Skip to content
DevOps Jobs
Hybrid (San Francisco, California) $165k - $277k/yr Full-time Senior Level 10 benefits + 3 perks
Posted 1 day ago

About the role

Join the Cisco ThousandEyes FedRAMP team to build and operate a US GovCloud platform, ensuring the reliability, performance, and security of federal region infrastructure. You will architect, deliver, and maintain a critical FedRAMP offering, contributing to a platform that provides digital experience assurance for government organizations.

Skills

FedRAMP Compliance Python Go AWS Terraform Puppet Kubernetes Unix/Linux Infrastructure as Code Distributed Systems Cloud-native Services Incident Response Capacity Planning Vulnerability Management Security Assessments Monitoring and Logging
Onsite (Jersey City, NJ) $171k - $260k/yr Full-time Senior Level 3 benefits + 3 perks
Posted 1 day ago

About the role

Join JPMorgan Chase & Co. as a Senior Lead Site Reliability Engineer on the Chief Data & Analytics Office (CDAO) AI/ML & Data Platforms team. You will define and implement reliability targets for large-scale data platforms and AI/ML workloads, ensuring secure, scalable, and high-performing analytics.

Skills

Site Reliability Engineering Observability SLI/SLO/SLA Distributed Systems AI/ML Platforms Data Lake Ecosystems Grafana Dynatrace Prometheus Datadog Splunk AWS Databricks Spark Kubernetes Terraform
J

Site Reliability Engineer III

JPMorgan Chase & Co.

Onsite (Jersey City, NJ) $133k - $185k/yr Full-time Mid Level 3 benefits + 3 perks
Posted 1 day ago

About the role

Join JPMorgan Chase & Co. as a Site Reliability Engineer III on the Chief Data & Analytics Office (CDAO) AI/ML & Data Platforms team. You will modernize critical systems and solve complex business problems through code and cloud infrastructure, contributing to the availability, reliability, and scalability of applications and platforms.

Skills

Site Reliability Engineering Python Java AWS Kubernetes Databricks Snowflake CI/CD Observability System Design PySpark Incident Management SLI/SLO/SLA Infrastructure as Code Container Orchestration Disaster Recovery
Hybrid (McLean, Virginia) $103k - $150k/yr Full-time Mid Level 6 benefits
Posted 1 day ago

About the role

Medallia is a leader in Experience Management, offering a SaaS platform that enhances customer and employee experiences. As a Site Reliability Engineer II, you will operate and improve the reliability, scalability, and performance of global SaaS platform services.

Skills

Kubernetes AWS OCI GCP Linux Systems Administration Python Bash Go CI/CD GitOps Networking Fundamentals Distributed Systems Troubleshooting Terraform Prometheus Grafana Incident Response
Hybrid (Albuquerque, NM) $108k - $138k/yr Full-time Senior Level 7 benefits
Posted 1 day ago

About the role

Legence is a leading provider of engineering and consulting services for mission-critical systems in buildings, serving demanding sectors including the Nasdaq-100. This role involves designing, implementing, and managing network infrastructure to ensure optimal performance and security.

Skills

OSI Layer 2 and 3 Switch Configuration Wireless Access Points Router Management VLANs ACLs Firewall Rules Palo Alto Firewalls HPE Aruba Cisco IPv4 Routing Protocols NAT VPNs Network Troubleshooting
M

DevOps Engineer

Metova Federal

Onsite (Hanover, MD) $115k - $135k/yr Full-time Mid Level 11 benefits
Posted 1 day ago

About the role

By Light Professional IT Services LLC, through its company Cole Engineering Services (CESI), provides advanced modeling and simulation training solutions for federal agencies. This role focuses on researching, developing, and maintaining DevOps processes to automate the integration and deployment of complex DoD cybersecurity software applications.

Skills

DevOps CI/CD Scripting Containerization Virtualization Cloud Computing Network Configuration Automated Testing Configuration Management Agile Development Kubernetes Docker
Hybrid (Washington, District of Columbia) $325k - $360k/yr Full-time Senior Level 2 perks
Posted 2 days ago

About the role

Anthropic is building reliable, interpretable, and steerable AI systems to be safe and beneficial for society. The Endpoint team treats the device fleet as a distributed platform, managing every piece of device configuration as code to ensure security and efficiency.

Skills

MDM Configuration As Code macOS Internals Windows Internals Python Shell Scripting PowerShell GitOps CI/CD Infrastructure As Code Public Cloud Zero Touch Provisioning Patch Management Endpoint Security Swift Go
Onsite (USA - CA - Palo Alto, California) $232k - $335k/yr Full-time Senior Level 6 benefits + 1 perks
Posted 2 days ago

About the role

Uniphore is a B2B AI-native company focused on unifying and humanizing enterprise experiences through advanced AI. This Principal Site Reliability Engineer role will shape platform strategy and tackle complex scaling and reliability challenges within a multi-cloud environment.

Skills

Go Kubernetes AWS GCP Azure Terraform Multi-cloud Architecture Incident Management API Design Observability Platform Engineering SRE DevOps Technical Leadership Infrastructure Automation System Design
Onsite (Renton, Washington) $205k - $307k/yr Full-time Senior Level 7 benefits
Posted 2 days ago

About the role

Join Hasbro and Wizards of the Coast to build and operate the enterprise AI platform, shaping the future of AI access and leverage across iconic brands. You'll collaborate with passionate teams to create innovative experiences that inspire creativity and foster community through play.

Skills

Platform Engineering AI Infrastructure AWS Bedrock SageMaker MCP Protocol A2A Protocol Databricks Snowflake Microsoft Entra ID Okta IAM Prompt Management AI Safety Enterprise AI Administration System Engineering API Design
Onsite (McLean, Virginia) $209k - $286k/yr Full-time Senior Level 5 benefits
Posted 2 days ago

About the role

Capital One is a rapidly growing organization focused on technology and customer passion. This role offers a chance to provide technical leadership and risk oversight for enterprise-wide software engineering and Site Reliability Engineering (SRE) practices, ensuring seamless and highly reliable customer experiences.

Skills

Site Reliability Engineering Cloud Architecture Risk Management Gen AI Tooling Automation CI/CD Pipelines Observability Containerization Technical Leadership Executive Communication Cloud Migration Software Engineering
Onsite (Washington, District of Columbia) $180k - $280k/yr Full-time Senior Level 3 benefits
Posted 2 days ago

About the role

Shield AI, a defense-tech company, is seeking a Senior Manager, DevOps Engineering to lead a team responsible for the software development life cycle and build pipelines for intelligent systems. This role is crucial for enabling engineering velocity and modernizing build systems.

Skills

DevOps Engineering CI/CD Pipelines GitLab CMake Conan Poetry Nix Docker Artifactory Linux System Administration People Management Azure Gov Cloud Kubernetes C++ Python Node.js
Hybrid (New York, New York) $220k - $260k/yr Full-time Senior Level 3 benefits + 6 perks
Posted 2 days ago

About the role

Ripple is building a world where value moves like information, aiming to improve the global financial system and create economic opportunity. As a Senior Staff DevOps Engineer, you will contribute to the discovery, development, and implementation of solutions to enhance infrastructure and release pipelines, impacting automated product delivery.

Skills

Python Go Java Kubernetes AWS Docker Terraform CloudFormation CI/CD Infrastructure-as-Code Agile Distributed Services Container Schedulers Monitoring Instrumentation Security Methodologies
Hybrid (Chicago, Illinois) $220k - $260k/yr Full-time Senior Level 7 benefits + 6 perks
Posted 2 days ago

About the role

Ripple is building a world where value moves like information, aiming to improve the global financial system. As a Senior Staff DevOps Engineer, you will contribute to the discovery, development, and implementation of solutions to enhance infrastructure and release pipelines, impacting automated product delivery.

Skills

Python Go Java Kubernetes AWS Docker Terraform CloudFormation CI/CD Infrastructure-as-Code Agile Distributed Services Container Schedulers Multi-region Service Platforms Security Methodologies Automation
Onsite (US FL JAX 347, Georgia) $144k - $246k/yr Full-time Senior Level
Posted 2 days ago

About the role

Join FIS in building an AI-enabled, autonomous banking platform for Tier-1 financial institutions. This role offers direct impact on customer trust and operational continuity, with high visibility across customer, support, engineering, platform, and product teams.

Skills

Kubernetes Troubleshooting Linux Networking Observability Prometheus Grafana Loki OpenTelemetry Datadog Splunk Customer Support Distributed Systems APIs Containers AI-Assisted Operations
Onsite (Jacksonville, Florida) $144k - $246k/yr Senior Level
Posted 2 days ago

About the role

Join FIS Global in building an AI-enabled, autonomous banking platform for Tier-1 financial institutions. This role offers direct impact on customer trust and operational continuity for major financial clients.

Skills

Kubernetes Troubleshooting Linux Networking Customer Support Observability Prometheus Grafana Loki OpenTelemetry Datadog Splunk Distributed Systems API Containers AI-native Tooling
Onsite (San Francisco, California) $210k - $240k/yr Full-time Senior Level 1 perks
Posted 2 days ago

About the role

Alembic is a pioneering Causal AI platform backed by significant funding, utilizing cutting-edge NVIDIA DGX SuperPOD infrastructure. This role offers the opportunity to architect and operate the global network and reliability layer for a high-performance platform, with significant technical autonomy and impact.

Skills

Network Architecture SRE BGP VPN WAN Ansible Terraform Kubernetes Networking Linux Administration Prometheus Grafana Python Bash InfiniBand Network Security Capacity Planning
M

Manager Site Reliability Operations

Mercury Insurance Services, LLC

Onsite (Brea, CA) $118k - $230k/yr Senior Level 8 benefits + 4 perks
Posted 2 days ago

About the role

Join Mercury Insurance, a company recognized for its achievements and culture, as a Site Reliability Operations Manager. You will lead a team responsible for the end-to-end observability, monitoring, and operational response of critical platforms, driving system stability and minimizing customer impact.

Skills

Site Reliability Engineering Observability Incident Management Problem Management CI/CD Pipelines Root Cause Analysis Automation Infrastructure as Code Configuration as Code Team Leadership Monitoring and Alerting Cloud Platforms Kubernetes OpenShift AWS ITIL
Onsite (Remote (VA), Virginia) $98k - $228k/yr Full-time Senior Level 6 benefits + 1 perks
Posted 2 days ago

About the role

Join the Zoom Phone and Zoom Contact Center DevOps team to build and maintain the next generation of cloud and colocation infrastructure powering seamless communication. This role involves designing, implementing, and optimizing resilient VoIP infrastructure and services.

Skills

Kubernetes Docker Terraform Ansible AWS CI/CD Linux Systems Administration VoIP SIP RTP Prometheus Grafana ELK Datadog Network Configuration Cloud Infrastructure
Onsite (Work At Home-Texas, Idaho) $83k - $222k/yr Full-time Senior Level 6 benefits + 3 perks
Posted 2 days ago

About the role

Join CVS Health to simplify healthcare and shape a more connected, convenient, and compassionate health experience. As a Senior Platform Engineer on the Data & Performance Enablement team, you will design and implement platforms for distributed event streaming, directly enabling high-performance, data-intensive applications.

Skills

Kafka Redis Infrastructure as Code Bash Python Azure AWS GitHub Actions Database Administration Query Optimization DBRE Principles Data Modeling Event Streaming Automation Provisioning Troubleshooting
Hybrid (Bellevue, Washington) $154k - $220k/yr Full-time Senior Level 6 benefits + 1 perks
Posted 2 days ago

About the role

Zscaler is an AI-forward enterprise accelerating digital transformation to secure customers in the AI age. They seek innovators to join their high-performing teams, focusing on customer obsession, collaboration, and ownership to solve complex challenges.

Skills

Backend Services Design API Development Distributed Systems Docker Kubernetes AWS GCP Azure Object-Oriented Programming AI/ML Technologies CI/CD Pipelines DevOps Automation Agile/Scrum System Optimization Mentoring LLM Deployment