Browse 152 exciting jobs hiring in Prometheus now. Check out companies hiring such as SearchStax, Rula, NVIDIA in Durham, Grand Rapids, Knoxville.
Experienced Staff AWS SRE to lead scalability, automation, and reliability across a rapidly growing cloud platform serving enterprise search workloads.
Rula is hiring a Staff SRE & DevOps Engineer to drive observability, reliability, and SRE practices across its remote engineering teams.
Lead NVIDIA's multi-cloud GPU capacity strategy by combining analytics, automation, and cross-functional leadership to optimize compute resources for AI and HPC workloads.
Intuitive is hiring a Cloud Operations Engineer to design and operate automated, secure cloud-native infrastructure and CI/CD pipelines that support mission-critical healthcare systems.
Experienced AWS DevOps Engineer needed to build scalable infrastructure, automate deployments with Terraform and Jenkins, and drive reliability across cloud-native systems for a remote US-based engineering team.
ServiceNow is hiring a Staff DevOps Engineer to operate, automate, and support mission-critical Hadoop and Big Data infrastructure for its US Federal cloud environment.
Experienced Full-Stack Engineer (Haskell + PureScript) needed to build robust backends and functional frontends for cloud-native projects across LATAM and US clients.
AHEAD is hiring an Observability Engineer to design and operate scalable monitoring, logging, and tracing solutions across cloud-native and hybrid environments.
Apply your backend engineering experience to design and run automated chaos and reliability experiments that harden Camunda's distributed platform in a fully remote, async-first environment.
Hopin seeks an AWS DevOps Engineer to own and automate scalable, secure AWS infrastructure and CI/CD pipelines while mentoring teams and improving operational reliability in a remote US role.
Experienced AWS DevOps Engineer needed to build and maintain scalable, secure cloud infrastructure and CI/CD pipelines for a fast-growing remote US team.
Maintain and troubleshoot TensorWave's bare-metal Kubernetes clusters to ensure reliable, high-performance infrastructure for cutting-edge AI workloads.
Senior Systems Software Engineer to design and maintain observability and reliability systems for large-scale cloud services at NVIDIA.
Astronomer is hiring a Customer Reliability Engineer (Infrastructure) to own the reliability of Kubernetes and cloud infrastructure for its managed Airflow service and directly support enterprise customers.
Lead the design and implementation of orchestration layers that coordinate LLMs, agents, retrieval pipelines, and human-in-the-loop workflows for a mission-driven healthcare AI platform.
Lead a Platform Engineering team to design, operate, and scale core cloud services and integrations that power Canopy’s IoT and security products.
Work with Experian's Financial Services Division to design, build and operate real-time, high-throughput API platforms using Python and cloud-native technologies.
Arista Networks is hiring a Senior Site Reliability Engineer to design, operate, and scale the CloudVision SaaS platform running on Kubernetes across global regions.
Be part of NVIDIA’s performance engineering team to architect, tune, and validate large-scale GPU-accelerated systems and workflows for AI and datacenter workloads.
Help build and operate Tempo's production infrastructure—Kubernetes, bare metal, observability, and tooling—to enable fast, secure shipping of blockchain payment systems.
Experienced backend and DevOps engineer needed to design, automate, and operate secure, high-reliability cloud infrastructure for federal environments at Abnormal AI.
As a Production Engineer II on Yahoo's Media Platform team, you'll architect, automate, and operate large-scale cloud infrastructure and observability tooling to improve reliability and developer velocity.
Work on the Mailgun infrastructure team to design, automate, and operate cloud-native systems that deliver reliable, large-scale email services to global customers.
The Aspen Group is seeking a Senior SRE to design AI-driven observability, automate incident response, and scale resilient cloud infrastructure for its national healthcare platforms.
Exa is hiring an in-person Infrastructure Engineer in San Francisco to build and operate large-scale GPU and Kubernetes infrastructure that powers its AI search platform.
Senior Cloud Platform Engineer to design, automate, and operate AWS-based platform services, developer tooling, and CI/CD pipelines for a high-growth professional services fintech.
Lead and mentor a Customer Success Engineering team at Logz.io to drive presales wins, ensure postsale value realization, and act as the technical bridge between customers and internal teams.
Join Unstructured as a Public Sector SRE to architect and operate compliant, high-assurance cloud infrastructure that powers AI workloads for federal customers.
Bumble Inc. is seeking a Lead Backend Software Engineer in Austin to design and operate scalable AWS-native backend systems for Bumble Date while leading projects and mentoring engineers.
Bumble is hiring a Staff Backend Software Engineer to own and deliver scalable AWS-native backend systems that power the Bumble Date experience.
TensorWave is hiring an AI Infrastructure Engineer to design, operate, and optimize high-performance GPU clusters that power its AI cloud services.
Wyetech is hiring a Database Engineer 2 to design and optimize PostgreSQL databases and implement HA, replication, and automation for classified federal programs requiring TS/SCI clearance.
Lead Flow's platform and infrastructure efforts as a Platform Engineering Manager, driving reliability, automation, and developer velocity across cloud-native systems.
Senior Linux engineer needed to design, tune, and operate low-latency trading infrastructure for Point72's strategic order management and co-location environments.
NVIDIA seeks a hands-on Solutions Architect to advise DGX Cloud partners, accelerate production AI adoption, and scale best practices for high-performance GPU infrastructure.
Greenlight needs a Senior Production Operations Engineer to lead SRE practices, automation, and infrastructure reliability for its high-scale fintech platform.
Experienced SRE/Software Engineer III needed to support production reliability, automate systems at scale, and collaborate with development teams at LexisNexis Risk Solutions in Alpharetta, GA.
Morgan Stanley is hiring a VP-level Senior AI Platform Engineer to architect and implement a firmwide Generative AI platform using Python, Kubernetes/OpenShift, and large-scale data and API ecosystems.
Wyetech seeks a seasoned Software Engineer 2 to drive DevOps engineering, backend Java development, and platform automation for federal programs requiring active TS/SCI clearance.
NVIDIA's DGX Cloud team is hiring a Senior Site Reliability Engineer to operate and scale GPU-accelerated Kubernetes clusters across major cloud providers while driving reliability, observability, and performance.
Mirantis seeks a US-based Consulting Architect to lead technical client engagements, design cloud-native solutions, and act as a trusted liaison between customers and internal teams.
Platform Engineer needed to design, implement, and scale secure cloud infrastructure and CI/CD pipelines for PermitFlow’s AI-driven pre-construction platform in a hybrid NYC role.
Lead the architecture and operationalization of a secure, scalable multi-cloud data platform and FinOps governance stack across AWS, GCP, Azure, and on-prem environments for a Fortune 500 enterprise.
Halcyon, a remote-native adaptive security platform dedicated to stopping ransomware, is hiring a Senior DevOps Engineer to own and scale AWS infrastructure, CI/CD, and observability.
Antimetal is hiring a Systems Engineer to build and operate low-level, high-performance systems and scalable infrastructure that power our observability platform.
Technical lead role managing a small DevOps team to design, automate, and scale Azure infrastructure and CI/CD for a fast-growing pet-health and e-commerce portfolio.
Experienced SRE/Test Engineer needed to own post-production monitoring, incident response, and business-requirements-driven test automation for PHIL’s prescription management platform.
WorkOS is hiring a Site Reliability Engineer to improve platform reliability, observability, and performance across a distributed, TypeScript-based production environment.
Degreed is hiring a Senior DevOps Engineer to lead Azure and Terraform initiatives that improve platform reliability, security, and developer velocity.
Experienced software developer needed to design, prototype, and productionize low-latency sensor-processing algorithms in C++/Rust for DoD-focused systems based in Boulder, CO.
Below 50k*
0
|
50k-100k*
3
|
Over 100k*
145
|