Browse 81 exciting jobs hiring in Prometheus now. Check out companies hiring such as Flock Safety, PayJoy, Intel in Cincinnati, Long Beach, Philadelphia.
Lead the design and productionization of agentic AI systems and an evaluation platform to power Night Shift, Flock Safety’s investigator-facing LLM agent product.
PayJoy is hiring a Staff Engineer to lead and scale its AWS/Kubernetes platform and developer productivity tooling to support resilient, secure, and cost-efficient backend services.
Work within Intel's Information Security team to design and deploy secure, scalable network monitoring solutions supporting U.S. Government operations.
Lead the architecture and operation of NVIDIA's global observability platform to ensure reliable, high-performance telemetry for large-scale AI and data systems.
Lead DevOps Engineer needed to architect and modernize CI/CD and cloud infrastructure for large-scale enterprise applications in Dallas, TX.
Lead architecture and implementation of a scalable observability and data platform for NVIDIA’s global AI and HPC clusters as Principal Data Platform Architect.
Design and operate scalable observability and telemetry platforms that transform chip-build telemetry into actionable insights for NVIDIA's semiconductor engineering teams.
Sweed is seeking an experienced Head of DevOps to lead the DevOps organization, build scalable AWS infrastructure, and improve CI/CD, monitoring, and reliability for its enterprise cannabis retail platform.
Experienced platform engineer needed to design, operate, and automate large-scale observability stacks (Prometheus/Elasticsearch) and CI/CD pipelines for Motorola Solutions' cloud platform.
Build and maintain next-generation observability and telemetry platforms at NVIDIA to improve chip development efficiency and operational insight.
Sequen AI seeks a Staff Software Engineer (Infrastructure) to own and scale high‑performance cloud and ML infrastructure supporting training, research, and serving of frontier ranking models.
Experienced Kubernetes administrator needed to lead cluster design, GKE migrations, security hardening, and mentor a growing infrastructure team at Calix.
Senior cloud engineering leader to oversee AWS-based platform, SRE, and systems teams, driving FinOps, observability, and large-scale infrastructure modernization in a remote-first setting.
Iru is hiring a Principal Infrastructure Engineer in Coral Gables to design and scale AWS infrastructure, drive IaC and CI/CD automation, and strengthen platform security and observability.
Lead a remote software team at dLocal to design, implement and operate scalable, mission-critical payment applications for global enterprise customers.
Broadcom seeks a Senior Software Engineer to architect, deploy, and operate production-grade ML and LLM systems for Tanzu Intelligent Assist, ensuring scalability, security, and operational excellence.
Adaptable Intelligence is looking for a hands-on Senior Infrastructure Engineer to architect and operate scalable, secure cloud and GPU infrastructure from 0→1.
Senior Software Engineer needed to architect observability and cloud efficiency solutions across OpenShift-based CI/CD pipelines while leading technical teams and automation efforts.
Experienced SRE leader needed to architect, automate, and operate cloud-native infrastructure to deliver reliable, scalable services across regulated environments.
Experienced software engineer with strong DevOps and Linux skills needed to develop complex analytics and cloud/containerized systems in support of defense and intelligence customers.
Experienced senior engineer needed to lead QA automation and performance testing for cloud-native enterprise applications at a global analytics software company with HQ in Bozeman, MT.
OnePay seeks an experienced Site Reliability Engineer to improve platform reliability and observability for a high-scale consumer fintech platform serving millions of users.
Hoist the mainsail with Ivo as an Infrastructure Engineer shaping a secure, scalable multi-tenant platform for LLM-powered legal products.
Zoox is hiring an Infrastructure Platform Engineer Intern to help automate infrastructure workflows and build monitoring and metrics systems across on-prem and cloud platforms.
Help design and implement the next-generation machine learning platform at Samsung Ads to accelerate model development, deployment, and serving at massive scale.
Experienced engineering leader sought to manage and grow an SRE team that ensures reliability, scalability, and operational excellence for cloud-native production systems.
Fieldguide is hiring an experienced Infrastructure Platform Engineer to build and operate secure, scalable AWS infrastructure, automate environments with Terraform, and drive reliability and compliance across the platform.
Senior SRE leader needed to shape reliability practices, mentor engineers, and deliver resilient, scalable cloud infrastructure for a high‑throughput fintech platform.
Lead and grow the engineering team responsible for NMC²’s bare-metal Kubernetes platform, driving architecture, automation, and performance for large-scale GPU/CPU ML and HPC clusters.
Valinor is looking for an Infrastructure & Security Engineer to design, operate, and secure CI/CD pipelines and cloud/edge infrastructure for defense-focused products across its portfolio.
Lead Senior Software Engineer to design and operate scalable platform infrastructure and CI/CD tooling for Cars Commerce’s Marketplace Platform in a fully remote role.
Experienced engineering leader needed to drive architecture, mentor engineers, and oversee delivery of secure, scalable systems using Rust, C, cloud infrastructure, and modern CI/CD tooling.
At Poolside, this role will build and ship large-scale search, retrieval and agentic tooling to connect models with the right data and improve real-world AI performance.
Lead Peacock's SRE and DevSecOps efforts as Manager, guiding cloud architecture and engineering teams to deliver secure, scalable streaming services for millions of users.
Redwood Materials seeks a hands-on Software Engineer to architect and implement the Site Controller software for distributed second-life battery energy storage systems.
Senior Software Engineer - Reliability (remote, CA) to help build foundational SRE practices, observability, and infrastructure automation for secure, compliant cloud production systems.
Build and operate robust ML training and SaaS infrastructure at Basis, scaling GPU clusters, cloud services, and developer workflows to support cutting-edge research and commercial products.
Lead the design and operation of scalable Kubernetes and cloud-native infrastructure at Green Dot, driving CI/CD, observability, and team growth in a fully remote U.S. role.
Visa is hiring a Staff Software Engineer in Highlands Ranch to lead observability, automation, and scalable cloud-native system design for global payment platforms.
Experienced cloud-native engineer needed to lead design and automation of scalable Kubernetes platforms across AWS and OCI, driving reliability, cost optimization, and developer experience.
Visa is hiring a Sr. Systems Engineer to build and automate secure, scalable public cloud and Kubernetes infrastructure supporting its global payments platform.
Lead the design and build-out of network operations and reliability for Fluidstack's distributed datacenter fabric, owning Tier 2+ incident response, observability, automation, and team development.
Panorama seeks a Senior Software Engineer (Platform/SRE) to lead and improve AWS infrastructure, Kubernetes operations, and CI/CD practices for a growing production platform.
TLA is seeking an experienced AWS DevOps Engineer to build and operate secure, automated AWS infrastructure and CI/CD pipelines for mission-focused systems.
Keeper Security is hiring a hands-on Senior DevOps Program Manager to drive large-scale cloud, automation, and compliance programs across engineering, security, and operations.
Flock Safety seeks a Senior Backend Engineer skilled in Go and PostgreSQL to architect and deliver high-performance, reliable microservices for our nationwide safety platform.
Lead the design and operation of production-grade infrastructure at Decagon to deliver low-latency, highly available systems that power conversational AI at scale.
Lead efforts to improve reliability and performance of Alpaca's streaming infrastructure (RabbitMQ/Redpanda) as a Staff Site Reliability Engineer on a remote, North-America-based team.
Lead and mentor a high-performing DevOps team to deliver secure, reliable cloud and monitoring solutions for Kaseya’s Remote Monitoring & Management product.
Visa is hiring Software Engineer interns to tackle real-world projects in payments technology—building prototypes, automation, and scalable services while gaining mentorship and professional development.
Below 50k*
0
|
50k-100k*
4
|
Over 100k*
80
|