Browse 108 exciting jobs hiring in Reliability Engineer now. Check out companies hiring such as Rula, Runloop, LLNL in San Antonio, Gilbert, Cincinnati.
Lead technical design and delivery for Rula’s Patient Outcomes efforts, building reliable systems that improve patient engagement and capture outcomes data.
Runloop is hiring a Site Reliability Engineer to ensure the reliability, scalability, and security of our sandbox platform powering AI development.
Lawrence Livermore National Laboratory is hiring Nuclear Systems Reliability Engineers to develop and implement predictive and reliability-centered maintenance, perform RAMI analyses, and support mechanical system health for mission-critical nuclear facility equipment.
Experienced Staff AWS SRE to lead scalability, automation, and reliability across a rapidly growing cloud platform serving enterprise search workloads.
Peraton is hiring a Cloud Reliability Systems Engineer to provide 24x7 on-site monitoring, troubleshooting, and incident response for a multi-tenant DoD cloud environment at Chantilly, VA.
Experienced electrical/project engineer needed to lead automation, high-voltage distribution and reliability efforts at a heavy industrial metals mill.
Milwaukee Tool is hiring a Reliability Engineer II to lead reliability testing, life-data analysis, and quality qualification for new carpentry power tool products.
Commify is looking for a Site Reliability Engineer to drive operational excellence and reliability for its Azure-based, high-throughput messaging platforms through automation, monitoring and infrastructure-as-code.
Apply your backend engineering experience to design and run automated chaos and reliability experiments that harden Camunda's distributed platform in a fully remote, async-first environment.
Experienced Platform Engineer needed to optimize Whatnot's Python backend, scale storage and core platform services, and improve reliability across a growing live-commerce marketplace.
Join EchoTwin AI as a Prognostics Engineer to develop predictive maintenance and prognostic systems that keep smart city assets healthy and resilient.
Experienced Site Reliability Engineer to lead reliability, observability, automation, and DoD-aligned cybersecurity for mission-critical systems in a remote role.
Early-career PhD engineer needed to develop future-generation 3D NAND device architectures and process approaches at SanDisk's Milpitas technology development team.
Astronomer is hiring a Customer Reliability Engineer (Infrastructure) to own the reliability of Kubernetes and cloud infrastructure for its managed Airflow service and directly support enterprise customers.
Lead the design and implementation of orchestration layers that coordinate LLMs, agents, retrieval pipelines, and human-in-the-loop workflows for a mission-driven healthcare AI platform.
Experienced Site Reliability Engineer needed to support and improve mission-critical systems, drive CI/CD and Kubernetes-based scalability, and engage with government customers under an active TS/SCI clearance.
Experienced Reliability Engineer needed at Firestone Polymers' Lake Charles plant to drive maintenance strategies, root-cause investigations, and reliability improvements for polymer manufacturing equipment.
Work on the Mailgun infrastructure team to design, automate, and operate cloud-native systems that deliver reliable, large-scale email services to global customers.
Lead the design and delivery of complex SMART on FHIR and HL7v2 integrations and core platform features for an early-stage healthtech startup using TypeScript and AWS.
The Aspen Group is seeking a Senior SRE to design AI-driven observability, automate incident response, and scale resilient cloud infrastructure for its national healthcare platforms.
Join Unstructured as a Public Sector SRE to architect and operate compliant, high-assurance cloud infrastructure that powers AI workloads for federal customers.
Axon is hiring a Senior Site Reliability Engineer to build and operate cloud-native platform tooling that improves reliability, automation, and developer self-service for mission-critical services.
Greenlight needs a Senior Production Operations Engineer to lead SRE practices, automation, and infrastructure reliability for its high-scale fintech platform.
Experienced SRE/Software Engineer III needed to support production reliability, automate systems at scale, and collaborate with development teams at LexisNexis Risk Solutions in Alpharetta, GA.
NVIDIA's DGX Cloud team is hiring a Senior Site Reliability Engineer to operate and scale GPU-accelerated Kubernetes clusters across major cloud providers while driving reliability, observability, and performance.
Lead vehicle-level risk for Starship and Super Heavy by owning risk tracking, supporting flight readiness, and providing on-console authority during test and launch operations.
EAG Laboratories is hiring a Reliability Systems Engineer to maintain and deliver burn-in systems, perform diagnostics and repairs, and support semiconductor reliability qualification at our Santa Clara lab.
Lead Cantina's cloud infrastructure as the sole Senior DevOps Engineer, building scalable, secure AWS systems with Terraform and automated CI/CD pipelines.
AirGarage is hiring a Software Engineer to lead reliability and observability for its nationwide IoT device fleet, bridging embedded systems, backend services, and SRE practices to keep devices online and performant.
HomeVision is hiring a US-based Site Reliability Engineer to build and maintain cloud infrastructure, platform tooling, and observability for a fast-growing real-estate valuation SaaS.
Senior Infrastructure Engineer needed to manage and scale hybrid on-prem/cloud infrastructure, optimize performance, and streamline CI/CD for a leading quantum computing company in Boulder, CO.
Senior Site Reliability Engineer to design, operate, and scale Kraken's data platform—streaming, lakehouse, CI/CD and security—across cloud and hybrid environments.
Antimetal is hiring a Systems Engineer to build and operate low-level, high-performance systems and scalable infrastructure that power our observability platform.
Be the on-site process engineering lead for AMP’s Commerce City single-stream facility, owning commissioning, operator training, and reliability to meet throughput, recovery, and uptime targets.
ASG is hiring a mid-level Systems Engineer to lead requirements engineering, systems integration, and analysis supporting GEOINT missions for NSG/ASG and federal customers.
Qualdoc Staffing is seeking a Maintenance Reliability Engineer in Petersburg, VA to implement RCM/PM strategies, perform RCFA/FMEA, and improve equipment uptime in a heavy industrial manufacturing environment.
Astronomer seeks a Customer Reliability Engineer (Infrastructure) to operate and optimize cloud-native data platforms across AWS, Azure, and GCP while partnering directly with customers and engineering teams.
Lead observability and SRE initiatives to build scalable, reliable platform tooling and monitoring for Teraswitch’s global infrastructure.
NVIDIA is hiring a Senior Site Reliability Engineer to champion reliability, automation, and observability for large-scale AI infrastructure across global cloud environments.
Visa is looking for a Senior Site Reliability Engineer – Sr. Consultant to lead cloud migrations, automation, container-based reliability, and GenAI-driven operational improvements for mission-critical payment platforms.
Wellfit is hiring a Site Reliability Engineer to optimize Azure App Services, implement robust observability, and automate reliability at scale for a fast-growing dental fintech.
Help build and operate the reliable, end‑to‑end transactional systems that power payroll, benefits, and compliance automation at Central.
Crusoe Cloud is hiring a Principal Software Engineer to lead architecture decisions and scale a carbon-reducing cloud platform for AI workloads while mentoring engineering teams.
Senior Software Engineer to lead and deliver Supplier Quality applications, driving technical strategy, development, and production support for Toyota North America.
Experienced field engineer needed to diagnose, install, and maintain rotating machinery monitoring and protection systems for Bently Nevada, working extensively on customer sites across Louisiana and beyond.
Lead reliability and automation initiatives for Grindr’s production systems as a Staff Site Reliability Engineer on a hybrid Chicago-based team.
Lead reliability testing and failure analysis for next-generation wireless optical communication hardware, driving qualification and design improvements across semiconductor, photonic and system-level components.
SpaceX seeks a Sr. Electrical Design Engineer (Avionics) to lead avionics hardware design, validation, and reliability efforts for flight systems based in Hawthorne, CA.
Pythian is hiring a Team Lead, Site Reliability Engineering to lead a distributed SRE team responsible for designing, automating, and operating resilient cloud and AI/ML infrastructure.
Lightmatter is hiring a Reliability Engineer (Hardware) to design, run, and analyze reliability qualification tests for its photonics-based AI platform and to drive root-cause resolution across product and manufacturing teams.
Below 50k*
0
|
50k-100k*
5
|
Over 100k*
5
|