Browse 61 exciting jobs hiring in Monitoring Engineer now. Check out companies hiring such as Conversica, TensorWave, Ensono in Tacoma, Laredo, Oxnard.
Lead the design and production of scalable, reliable LLM-driven systems at Conversica, shaping architecture and best practices across product and engineering.
Help shape and operate TensorWave Cloud's end-to-end observability platform, driving high-quality metrics, dashboards, and alerting to keep systems measurable, debuggable, and reliable at scale.
Ensono is hiring a seasoned Site Reliability Engineer to lead IaC, CI/CD, monitoring, and incident resolution across cloud environments while engaging directly with clients and third-party suppliers.
Senior Unix System Engineer (remote, Kansas) needed to support 600+ RHEL/CentOS servers, expand Linux in AWS, and drive infrastructure reliability through automation, monitoring, and hardening.
Join a growth-oriented SaaS partner as a Cloud Engineer to architect and operate Azure-based infrastructure and CI/CD pipelines for enterprise clients while working remotely from the US.
Forward Deployed AI Engineer at Judgment Labs — embed and operate our ABM platform inside customer production systems, own integrations, and drive reliable agent deployments in San Francisco.
iHerb is seeking an experienced Senior Data Engineer to design and operate cloud-native data platforms and MLOps pipelines that enable production AI/ML at scale.
Experienced full-stack engineer needed to lead feature development across a React frontend, Python APIs, and AWS infrastructure within a collaborative, product-focused remote team.
NBCUniversal is hiring a Senior DevOps Engineer to build and automate CI/CD and cloud infrastructure for its Playout platform, ensuring reliable media distribution at scale.
Lead the architecture and production deployment of low-latency, explainable machine learning systems that drive personalized next-best-action decisioning across digital and assisted channels at Humana.
LinkedIn is hiring a Staff Software Engineer, Reliability to improve the reliability, scalability, and operational tooling for Espresso, the company’s core distributed NoSQL data service.
Field Service Engineer role supporting hands-on testing, deployment, and troubleshooting of Plus’s autonomous truck systems across field operations.
AllTrails is hiring a Senior DevOps Engineer to drive infrastructure reliability, security, and developer tooling for its global trail-discovery platform.
Zoox is hiring a Full Stack Software Engineer to develop scalable frontend and backend systems, APIs, and automation tooling that accelerate QA and developer productivity.
Senior Full Stack Software Engineer needed to design and implement scalable Java/Spring-based services and modern front-end experiences for Visa's global payments platform.
Senior Trading Systems Engineer needed to lead and improve low-latency trading infrastructure in an onsite, high-performance trading environment in Chicago.
Bedrock Energy is looking for a Senior Reliability Engineer to own system-level reliability of coiled tubing drilling spreads, lead failure analysis, and implement maintenance and design improvements across field operations.
Lead the design and evolution of enterprise-scale logging and observability platforms, applying deep technical expertise to build resilient, automated, AI-native monitoring solutions for a major healthcare company.
Senior Server Engineer II to join Strava’s B2B team in San Francisco, building scalable backend systems for sponsored experiences and mentoring engineers across the organization.
Experienced Elastic Stack engineer wanted to configure, administer, and optimize Elasticsearch/Kibana environments for a DoD-focused ICAM program with TS/SCI requirements.
Lead performance engineering at Salesforce to design high-scale automation, optimize systems and databases, and own the resolution of complex production performance issues.
Lead architecture and development of Visa's internal PaaS to enable scalable, secure Generative AI and cloud-native services across the company.
Vanilla is hiring a Software Engineer - AI Applications to build production-ready AI features using LLMs, microservices, and data pipelines to modernize estate planning.
Build and scale real-time ML systems to detect and prevent fraud at Rain, a high-growth fintech powering global stablecoin payments.
A global healthcare technology provider is hiring a Lead Systems Engineer (remote, Illinois) to modernize and maintain critical enterprise infrastructure that directly supports patient care.
At Path Robotics, a Senior Reliability Engineer will own L2 support and drive long-term process, tooling, and documentation improvements to make fielded robotic systems more reliable and easier to operate.
Leidos is seeking an experienced Principal AI/ML Engineer to lead the secure design, deployment, and operationalization of production-grade AI/ML systems for mission-critical applications.
Experienced AI/ML Engineer needed to build and scale ML-driven automation into Candid Health’s revenue cycle management platform to improve billing efficiency and outcomes.
At Ascertain, this Product Engineer - Voice AI role will develop and optimize voice agents that automate healthcare back-office workflows, combining hands-on Python development with direct customer and operations collaboration.
Help design and operate Airbnb’s global payments platform by building scalable, reliable billing and payouts systems and leading cross-functional technical initiatives.
Experienced Database Reliability Engineer needed to manage and optimize Oracle and PostgreSQL databases, design backup/DR solutions, and automate operations for SpaceX's production systems in Hawthorne, CA.
Senior Network Engineer to own and modernize the firm’s network infrastructure (LAN/WAN/wireless) for a mission-driven Boston law firm that treats internal technology as a strategic asset.
Visa is hiring a Senior Network Engineer to modernize and automate network monitoring and configuration management by integrating vendor tools with cloud, IaC, and GenAI-driven workflows.
Lead Aledade's AI/ML and cloud security efforts by designing robust controls, guiding cross-functional teams, and operationalizing protections for model training, inference, and data pipelines.
Senior Platform Engineer to lead automation, scalability, and reliability for Jerry.ai’s cloud infrastructure and CI/CD pipelines during a rapid scale-up phase.
Motorola Solutions is seeking a Software Engineer, Linux Operations to build automation, internal tooling, and SRE-focused solutions for its Access Control platform in Culver City.
Help build and operate the core infrastructure for a fast-growing adtech startup powering AI-native products and publisher/advertiser marketplaces.
Experienced Network Engineer needed to design, implement and support RRD's enterprise network infrastructure, including routing, switching, security, wireless and load balancing.
Lead development and deployment of maintenance and reliability programs for critical Cape Canaveral facilities and equipment to minimize downtime and improve asset performance.
Arthur Grand Technologies is hiring a Senior Network Engineer in Minneapolis to operate and secure multi-site LAN/WAN infrastructure and lead network upgrades and troubleshooting efforts.
The Data Engineer will build and maintain PWBM's data infrastructure and ETL pipelines to support large-scale economic models and an evolving model automation pipeline.
Built is hiring a Senior Product Security Engineer to lead application and AI/ML security across its cloud-native platform, embedding security into the SDLC and partnering with engineering teams to reduce risk and enable safe product innovation.
Mia Labs is hiring a Technical Support Engineer to drive triage, root-cause analysis, and production reliability for its AI-powered platform supporting dealerships.
Contribute as a full‑stack Engineer at CourseStorm, using clear judgment and AI to design, refine, and maintain simple, reliable e‑commerce software for education organizations.
Senior Site Reliability Engineer needed to own operability, automation, and production reliability for Poshmark's large-scale web services.
Lead architecture and delivery of complex Azure solutions as a Principal Software Engineer advising sponsor-level clients and guiding multi-workstream projects.
Belvedere Trading is hiring a Data Integrity Engineer in Chicago to build and monitor data pipelines, implement data quality checks, and partner with traders and quants to ensure trustworthy analytics using SQL, Python, and modern data platforms.
Kiddom is hiring a Senior Software Engineer (Infrastructure) to scale DevOps and infrastructure, build APIs, and enable engineering teams to deliver reliable education software.
Lead AI Engineer (Machine Learning) to architect and ship production ML/DL systems that drive measurable outcomes across Renuity's national home-improvement operations.
Experienced Linux systems engineer needed to design, automate, and maintain cloud/virtualized infrastructure for Teleflora while collaborating with application and database teams to ensure reliability and scalability.
Below 50k*
0
|
50k-100k*
0
|
Over 100k*
1
|