Browse 80 exciting jobs hiring in Ai Observability now. Check out companies hiring such as Fiddler AI, Hatch, DimRed in Sioux Falls, Orlando, Shreveport.
Fiddler is hiring a Senior Solutions Engineer to own technical wins and deliver compelling PoCs that demonstrate the business value of its AI observability platform to enterprise customers.
Lead the design and scaling of Hatch’s communications platform as a Staff Software Engineer (Elixir), shaping architecture and mentoring engineers to deliver reliable, high-performance user experiences.
Help build the core backend infrastructure at DimRed to power large-scale LLM evaluations, agent workflows, and secure multi-tenant production systems.
Lead the CI, Build and Packaging team at Shield AI to define strategy, own developer productivity roadmaps, and deliver scalable build and CI/CD systems that accelerate engineering velocity.
Lead the engineering of next-generation agent orchestration and AI-native tools to automate and enhance the full software delivery lifecycle at a fast-growing, remote US company.
Truelogic seeks a hands-on LLM Operations & Governance Specialist to own AI infrastructure, governance, monitoring, and adoption for a subscription-based market research client.
ServiceNow is hiring a Senior Software Engineer to build AI-powered observability UIs and lead end-to-end delivery of scalable, customer-facing cloud features.
Lead architecture and enterprise adoption of agentic AI solutions at NVIDIA, translating experimental prototypes into secure, scalable workflows that drive measurable transformation across IT and business systems.
Build scalable, reliable AI-driven systems at a fast-growing mental health startup, contributing end-to-end code and technical leadership in an on-site SoHo engineering team.
Drive the design and delivery of ServiceNow's Observability & Data platform for federal customers, building scalable distributed systems, leading engineers, and promoting engineering best practices.
Lead the development of scalable compute infrastructure for robotics foundation-model research at NVIDIA, optimizing GPU clusters, job orchestration, and observability for large training and evaluation workloads.
Lead the design and scaling of multi-agent, LLM-driven orchestration frameworks to embed Generative AI throughout the software delivery lifecycle at a remote U.S. company.
Lead architecture and enterprise adoption of scalable, secure AI agents at NVIDIA to transform IT and business workflows.
Lead and grow a distributed engineering team building production-grade Python/Django services, React front ends, and AWS data pipelines for an AI-driven fintech platform.
Lead the backend engineering efforts for Whatnot's Customer Experience Platform to build scalable systems and AI-driven automations that improve support reliability and customer satisfaction.
Design, build, and operate high-performance Go backend services from design to production as part of a small, high-ownership engineering team in New York City.
Crusoe is hiring a Senior Software Engineer to own and scale the observability stack that provides actionable telemetry for its global cloud and AI infrastructure.
Torch Dental is hiring a Senior Software Engineer to lead end-to-end AI infrastructure, ensuring secure, HIPAA-compliant, and scalable LLM workloads that power our healthcare procurement platform.
Lead the engineering and applied-LLM work to improve agent reliability, autonomy, and evaluation pipelines for a fast-moving startup building autonomous business agents.
Lead architecture and integration of production-scale enterprise AI solutions (Claude/MCP) with a focus on security, scalability, and observability for Nagarro's enterprise customers.
Lead the design and operation of large-scale data infrastructure for NVIDIA's robotics foundation-model efforts (Project GR00T) to enable multimodal training and robot learning research.
Help design and build scalable full‑stack systems and AI-enabled product features at Lightfield to power the next-generation CRM for high-growth tech companies.
Lead engineering excellence for LinkedIn's PSM and AI/product platforms, driving system health, observability, and cross-functional initiatives that improve reliability and scalability.
Lead the architecture and implementation of AI-augmented developer platform tooling at Palo Alto Networks to accelerate engineering velocity and secure cloud innovation.
Datadog is looking for a Staff Applied Scientist to design and prototype AI-native observability control planes that enable safe, cost-aware agent actions in production systems.
NBCUniversal is looking for a Principal Software Engineer to lead architecture and strategy for cloud, API, and AI-driven developer platform initiatives that scale across the enterprise.
Help build and scale the ML infrastructure that powers GTV's real-time generative video features, working closely with research and product teams in San Francisco.
A remote-friendly healthcare tech company is hiring a Staff Engineer to lead complex full-stack engineering efforts, adopt AI/automation tools, and shape long-term technical strategy.
Lead product strategy and execution for core engineering services and productivity application integration at Okta, using AI to streamline operations and boost developer productivity.
Experienced SRE with distributed systems and LLM experience needed to design and operate scalable, reliable managed AI services for a mission-driven, sustainability-focused AI infrastructure company.
Brellium is hiring a Software Engineer II to help build and scale backend and fullstack systems that power its AI medical review platform and improve patient care.
Experienced DevOps Engineer needed to design, operate, and automate cloud-native infrastructure and delivery pipelines for a fast-growing, remote-first AI marketing platform.
Lead the design and hands-on implementation of ServiceNow's next-generation multi-cloud orchestration and automation platform as a Principal Software Engineer on the Platform Infrastructure team.
Senior AI Engineer needed to architect and scale AI tooling and infrastructure that enables data science and product teams to deliver AI-driven customer and logistics experiences at Shipt.
Senior engineering role to architect and build large-scale distributed infrastructure and platforms at LinkedIn, driving technical strategy, operational excellence, and open-source collaboration.
Build and scale Node.js/TypeScript backend services at Vapi to power real-time voice AI, prioritizing reliability, performance, and security for developer and enterprise use.
ServiceNow is hiring a Senior Software Engineer to own and deliver scalable core platform capabilities, improve developer-facing APIs, and shape system architecture for high-scale distributed services.
Zocdoc is hiring an Engineering Manager to own and scale the Interop Platform — building enterprise-grade APIs, data pipelines, and observability to enable product growth and AI-enabled automation.
Sola, a fast-growing Series A AI automation startup backed by tier-1 investors and YC, is hiring a Senior Full-Stack Engineer to build polished, reliable frontend and backend systems for enterprise workflow automation in NYC.
Lead product strategy and execution for a fast-growing enterprise SaaS company focused on network observability for service providers, shaping roadmap, team, and customer outcomes.
Lead backend architecture and scale core systems at Mercor, helping power training pipelines used by the world’s top AI labs.
Work on next-generation engineer productivity platforms at NVIDIA, using LLMs and systems engineering to build scalable, AI-accelerated tools that boost chip-design productivity.
Be the technical owner for strategic Logz.io customers, driving adoption, onboarding, and value realization for our AI-powered observability platform.
Lead the design and implementation of AI-driven developer and operator tooling at Felicity, improving our DSL, self-healing automation, and LLM observability.
Lead full-stack, cloud-native engineering initiatives at ServiceNow to build scalable, reliable, and AI-aware products while mentoring teams and improving engineering practices.
Senior Back End Software Engineer to lead development of Python microservices for a consumer fintech platform, working remotely across the U.S. and driving architecture, reliability, and AI-enhanced engineering workflows.
Kandji is hiring a Staff AI Engineer to design and build scalable AI platform services and apply LLMs to create intelligent, production-ready features for device management and security.
Lead engineering for NVIDIA Mission Control to build and operate resilient, AI-enabled cluster automation for large-scale GPU and CPU infrastructure.
Join a VC-backed fintech startup as a backend engineer to design and build scalable services that power AI-driven regulatory compliance across enterprise customers.
Braintrust is hiring a Developer Advocate to grow and activate its developer community by producing technical content, engaging on forums, and representing the platform at events.
Below 50k*
0
|
50k-100k*
0
|
Over 100k*
1
|