Browse 184 exciting jobs hiring in Grafana now. Check out companies hiring such as Elsevier, Anyscale, Netlify in Chattanooga, Sacramento, Los Angeles.
Experienced SRE leader needed to manage multiple teams and advance cloud reliability, automation, observability, and security for LexisNexis Risk Solutions.
Help ensure production-grade reliability for Anyscale's distributed ML platform by building test automation, simulation, and observability tooling for Ray-based workloads.
Netlify is hiring a Staff Data Pipeline Engineer to design and operate Kafka-based streaming pipelines and modernize the data infrastructure that powers usage-based billing and real-time analytics.
Work remotely on the team that operates and stabilizes detection content releases—managing deployments, runtime telemetry, first-level triage, and release communications for CrowdStrike's detection platform.
NXTKey is hiring a Backend BFF Developer to build performant Java-based backend services that power Angular user experiences and operate in containerized, CI/CD-driven environments.
Work on the core Observability Agent at NetBox Labs to build scalable, low-level network telemetry and discovery features used by both open-source and commercial products.
TheLoops is hiring a Senior Backend Software Engineer to build high-performance Java/Kafka-based distributed systems that power its enterprise AI Agent platform.
Nordstrom seeks an entry-level Platform Engineer to support and improve its Kubernetes compute platform using Terraform, GitOps, and modern observability and CI/CD practices.
Lead product marketing and GTM programs at Grafana Labs to clearly articulate the value of our open-source observability products to technical buyers and sellers.
Dexterity seeks a Senior Robotics Engineer to develop and productionize motion planning, control, and scalable robotic systems for warehouse automation.
Vultr is hiring a Junior Test Automation Engineer to develop and maintain UI and API test automation for its cloud infrastructure platform.
Lead the design and automation of Linea's cloud-native infrastructure as a Senior DevOps Engineer at Consensys, focusing on AWS, Kubernetes, Terraform, and observability to support a fast-moving Layer-2 blockchain.
Boeing is hiring a Cloud Engineer to implement and operate CI/CD, IaC, secrets management, and observability tooling for AWS applications in a regulated, hybrid environment.
Hardware Test Engineering Co-op (Software) at Shield AI to design and implement Python test automation and data pipelines for aerospace hardware validation.
BETA Technologies is hiring a Flight Data Analytics Engineer in South Burlington to build and maintain Python/SQL data pipelines, models, and dashboards that deliver accurate, actionable flight data to engineering and commercial stakeholders.
Lead the architecture and delivery of Visa's enterprise log management solution as a Senior Staff Software Engineer specializing in Java and Go on the Observability team.
Onebrief is hiring a Senior Site Reliability Engineer to own reliability, observability, and secure operations for on-prem and cloud military deployments in Colorado Springs.
Lead Visa's Site Reliability Engineering efforts to deliver highly available, secure, cloud-native application platforms while driving automation and operational excellence.
Visa is hiring a senior-level Systems Engineer to manage and automate virtualization and Linux infrastructure, ensuring 24x7 availability and operational excellence across enterprise environments.
Sierra is hiring a seasoned Site Reliability Engineer to own observability, scalability, and secure cloud infrastructure for its AI platform in San Francisco.
Lead GFiber's Network Reliability Engineering organization to define reliability strategy, run tier-2 incident response, and drive observability and automation across metro networks.
Canary seeks an experienced Lead Site Reliability Engineer to drive incident response, SLO frameworks, and platform reliability across its remote engineering organization.
NetBox Labs is hiring a Senior DevOps Engineer to own infrastructure automation, CI/CD, and observability for their SaaS and self-managed products in a fast-paced, product-focused environment.
Senior Software Engineer (Hardware Test) to build Python-driven test frameworks, drivers, and scalable test infrastructure that validate aircraft components and increase hardware reliability.
Contribute to aerospace hardware validation as an entry-level Software Engineer writing Python drivers, automated test frameworks, and CI/CD for scalable test stands at Shield AI.
Senior-level SRE role focused on automating infrastructure and security controls, maintaining observability and SLOs, and improving reliability across Sonar’s global platform.
Lead the design and implementation of large-scale observability systems for GPU-powered AI and HPC workloads at NVIDIA's MARS team, enabling telemetry, analytics, and intelligent monitoring across world-class GPU infrastructure.
Lead development of secure, enterprise-grade developer tooling and integrations at Coder, focusing on Go-based distributed systems, IDE integrations, and AI-enabled developer experiences.
Cartesia is hiring a Cluster Infrastructure Engineer in San Francisco to build and operate large-scale GPU clusters and automation that power state-of-the-art multimodal model training and inference.
Provide Linux systems engineering and device-management expertise to maintain and enhance remote in-store digital menu board platforms.
Senior DevOps Engineer needed to design and operate cloud‑native, Kubernetes‑based infrastructure and CI/CD pipelines for NBCUniversal's local media and broadcast workflows.
Cape is hiring a Site Reliability Engineer to build and operate privacy-focused telecommunications infrastructure, improve system reliability and monitoring, and own FedRAMP accreditation for a fast-growing, mission-driven startup.
Exegy’s Managed Services Engineering team is looking for a hands-on DevOps Engineer to build automation, CI/CD, and observability for high-performance market data systems.
Senior DevOps Engineer role supporting AKS-based production systems, CI/CD automation, cost optimization, and 24x7 incident response for a mature marine transportation company in New Orleans.
Work as a DevOps Engineer supporting cloud, on‑prem, and containerized platforms to automate CI/CD, optimize platform performance, and improve operational reliability in a remote-first US role.
As a Senior Site Reliability Engineer for a high-growth platform, you will design and operate large-scale AWS infrastructure, build automation and observability, and partner with engineering teams to improve reliability and deployment velocity.
Poolside seeks an experienced Solutions Architect to help enterprise customers deploy and operate its scalable AI platform across cloud and hybrid infrastructures.
Senior Site Reliability Engineer needed to own large-scale AWS infrastructure, automate CI/CD and observability, and drive platform reliability for a high-growth, remote-friendly US company.
Build and operate scalable backend and ML-serving infrastructure for emotionally intelligent AI systems at an early-stage, high-growth company (remote, US).
ServiceNow is hiring an AI-native Staff/Senior Staff Product Manager to lead the design and delivery of predictive, compliance-aware network observability and autonomous remediation for hyperscaler and sovereign cloud deployments.
An experienced Cloud Infrastructure Engineer is needed to architect, automate, and operate Kubernetes-based cloud platforms for a large-scale enterprise in a fully remote US role.
Architect and implement scalable backend services and ML pipelines for a remote US-based AI startup focused on emotionally intelligent, personalized experiences.
Lead architecture and full-stack feature development for secure, high-performance healthcare platforms using React, Node, and Go while working remotely with distributed teams.
Lead the strategy and execution for enterprise-scale performance telemetry at ServiceNow to improve visibility, reliability, and operational decision-making across the Now Platform.
Experienced backend engineer needed to design and operate high-scale Java/Spring microservices and event-driven systems powering VGS’s credential management and payment tokenization platforms.
Lead the architecture and engineering of next-generation LLM-driven agentic workflows for enterprise observability within ServiceNow's Global Cloud Services team.
Kemper is hiring a Senior Java Full Stack Developer to design and deliver microservices-based solutions and lead development efforts across cloud and web applications.
SailPoint seeks a Software Engineer II to design, implement, and operate scalable microservices that ensure identity and account integrity across its Identity Security Cloud.
Senior-level monitoring analyst to architect and operate observability and log-management solutions (Splunk and related tools) that keep a high-volume global payments platform running 24x7.
Experienced HPC Support Engineer needed to troubleshoot GPU/HPC clusters, mentor peers, and deliver high-quality customer support for Lambda’s deep learning cloud.
Below 50k*
0
|
50k-100k*
9
|
Over 100k*
184
|