Browse 161 exciting jobs hiring in Grafana now. Check out companies hiring such as SearchStax, Rula, NVIDIA in Glendale, Buffalo, Raleigh.
Experienced Staff AWS SRE to lead scalability, automation, and reliability across a rapidly growing cloud platform serving enterprise search workloads.
Rula is hiring a Staff SRE & DevOps Engineer to drive observability, reliability, and SRE practices across its remote engineering teams.
Lead NVIDIA's multi-cloud GPU capacity strategy by combining analytics, automation, and cross-functional leadership to optimize compute resources for AI and HPC workloads.
Intuitive is hiring a Cloud Operations Engineer to design and operate automated, secure cloud-native infrastructure and CI/CD pipelines that support mission-critical healthcare systems.
Software Engineering Intern role at Aptiv's ASUX GPO in Troy to develop C/C++ and Python features for in-vehicle demo systems and work on networking, cloud integration, and UX-focused automotive projects.
ServiceNow is hiring a Staff DevOps Engineer to operate, automate, and support mission-critical Hadoop and Big Data infrastructure for its US Federal cloud environment.
Docker is hiring a Staff Backend Engineer to advance secure policy and supply chain protections across its registry and SaaS platform while working remotely across US/Canada Eastern or European CET/Western regions.
AHEAD is hiring an Observability Engineer to design and operate scalable monitoring, logging, and tracing solutions across cloud-native and hybrid environments.
Apply your backend engineering experience to design and run automated chaos and reliability experiments that harden Camunda's distributed platform in a fully remote, async-first environment.
ServiceNow is hiring a Staff Storage Engineer to build, automate, and operate large-scale storage infrastructure and ensure capacity, performance, and reliability for cloud services.
Maintain and troubleshoot TensorWave's bare-metal Kubernetes clusters to ensure reliable, high-performance infrastructure for cutting-edge AI workloads.
Customer-focused Technical Support Engineer needed to diagnose and resolve SaaS issues, work closely with engineering, and help improve product reliability at Base44 (part of Wix).
Senior Systems Software Engineer to design and maintain observability and reliability systems for large-scale cloud services at NVIDIA.
Lead platform reliability and performance initiatives at 0x by owning cloud and Kubernetes infrastructure, observability, CI/CD, and automation to support high-throughput, low-latency web3 products.
Lead the design and implementation of orchestration layers that coordinate LLMs, agents, retrieval pipelines, and human-in-the-loop workflows for a mission-driven healthcare AI platform.
Lead a Platform Engineering team to design, operate, and scale core cloud services and integrations that power Canopy’s IoT and security products.
Work with Experian's Financial Services Division to design, build and operate real-time, high-throughput API platforms using Python and cloud-native technologies.
Arista Networks is hiring a Senior Site Reliability Engineer to design, operate, and scale the CloudVision SaaS platform running on Kubernetes across global regions.
Comcast seeks a backend software engineer to help build and scale the entertainment metadata API platform powering TV, web, and mobile experiences.
Be part of NVIDIA’s performance engineering team to architect, tune, and validate large-scale GPU-accelerated systems and workflows for AI and datacenter workloads.
Help build and automate customer-facing tools and production observability at a fast-growing outage intelligence SaaS company trusted by major enterprises.
Help build and operate Tempo's production infrastructure—Kubernetes, bare metal, observability, and tooling—to enable fast, secure shipping of blockchain payment systems.
Work on the Mailgun infrastructure team to design, automate, and operate cloud-native systems that deliver reliable, large-scale email services to global customers.
The Aspen Group is seeking a Senior SRE to design AI-driven observability, automate incident response, and scale resilient cloud infrastructure for its national healthcare platforms.
Experienced full stack developer needed to design and maintain high-availability, event-driven systems for U-Haul's Rates and Distribution platform using .NET, C#, Kafka, and modern front-end technologies.
Senior Product Designer needed to lead UX for Grafana Cloud’s cost management and billing experience, creating clear, actionable, and role-driven interfaces for technical and finance audiences.
Flock Safety is hiring a Lead Aviation Support Engineer to own Tier 3 support, drive cross-functional issue resolution, and ensure reliable remote operations of its drone fleet.
Senior Cloud Platform Engineer to design, automate, and operate AWS-based platform services, developer tooling, and CI/CD pipelines for a high-growth professional services fintech.
Lead and mentor a Customer Success Engineering team at Logz.io to drive presales wins, ensure postsale value realization, and act as the technical bridge between customers and internal teams.
Join Unstructured as a Public Sector SRE to architect and operate compliant, high-assurance cloud infrastructure that powers AI workloads for federal customers.
Experienced Linux and virtualization engineer needed to maintain, automate, and support Visa's large-scale, secure infrastructure while collaborating with platform and engineering teams to meet operational SLAs.
Wyetech is hiring a Database Engineer 2 to design and optimize PostgreSQL databases and implement HA, replication, and automation for classified federal programs requiring TS/SCI clearance.
Lead Flow's platform and infrastructure efforts as a Platform Engineering Manager, driving reliability, automation, and developer velocity across cloud-native systems.
NVIDIA seeks a hands-on Solutions Architect to advise DGX Cloud partners, accelerate production AI adoption, and scale best practices for high-performance GPU infrastructure.
Greenlight needs a Senior Production Operations Engineer to lead SRE practices, automation, and infrastructure reliability for its high-scale fintech platform.
Experienced SRE/Software Engineer III needed to support production reliability, automate systems at scale, and collaborate with development teams at LexisNexis Risk Solutions in Alpharetta, GA.
Morgan Stanley is hiring a VP-level Senior AI Platform Engineer to architect and implement a firmwide Generative AI platform using Python, Kubernetes/OpenShift, and large-scale data and API ecosystems.
Talworx is recruiting a Compute Hardware L2 engineer to perform server and network hardware administration, troubleshooting, and incident resolution for a major IT services organization based in Spring, Texas.
Wyetech seeks a seasoned Software Engineer 2 to drive DevOps engineering, backend Java development, and platform automation for federal programs requiring active TS/SCI clearance.
NVIDIA's DGX Cloud team is hiring a Senior Site Reliability Engineer to operate and scale GPU-accelerated Kubernetes clusters across major cloud providers while driving reliability, observability, and performance.
Senior Developer to lead cloud modernization and AI-enabled document management initiatives for Expeditors' enterprise logistics systems in the Seattle area.
Platform Engineer needed to design, implement, and scale secure cloud infrastructure and CI/CD pipelines for PermitFlow’s AI-driven pre-construction platform in a hybrid NYC role.
Chronosphere is hiring a Senior Sales Advisor to accelerate adoption of Chronosphere Logs by leading technical evaluations, designing solutions, and translating customer insights into product impact.
OpenAI is hiring a senior Product Operations Manager in San Francisco to lead cross-functional coordination and change management for Integrity platform rollouts and policy enforcement programs.
Lead the architecture and operationalization of a secure, scalable multi-cloud data platform and FinOps governance stack across AWS, GCP, Azure, and on-prem environments for a Fortune 500 enterprise.
Senior Infrastructure Engineer needed to manage and scale hybrid on-prem/cloud infrastructure, optimize performance, and streamline CI/CD for a leading quantum computing company in Boulder, CO.
Halcyon, a remote-native adaptive security platform dedicated to stopping ransomware, is hiring a Senior DevOps Engineer to own and scale AWS infrastructure, CI/CD, and observability.
Technical lead role managing a small DevOps team to design, automate, and scale Azure infrastructure and CI/CD for a fast-growing pet-health and e-commerce portfolio.
Experienced SRE/Test Engineer needed to own post-production monitoring, incident response, and business-requirements-driven test automation for PHIL’s prescription management platform.
Be the on-site process engineering lead for AMP’s Commerce City single-stream facility, owning commissioning, operator training, and reliability to meet throughput, recovery, and uptime targets.
Below 50k*
0
|
50k-100k*
5
|
Over 100k*
151
|