Hyperbolic Labs is on a mission to democratize AI by breaking down the barriers to computing power with our Open-Access AI Cloud. By making better use of idle computing resources across the globe, we offer an innovative GPU marketplace and AI inference service that promise affordability and accessibility for all. As pioneers at the intersection of AI and open-source technology, we believe in an open future where AI innovation is limited only by imagination, not by access to resources. We're looking for forward-thinking individuals who share our passion for making AI universally accessible, secure, and affordable. Join us in building a platform that empowers innovators everywhere to turn their visionary AI projects into reality.
As we prepare for growth after our Series A, backed by industry leaders, our team — led by co-founders with PhDs in AI, Math, and Computer Science — is poised to redefine computing.
We are hiring a Head of Infrastructure to lead the design, evolution, and reliability of Hyperbolic’s globally distributed GPU cloud. This role sits at the center of our mission: you will architect and scale the systems that power our peer-to-peer GPU marketplace, inference fabric, and core platform primitives.
You’ll own the infrastructure roadmap end-to-end—from distributed systems design and resource orchestration to networking, security, and global capacity strategy. You’ll grow and mentor a world-class engineering organization, establish engineering excellence standards, and partner closely with Product, Security, Platform, and GTM leadership to translate future AI workloads into infrastructure reality.
You are an infrastructure leader with a track record of scaling complex systems, guiding high-impact teams, and making deeply technical decisions in environments where reliability and performance are existential.
Leadership & Strategic Execution
10+ years in infrastructure, systems engineering, or distributed systems, including 5+ years leading managers and senior ICs.
Proven ability to own multi-year infrastructure roadmaps, align stakeholders, and translate ambiguous requirements into crisp technical direction.
Experience building, scaling, and mentoring high-performing engineering orgs across infrastructure, platform, and SRE disciplines.
Exceptional judgment in balancing velocity with reliability, cost, and security.
Comfortable working in fast-moving, high-stakes environments where infrastructure is the product.
Technical Depth & System Design
Deep expertise in distributed systems, operating systems internals, networking, and resource orchestration.
Hands-on experience with container orchestration systems (Kubernetes, Nomad, SLURM, custom schedulers) at global scale.
Strong engineering background with the ability to read and write production code (Go, Rust, Python, or similar).
Experience architecting multi-cloud + on-prem + edge topologies, including GPU-centric workloads.
Expert-level understanding of infrastructure-as-code, automation frameworks, and GitOps workflows.
Expertise in designing observability systems (metrics, tracing, logging, alerting) and building operational excellence.
Operational Excellence & Security
A track record of owning 99.9–99.99% uptime targets, incident response processes, and resilience engineering.
Passionate about security-first infrastructure, including workload isolation, network security, IAM, hardening, and compliance.
Experience leading major capacity planning, load forecasting, and cost optimization initiatives.
Bonus Experience
Contributions to open-source infra tools, kernels, schedulers, or distributed systems libraries.
Familiarity with service mesh, mTLS, RPC frameworks, or low-latency communication patterns.
High impact: your work affects the entire stack and enables all engineering teams
Ownership: you’ll own production systems and have real autonomy
Learning: exposure to new infrastructure challenges and the chance to grow
List of perks & benefits: e.g. equity, health, remote policy, hardware budget, offsites, etc.
Inclusive culture: we strive to build a diverse, supportive team
Hyperbolic is an equal opportunity employer. We celebrate diversity and are committed to creating an inclusive environment for all employees.
If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.
Lead the Starter Onboarding engineering team at Change.org to scale an AI-powered petition-creation workflow and improve petition launch success across the platform.
Navarro Research & Engineering is hiring a Senior Piping Designer in Greenville, SC to deliver 3D piping layouts and detailed design support for DOE and nuclear-focused EPC projects.
Lead the product engineering and digital‑thread effort to integrate, verify, and release mission systems and subsystems for a high‑speed aircraft program working with DoD customers.
AECOM seeks an Entry-Level Structural Dam Safety Engineer in Germantown, MD to support dam safety analyses, inspections, design documentation and multidisciplinary project delivery.
Lead a regional bridge engineering team at AECOM, delivering complex bridge and transportation structural designs while mentoring staff and fostering client relationships.
Experienced civil engineer needed to lead stormwater, permitting, and site design work for a growing coastal Maine engineering firm offering hybrid work and strong benefits.
Lead Engineer at Connecticut Natural Gas Corporation responsible for designing gas facilities, ensuring regulatory compliance, and leading project execution for transmission and distribution work.
Bosch's Florence manufacturing site is hiring a Facilities Engineer Intern/Co-op (Spring 2026) to support SAP PM setup, BMS work, documentation updates, and facilities improvement projects.
Lead process development and validation for defense-grade forgings at Union Technologies, applying CpK/SPC, hands-on shop-floor problem solving, and statistical tools to deliver mission-ready parts.
Lead and grow a regional SLED systems engineering team to architect and deliver complex infrastructure, networking, cloud, and security solutions for public-sector and education clients across the Central US.
Lead a global Applied AI engineering organization to build and ship LLM-powered automation that scales across NAMER and EMEA for a fast-growing, privately held tech company.
Lead the end-to-end mission systems architecture for ICD-downstream subsystems, defining OMS/UCI services, APIs, data flows, and integration strategies to enable safe, testable, and modular mission capabilities.
Lead technical solution development and integration for high energy laser systems, guiding control design, system integration, testing, and customer engagement for Kord/KBR DoD programs.