Cruose's mission is to accelerate the abundance of energy and intelligence. We’re crafting the engine that powers a world where people can create ambitiously with AI — without sacrificing scale, speed, or sustainability.
Be part of the AI revolution with sustainable technology at Crusoe. Here, you'll drive meaningful innovation, make a tangible impact, and join a team that’s setting the pace for responsible, transformative cloud infrastructure.
Overview
Crusoe is building the World’s Favorite AI-first Cloud infrastructure company. We’re pioneering vertically integrated, purpose-built AI infrastructure solutions trusted by Fortune 500 companies to power their most advanced AI applications. Crusoe is redefining AI cloud infrastructure, with a mission to align the future of computing with the future of the climate. Our AI platform is recognized as the "gold standard" for reliability and performance. Our data centers are optimized for AI workloads and are powered by clean, renewable energy.
Be part of the AI revolution with sustainable technology at Crusoe. Here, you'll drive meaningful innovation, make a tangible impact, and join a team that’s setting the pace for responsible, transformative cloud infrastructure.
We are building Crusoe’s next-generation cloud orchestration platform, centered on Kubernetes at scale. As a Senior Software Engineer on the Managed Orchestration team, you’ll design and deliver features that power Crusoe’s managed Kubernetes service, enabling high-performance workloads on CPUs and GPUs in distributed environments.
This role requires deep technical expertise in distributed systems, Kubernetes internals, and modern cloud-native architectures. You’ll work closely with teams across GPUs, Networking, and Storage to build a reliable, scalable, and secure orchestration layer for customers running mission-critical workloads.
Architect, build, and operate features for Crusoe’s Managed Kubernetes platform (control plane, autoscaling, cluster lifecycle, upgrades, multi-tenancy).
Integrate and optimize GPU workloads within Kubernetes clusters, including device plugins, GPU operators, scheduling, and monitoring.
Enhance container networking through advanced CNI integration (Cilium, Calico, Multus) and support for high-performance networking (InfiniBand, RoCE).
Improve reliability and resilience of Kubernetes clusters, including HA control planes, node lifecycle management, and self-healing mechanisms.
Contribute to open-source and internal tooling that enhances observability, automation, and cluster security.
Participate in design reviews, provide mentorship to engineers, and help set long-term technical direction.
Troubleshoot complex distributed systems problems spanning containers, GPUs, and networking.
5+ years of software engineering experience in distributed systems, cloud, or infrastructure.
Deep understanding of Kubernetes internals (control plane, scheduling, operators, controllers, API machinery).
Strong proficiency in Go (preferred) or similar languages (Rust, C++, Python for systems work).
Experience with container networking (CNI plugins, service mesh, load balancing) and Linux networking fundamentals.
Exposure to GPU workloads in Kubernetes (device plugins, GPU operators, scheduling, autoscaling).
Familiarity with cloud platforms (AWS, GCP, or Azure) and infrastructure automation (Terraform, Helm, GitOps).
Strong debugging and performance optimization skills for distributed systems.
Passion for building reliable, developer-friendly platforms that abstract complexity for customers.
Familiarity with NVIDIA and AMD GPUs, device plugins, and operators for GPU lifecycle management.
Knowledge of network operators and CNI implementations (Cilium, Calico, Multus).
Experience with high-performance networking technologies (InfiniBand, RoCE).
Contributions to Kubernetes SIGs, CNCF projects, or related open-source communities.
Experience with Slurm, MPI, or HPC-style job schedulers.
Familiarity with service meshes (Istio, Linkerd) and multi-cluster networking.
Background in security for containers, GPUs, and Kubernetes (PodSecurity, RBAC, runtime scanning).
Compensation Range:
Compensation will be paid in the range of $166,000 - $204,000. Restricted Stock Units are included in all offers. Compensation to be determined by the applicants knowledge, education, and abilities, as well as internal equity and alignment with market data.
Crusoe is an Equal Opportunity Employer. Employment decisions are made without regard to race, color, religion, disability, genetic information, pregnancy, citizenship, marital status, sex/gender, sexual preference/ orientation, gender identity, age, veteran status, national origin, or any other status protected by law or regulation.
If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.
Lead the technical direction and implementation of Crusoe’s managed Kubernetes platform to enable scalable, secure GPU-accelerated AI workloads across high-performance networking environments.
Lead the design, commissioning, and optimization of BMS/EPMS controls for Crusoe’s data centers to improve energy efficiency and operational reliability.
Technical role focused on packaging, troubleshooting, monitoring, and evolving Linux-based digital menu board devices for a remote-first retail technology team.
Scale is seeking a frontend-focused full-stack Forward Deployed Engineer to build AI-powered enterprise web applications and ship production features while working closely with technical customers in San Francisco or New York.
Lead the architecture and scaling of GoodLeap's payments platform, building secure, high-performance backend systems and mentoring engineers to deliver production-grade payment experiences.
OpenSesame seeks an Associate Engineering Manager to lead a remote engineering team building IAM solutions, balancing delivery, technical guidance, and team development.
Lead frontend development at Aiwyn to build a collaborative, AI-enabled tax workbench that makes complex tax workflows intuitive and fast for CPA teams.
Join Pattern’s creative engineering team to build AI-powered tools and workflow solutions that accelerate content creation for global ecommerce clients.
CloudZero is hiring a Spring 2026 Engineering Co-op for its Insights team in Boston to work on backend, serverless systems that deliver cloud cost insights and optimization features to real customers.
Build scalable backend services and APIs at doola to power seamless business formation and compliance workflows using Java, Node.js, and AWS.
Equifax is hiring a Security Software Engineer to convert Java crypto libraries to C, build a strong FFI foundation for modern languages, and lead secure, scalable cryptographic engineering.
Lead a small engineering team at Pattern to design and build AI-infused creative tooling that accelerates content production and insight-driven creative decisions.
GovDash seeks a full-stack Software Engineer to build and scale Dash, an agent-based AI orchestration platform that automates workflows for government contractors.
Early-stage, Cherry-backed AI fintech startup is hiring founding engineers to build a modern TypeScript/Next.js stack and agentic LLM pipelines to transform B2B payments.
Adtalem Global Education is hiring a Software Engineer (Web Technologies) to design, implement, and maintain scalable, cloud-ready web applications for their hybrid Columbia, MD engineering team.
We’re on a mission to align the future of computation with the future of the climate.
32 jobs