At TensorWave, we’re leading the charge in AI compute, building a versatile cloud platform that’s driving the next generation of AI innovation. We’re focused on creating a foundation that empowers cutting-edge advancements in intelligent computing, pushing the boundaries of what’s possible in the AI landscape.
We are seeking an exceptional Kubernetes Architect to lead the design, development, and deployment of our next-generation infrastructure platform. This is a very senior-level role for someone who not only understands Kubernetes deeply but can write complex manifests, operators, and controllers from scratch, and architect resilient, secure, and performant systems that scale to millions of users.
As a technical visionary and hands-on expert, you will lead the evolution of our cloud-native architecture, including designing serverless systems on Kubernetes, integrating with CI/CD, and ensuring observability, security, and cost-efficiency across environments.
Architect and implement end-to-end Kubernetes infrastructure for large-scale, cloud-native applications.
Design and build serverless platforms on top of Kubernetes using technologies such as Knative, OpenFaaS, or KEDA.
Develop and maintain Kubernetes custom resources (CRDs), controllers, operators, and admission controllers in Go or Python.
Define multi-tenant, multi-region architecture supporting millions of users with high availability and low latency.
Lead Kubernetes cluster lifecycle management (provisioning, upgrades, scaling, monitoring, troubleshooting).
Collaborate closely with engineering teams to containerize applications, write Helm charts or Kustomize overlays, and standardize deployment practices.
Implement infrastructure as code using tools like Terraform, Pulumi, or Crossplane.
Lead efforts around observability, policy enforcement, cost optimization, and RBAC/security hardening within the cluster.
Evaluate and integrate Kubernetes ecosystem tools (e.g., Istio/Linkerd, ArgoCD, Flux, Prometheus, Grafana, OPA, etc.).
Mentor and upskill DevOps engineers and SREs in Kubernetes best practices.
8+ years of experience in cloud infrastructure, DevOps, or platform engineering roles.
4+ years of hands-on Kubernetes experience, including deep knowledge of the Kubernetes API, internals, networking, and storage.
Proficiency in writing Kubernetes manifests, Helm charts, and custom Kubernetes controllers/operators (preferably in Go).
Proven experience designing cloud-native systems that scale globally (multi-region, multi-cloud or hybrid setups).
Experience with serverless technologies (Knative, OpenFaaS, AWS Lambda, etc.) in a production environment.
Strong knowledge of cloud platforms such as AWS, GCP, or Azure.
Experience with GitOps tools (ArgoCD, Flux), service meshes, policy engines (OPA/Gatekeeper), and CI/CD pipelines.
Deep understanding of security, compliance, and resilience in containerized workloads.
Contributions to Kubernetes open-source projects or CNCF-related tooling.
Experience with service mesh design (Istio, Linkerd).
Familiarity with eBPF, Cilium, or network-level observability.
Background in building PaaS or developer platforms on top of Kubernetes.
A production-grade Kubernetes platform that can support millions of users globally, with self-healing, autoscaling, and strong observability.
Developer teams can deploy serverless applications with ease, speed, and reliability.
Infrastructure is resilient, secure, cost-optimized, and compliant.
Kubernetes practices and tooling are well-documented, standardized, and continuously improved across the company.
Stock Options
100% paid Medical, Dental, and Vision insurance
Life and Voluntary Supplemental Insurance
Short Term Disability Insurance
Flexible Spending Account
401(k)
Flexible PTO
Paid Holidays
Parental Leave
Mental Health Benefits through Spring Health
If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.
Software Engineer with Unity and C# experience to develop and maintain Lingraphica’s AAC speech-generating devices and supporting services in a remote, mission-driven environment.
Freshworks is seeking a Principal AI Knowledge Architect to architect and lead scalable, secure RAG and agentic knowledge systems that deliver accurate, context-aware answers for enterprise AI assistants.
Lead the design and implementation of AI security, governance, and compliance for Freshworks' enterprise agentic AI platform as the Principal AI Security Architect.
Molina Healthcare is hiring a Senior Cloud Engineer to architect and operate Azure infrastructure, lead cloud migrations, and implement secure, automated IaC and DevOps practices for a mission-driven healthcare organization.
Work on Eluvio's decentralized Content Fabric as a Senior Software Engineer focused on video processing, packaging and real-time delivery across on-demand and live formats.
Lead and grow an embedded and full‑stack engineering team to deliver Reach's cutting‑edge wireless power products from prototype through volume production.
Visa is hiring a Senior Site Reliability Engineer to develop, operate, and optimize ServiceNow CMDB and ITOM capabilities supporting a global payments network.
Contribute as a contract Full Stack Engineer on a high-impact healthtech platform, building scalable Next.js front-ends and Node.js/TypeScript serverless backends to improve patient access to medications.
Help build Okta's Device Access cloud-native backend services that secure desktop logins and enable seamless identity-driven device access.
Manifest is hiring a Senior Backend Engineer - AI to architect and scale high-throughput backend infrastructure that brings machine learning and LLMs into production.
Lead system-level Android development for GlobalProtect at Palo Alto Networks, building secure mobile networking features using Kotlin, Java, JNI/C++ and deep Android internals expertise.
Nova Dynamics seeks a Full Stack Junior Software Developer to work on-site building emergency communication tools for fire departments alongside the CEO.
Northrop Grumman seeks an experienced Principal/Sr. Principal Backend Software Engineer to design and implement robust, high-performance ETL pipelines and data architectures onsite in Rancho Bernardo, CA.
Supercharge your large-scale PyTorch LLM workloads with our cloud powered by AMD MI300X
13 jobs