Let’s get started
By clicking ‘Next’, I agree to the Terms of Service
and Privacy Policy, and consent to receive emails from Rise
Jobs / Job page
Software Engineer Lead - Cloud Engineering image - Rise Careers
Job details

Software Engineer Lead - Cloud Engineering

The Cloud Infrastructure team at Kumo is responsible for managing and scaling our Kubernetes-based, cloud-native AI platform across multiple cloud providers. They set service level objectives, optimize resource allocation, enforce security compliance, and drive cost efficiency for the Multi-Cloud Platform.


As a key team member, you will architect and operate a highly scalable, resilient Kubernetes infrastructure to support massive Big Data and AI workloads. You’ll design and implement advanced cluster management strategies, fleet capacity scaling, optimize workload scheduling, and enhance observability at scale. Your expertise in Kubernetes internals, networking, and performance tuning will be critical in ensuring high availability and seamless scaling.


Joining early, you'll play a pivotal role in shaping platform reliability, automating infrastructure, and enabling ML engineers with efficient commit-to-production automation, Continuous Provisioning, CI/CD, ML Ops, and deployment orchestration and workflows. You'll collaborate with ML scientists, product engineers, and leadership to influence scaling strategies, develop self-service tooling, and drive multi-cloud resilience. Engineers at Kumo take ownership of core system design, building infrastructure that powers the next generation of AI applications.


Key Responsibilities
  • Design, build, and scale Kubernetes-based infrastructure to support Kumo’s multi-cloud AI platform, ensuring high availability, resilience, and performance.
  • Architect and optimize large-scale Kubernetes clusters, improving scheduling, networking (CNI), and workload orchestration for production environments.
  • Develop and extend Kubernetes controllers and operators to automate cluster management, lifecycle operations, and scaling strategies.
  • Enhance observability, diagnostics, and monitoring by building tools for real-time cluster health tracking, alerting, and performance tuning.
  • Lead efforts to automate fleet management, optimizing node pools, autoscaling, and multi-cluster deployments across AWS, GCP, and Azure.
  • Define and implement Kubernetes security policies, RBAC models, and best practices to ensure compliance and platform integrity.
  • Collaborate with ML engineers and platform teams to optimize Kubernetes for machine learning workloads, ensuring seamless resource allocation for AI/ML models.
  • Drive commit-to-production automation, cloud connectivity, and deployment orchestration, ensuring seamless application rollouts, zero-downtime upgrades, and global infrastructure reliability.


Required Skills and Experience
  • Kubernetes Mastery: 8-10+ years of experience managing large-scale Kubernetes clusters (EKS, GKE, AKS, or OpenSource) in production. Deep expertise in Kubernetes internals, including controllers, operators, scheduling, networking (CNI), and security policies.
  • Cloud-Native Infrastructure: 8-10+ years of experience building cloud-native Kubernetes-based infrastructure across AWS, Azure, and GCP.
  • Platform Engineering: 8-10+ years of experience building Kubernetes service meshes (Istio/Envoy, Traefik), networking policies (Calico/Tigera), and distributed ingress/egress control.
  • Fleet Management & Scaling: Proven experience in optimizing, scaling, and maintaining Kubernetes clusters across multi-cloud environments, ensuring high availability and performance.
  • Software Development: 8-10+ years of experience writing production-grade controllers and operators in Python, Go, or Rust to extend Kubernetes functionality.
  • Infrastructure-as-Code & Automation: Hands-on experience with Terraform, CloudFormation, Ansible, BASH and Make scripting to automate Kubernetes cluster provisioning and management.
  • Distributed Systems & SaaS: Expertise in building and operating large-scale distributed systems for cloud-native B2B SaaS applications running on Kubernetes.
  • Cloud Application Deployment: Deep expertise in building of container orchestration, workload scheduling, and runtime optimizations using Kubernetes, Argo or Flux.
  • Education: BS/MS in Computer Science or a related field (PhD preferred)


Nice to Have
  • Proficiency with cloud platforms such as AWS, GCP, or Azure.
  • Familiarity with chaos engineering tools and practices for testing system resilience.
  • Strong understanding of security best practices and compliance standards (GDPR, SOC2, ISO27001, vulnerability assessments, GRC, risk management).
  • Contributions to open-source projects, particularly in the Kubernetes or cloud-native ecosystem.
  • Expertise in Docker, Kubernetes, Jenkins, Flux, Argo, and Terraform in a Linux environment.
  • Hands-on experience with monitoring and observability tools such as Prometheus and Grafana.
  • Ability to develop customer-facing web frontends or public APIs/SDKs for platform services.


Benefits
  • Competitive salary and equity options.
  • Comprehensive medical and dental insurance.
  • An inclusive, diverse work environment where all employees are valued and supported.


$175,000 - $250,000 a year

We are an equal opportunity employer and value diversity at our company. We do not discriminate on the basis of race, religion, color, national origin, gender, sexual orientation, age, marital status, veteran status, or disability status.

KUMO Glassdoor Company Review
2.6 Glassdoor star iconGlassdoor star icon Glassdoor star icon Glassdoor star iconGlassdoor star icon
KUMO DE&I Review
2.1 Glassdoor star iconGlassdoor star icon Glassdoor star icon Glassdoor star iconGlassdoor star icon
CEO of KUMO
KUMO CEO photo
Unknown name
Approve of CEO

Average salary estimate

$212500 / YEARLY (est.)
min
max
$175000K
$250000K

If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.

Similar Jobs
Posted 5 hours ago

An experienced Deputy Software Lead role is available at Northrop Grumman to drive cloud-native software development and technical leadership for a critical defense ground system.

Posted 14 hours ago

Exciting opportunity for a Senior Software Engineer to lead and innovate in Android application development for Yahoo Mail's global user base.

Posted 16 hours ago

Senior Software Developer role at GovCIO to lead and develop complex IT solutions for USPS Logistics Tech Services in a fully remote capacity.

Experienced Salesforce Developer with strong JavaScript and LWC skills needed to enhance and customize Salesforce solutions remotely for a leading Texas State Agency service provider.

Photo of the Rise User
Posted 10 hours ago
Inclusive & Diverse
Empathetic
Collaboration over Competition
Growth & Learning
Transparent & Candid
Medical Insurance
Dental Insurance
Mental Health Resources
Life insurance
Disability Insurance
Child Care stipend
Employee Resource Groups
Learning & Development

Contribute to American Express’s Global Loyalty & Benefits transformation as a Senior Software Engineer specializing in Java and scalable microservice development.

Photo of the Rise User
Posted 22 hours ago
Inclusive & Diverse
Empathetic
Collaboration over Competition
Growth & Learning
Transparent & Candid
Medical Insurance
Dental Insurance
Mental Health Resources
Life insurance
Disability Insurance
Child Care stipend
Employee Resource Groups
Learning & Development

Contribute to innovative software solutions at American Express as a Software Engineer II in their Global Servicing Technology team.

Photo of the Rise User
Posted 20 hours ago

A skilled Software Architect is needed at TekSynap to lead the design and implementation of secure, scalable software solutions for federal defense clients.

Photo of the Rise User
Posted 4 hours ago

Lead the frontend architecture and development for Procore's core construction management platform as a Principal Software Engineer, driving innovation and technical excellence.

Photo of the Rise User
Posted 10 hours ago
Inclusive & Diverse
Empathetic
Collaboration over Competition
Growth & Learning
Transparent & Candid
Medical Insurance
Dental Insurance
Mental Health Resources
Life insurance
Disability Insurance
Child Care stipend
Employee Resource Groups
Learning & Development

Contribute to American Express's cutting-edge personalization products as a Software Engineer skilled in Java, APIs, and machine learning technologies.

Photo of the Rise User
Posted 19 hours ago
Inclusive & Diverse
Feedback Forward
Collaboration over Competition
Growth & Learning

A full-stack engineer role at OpenAI developing innovative observability and evaluation systems to enhance AI model behavior and user experience.

Posted 3 hours ago

SpeAR AI is looking for a Modernization Engineer to lead development efforts on nuclear submarine combat systems, combining legacy modernization with innovative software technologies.

Mechanical Orchard is seeking a skilled Infrastructure Software Engineer to join their remote team and help build and deploy their Generative AI platform using XP engineering practices and cloud infrastructure expertise.

Photo of the Rise User
Posted 2 hours ago

Contribute to Orb's next-generation billing infrastructure as a Fullstack Software Engineer at the forefront of modern monetization technology.

MATCH
Calculating your matching score...
FUNDING
SENIORITY LEVEL REQUIREMENT
TEAM SIZE
EMPLOYMENT TYPE
Full-time, hybrid
DATE POSTED
August 7, 2025
Risa star 🔮 Hi, I'm Risa! Your AI
Career Copilot
Want to see a list of jobs tailored to
you, just ask me below!