Let’s get started
By clicking ‘Next’, I agree to the Terms of Service
and Privacy Policy, and consent to receive emails from Rise
Jobs / Job page
Senior Software Engineer, Compute Platform image - Rise Careers
Job details

Senior Software Engineer, Compute Platform

About AION

AION is building an interoperable AI cloud platform by transforming the future of high-performance computing (HPC) through its decentralized AI cloud. Purpose-built for bare-metal performance, AION democratizes access to compute and provides managed services, aiming to be an end-to-end AI lifecycle platform—taking organizations from data to deployed models using its forward-deployed engineering approach.

AI is transforming every business around the world, and the demand for compute is surging like never before. AION thrives to be the gateway for dynamic compute workloads by building integration bridges with diverse data centers around the world and re-inventing the compute stack via its state-of-the-art serverless technology. We stand at the crossroads where enterprises are finding it hard to balance AI adoption with security. At AION, we take enterprise security and compliance very seriously and are re-thinking every piece of infrastructure from hardware and network packets to API interfaces.

Led by high-pedigree founders with previous exits, AION is well-funded by major VCs with strategic global partnerships. Headquartered in the US with global presence, the company is building its initial core team in India/London.

Who You Are

You're a seasoned engineer who has built and scaled high-performance inference systems for AI/ML workloads. You've designed distributed systems that handle thousands of requests per second while maintaining sub-second response times and cost efficiency.

You're product-minded—you understand how your technical decisions impact developers using AION's platform and think about the end-to-end user experience. You're a team player comfortable wearing multiple hats—one day you're building product features, the next you're joining customer calls to understand their deployment challenges, and the day after you're helping with UI/UX, customer success, documentation and product ops.  

Experience with Golang is strongly preferred, and deep exposure to cloud infrastructure (AWS/GCP/Azure), Kubernetes, and distributed systems is essential. You take ownership of platform-level decisions, think strategically about multi-cloud architecture, and want your work to power AI workloads for thousands of developers globally.

What You'll Do

Compute Platform Architecture & Multi-Cloud Integration

  • Design and architect AION's multi-cloud compute platform, building abstraction layers that unify diverse cloud providers (AWS, GCP, Azure, bare-metal data centers)
  • Work directly with cloud providers to expand AION's compute pool—understanding pricing, availability zones, GPU types, and capacity planning
  • Build and maintain the AION managed services
  • Understand and abstract cloud provider differences in storage (block, object, file systems), networking (VPCs, subnets, security groups), and compute resources
  • Design composable platform components that enable forward deployments and promote reusability across AION's infrastructure stack

Managed Services Development & Platform Ownership

  • Own end-to-end development of managed services on the compute platform—from design and architecture through execution and production monitoring
  • Build scalable orchestration systems for GPU workloads, container scheduling, and resource allocation
  • Develop robust APIs and control planes for compute lifecycle management (provisioning, scaling, termination)
  • Lead technical discussions on platform reliability, performance optimization, and cost efficiency

Infrastructure & Peripheral Services

  • Execute on peripheral platform services including billing systems, usage accounting, observability infrastructure, and compliance tooling
  • Build monitoring and telemetry systems for compute utilization, cost tracking, and performance metrics
  • Establish engineering standards for platform development including code reviews, quality gates, and testing practices
  • Mentor engineers on infrastructure best practices and distributed systems design


Technical Skills & Experience

If you are meeting some of these requirements and feel comfortable catching up on others, we definitely recommend you to apply:

  • 4+ years of experience building and scaling complex backend systems, cloud infrastructure, or distributed platforms
  • Strong understanding of multi-cloud architectures and experience working with AWS, GCP, or Azure at scale
  • Deep knowledge of cloud abstractions: compute (EC2, GCE, VMs), storage (S3, GCS, EBS), networking (VPCs, load balancers, security groups)
  • Proficiency in Golang strongly preferred; Python, Rust, or other systems languages a plus
  • Experience with Kubernetes, container orchestration, and infrastructure-as-code (Terraform, Pulumi, CloudFormation)
  • Solid understanding of distributed systems principles, consensus algorithms, and state management
  • Experience building APIs, control planes, and platform services for infrastructure management
  • Familiarity with databases (PostgreSQL, Redis, etcd), message queues (Kafka, RabbitMQ), and event-driven architectures
  • Knowledge of GPU orchestration, AI/ML workloads, or HPC systems is highly desirable
  • Experience with observability tools (Prometheus, Grafana, Datadog) and distributed tracing
  • Understanding of cloud billing models, cost optimization strategies, and resource scheduling

Bonus / Good to Have

Having expertise in one or more of these specializations is highly desired:

  • HPC & Cluster Management: Experience handling large-scale HPC clusters using Kubernetes and Slurm for job scheduling, resource allocation, and workload orchestration
  • Data Engineering: Expertise with data pipelines, ETL systems, and large-scale data processing frameworks
  • Systems-Level Programming: Experience with low-level systems programming such as storage systems, Kubernetes operators, OS-level software development, or daemon services (llm-d, system agents)
  • ML Platform Engineering: Experience productionizing ML pipelines, batch job orchestration, model fine-tuning workflows, and Jupyter notebook orchestration systems

Enterprise Deployment: Experience platformizing and packaging software for on-premises deployments or customer VPC installations with emphasis on security, compliance, and operational simplicity

Preferred Attributes:

  • High ownership, self driven and bias for action.
  • Strong strategic thinking and ability to connect technical decisions to business impact.
  • Excellent communication and mentoring skills.
  • Thrives in ambiguity, fast-paced environments, and early-stage startup culture.

Why Join AION?

  • Work directly with high-pedigree founders shaping technical and product strategy.
  • Build infrastructure powering the future of AI compute globally.
  • Significant ownership and impact with equity reflective of your contributions.
  • Competitive compensation, flexible work options, and wellness benefits

Apply Now:
If you’re a strong engineer ready to lead architecture and scale next-generation AI infrastructure, we want to hear from you. Please share:

  • Your resume highlights relevant projects and leadership experience.
  • Links to products, code, or demos you’ve built.
  • A brief note on why AION’s mission excites you.

Average salary estimate

$185000 / YEARLY (est.)
min
max
$140000K
$230000K

If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.

Similar Jobs

Senior Python Full-Stack Engineer to build and optimize backend services and full-stack tooling for large-scale AI data annotation and evaluation workflows at Alignerr.

Photo of the Rise User

Experienced backend or backend-leaning full-stack engineer with strong FHIR and SMART on FHIR expertise to build and maintain interoperable healthcare systems for government clients at Nava.

Photo of the Rise User

Verisoul is hiring an in-person Platform Engineer in Austin to architect and ship the core APIs, SDKs, and verification infrastructure that power its fraud-detection product.

Photo of the Rise User
Posted 18 hours ago

Stand Together is hiring a Senior Cloud Engineer to lead cloud infrastructure strategy and deliver self-service platforms that empower application teams and drive DevOps adoption.

Posted 17 hours ago

Booz Allen seeks a seasoned Front End Software Engineer to build accessible, scalable React applications, lead front-end architecture decisions, and mentor a development team.

Photo of the Rise User

Experienced Senior Software Engineer with strong RPA and DevOps skills needed to design and deploy secure automation and cloud-native solutions for a regulated, remote-first team based in Texas.

Photo of the Rise User

Experienced backend engineer needed to lead design and delivery of secure, scalable crypto custody systems for a North America–remote blockchain team.

Photo of the Rise User

Senior technology leader needed to head a global engineering organization and deliver scalable, AI-enabled web and mobile solutions while partnering closely with executive leadership and clients.

Photo of the Rise User

Lead architecture and development of high-scale payment services for Visa’s Crypto Program, delivering robust, secure solutions across distributed systems in a hybrid Austin role.

Posted 10 hours ago
Mission Driven
Social Impact Driven

Lead the design and delivery of production AI capabilities that combine LLMs, ML, and graph-structured data to create explainable, reliable platform features for real-world risk and narrative intelligence.

Posted 10 hours ago

Design and deploy mission-critical embedded firmware and hardened hardware at Sweep to defend people and organizations from malicious autonomous systems.

Photo of the Rise User
Dental Insurance
Disability Insurance
Flexible Spending Account (FSA)
Health Savings Account (HSA)
Vision Insurance
Paid Holidays

Kiddom is hiring a Staff Software Engineer (Frontend) to lead frontend architecture and deliver scalable, data-driven product features for their education platform.

Photo of the Rise User

Lead the design and delivery of high-performance, accessible React front-ends at a fast-paced, product-driven company working across distributed teams.

MATCH
Calculating your matching score...
FUNDING
SENIORITY LEVEL REQUIREMENT
TEAM SIZE
No info
HQ LOCATION
No info
EMPLOYMENT TYPE
Full-time, remote
DATE POSTED
December 18, 2025
Risa star 🔮 Hi, I'm Risa! Your AI
Career Copilot
Want to see a list of jobs tailored to
you, just ask me below!