Job details

Senior Platform Engineer

Overview

Pluralis Research is pioneering Protocol Learning—a fully decentralised way to train and deploy AI models that opens this layer to individuals rather than well resourced corporates. By pooling compute from many participants, incentivising their efforts, and preventing any single party from controlling a model’s full weights, we’re creating a genuinely open, collaborative path to frontier-scale AI.

We’re looking for a Senior Platform Engineer with experience in startups, or senior devops in big tech with a passion for ML. Helping to scale and own our systems infrastructure orchestration, and services integration.

Responsibilities

Multi-Cloud Infrastructure: Design resource management systems provisioning and orchestrating compute across AWS, GCP, and Azure using infrastructure-as-code (Pulumi/Terraform). Handle dynamic scaling, state synchronization, and concurrent operations across hundreds of heterogeneous nodes.
Distributed Training Systems: Architect fault-tolerant infrastructure for distributed ML. GPU clusters, NVIDIA runtime, S3 checkpointing, Large dataset management and streaming, health monitoring, and resilient retry strategies.
Real-World Networking: Build systems that simulate and handle real-world network conditions — bandwidth shaping, latency injection, packet loss — while managing dynamic node churn and ensuring efficient data flow across workers with heterogeneous connectivity, because our training happens on consumer nodes and non co-located infrastructure, not in a datacenter.

What You’ll Bring

Ideally, you’ll have 5+ years of work experience with deep experience in:

Infrastructure-as-Code: Production Pulumi/Terraform/CloudFormation managing multi-cloud deployments. Lifecycle orchestration, automated provisioning, self-healing systems at scale.
Python Engineering: Idiomatic async Python with error handling, retry logic, concurrent execution. Asyncio, SSH libraries, cloud SDKs, CLI tools.
Container & GPU: Docker, Kubernetes/EKS, GPU workloads, heterogeneous clusters. multi-GPU optimization, resource scheduling.
Networking: Decentralized topologies and routing, NAT hole punching, P2P multi-address coordination, traffic shaping, real-world bandwidth constraints.
ML Infrastructure: Distributed training workflows, checkpoint management, data sharding, model versioning, long-running job operations.
Observability & SRE: Monitoring systems (Prometheus/Grafana), logging, SLOs, incident response, bottleneck profiling, performance optimization.

What we’re looking for

Experience in a startup environment with an emphasis on micro-services orchestration or big tech background
Deep understanding of multi-cloud infra & distributed training systems
A team player with high attention to detail
A strong passion to work at the intersection of AI and decentralized systems

FYI’s

We only hire in Australia and the United States. Visa sponsorship is limited to these countries.
Applicants must have professional-level English proficiency (written and spoken).
Pluralis is a remote team across Australia and the US. You’ll need to be comfortable working across timezones and collaborating with a diverse, distributed group.
Recruiters: we aren’t looking for agency support at this time. We’ll reach out if we need help.

Backed by Union Square Ventures and other tier-1 investors, we’re a world-class, deeply technical team of ML researchers. Pluralis is unapologetically ideological. We view the world as a better place if we are able to implement what we are attempting, and Protocol Learning as the only plausible approach to preventing a handful of massive corporations monopolising model development, access and release, and achieving massive economic capture. If this resonates, please apply.

Senior Platform Engineer Platform DevOps MLOps Pulumi Terraform Kubernetes EKS Docker GPU CUDA Python Asyncio Distributed Training Checkpointing Data Sharding P2P NAT Prometheus Grafana AWS GCP Azure

Average salary estimate

$150000 / YEARLY (est.)

min

max

$110000K

$190000K

If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.

Similar Jobs

Senior Backend Software Engineer (Remote from Washington)

Jobgether Hybrid Washington

VIEW

Posted 10 hours ago

Senior Backend Software Engineer role building scalable, reliable alerting systems for a global user base in a remote-first engineering organization.

Senior Software Engineer (BE/Fullstack)

TeamSnap Hybrid Remote

VIEW

Posted 9 hours ago

TeamSnap is hiring a Senior Backend/Fullstack Software Engineer to design and build scalable services and APIs for millions of users on a fully distributed engineering team.

Senior Backend Engineer

Pickle Hybrid No location specified

VIEW

Posted 24 hours ago

Lead backend development at Pickle to design and scale services, APIs, and infrastructure that power a fast-growing peer-to-peer fashion rental marketplace in NYC.

Software Engineer–Hardware Emulation (Hardware Emulation)

Boeing Hybrid United States - Remote

VIEW

Posted 9 hours ago

Senior software engineer to design and lead hardware emulation solutions (virtualization, KVM/QEMU, embedded/C++) for Boeing's emulation and simulation platforms.

Staff Software Engineer, Infrastructure

Engine Hybrid Remote - USA

VIEW

Posted 6 hours ago

Engine is hiring a Staff Infrastructure Engineer to lead architecture and delivery of a secure, scalable cloud platform that enables high-velocity product development across the company.

Senior Software Engineer II - 3D/CAD Infra (US)

Dandy Hybrid No location specified

VIEW

Posted 4 hours ago

Dandy is hiring a Senior Software Engineer II to own and scale the 3D/CAD infrastructure and cloud automation that powers production 3D workflows and developer tooling.

DGX Cloud Performance Engineer - New Grad 2026

NVIDIA Hybrid US, CA, Santa Clara

VIEW

Posted 9 hours ago

Customer-Centric

Mission Driven

Inclusive & Diverse

Rise from Within

Diversity of Opinions

Work/Life Harmony

Growth & Learning

Transparent & Candid

Medical Insurance

Paid Time-Off

Maternity Leave

Mental Health Resources

Equity

Child Care stipend

Paternity Leave

WFH Reimbursements

Flex-Friendly

Dental Insurance

Vision Insurance

Life insurance

Health Savings Account (HSA)

Flexible Spending Account (FSA)

401K Matching

Military leave

NVIDIA is hiring a New Grad DGX Cloud Performance Engineer to analyze, model, and optimize large-scale distributed AI systems and influence cluster architecture and HW–SW co-design.

Staff Software Engineer

DeepWalk Hybrid Chicago

VIEW

Posted 2 hours ago

Lead the technical direction of DeepWalk's computer vision platform as a Staff Software Engineer, building scalable ML pipelines, architecting systems for large-scale image data, and mentoring the engineering team.

Site Reliability Engineer I

Prosper Hybrid San Francisco, CA

VIEW

Posted 6 hours ago

At Prosper, this entry-level Site Reliability Engineer I will help build and maintain reliable, observable cloud infrastructure and automation to support scalable fintech products.

Sr. Compiler Engineering Manager

Intel Hybrid US, California, Santa Clara

VIEW

Posted 17 hours ago

Inclusive & Diverse

Rise from Within

Mission Driven

Diversity of Opinions

Work/Life Harmony

Growth & Learning

Transparent & Candid

Customer-Centric

Snacks

Onsite Gym

Family Coverage (Insurance)

Medical Insurance

Dental Insurance

Vision Insurance

Mental Health Resources

Life insurance

Disability Insurance

Health Savings Account (HSA)

Flexible Spending Account (FSA)

Learning & Development

Paid Time-Off

401K Matching

Maternity Leave

Paternity Leave

Lead and scale global compiler engineering teams at Intel to drive next-generation compiler products, open-source contributions, and cross-functional solutions for heterogeneous computing.

Staff Engineer, Platform/SRE (R4187)

Shield AI Hybrid Washington, DC

VIEW

Posted 11 hours ago

Shield AI is hiring a Staff Platform/SRE Engineer to design and operate secure, air‑gapped Kubernetes infrastructure and automation for mission-critical autonomy systems.

Sr Manager, SW Engineering (Prisma AIRS)

Palo Alto Networks Hybrid Seattle, WA

VIEW

Posted 23 hours ago

Lead the Prisma AIRS engineering organization to define technical strategy and scale teams building a cloud-native AI security platform at Palo Alto Networks.

Full Stack Developer, Life Sciences Technology Solutions

Guidehouse Hybrid US - Remote (Any location)

VIEW

Posted 3 hours ago

Guidehouse is hiring a Full Stack Developer to create responsive, secure web applications and interactive visualizations that support biopharma customers and internal data science teams.

P Pluralis Research

1 jobs

MATCH

Calculating your matching score...

FUNDING

Growth

DEPARTMENTS

Software Engineering

SENIORITY LEVEL REQUIREMENT

Senior Level

TEAM SIZE

No info