About Krea
At Krea, we are building next-generation AI creative tools.
We are dedicated to making AI intuitive and controllable for creatives. Our mission is to build tools that empower human creativity, not replace it.
We believe AI is a new medium that allows us to express ourselves through various formats—text, images, video, sound, and even 3D. We're building better, smarter, and more controllable tools to harness this medium.
This job
Robust, reliable, and scalable distributed systems form the backbone of Krea. These systems support the infrastructure that powers our AI research, real-time user experiences, and large-scale model deployments.
As a Distributed Systems Engineer, you will…
… design, build, and maintain large-scale distributed infrastructure to reliably support AI research and real-time model serving.
… own and scale our multi-thousand-node Kubernetes GPU clusters, ensuring efficient and fault-tolerant operations.
… collaborate closely with ML engineers and researchers to architect systems that enable rapid experimentation and deployment.
… improve network architecture, optimize load balancing, and streamline operational practices across multi-zone cloud deployments.
Example projects
Own and manage a large-scale Kubernetes cluster designed to run extensive ML training and inference workloads.
Architect fault-tolerant systems ensuring uninterrupted model training and real-time inference despite individual node failures.
Develop and implement optimized load-balancing strategies to efficiently distribute workloads across zones.
Create comprehensive monitoring, alerting systems, and operational playbooks for high-availability clusters.
Migrate existing deployments to Infrastructure as Code (Terraform) for reproducibility and scalability.
Setting up IP-based rate-limiting to prevent GPU abuse.
Strong candidates may have experience with…
Kubernetes at scale (thousands of nodes)
Cloud infrastructure management (AWS/GCP/Azure)
High-performance and fault-tolerant networking
Low-level Linux interfaces and administration
Debugging complex distributed systems in production
Python, Golang, Ruby, Rust, and similar systems languages
Bonus: Infrastructure as Code (e.g. Terraform)
About us
We’re building AI creative tooling.
We’ve raised over $83M from the best investors in Silicon Valley.
We’re a team of 12 with millions of active users scaling aggressively.
If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.
Advance autonomous driving safety by owning behavioral requirements and verification processes at Waabi, an innovative AI-driven self-driving technology company.
Lead the design and development of advanced AI GPU SoC architectures at Intel to deliver high-performance, energy-efficient silicon solutions.
Contribute to pioneering satellite communication technology as a Junior Embedded Software Engineer at ALL.SPACE, focusing on embedded real-time software development.
Lead aerodynamic design and CFD simulation efforts as a Turbomachinery Engineer driving high-performance flowpath development for scalable carbon capture power cycles at Arbor.
Kimley-Horn seeks engineering graduates to join their Naples team to assist in civil engineering analysis and project delivery using advanced technical software.
Level99 is looking for a skilled Hardware/Software Integration Engineer to lead integration of hardware and software for their innovative gaming venues.
Lead mechanical design engineering projects at Intuitive to enhance robotic surgical systems through innovative problem solving and team leadership.
Lead systems engineering efforts at General Dynamics Mission Systems by developing and updating software-interacting models for cutting-edge weapon systems.
Moog seeks an Associate Design Engineer to develop and verify advanced control systems for military aircraft within a supportive hybrid work culture.
Serco Inc. is hiring a Junior Mechanical Engineer in Ludlow, MA to support defense-related mechanical design and manufacturing efforts with emphasis on GD&T expertise.
Contribute to pioneering PCB design solutions at Northrop Grumman as an Electrical CAD Designer supporting advanced armament systems.
Zapier is hiring an Engineering Manager to lead their Integrations team focused on AI and automation tools, collaborating across functions to deliver scalable, high-impact products.
Lead Emory University's engineering design services to drive innovation and energy efficiency as Director of Engineering Services.