Transform Language Models into Real-World Applications
We’re building AI systems for a global audience. We are living in an era of AI transition - this new project team will be focusing on building applications to enable more real world impact and highest usage for the world.
This role is a global role with hybrid work arrangement - combining flexible remote work with in-office collaboration at our HQ. You’ll work closely with regional teams across product, engineering, operations, infrastructure and data to build and scale impactful AI solutions.
You’ll fine-tune state-of-the-art models, design evaluation frameworks, and bring AI features into production. Your work ensures our models are not only intelligent, but also safe, trustworthy, and impactful at scale.
Run and manage open-source models efficiently, optimizing for cost and reliability
Ensure high performance and stability across GPU, CPU, and memory resources
Monitor and troubleshoot model inference to maintain low latency and high throughput
Collaborate with engineers to implement scalable and reliable model serving solutions
Likes ownership and independence
Believe clarity comes from action - prototype, test, and iterate without waiting for perfect plans.
Stay calm and effective in startup chaos - shifting priorities and building from zero doesn’t faze you.
Bias for speed - you believe it’s better to deliver something valuable now than a perfect version much later.
See feedback and failure as part of growth - you’re here to level up.
Possess humility, hunger, and hustle, and lift others up as you go.
Experience with model serving platforms such as vLLM or HuggingFace TGI
Proficiency in GPU orchestration using tools like Kubernetes, Ray, Modal, RunPod, LambdaLabs
Ability to monitor latency, costs, and scale systems efficiently with traffic demands
Experience setting up inference endpoints for backend engineers
Flat structure & real ownership
Full involvement in direction and consensus decision making
Flexibility in work arrangement
High-impact role with visibility across product, data, and engineering
Top-of-market compensation and performance-based bonuses
Global exposure to product development
Lots of perks - housing rental subsidies, a quality company cafeteria, and overtime meals
Health, dental & vision insurance
Global travel insurance (for you & your dependents)
Unlimited, flexible time off
We’re a densed, high-performance team focused on high quality work and global impact. We behave like owners. We value speed, clarity, and relentless ownership. If you’re hungry to grow and care deeply about excellence, join us.
BJAK is Southeast Asia’s #1 insurance aggregator with 8M+ users, fully owned by its employees. Headquartered in Malaysia and operating in Thailand, Taiwan, and Japan, we help millions of users access transparent and affordable financial protection through Bjak.com. We simplify complex financial products through cutting-edge technologies, including APIs, automation, and AI, to build the next generation of intelligent financial systems.
If you're excited to build real-world AI systems and grow fast in a high-impact environment, we’d love to hear from you.
If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.
Lead a full-stack engineering team at Abridge to design and deliver secure, HIPAA-compliant EHR integrations and interoperability features using modern web technologies.
Lead the technical strategy for Production Engineering at GitLab, shaping how GitLab.com scales, performs, and remains highly reliable for millions of users worldwide.
BentoML seeks an Inference Optimization Engineer to accelerate LLM inference across GPUs and distributed serving stacks, reducing latency and GPU costs while contributing to open-source tooling.
Coates Group is hiring a Release and Environments Manager in Chicago to lead release orchestration, environment provisioning and CI/CD improvements across multi-environment deployments.
Lead the architecture and delivery of secure, agent-based AI systems that turn high-value data into customer-facing, production-grade solutions at 3E.
Zoox seeks a Perception Software Engineer to develop and deploy real-time scene understanding and perception pipelines for its autonomous vehicle platform in Foster City, CA.
Lead NVIDIA’s enterprise identity and access SRE efforts to design, automate, and scale zero-trust authentication and access controls across hybrid and cloud-native infrastructure.
NVIDIA seeks a Senior System Development Security Operations Engineer to build and integrate advanced DevSecOps tooling (SAST/DAST/SCA) and drive security posture for its data center systems and AI/HPC software stacks.
Lead and scale a global engineering organization to deliver scalable, high-performance ecommerce analytics software while setting technical strategy and driving operational excellence.
Apex.AI is hiring a Senior Application Engineer in Palo Alto to build and deploy C++-based applications and provide technical customer support for safety-critical mobility software.
Patreon is hiring a Senior Backend Platform Engineer to build scalable backend systems and platform tooling that accelerate product teams and support creator growth.
Attentive is hiring a Frontend Software Engineer to build scalable React/TypeScript UIs for its enterprise email marketing platform in a hybrid New York City role.
Yeet is hiring a Backend Engineer to help scale its Python microservices and build high-throughput, fault-tolerant streaming and ingestion systems for a real-time observability platform.
Bjak is focused on providing access to affordable and sustainable financial services for people in ASEAN. Headquartered in Malaysia, Bjak is the the largest insurance portal in Southeast Asia. Our main portal, Bjak.com, helps millions find the i...
1 jobs