Introducing Moonlake, AI for creating world simulations.
Training efficiency
Dataloaders, fusion, activation remat, gradient checkpointing.
FSDP/ZeRO/tensor+pipeline parallel; NCCL tuning.
GPU + kernel performance
Nsight profiling, Triton/CUDA kernels, fused ops.
Flash-attention–style speedups, sequence packing, KV-cache tricks.
Inference optimization
Low-latency serving, continuous batching, speculative decoding.
Quantization (GPTQ/AWQ), distillation, pruning.
Infra + reliability
SLURM/K8s multi-node jobs, checkpoint hygiene.
Determinism, env pinning, GPU failure handling.
We are committed to being an on-site, in-person team currently based in San Mateo
If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.
Lead development and validation of gene delivery strategies (viral and non-viral) to enable expression of next-generation molecular neural interfaces at Merge Labs' San Francisco bio lab.
Helm.ai seeks a Research Engineer to research and implement state-of-the-art model optimization techniques and deploy optimized deep learning models on GPUs and AI accelerators.
Lead technical R&D efforts developing test methods, fixtures, and manufacturable designs for medical device/IVD specimen management products in a high‑volume regulated environment.
Work on state-of-the-art unsupervised learning and perception research at Helm.ai, turning rapid experimental results into deployable algorithms for autonomous vehicles.
Toyota Research Institute is hiring a Senior Robotics Research Engineer to design and integrate hardware-software systems that advance general-purpose robotic manipulation in real-world environments.