Responsibilities:
We need 10X engineers to make this happen! True wizards that can deliver on a daily basis in the high-paced environment.
In this role, you will...
Enhance our retrieval-augmented generation (RAG) capabilities, ensuring our models effectively integrate and utilize diverse data sources.
Work on embedding generation techniques, optimizing them for accuracy and efficiency.
Keep abreast of the latest developments in AI models, their evolving capabilities, and the complexities of integration. Position yourself as the go-to expert within the team, guiding our end users in crafting their use cases with the most suitable models.
Collaborate with backend and frontend teams to integrate AI models into products and services.
We're all about our users - listen to their needs, help them, build strong relationships! Expect to dive into customer support now and then as everyone else in the team.
Required Skills and Qualifications:
You will be a great fit if you have...
Bachelor's or Master's degree in Computer Science, Machine Learning, AI, or related fields.
Proven experience in machine learning and AI, with a focus on generative models.
Knowledge of retrieval-augmented generation pipelines and their implementation.
Strong proficiency in Python and popular machine learning frameworks like TensorFlow or PyTorch.
Experience with embedding techniques and understanding of their applications in AI models.
Excellent problem-solving skills and the ability to work in fast-paced environments.
Strong communication and collaboration skills.
Preferred Qualifications:
You will be an exceptional fit if you also have...
PhD in a relevant field.
Published research or projects in the field of generative AI, embeddings, or retrieval-augmented generation.
Experience with large-scale ML deployments in a production environment.
Contribution to open-source AI/ML projects.
In-depth knowledge of generative AI hardware, including an understanding of GPU architectures, requirements, and optimization techniques for AI model deployment.
Beyond these skills, what we value the most is your ability to remain curious and open to learning new skills. As an early-stage startup joiner, you may need to wear many hats.
Stack AI is a no-code drag-and-drop tool to quickly design, test, and deploy AI workflows that leverage Large Language Models (LLMs), such as ChatGPT, to automate any business process.
Our core value is to make it extremely easy to build arbitrarily complex AI pipelines using a visual interface that allows you to connect different data sources with different AI models.
Our customers use Stack AI to build applications such as:
Chatbots and Assistants: AI agents that interact with users, answer questions, and complete tasks, using your internal data and APIs.
Document Processing: apps to answer questions, summarize, and extract insights from any document, no matter how long.
Answer Questions on Databases: connect GPT-like models to databases (such as Notion, Airtable, or Postgres) and ask questions about them.
Content Creation: generate tags, summaries, and transfer styles or formats between documents and data sources.
If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.
Stack AI is hiring a Business Development Representative to drive enterprise prospecting, nurture executive relationships, and create high-impact outreach for a fast-growing no-code AI workflow platform.
Help shape the backend architecture of a fast-growing no-code AI platform by building scalable systems, integrating LLMs, and working closely with the founding team and customers.
Stack AI is hiring a Founding Frontend Engineer to build and own high-quality React/TypeScript interfaces and the company’s landing pages at an early-stage AI startup.
Parallel Web Systems is hiring a hands-on Deployed Engineer in Palo Alto to partner with customers, implement API integrations, and scale product adoption.
Join Oumi as a Senior Platform Engineer to architect and build scalable backend systems and infrastructure for an open, research-driven AI platform.
Experienced backend-leaning engineers are sought to join Zapier’s Workflow Zone to design and ship scalable, reliable systems that power workflow creation, execution, and troubleshooting.
Lead front-end engineering efforts for Target.com by evaluating new technologies, designing robust architectures, and delivering high-quality, accessible web experiences.
Mercor is hiring experienced Swift engineers with open-source backgrounds to write and review comprehensive unit tests for production Swift projects used in cutting-edge AI research.
Quizlet is hiring a Senior Fullstack Engineer on the Activation & Retention team to design and ship experiments that increase user onboarding and retention using React, NextJS and server-side technologies.
Help build and scale Zams’ core SaaS product as a Full Stack Engineer focused on frontend excellence, full-stack ownership, and AI-driven tooling.
LangChain seeks a Full Stack Engineer to own and ship features for LangSmith’s observability and evals platform, working onsite in San Francisco across a Go/Python backend and a React+TypeScript frontend.
Notion is hiring an early-career Infrastructure Software Engineer to help design, build, and operate the platform that powers a global user base, focusing on reliability, scalability, and developer experience.
Help build backend-first developer tooling and scalable integration services at Zapier to enable AI-native automation and a better developer experience.
Highmark Health seeks a Full Stack Java Associate Software Engineer to design, build, and maintain scalable web services and user-facing applications in a remote, agile team environment.
Build the backbone of Chime's financial services as a Software Engineer focused on scalable, secure, high-performance backend systems.