Let’s get started
By clicking ‘Next’, I agree to the Terms of Service
and Privacy Policy, and consent to receive emails from Rise
Jobs / Job page
Senior Software Engineer - Retrieval-Augmented Generation (RAG) System image - Rise Careers
Job details

Senior Software Engineer - Retrieval-Augmented Generation (RAG) System

liETtVLaARqgmMEbYzHNNLIzUPcdfPrwhYtVK7Qa.png Fast Facts

Seeking a Senior Software Engineer to design and operate production-scale retrieval-augmented generation (RAG) systems in the healthcare domain, focusing on document retrieval and response generation.

liETtVLaARqgmMEbYzHNNLIzUPcdfPrwhYtVK7Qa.png Responsibilities: Key responsibilities include architecting and implementing end-to-end RAG workflows, building low-latency services and APIs, collaborating with cross-functional teams, and ensuring security and compliance standards.

liETtVLaARqgmMEbYzHNNLIzUPcdfPrwhYtVK7Qa.png Skills: Candidates must have strong programming skills in Python, experience with RAG systems, ML workflow tooling, cloud infrastructure, and a solid understanding of data governance and security best practices.

liETtVLaARqgmMEbYzHNNLIzUPcdfPrwhYtVK7Qa.png Qualifications: Preferred qualifications include familiarity with agentic workflow tools, knowledge of evaluation methodologies for retrieval systems, and experience with performance optimization for AI services.

liETtVLaARqgmMEbYzHNNLIzUPcdfPrwhYtVK7Qa.png Location: The job is based in Philadelphia, PA, with no specified travel requirements.

liETtVLaARqgmMEbYzHNNLIzUPcdfPrwhYtVK7Qa.png Compensation: $86600 - $144400 / Annually




Job title: Senior Software Engineer – Retrieval-Augmented Generation (RAG) System

About the role, we are seeking an engineer to work with a team to build and support a healthcare centered production-scale RAG system that combines document retrieval with response generation to deliver accurate, context-aware answers. This engineer we be expected to design, implement, and operate end-to-end RAG pipelines— LLM interaction, API creation, and high-performance, secure delivery of knowledge-grounded capabilities. You will collaborate with data engineers, platform teams, and product partners to ship reliable, scalable, and observable systems.

About the team; This collaborative team is entrusted with building the Next Generation Health Solutions through the utilization of cutting-edge technology.

Role and responsibilities

  • Architecting, implementing, testing, and operating end-to-end RAG workflows:
  • Ingesting and normalizing documents from diverse sources
  • Generating and managing embeddings; index and query vector databases
  • Retrieve relevant passages, apply reranking or fusion strategies, and feed prompts to LLMs
  • Building scalable, low-latency services and APIs (Python preferred; other languages acceptable) and ensure production-grade reliability (monitoring, tracing, alerting)
  • Integrating with vector databases and embedding pipelines and optimize for latency, throughput, and cost
  • Designing and implementing ML Ops workflows: model/version management, experiments, feature stores, CI/CD for ML-enabled services, rollback plans
  • Developing robust data pipelines and governance around ingestion, provenance, quality checks, and access controls
  • Collaborating with data engineers to improve retrieval quality (embedding strategies, reranking, cross-encoder models, prompt engineering) and implement evaluation metrics (precision/recall, MRR, QA accuracy, user-centric metrics)
  • Implementing monitoring and observability for RAG components (latency, success rate, cache hit rate, retrieval quality, data drift)
  • Ensuring security, privacy, and compliance (authentication, authorization, data masking, PII handling, audit logging)

Required qualifications

  • 5+ years of professional software engineering experience designing and delivering production systems
  • Strong programming skills (Python required; NodeJs a plus)
  • Deep understanding of retrieval-augmented or application-scale NLP systems and practical experience building RAG-like pipelines
  • Hands-on experience with ML workflow tooling and MLOps concepts (model serving, versioning, experiments, feature stores, reproducibility)
  • Proficiency with cloud infrastructure and modern software practices (AWS/GCP/Azure; Docker; Kubernetes; CI/CD)
  • Strong problem-solving skills, excellent communication, and ability to work with cross-functional teams
  • Familiarity with data governance, privacy, and security best practices

Preferred qualifications

  • Experience with agentic workflow tools (LangGraph) and familiarity with prompt engineering for LLMs
  • Exposure to working with and evaluating different LLMs
  • Knowledge of evaluation methodologies for retrieval and QA systems and the ability to set up A/B tests and dashboards
  • Experience with data processing frameworks (SQL, Pandas, Spark) and working with large-scale data pipelines
  • Background in performance optimization for low-latency AI services (MLflow)
  • Experience with monitoring and logging via New Relic, K9s, Portkey, etc
  • Experience with minimizing token usage and cost optimization
  • Comfortable with design and implementation of security controls for data-intensive AI systems

Elsevier is a renowned global information analytics company that primarily focuses on providing scientific, technical, and medical (STM) research content, tools, and services. It is one of the largest publishers of academic journals and scholarly literature in the world.

Elsevier operates in various domains, including science, technology, medicine, social sciences, and more. They publish a vast number of peer-reviewed journals covering a wide range of disciplines. These journals act as platforms for researchers and academics to share their findings and contribute to the advancement of knowledge in their respective fields.

U.S. National Base Pay Range: $86,600 - $144,400. Geographic differentials may apply in some locations to better reflect local market rates. If performed in New Jersey, the base pay range is $97,867 - $156,333. This job is eligible for an annual incentive bonus.

We know your well-being and happiness are key to a long and successful career. We are delighted to offer country specific benefits. Click here to access benefits specific to your location.

We are committed to providing a fair and accessible hiring process. If you have a disability or other need that requires accommodation or adjustment, please let us know by completing our Applicant Request Support Form or please contact 1-855-833-5120.

Criminals may pose as recruiters asking for money or personal information. We never request money or banking details from job applicants. Learn more about spotting and avoiding scams here.

Please read our Candidate Privacy Policy.

We are an equal opportunity employer: qualified applicants are considered for and treated during employment without regard to race, color, creed, religion, sex, national origin, citizenship status, disability status, protected veteran status, age, marital status, sexual orientation, gender identity, genetic information, or any other characteristic protected by law.

USA Job Seekers:

EEO Know Your Rights.

Average salary estimate

$115500 / YEARLY (est.)
min
max
$86600K
$144400K

If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.

Similar Jobs
Posted 16 hours ago

Senior Software Engineer to lead design and operation of Python-based backend systems and workflow platforms for client onboarding and reporting at Apollo Insurance Solutions Group in El Segundo.

Photo of the Rise User
Sydecar Hybrid San Francisco
Posted 9 hours ago

Sydecar is hiring a full-stack Software Engineer to build and maintain cloud-native fintech products using TypeScript, React, Node.js, and PostgreSQL in a hybrid San Francisco or NYC environment.

Photo of the Rise User
Posted 11 hours ago

Lead development of scalable, secure iOS features at Bumble, driving architecture, delivery, and mentorship for products used by millions.

Photo of the Rise User
Posted 17 hours ago

Bumble is hiring an Android Software Engineer in Austin to own end-to-end Android features, influence architecture, and mentor junior engineers in a fast-moving, user-focused product environment.

Photo of the Rise User
NBCUniversal Hybrid 30 Rockefeller Plaza, New York, NEW YORK
Posted 4 hours ago

NBCUniversal is hiring a Senior Software Engineer with strong Python and AWS serverless experience to develop scalable metadata and title services powering content across the company.

Photo of the Rise User

Demiurge Studios seeks a Unity Software Engineer to implement and maintain meta-game systems for an upcoming fast-paced, team-based PvP game.

Photo of the Rise User

Senior Mobile Architect wanted to design and lead secure, scalable mobile architectures across hybrid and native platforms for a US-based enterprise organization.

Photo of the Rise User

Senior Software Developer needed to architect and implement scalable Node.js backends and modern Angular frontends for a US-based remote engineering team.

Photo of the Rise User
MongoDB Hybrid Austin; Boston; Chicago; New York City; Pittsburgh; Raleigh; United States; Washington DC
Posted 9 hours ago

Work on MongoDB Atlas’s Infrastructure Security team to design and implement scalable security primitives, runtime protections, and automation that secure multi-cloud infrastructure at scale.

Photo of the Rise User
Posted 2 hours ago

Senior Cloud DevOps Engineer to lead rapid AWS lift-and-shift migrations and drive modernization and automation for mission-critical workloads at KBR (U.S. citizens only).

Photo of the Rise User
Posted 5 hours ago

Encord is hiring graduate full-stack engineers to help build scalable AI infrastructure and shipping impactful product features as part of a small, high-trust engineering team in San Francisco.

Photo of the Rise User

An early-career Java/DevOps role offering remote work, mentorship, and hands-on experience with Java, CI/CD tooling, and cloud deployments to jumpstart your tech career.

Photo of the Rise User

Airwallex seeks a Senior Frontend Engineer (Growth) in San Francisco to build scalable, data-driven React/TypeScript experiences and experimentation tooling that drive user acquisition.

Lead the way in advancing science, technology and health.

20 jobs
MATCH
Calculating your matching score...
FUNDING
SENIORITY LEVEL REQUIREMENT
INDUSTRY
TEAM SIZE
EMPLOYMENT TYPE
Full-time, remote
DATE POSTED
January 17, 2026
Risa star 🔮 Hi, I'm Risa! Your AI
Career Copilot
Want to see a list of jobs tailored to
you, just ask me below!