Job details

Software Engineer II (Backend + Data pipelines)

About The Company:

At Scribd (pronounced “scribbed”), our mission is to spark human curiosity. Join our team as we create a world of stories and knowledge, democratize the exchange of ideas and information, and empower collective expertise through our three products: Everand, Scribd, and Slideshare.

We support a culture where our employees can be real and be bold; where we debate and commit as we embrace plot twists; and where every employee is empowered to take action as we prioritize the customer.

When it comes to workplace structure, we believe in balancing individual flexibility and community connections. It’s through our flexible work benefit, Scribd Flex, that employees – in partnership with their manager – can choose the daily work-style that best suits their individual needs. A key tenet of Scribd Flex is our prioritization of intentional in-person moments to build collaboration, culture, and connection. For this reason, occasional in-person attendance is required for all Scribd employees, regardless of their location.

So what are we looking for in new team members? Well, we hire for “GRIT”. The textbook definition of GRIT is demonstrating the intersection of passion and perseverance towards long term goals. At Scribd, we are inspired by the potential that this can unlock, and ask each of our employees to pursue a GRIT-ty approach to their work. In a tactical sense, GRIT is also a handy acronym that outlines the standards we hold ourselves and each other to. Here’s what that means for you: we’re looking for someone who showcases the ability to set and achieve Goals, achieve Results within their job responsibilities, contribute Innovative ideas and solutions, and positively influence the broader Team through collaboration and attitude.

About the team:

The ML Data Engineering team powers metadata extraction, enrichment, and content understanding across all Scribd brands. We process hundreds of millions of documents, billions of images, and deliver high-quality metadata to enable content discovery and trust for millions of users worldwide.

Our systems operate at massive scale, supporting diverse datasets like user-generated content (UGC), ebooks, audiobooks, and more. We work at the intersection of machine learning, data engineering, and distributed systems, collaborating closely with applied research and product teams to deploy scalable ML and LLM-powered solutions in production.

Role Overview:

We’re seeking a Software Engineer II with strong backend development experience and a passion for solving complex data challenges at scale. In this role, you’ll design, build, and optimize distributed systems that extract, enrich, and process metadata for a wide range of content. You’ll work closely with ML engineers, product managers, and cross-functional partners to integrate machine learning models and LLM-based services into production pipelines and deliver impactful, high-performance solutions. This role offers the opportunity to work on cutting-edge generative AI and metadata enrichment problems at a truly global scale.

Tech Stack:

Our team uses various technologies. The following are the ones that we use on a regular basis: Python, Scala, Ruby on Rails, Airflow, Databricks, Spark, HTTP APIs, AWS (Lambda, ECS, SQS, ElastiCache, Sagemaker, Cloudwatch, Datadog) and Terraform.

Key Responsibilities:

Design and build scalable systems to extract, enrich, and process metadata from millions of documents, images, and audio content.
Leverage LLMs to integrate capabilities like summarization, classification, extraction, and enrichment into metadata pipelines.
Collaborate with cross-functional teams, including ML engineers and product managers, to deliver scalable, efficient, and reliable metadata solutions.
Optimize and refactor existing systems for performance, scalability, and reliability.
Ensure data accuracy, integrity, and quality through automated validation and monitoring.
Participate in code reviews, ensuring best practices are followed and maintaining high-quality standards in the codebase.
Manage and maintain data pipelines, security and infrastructure

Requirements:

4+ years of professional software engineering experience
Proficiency in Python, Scala, Ruby, or similar languages
Experience designing and building distributed systems at scale
Hands-on experience building, deploying, and optimizing solutions using ECS, EKS, or AWS Lambda
Experience with infrastructure-as-code tools like Terraform (or similar)
Experience working with a public cloud provider (AWS, Azure, or Google Cloud)
Familiarity with data processing frameworks like Spark or Databricks for large-scale workloads
Proven ability to test, profile, and optimize systems for performance, scalability, and reliability
Bachelor’s degree in Computer Science or equivalent professional experience
Bonus: Experience working with LLMs or integrating ML models into production systems

At Scribd, your base pay is one part of your total compensation package and is determined within a range. Our pay ranges are based on the local cost of labor benchmarks for each specific role, level, and geographic location. San Francisco is our highest geographic market in the United States. In the state of California, the reasonably expected salary range is between $126,000 [minimum salary in our lowest geographic market within California] to $196,000 [maximum salary in our highest geographic market within California].

In the United States, outside of California, the reasonably expected salary range is between $103,500 [minimum salary in our lowest US geographic market outside of California] to $186,500 [maximum salary in our highest US geographic market outside of California].

In Canada, the reasonably expected salary range is between $131,500 CAD[minimum salary in our lowest geographic market] to $174,500 CAD[maximum salary in our highest geographic market].

We carefully consider a wide range of factors when determining compensation, including but not limited to experience; job-related skill sets; relevant education or training; and other business and organizational needs. The salary range listed is for the level at which this job has been scoped. In the event that you are considered for a different level, a higher or lower pay range would apply. This position is also eligible for a competitive equity ownership, and a comprehensive and generous benefits package.

Working at Scribd, inc.

Are you currently based in a location where Scribd is able to employ you?
Employees must have their primary residence in or near one of the following cities. This includes surrounding metro areas or locations within a typical commuting distance:

United States:

Canada:

Ottawa | Toronto | Vancouver

Mexico:

Mexico City

Benefits, Perks, and Wellbeing at Scribd

*Benefits/perks listed may vary depending on the nature of your employment with Scribd and the geographical location where you work.

Healthcare Insurance Coverage (Medical/Dental/Vision): 100% paid for employees
12 weeks paid parental leave
Short-term/long-term disability plans
401k/RSP matching
Onboarding stipend for home office peripherals + accessories
Learning & Development allowance
Learning & Development programs
Quarterly stipend for Wellness, WiFi, etc.
Mental Health support & resources
Free subscription to the Scribd Inc. suite of products
Referral Bonuses
Book Benefit
Sabbaticals
Company-wide events
Team engagement budgets
Vacation & Personal Days
Paid Holidays (+ winter break)
Flexible Sick Time
Volunteer Day
Company-wide Employee Resource Groups and programs that foster an inclusive and diverse workplace.
Access to AI Tools: We provide free access to best-in-class AI tools, empowering you to boost productivity, streamline workflows, and accelerate bold innovation.

Want to learn more about life at Scribd? www.linkedin.com/company/scribd/life

We want our interview process to be accessible to everyone. You can inform us of any reasonable adjustments we can make to better accommodate your needs by emailing [email protected] about the need for adjustments at any point in the interview process.

Scribd is committed to equal employment opportunity regardless of race, color, religion, national origin, gender, sexual orientation, age, marital status, veteran status, disability status, or any other characteristic protected by law. We encourage people of all backgrounds to apply, and believe that a diversity of perspectives and experiences create a foundation for the best ideas. Come join us in building something meaningful.

Software Engineer Backend Data Pipelines Python Scala Ruby Spark Databricks Airflow AWS Terraform ECS Lambda LLM MLOps Distributed Systems Metadata Data Engineering

Scribd Glassdoor Company Review

No rating

Scribd DE&I Review

No rating

CEO of Scribd

Unknown name

Approve of CEO

Average salary estimate

$149750 / YEARLY (est.)

min

max

$103500K

$196000K

If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.

Similar Jobs

Communications Associate

Scribd Hybrid No location specified

VIEW

Posted 7 hours ago

Scribd is hiring a Communications Associate to help manage media relations, craft strategic communications, and optimize brand visibility across traditional and AI-driven platforms.

AI Engineer

MLabs Hybrid No location specified

VIEW

Posted 4 hours ago

Be the inaugural AI Engineer in the San Francisco office of a fast-moving healthcare AI startup, building production-ready Python microservices, LLM orchestration, and RAG-based systems to power clinical AI solutions.

Full Stack Developer (Remote - Texas)

Jobgether Hybrid No location specified

VIEW

Posted 12 hours ago

USALCO is hiring a remote Full Stack Developer in Texas to design, build, and maintain scalable web applications that power both internal systems and client portals.

Staff Agentic Software Engineer – UI/UX

ServiceNow Hybrid 12900 Science Drive Suite 100, Orlando, Florida, United States

VIEW

Posted 19 hours ago

Inclusive & Diverse

Mission Driven

Rise from Within

Diversity of Opinions

Work/Life Harmony

Empathetic

Feedback Forward

Take Risks

Collaboration over Competition

Medical Insurance

Dental Insurance

Vision Insurance

Mental Health Resources

Life insurance

Disability Insurance

Health Savings Account (HSA)

Flexible Spending Account (FSA)

Conferences Stipend

Paid Time-Off

Maternity Leave

Equity

Lead the design and engineering of next-generation agentic, AI-powered UX for enterprise observability at a public cloud-software leader.

Business IT Resource / AEM Architect

iSoftTek Solutions Inc Hybrid No location specified

VIEW

Posted 4 hours ago

Lead the architecture and implementation of enterprise AEM solutions, leveraging Java, Node.js, and Angular to deliver scalable, secure, and well-documented digital experiences.

Staff Software Engineer - Access Analysis

ServiceNow Hybrid Building A,B,C 2225 Lawson Lane, Santa Clara, California, United States

VIEW

Posted 5 hours ago

Inclusive & Diverse

Mission Driven

Rise from Within

Diversity of Opinions

Work/Life Harmony

Empathetic

Feedback Forward

Take Risks

Collaboration over Competition

Medical Insurance

Dental Insurance

Vision Insurance

Mental Health Resources

Life insurance

Disability Insurance

Health Savings Account (HSA)

Flexible Spending Account (FSA)

Conferences Stipend

Paid Time-Off

Maternity Leave

Equity

ServiceNow is hiring a Staff Software Engineer - Access Analysis to design and implement scalable, AI-aware platform features and lead delivery from design through automated testing to production.

Lead Full Stack Engineer

CivilGrid Hybrid Boston

VIEW

Posted 3 hours ago

CivilGrid seeks a Lead Full Stack Engineer to architect and build core end-to-end features for a fast-growing B2B SaaS mapping platform serving engineering and infrastructure customers.

Sr. Software Engineer

Meazure Learning Hybrid Remote

VIEW

Posted 10 hours ago

Meazure Learning is hiring a Senior Software Engineer to lead feature design and implementation, mentor teammates, and build cloud-based systems including Google Manifest v3 extensions.

AI Automation Programming (Remote - Florida)

Jobgether Hybrid No location specified

VIEW

Posted 13 hours ago

Fieldstone A&E is hiring an AI Automation Programmer to build and maintain AI-driven automation tools that improve efficiency across HR, Finance, and Operations.

Senior Engineering Manager, Grow

Jobgether Hybrid No location specified

VIEW

Posted 4 hours ago

Curinos is seeking a Senior Engineering Manager to lead a cross-functional team in building scalable, AI-enabled SaaS products for financial services, with a strong focus on technical execution and team development.

Sr. Machine Learning Platform Engineer

Dave Hybrid No location specified

VIEW

Posted 12 hours ago

Lead the design and scaling of Dave’s machine learning infrastructure, working cross-functionally to productionize models and improve platform reliability.

Senior Full Stack Software Engineer

MLabs Hybrid No location specified

VIEW

Posted 11 hours ago

Senior Full Stack Software Engineer needed to build and scale low-latency consumer experiences and robust enterprise integrations for a Y Combinator-backed AI recruiting startup in San Francisco.

Sr. AI Engineer

Agiloft Hybrid United States

VIEW

Posted 13 hours ago

Senior AI Engineer needed to lead development of LLM-powered features and production systems for a market-leading CLM platform focused on contract automation and applied AI.

Founding Backend Engineer

Termblocks Hybrid New York

VIEW

Posted 17 hours ago

Help build and scale Termblocks' agentic AI platform for capital markets as a Founding Backend Engineer focused on backend architecture, production systems, and LLM integrations in Midtown Manhattan.

Scribd

Spark Human Curiosity

9 jobs

MATCH

Calculating your matching score...

FUNDING

Series B

DEPARTMENTS

Software Engineering

SENIORITY LEVEL REQUIREMENT

Mid-Level

INDUSTRY