At Scribd (pronounced “scribbed”), our mission is to spark human curiosity. Join our team as we create a world of stories and knowledge, democratize the exchange of ideas and information, and empower collective expertise through our four products: Everand, Scribd, Slideshare, and Fable.
We support a culture where our employees can be real and be bold; where we debate and commit as we embrace plot twists; and where every employee is empowered to take action as we prioritize the customer.
When it comes to workplace structure, we believe in balancing individual flexibility and community connections. It’s through our flexible work benefit, Scribd Flex, that employees – in partnership with their manager – can choose the daily work-style that best suits their individual needs. A key tenet of Scribd Flex is our prioritization of intentional in-person moments to build collaboration, culture, and connection. For this reason, occasional in-person attendance is required for all Scribd employees, regardless of their location.
So what are we looking for in new team members? Well, we hire for “GRIT”. The textbook definition of GRIT is demonstrating the intersection of passion and perseverance towards long term goals. At Scribd, we are inspired by the potential that this can unlock, and ask each of our employees to pursue a GRIT-ty approach to their work. In a tactical sense, GRIT is also a handy acronym that outlines the standards we hold ourselves and each other to. Here’s what that means for you: we’re looking for someone who showcases the ability to set and achieve Goals, achieve Results within their job responsibilities, contribute Innovative ideas and solutions, and positively influence the broader Team through collaboration and attitude.
About the team:
The ML Data Engineering team powers metadata extraction, enrichment, and content understanding across all Scribd brands. We process hundreds of millions of documents, billions of images, and deliver high-quality metadata to enable content discovery and trust for millions of users worldwide.
Our systems operate at massive scale, supporting diverse datasets like user-generated content (UGC), ebooks, audiobooks, and more. We work at the intersection of machine learning, data engineering, and distributed systems, collaborating closely with applied research and product teams to deploy scalable ML and LLM-powered solutions in production.
Role Overview:
We’re seeking a Software Engineer II with deep experience building event-driven, distributed, and scalable systems in Python. In this role, you’ll design and optimize large-scale data and service pipelines running on AWS, supporting Scribd’s content enrichment and metadata systems. You’ll work closely with cross-functional teams to design reliable backend services that integrate machine learning models and LLM-based components when needed. This role offers the opportunity to work on cutting-edge generative AI and metadata enrichment problems at a truly global scale.
Tech Stack:
Our backend systems are primarily built in Python, leveraging AWS services such as Lambda, ECS, SQS, and ElastiCache for event-driven and distributed processing. We also use Airflow, Spark, Databricks, Terraform, and Datadog for orchestration, data processing, and observability.
Key Responsibilities:
Design and implement event-driven, distributed systems to extract, enrich, and process metadata from large-scale document and media datasets.
Build and maintain scalable APIs and backend services for high-throughput content processing.
Leverage AWS services (ECS, Lambda, SQS, ElastiCache, CloudWatch) to design and deploy resilient, high-performance systems.
Collaborate with cross-functional teams to deliver backend solutions that power ML-driven features.
Optimize and refactor existing backend systems for scalability, reliability, and performance.
Ensure system health and data integrity through monitoring, observability, and automated testing.
Requirements:
5+ years of professional software engineering experience on Python or distributed systems development.
Strong proficiency in Python (3+ years). Experience with Scala is a plus.
Proven experience designing and building event-driven, distributed, and scalable systems.
Hands-on experience with AWS services (ECS, Lambda, SQS, SNS, CloudWatch, etc.).
Experience with infrastructure-as-code tools like Terraform.
Solid understanding of system performance, profiling, and optimization.
Bachelor’s degree in Computer Science or equivalent professional experience.
Bonus: Familiarity with data processing frameworks (Spark, Databricks) and workflow orchestration tools.
Bonus: Experience integrating ML or LLM-based models into production systems.
At Scribd, your base pay is one part of your total compensation package and is determined within a range. Our pay ranges are based on the local cost of labor benchmarks for each specific role, level, and geographic location. San Francisco is our highest geographic market in the United States. In the state of California, the reasonably expected salary range is between $126,000 [minimum salary in our lowest geographic market within California] to $196,000 [maximum salary in our highest geographic market within California].
In the United States, outside of California, the reasonably expected salary range is between $103,500 [minimum salary in our lowest US geographic market outside of California] to $186,500 [maximum salary in our highest US geographic market outside of California].
In Canada, the reasonably expected salary range is between $131,500 CAD[minimum salary in our lowest geographic market] to $174,500 CAD[maximum salary in our highest geographic market].
We carefully consider a wide range of factors when determining compensation, including but not limited to experience; job-related skill sets; relevant education or training; and other business and organizational needs. The salary range listed is for the level at which this job has been scoped. In the event that you are considered for a different level, a higher or lower pay range would apply. This position is also eligible for a competitive equity ownership, and a comprehensive and generous benefits package.
Are you currently based in a location where Scribd is able to employ you?
Employees must have their primary residence in or near one of the following cities. This includes surrounding metro areas or locations within a typical commuting distance:
United States:
Atlanta | Austin | Boston | Dallas | Denver | Chicago | Houston | Jacksonville | Los Angeles | Miami | New York City | Phoenix | Portland | Sacramento | Salt Lake City | San Diego | San Francisco | Seattle | Washington D.C.
Canada:
Ottawa | Toronto | Vancouver
Mexico:
Mexico City
Benefits, Perks, and Wellbeing at Scribd
*Benefits/perks listed may vary depending on the nature of your employment with Scribd and the geographical location where you work.
Healthcare Insurance Coverage (Medical/Dental/Vision): 100% paid for employees
12 weeks paid parental leave
Short-term/long-term disability plans
401k/RSP matching
Onboarding stipend for home office peripherals + accessories
Learning & Development allowance
Learning & Development programs
Quarterly stipend for Wellness, WiFi, etc.
Mental Health support & resources
Free subscription to the Scribd Inc. suite of products
Referral Bonuses
Book Benefit
Sabbaticals
Company-wide events
Team engagement budgets
Vacation & Personal Days
Paid Holidays (+ winter break)
Flexible Sick Time
Volunteer Day
Company-wide Employee Resource Groups and programs that foster an inclusive and diverse workplace.
Access to AI Tools: We provide free access to best-in-class AI tools, empowering you to boost productivity, streamline workflows, and accelerate bold innovation.
Want to learn more about life at Scribd? www.linkedin.com/company/scribd/life
We want our interview process to be accessible to everyone. You can inform us of any reasonable adjustments we can make to better accommodate your needs by emailing [email protected] about the need for adjustments at any point in the interview process.
Scribd is committed to equal employment opportunity regardless of race, color, religion, national origin, gender, sexual orientation, age, marital status, veteran status, disability status, or any other characteristic protected by law. We encourage people of all backgrounds to apply, and believe that a diversity of perspectives and experiences create a foundation for the best ideas. Come join us in building something meaningful.
If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.
Veeam is hiring a Frontend Engineer to build responsive, reusable React interfaces for its Veeam Data Cloud AI platform, combining strong frontend craft with modern AI integrations.
Cape is hiring a Site Reliability Engineer to build and operate privacy-focused telecommunications infrastructure, improve system reliability and monitoring, and own FedRAMP accreditation for a fast-growing, mission-driven startup.
Senior Full Stack Engineer role at Sensor Tower focused on building and optimizing scalable backend systems, APIs, and automation to drive product impact.
Help shape Softlight’s product and AI infrastructure as a founding engineer focused on building novel models and shipping product features for PMs, designers, and engineers.
Sentry is hiring a Senior Full Stack Engineer to lead development of high-impact core product features using React/TypeScript and Django/Python, improving developer workflows and AI-driven capabilities.
Lead technical customer engineering and architectural design for NVIDIA's BlueField and ConnectX AI networking products to enable scalable, hardware-accelerated networking solutions.
Contribute to a market-leading SaaS product as a Front-end Engineer intern, building embeddable low-code widgets and interactive dashboards for global media clients.
Experienced backend engineer (Java/Python) needed to build scalable microservices, time-series data pipelines, and cloud-native infrastructure for innovative IoT and sustainability projects.
Experienced system-level engineer needed to design and deliver cross-platform, high-performance endpoint sensors and kernel-mode components for a cybersecurity product team.
Lead integration architecture and development for Forbright's digital banking platform, driving scalable, secure AWS- and event-driven solutions while mentoring engineering teams.
Work on the real-time backbone of a HIL test and simulation platform, building deterministic scheduling, low-latency I/O, and timing-safe runtimes in C++ or Rust for mission-critical systems.
Build and deploy scalable AI-driven healthcare applications as a remote AI Software Engineer, translating ML models into production-ready solutions across backend, frontend, and cloud environments.
Help build LM Studio's desktop app, background daemon, SDKs, and public/internal APIs as a Systems Engineer on a small, NYC-based team focused on delightful user and developer experiences.
Spark Human Curiosity
9 jobs