Job Title: DevOps Engineer - Lead
Job ID: 94330-1, 94329-1 & 94503-1
Only-EX-Capital one ,C2C
Client: Capital One
Location: 15075 Capital One Drive Richmond, VA 23238 (Hybrid)
Duration: 12+ Months with possible of extension
Key Skills & Tools:
Observability Tools: Proficiency in monitoring, logging, and tracing tools, including Prometheus, Grafana, ELK Stack (Elasticsearch, Logstash, Kibana), Splunk, Datadog, New Relic, and cloud-native solutions like AWS CloudWatch.
Programming Languages: Expertise in languages such as Python and Go for scripting and automation.
Infrastructure & Cloud Platforms: Experience with cloud platforms (AWS, GCP, Azure) and container orchestration systems like Kubernetes.
Infrastructure as Code (IaC): Familiarity with Terraform and Ansible for managing infrastructure and configurations.
CI/CD & Automation: Experience with CI/CD pipelines and automation tools like Jenkins.
System & Software Engineering: A strong background in both system operations and software development.
Optimize cloud agent instrumentation, with cloud certifications being a plus.
Datadog Fundamental, APM and Distributed Tracing Fundamentals & Datadog Demo Certification (Mandatory)
Strong understanding of Observability concepts (Logs, Metrics, Tracing)
Expertise in security & vulnerability management in observability
Possesses 2 years of experience in cloud-based observability solutions, specializing in monitoring, logging, and tracing across AWS, Azure, and GCP environments.
Job Description:
Design & Implement Solutions: Build and maintain comprehensive observability platforms that provide deep insights into complex systems, incorporating logs, metrics, and traces.
System Instrumentation: Instrument applications, infrastructure, and services to collect telemetry data using frameworks like OpenTelemetry.
Data Analysis & Visualization: Develop dashboards, reports, and alerts using tools like Prometheus, Grafana, and Splunk to visualize system performance and detect issues.
Collaboration: Work with development, SRE, and DevOps teams to integrate observability best practices and align monitoring with business and operational goals.
Automation: Develop scripts and use Infrastructure as Code (IaC) tools like Ansible and Terraform to automate monitoring configurations and telemetry collection.
Implement and manage full-stack observability using Datadog, ensuring seamless monitoring across infrastructure, applications, and services.
Instrument agents for on-premise, cloud, and hybrid environments to enable comprehensive monitoring.
Design and deploy key service monitoring, including dashboards, monitor creation, SLA/SLO definitions, and anomaly detection with alert notifications.
Configure and integrate Datadog with third-party services such as ServiceNow, SSO enablement, and other ITSM tools.
If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.
Direct-hire Full Stack Java Developer role in Richardson for recent Goldman Sachs employees, focused on Java + React delivery across distributed systems and RESTful integrations.
Experienced Full Stack Engineer to design and build robust Java-based backend services and modern web UIs for a global data and analytics firm with flexible work options.
Bring deep Java back-end and modern front-end framework expertise to a remote senior full stack role supporting secure, scalable web applications with a team-focused, mission-driven company.
Lead the architecture and engineering of Cobot's cloud platform to connect, manage, and scale large fleets of collaborative robots.
Lead design and delivery of scalable, production-grade billing and metering systems at Cloudflare, working across teams to build highly reliable transactional microservices.
LinkedIn is seeking a Master’s-level Systems & Infrastructure Engineer Intern to work on large-scale distributed data infrastructure and backend systems during Summer 2026 in Mountain View.
Experienced Software Application Developer needed to build secure, serverless, cloud-native backend systems supporting mission-critical DoD programs using Ruby, AWS Lambda, and modern DevSecOps practices.
Experian is hiring a Software Engineer to develop Python microservices and manage Kubernetes/Terraform-based infrastructure to improve deployment reliability and performance.
Lead the development and scaling of an AI-driven marketing intelligence product for Shopify brands, building both front-end and back-end systems, API integrations, and CI/CD for production deployment.
Citizant is hiring a Senior Applications Software Developer to architect and build scalable, secure application solutions for federal clients while providing technical leadership and mentorship.
Platinum Technologies is hiring a Full Stack Application Developer with active TS/SCI to support and sustain a mission-critical DoD budget tool at MacDill AFB.
Lead and grow a distributed Fulfillment engineering team to build scalable, reliable self-service fulfillment and licensing infrastructure for enterprise SaaS customers.
Lead the design and scaling of high-throughput backend systems at Perplexity, building robust services that power AI-enabled search, collaboration, and paid features used by millions.
Contribute to flight software, embedded systems, and simulations for commercial satellite platforms as a Software Engineer Intern at a fast-growing aerospace company.