Coupa Software

Lead Site Reliability Engineer – Data Platforms

Coupa Software

full-time

Posted on:

Location Type: Remote

Location: Remote • Colorado • 🇺🇸 United States

Visit company website
AI Apply
Apply

Salary

💰 $125,000 - $162,000 per year

Job Level

Senior

Tech Stack

Amazon RedshiftAnsibleApacheAWSAzureChefCloudDNSDockerETLJenkinsLinuxMySQLPythonSparkTerraform

About the role

  • Manage end-to-end data pipelines (ETL), AWS infrastructure (S3, EMR, Redshift), and IaC tools like Terraform and Chef.
  • Support containerized applications (ECS, Docker) and administer Linux-based systems.
  • Collaborate with ML teams to manage ML/GenAI infrastructure and deliver AI-driven features.
  • Enable monitoring and observability across systems and applications.
  • Provide support for data pipelines, including on-call rotation and release planning with Dev teams.
  • Participate in design/code reviews, troubleshoot complex issues, and document root cause analyses (RCAs).

Requirements

  • 8+ years of experience in Big Data technologies, data pipelines, and Linux administration; strong scripting skills in Bash or Python.
  • 5+ years managing cloud platforms (AWS, Azure), including tools like ECS, EKS, AKS, Terraform, and Helm.
  • Hands-on experience with Infrastructure as Code, CI/CD tools (Chef, Ansible, Jenkins), and source control (Git).
  • Familiarity with Generative AI tools (SageMaker, Bedrock, Azure ML), vector databases, and a strong interest in AI technologies.
  • Solid knowledge of networking (DNS, load balancers), MySQL, Apache Spark, and BI/data lake platforms (e.g., Looker).
  • Strong communication skills, self-driven with global thinking, and capable of independently resolving complex issues and delivering projects.
Benefits
  • Health insurance
  • Equal employment opportunities
  • Welcoming and inclusive work environment

Applicant Tracking System Keywords

Tip: use these terms in your resume and cover letter to boost ATS matches.

Hard skills
ETLAWSLinux administrationBashPythonInfrastructure as CodeCI/CDMySQLApache SparkBig Data
Soft skills
strong communication skillsself-drivenglobal thinkingproblem-solvingindependent project delivery
Docusign

Manager, SRE Engage

Docusign
Mid · Seniorfull-timeIllinois · 🇺🇸 United States
Posted: 4 hours agoSource: emeacareers-docusign.icims.com
AnsibleAWSAzureCloudDistributed SystemsGoGoogle Cloud PlatformJavaLinuxPythonRubyTerraform+1 more
ActioNet, Inc.

Cloud Engineer, DevOps Engineer

ActioNet, Inc.
Mid · Seniorfull-timeMaryland, Oregon · 🇺🇸 United States
Posted: 10 hours agoSource: jobs.jobvite.com
AzureCloudLinuxPython
ActioNet, Inc.

Cloud Engineer / DevOps Engineer

ActioNet, Inc.
Mid · Seniorfull-timeMaryland, Oregon · 🇺🇸 United States
Posted: 10 hours agoSource: jobs.jobvite.com
AzureCloudPython
Dresden Partners Community

DevOps Engineer

Dresden Partners Community
Mid · Seniorfull-time🇺🇸 United States
Posted: 15 hours agoSource: dresdenpartners.na.teamtailor.com
AnsibleAWSChefDockerJavaJenkinsKubernetesLinuxPuppetPythonRubyUnix