FREE ACCESS
5,000–10,000 jobs/day

See all jobs on JobTailor
Search thousands of fresh jobs every day.
Discover
- Fresh listings
- Fast filters
- No subscription required
Create a free account and start exploring right away.

Lead Data Platform Engineer
Coupa SoftwareLead Data Platform Engineer managing end-to-end data pipeline operations at Coupa. Collaborate on AI-driven features and optimize cloud infrastructure within a global impact framework.
Posted 6/26/2026full-timeRemote • New York • 🇺🇸 United StatesSenior💰 $125,000 - $174,333 per yearWebsite
Tech Stack
Tools & technologiesAmazon RedshiftAnsibleAWSAzureChefCloudDNSDockerETLJavaJenkinsLinuxMySQLPythonTerraform
About the role
Key responsibilities & impact- Manage end-to-end **Data pipeline **(ETL jobs) within agreed SLAs.
- Manage AWS core and **big data services** (S3, IAM, EMR, Redshift, etc..).
- Running applications in containers (ECS, Docker).
- Lead Day 2 operational lifecycle for ML and GenAI infrastructure. This includes designing, deploying, and maintaining high-availability production LLM serving platforms, implementing automated scaling, self-healing, and infrastructure-as-code patterns. Focus on proactive reliability, model performance observability, and continuous cost optimization for high-compute AI workloads.
- Collaborate closely with our product development and engineering teams to create AI-driven features.
- Drive cloud operations consistency by automating platform maintenance, standardizing infrastructure configurations (IaC), and implementing robust release management processes to minimize drift across multi-cloud environments.
- Manage AWS infrastructure using code (Terraform, Chef, etc..).
- Administering applications running in Linux operating system.
- Enable application and system monitoring for better observability.
- Application and infrastructure support for ETL jobs and data pipelines including participating in an on-call rotation for after-hours emergencies.
- Collaborate with platform and Dev teams to plan and deploy product releases and patch Linux/ECS clusters.
- Ability to participate in design reviews, code reviews, and troubleshooting incidents.
- Ability to operate in a high-pressure environment and troubleshoot complex issues quickly while successfully handling multiple priorities.
- Ability to record, write, and review RCAs.
Requirements
What you’ll need- Bachelor's Degree and at least 8+ years of experience managing Big Data technologies and Data Pipelines.
- Sound knowledge and experience in Linux administration and troubleshooting.
- 5+ years of experience in managing cloud infrastructure and platforms, such as AWS and Azure.
- Familiar with the current engineering landscape in the generative AI space and have a strong interest in AI and related technologies.
- Strong expertise in MLOps and production-grade LLM operations. Proven track record in managing high-availability model inference clusters, automating model lifecycle management, and implementing advanced observability (latency, throughput, and error rate monitoring) specifically for AI workloads.
- Have Bash or Python scripting experience.
- Experience with containerization, Amazon ECS, EKS/ Azure AKS.
- Experience with tools like Chef, Ansible, Jenkins, Rundeck, or equivalent.
- Experience with source control systems such as Git and operating in complex branching strategies.
- Experience with Infrastructure as Code products like Terraform, helm charts.
- Good understanding of DNS and Load balancers setup and troubleshooting.
- Experience in Big Data platforms/Data lakes and managing Business Intelligence tools (like looker..).
- Knowledge in ApacheSpark architecture and troubleshooting Java applications.
- Basic understanding of MySQL Server and general database knowledge.
- Excellent written and verbal communication with a passion for solving the problem.
- Confidence in your ability to own and deliver projects and issues to resolution on your own & can think and act globally.
- Deep experience in Day 2 cloud operations, including automated incident remediation, capacity planning, and managing large-scale production cloud environments with a focus on performance and reliability.
Benefits
Comp & perks- 🌐 Worldwide ❌ Jobs You've Hidden ⭐️ Saved Jobs ✅ Applied Jobs ✉️ Email Alerts 👤 Account Coupa Software Website LinkedIn All Job Openings 1001 - 5000 employees Founded 2006 ☁️ SaaS 💸 Finance 🛍️ eCommerce SaaS
- Finance
- eCommerce Coupa Software is a leading provider of business spend management solutions. Their platform focuses on optimizing and transforming direct and indirect spend across procurement, finance, supply chain, and IT. Coupa leverages AI and extensive data insights to drive cost efficiencies, manage supplier relationships, and mitigate risks. With products covering areas such as invoicing, payments, expense management, and supply chain collaboration, Coupa serves a wide range of industries including automotive, healthcare, retail, and more. Their comprehensive community and partner ecosystem enable organizations to unlock hidden savings and improve compliance, promoting growth and resilience in a changing economic climate. Lead Data Platform Engineer Job not on LinkedIn 🔥 8 minutes ago 🗽 New York – Remote 💵 $125k - $174.3k / year ⏰ Full Time 🟠 Senior 🏗️ Platform Engineer 🦅 H1B Visa Sponsor Amazon Redshift Ansible AWS Azure Chef Cloud DNS Docker ETL Java Jenkins Linux MySQL Python Terraform Apply Now Find Hiring Managers Customize resume + cover letter Report problem ☆ Save ☑️ Mark as applied ❌ Hide 📋 Description
- Manage end-to-end **Data pipeline **(ETL jobs) within agreed SLAs.
- Manage AWS core and **big data services** (S3, IAM, EMR, Redshift, etc..).
- Running applications in containers (ECS, Docker).
- Lead Day 2 operational lifecycle for ML and GenAI infrastructure. This includes designing, deploying, and maintaining high-availability production LLM serving platforms, implementing automated scaling, self-healing, and infrastructure-as-code patterns. Focus on proactive reliability, model performance observability, and continuous cost optimization for high-compute AI workloads.
- Collaborate closely with our product development and engineering teams to create AI-driven features.
- Drive cloud operations consistency by automating platform maintenance, standardizing infrastructure configurations (IaC), and implementing robust release management processes to minimize drift across multi-cloud environments.
- Manage AWS infrastructure using code (Terraform, Chef, etc..).
- Administering applications running in Linux operating system.
- Enable application and system monitoring for better observability.
- Application and infrastructure support for ETL jobs and data pipelines including participating in an on-call rotation for after-hours emergencies.
- Collaborate with platform and Dev teams to plan and deploy product releases and patch Linux/ECS clusters.
- Ability to participate in design reviews, code reviews, and troubleshooting incidents.
- Ability to operate in a high-pressure environment and troubleshoot complex issues quickly while successfully handling multiple priorities.
- Ability to record, write, and review RCAs. 🎯 Requirements
- Bachelor's Degree and at least 8+ years of experience managing Big Data technologies and Data Pipelines.
- Sound knowledge and experience in Linux administration and troubleshooting.
- 5+ years of experience in managing cloud infrastructure and platforms, such as AWS and Azure.
- Familiar with the current engineering landscape in the generative AI space and have a strong interest in AI and related technologies.
- Strong expertise in MLOps and production-grade LLM operations. Proven track record in managing high-availability model inference clusters, automating model lifecycle management, and implementing advanced observability (latency, throughput, and error rate monitoring) specifically for AI workloads.
- Have Bash or Python scripting experience.
- Experience with containerization, Amazon ECS, EKS/ Azure AKS.
- Experience with tools like Chef, Ansible, Jenkins, Rundeck, or equivalent.
- Experience with source control systems such as Git and operating in complex branching strategies.
- Experience with Infrastructure as Code products like Terraform, helm charts.
- Good understanding of DNS and Load balancers setup and troubleshooting.
- Experience in Big Data platforms/Data lakes and managing Business Intelligence tools (like looker..).
- Knowledge in ApacheSpark architecture and troubleshooting Java applications.
- Basic understanding of MySQL Server and general database knowledge.
- Excellent written and verbal communication with a passion for solving the problem.
- Confidence in your ability to own and deliver projects and issues to resolution on your own & can think and act globally.
- Deep experience in Day 2 cloud operations, including automated incident remediation, capacity planning, and managing large-scale production cloud environments with a focus on performance and reliability. Apply Now 📊 Check your resume score for this job Improve your chances of getting an interview by checking your resume score before you apply. Check Resume Score Similar Jobs Senior Platform Engineer 🔥 6 hours ago SitusAMC 5001 - 10000 🏠 Real Estate 💸 Finance 🤝 B2B Website LinkedIn All Job Openings Platform Engineer specializing in CI/CD automation and cloud technologies within SitusAMC's Cloud development team. Requires extensive experience with AWS, Kubernetes, and Release automation. 🇺🇸 United States – Remote 💵 $135k - $165k / year 💰 Private equity on 2020-05 ⏰ Full Time 🟡 Mid-level 🟠 Senior 🏗️ Platform Engineer 🦅 H1B Visa Sponsor AWS Azure Cloud Kubernetes Terraform Senior Platform Engineer 🔥 11 hours ago Cytora 51 - 200 Website LinkedIn All Job Openings Senior Platform Engineer developing Cytora’s serverless Underwriting Productivity Suite for insurance applications. Collaborating with platform team on infrastructure and CI/CD pipelines. 🇺🇸 United States – Remote 💵 $100k - $140k / year 💰 Series B on 2019-04 ⏰ Full Time 🟠 Senior 🏗️ Platform Engineer AWS BigQuery Google Cloud Platform Postgres Python Terraform Senior Frontend Engineer – Platform 🕒 Yesterday EvenUp 51 - 200 🤖 Artificial Intelligence ☁️ SaaS Website LinkedIn All Job Openings Senior Frontend Engineer leading development of scalable AI systems at EvenUp, a vertical SaaS company. Collaborating with cross-functional teams to deliver quality software solutions. 🇺🇸 United States – Remote 💵 $190k - $249k / year ⏰ Full Time 🟠 Senior 🏗️ Platform Engineer Kubernetes Senior HPC Support Engineer – Compute and GPU Platform 🕒 Yesterday NVIDIA 10,000+ employees 🤖 Artificial Intelligence 🎮 Gaming Website LinkedIn All Job Openings Senior HPC Support Engineer handling AI hardware and software solutions for NVIDIA's compute and GPU platforms. Resolving sophisticated customer issues and maintaining a high level of customer happiness. 🇺🇸 United States – Remote 💵 $108k - $207k / year ⏰ Full Time 🟠 Senior 🏗️ Platform Engineer 🦅 H1B Visa Sponsor Distributed Systems Docker Kubernetes Linux Senior Data Engineer, GCP, AI Platform 🕒 Yesterday nDeavour Consulting 1 - 10 🎯 Recruiter 👥 HR Tech 🤝 B2B Website LinkedIn All Job Openings Senior Data Engineer building analytical data platform on GCP for Mobile Wave Solutions. Collaborating on AI features and data governance for reliable data consumption. 🇺🇸 United States – Remote ⏰ Full Time 🟠 Senior 🏗️ Platform Engineer Airflow BigQuery Cloud ETL Google Cloud Platform Postgres Python SQL .NET View More Platform Engineer Jobs 🌐 Worldwide Built by Lior Neu-ner. I'd love to hear your feedback — Get in touch via DM or support@remoterocketship.com Search Search Jobs by country Search jobs by city Search jobs by job title Search entry-level jobs Search junior-level jobs Search senior-level jobs Search jobs by tech stack Search jobs by contract type Search remote internships Search remote part-time jobs Remote jobs Anywhere in the World Companies Hiring Anywhere in the World Companies Hiring Sales People Anywhere in the World Companies Hiring Software Engineers Anywhere in the World Resources Advice Tips for finding remote jobs Interview questions and answers Resume examples Cover letter examples Post a job Affiliates Privacy policy Terms of service Job board SEO course AI Apply Copilot OpenClaw job finder Jobs by Country Remote jobs anywhere in the world (Worldwide remote jobs) Remote jobs United States Remote jobs Australia Remote jobs Brazil Remote jobs Canada Remote jobs France Remote jobs Ireland Remote jobs Germany Remote jobs Netherlands Remote jobs Spain Remote jobs UK Popular Jobs Remote data analyst jobs Remote customer support jobs Remote executive assistant jobs Remote marketing jobs Remote product designer jobs Remote product manager jobs Remote project manager jobs Remote recruiter jobs Remote sales jobs Remote software engineer jobs Jobs by Type Remote full-time jobs Remote part-time jobs Remote contract jobs Remote internship jobs Remote entry-level jobs Remote jobs with no experience required Remote junior jobs (1-3 years of experience) Digital nomad jobs Remote jobs with no degree required Freelance remote jobs Temporary remote jobs Remote jobs hiring now Stay at home mom jobs
ATS Keywords
✓ Tailor your resumeApplicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.
Hard Skills & Tools
Data pipelineETLAWSbig data servicesMLOpsLinux administrationBashPythonInfrastructure as CodeApache Spark
Soft Skills
communicationproblem-solvingproject ownershiptroubleshootingcollaborationhigh-pressure environment managementdesign reviewscode reviewsincident managementcapacity planning
Certifications
Bachelor's Degree