Data Platform, Cloud Infrastructure Engineer – Senior

Domus Global

full-time

Posted on:

Location Type: Remote

Location: United States

Visit company website

Explore more

AI Apply
Apply

Job Level

About the role

  • Design, deploy and maintain cloud infrastructure for data and ML workloads using Infrastructure as Code.
  • Manage and evolve AWS-based data platform components running on Kubernetes (EKS).
  • Provision and maintain services such as EMR on EKS, SageMaker, MWAA (Managed Airflow), Lambda, API Gateway and Step Functions.
  • Implement and maintain IAM roles, permissions and governance policies aligned with compliance requirements.
  • Support orchestration frameworks used by data teams (DBT, Airflow, Step Functions).
  • Collaborate with data engineers to troubleshoot infrastructure or platform issues affecting pipelines.
  • Participate in platform observability initiatives (metrics, logging and monitoring).
  • Maintain Terraform modules and deployment pipelines.
  • Support platform migrations and organizational AWS changes when required.
  • Contribute to platform reliability, scalability and operational excellence.

Requirements

  • 3+ years of experience working with AWS cloud infrastructure
  • Strong experience with Terraform or similar Infrastructure as Code tools
  • Experience deploying and operating containerized workloads on Kubernetes / EKS
  • Solid understanding of AWS IAM, roles and security best practices
  • Experience with serverless architectures (Lambda, API Gateway, Step Functions)
  • Experience supporting data or ML platforms from an infrastructure perspective
  • DevOps mindset and experience managing CI/CD or infrastructure automation
  • Strong troubleshooting skills across distributed systems.
Benefits
  • Remote
  • Professional development opportunities
Applicant Tracking System Keywords

Tip: use these terms in your resume and cover letter to boost ATS matches.

Hard Skills & Tools
AWSTerraformKubernetesEKSEMRSageMakerMWAALambdaAPI GatewayStep Functions
Soft Skills
troubleshootingcollaborationoperational excellencescalabilityreliabilitycommunicationproblem-solvingorganizational skillsDevOps mindsetsupport