
Infrastructure Engineer
A.P. Moller - Maersk
full-time
Posted on:
Location Type: Hybrid
Location: Bangalore • 🇮🇳 India
Visit company websiteJob Level
Mid-LevelSenior
Tech Stack
AWSCloudDistributed SystemsDockerKubernetesPrometheusPythonScalaSparkSQLTerraform
About the role
- Design and maintain ingestion frameworks for high-volume structured and unstructured data from operational systems, APIs, file drops, and events
- Support streaming and batch use cases across latency windows
- Develop transformation logic using SQL, Python, Spark, and declarative tools like dbt or sqlmesh
- Handle deduplication, windowing, watermarking, and late-arriving data
- Collaborate with domain teams to annotate datasets with metadata, ownership, PII classification, and usage lineage
- Enforce naming standards, partitioning schemes, and schema evolution policies
- Work within a modern lakehouse architecture leveraging Delta Lake, S3, Glue, and EMR to ensure performance and queryability
- Instrument pipelines with quality checks, cost visibility, and lineage hooks; integrate with OpenMetadata, Prometheus, or OpenLineage
- Support deployment workflows via GitHub Actions, Terraform, and IaC patterns to enable production-readiness and multi-tenant safety
- Create reusable platform primitives, scaffolding for dataset onboarding, and codify data engineering standards
Requirements
- Strong foundation in data engineering: distributed systems, columnar storage formats (Parquet, Avro), data lake performance tuning, and schema evolution
- Hands-on cloud experience with AWS-native services like Glue, EMR, Athena, Lambda, and object storage (S3)
- Fluency in Python and SQL
- Experience with Spark and modern declarative tools like dbt or sqlmesh
- Familiarity with GitOps, containerized workflows (Docker, K8s), and CI/CD pipelines
- Experience with Terraform and IaC is highly valued
- Experience with instrumentation and observability tools (OpenMetadata, Prometheus, OpenLineage)
- Knowledge of Parquet/Avro, deduplication, windowing, watermarking, and late-arriving data handling
- Bonus: experience with Databricks, Snowflake, or Trino; Scala; Jinja-templated SQL; DSL-based modeling frameworks
- Collaboration skills: work with ML engineers, platform architects, security teams, and domain data owners
- Ability to communicate clearly and write clean, documented code
Benefits
- Diverse and inclusive workplace embracing different styles of thinking
- Support for adjustments and accommodations during application and hiring process (accommodationrequests@maersk.com)
- Freedom to experiment and challenge the status quo
- Greenfield project with executive visibility and cross-domain exposure
- Impact at global scale across 130+ countries and $4T+ in global trade
Applicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.
Hard skills
SQLPythonSparkdbtsqlmeshParquetAvroTerraformGitHub Actionsdata engineering
Soft skills
collaborationcommunicationclean code writing