d-Matrix

Manufacturing Infrastructure Engineer

d-Matrix

contract

Posted on:

Location Type: Hybrid

Location: Santa Clara • California • 🇺🇸 United States

Visit company website
AI Apply
Apply

Job Level

SeniorLead

Tech Stack

AnsibleAWSAzureChefCloudDockerGoogle Cloud PlatformGrafanaJenkinsKubernetesLinuxPostgresPrometheusPuppetPythonShell ScriptingTerraform

About the role

  • Design, implement, and maintain scalable infrastructure supporting manufacturing operations
  • Ensure system reliability, security, and performance for production-critical applications
  • Troubleshoot complex technical issues with minimal downtime impact
  • Collaborate with manufacturing, quality, and engineering teams to optimize system integration and data flow
  • Develop and maintain automation tools, monitoring systems, and recovery procedures
  • Implement and maintain database high-availability setups, migrations, and backup/recovery procedures
  • Participate in disaster recovery planning and serve on 24/7 on-call rotations for critical systems
  • Stay current with emerging technologies and manufacturing industry best practices

Requirements

  • Advanced Linux proficiency (RHEL, CentOS, Ubuntu) including system installation, configuration, and optimization
  • Command-line expertise with shell scripting (Bash, Python) for automation
  • System monitoring, performance tuning, and capacity planning
  • Package management and dependency resolution
  • File system management, storage solutions, and backup strategies
  • PostgreSQL administration including installation, configuration, and optimization
  • Database performance monitoring, query optimization, indexing, backup and recovery, high availability setups, and migrations
  • Containerization technologies (Docker, Kubernetes)
  • Configuration management tools (Ansible, Puppet, Chef)
  • Infrastructure as Code (Terraform, CloudFormation)
  • CI/CD pipeline setup and maintenance (Jenkins, GitLab CI, GitHub Actions)
  • Version control systems (Git)
  • Monitoring and logging solutions (Prometheus, Grafana, ELK stack)
  • Scripting languages: Python, Bash, PowerShell
  • High availability system design with disaster recovery planning and implementation
  • Network troubleshooting and optimization; understanding of industrial networking (Ethernet/IP, Profinet)
  • Cloud platform experience (AWS, Azure, GCP) and edge computing deployment
  • 24/7 on-call availability and ability to work in industrial environments (factory floors, clean rooms)
  • Minimum 7 yrs of industry experience; preferred 3+ years in manufacturing, industrial automation, or similar environments
  • Preferred certifications: Linux (RHCE, LPIC), PostgreSQL CE, AWS Solutions Architect, Azure Administrator
  • Legal authorization to work in the United States

Applicant Tracking System Keywords

Tip: use these terms in your resume and cover letter to boost ATS matches.

Hard skills
Linuxshell scriptingPostgreSQL administrationcontainerizationconfiguration managementInfrastructure as CodeCI/CD pipelinenetwork troubleshootingcloud platform experiencehigh availability system design
Soft skills
collaborationtroubleshootingsystem reliabilityperformance tuningcapacity planningdisaster recovery planningon-call availabilityadaptabilitycommunicationproblem-solving
Certifications
RHCELPICPostgreSQL CEAWS Solutions ArchitectAzure Administrator
NVIDIA

Senior ASIC Front-End Infrastructure Engineer

NVIDIA
Seniorfull-time$184k–$357k / yearCalifornia, North Carolina, Oregon, Texas · 🇺🇸 United States
Posted: 6 hours agoSource: nvidia.wd5.myworkdayjobs.com
JenkinsPerlPython
Fastly

Senior Principal Engineer – Infrastructure Engineering

Fastly
Seniorfull-time$266k–$320k / yearCalifornia, Colorado, New York · 🇺🇸 United States
Posted: 3 days agoSource: boards.greenhouse.io
GoOpen SourceRust