Tech Stack
AnsibleAWSAzureChefCloudDockerGoogle Cloud PlatformGrafanaJenkinsKubernetesLinuxPostgresPrometheusPuppetPythonShell ScriptingTerraform
About the role
- Design, implement, and maintain scalable infrastructure supporting manufacturing operations
- Ensure system reliability, security, and performance for production-critical applications
- Troubleshoot complex technical issues with minimal downtime impact
- Collaborate with manufacturing, quality, and engineering teams to optimize system integration and data flow
- Develop and maintain automation tools, monitoring systems, and recovery procedures
- Implement and maintain database high-availability setups, migrations, and backup/recovery procedures
- Participate in disaster recovery planning and serve on 24/7 on-call rotations for critical systems
- Stay current with emerging technologies and manufacturing industry best practices
Requirements
- Advanced Linux proficiency (RHEL, CentOS, Ubuntu) including system installation, configuration, and optimization
- Command-line expertise with shell scripting (Bash, Python) for automation
- System monitoring, performance tuning, and capacity planning
- Package management and dependency resolution
- File system management, storage solutions, and backup strategies
- PostgreSQL administration including installation, configuration, and optimization
- Database performance monitoring, query optimization, indexing, backup and recovery, high availability setups, and migrations
- Containerization technologies (Docker, Kubernetes)
- Configuration management tools (Ansible, Puppet, Chef)
- Infrastructure as Code (Terraform, CloudFormation)
- CI/CD pipeline setup and maintenance (Jenkins, GitLab CI, GitHub Actions)
- Version control systems (Git)
- Monitoring and logging solutions (Prometheus, Grafana, ELK stack)
- Scripting languages: Python, Bash, PowerShell
- High availability system design with disaster recovery planning and implementation
- Network troubleshooting and optimization; understanding of industrial networking (Ethernet/IP, Profinet)
- Cloud platform experience (AWS, Azure, GCP) and edge computing deployment
- 24/7 on-call availability and ability to work in industrial environments (factory floors, clean rooms)
- Minimum 7 yrs of industry experience; preferred 3+ years in manufacturing, industrial automation, or similar environments
- Preferred certifications: Linux (RHCE, LPIC), PostgreSQL CE, AWS Solutions Architect, Azure Administrator
- Legal authorization to work in the United States
Applicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.
Hard skills
Linuxshell scriptingPostgreSQL administrationcontainerizationconfiguration managementInfrastructure as CodeCI/CD pipelinenetwork troubleshootingcloud platform experiencehigh availability system design
Soft skills
collaborationtroubleshootingsystem reliabilityperformance tuningcapacity planningdisaster recovery planningon-call availabilityadaptabilitycommunicationproblem-solving
Certifications
RHCELPICPostgreSQL CEAWS Solutions ArchitectAzure Administrator