
Principal Infrastructure Engineer
Voxel51
full-time
Posted on:
Location Type: Remote
Location: United States
Visit company websiteExplore more
Salary
💰 $250,000 - $280,000 per year
Job Level
Tech Stack
About the role
- Shape the architecture and evolution of Voxel51’s infrastructure to support deployments ranging from individual researchers to Fortune 500 enterprises
- Design, build, and scale deployment systems across cloud (GCP, AWS, Azure) and on-premises environments, ensuring reliability, security, and repeatability
- Partner with enterprise customers to deliver and support production-grade deployments in their environments, guiding them through installation, troubleshooting, and scaling
- Lead infrastructure initiatives across engineering teams, enabling peers to develop, test, and ship features faster with robust internal tooling and automation
- Drive best practices in CI/CD, evolving our pipelines (currently GitHub Actions + Google Cloud Build) and introducing new approaches where they add value
- Develop and maintain deployment solutions for Voxel51-hosted environments (GKE) as well as customer on-prem installations (K8s or Docker Compose)
- Champion developer productivity, improving workflows for development and automated cloud deployments
- Troubleshoot and resolve complex infrastructure issues, spanning build failures, runtime failures, and customer deployment challenges
- Anticipate and prevent failures by designing monitoring, alerting, and predictive solutions for both internal and customer environments
- Mentor engineers and set technical direction, ensuring Voxel51’s infrastructure remains ahead of customer needs and industry trends
Requirements
- Deep experience with containerized environments
- Infrastructure as Code expertise (Terraform, Ansible, or equivalent)
- Scripting and automation skills (Bash or similar)
- Python expertise, including build and environment management, packaging/distribution, release management, and dependency debugging
- CI/CD systems experience, ideally GitHub Actions (we use this today)
- Cloud infrastructure knowledge, especially GCP (IAM, VPC, load balancing, ingress/egress routing, proxies, firewall rules)
- Database fundamentals, ideally MongoDB or similar NoSQL systems
- Observability skills, including designing meaningful monitors, logging, tracing, and alerting
- Security best practices, including certificates, service accounts, least privilege, and role assumptions
- Troubleshooting ability across complex, distributed systems (including with customers in the loop)
- Testing mindset: comfortable with designing and applying different types of tests to validate functionality
- Strong communication skills, with the ability to work directly with enterprise customers as well as collaborate across teams in a remote-first, collaborative environment
- Adaptability and curiosity, with the ability to ramp quickly on unfamiliar concepts and technologies
Benefits
- equity in the form of options
- a variety of benefits
- opportunity to grow in an exciting and collaborative environment
Applicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.
Hard Skills & Tools
containerized environmentsInfrastructure as CodeTerraformAnsiblescriptingautomationBashPythonCI/CDcloud infrastructure
Soft Skills
troubleshootingstrong communicationadaptabilitycuriositymentoringcollaborationdeveloper productivitybest practicestesting mindsetanticipation of failures