Salary
💰 $155,000 - $210,000 per year
Tech Stack
AirflowApacheAWSAzureBigQueryCloudCyber SecurityDockerGoogle Cloud PlatformGrafanaKubernetesPrometheusPythonTerraform
About the role
- Design, implement, and maintain scalable and reliable GCP infrastructure using Infrastructure as Code (IaC) principles, primarily with Terraform.
- Manage and optimize BigQuery & k8s logging configurations to ensure efficient data ingestion, analysis, and cost control.
- Develop and maintain FinOps dashboards to provide visibility into cloud spending and identify cost optimization opportunities (we currently use Cloudzero as a cloud-costs vendor).
- Provide and deliver deep, hands-on technical support to developers, acting as a trusted advisor to help optimize workflows, troubleshoot complex issues, and improve development practices.
- Implement and manage monitoring, alerting, and logging solutions for GCP services.
- Troubleshoot and resolve infrastructure-related issues, ensuring high availability and performance.
- Collaborate with Devops, IT, and InfoSec teams to leverage technology effectively while enhancing security and performance.
- Participate in on-call rotations as needed.
- Document infrastructure designs, configurations, and procedures.
- Build interfaces and connections to operations and capacity planning software.
Requirements
- Bachelor's degree in Computer Science, Engineering, or a related field, or equivalent practical experience.
- Proven experience as a DevOps Engineer or similar role, with a strong focus on cloud infrastructure and deep, hands-on developer support in complex environments.
- In-depth knowledge and hands-on experience with Google Cloud Platform (GCP) services, including but not limited to:
- BigQuery
- Compute Engine
- Cloud Storage
- VPC
- IAM
- In-depth knowledge of networking principles.
- Expertise in Terraform for infrastructure provisioning and management.
- Experience with FinOps practices and tools for cloud cost management.
- Proficiency in scripting languages (e.g., Python, Bash).
- Expertise in Git, GitHub workflows, CI/CD (CircleCI) automation, and efficient release pipelines.
- Experience with monitoring and logging tools (e.g., Cloud Monitoring, Cloud Logging, Grafana, Prometheus).
- Strong problem-solving skills and a collaborative mindset for working across cross-functional teams.
- Experience with backend orchestration tools like Apache Airflow.
- Familiarity with containerization technologies (e.g., Docker, Kubernetes).
- Experience with other cloud providers (AWS, Azure).
- Proven ability to manage regulated data and ensure security and compliance.