Salary
💰 $84,300 - $140,500 per year
Tech Stack
AWSAzureCloudConsulDockerGoogle Cloud PlatformKubernetesMicroservicesTerraformVault
About the role
- Support and maintain reliability of McKesson cloud infrastructure and services
- Deploy and manage workloads on Kubernetes using deployment tools like ArgoCD
- Implement and manage Infrastructure as Code with Terraform and Terraform Cloud
- Manage secrets and configuration using tools such as Hashicorp Vault and Consul
- Instrument and monitor systems using observability tools (Datadog) and perform root cause analysis
- Build and maintain CI/CD pipelines and automation tooling
- Ensure cloud security and compliance within infrastructure
- Collaborate with development teams, architects, and stakeholders to deliver reliable services
- Diagnose and resolve complex system issues and implement corrective actions
- Continuously learn and adopt cloud-native best practices and tools
Requirements
- Bachelor's degree in Computer Science, Information Technology, or a related field, or equivalent practical experience
- 1+ years of experience in a Site Reliability Engineering, DevOps or related technical role
- Exposure to cloud platforms (Azure, AWS, GCP) is a plus
- Experience with containerization, scripting, or automation in any environment
- Familiarity with Kubernetes and deployment tools like ArgoCD
- Experience with Infrastructure as Code (IaC) using Terraform, specifically with Terraform Cloud
- Exposure to secrets management tools such as Hashicorp Vault
- Experience with monitoring and observability tools, particularly Datadog, or a desire to learn observability best practices
- An understanding of service discovery and configuration management concepts (e.g., Consul) is a plus
- Knowledge of CI/CD pipelines and automation tools
- Basic understanding of cloud-native architectures and best practices
- Knowledge of containerization technologies like Docker
- Knowledge of microservices architecture and its implementation in a cloud environment
- Familiarity with security best practices in cloud environments
- Experience in implementing and managing compliance standards within cloud infrastructure
- Strong analytical and problem-solving skills to diagnose and resolve complex system issues
- Ability to perform root cause analysis and implement corrective actions
- Excellent collaboration skills to work effectively with development teams, architects, and other stakeholders
- Strong verbal and written communication skills for documenting processes and procedures
- Willingness to stay updated with the latest industry trends and technologies in cloud computing and SRE practices
- Ability to quickly adapt to new tools and technologies as needed
- Experience with other cloud providers (AWS, GCP) is a plus
- Familiarity with agile methodologies and DevOps culture