
Principal Cloud Reliability Engineer
T. Rowe Price
full-time
Posted on:
Location Type: Hybrid
Location: Owings Mills • Colorado • Maryland • United States
Visit company websiteExplore more
Salary
💰 $159,000 - $339,000 per year
Job Level
About the role
- Owns the architecture, implementation, and reliability outcomes of enterprise cloud platforms
- Balances strategic leadership with direct contribution, driving reliability standards while remaining hands‑on in critical areas of platform engineering and incident response
- Lead the design, architecture, and evolution of enterprise cloud platforms, including AWS Landing Zone (ALZ)
- Ensure cloud platforms are designed for high availability, fault tolerance, and operational resilience
- Design and implement cloud solutions using AWS services such as EC2, S3, ECS, EKS, ELB, RDS, Route 53, Lambda, and API Gateway
- Design and automate advanced AWS networking solutions including VPCs, Transit Gateway, VPC Peering, PrivateLink, and Direct Connect
- Build and maintain Infrastructure as Code (IaC) using Terraform, CloudFormation, Ansible, and Git‑based workflows
- Be accountable for Site Reliability Engineering (SRE) outcomes across cloud platforms and drive adoption of SRE best practices
- Standardize reusable modules and patterns that promote consistent, reliable deployments at scale
- Define and enforce reliability standards, including availability targets, recovery expectations, and resilience patterns
- Ensure instrumentation, monitoring, logging, and alerting are embedded into platforms and services
- Act as an escalation point for complex incidents, driving root‑cause analysis and long‑term remediation
- Design and implement guardrails that enable secure, reliable, self‑service cloud adoption
- Enable teams to own end‑to‑end services while meeting reliability and operational standards
- Participate in a Agile delivery model, contributing to stories and epics, tracking work in Jira, and supporting sprint releases
- Partner closely with Application Development, Development Services, Enterprise Architecture, and Enterprise Security teams
- Mentor engineers while promoting a culture of operational excellence and tech modernization to drive client value.
Requirements
- Bachelor's degree or the equivalent combination of education and relevant experience
- 10+ years of experience designing and operating cloud infrastructure with senior‑level impact
- Deep hands‑on experience with AWS
- Expert knowledge of: Cloud infrastructure, container platforms and serverless deployments, Reliability engineering concepts (availability, resilience, observability), Operating systems, networking, identity and access management, Infrastructure automation, CI/CD pipelines, and DevSecOps practices
- Proven ability to design, build, and operate enterprise‑scale, highly reliable cloud platforms
- Strong troubleshooting skills across infrastructure, platform, and reliability layers
- Experience mentoring engineers and influencing teams without formal line management.
Benefits
- Competitive compensation
- Annual bonus eligibility
- A generous retirement plan
- Hybrid work schedule
- Health and wellness benefits, including online therapy
- Paid time off for vacation, illness, medical appointments, and volunteering days
- Family care resources, including fertility and adoption benefits
Applicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.
Hard Skills & Tools
AWSEC2S3ECSEKSELBRDSRoute 53LambdaAPI Gateway
Soft Skills
strategic leadershipmentoringtroubleshootinginfluencing teamsoperational excellence
Certifications
Bachelor's degree