FREE ACCESS
5,000–10,000 jobs/day

See all jobs on JobTailor
Search thousands of fresh jobs every day.
Discover
- Fresh listings
- Fast filters
- No subscription required
Create a free account and start exploring right away.

Principal Cloud Engineer – AI
NBCUniversalPrincipal Cloud Engineer designing and improving cloud platforms that support AI-enabled systems at Comcast. Responsible for cloud infrastructure, platform reliability, and automation.
Tech Stack
Tools & technologiesAnsibleAWSAzureCloudGoogle Cloud PlatformGrafanaKubernetesPackerPrometheusPythonShell ScriptingSplunkTerraform
About the role
Key responsibilities & impact- Designing, implementing, and operating cloud infrastructure supporting scalable, highly available AI‑enabled and Agentic platforms
- Apply Infrastructure as Code (IaC) practices (e.g., Terraform, Packer, Ansible) to provision and manage cloud resources consistently and securely
- Monitor platform health using metrics, logs, dashboards, and alerts, applying critical thinking to distinguish infrastructure, application, and AI‑driven failures
- Lead troubleshooting and resolution of complex cloud and platform issues, including distributed system and integration failures
- Develop and maintain automation and tooling (primarily Python and shell scripting) to improve reliability, diagnostics, and operational efficiency
- Implement and evolve observability solutions (e.g., Prometheus, Splunk, Grafana, CloudWatch) to improve system transparency and alert quality
- Collaborate with application, data, and AI teams to ensure platforms are operable, observable, and scalable prior to and after release
- Support CI/CD pipelines and release workflows, enabling safe and repeatable deployments across environments
- Ensure secure cloud operations, including secrets management, access controls, encryption, and secure networking practices
- Apply foundational AI literacy to understand agent lifecycles, orchestration patterns, and non‑deterministic execution behaviors that impact operations
- Use AI‑assisted tools to enhance investigation, documentation, and optimization, while validating outputs with sound engineering judgment
- Contribute to and maintain runbooks, platform documentation, and operational standards
- Participate in on‑call rotations and act as an escalation point during production incidents
Requirements
What you’ll need- 10+ Years relevant work experience
- Bachelor's Degree
- Cloud Infrastructure Engineering (AWS, Azure, or GCP)
- Kubernetes & Container Platforms
- Infrastructure as Code (Terraform, Packer, Ansible, or equivalent)
- Python Programming for automation and operational tooling
- Monitoring, Logging, and Alerting Systems
- CI/CD Tools and Release Automation
- Security Fundamentals (IAM, secrets management, encryption, network security)
Benefits
Comp & perks- array of options
- expert guidance
- always-on tools that are personalized to meet the needs of your reality—to help support you physically, financially and emotionally through the big milestones and in your everyday life
ATS Keywords
✓ Tailor your resumeApplicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.
Hard Skills & Tools
Infrastructure as CodeTerraformPackerAnsiblePythonShell scriptingKubernetesCI/CDMonitoring systemsLogging systems
Soft Skills
Critical thinkingTroubleshootingCollaborationOperational efficiencyDocumentation
Certifications
Bachelor's Degree