FREE ACCESS
5,000–10,000 jobs/day

See all jobs on JobTailor
Search thousands of fresh jobs every day.
Discover
- Fresh listings
- Fast filters
- No subscription required
Create a free account and start exploring right away.

Senior Platform Engineer – AI Agentic
NBCUniversalPlatform Engineer designing, building, and operating AI-enabled infrastructure for Comcast. Leading the implementation of cloud solutions ensuring scalability, reliability, and integration with applications and data teams.
Tech Stack
Tools & technologiesAnsibleAWSAzureCloudGoogle Cloud PlatformGrafanaPackerPrometheusPythonShell ScriptingSplunkTerraform
About the role
Key responsibilities & impact- Design, implement, and operate cloud infrastructure supporting scalable, highly available AI‑enabled and Agentic platforms.
- Apply Infrastructure as Code (IaC) practices (e.g., Terraform, Packer, Ansible) to provision and manage cloud resources consistently and securely.
- Monitor platform health using metrics, logs, dashboards, and alerts, applying critical thinking to distinguish infrastructure, application, and AI‑driven failures.
- Lead troubleshooting and resolution of complex cloud and platform issues, including distributed system and integration failures.
- Develop and maintain automation and tooling (primarily Python and shell scripting) to improve reliability, diagnostics, and operational efficiency.
- Implement and evolve observability solutions (e.g., Prometheus, Splunk, Grafana, CloudWatch) to improve system transparency and alert quality.
- Collaborate with application, data, and AI teams to ensure platforms are operable, observable, and scalable prior to and after release.
- Support CI/CD pipelines and release workflows, enabling safe and repeatable deployments across environments.
- Ensure secure cloud operations, including secrets management, access controls, encryption, and secure networking practices.
- Apply foundational AI literacy to understand agent lifecycles, orchestration patterns, and non‑deterministic execution behaviors that impact operations.
- Use AI‑assisted tools to enhance investigation, documentation, and optimization, while validating outputs with sound engineering judgment.
- Contribute to and maintain runbooks, platform documentation, and operational standards.
- Participate in on‑call rotations and act as an escalation point during production incidents.
Requirements
What you’ll need- 7-10 years of relevant experience
- Bachelor's degree
- Cloud Infrastructure Engineering (AWS, Azure, or GCP)
- Infrastructure as Code (Terraform, Packer, Ansible, or equivalent)
- Python Programming for automation and operational tooling
- Monitoring, Logging, and Alerting Systems
- CI/CD Tools and Release Automation
- Security Fundamentals (IAM, secrets management, encryption, network security)
Benefits
Comp & perks- health insurance
- retirement plans
- paid time off
- flexible work arrangements
- professional development opportunities
ATS Keywords
✓ Tailor your resumeApplicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.
Hard Skills & Tools
Cloud Infrastructure EngineeringInfrastructure as CodeTerraformPackerAnsiblePython ProgrammingMonitoring SystemsLogging SystemsAlerting SystemsCI/CD Tools
Soft Skills
Critical ThinkingTroubleshootingCollaborationOperational EfficiencyDocumentationProblem SolvingCommunicationLeadership
Certifications
Bachelor's Degree