
Senior Cloud Platform Engineer
Capgemini
full-time
Posted on:
Location Type: Hybrid
Location: Hyderabad • India
Visit company websiteExplore more
Job Level
About the role
- Ensure availability, scalability, reliability and security of cloud platforms and services.
- Design, deploy, and govern AI-powered agents (e.g., using Azure Copilot /AWS Bedrock) to achieve autonomous self-healing capabilities and automated resource management.
- Handle regular operational requests with hands-on experience using Terraform for EC2 changes, S3 updates, user access management, and managed services like SageMaker, Bedrock, Storage Gateway, RDS, and Transfer Family etc.
- Supervise and refine AI-generated Infrastructure-as-Code (IaC) (Terraform/Ansible) for Developing and maintaining complex and scalable Terraform/ansible/CloudBees (Jenkins) automation pipelines to provision, deploy, patch and manage cloud infrastructure.
- Implementing AI based automation solutions for Cloud Operations to Monitor performance, scalability and respond to incidents and operational issues autonomously.
- Implement GenAI tools to perform real-time Root Cause Analysis (RCA), correlate complex event data (logs, metrics), and auto-generate runbooks and incident summaries.
- Develop and train predictive ML models to analyze historical telemetry and forecast potential system outages or performance bottlenecks and configure proactive monitoring and alerting for critical services.
- Manage security and compliance by utilizing AI agents to detect configuration drift and auto-generate compliant updates for IAM, network, and security policies.
- Manage security and compliance by remediating vulnerabilities, configuring notifications, supporting audits, and maintaining certifications and governance standards.
- Collaborate with application, architecture, AIOps, FinOps and security teams to ensure production readiness.
- Lead proof-of-concepts, implement new cloud services and Adapt quickly to new cloud releases and features to enhance operational capabilities.
- Understand middleware components to provide end-to-end production support and troubleshoot complex operational scenarios.
- Works with application teams, analyzing logs and data, opens service requests, works with vendors, partners, and drives problem resolution.
- Reviews architectural designs for applications to ensure reliable and performant design patterns are implemented.
- Hands-on experience in deploying applications, workloads, and data to the cloud environment, often involving migration from on-premises infrastructure or other cloud providers.
- Advanced experience working with Finance and procurement team and implement cost optimization strategies based on changing workload patterns, business requirements, and new offerings from cloud providers.
Requirements
- Public cloud operations (AWS, Azure, GCP), with a strong focus on AIOps integration.
- Expertise in Infrastructure as Code (Terraform, CloudFormation), Ansible, and CI/CD pipelines
- Deep, demonstrable expertise designing and operationalizing solutions leveraging AWS Bedrock/Agent Frameworks and Azure Copilot for Cloud Operations.
- Proficiency in scripting languages (Python, Bash)
Benefits
- Competitive salary and performance-based bonuses
- Comprehensive benefits package
- Career development and training opportunities
- Flexible work arrangements (remote and/or office-based)
- Dynamic and inclusive work culture within a globally renowned group
- Private Health Insurance
- Pension Plan
- Paid Time Off
- Training & Development
Applicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.
Hard Skills & Tools
TerraformAnsibleAWS BedrockAzure CopilotCI/CD pipelinesPythonBashInfrastructure as CodeMachine LearningRoot Cause Analysis
Soft Skills
collaborationleadershipproblem resolutionadaptabilitycommunication