Tech Stack
AWSAzureCloudDockerEC2FirewallsGroovyJavaJMeterKubernetesLinuxNoSQLPythonSplunkVMware
About the role
- Design, implement, and maintain scalable and reliable infrastructure on AWS.
- Monitor system performance and availability using tools like New Relic and Splunk and AWS CloudWatch.
- Conduct performance testing using tools such as JMeter, Performance Center to identify bottlenecks and optimize system performance.
- Develop and maintain CI/CD pipelines to support rapid and reliable software delivery.
- Implement and manage incident response processes, including root cause analysis and postmortems.
- Created dashboards and configured alerts in Splunk, New Relic, and AWS CloudWatch to monitor system performance, availability, and application health.
- Collaborate with development teams to improve system reliability and performance.
- Ensure security and compliance standards are met across infrastructure and applications.
- Continuously improve observability and alerting systems.
Requirements
- 8-9+ years of experience as an SRE, DevOps Engineer, Performance engineer or similar role.
- Strong hands-on experience with AWS services (EC2, EBS, Lambda, RDS, S3, ECS, EKS etc.).
- Proficiency in performance testing tools like JMeter, Performance Center and methodologies.
- Experience with containerization and orchestration (Docker, Kubernetes).
- Experience with monitoring and logging tools like CloudWatch, New Relic and Splunk.
- Strong scripting skills (Bash, Groovy, Java Script, Python).
- Experience with performance diagnostics, tuning, and JVM/JBoss EAP on Linux and on AWS EKS.
- Working knowledge of Agile development practices.
- Excellent problem-solving and communication skills.
- Candidates must be able to obtain and maintain a Public Trust clearance
- Candidates must have lived in the United States 3 out of the past 5 years
- Bachelor’s degree in computer science, Information Technology or equivalent
- AWS Certifications are preferred