
Production Engineer III
Veeam Software
full-time
Posted on:
Location Type: Remote
Location: India
Visit company websiteExplore more
About the role
- Own complex and escalated production issues from support, and drive long-term fixes in collaboration with engineering, including code, configuration, and architecture changes.
- Proactively identify and address risks that are identified during the problem solving process.
- Lead production efficiency initiatives, develop and maintain processes, run-books and knowledge base integrity.
- Define, build and maintain production monitoring systems.
- Continuously improve alerting to minimize noise and ensure actionable, well-documented runbooks.
- Define and maintain SLIs/SLOs for key services, and use error budgets to guide operational and product decisions.
- Turn manual processes into automation.
- Own and drive post-mortem review process and actions arising from incident analysis.
- Collaborate with support organization as an escalation point and feed back knowledge & improvement recommendations.
- Collaborate with developers throughout the lifecycle of changes, from design through rollout and patch delivery, ensuring safe deployments and efficient incident mitigation.
- Participate in design reviews to ensure services are operable with minimal manual intervention in production (automation, safe deployments, clear runbooks), and share learnings through documentation and feedback.
Requirements
- 5+ years of experience in software engineering, site reliability, production engineering, or senior technical support roles operating distributed systems.
- Experience with log analysis and advanced troubleshooting.
- Basic programming experience (e.g., JS, Go, Typescript, Java, or C#).
- Experience deploying and troubleshooting systems on a public cloud platforms (Azure preferred).
- Familiarity with observability tooling (e.g., Elastic, Prometheus, Grafana, Open Telemetry).
- Understanding of distributed systems, networking, automation and CI/CD.
Benefits
- Make a high-impact contribution to the architecture and reliability of Veeam's first global SaaS product suite.
- Help shape a modern SRE organization from the ground up, influencing best practices, tooling, and culture.
- Collaborate with highly skilled teams across product, cloud engineering, security, and support.
- Access professional development resources including internal mentorship, technical training platforms, and volunteer days.
- Enjoy competitive compensation and benefits tailored to local markets in the US, Czechia, India, and Australia.
Applicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.
Hard Skills & Tools
log analysisadvanced troubleshootingprogrammingJavaScriptGoTypescriptJavaC#automationCI/CD
Soft Skills
problem solvingcollaborationleadershipprocess developmentdocumentationrisk managementincident analysiscommunicationefficiency improvementfeedback