
Reliability and Performance Engineer
Broadvoice
full-time
Posted on:
Location Type: Hybrid
Location: Aveiro • 🇵🇹 Portugal
Visit company websiteJob Level
Mid-LevelSenior
Tech Stack
AWSCloudDistributed SystemsGoGrafanaPrometheusPythonVoIP
About the role
- Design and implement comprehensive reliability strategies to ensure that infrastructure and applications remain resilient, scalable, and deliver consistently high performance
- Own the optimization of system performance by proactively identifying bottlenecks and anticipating challenges to minimize customer impact and maintain seamless service
- Work closely with Infrastructure and Product teams to define and align SLOs and SLIs that reflect both technical goals and exceptional user experience in a real-time communications environment
- Lead root cause analyses and post-incident reviews to drive continuous improvements in system reliability, availability, and operational maturity
- Develop automation and tooling to empower teams in monitoring, testing, and enhancing reliability at scale across platforms
- Manage capacity planning and forecasting efforts to ensure the communication platforms scale efficiently alongside business growth and increasing customer demand
- Participate actively in on-call rotations, contributing to a culture of operational excellence, rapid incident response, and minimizing downtime in a mission-critical environment
Requirements
- 3+ years of experience in SRE, performance engineering, or infrastructure roles
- Strong understanding of distributed systems, cloud infrastructure (AWS preferred), and networking
- Proficiency in observability tools (e.g., Prometheus, Grafana, Datadog)
- Experience with performance profiling, load testing, and benchmarking
- Solid scripting or programming skills (e.g., Python, Go, Bash)
- Familiarity with SIP/VoIP systems and real-time communications
- Experience working in a SaaS or multi-region infrastructure environment
- Knowledge of incident management and reliability frameworks (e.g., Google SRE model)
Benefits
- Grow Your Career
- Enjoy Flexibility
- Community & Culture
- Make an Impact
Applicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.
Hard skills
SREperformance engineeringinfrastructure rolesdistributed systemscloud infrastructureobservability toolsperformance profilingload testingscriptingprogramming
Soft skills
leadershipcommunicationproblem-solvingcollaborationanalytical thinkingcontinuous improvementoperational excellence