Salary
💰 $130,000 - $150,000 per year
Tech Stack
AngularAzureGrafana.NETPrometheusSQL
About the role
- Wellfit is the dental industry’s fintech solution, breaking down financial barriers so patients, providers, employers, and payors can access better care.
- Site Reliability Engineer (SRE) role focused on monitoring, debugging, and optimizing Azure App Services.
- Design, implement, and fine-tune monitoring systems for Azure-based applications using Application Insights and Azure Monitor.
- Build custom dashboards and analyze logs, metrics, and traces to proactively troubleshoot performance and reliability issues.
- Apply code-level debugging for C#, .NET, Angular, and SQL to resolve application issues.
- Optimize, configure, manage, and scale App Service environments for multiple applications.
- Use Diagnose and Troubleshoot Tools, Kudu, and PowerShell scripting to resolve application and infrastructure issues.
- Automate monitoring, alerting, and remediation workflows to improve reliability and reduce toil.
- Use APM tools like Grafana and Prometheus and quickly learn new monitoring tools as needed.
- Partner with developers and operations, document findings, and produce RCA reports with clarity and impact.
Requirements
- Bachelor’s degree in Computer Science, IT, or related field (or equivalent experience).
- Proven SRE experience with a focus on monitoring, debugging, and incident response.
- Extensive hands-on work with Azure App Services, Application Insights, and Azure Monitor.
- Skilled with Diagnose and Troubleshoot Tools, Kudu, and PowerShell scripting.
- Strong programming fundamentals with the ability to read and troubleshoot .NET/C# and Angular code.
- Experience in on-call operations, incident response, and RCA writing.
- Bonus: Experience with Grafana/Prometheus, DataDog/Dynatrace, Azure Front Door, CDN, Function Apps, WebJobs, Service Bus, or Event Hub.
- Excellent communication, collaboration, and problem-solving skills.
- Azure certifications are a strong plus.