Tech Stack
AWSAzureCloudGoogle Cloud PlatformJavaPythonSQL
About the role
- First-line owner for API incidents and customer requests; triage and resolve API issues.
- Work closely with Web/API Engineering and Enterprise/Platform teams to ensure platform resilience and observability.
- Analyze requests/responses, authentication, throttling, data mismatches, and configuration.
- Debug with code-level visibility: read and step through C# services, produce minimal repros, and propose fixes or small PRs.
- Operate across the stack: Azure API Management (APIM), Azure Functions, Service Bus/queues, SQL layer, identity/OAuth, and networking.
- Design and run load/security tests to identify performance and resilience improvements.
- Enhance Datadog dashboards, logs, and alerts to reduce MTTD/MTTR.
- Publish Postman collections and sample clients (C#, Python, Java) for developer experience.
- Create and maintain knowledge base articles and runbooks for common scenarios.
- Partner with Product and engineering teams on API enhancements and customer needs.
Requirements
- Ability to read and debug C# services; strong general programming fundamentals.
- Working knowledge of REST, OAuth 2.0/identity, and API best practices (rate limits, pagination, error patterns).
- Familiarity with cloud concepts (Azure/AWS/GCP) and deploying/operating APIs in the cloud.
- Experience with relational databases and SQL for diagnostics and data validation.
- Proven troubleshooting mindset and customer empathy; comfortable owning issues through resolution.
- (Nice-to-have) Experience with Azure APIM, Azure Functions, Service Bus, and Datadog (or comparable tools).
- (Nice-to-have) Prior work in load/security testing and performance diagnostics.
- (Nice-to-have) Ability to produce sample code in C#, Python, or Java for customer scenarios.