Tech Stack
AWSCloudDistributed SystemsDockerGoogle Cloud PlatformGrafanaJavaScriptLinuxNGINXNode.jsPrometheusRust
About the role
- Design, develop, and maintain robust monitoring and alerting tools for mission-critical systems
- Automate production testing to ensure reliability and scalability
- Test and validate solutions in real production environments
- Proactively identify, troubleshoot, and resolve performance and software issues
- Track, update, and drive resolution of technical problems across teams
- Suggest and implement architectural enhancements and process improvements
- Research, evaluate, and recommend new technologies and vendor solutions
- Safeguard critical systems by applying best-in-class security practices and solutions
- Collaborate closely with Development, QA, and DevOps teams to ensure system reliability and performance
Requirements
- 5+ years of experience as a Node.js / JavaScript developer, ideally with a background in monitoring and supporting large-scale production systems
- Strong knowledge of REST API design and WebSockets
- Hands-on experience with cloud platforms such as AWS or GCP
- Solid understanding of distributed systems at scale
- Proficiency working in a Linux environment
- Excellent communication skills in English, with the ability to explain complex concepts clearly and logically
- Optional: Familiarity with tools such as Docker, Nginx, Prometheus, Grafana
- Optional: Experience working with time-series databases
- Optional: Understanding of low-level programming languages (e.g., C, C++, Rust)
- Optional: Previous contributions to open-source projects