Tech Stack
AzureCloudDockerElasticSearchGoGrafanaJavaScriptKubernetesNode.jsPrometheusRust
About the role
- Ensure the application is running at all times in Production
- Manage non-production environments such as pre-production/staging
- Maintain strong understanding of application features and all supported blockchains
- Work closely with application and infrastructure engineers to prepare production deployments
- Apply fixes, improvements, and optimizations to applications and infrastructure
- Identify delivery issues impacting production uptime, scalability, or responses
- Monitor, troubleshoot, and triage application alerts and issues
- Perform incident and problem management and post-resolution follow-ups
- Continuously improve monitoring tools and processes
- Coordinate disaster recovery (DR) testing and audits
- Maintain and support blockchain infrastructure (DevOps knowledge advantageous)
- Communicate with stakeholders on application issues (for senior individuals)
Requirements
- Degree in Computer Science or Engineering or Information Technology or similar discipline
- Knowledge in programming languages such as Rust or Javascript or Node.js or Golang
- Past job experiences in supporting applications
- Attention to detail with an analytical mindset
- Comfortable with analysing logs and possess a keen eye for anomalies and inconsistencies
- Strong curiosity and desire to identify problems or root causes, and effectively communicate findings to the rest of the team
- Experience in deploying in various cloud (Azure, Google, Amazon)
- Experience in building, deploying and maintaining monitoring tools (Grafana, Prometheus, etc)
- Experience in deploying and maintaining container infrastructures (Kubernetes, docker-compose, etc)
- Experience in building, deploying and maintaining logging tools (Fluentd, ElasticSearch, Kibana, etc)