Tech Stack
AzureCloudDockerElasticSearchGoGrafanaJavaScriptKubernetesNode.jsPrometheusRust
About the role
- Ensure the application is running at all times in Production
- Manage non-production environments such as pre-production/staging
- Maintain a strong understanding of application features and all supported blockchains
- Work closely with application and infrastructure engineers on production deployments
- Apply fixes, improvements, or optimizations to applications and infrastructure
- Identify issues from deliveries that may impact production uptime, scalability, or responses
- Monitor, troubleshoot, and triage application alerts and issues
- Perform incident and problem management and post-resolution follow-ups
- Continuously improve monitoring tools and processes
- Coordinate disaster recovery testing and audits
- Maintain and support blockchain infrastructure (some DevOps knowledge advantageous)
- Communicate with stakeholders on application issues (for senior individuals)
Requirements
- Degree in Computer Science or Engineering or Information Technology or similar discipline
- Knowledge in programming languages such as Rust or Javascript or Node.js or Golang
- Past job experiences in supporting applications
- Attention to detail with an analytical mindset
- Comfortable with analysing logs and possess a keen eye for anomalies and inconsistencies
- Strong curiosity and desire to identify problems or root causes, and effectively communicate findings to the rest of the team
- Experience in deploying in various cloud (Azure, Google, Amazon)
- Experience in building, deploying and maintaining monitoring tools (Grafana, Prometheus, etc)
- Experience in deploying and maintaining container infrastructures (Kubernetes, docker-compose, etc)
- Experience in building, deploying and maintaining logging tools (Fluentd, ElasticSearch, Kibana, etc)