Tech Stack
CloudGoIoTJavaMicroservicesPython
About the role
- Design and implement comprehensive DCIM solutions including digital twin capabilities, real-time power monitoring, and environmental tracking systems
- Lead the development of asset maintenance management systems that track hardware lifecycle, predictive maintenance schedules, and automated alerting workflows
- Build workforce management tools that optimize technician scheduling, work order routing, and capacity planning across multiple data center locations
- Create automation-first solutions that reduce manual intervention in routine data center operations, with a focus on API-driven integrations and self-healing systems
- Collaborate closely with the infrastructure team to identify opportunities for tooling improvements and contribute written reports on automation initiatives
Requirements
- 5+ years of software engineering experience with at least 2 years focused on DCIM, IoT, or infrastructure management systems
- Strong programming skills in Python, Go, or Java with experience building scalable microservices and RESTful APIs
- Deep understanding of data center operations including power distribution, cooling systems, rack management, and environmental monitoring protocols
- Experience with time-series databases, real-time data processing, and building monitoring dashboards for critical infrastructure
- Experience with Network Source of Truth Tooling (Netbox, ...)
- Experience with digital twin technologies and 3D visualization frameworks for data center modeling
- Previous experience in hyperscale data center environments or managing infrastructure for AI/GPU workloads