Tech Stack
AWSAzureCloudDockerGoogle Cloud PlatformJenkinsKubernetesLinuxOpenStackOraclePython
About the role
- Work on the integration of Base Command with public cloud providers such as Amazon Web Services, Microsoft Azure, Oracle Cloud Infrastructure and Google Cloud Platform
- Develop features that let organizations seamlessly deploy their workload into one or multiple public clouds
- Add many new features to allow Base Command clusters to be used for a wider range of workloads and to allow clusters to scale to hundreds of thousands of nodes
- Follow the latest developments in cloud technology and assist the support team with cloud specific support requests
- Work with the latest hardware (e.g. GPUs, accelerators, InfiniBand, Spectrum X) and software technologies (parallel file-systems, OpenStack, Docker, Kubernetes, Jupyter and Ceph)
- Opportunity to develop skills and take initiative
Requirements
- Degree in Computer Science or related field (or equivalent experience)
- 8+ years of industry experience in software development, on Linux platforms, preferably with a focus on infrastructure
- Fluent in C++ and/or Python and intimately familiar with object oriented software design, design patterns, and concurrent programming techniques
- Expertise in system-level programming and ability to develop robust tools and automation at scale
- Proficiency in automated testing of code, using CI/CD systems
- Strong familiarity with the Linux operating system, networking concepts in Linux, and common Linux server software
- Eagerness to learn and use new technologies
- Knowledge of AWS, Azure, OCI, GCP, OpenStack, Kubernetes, Jenkins and distributed programming beneficial
- Experience with high-performance computing and system administration would be an asset