
AI Platform & Systems Engineer
BNY
full-time
Posted on:
Location Type: Hybrid
Location: Lake Mary • Florida, Pennsylvania • 🇺🇸 United States
Visit company websiteJob Level
SeniorLead
Tech Stack
AnsibleAWSAzureCloudDistributed SystemsDockerGoogle Cloud PlatformJenkinsKubernetesLinuxPythonShell ScriptingTerraform
About the role
- Provide hands-on operational support and incident management for GPU-based compute infrastructure across hybrid and on-prem environments.
- Deploy, monitor, and troubleshoot containerized AI workloads using Kubernetes, Docker, and GPU orchestration tools such as Run:AI, Volcano, or Kubeflow.
- Automate infrastructure processes and workload provisioning using Python, Bash, and configuration management tools.
- Maintain and scale training/inference workloads using GitOps tools like Helm, ArgoCD, and integrate with CI/CD pipelines (GitLab, Jenkins).
Requirements
- Bachelor's degree in computer science or a related discipline, or equivalent work experience required; advanced degree preferred
- 8-10 years of related experience required; experience in the securities or financial services industry is a plus.
- Experience with Linux administration (RHEL/Ubuntu), shell scripting, and system-level debugging.
- Proven experience running distributed systems in Kubernetes and containerized environments using Docker.
- Familiarity with GPU resource management, including NVIDIA GPU Operator and device plugin lifecycle.
- Experience with CI/CD workflows and infrastructure automation tools such as GitLab CI, Jenkins, Terraform, Helm, or Ansible.
- Knowledge of networking fundamentals and persistent storage systems.
- Exposure to cloud platforms (AWS, GCP, Azure) and hybrid GPU environments.
- Ability to read and support Python code focused on ML/AI pipeline integration.
- Strong analytical and troubleshooting skills with a collaborative mindset.
- Effective communication skills and proactive ownership of platform reliability and performance.
Benefits
- BNY offers highly competitive compensation, benefits, and wellbeing programs rooted in a strong culture of excellence and our pay-for-performance philosophy.
- We provide access to flexible global resources and tools for your life’s journey.
- Focus on your health, foster your personal resilience, and reach your financial goals as a valued member of our team, along with generous paid leaves, including paid volunteer time, that can support you and your family through moments that matter.
Applicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.
Hard skills
KubernetesDockerPythonBashGitOpsCI/CDLinux administrationGPU resource managementTerraformAnsible
Soft skills
analytical skillstroubleshooting skillscollaborative mindseteffective communicationproactive ownership