Salary
💰 $102,000 - $190,000 per year
Tech Stack
AnsibleAWSAzureCloudDockerGoGoogle Cloud PlatformGrafanaJavaKubernetesPrometheusPythonSplunkTerraform
About the role
- <p>At BNY, our culture allows us to run our company better and enables employees’ growth and success. As a leading global financial services company at the heart of the global financial system, we influence nearly 20% of the world’s investible assets. Every day, our teams harness cutting-edge AI and breakthrough technologies to collaborate with clients, driving transformative solutions that redefine industries and uplift communities worldwide.</p>\n<p>Recognized as a top destination for innovators and champions of inclusion, BNY is where bold ideas meet advanced technology and exceptional talent. Together, we power the future of finance – and this is what #LifeAtBNY is all about. Join us and be part of something extraordinary.</p>\n<p>We’re seeking a future team member for the role of <strong>Site Reliability Engineer</strong> to join our <strong>Wealth Services Platform</strong> team. This role is located in <strong>Jersey City, NJ</strong></p>\n<p><strong>In this role, you’ll make an impact in the following ways: </strong></p>\n<ul>\n <li>Drive reliability and performance by defining Service Level Objectives (SLOs) and Service Level Indicators (SLIs), improving observability, and proactively identifying and resolving system bottlenecks across cloud environments.</li>\n <li>Perform end-to-end identification and resolution of customer friction points, improving the entire lifecycle of services from inception, design, deployment, operation, to sustainment.</li>\n <li>Automate infrastructure and operations using tools such as Terraform, Kubernetes, CI/CD pipelines, and configuration management (e.g., Ansible) to eliminate toil, enable scalable and fault-tolerant deployments, and sustainably scale systems.</li>\n <li>Build and maintain monitoring and alerting systems using tools like Prometheus, Grafana, AppDynamics, Splunk, and custom dashboards to support real-time alerting, latency measurement, root cause analysis, and continuous health checks of applications and infrastructure.</li>\n <li>Lead incident management by participating in on-call rotations, conducting postmortems, and implementing automated recovery mechanisms to minimize downtime.</li>\n <li>Collaborate cross-functionally with product, infrastructure, DevOps, and operations teams to reduce incidents, build resilient services, and ensure architectural clarity.</li>\n <li>Develop platform tooling and pipelines for container orchestration, third-party integrations, and cloud-native operations to improve system efficiency and reliability.</li>\n <li>Mentor engineers and champion SRE best practices, embedding a reliability-first culture and ensuring technical excellence across engineering teams.</li>\n <li>Leverage KPIs and monitoring feedback to understand service performance, identify corrective actions, and contribute to the achievement of related teams' objectives.</li>\n <li>Understand the organization's technical ecosystem to recognize systemic interrelationships and perform gap identification, pushing for changes that improve reliability and velocity.</li>\n</ul>\n<p><strong>To be successful in this role, we’re seeking the following: :</strong></p>\n<ul>\n <li>Advanced knowledge of Site Reliability Engineering principles, including SLAs, SLOs, error budgets, postmortems, and reliability-focused system design.</li>\n <li>Strong expertise in cloud infrastructure platforms such as Amazon Web Services (AWS), Microsoft Azure, or Google Cloud Platform (GCP).</li>\n <li>Proficiency with containerization technologies like Docker and Kubernetes.</li>\n <li>Experience with Infrastructure as Code tools such as Terraform and Helm.</li>\n <li>Fluency in DevOps-centric automation tools and CI/CD technologies for non-production and production environments.</li>\n <li>Solid programming and scripting skills in languages such as Python, Java, or Go, with a focus on automation, tooling, and system integration.</li>\n <li>Familiarity with monitoring and observability tools including Prometheus, Grafana, AppDynamics, Datadog, and Splunk.</li>\n <li>Strong collaboration and communication skills, with experience working in Agile environments and partnering with cross-functional engineering, product, and operations teams.</li>\n</ul>\n<p><strong>Qualifications:</strong></p>\n<ul>\n <li>Bachelor’s degree in computer science or a related discipline, or equivalent work experience; advanced degree preferred.</li>\n <li>8-10 years of related experience, with prior success in technical engineering roles.</li>\n <li>Experience in the securities or financial services industry is a plus.</li>\n <li>Coding experience beyond simple scripting is required.<br/> </li>\n</ul>\n<p>At BNY, our culture speaks for itself, check out the latest BNY news at:</p>\n<p><a href=\"https://www.bnymellon.com/newsroom\" target=\"_blank\">BNY Newsroom</a><br/><a href=\"https://www.linkedin.com/company/bny-mellon\" target=\"_blank\">BNY LinkedIn</a></p>\n<p>Here’s a few of our recent awards:</p>\n<ul>\n <li>America’s Most Innovative Companies, Fortune, 2025</li>\n <li>World’s Most Admired Companies, Fortune 2025</li>\n <li>“Most Just Companies”, Just Capital and CNBC, 2025</li>\n</ul>\n<p>Our Benefits and Rewards:</p>\n<p>BNY offers highly competitive compensation, benefits, and wellbeing programs rooted in a strong culture of excellence and our pay-for-performance philosophy. We provide access to flexible global resources and tools for your life’s journey. Focus on your health, foster your personal resilience, and reach your financial goals as a valued member of our team, along with generous paid leaves, including paid volunteer time, that can support you and your family through moments that matter.</p>\n<p>BNY is an Equal Employment Opportunity/Affirmative Action Employer - Underrepresented racial and ethnic groups/Females/Individuals with Disabilities/Protected Veterans.</p>\n<p><br/>BNY assesses market data to ensure a competitive compensation package for our employees. The base salary for this position is expected to be between $102,000 and $190,000 per year at the commencement of employment. However, base salary if hired will be determined on an individualized basis, including as to experience and market location, and is only part of the BNY total compensation package, which, depending on the position, may also include commission earnings, discretionary bonuses, short and long-term incentive packages, and Company-sponsored benefit programs. <br/> </p>\n<p>This position is at-will and the Company reserves the right to modify base salary (as well as any other discretionary payment or compensation) at any time, including for reasons related to individual performance, change in geographic location, Company or individual department/team performance, and market factors.</p>
Requirements
- Advanced knowledge of Site Reliability Engineering principles, including SLAs, SLOs, error budgets, postmortems, and reliability-focused system design. Strong expertise in cloud infrastructure platforms such as Amazon Web Services (AWS), Microsoft Azure, or Google Cloud Platform (GCP). Proficiency with containerization technologies like Docker and Kubernetes. Experience with Infrastructure as Code tools such as Terraform and Helm. Fluency in DevOps-centric automation tools and CI/CD technologies for non-production and production environments. Solid programming and scripting skills in languages such as Python, Java, or Go, with a focus on automation, tooling, and system integration. Familiarity with monitoring and observability tools including Prometheus, Grafana, AppDynamics, Datadog, and Splunk. Strong collaboration and communication skills, with experience working in Agile environments and partnering with cross-functional engineering, product, and operations teams.