DXS - Direct Expansion Solutions

Platform Architect

DXS - Direct Expansion Solutions

full-time

Posted on:

Origin:  • 🇺🇸 United States • Virginia

Visit company website
AI Apply
Apply

Job Level

Mid-LevelSenior

Tech Stack

AnsibleCloudDistributed SystemsElasticSearchKubernetesLinuxPostgresTerraform

About the role

  • Define and own the cloud-native platform architecture to support multiple SaaS products.
  • Develop patterns for multi-product, multi-tenant deployment models.
  • Build scaling strategies to ensure resiliency, high availability, and performance.
  • Establish long-term standards for security, upgrades, lifecycle management, and governance in a containerized environment.
  • Define and implement CI/CD pipelines and automation strategies, leveraging Atlassian Bamboo, Bitbucket, and Jira where applicable.
  • Collaborate with DevOps to manage application deployment, scaling, monitoring, and upgrades.
  • Establish best practices for backup, recovery, and disaster recovery planning.
  • Ensure observability and monitoring frameworks (metrics, logging, tracing) are in place for proactive management.
  • Drive automation of infrastructure and platform provisioning with Infrastructure-as-Code.
  • Partner with product teams to ensure platform services align with business and customer needs.
  • Support product delivery teams by providing standardized platform services and tooling.
  • Directly contribute to customer success and faster product delivery by ensuring platform reliability and efficiency.
  • Serve as subject-matter expert for the SaaS platform and mentor engineers.
  • Collaborate with product teams, security, and operations to ensure platform alignment with business objectives.
  • Apply critical-thinking and problem-solving skills to challenge assumptions and refine strategies.

Requirements

  • Hands-on expertise with Kubernetes and a proven track record designing and running containerized services in SaaS or PaaS environments.
  • Deep knowledge of scalability, multi-tenancy, high availability, and security in distributed systems.
  • Ability to define RBAC in Kubernetes to manage SOD for internal teams and control access to platform resources for customer access.
  • Strong background in backup/restore, lifecycle management, and upgrade strategies.
  • Experience with automation/IaC (Terraform, Helm, Ansible, GitOps).
  • Solid foundation in Linux, networking, and observability.
  • Experience with SAP BTP, Kyma, and Kubernetes in enterprise contexts.
  • Experience managing BTP services integrated with SAP BTP Kyma runtime (e.g., Cloud Logging, Connectivity/Destination Services, PostgreSQL).
  • Experience building platforms in multi-product SaaS/PaaS environments.
  • Experience migrating legacy enterprise workloads to cloud-native architectures.
  • Knowledge of compliance frameworks (SOC 2, ISO 27001) for SaaS platform operations.
  • Expertise in OpenSearch (or Elasticsearch) for AI/ML data processing and search workloads.
  • Familiarity with AI/ML integration patterns using search/indexing technologies.
  • Experience working with Atlassian Jira, Bitbucket, and Bamboo for issue management and CI/CD.
  • Resume must be uploaded in English.
Red Cell Partners

Senior Site Reliability Engineer

Red Cell Partners
Seniorfull-time🇺🇸 United States
Posted: 36 days agoSource: boards.greenhouse.io
AnsibleAWSAzureCloudDistributed SystemsDockerGoogle Cloud PlatformGrafanaKubernetesPrometheusPythonTerraform
GitLab

Senior Site Reliability Engineer, Environment Automation

GitLab
Seniorfull-time$151k–$266k / yearCalifornia, Colorado, District of Columbia, Hawaii, Illinois · 🇺🇸 United States
Posted: 38 days agoSource: boards.greenhouse.io
AnsibleAWSCloudDistributed SystemsGoogle Cloud PlatformKubernetesLinuxPrometheusRubyRuby on RailsSDLCTerraform
Sinch

Senior Site Reliability Engineer

Sinch
Seniorfull-time$143k–$179k / yearColorado, Illinois · 🇺🇸 United States
Posted: 16 days agoSource: apply.workable.com
AnsibleAWSCassandraCloudDistributed SystemsElasticSearchGoGoogle Cloud PlatformGrafanaLinuxMicroservicesPrometheus+2 more
Poppulo

Software Architect

Poppulo
Senior · Leadfull-time🇮🇳 India
Posted: 16 hours agoSource: boards.greenhouse.io
AWSDistributed SystemsJavaScriptKafkaMicroservicesNode.jsReactTerraformTypeScript
Docusign

Principal Product Manager - Site Reliability

Docusign
Leadfull-time$174k–$328k / year🇺🇸 United States
Posted: 35 days agoSource: uscareers-docusign.icims.com
AnsibleAWSAzureCloudDistributed SystemsGoogle Cloud PlatformGrafanaKubernetesPrometheusTerraform