
Principal Software Engineer – Next-Generation API, AI Platform Architecture
SpyCloud
full-time
Posted on:
Location Type: Remote
Location: Texas • United States
Visit company websiteExplore more
Job Level
Tech Stack
About the role
- Architect and implement customer-facing data and intelligence APIs with clear versioning, SLAs, and backward compatibility.
- Define and enforce API contracts for: Low latency, high throughput endpoints; Agentic workflows (tools, actions, state transitions, handoffs); Context retrieval and enrichment.
- Design APIs as durable platform interfaces, not thin wrappers over storage systems.
- Establish standards for schema governance, payload design, and service boundaries.
- Architect service layers that support agentic workflows, including orchestration interfaces, guardrails, and runtime observability.
- Build API surfaces that expose model outputs safely and consistently.
- Collaborate with MLEs and Data Scientists to move models from experimentation into robust inference services.
- Ensure proper lifecycle management, monitoring, and rollback mechanisms for ML-powered endpoints.
- Design and implement service layers that sit on top of the gold-layer system of record.
- Build and evolve context hubs derived from authoritative systems of record, enabling consistent entity, identity, or risk context for downstream services.
- Ensure APIs are aligned with medallion architecture principles and platform data contracts.
- Prevent fragmentation by maintaining strong coupling to canonical schemas and governed data models.
- Design high-throughput, low-latency services supporting batch and streaming workloads.
- Implement scalable infrastructure patterns on AWS.
- Ensure strong observability, error handling, performance tuning, and cost awareness.
- Drive infrastructure-as-code practices and production readiness standards.
- Serve as a senior architectural voice in design reviews.
- Raise the bar on code quality, system design, and operational rigor.
- Mentor engineers on API design, distributed systems, and ML integration patterns.
- Influence platform direction through technical depth rather than organizational authority.
Requirements
- 10+ years of experience building distributed systems, APIs, or large-scale data platforms.
- Deep experience designing and implementing customer-facing APIs as long-lived products.
- Strong understanding of schema governance, versioning strategies, and contract design.
- Experience productionizing ML models and exposing inference services.
- Experience designing or integrating agentic or autonomous workflows.
- Experience building service layers on top of systems of record or semantic data layers.
- Strong background in distributed systems reliability, observability, and scalability.
- Proficiency in Go & Python (and/or backend service languages).
- Hands-on experience with Databricks/Snowflake
- Strong experience with AWS infrastructure and infrastructure-as-code tools
Benefits
- 401(k) with Employer Contribution
- Health, Vision, and Dental Insurance
- Health Savings Account (HSA) available with Employer Contribution
- Employer Paid Life, Short-term, and Long-term Disability Insurance
- Generous PTO Plan and 16 paid holidays per year
- Retirement Savings Plan with Employer Contribution
- Employer Provided Private Health Insurance and Healthcare Cashplan
- Employer Paid Life Insurance and Income Replacement
- Generous Holiday Plan and 14 paid holidays per year
Applicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.
Hard Skills & Tools
API designschema governanceversioning strategiescontract designdistributed systemsML model productionizationagentic workflowsservice layer architectureobservabilityscalability
Soft Skills
mentoringcollaborationinfluencecode qualitysystem designoperational rigor