FREE ACCESS
5,000–10,000 jobs/day

See all jobs on JobTailor
Search thousands of fresh jobs every day.
Discover
- Fresh listings
- Fast filters
- No subscription required
Create a free account and start exploring right away.

DevOps / Infrastructure Engineer
MLabsInfrastructure & DevOps Engineer responsible for deployment and reliability of AI trading agent fleet. Managing secure networking and scalability in a high-frequency trading environment.
Posted 6/1/2026full-timeRemote • 🇺🇸 United StatesMid-LevelSenior💰 $100,000 - $130,000 per yearWebsite
Tech Stack
Tools & technologiesAWSCloudDockerNode.jsWeb3
About the role
Key responsibilities & impact- Fleet Orchestration & Scaling: Architect, provision, and scale the core user agent fleet across a hybrid Railway and AWS ecosystem, ensuring each user retains an isolated, secure, and predictable containerized process with optimized cost tracking and precise lifecycle hooks.
- Secure Network Engineering: Establish, manage, and continuously harden private overlay networks using Tailscale in production, linking disparate user agents securely with core Model Context Protocol (MCP) servers and the underlying live trading runtimes.
- Automated User Provisioning: Design and construct an end-to-end, zero-touch deployment pipeline utilizing advanced infrastructure-as-code and CI/CD best practices, enabling seamless, single-click automated provisioning of containers, secrets management, and environmental configurations for new users.
- Operational Resilience & SRE: Define, build, and maintain comprehensive monitoring, telemetry, alerting, and automated incident response frameworks to guarantee graceful state retention, preserving live in-flight transaction states across sudden host restarts, scheduled key rotations, or regional cloud outages.
- Incident Management: Oversee system health and participate in direct real-incident response and on-call rotations to maintain strict operational continuity for the live global fleet.
Requirements
What you’ll need- Container PaaS Orchestration: Proven professional experience deploying, monitoring, and scaling complex architectures in production utilizing Railway, or equivalent containerized platform-as-a-service frameworks (such as Fly.io, Render, or Northflank).
- Advanced AWS Proficiency: In-depth technical mastery of Amazon Web Services (AWS), with practical expertise spanning Virtual Private Clouds (VPC), Identity & Access Management (IAM), Secrets Manager, and elastic scaling frameworks (ECS / AWS Lambda).
- Production-Grade Tailscale Networking: Demonstrated experience implementing Tailscale within a high-security production environment, with distinct competence configuring Access Control Lists (ACLs), complex subnet routing, and ephemeral node lifecycles.
- Modern Infrastructure & CI/CD: Mastery of Docker containerization, comprehensive CI/CD deployment pipelines, and modern Infrastructure-as-Code (IaC) paradigms.
- Blockchain & Onchain Context: Technical familiarity with blockchain mechanics, smart contract interactions, or web3 infrastructure paradigms to support decentralized application layers.
- High-Availability / Financial SRE Background: A proven professional history managing environments where system stability impacts critical financial outcomes, paired with total comfort managing on-call duties and live incident response.
Benefits
Comp & perks- Highly competitive compensation package
- The flexibility of a fully remote operating environment with an immediate start timeline.
- The opportunity to shape the architectural foundation of a cutting-edge technical ecosystem intersecting Artificial Intelligence and decentralized financial infrastructure.
- Access to top-tier modern tooling, modern infrastructure frameworks, and a highly streamlined, zero-red-tape development culture.
ATS Keywords
✓ Tailor your resumeApplicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.
Hard Skills & Tools
container orchestrationAWSTailscaleInfrastructure-as-Code (IaC)CI/CDDockermonitoringtelemetryincident responseblockchain
Soft Skills
operational resilienceincident managementproblem-solvingcommunicationteam collaborationleadership