Apply

Ready to go for it?

AI Apply speeds things up—apply directly if you prefer.

FREE ACCESS
5,000–10,000 jobs/day
JobTailor Logo

See all jobs on JobTailor

Search thousands of fresh jobs every day.

Discover
  • Fresh listings
  • Fast filters
  • No subscription required
Create a free account and start exploring right away.
Rox Partner

AI Ops Engineer – Backend Developer, Python

Rox Partner

Senior AI Ops Engineer responsible for deploying and maintaining AI applications at Rox Partner. Engaging with cross-functional teams and managing complex data systems for high availability.

Posted 5/14/2026full-timeRemote • 🇧🇷 BrazilMid-LevelSeniorWebsite

Tech Stack

Tools & technologies
AWSCloudDjangoFlaskGrafanaGraphQLPython

About the role

Key responsibilities & impact
  • Deploy, orchestrate, and maintain AI applications in high-availability production environments
  • Develop and maintain complex system integrations using REST/GraphQL APIs, API Gateways, and Load Balancers
  • Build data pipelines and processing workflows for AI applications using Python in AWS environments
  • Design, optimize, and support architectures based on Agentic AI and Agent-to-Agent (A2A) communication
  • Implement and support integrations using MCP (Model Context Protocol)
  • Evolve and maintain OCR pipelines and unstructured data extraction workflows using LLMs
  • Create, manage, and enhance advanced monitoring and observability dashboards to ensure model health, inference performance, application availability, and operational cost control using Grafana, Rancher, and related tools
  • Monitor and optimize AI applications in production to ensure scalability, stability, and operational efficiency
  • Identify and resolve performance issues, bottlenecks, and failures in AI pipelines
  • Collaborate with cross-functional engineering, data, architecture, and product teams

Requirements

What you’ll need
  • Strong experience in backend development with Python (FastAPI, Flask, Django, or similar)
  • Extensive experience with AWS and cloud infrastructure services
  • Practical experience deploying and supporting Generative AI applications in production
  • Experience with integrations via REST/GraphQL APIs
  • Advanced knowledge of API Gateways, Load Balancers, and distributed architecture
  • Experience with monitoring, observability, and troubleshooting in production
  • Experience with tools such as Grafana, Datadog, ELK Stack, or similar
  • Hands-on knowledge of LangChain, LangFlow, and LangGraph
  • Experience with Agentic AI and A2A communication flows
  • Knowledge and implementation experience with MCP (Model Context Protocol)
  • Experience with OCR, document processing, and unstructured data extraction
  • Experience with CI/CD, DevOps, and/or MLOps
  • Experience with Git and automated deployment pipelines
  • Knowledge of containers, clusters, and management via Rancher

Benefits

Comp & perks
  • Remote work – Monday to Friday (09:00–18:00)
  • Home-office allowance – Credit on iFood card for meals/food worth R$ 300.00 per month
  • Birthday – Rox gifts you a voucher and a day off so you can enjoy your day
  • Courses – Full access to RoxSchool, Alura, Pluralsight, and O'Reilly (books and talks)
  • Certifications – Reimbursement of up to R$ 300.00 for technology certifications + a R$ 300.00 bonus per certification achieved from these providers
  • Psychologist support – Two psychotherapy sessions covered monthly by ROX with partner psychologists
  • Feedz partnership – A gamified platform to stay connected, improve communication, and track sentiments, engagement, feedback, development plans (PDI), and performance
  • WellHub (Gympass) – Partnership with gyms and health and wellness apps
  • Work equipment provided

ATS Keywords

✓ Tailor your resume
Applicant Tracking System Keywords

Tip: use these terms in your resume and cover letter to boost ATS matches.

Hard Skills & Tools
PythonFastAPIFlaskDjangoAWSREST APIsGraphQL APIsMCP (Model Context Protocol)OCRCI/CD
Soft Skills
collaborationtroubleshootingproblem-solvingcommunicationmonitoringobservabilityperformance optimizationscalabilitystabilityoperational efficiency