
Senior AI Platform Engineer
myneva
full-time
Posted on:
Location Type: Remote
Location: Portugal
Visit company websiteExplore more
Job Level
Tech Stack
About the role
- We are looking for a Go Platform Engineer who thrives at the intersection of infrastructure, AI systems, and DevOps.
- In this role, you will architect and scale the backbone of our AI Platform: Ensuring high availability, low latency, and seamless integration of machine learning capabilities into production.
- You will own the microservices that power AI inference, build robust multi-tenant infrastructure, and support our Data & AI team with production-grade DevOps practices.
- Design, build, and maintain Go microservices that handle AI model inference, data processing pipelines, and real-time streaming workflows.
- Architect scalable APIs (gRPC/REST) that serve as the bridge between AI models and production applications.
- Own the Kubernetes infrastructure (EKS), including deployments, autoscaling policies, service mesh, and cluster health monitoring.
- Implement service-to-service communication using gRPC and message queues (RabbitMQ/SQS) for asynchronous processing.
- Integrate with cloud AI services (AWS Bedrock, OpenAI, Anthropic) and manage model serving infrastructure.
- Build multi-tenant capabilities including authentication (JWT/JWKS), rate limiting, usage tracking, and tenant isolation.
- Partner with the Data & AI team to productionize machine learning models—wrapping them in production-ready services with proper health checks, circuit breakers, and graceful degradation.
- Build comprehensive observability: structured logging, metrics (Prometheus), distributed tracing (Jaeger/Tempo), and alerting.
- Implement CI/CD pipelines and infrastructure-as-code (Terraform) for automated deployments and disaster recovery.
- Ensure high availability through proper monitoring, incident response, and post-mortem analysis.
- Optimize resource utilization for GPU workloads and cost-efficient scaling strategies.
Requirements
- Go Expertise: 3+ years of professional Go development experience with strong understanding of concurrency patterns, interfaces, channels, and error handling.
- Kubernetes Production Experience: 3+ years managing production Kubernetes clusters, including deployments, services, ingress controllers, resource management, and troubleshooting.
- Distributed Systems Knowledge: Deep understanding of CAP theorem, eventual consistency, idempotency, circuit breakers, and fault-tolerant design.
- gRPC & Async Messaging: Hands-on experience with gRPC/Protocol Buffers and message queues (RabbitMQ, SQS, Kafka) in production systems.
- Cloud Platform Experience: Strong experience with AWS services (EKS, S3, DynamoDB, Lambda) or equivalent cloud providers.
- DevOps Mindset: Experience with Docker, CI/CD pipelines, infrastructure-as-code, and GitOps workflows.
- Spoken language: You communicate confidently in English (C1 level); German skills are a plus.
Benefits
- A remote working time model to keep your everyday life flexible
- Exciting, challenging tasks in a dynamic, future-oriented environment
- A culture of appreciation and a harmonious working atmosphere in a growing, international company with opportunities to get involved
- A creative working environment, flat hierarchies and short decision-making processes
- Attractive remuneration models, a permanent employment contract
Applicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.
Hard Skills & Tools
GoKubernetesgRPCRabbitMQSQSAWSTerraformCI/CDDockerDistributed Systems
Soft Skills
communicationcollaborationproblem-solvingincident responsepost-mortem analysis