Salary
💰 $115,000 - $230,000 per year
Tech Stack
AirflowAWSAzureCassandraCloudJavaKafkaKubernetesMongoDBNoSQLOpen SourcePostgresPythonRayRedisSDLCSpark
About the role
- Own design, development and maintenance of high-performance, durable and scalable platform components that jointly power large sets of end-to-end GenAI agentic workflows (e.g., AI agent building & deployment, prompt management, workflow orchestration, MCP registry)
- Build key multi-tenant services that power the building, testing, simulation, deployment and hosting of LLM-based agents
- Contribute to the selection, evaluation, and implementation of software platforms, tools, and frameworks, balancing build vs. buy, speed to market, maintainability
- Collaborate with cross-functional teams, including data scientists, ML engineers, software engineers, product managers, designers to gather requirements, define project scope and prioritize feature backlogs
- Establish pragmatic technical visions & roadmaps that balance business outcome, product release timelines and engineering excellence
- Assist in the planning and estimation of software development projects, ensuring the efficient allocation of resources and timely delivery of solutions
- Mentor and guide junior engineers via code reviews and design sessions, fostering a collaborative and high-performance team culture, elevating AI engineering best practices across the company
Requirements
- 5+ years of professional software development experience with at least two general-purpose programming languages such as Java, C++, Python or C#
- 4+ years of experience designing and building AIML platform and systems utilizing open-source /cloud-agnostic components such as search engine (e.g. elastic search, Qdrant), data warehouse (e.g. snowflake), streaming platform (e.g. Kafka), relational database (e.g. postgresql), Nosql (e.g. MongoDB, Cassandra), distributed processing (e.g. Spark, Ray), workflow management (e.g. Airflow, Temporal), memory management (e.g. Redis)
- 3+ years experience managing end-to-end software development life cycle (version control, CICD pipelines, Kubernetes clusters, testing, monitoring & alerting, production support)
- 3+ years experience building training, finetuning, real-time/batch inferencing and evaluation systems for AIML models and LLMs, esp. utilizing GPU-powered infrastructure
- Bachelors degree or above in Computer Science, Engineering, Statistics or a related field
- Preferred: 3+ years experience with cloud providers such as Azure and AWS
- Preferred: 2+ years experience building conversational experiences and agentic workflows, leveraging open source LLMs (e.g. Llama, Mistral, Claude)
- Preferred: Experience with LLM observability software such as Langsmith, Arize Phoenix
- Preferred: Experience building LLM-based AI agent workflows via both no code/low code and traditional high-code development environments
- Strong communication and problem - solving skills
- GEICO will consider sponsoring a new qualified applicant for employment authorization for this position