Cohere

Solutions Architect

Cohere

full-time

Posted on:

Location Type: Remote

Location: Remote • California • 🇺🇸 United States

Visit company website
AI Apply
Apply

Job Level

Mid-LevelSenior

Tech Stack

AWSAzureCloudDockerKubernetesPython

About the role

  • Develop and deliver cutting-edge agentic AI solutions utilizing Cohere’s foundation models and Agentic AI Foundry - North.
  • Architect scalable, secure, and customizable NLP and generative AI solutions tailored to enterprise customer needs.
  • Collaborate with customers to understand complex workflows, design pilots, and translate business requirements into technical solutions encompassing model fine-tuning, custom agents, and agent orchestration.
  • Support deployment and integration of large language models (LLMs) and custom solutions into production environments using Kubernetes, Docker, and cloud infrastructures, ensuring high performance and security.
  • Lead technical engagements, including deep dives into AI architectures, workshop facilitation, and establishing best practices for agent-based AI systems and model customization.
  • Work with product development to provide customer feedback on agentic AI capabilities, contribute to product enhancements, and help shape future features.

Requirements

  • 5+ years of experience in AI/ML solution architecture, with demonstrated expertise in agentic AI, model customization, and deploying tailored AI models in enterprise contexts.
  • Strong hands-on skills with Python, Jupyter Notebooks, and cloud-native deployment frameworks such as Kubernetes, Docker, Cloud managed AI services like AWS Sagemaker, Bedrock, or Azure AI Foundry or Google Vertex AI.
  • Experience in designing and deploying “agentified” AI workflows, that involve multiple interconnected models or agents, to solve business challenges.
  • Hands-on experience building on agent orchestration frameworks like Cohere North and deploying custom agents to production.
  • Familiarity with model fine-tuning methodologies, and the development of AI agents optimized for specific workflows and enterprise needs.
  • In-depth understanding of the strengths, weaknesses, and operational considerations of generative LLMs, with experience in customizing and orchestrating these models.
  • Excellent communication skills to articulate complex AI architectures to both technical stakeholders and executive audiences.
Benefits
  • An open and inclusive culture and work environment
  • Work closely with a team on the cutting edge of AI research
  • Weekly lunch stipend, in-office lunches & snacks
  • Full health and dental benefits, including a separate budget to take care of your mental health
  • 100% Parental Leave top-up for up to 6 months
  • Personal enrichment benefits towards arts and culture, fitness and well-being, quality time, and workspace improvement
  • Remote-flexible, offices in Toronto, New York, San Francisco, London and Paris, as well as a co-working stipend
  • 6 weeks of vacation (30 working days!)

Applicant Tracking System Keywords

Tip: use these terms in your resume and cover letter to boost ATS matches.

Hard skills
AI solution architectureagentic AImodel customizationPythonJupyter NotebooksKubernetesDockerAWS SagemakerAzure AI FoundryGoogle Vertex AI
Soft skills
communication skills