Design and implement a high-performance, scalable and reliable LLM service using Python, LLM Services (including OpenAI, Antropic’s Claude, and Meta’s LLaMA) and Cloud technologies to power applications and business needs including building services that can serve hundreds of concurrent users in a second.
Design and develop RAG pipelines Building Search Engines and Recommendation engines on domain specific corpus.
Collaborate with business leaders to understand their needs and challenges, and identify potential LLM use cases.
Monitor LLM performance and make improvements to ensure acceptable response times based on use cases.
Participate in code reviews and provide guidance to junior developers.
Mentor LLM and AI Engineers to help deliver the AI use cases across the company. Perform LLM modeling, fine tuning and deployment of advanced AI technology solutions.
Collaborate with cross functional teams to understand how to build LLM software solutions, including fine-tuning LLM models.
Utilize strong communication skills (written and verbal) to convey technical concepts and collaborate technical concepts and collaborating with cross-functional teams.
Collaborate with other developers and data scientists within the organization to share knowledge, ideas, and best practices.
Design, develop and deploy Large Language Models (LLM) software solutions to facilitate AI needs for all enterprise and business use cases.
Implement software best practices to ensure seamless development, testing, and deployment of LLM software solutions.
Collaborate with business leaders to understand their needs and challenges, and identify potential LLM use cases that provide best ROI opportunities.
Stay up-to-date with the latest advancements in LLM technologies, and continuously explore ways to integrate these advancements into existing solutions.
Develop and maintain comprehensive documentation, including technical specifications, user guides, and best practices.
Ensure data privacy and security compliance while working with LLMs.
Provide support and training to internal teams as needed.
Requirements
MINIMUM REQUIREMENTS: Bachelor’s degree or U.S. equivalent in Computer Engineering, Computer Science, Data Science, or related field, plus 5 years of professional experience as a Software Developer, Software Engineer, or any occupation/position/job title involving developing Large Language Model (LLM) applications.
In lieu of a Bachelor's degree plus 5 years of experience, the employer will accept a Master's degree or U.S. equivalent in Computer Engineering, Computer Science, Data Science, or related field, plus 3 years of professional experience as a Software Developer, Software Engineer, or any occupation/position/job title involving developing Large Language Model (LLM) applications.
3 years of professional experience utilizing Python programming, building ETL pipelines, and working with Transformer architecture models
3 years of professional experience working with Cloud Platforms (including AWS, GCP and Azure) and Vector databases (including MongoDB)
3 years of professional experience utilizing LLM concepts including model fine-tuning and transfer learning
3 years of professional experience utilizing LLM frameworks (including Hugging Face Transformers) and integrating with Python
3 years of professional experience automating and scaling web frameworks (including FastAPI and Django) for building production ready applications
3 years of professional experience building conversational chatbots using LLM's as per business needs
3 years of professional experience presenting complex concepts to both technical and non-technical audiences
3 years of professional experience working in a cross-functional team and collaborating with stakeholders at various levels of the organization
3 years of professional experience with DevOps toolsets and working with RDBMS databases including MySQL, SQL Server, and PotstgreSQL
3 years of professional experience utilizing Docker, Kubernetes and CI/CD tools and writing clean, maintainable, and testable code following best practices and industry standards
3 years of professional experience building machine learning models including Random Forest, Logistic Regression, Linear Regression, Support Vector Machines (SVM), XGBoost, and LightGBM
3 years of professional experience working with clustering models, including K-Means, DBSCAN, Gaussian Mixture Models, and Agglomerative Clustering
Benefits
Position allows telecommuting from anywhere in the U.S. and reports to HQ in New York.
Medical, Dental, Vision - multiple packages available based on your individualized needs
Life/AD&D Insurance - basic coverage at 100% company paid, additional supplemental available
Supplemental Short-term and Long-term Disability
FSA: Medical and Dependant Care
401K
Equity packages for each role
Time Off: Vacation, Sick and Parental Leave
EAP (Employee Assistance Program)
Pet Insurance
Home Office Stipend
Applicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.
communication skillscollaborationmentoringguidanceproblem-solvingdocumentationtrainingcross-functional teamworkpresentation skillsleadership
Certifications
Bachelor's degree in Computer EngineeringBachelor's degree in Computer ScienceBachelor's degree in Data ScienceMaster's degree in Computer EngineeringMaster's degree in Computer ScienceMaster's degree in Data Science