
Data Research Engineer – AI/ML
NextHire
full-time
Posted on:
Location Type: Remote
Location: Remote • 🇮🇳 India
Visit company websiteJob Level
Mid-LevelSenior
Tech Stack
ETLPythonPyTorchSQLTensorflow
About the role
- Develop and implement methods to leverage AI and LLMs (Large Language Models) for process automation and data research efficiency.
- Proactively identify new AI/LLM-based solutions to streamline operations and improve data workflows.
- Act as a visionary for AI/LLM adoption, anticipating future technological developments and preparing the team to capitalize on them early.
- Assist in acquiring and integrating data from multiple sources, including web crawling, APIs, and other data pipelines.
- Design and optimize ETL workflows to ensure high-quality data availability for downstream analysis.
- Explore and evaluate third-party tools for modernizing legacy data systems and enhancing scalability.
- Collaborate cross-functionally with content, research, and analytics teams to understand and fulfill data requirements.
- Ensure timely delivery of project milestones in a fast-paced, dynamic environment.
- Support and collaborate with fellow engineers on the Data Research and Extraction Team.
- Utilize online technical resources effectively (e.g., StackOverflow, ChatGPT, Bard) while understanding their limitations.
Requirements
- Bachelor’s degree in Computer Science, Data Science, Engineering, or a related field (advanced degree is a plus).
- Minimum 4+ years of experience in Data Engineering, AI/ML Engineering, or related fields.
- Strong proficiency in Python for data manipulation, automation, and API integration.
- Experience in AI/ML engineering and data extraction workflows.
- Proficiency in implementing Retrieval-Augmented Generation (RAG) pipelines using tools such as ChromaDB or Pinecone.
- Experience with agentic AI platforms (e.g., CrewAI, LangChain) for modular and autonomous task execution.
- Hands-on experience working with LLMs, including prompt engineering and mitigation of model “hallucinations.”
- Familiarity with machine learning frameworks such as TensorFlow or PyTorch.
- Exposure to NLP frameworks (spaCy, NLTK, Hugging Face, etc.).
- Understanding of SQL and data querying (a plus).
- Familiarity with web crawling techniques and API integration (a plus).
- Experience using version control tools such as Git for collaborative development.
- Strong problem-solving, analytical, and critical-thinking skills.
- Excellent communication and teamwork abilities.
- Ability to thrive in a high-growth environment with shifting priorities.
- Experience managing or mentoring AI/LLM-focused teams is a plus.
- Familiarity with Agile development methodologies (a plus).
Benefits
- Day off on the 3rd Friday of every month (one long weekend each month)
- Monthly Wellness Reimbursement Program to promote health and well-being
- Monthly Office Commutation Reimbursement Program
- Paid Paternity and Maternity Leave
Applicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.
Hard skills
PythonAI/ML engineeringdata extraction workflowsRetrieval-Augmented Generation (RAG)ChromaDBPineconeagentic AI platformsCrewAILangChainLLMs
Soft skills
problem-solvinganalytical skillscritical-thinkingcommunicationteamworkadaptabilitymentoringcollaborationvisionary thinkingtime management
Certifications
Bachelor’s degree in Computer ScienceBachelor’s degree in Data ScienceBachelor’s degree in Engineering