
Data Scientist
Irth Solutions
full-time
Posted on:
Location Type: Remote
Location: India
Visit company websiteExplore more
About the role
- Contribute to medallion architecture pipelines (Bronze, Silver, Gold) and ensure data quality and governance.
- Support lineage tracking, data contracts, and governance processes using Unity Catalog.
- Apply data residency and governance policies, ensuring secure and compliant handling of sensitive data.
- Develop and deploy machine learning models including forecasting, anomaly detection, NLP, and predictive analytics.
- Design and implement GenAI solutions such as RAG pipelines, LLM-powered assistants, and intelligent automation.
- Package and manage models using Unity Catalog and model registries.
- Build scalable workflows using Databricks, Spark, and cloud-native tools.
- Implement monitoring, model evaluation, and performance tracking to ensure reliability.
- Apply cost optimization and performance tuning best practices.
- Support enterprise semantic layers for BI tools such as Power BI and Databricks AI/BI.
- Collaborate with engineering and product teams to enable scalable, production-ready data products.
- Document models, pipelines, and analytical workflows.
- Implement access control policies (RBAC/ABAC), secure data access, and compliance controls.
- Support regulatory compliance including SOC 2, GDPR, ISO 27001, and PIPEDA.
- Maintain documentation supporting governance, audit, and disaster recovery requirements.
Requirements
- 3–6 years of experience in Data Science, Machine Learning, or applied AI roles.
- Strong experience with Python, SQL, and distributed processing frameworks such as Spark.
- Experience with Databricks, Delta Lake, and ML model deployment workflows.
- Experience with ML lifecycle including feature engineering, model training, evaluation, and monitoring.
- Hands-on experience with GenAI technologies including prompt engineering, RAG pipelines, and LLM integration.
- Experience with CI/CD pipelines, version control, and automated deployments.
- Strong understanding of data governance, security, and data quality practices.
- Strong analytical thinking, problem-solving, and communication skills.
- Experience with Azure (ADLS, Entra ID, Power BI) or AWS (S3, Secrets Manager, KMS) preferred.
- Experience with MLflow, streaming pipelines, and model serving preferred.
- Knowledge of geospatial analytics, infrastructure, or utility domain data preferred.
- Experience with observability, monitoring, and performance optimization tools preferred.
- Relevant cloud or Databricks certifications preferred.
- Bachelor’s or master’s degree in computer science, Software Engineering, or a related field, or equivalent professional experience.
Benefits
- Being an integral part of a dynamic, growing company that is well respected in its industry.
- Competitive pay based on experience.
Applicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.
Hard Skills & Tools
PythonSQLSparkMachine LearningGenAIData GovernanceData QualityFeature EngineeringModel TrainingModel Evaluation
Soft Skills
Analytical ThinkingProblem-SolvingCommunication
Certifications
Cloud CertificationsDatabricks Certifications