Salary
💰 $150,000 - $215,000 per year
Tech Stack
AirflowApacheAWSAzureCloudDockerETLGoogle Cloud PlatformKubernetesPandasPythonPyTorchSparkSQLTensorflow
About the role
- Build and maintain scalable ETL pipelines for processing large, diverse image datasets collected from our tractor-mounted camera systems in farms
- Stay up-to-date with current literature in computer vision models and architectures, and apply relevant advancements to our systems
- Develop, deploy, and monitor infrastructure for model training, evaluation, and inference, both in the cloud and on edge devices
- Design and implement intelligent active sampling infrastructure to optimize data collection and improve model performance
- Collaborate with a multidisciplinary team to integrate ML solutions into production robotics systems
- Work closely with agronomists and farmers to understand crop biology and translate domain knowledge into actionable ML features
- Be a generalist, supporting different parts of our software stack as needed
Requirements
- 3-5+ years of experience building production-grade data pipelines and ML infrastructure
- Proficiency in Python and experience with ML frameworks (e.g., TensorFlow, PyTorch)
- Strong experience with data engineering tools (e.g., Pandas, SQL, Apache Airflow, Spark)
- Familiarity with cloud platforms (AWS, GCP, or Azure) and containerization (Docker, Kubernetes)
- Experience working with massive amounts of real-world training data
- Familiarity with MLops software and data engineering to ensure consistent deployment of ML models
- Ability to work independently, learn quickly, and operate in a dynamic environment
- Enthusiasm for taking on multiple roles and responsibilities as our company grows