Tech Stack
AWSCloudDockerNumpyPostGISPostgresPythonRayRemote SensingSQL
About the role
- Execution, support and improvement of data pipelines that process large volumes of geospatial raster data Assist in orchestrating and automating workflows with tools such as Prefect Learn to work with relational databases (e.g. PostgreSQL) and help ensure data integrity and performance Contribute to the continuous improvement of our data processing frameworks by exploring challenges in large-scale Machine Learning pipelines and satellite image processing (e.g. data selection, observability, versioning, automated QA) Collaborate closely with experienced engineers and researchers, gaining hands-on exposure to industry-standard tools and practices
Requirements
- Python scripting experience Able to understand complex data flows Open mind, respectful attitude towards others and their work and good communication skills are a must You are fluent in English (written and spoken) Understanding of geospatial and remote sensing data and processing techniques is a big plus Interest in Machine Learning (Operations) and related tooling is a plus Knowledge of SQL or relational databases is a plus Geospatial knowledge and related tools/libraries such as GDAL, QGIS or rasterio is a plus Some exposure to cloud platforms, especially AWS is a plus Curiosity about the latest developments in Large Language Models (LLMs), computer vision models, and Vision-Language Models (VLMs) is a plus