Tech Stack
AWSAzureCloudMySQLOraclePostgresPythonSQL
About the role
- Develop tools and applications in Python to perform data processing, data analysis, modelling, forecasting.
- Utilize statistical, mathematical, and algorithmic principles in data solutions.
- Prototype, test, and deploy end-to-end data operation pipelines.
- Develop web-based applications using Streamlit for data visualization and reporting.
- Design, build, and optimize Python/Snowflake-based data operation pipelines for ingesting, cleaning, transforming, and loading data.
- Collaborate with global technical and analyst teams to gather requirements.
- Ensure quality and performance of large, time-series market and vendor datasets used by IDC's global products.
- Develop tools and systems to streamline processing, validate data quality, and integrate diverse data sources.
Requirements
- BA/BS in Computer Science, Mathematics, Statistics, Data Engineering or equivalent experience (3-5 years) in related fields.
- 1-2 years of relevant work experience in a technical role, preferably with experience managing individuals for team outcomes.
- Proficiency with Python and SQL-based environments, preferably Snowflake, or similar databases (e.g., MySQL, PostgreSQL, Oracle, or Microsoft SQL).
- Experience developing tools and applications in Python to perform data processing, data analysis, modelling, forecasting.
- Experience developing web-based applications using Streamlit for data visualization and reporting.
- Knowledge of statistical, mathematical, and algorithmic principles.
- Experience prototyping, testing, and deploying end-to-end data operation pipelines.
- Experience designing, building, and optimizing data operation pipelines for ingesting, cleaning, transforming, and loading data.
- Ability to collaborate with global technical and analyst teams to gather requirements.
- Nice to haves: Experience with cloud platforms like AWS, Azure, or Google Cloud; experience developing algorithms for data quality checks and outlier detection; familiarity with machine learning and Gen AI.