
Data Consultant
Particle41
full-time
Posted on:
Location Type: Remote
Location: Remote • 🇺🇸 United States
Visit company websiteJob Level
Mid-LevelSenior
Tech Stack
AWSAzureCloudETLFlaskGoogle Cloud PlatformLinuxMongoDBMySQLNoSQLPostgresPySparkPythonRedisScikit-LearnSparkSQL
About the role
- Design, develop, and maintain scalable ETL (Extract, Transform, Load) pipelines to process large volumes of data from diverse sources.
- Build and optimize data storage solutions, such as data lakes and data warehouses, to ensure efficient data retrieval and processing.
- Integrate structured and unstructured data from various internal and external systems to create a unified view for analysis.
- Ensure data accuracy, consistency, and completeness through rigorous validation, cleansing, and transformation processes.
- Maintain comprehensive documentation for data processes, tools, and systems while promoting best practices for efficient workflows.
- Collaborate with product managers, and other stakeholders to gather requirements and translate them into technical solutions.
- Participate in requirement analysis sessions to understand business needs and user requirements.
- Provide technical insights and recommendations during the requirements-gathering process.
- Participate in Agile development processes, including sprint planning, daily stand-ups, and sprint reviews.
- Work closely with Agile teams to deliver software solutions on time and within scope.
- Adapt to changing priorities and requirements in a fast-paced Agile environment.
- Conduct thorough testing and debugging to ensure the reliability, security, and performance of applications.
- Write unit tests and validate the functionality of developed features and individual elements.
- Identify and resolve software defects, code smells, and performance bottlenecks.
- Stay updated with the latest technologies and trends in full-stack development.
- Propose innovative solutions to improve the performance, security, scalability, and maintainability of applications.
- Collaborate effectively with cross-functional teams, including testers, and product managers.
Requirements
- Bachelor's degree in Computer Science, Engineering, or related field.
- Proven experience as a Data Engineering, with over of 3 years of experience.
- Proficiency in Python programming language.
- Experience with database technologies such as SQL (e.g., MySQL, PostgreSQL) and NoSQL (e.g., MongoDB) databases.
- Strong understanding of Programming Libraries/Frameworks and technologies such as Flask, API frameworks, datawarehousing/lakehouse, principles, database and ORM, data analysis databricks, panda's, Spark, Pyspark, Machine learning, OpenCV, scikit-learn.
- Utilities & Tools: logging, requests, subprocess, regex, pytest
- ELK stack, Redis, distributed task queues
- Strong understanding of data warehousing/lakehousing principles and concurrent/parallel processing concepts.
- Familiarity with at least one cloud data engineering stack (Azure, AWS, or GCP) and the ability to quickly learn and adapt to new ETL/ELT tools across various cloud providers.
- Familiarity with version control systems like Git and collaborative development workflows.
- Competence in working on Linux OS and creating shell scripts.
- Solid understanding of software engineering principles, design patterns, and best practices.
- Excellent problem-solving and analytical skills, with a keen attention to detail.
- Effective communication skills, both written and verbal, and the ability to collaborate in a team environment.
- Adaptability and willingness to learn new technologies and tools as needed.
Benefits
- Health insurance
- 401(k) matching
- Flexible work hours
- Professional development opportunities
Applicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.
Hard skills
ETLPythonSQLNoSQLFlaskdata warehousingPysparkMachine learningOpenCVdata analysis
Soft skills
problem-solvinganalytical skillscommunication skillscollaborationadaptabilityattention to detailtechnical insightsrequirement analysisteam environmentinnovation
Certifications
Bachelor's degree in Computer ScienceBachelor's degree in Engineering