Help build, and optimize a AWS-based data lake to support AI/ML initiatives and advance analytics
Design and implement scalable data ingestion pipelines for both batch and real-time data from diverse structured and unstructured sources
Perform extensive data profiling, transformation and enrichment to prepare clean, ML ready datasets for data scientists and analysts.
Develop custom reports and data visualizations to support analytics, decision-making across business and technical teams
Collaborate with data scientists and business teams to deliver curated datasets and reporting needs for ML and analytics.
Support project delivery on Data lake, Data Warehouse/BI projects for external and internal clients, including partnering with ICF subject matter experts on project execution
Requirements
Bachelor’s degree (e.g., Computer Science, Engineering or related discipline)
6-8 years’ experience in Data engineering with strong background in pipeline development and data integration.
3+ years of hands-on experience with AWS data services, including: Amazon Glue, Lambda, S3, StepFunctions and Athena; familiarity with Redshift and Lake Formation is a plus.
6+ years of experience in SQL and programming, preferably in Python.
Experience with BI Tools like Tableau, PowerBI or Amazon QuickSight.
Experience with cloud integration tools such as Talend, Informatica
Excellent oral communications, thought leadership and formal presentation skills
US Citizen or Permanent Lawful Resident (Green Card Holder).
Must be able to obtain and maintain a Public Trust
Benefits
Reasonable Accommodations are available
Professional development
Applicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.
Hard skills
data engineeringpipeline developmentdata integrationAWS data servicesSQLPythondata profilingdata transformationdata enrichmentdata visualization