
Database Developer – Pyspark
Supersourcing
full-time
Posted on:
Location Type: Remote
Location: Oregon • United States
Visit company websiteExplore more
About the role
- Design and develop a custom query builder for efficient JSON data processing using PySpark
- Collaborate with cross-functional teams, including data scientists and analysts
- Design and implement data models and database schemas for optimal storage and retrieval of JSON data
- Develop and maintain data pipelines and workflows, ensuring accuracy and reliability of processed data through validation and quality checks
Requirements
- Strong proficiency in Python programming language
- Extensive experience with PySpark and Apache Spark for data processing
- A solid understanding of JSON data structures, SQL, and database systems
- Experience with distributed computing frameworks and knowledge of data processing best practices
Applicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.
Hard Skills & Tools
PythonPySparkApache SparkJSONSQLdatabase systemsdata processingdata modelsdatabase schemasdata pipelines
Soft Skills
collaborationcross-functional teamworkcommunicationproblem-solvingattention to detail