FREE ACCESS
5,000–10,000 jobs/day

See all jobs on JobTailor
Search thousands of fresh jobs every day.
Discover
- Fresh listings
- Fast filters
- No subscription required
Create a free account and start exploring right away.
Tech Stack
Tools & technologiesHadoopHBaseJenkinsKafkaMongoDBNoSQLPySparkPythonSparkYarn
About the role
Key responsibilities & impact- Ability to design and build Python-based code generation framework and runtime engine by reading Business Rules repository
- Generate PySpark runtime executable code for all business rules stitched together
- Orchestrate the pipeline of runtime executables as per standardization and curation needs on the Hadoop cluster
- Build Spark code generation optimizers which will factor-in the rule processing patterns
- Build PySpark based applications for both batch and streaming requirements
- Design a graph-based recursive model for capturing Business Rules metadata in a JSON format
- Build recursive parsers of JSON and XML documents and objects for metadata-driven code generation on PySpark
- Develop and execute data pipeline testing processes and validate business rules and policies
- Optimize performance of the built Spark applications in Hadoop
- Participate in the agile development process and document issues and bugs relative to data standards
- Create and maintain an integration and regression testing framework in Jenkins integrated with BitBucket and GIT repositories
- Pair up with other data engineers to develop analytic applications leveraging Big Data technologies: Hadoop, NoSQL, and In-memory Data Grids
Requirements
What you’ll need- Bachelor's degree in a quantitative field (such as Engineering, Computer Science, Statistics, Econometrics) and a minimum of 5 years of experience
- Minimum 5 years of extensive experience in design, build and deployment of Python-based applications
- Minimum 3 years of experience in build & deployment of Bigdata applications using SparkSQL, SparkStreaming in Python
- Expertise on graph algorithms and advanced recursion techniques
- Expertise in handling complex large-scale Big Data environments preferably (20Tb+)
- Minimum 3 years of experience in HIVE, YARN, Kafka, HBase, MongoDB
Benefits
Comp & perks- Health insurance
- Professional development
- Flexible work arrangements
ATS Keywords
✓ Tailor your resumeApplicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.
Hard Skills & Tools
PythonPySparkSparkSQLSparkStreaminggraph algorithmsrecursion techniquesdata pipeline testingperformance optimizationJSONXML
Soft Skills
collaborationproblem-solvingdocumentationagile development
