Python Developer

EXL

Senior Spark solution developer to design and build code generation frameworks for Business Rules Engine on Hadoop. Integrating data across sources and providing analytical insights.

Posted 5/26/2026full-timeChennai • 🇮🇳 IndiaMid-LevelSeniorWebsite

ATS Keywords

Tailor your resume

Applicant Tracking System Keywords

Tip: use these terms in your resume and cover letter to boost ATS matches.

Hard Skills

PythonPySparkSparkSQLSparkStreaminggraph algorithmsrecursion techniquesdata pipeline testingperformance optimizationJSONXML

Soft Skills

collaborationproblem-solvingdocumentationagile development

Tools & Technologies

HadoopJenkinsBitBucketGITHIVEYARNKafkaHBaseMongoDBIn-memory Data Grids

Industry Keywords

Big Datadata standardsbusiness rulesmetadatadata engineering

Tech Stack

Tools & technologies

HadoopHBaseJenkinsKafkaMongoDBNoSQLPySparkPythonSparkYarn

About the role

Key responsibilities & impact

Ability to design and build Python-based code generation framework and runtime engine by reading Business Rules repository
Generate PySpark runtime executable code for all business rules stitched together
Orchestrate the pipeline of runtime executables as per standardization and curation needs on the Hadoop cluster
Build Spark code generation optimizers which will factor-in the rule processing patterns
Build PySpark based applications for both batch and streaming requirements
Design a graph-based recursive model for capturing Business Rules metadata in a JSON format
Build recursive parsers of JSON and XML documents and objects for metadata-driven code generation on PySpark
Develop and execute data pipeline testing processes and validate business rules and policies
Optimize performance of the built Spark applications in Hadoop
Participate in the agile development process and document issues and bugs relative to data standards
Create and maintain an integration and regression testing framework in Jenkins integrated with BitBucket and GIT repositories
Pair up with other data engineers to develop analytic applications leveraging Big Data technologies: Hadoop, NoSQL, and In-memory Data Grids

Requirements

What you’ll need

Bachelor's degree in a quantitative field (such as Engineering, Computer Science, Statistics, Econometrics) and a minimum of 5 years of experience
Minimum 5 years of extensive experience in design, build and deployment of Python-based applications
Minimum 3 years of experience in build & deployment of Bigdata applications using SparkSQL, SparkStreaming in Python
Expertise on graph algorithms and advanced recursion techniques
Expertise in handling complex large-scale Big Data environments preferably (20Tb+)
Minimum 3 years of experience in HIVE, YARN, Kafka, HBase, MongoDB

Benefits

Comp & perks

Health insurance
Professional development
Flexible work arrangements