Salary
💰 $170,000 - $185,000 per year
Tech Stack
AirflowAWSCloudDynamoDBETLGoJavaKafkaPythonRedisSparkSQL
About the role
- At Socure—verify identities and eliminate fraud using ML and predictive analytics.
- Design, build, and maintain scalable data pipelines to collect, clean, transform, and aggregate large-scale device and behavioral signals.
- Build and support real-time velocity systems that compute session-level and user-level aggregations with sub-second latency.
- Develop and manage data infrastructure such as data lakes and data warehouses.
- Optimize data architectures for performance, scalability, cost efficiency, and maintainability.
- Collaborate with ML engineers, data scientists, and product stakeholders to ensure data availability, reliability, and privacy compliance.
- Maintain and enhance orchestration workflows; automate workflows and monitoring systems.
- Ensure data quality, consistency, and security across systems; implement best practices in data modeling, schema evolution, and pipeline observability.
- Document systems and processes for internal use and compliance.
Requirements
- Bachelor’s or Master’s degree in Computer Science, Data Engineering, or a related technical field.
- 5+ years of experience building and maintaining large-scale data systems and pipelines in cloud environments.
- Strong programming experience in Python, Go, or Java.
- Proficiency in SQL.
- Understanding of data modeling, ETL/ELT processes, and data warehousing.
- Deep experience with distributed processing frameworks such as Spark, Kafka, and Flink.
- Hands-on experience with AWS services including Glue, Kinesis, EMR, Lambda, and DynamoDB.
- Experience with orchestration tools such as Airflow or AWS Step Functions.
- Understanding of real-time data processing, event-driven architectures, and stream processing patterns.
- Familiarity with machine learning workflows and data requirements.
- Strong problem-solving and communication skills.