Daxx is looking for a Lead Big Data Engineer
Project description
Our client is one of the world’s leading hardware manufacturers. It has over 250,000 employees across 30 countries. It is focused on high end technical and design products, manufacturing know-hows, creating supply chain solutions for the worlds leading brands.
Responsibilities
- Create and improve data pipeline architecture
- Writing and optimizing new ETL processes in SQL
- Preparing training datasets for data analytics team
- Build new data workflows based on AWS Step functions
Requirements
- Python
- PySpark
- Bash
- Hadoop (HDFS, Hive/Impala, YARN)
- SQL (optimizing SQL queries, writing stored procedures and functions, debugging SQL)
- AWS S3
- Git, AWS CodeCommit/CodePipeline
Will be a plus
- AWS Aurora
- AWS Glue
- AWS SQS/SNS
- AWS Step Functions workflow management
Daxx offers
- Direct cooperation with the customer
- Dedicated HR / Client Manager
- Regular performance reviews
- Competitive salary, medical insurance, 20 working vacation days
- Regular corporate events, team buildings, etc.