Data Architect | Scrabble & Jigsaw
Job Description
Primary Responsibilities
-
● Collaborate with Tech and Analytics team to build and maintain the infrastructure required for optimal extraction, transformation, and loading of data from a variety of data sources
-
● Create and maintain scalable ETL pipelines that feed organization-wide data
-
● Design and maintain ideal architecture for data tables to ensure optimal
querying performance in relational databases
-
● Mentor the data engineers in the team on best practices and projects.
-
● Create and maintain connectors that expose the data securely for
consumption by downstream systems and services in near real-time.
-
● Help build the ML pipelines and integrate ML models in Zepto applications
-
● Build data governance and security protocols and monitor adherence
What Are We Looking For?
-
● 6 - 10 years of experience in Data Engineering - Designing databases, building data pipelines, and maintaining data governance protocols in cloud platforms
-
● Hands-on working experience with Python, ETL pipelines, advanced SQL
-
● Understanding of AWS Services - Redshift, Lambda, Glue, Athena, security
protocols
-
● Experience in any Cloud DW Redshift/Snowflake/BigQuery and working with
data layer solutions like Apache Hudi, DeltaLake, iceberg
-
● Experience in setting up a real time data processing system with Apache
Spark/ Apache Flink , pySpark.
-
● Design, Test-driven development, code review and implement CICD using
Github/Gitlab/Docker
-
● Experience in gathering and processing raw data at scale including writing
scripts and spark jobs.
-
● Comfortable to setup query engines like Presto, Trino, etc.
-
● Strong data Modelling and database design experience with Redshift or other
relational databases.
