Data Architect | Scrabble & Jigsaw

Posted on 11-11-2022

Job Description

Primary Responsibilities

● Collaborate with Tech and Analytics team to build and maintain the infrastructure required for optimal extraction, transformation, and loading of data from a variety of data sources
● Create and maintain scalable ETL pipelines that feed organization-wide data
● Design and maintain ideal architecture for data tables to ensure optimal

querying performance in relational databases
● Mentor the data engineers in the team on best practices and projects.
● Create and maintain connectors that expose the data securely for

consumption by downstream systems and services in near real-time.
● Help build the ML pipelines and integrate ML models in Zepto applications
● Build data governance and security protocols and monitor adherence

What Are We Looking For?

● 6 - 10 years of experience in Data Engineering - Designing databases, building data pipelines, and maintaining data governance protocols in cloud platforms
● Hands-on working experience with Python, ETL pipelines, advanced SQL
● Understanding of AWS Services - Redshift, Lambda, Glue, Athena, security

protocols
● Experience in any Cloud DW Redshift/Snowflake/BigQuery and working with

data layer solutions like Apache Hudi, DeltaLake, iceberg
● Experience in setting up a real time data processing system with Apache

Spark/ Apache Flink , pySpark.
● Design, Test-driven development, code review and implement CICD using

Github/Gitlab/Docker
● Experience in gathering and processing raw data at scale including writing

scripts and spark jobs.
● Comfortable to setup query engines like Presto, Trino, etc.
● Strong data Modelling and database design experience with Redshift or other

relational databases.