HumanBit Logo

Data Architect | Scrabble & Jigsaw

Posted on November 11, 2022

Job Description

Primary Responsibilities

  • ●  Collaborate with Tech and Analytics team to build and maintain the infrastructure required for optimal extraction, transformation, and loading of data from a variety of data sources

  • ●  Create and maintain scalable ETL pipelines that feed organization-wide data

  • ●  Design and maintain ideal architecture for data tables to ensure optimal

    querying performance in relational databases

  • ●  Mentor the data engineers in the team on best practices and projects.

  • ●  Create and maintain connectors that expose the data securely for

    consumption by downstream systems and services in near real-time.

  • ●  Help build the ML pipelines and integrate ML models in Zepto applications

  • ●  Build data governance and security protocols and monitor adherence

    What Are We Looking For?

  • ●  6 - 10 years of experience in Data Engineering - Designing databases, building data pipelines, and maintaining data governance protocols in cloud platforms

  • ●  Hands-on working experience with Python, ETL pipelines, advanced SQL

  • ●  Understanding of AWS Services - Redshift, Lambda, Glue, Athena, security

    protocols

  • ●  Experience in any Cloud DW Redshift/Snowflake/BigQuery and working with

    data layer solutions like Apache Hudi, DeltaLake, iceberg

  • ●  Experience in setting up a real time data processing system with Apache

    Spark/ Apache Flink , pySpark.

  • ●  Design, Test-driven development, code review and implement CICD using

    Github/Gitlab/Docker

  • ●  Experience in gathering and processing raw data at scale including writing

    scripts and spark jobs.

  • ●  Comfortable to setup query engines like Presto, Trino, etc.

  • ●  Strong data Modelling and database design experience with Redshift or other

    relational databases.

Powered by
HumanBit Logo