HumanBit Logo

Data Architect | Codersbrain

full-time
Posted on September 24, 2025

Job Description

Data Architect

Company Overview

[Company information not provided.]

Job Summary

The Data Architect will be responsible for designing and implementing scalable, secure, and high-performance data architectures to support AI/ML workloads, analytics, and real-time data processing. This role plays a critical part in driving the organization's data strategy to meet analytical goals and business objectives.

Responsibilities

  • Architectural Leadership Skills

    • Design and implement scalable, secure, and high-performance data architectures to support AI/ML workloads, analytics, and real-time data processing.
    • Architect data platforms that enable experimentation, model training, and deployment for classical ML, deep learning, natural language processing (NLP), and Generative AI use cases.
    • Lead the development of data lakes, feature stores, and analytical data marts optimized for both operational and research use cases.
    • Define and enforce architectural standards, data governance policies, and best practices across data science, engineering, and analytics teams.
    • Collaborate with cross-functional teams to align data architecture with business strategy and analytical goals.
  • Technical Expertise & Experience

    • Translate analytical and AI/ML requirements into scalable data infrastructure and pipelines.
    • Develop and support end-to-end ML pipelines from data ingestion and feature engineering to model training, deployment, and monitoring.
    • Demonstrate hands-on experience with classical ML algorithms (e.g., regression, decision trees, ensemble methods), deep learning architectures (CNNs, RNNs, Transformers), and NLP models (BERT, GPT, T5).
    • Architect and integrate Generative AI solutions using large language models (LLMs), prompt engineering, and retrieval-augmented generation (RAG) techniques.
    • Implement DevOps practices including CI/CD pipelines, infrastructure as code, automated testing, and monitoring.
    • Ensure high standards of code quality, modularity, and reusability across data and ML pipelines.
    • Integrate modern data technologies such as Apache Spark, Apache Kafka, Delta Lake, and NoSQL databases.
    • Collaborate with data scientists to design feature stores, model registries, and scalable ML infrastructure.
    • Build and optimize data solutions on Microsoft Azure, including Azure Copilot Studio, Azure AI Foundry, Azure Data Factory, Synapse Analytics, Azure Databricks, Blob Storage, and Azure Machine Learning.

Qualifications

  • 12+ years of experience in data architecture, data engineering, and data science.
  • Strong programming skills in Python, SQL, and relevant scripting languages.
  • Deep understanding of data modeling, ETL/ELT processes, and distributed computing.
  • Hands-on experience with machine learning frameworks (e.g., TensorFlow, PyTorch), NLP libraries (e.g., spaCy, Hugging Face), and Generative AI tools (e.g., LangChain, LLMs).
  • Proven experience with DevOps tools (Azure DevOps, Git, Terraform, Kubernetes).
  • Expertise in Microsoft Azure preferred.
  • Familiarity with MLOps practices and model lifecycle management.

Preferred Skills

  • Azure certifications (e.g., Azure Solutions Architect, Azure Data Scientist Associate).
  • Experience with data mesh, real-time analytics, and streaming architectures.
  • Knowledge of data privacy regulations (e.g., GDPR, HIPAA).
  • Exposure to AI/ML infrastructure and Generative AI safety frameworks.

Experience

  • Minimum 12 years of relevant experience in data architecture, data engineering, and data science.

Environment

  • Hybrid work environment with three days in-office work in Gurgaon and the remainder remote.

Salary

  • Budgeted salary of INR 2.9 lakhs.

Growth Opportunities

[Career advancement opportunities not provided.]

Benefits

[Benefits information not provided.]

Powered by
HumanBit Logo