HumanBit Logo

Site Reliability Engineer | Codersbrain

full-time
Posted on August 29, 2025

Job Description

SRE

Company Overview

Company details are not specified.

Job Summary

The Site Reliability Engineer (SRE) plays a critical role in maintaining and improving the reliability, availability, and performance of our systems. This position is responsible for the design, implementation, and maintenance of infrastructure and services, helping to ensure that they are available and performant at all times. The SRE will collaborate with various teams to enhance system architecture and to implement best practices in software development and operational processes.

Responsibilities

  • Design, implement, and manage scalable infrastructures for production environments.
  • Automate system deployments and manage CI/CD pipelines to streamline development processes.
  • Monitor system performance using various observability tools and address any issues proactively.
  • Collaborate with development teams to improve application reliability and troubleshoot production issues.
  • Implement and manage container orchestration using tools such as Docker and Kubernetes.
  • Develop scripts and automation tools to enhance operational efficiency and reduce manual workload.

Qualifications

  • Education: Bachelor’s degree in Computer Science, Engineering, or related field.
  • Technical Skills:
    • Proficiency in programming languages such as Python, Bash, or Java.
    • Deep understanding of Linux/Windows operating systems and networking concepts.
    • Experience with AWS & Azure including services, architecture, and best practices.
    • Hands-on experience with Docker, Kubernetes, and related containerization tools.
    • Familiarity with Infrastructure as Code (IaC) tools like Terraform, CloudFormation, or Azure CLI.
    • Knowledge of monitoring and observability tools such as Splunk, New Relic, or Azure Monitoring.
    • Experience with continuous integration and continuous delivery pipelines, including GitHub and GitHub Actions.
    • Familiarity with supporting Azure ML, Databricks, and other related SaaS tools.

Preferred Skills

  • Additional experience with microservices architecture and cloud-native application development.
  • Knowledge of security best practices for cloud services.

Experience

  • Minimum of 6 years of relevant experience in site reliability engineering or a similar role.

Environment

  • Work location is in Mumbai, specifically in Chembur and Bandra. Additional work environment details are not specified.

Salary

Salary details are not specified.

Growth Opportunities

Opportunities for career advancement within the company are not specified.

Benefits

Benefits offered by the company are not specified.

Powered by
HumanBit Logo