Site Reliability Engineer (SRE) | Codersbrain
Job Description
Site Reliability Engineer (SRE)
Job Overview:
We are seeking a skilled Site Reliability Engineer (SRE) with 3 to 5 years of experience in
alert monitoring, application monitoring, and automation. The ideal candidate should have
hands-on experience in any programming language such as Java, Python, or PHP and a
strong understanding of AWS microservices architecture. Knowledge of Nginx, Apache,
MySQL, and MongoDB will be an added advantage.
Job Title – Site Reliability Engineer (SRE)
Total Exp – 3+ yrs
Location –Thane (Mumbai)– Onsite -WFO
Mandatory Skills
SRE/ELK/Python.
Key Responsibilities:
[Must Have] Design, implement, and maintain monitoring solutions for applications and
infrastructure.
[Must Have] Develop and maintain alerting mechanisms to proactively detect and resolve
system issues.
[Must Have] Automate operational tasks using scripting and programming languages.
[Must Have] Troubleshoot production issues, perform root cause analysis, and ensure quick
resolution.
[Must Have] Implement logging, tracing, and observability solutions for microservices.
Work with cloud-based architectures, particularly AWS, to optimize system performance.
Ensure reliability, availability, and performance of applications through effective monitoring.
Collaborate with development and operations teams to implement SRE best practices.
Optimise database performance and troubleshoot issues in MySQL and MongoDB.
Manage and configure web servers like Nginx and Apache for scalability and performance.
Required Skills & Qualifications:
3 to 5 years of experience in Site Reliability Engineering (SRE), DevOps, or related roles.
Proficiency in any one programming language: Java, Python, or PHP.
Hands-on experience with AWS services and microservices architecture.
Strong understanding of application and infrastructure monitoring tools.
Experience with alerting and observability platforms like Prometheus, Grafana, ELK, or
Datadog.
Knowledge of containerization and orchestration (Docker, Kubernetes is a plus).
Familiarity with CI/CD pipelines and automation tools.
Experience in managing and optimizing databases such as MySQL and MongoDB.
Exposure to Nginx and Apache configuration and troubleshooting.
Strong problem-solving skills and ability to work in a fast-paced environment.
Good to Have:
Certification in AWS, Kubernetes, or DevOps practices.
Experience with Infrastructure as Code (IaC) tools like Terraform or CloudFormation.
Familiarity with security best practices for cloud and application monitoringent
The role is based in Thane (Mumbai) and requires working in an office environment. Collaborative team setting with cross-functional teams is expected for efficient operation.
Deadline
Applications are open until April 29, 2025. The selected candidate will ideally start within 15 days from acceptance.
Growth Opportunities
The position offers the potential for career advancement into senior engineering roles or technical leadership positions, based on performance and expertise.