HumanBit Logo

GCP Site Reliability Engineer | Codersbrain

contractual
Posted on July 21, 2025

Job Description

GCP Site Reliability Engineer

Company Overview

[Company details not provided.]

Job Summary

The GCP Site Reliability Engineer (SRE) will be responsible for ensuring the reliability, availability, and performance of the applications and infrastructure within the Google Cloud Platform (GCP). This role focuses on implementing SRE principles to automate processes, reduce toil, and effectively manage incidents, thereby enhancing the overall efficiency of the organization's cloud-based services.

Responsibilities

  • Design, deploy, and manage applications and infrastructure on Google Cloud Platform, ensuring optimal performance and scalability.
  • Implement automation strategies and practices to reduce operational toil and improve service reliability.
  • Monitor systems and applications using tools like Datadog and GCP Cloud Logging, responding swiftly to incidents.
  • Collaborate with development teams to integrate CI/CD practices using tools such as GitLab and GitHub for version control.
  • Provide on-call support and incident management using PagerDuty to ensure continuous availability of services.
  • Troubleshoot complex distributed systems and resolve issues effectively.

Qualifications

  • Education: Bachelor's degree in Computer Science, Engineering, Information Technology, or a related field.
  • Experience:
    • 5+ years in a Site Reliability Engineering, DevOps, or similar role.
    • Significant hands-on experience with Google Cloud Platform, especially with Google Cloud Spanner.
    • Proficiency in at least one scripting language (e.g., Python, Bash).
    • Extensive experience with HashiCorp Terraform for infrastructure-as-code.
    • Experience with containerization (Docker) and orchestration technologies (Kubernetes, preferably GKE).
    • Strong understanding of Linux operating systems and command-line utilities (e.g., terraform, kubectl, helm).
  • Skills:
    • Excellent problem-solving and troubleshooting skills.
    • Strong communication and collaboration abilities.

Preferred Skills

  • Google Cloud certifications (e.g., Professional Cloud Architect, Professional Cloud DevOps Engineer).
  • Experience with database administration and optimization on Google Cloud Spanner.
  • Familiarity with networking concepts within GCP (e.g., VPC, Load Balancing, Cloud DNS).
  • Knowledge of security best practices in cloud environments, potentially with HashiCorp Vault.

Experience

  • 5 to 8 years of relevant experience in Site Reliability Engineering or related fields.

Environment

[Work setting, location, and physical conditions not provided.]

Salary

[Salary information not provided.]

Growth Opportunities

[Career advancement opportunities not provided.]

Benefits

[Benefits information not provided.]

Powered by
HumanBit Logo