Principal Platform Engineer - R30702 | ScaleneWorks INC
Job Description
Principal Platform Engineer
Company Overview
[Company overview is not provided.]
Job Summary
The Technical Lead is responsible for ensuring that the team operates at a high level of technical proficiency. This role acts as a leader within the area, guiding team members and potentially providing formal training to Specialists and Senior Specialists. The Technical Lead will assess complex problems that impact the organization, recommend solutions, and improve processes while effectively communicating complex information.
Responsibilities
- Produce high-quality, efficient code throughout the entire product development cycle, including creating technical requirements, leading feasibility studies, project planning, and identifying dependencies.
- Investigate and analyze root causes of complex software and system defects, providing troubleshooting solutions in a timely manner.
- Apply best practices for code quality and security, including non-functional requirements mastery, code reviews, and unit testing.
- Suggest and facilitate the evolution of components, managing code debt, and improving technical aspects of project delivery.
- Build cross-functional teams and ensure knowledge sharing through effective communication, collaboration, and engagement in projects.
- Contribute to the community through participation in events and cultural transformations within the research and development team.
Qualifications
- Proficient in Cloud platforms (Azure/Google).
- Strong experience with Terraform/Terragrunt at scale, including registry management and policy/test automation.
- Knowledge of AKS/OpenShift, including Helm/Kustomize and operators (Azure Service Operator/Crossplane).
- Expertise in Python/Go for platform tooling (SDK/CLI/API integrations).
- Understanding of Security-by-default principles.
- Familiarity with DORA/Site Reliability Engineering (SRE) fundamentals and incident leadership.
- Successful track record in cross-functional collaboration and team engagement.
Preferred Skills
- Experience with event-driven automation (RabbitMQ/Kafka) and workflow engines.
- Knowledge of AIOps and Large Language Models (LLM) for incident triage and audit/guardrails.
- Proficiency in FinOps practices, including tagging, budgets, right-sizing, and performance/cost trade-offs.
- Understanding of disaster recovery/multi-region patterns and chaos testing methodologies.
- Compliance-by-design understanding.
Experience
Years of experience and specific types of relevant experience are not provided but a significant expertise level is implied.
Environment
The typical work setting, including location and environmental conditions, is not specified.
Salary
Salary details are not provided.
Growth Opportunities
Opportunities for career advancement, including potential leadership roles and further specialization, are implied but not specified.
Benefits
Specific benefits offered by the company are not mentioned.