Position Overview:
TechTiera is seeking an experienced and technically strong Engineering Manager – Site Reliability to lead the development and enhancement of infrastructure, CI/CD systems, observability frameworks, and production readiness across multiple engineering teams. This role offers a unique opportunity to collaborate across platform, infrastructure, ML, search, data, DevOps, and frontend teams to enable rapid, safe, and scalable software delivery.
The ideal candidate will combine strong hands-on experience in software engineering/Site Reliability Engineering (SRE) with leadership skills to manage high-impact initiatives and mentor adjacent teams, especially those based in Bengaluru.
Key Responsibilities:
- Lead the design, implementation, and maintenance of high-availability infrastructure and CI/CD pipelines that accelerate secure and reliable software delivery.
- Drive improvements in deployment strategies including blue/green deployments, canary environments, and rollback mechanisms to minimize production risks.
- Enhance observability and monitoring by developing or improving tools and frameworks for system reliability, performance monitoring, and automated alerting.
- Develop tools to increase debuggability and operational insights across production environments.
- Collaborate with cross-functional teams to ensure operational excellence and lead incident response when critical issues arise.
- Architect and lead projects aimed at increasing the reliability, scalability, and maintainability of systems and services.
- Act as a technical leader and mentor to teams in Bengaluru, providing guidance, code reviews, and support in solving complex engineering challenges.
Required Qualifications:
- Bachelor’s degree in Computer Science, Engineering, or a related technical discipline from premier institutions such as IIT, NIT, BITS and similar.
- Minimum of 8+ years of experience in software engineering and/or Site Reliability Engineering (SRE), with deep expertise in Python and/or Go.
- At least 2 years of experience in technical leadership, leading projects and solving problems in distributed systems.
- Hands-on experience in building and managing cloud infrastructure in AWS, including container orchestration, CI/CD tools, and infrastructure as code (e.g., Jenkins, Terraform, Ansible, Helm).
 
								
