ISS STOXX is looking for a Senior Site Reliability Engineer to join their team in Mumbai, India. The role involves driving the architecture, development, and operation of Stoxx's SRE efforts, mentoring junior team members, and working with cross-functional teams to implement SRE principles and observability solutions.
Requirements
- 5 years of total experience and at least 3 years' of relevant experience in Site Reliability Engineering or production management
- Good understanding of SRE principles
- Experience implementing observability stacks such as ELK, Prometheus/Grafana, Splunk, Data Dog or other scalable solution
- Expertise in creating SLO dashboards using multiple data sources
- Strong experience of cloud-native ways of working
- Experience with the development and deployment of large-scale, complex technology platforms
- Deep understanding of cloud technology across database, serverless, containerization and API
- Advanced level expertise in Terraform
- Extensive experience in designing and implementing SRE practices
- Experience with one or more CI/CD solutions
- Experience coaching and mentoring high-performing teams
- Excellent knowledge of integrating incident management tooling such as Rootly, blameless, ServiceNow or incident-io.
- Pragmatic experience using agile to deliver incremental value
- Experience working in a global or multinational team setting
- Strong knowledge management, documentation, communication and collaboration skills
- Proven ability to drive innovation and continuous improvement initiatives
- Focus on simplicity, automation and data
- Expertise in Python, GitHub Actions, Apigee, Airflow
- Bachelor's or Master's degree in Computer Science, Mathematics, Physics or related field