Lead Site Reliability Engineer will be hands-on and provide mentorship to other team members on core SRE principles and tools. The role requires a highly skilled technology professional with excellent communication skills, strategic mindset, strong analytical and troubleshooting skills on AWS Cloud Platform.
Requirements
- 5+ years of experience in Site Reliability Engineering or related position in AWS Cloud Platform
- At least 2 AWS Certifications (AWS Sysops Admin and Architects certifications preferred)
- Deep experience with AWS, Docker and Kubernetes, CloudFormation, CloudWatch, CodeDeploy, DynamoDB, Lambda, SQS, Amazon FSX, Elastic Search and networking concepts
- Program at a high level in at least one language (Java, C#, Javascript, Python or Ruby)
- Integration experience with PagerDuty, ServiceNow, Datadog, CloudWatch
- Good understanding of Site Reliability Engineering (SRE) philosophies, technologies, platforms and tools, SLO management, incident resolution, and automation
Benefits
- Flexible vacation
- Two company-wide Mental Health Days off
- Access to the Headspace app
- Retirement savings
- Tuition reimbursement
- Employee incentive programs
- Resources for mental, physical, and financial wellbeing