The company is seeking a Site Reliability Engineer to manage its multi-cloud infrastructure, improve services through testing and release procedures, and create sustainable systems through automation. The role involves configuring, deploying, maintaining, troubleshooting, and monitoring container orchestration on AWS. The ideal candidate has a strong understanding of container orchestration, Linux Server, AD, LDAP, DNS, Network Storage, and AWS Compute services.
Requirements
- Bachelor's degree in computer science or related discipline
- At least 7 years of experience
- Proactive approach to identifying problems, performance bottlenecks, and areas for improvement
- Strong interpersonal skills, analytical and problem-solving ability, and strong written and verbal communication
- Solid understanding and hands-on experience with container orchestration
- Ability to communicate ideas in both technical and non-technical ways
- Strong capacity for teamwork and a sense of ownership
- Hands on Experience with Linux Server, AD, LDAP, DNS, Network Storage, AWS Compute services (EC2, FSX, Managed AD, Route 53, etc...)
- Ability to program using scripting with tools or languages, such as PowerShell, Python, Ansible, Terraform, and Bash
- Familiarity with ITSM processes like Incident, Problem, and Change Management using ServiceNow
Benefits
- Health & Wellness
- Flexible Downtime
- Continuous Learning
- Invest in Your Future
- Family Friendly Perks
- Beyond the Basics