Homecare Homebase is hiring a Platform Reliability Engineer to manage the complex challenges of scale in their healthcare software development. The role involves practice sustainable incident response, operationalization of services, and platform engineering and automation to maintain scale and reliability of systems.
Requirements
- 3+ years' experience in a 24x7 production enterprise-class environment as an SRE or comparable role
- 1+ year Kubernetes administration/support in a production environment
- 1+ year Azure or comparable cloud PaaS, IaaS, and resource administration/support in a production environment
- Strong written and verbal interpersonal skills
- Excellent problem solving and analytical skills with attention to detail and driving issues to resolution
- Experience solving problems via automation using orchestration platforms such as JAMS, Ansible, Azure Automation, and ServiceNow Flows
- Proficient with scripting languages (multiple preferred): Bash, PowerShell, Python, and JavaScript
- Proficient with data tier languages: TSQL and GraphQL
- Proficient with the following monitoring solutions (multiple preferred): Splunk, Prometheus/Grafana, Application Insights, Azure Monitor, and Microsoft SCOM
Benefits
- Blameless postmortems
- Operationalization of services
- Platform engineering and automation