We are looking for a Service Reliability Engineer to join our Tech@Lilly Enterprise Automation team. The role will involve improving system reliability and resiliency, reducing manual effort, and preventing operations incidents. You will be responsible for building automation to reduce support footprint, creating reusable application components, and engaging in software development and system engineering activities.
Requirements
- System Reliability & Performance
- Incident Management & Response
- Automation & Infrastructure as Code (IaC)
- Strong problem solving and analytical skills and highly adaptable to changing circumstances
- Experience in reliability engineering and monitoring practices, environments, and tools (e.g., cloud ecosystem preferably AWS, monitoring/observability, configuration management, etc.)
- Experience with programming and scripting languages (e.g., Java, Javascript, Python, etc.) within context of automation tools
- Experience with large-scale databases, data movement and analytics tools (e.g., RDS, DynamoDB, etc.)
- Experience with ITIL v4 processes, framework, and tools that support it (e.g., ServiceNow).
- Experience with testing frameworks and methodologies (unit, integration, and end-to-end testing) for ensuring application quality and reliability.
- Knowledge of debugging and performance optimization tools for diagnosing and resolving issues.
Benefits
- Comprehensive benefit program to eligible employees, including eligibility to participate in a company-sponsored 401(k); pension; vacation benefits; eligibility for medical, dental, vision and prescription drug benefits; flexible benefits (e.g., healthcare and/or dependent day care flexible spending accounts); life insurance and death benefits; certain time off and leave of absence benefits; and well-being benefits (e.g., employee assistance program, fitness benefits, and employee clubs and activities)