- Lead data-driven reliability engineering initiatives to improve platform stability and identify patterns of failure.
- Partner with engineering and operations teams to drive proactive practices across distributed and mainframe-based environments.
- Analyse incident and change data to identify root causes and systemic risks.
- Define and track service health metrics, including MTTR and failure rates.
- Experience in service reliability, production support, or platform operations at scale.
- Strong analytical mindset and ability to interpret complex data sets.
- Familiarity with automation/orchestration platforms such as Control-M or IBM SFG.
- Understanding of ITIL principles and DevOps culture.
- Experience with monitoring tools, dashboards, and reporting solutions.
- Knowledge of distributed systems, batch processing, and file transfer workflows.
Highly Skilled Reliability Engineer Wanted - Kraków - beBeeReliability
beBeeReliability Kraków
před 2 měsíci
Název práce: Site Reliability Engineering Lead @
Popis
Site Reliability Engineering Lead
Key Challenges and Objectives:
The role of Site Reliability Engineering Lead involves driving reliability engineering efforts to enhance platform stability and identify patterns of failure. This requires a strong analytical mindset and ability to interpret complex data sets.
Responsibilities:
Requirements:
Personal Qualities and Skills:
This role requires a proactive communicator with strong documentation and facilitation skills, as well as a passion for continuous improvement and user-oriented reliability mindset.