img
نوع العقددوام كامل
طبيعة الوظيفةبالموقع
الموقعالرياض

وصف الوظيفة

About the Principal Site Reliability Engineer Role

SimCorp is seeking a Principal Site Reliability Engineer to join its team in Riyadh. This full-time position is integral to ensuring the continuous availability, reliability, and performance of mission-critical systems and services within the organization. The role requires a proactive and skilled individual to contribute to the evolution of financial technology.

Role Context and Importance

The Principal Site Reliability Engineer, at the IC6 grade, is responsible for the robust design, implementation, and management of systems that underpin SimCorp's operations. This role operates with a high degree of autonomy, focusing on creating resilient solutions, automating complex processes, and anticipating potential system issues. Close collaboration with development, operations, and IT teams is essential to uphold service delivery standards and achieve operational excellence.

Key Responsibilities

  • Lead the design and implementation of systems and solutions to enhance reliability and scalability.
  • Automate operational tasks and processes to improve efficiency and system performance.
  • Monitor system performance and availability, proactively identifying and resolving issues.
  • Collaborate with development teams to embed reliability practices into the software development lifecycle.
  • Conduct root cause analysis for incidents and implement long-term preventive measures.
  • Develop and maintain infrastructure monitoring, alerting, and logging systems.
  • Ensure the security, performance, and scalability of cloud infrastructure using best practices.
  • Provide technical guidance and mentorship to junior engineers.
  • Engage with cross-functional teams to prioritize reliability and performance across systems.

Scope of Work and Engagement

This role involves leading initiatives to improve the reliability, scalability, and availability of critical systems. It includes close collaboration with development teams to ensure systems are built with reliability as a core consideration. The work also encompasses automating monitoring, incident management, and infrastructure provisioning, as well as analyzing performance metrics to identify areas for improvement. Participation in incident response and on-call rotations is expected, alongside building and managing infrastructure as code, engaging in capacity planning, and evaluating emerging technologies.

Qualifications and Experience

Candidates should possess 5-10 years of experience in site reliability engineering, cloud infrastructure, or a related field. The ability to work independently, design robust solutions, and automate processes is essential. A strong understanding of cloud-based infrastructure, configuration management, and deployment best practices is required.

Work Environment and Location

This is a full-time position based in Riyadh. SimCorp fosters a people-centered organization that emphasizes skills development, relationship building, and client success, aiming to cultivate an environment where team members feel heard, valued, and empowered.


متطلبات الوظيفة

  • للسعوديين فقط
  • تتطلب ٥-١٠ سنوات خبرة

وظائف مشابهة