Skip to content

Senior SRE

RemoteUnited Kingdom, London, City of, LondonInfrastructure

Job description

Join us at this pivotal time for an exciting challenge to shape fintech’s future.

Oval has always worked following two key principles: financial innovation and inclusion. We are still led by them and now have an expanded team and powerful new technologies.

Oval was founded in 2016 and has helped tens of thousands of people learn how to take control of their finances – from their spending to their saving and investment habits - with our award-winning app.

Thanks to the integration with ETX Capital, a leading global financial services firm with a long legacy in the financial markets, our users will be able to access a variety of investment opportunities from spread betting, to thousands of global CFD markets that include forex, commodities, and shares.

Spread across three offices in the UK, Italy, and Cyprus, we are one global team of 180+ people with a unified vision for the future of finance. We are currently looking for talented people keen to define the fintech revolution and help the brand shift from a start-up to a substantial industry presence. If you want to make an impact, then we can’t wait to have you with us for the journey!

Senior Site Reliability Engineer
As a Senior Site Reliability Engineer, you will have the opportunity to improve the way technology is used with the company and by clients. You will join a team responsible for developing, supporting & monitoring Oval’s suite of trading systems.

You will be working closely across all areas of technology and be responsible for the availability and continuous reliability of our trading systems. Analysing platform behaviour, continuously building and improving monitoring, diagnosing root cause of issues as well as working with technology teams to develop and deploy any changes that improve the service.

This is a broad-spectrum role and the successful candidate must be extremely passionate, knowledgeable, dedicated and above all pragmatic to ensure that constant availability and improvement of these platforms.

You are likely to have worked previously in either a development or system admin role and are very proficient in a variety of technical skills and have the hunger to learn and be the custodian of the systems.

You will be required to assist with out of office hours emergency response if an issue with the bespoke trading systems occurs that cannot be left until the next working day.

Main responsibilities include:

  • Be the primary custodian of all live service platforms.
  • Perform analysis on bespoke software applications functionality and work closely with product, IT operational and development teams.
  • Debug and diagnose software issues/bugs in the platforms and work with development/product teams to resolve and funnel into development.
  • Establishing root cause of application errors and dealing with high priority issues with development teams.
  • Being part of the team responsible for code deployments and updates.
  • Consult with software development teams, internal users/clients with a view to improve platform performance.
  • Document and log application performance issues into the company incident management for analytics and metrics.
  • Work with the Jira based helpdesk system to manage issues through their lifecycle.
  • Responsible for managing and implementing monitoring systems.

This is not intended to be an exhaustive or exclusive list of duties. You may be required to carry out any other associated tasks to ensure the successful delivery of the company’s objectives.

Job requirements

Skills and experience:

  • Previous experience in either a development or SysAdmin Background.
  • Demonstrable experience and understanding of software languages (C#, SQL 92.).
  • Demonstrable experience and understanding in Python, Powershell and Bash.
  • Strong experience of working with analysis and monitoring tools such as Grafana and Splunk.
  • Ability to trace network level issues and understanding of DNS.
  • Previous experience on cloud technologies such as Amazon AWS or MS Azure beyond basic server maintenance.
  • Advanced proficiency in determining cause of application errors and ability to read and diagnose error logs.
  • Excellent written and oral communication skills are required, with the ability to comprehend and communicate issues succinctly and without ambiguity using the appropriate technical vocabulary.
  • Fluent in English


Apply with Indeed unavailable