The association for the people and businesses of Sheffield's digital industries.

talent 23, a year of talent and skills

a year of talent and skills

DevOps – Site Reliability Engineering (SRE)

  • Full Time
  • Sheffield
  • £44,720 - £52,130 GBP / Year

Website Home Office

Job summary

The Reliability Enablement team helps Data Services & Analytics (DSA) teams improve their product and service reliability by providing observability and embedding Site Reliability Engineering (SRE) principles. You will be a key part of the team, working on engagements with product teams and helping grow SRE culture within the organisation.

Job description

The DevOps (SRE) is responsible for improving the reliability of our platforms and services. Your role is proactive, ensuring relevant metrics are being measured and reliability improvements are identified and implemented when necessary. This will ensure the reliability and availability of services for users.

You will also advise developers on how to use platforms and tools effectively, reviewing and advising on their use of CI/CD pipelines and observability tooling. You may also work to deliver new platform tooling.

Recruitment events

We are hosting an Engineering online recruitment event on Thursday 6th February 2025 from 12:00pm to 1:00pm. Where you can find out more about our roles, working for the organisation and how to apply. Register your interest here: Home Office Events I Eventbrite

Tools and Technologies we use:

We are keen for Engineers to continue learning new technologies, we have a large range in the Home Office including:

  • Backend: Java, Node.js, C#, Python, PHP, Scala, Power Platform
  • Frontend: React, JavaScript, Typescript, Angular
  • Data: PostgreSQL, Microsoft SQL Server, MongoDB, Apache Kafka, Neo4J, Amazon Athena
  • DevOps: AWS, Kubernetes, Azure, Jenkins, Docker, Ansible, Terraform, Dynatrace

What you will do

Your main day to day responsibilities will be:

  • supporting teams to effectively build, improve and deploy reliable and secure services
  • building new or improved shared tooling to help teams automate and maximise reliability
  • spotting instances where teams are not using best practice and advising on how to improve
  • supporting engineers to design new services; helping to define test and deployment pipelines
  • helping teams improve their integration approaches; increasing reliability and the value delivered to users

Like many organisations we need to maintain our services 24/7, therefore, on occasions there may be a requirement to work out of hours, for which you will be paid an additional allowance.

Person specification

UK residency and security requirements – You need to have lived in the UK for the past 5 years.

Essential Criteria

As a DevOps (SRE), you will have experience of:

  • Designing and implementing reliable cloud solutions using AWS or Azure according to best practices. (Software design – SWDN)
  • Implementing automated testing, scanning and code analysis tooling, according to best practices. (Testing – TEST)
  • Implementing and using application monitoring tooling to identify and respond to problems early. (Application support – ASUP)
  • Designing, coding, testing, maintaining and documenting scripts and infrastructure-as-code definitions to automate build and deployment activities. (Programming/software development – PROG)
  • Implementing and promoting use of CI/CD pipelines according to best practices. (Systems integration and build – SINT)
  • Implementing data management best practices for cloud resources, such as naming, tagging, metadata, backups, and documentation. (Data management – DATM)

To apply for this job please visit www.aplitrak.com.