Platform Engineer (HPC)
- London, UK
- Employees can work remotely
- Department: Engineering
- Office: London
Genomics England successfully led the world-leading 100,000 Genomes Project, which compared and analysed individuals’ genetic codes to help diagnose, treat and prevent illness.
We're now accelerating our impact, working with the NHS to further develop and embed genomic healthcare and research in Britain. Our next chapter involves working with patients, doctors, scientists, government and industry to improve genomic testing, and help researchers access the health data and technology they need to make new medical discoveries and create more effective, targeted medicines for everybody.
The HPC Platform Engineer will be responsible for maintaining configuration and infrastructure to optimise high performance compute (HPC) clusters for bioinformatician and researchers, building functional systems to help us keep our environments running smoothly. Platform Engineers responsibilities will include continuous integration and delivery pipeline, cloud and datacentre infrastructure and site reliability. You must have a keen eye for detail, problem-solving skills and be a team player.
You will own the platform for our HPC clusters and associated deployment pipelines, whilst designing, building and maintaining core infrastructure. You will champion the DevOps culture by example, with an overview of the set-up and management of monitoring and alerting processes on the symptoms and not outages. The ideal candidate will improve and automate the deployment process to make it as straightforward and efficient as possible alongside documenting activity and observations so your findings turn into repeatable actions – and then into automation. You will have responsibility of debugging platform issues across services and levels of the stack, whilst also planning the growth of our HPC infrastructure to support, products, services and compute strategy.
Skills and Experience for Success
We anticipate the ideal candidate having a strong background with High Performance Computing Environments (HPC) and hands on experience supporting end users to troubleshoot and optimise their jobs on HPC clusters. You will possess experience of working with Linux and with server configuration and automation tools like Ansible, Chef, Puppet. Ideally you will also have a working background of using CI/CD tools and platforms.
Strong communication skills and able to work collaboratively, where required as part of a team are key, as are good problem solving skills.
AWA Lustre & AWS Batch hands on experience is highly desirable. We will also look for experience with Virtualisation, storage attached networks, networking & security. A demonstrable background of working hands-on in moving products & infrastructure from traditional data centre to cloud, is also sought after.
It's great if you also have experience with GitLab, Terraform and with using automated testing framework.
Originally conceived as a project, Genomics England has transformed to meet the opportunities created by our scientific breakthroughs in understanding the Human Genome. Being part of this journey is a reward in itself, however we're pleased to offer our colleagues a great benefits package including:
- competitive salary
- 30 days holiday
- generous pension scheme
- individual learning budgets for every colleague
- a raft of other benefits
Talk to our Talent Team and find out how a career with Genomics England will benefit you.