Skip to main content
Search roles

Technical Lead (HPC Infrastructure) - SCP

Location Cambridge, England, United Kingdom Job ID R-245042 Date posted 01/02/2026

Job Title: Technical Lead (HPC Infrastructure) - SCP

Location: Cambridge

Salary: Competitive

Introduction to role:

Are you ready to lead high-performance computing infrastructure that accelerates discovery and brings life-changing medicines to patients faster? This is your opportunity to set the direction for a mission-critical platform that underpins data science, computational biology and AI/ML workloads across the enterprise.

You will steer a modern HPC environment spanning on-premises and cloud, enabling scientists to run at scale and speed while maintaining reliability and cost efficiency. Do you thrive at the intersection of engineering rigor and scientific impact, where your decisions unlock faster insights and smarter experimentation?

In this hands-on leadership role, you will be empowered to take ownership, challenge the status quo and orchestrate new possibilities—partnering closely with researchers, pushing boundaries in hackathons and ensuring our platform evolves with the demands of cutting-edge science.

Accountabilities:

  • Platform Roadmap: Define and own the strategic roadmap for the HPC infrastructure, aligning capacity, architecture and capabilities to scientific priorities and business outcomes.

  • Operational Excellence: Drive continuous improvement of platform stability, efficiency and performance; set clear metrics and ensure reliability for large-scale workloads and time-critical studies.

  • Hybrid Delivery: Lead delivery across on-premises and cloud environments, optimizing for speed, scalability and cost while ensuring seamless user experience.

  • Scientific Partnership: Work with scientific users to understand their needs and translate them into robust solutions, enabling faster models, simulations and analyses.

  • Technology Foresight: Scan the horizon to identify emerging technologies that keep the platform innovative and competitive, and guide timely adoption.

  • Backlog Leadership: Prioritize the team’s work based on scientific impact, balancing quick wins with longer-term investments to maximize value.

  • People Development: Mentor and coach engineers, build capabilities and foster a high-performance, collaborative engineering culture.

  • Incident Leadership: Investigate and resolve complex operational incidents, lead root-cause analysis and implement preventative improvements that strengthen the platform.

Essential Skills/Experience:

  • Defining the roadmap for the platform’s HPC infrastructure

  • Drive continuous improvement of the stability and efficiency of the platform

  • Ensuring delivery of team objectives, both on-premises and in the cloud

  • Working with scientific users to understand their needs, and develop solutions

  • Horizon scanning, identifying the future technologies needed to stay innovative

  • Prioritising the work backlog for the team according to scientific needs

  • Mentoring and coaching engineers

  • Investigating and resolving complex operational incidents

Desirable Skills/Experience:

  • Experience with HPC schedulers and workload managers (e.g., Slurm, PBS, Grid Engine) and job orchestration at scale

  • Hands-on knowledge of cloud-based HPC services and architectures (e.g., Azure, AWS, GCP), hybrid networking and cost optimization

  • Containerization and orchestration for scientific workloads (e.g., Docker, Singularity, Kubernetes)

  • Infrastructure as Code and automation (e.g., Terraform, Ansible, CI/CD), plus strong scripting skills in Python and Bash

  • Performance tuning and profiling of compute and storage, including GPUs, accelerators and parallel filesystems (e.g., Lustre, Spectrum Scale)

  • Workflow management tools (e.g., Nextflow, Snakemake) and data pipelines supporting AI/ML and bioinformatics

  • Robust approach to reliability engineering, observability and security/compliance for sensitive research data

  • Proven stakeholder engagement across research, engineering and product teams, with the ability to balance speed, quality and sustainability

Why AstraZeneca:

Here, technology and science move together with purpose. You will join a community of entrepreneurial self-starters who experiment boldly—through hackathons, collaborations with leading academics and partnerships across the enterprise—to deliver digital capabilities that scale fast and unlock the potential of research. We value kindness alongside ambition, expect high standards and back them with investment, so your contribution directly fuels our progress toward a truly data-led organization that changes lives.

So, what’s next?

Complete your application before the below closing date.

We welcome your application no later than 15th  January 2026

Where can I find out more?

Follow AstraZeneca on LinkedIn https://www.linkedin.com/company/1603/

Follow AstraZeneca on Facebook https://www.facebook.com/astrazenecacareers/

Follow AstraZeneca on Instagram https://www.instagram.com/astrazeneca_careers/?hl=en

Date Posted

02-Feb-2026

Closing Date

15-Feb-2026

Our mission is to build an inclusive and equitable environment. We want people to feel they belong at AstraZeneca and Alexion, starting with our recruitment process. We welcome and consider applications from all qualified candidates, regardless of characteristics. We offer reasonable adjustments/accommodations to help all candidates to perform at their best. If you have a need for any adjustments/accommodations, please complete the section in the application form.

Join our Talent Network

Be the first to receive job updates and news from AstraZeneca

Sign up
Glassdoor logo Rated four stars on Glassdoor

Great culture, great work assignments, supportive management. Rotation opportunity within the company. They value our people.