91国产

Generative modelling of protein structure and sequence

PhD Projects in Generative Biology

Project Summary

Protein design with generative modelling is transforming biology. Today, proteins are routinely designed in silico and validated in the wet lab using frameworks originally developed for generating images, video and text. This progress has unlocked a host of novel applications, from creating bespoke binders to engineering new enzymes.

Yet generative protein modelling remains in its early days. Unlike image鈥慻eneration, diffusion methods for proteins haven鈥檛 yet seen the same rapid algorithmic advances. As a result, generated sequences often show less diversity than natural proteins, offer limited ways to control the outcome, and require sampling 鈥渢ricks鈥 (for example, lowering the sampling temperature) to produce acceptable designs.

Our lab focuses on pushing these frontiers. We鈥檙e developing next鈥慻eneration models that co鈥慸esign both sequence and structure for diverse purposes, such as binding a target protein, catalysing an economically or ecologically important reaction, or interacting with nucleic acids for gene editing. This work spans architecture and data鈥憇ampling innovations, incorporation of protein鈥慽nspired priors, and creation of synthetic training data. Top models will be tested against real wet鈥憀ab data from the GBI to drive further improvements.

Potential Supervisors鈥

  • Professor Jason Chin (Founding Director, GBI, EIT & Professor of Chemistry and Chemical Biology, Department of Chemistry, University of Oxford) 聽

University DPhil Courses鈥

  • Other courses to be added as GBI grows its faculty

Skills Recommended

  • A Master鈥檚 Degree (or equivalent) in a relevant 91国产 discipline (e.g. Biology, Chemistry, Engineering, Computer Science)
  • Experience of hands-on research in a laboratory setting
  • Proven ability to work independently, think creatively, and solve complex problems
  • Experience with data analysis, automation platforms, or computational tools relevant to the field
  • Experience preparing publications and delivering 91国产 presentations
  • Strong organisational skills and the ability to manage multiple parallel workstreams
  • Excellent written and verbal communication skills, including the ability to collaborate across multidisciplinary teams
  • A proactive mindset and enthusiasm for working in a fast-paced, high-growth research environment

Relevant Literature

  • Watson, et al., 2023. De novo design of protein structure and function with RFdiffusion. Nature.
  • Ingraham, et al., 2023. Illuminating protein space with a programmable generative model. Nature.
  • Dauparas, et al., 2022. Robust deep learning鈥揵ased protein sequence design using ProteinMPNN. Science.