1. Academic Validation
  2. Ultra-high-throughput mapping of genetic design space

Ultra-high-throughput mapping of genetic design space

  • Nature. 2026 Feb;650(8103):1035-1044. doi: 10.1038/s41586-025-09933-9.
Kshitij Rai # 1 2 Ronan W O'Connell # 1 3 Trenton C Piepergerdes 1 3 Yiduo Wang 1 3 Lucas B C Brown 1 2 Kian D Samra 1 Jack A Wilson 1 Shujian Lin 1 Thomas H Zhang 1 Eduardo M Ramos 1 Andrew Sun 1 Bryce Kille 4 Kristen D Curry 4 Jason W Rocks 5 Todd J Treangen 1 4 5 6 Pankaj Mehta 7 8 9 Caleb J Bashor 10 11 12 13
Affiliations

Affiliations

  • 1 Department of Bioengineering, Rice University, Houston, TX, USA.
  • 2 Graduate Program in Systems, Synthetic and Physical Biology, Rice University, Houston, TX, USA.
  • 3 Graduate Program in Bioengineering, Rice University, Houston, TX, USA.
  • 4 Department of Computer Science, Rice University, Houston, TX, USA.
  • 5 Ken Kennedy Institute, Rice University, Houston, TX, USA.
  • 6 Rice Synthetic Biology Institute, Houston, TX, USA.
  • 7 Department of Physics, Boston University, Boston, MA, USA.
  • 8 Biological Design Center, Boston University, Boston, MA, USA.
  • 9 Faculty of Computing and Data Science, Boston University, Boston, MA, USA.
  • 10 Department of Bioengineering, Rice University, Houston, TX, USA. [email protected].
  • 11 Ken Kennedy Institute, Rice University, Houston, TX, USA. [email protected].
  • 12 Rice Synthetic Biology Institute, Houston, TX, USA. [email protected].
  • 13 Department of Biosciences, Rice University, Houston, TX, USA. [email protected].
  • # Contributed equally.
Abstract

Massively parallel genetic screens have been used to map sequence-to-function relationships for a variety of genetic elements1-5. However, as these approaches interrogate only short sequences, it remains challenging to perform high-throughput assays on constructs containing combinations of multiple sequence elements arranged across multi-kb length scales. Overcoming this barrier could accelerate synthetic biology; by screening diverse gene circuit designs and learning 'composition to function' mappings, genetic part composability rules could be revealed, enabling rapid identification of behaviour-optimized design variants6,7. Here we introduce CLASSIC (combining long- and short-range Sequencing to investigate genetic complexity), a genetic screening platform that combines long- and short-read next-generation Sequencing (NGS) modalities to quantitatively assess pools of constructs of arbitrary length containing diverse genetic part compositions. We show that CLASSIC can measure expression profiles of over 105 gene circuit designs (from 5-20 kb) in a single experiment in human cells. The resulting datasets can be used to train machine-learning models that accurately predict circuit behaviour across expansive circuit design landscapes, revealing part composability rules that govern circuit performance. Our study shows that, by expanding the throughput of each design-build-test-learn cycle, CLASSIC enhances the pace and scale of synthetic biology and establishes an experimental basis for data-driven design of complex genetic systems.

Figures
Products