1. Academic Validation
  2. Machine Learning Modeling of Zebrafish Toxicity Endpoints After Exposure to PROTACs

Machine Learning Modeling of Zebrafish Toxicity Endpoints After Exposure to PROTACs

  • Toxicol Sci. 2025 Nov 19:kfaf162. doi: 10.1093/toxsci/kfaf162.
Christopher Yogodzinski 1 Joshua S Harris 1 Thomas R Lane 1 Morgan Barnes 1 Patricia A Vignaux 1 Renuka Raman 1 Lisa Truong 2 Robyn L Tanguy 2 Seth Kullman 3 Sean Ekins 1
Affiliations

Affiliations

  • 1 Collaborations Pharmaceuticals, Inc, 840 Main Campus Drive, Lab 3510, Raleigh, NC, 27606, USA.
  • 2 Department of Environmental and Molecular Toxicology, Sinnhuber Aquatic Research Laboratory, Oregon State University, 28645 E Hwy 34, Corvallis, Oregon, 97333, USA.
  • 3 Toxicology Program, Department of Biological Sciences, North Carolina State University, Raleigh, Raleigh, NC, 27695-7633, USA Campus Box 7633.
Abstract

Zebrafish (danio rerio) are an ideal system for understanding developmental toxicity as they display similar toxicity outcomes to Other vertebrates. Further, many molecules have been tested for developmental toxicity in zebrafish providing an opportunity for machine learning model development. We curated 1345 small molecules from ToxCast, flame retardant compounds, per- and polyfluoroalkyl substances (PFAS), and industrial chemicals published by the Superfund Research Program (SRP). Following curation, we trained machine learning models on the zebrafish toxicity endpoints ANY_ = any effect including mortality, ANY_BUT_MORT = any effect excluding mortality, MORT = mortality ie did the embryo die, EDEM = did an edema form, CRAN = Craniofacial malformation. We demonstrated that these models were better than random when compared to shuffled data. We also fine-tuned the molecular SMILES encoder MolBART to predict on all zebrafish toxicity endpoints and found it generally matched the performance of classical machine learning models for ANY_BUT_MORT, CRAN, and EDEM endpoints. We present new toxicity data for Proteolysis Targeting Chimeras (PROTACs) in Zebrafish and machine learning models for these data by fingerprinting different parts of the molecule individually, yielding predictive performance (AUROC 0.6-0.7). If we are to reduce animal testing with new approach methodologies (NAMs) like these Zebrafish toxicity models they need to be able adapt to new molecular classes like PROTACs.

Keywords

Machine learning; MolBART; PROTACs; Zebrafish.

Figures
Products