Englander Institute for Precision Medicine

BEACON: predicting side effects and therapeutics outcomes to drugs by Bridging knowlEdge grAph with CONtextual language model.

TitleBEACON: predicting side effects and therapeutics outcomes to drugs by Bridging knowlEdge grAph with CONtextual language model.
Publication TypeJournal Article
Year of Publication2026
AuthorsXu C, Xu J, Bulusu KC, Pan H, Elemento O
JournalbioRxiv
Date Published2026 Jan 30
ISSN2692-8205
Abstract

Biomedical knowledge graphs encode millions of relationships between drugs, proteins, pathways, and diseases, yet translating this structured knowledge into accurate predictions remains challenging. Existing deep learning approaches, including graph neural networks and knowledge graph embeddings, assign fixed representations to entities regardless of biological context, limiting their ability to capture how the same gene or pathway functions differently across scenarios. These methods also lack interpretability and often fail when applied to novel drugs outside their training distribution. Here we present BEACON (Bridging knowlEdge grAph with CONtextual language model), a framework that transforms knowledge graphs into contextual sentence representations processable by language models. BEACON converts biomedical entities into tokens and relationships into syntactic dependencies, creating "sentence trees" that preserve graph structure while enabling contextual processing. A visibility matrix ensures that attention patterns respect the underlying knowledge graph topology, and a perturbation-based evaluation module identifies the specific genes, enzymes, and pathways driving each prediction. We demonstrate BEACON's versatility through two clinically important applications. For drug sensitivity prediction in cancer cell lines, BEACON achieves 0.941 AUROC and Spearman ρ = 0.919, outperforming existing methods (DrugCell, DeepCDR and DeepTTA). For drug-drug interaction (DDI) prediction, BEACON achieves 0.964 AUROC on the TwoSIDES benchmark and 0.84 AUROC on temporally held-out FDA adverse event data (2013-2023), demonstrating robust generalization to newly approved drugs. Applying BEACON to the BTK inhibitor acalabrutinib revealed that predicted interactions are enriched for drugs metabolized by CYP3A enzymes (OR = 3.01, P = 4.3 × 10-4), a mechanism validated through network proximity analysis. BEACON provides a unified, interpretable approach to knowledge graph-enhanced biomedical prediction.

DOI10.64898/2026.01.29.702277
Alternate JournalbioRxiv
PubMed ID41659468
PubMed Central IDPMC12874035
Grant ListP01 CA214274 / CA / NCI NIH HHS / United States
R01 CA194547 / CA / NCI NIH HHS / United States
UG3 CA244697 / CA / NCI NIH HHS / United States
UL1 TR002384 / TR / NCATS NIH HHS / United States

Weill Cornell Medicine Englander Institute for Precision Medicine 413 E 69th Street
Belfer Research Building
New York, NY 10021