Loading...
Thumbnail Image
Publication

BAMBI integrates biostatistical and artificial intelligence methods to improve RNA biomarker discovery

Zhou, Peng
Li, Zixiu
Liu, Feifan
Kwon, Euijin
Hsieh, Tien-Chan
Ye, Shangyuan
Vasudevan, Shobha
Lee, Jung Ae
Tran, Khanh-Van
Zhou, Chan
Embargo Expiration Date
Abstract

RNA biomarkers enable early and precise disease diagnosis, monitoring, and prognosis, facilitating personalized medicine and targeted therapeutic strategies. However, identification of RNA biomarkers is hindered by the challenge of analyzing relatively small yet high-dimensional transcriptomics datasets, typically comprising fewer than 1000 biospecimens but encompassing hundreds of thousands of RNAs, especially noncoding RNAs. This complexity leads to several limitations in existing methods, such as poor reproducibility on independent datasets, inability to directly process omics data, and difficulty in identifying noncoding RNAs as biomarkers. Additionally, these methods often yield results that lack biological interpretation and clinical utility. To overcome these challenges, we present BAMBI (Biostatistical and Artificial-intelligence Methods for Biomarker Identification), a computational tool integrating biostatistical approaches and machine-learning algorithms. By initially reducing high dimensionality through biologically informed statistical methods followed by machine learning-based feature selection, BAMBI significantly enhances the accuracy and clinical utility of identified RNA biomarkers and also includes noncoding RNA biomarkers that existing methods may overlook. BAMBI outperformed existing methods on both real and simulated datasets by identifying individual and panel biomarkers with fewer RNAs while still ensuring superior prediction accuracy. BAMBI was benchmarked on multiple transcriptomics datasets across diseases, including breast cancer, psoriasis, and leukemia. The prognostic biomarkers for acute myeloid leukemia discovered by BAMBI showed significant correlations with patient survival rates in an independent cohort, highlighting its potential for enhancing clinical outcomes. The software is available on GitHub (https://github.com/CZhouLab/BAMBI).

Source

Zhou P, Li Z, Liu F, Kwon E, Hsieh TC, Ye S, Vasudevan S, Lee JA, Tran KV, Zhou C. BAMBI integrates biostatistical and artificial intelligence methods to improve RNA biomarker discovery. Brief Bioinform. 2025 Mar 4;26(2):bbaf073. doi: 10.1093/bib/bbaf073. PMID: 40121554; PMCID: PMC11929966.

Year of Medical School at Time of Visit
Sponsors
Dates of Travel
DOI
10.1093/bib/bbaf073
PubMed ID
40121554
Other Identifiers
Notes
Funding and Acknowledgements
Corresponding Author
Related Resources
Related Resources
Repository Citation
Rights
© The Author(s) 2025. Published by Oxford University Press. This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (https://creativecommons.org/ licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com