TY - JOUR
T1 - NetProphet 3
T2 - a machine learning framework for transcription factor network mapping and multi-omics integration
AU - Abid, Dhoha
AU - Brent, Michael R.
N1 - Publisher Copyright:
© The Author(s) 2023. Published by Oxford University Press.
PY - 2023/2/1
Y1 - 2023/2/1
N2 - Motivation: Many methods have been proposed for mapping the targets of transcription factors (TFs) from gene expression data. It is known that combining outputs from multiple methods can improve performance. To date, outputs have been combined by using either simplistic formulae, such as geometric mean, or carefully hand-tuned formulae that may not generalize well to new inputs. Finally, the evaluation of accuracy has been challenging due to the lack of genome-scale, ground-truth networks. Results: We developed NetProphet3, which combines scores from multiple analyses automatically, using a tree boosting algorithm trained on TF binding location data. We also developed three independent, genome-scale evaluation metrics. By these metrics, NetProphet3 is more accurate than other commonly used packages, including NetProphet 2.0, when gene expression data from direct TF perturbations are available. Furthermore, its integration mode can forge a consensus network from gene expression data and TF binding location data.
AB - Motivation: Many methods have been proposed for mapping the targets of transcription factors (TFs) from gene expression data. It is known that combining outputs from multiple methods can improve performance. To date, outputs have been combined by using either simplistic formulae, such as geometric mean, or carefully hand-tuned formulae that may not generalize well to new inputs. Finally, the evaluation of accuracy has been challenging due to the lack of genome-scale, ground-truth networks. Results: We developed NetProphet3, which combines scores from multiple analyses automatically, using a tree boosting algorithm trained on TF binding location data. We also developed three independent, genome-scale evaluation metrics. By these metrics, NetProphet3 is more accurate than other commonly used packages, including NetProphet 2.0, when gene expression data from direct TF perturbations are available. Furthermore, its integration mode can forge a consensus network from gene expression data and TF binding location data.
UR - http://www.scopus.com/inward/record.url?scp=85147834079&partnerID=8YFLogxK
U2 - 10.1093/bioinformatics/btad038
DO - 10.1093/bioinformatics/btad038
M3 - Article
C2 - 36692138
AN - SCOPUS:85147834079
SN - 1367-4803
VL - 39
JO - Bioinformatics
JF - Bioinformatics
IS - 2
M1 - btad038
ER -