TY - JOUR
T1 - Inferring TF activities and activity regulators from gene expression data with constraints from TF perturbation data
AU - Ma, Cynthia Z.
AU - Brent, Michael R.
N1 - Publisher Copyright:
© 2020 The Author(s) 2020. Published by Oxford University Press.
PY - 2021/5/1
Y1 - 2021/5/1
N2 - Motivation: The activity of a transcription factor (TF) in a sample of cells is the extent to which it is exerting its regulatory potential. Many methods of inferring TF activity from gene expression data have been described, but due to the lack of appropriate large-scale datasets, systematic and objective validation has not been possible until now. Results: We systematically evaluate and optimize the approach to TF activity inference in which a gene expression matrix is factored into a condition-independent matrix of control strengths and a condition-dependent matrix of TF activity levels. We find that expression data in which the activities of individual TFs have been perturbed are both necessary and sufficient for obtaining good performance. To a considerable extent, control strengths inferred using expression data from one growth condition carry over to other conditions, so the control strength matrices derived here can be used by others. Finally, we apply these methods to gain insight into the upstream factors that regulate the activities of yeast TFs Gcr2, Gln3, Gcn4 and Msn2.
AB - Motivation: The activity of a transcription factor (TF) in a sample of cells is the extent to which it is exerting its regulatory potential. Many methods of inferring TF activity from gene expression data have been described, but due to the lack of appropriate large-scale datasets, systematic and objective validation has not been possible until now. Results: We systematically evaluate and optimize the approach to TF activity inference in which a gene expression matrix is factored into a condition-independent matrix of control strengths and a condition-dependent matrix of TF activity levels. We find that expression data in which the activities of individual TFs have been perturbed are both necessary and sufficient for obtaining good performance. To a considerable extent, control strengths inferred using expression data from one growth condition carry over to other conditions, so the control strength matrices derived here can be used by others. Finally, we apply these methods to gain insight into the upstream factors that regulate the activities of yeast TFs Gcr2, Gln3, Gcn4 and Msn2.
UR - http://www.scopus.com/inward/record.url?scp=85108026710&partnerID=8YFLogxK
U2 - 10.1093/bioinformatics/btaa947
DO - 10.1093/bioinformatics/btaa947
M3 - Article
C2 - 33135076
AN - SCOPUS:85108026710
SN - 1367-4803
VL - 37
SP - 1234
EP - 1245
JO - Bioinformatics
JF - Bioinformatics
IS - 9
ER -