TY - JOUR
T1 - LNCipedia
T2 - A database for annotated human IncRNA transcript sequences and structures
AU - Volders, Pieter Jan
AU - Helsens, Kenny
AU - Wang, Xiaowei
AU - Menten, Björn
AU - Martens, Lennart
AU - Gevaert, Kris
AU - Vandesompele, Jo
AU - Mestdagh, Pieter
N1 - Funding Information:
Ghent University Multidisciplinary Research Partnership ‘Bioinformatics: from nucleotides to networks’ (to P.J.V., L.M., K.G., J.V.); National Institutes of Health [R01GM089784 to X.W.]; Flemish Fund for Scientific Research Flanders (FWO) (to P.M.); Ghent University Special Research Fund (BOF) (to J.V.). Funding for open access charge: Ghent University.
PY - 2013/1/1
Y1 - 2013/1/1
N2 - Here, we present LNCipedia (http://www.lncipedia.org), a novel database for human long non-coding RNA (IncRNA) transcripts and genes. LncRNAs constitute a large and diverse class of non-coding RNA genes. Although several IncRNAs have been functionally annotated, the majority remains to be characterized. Different high-throughput methods to identify new IncRNAs (including RNA sequencing and annotation of chromatin-state maps) have been applied in various studies resulting in multiple unrelated IncRNA data sets. LNCipedia offers 21488 annotated human IncRNA transcripts obtained from different sources. In addition to basic transcript information and gene structure, several statistics are determined for each entry in the database, such as secondary structure information, protein coding potential and microRNA binding sites. Our analyses suggest that, much like microRNAs, many IncRNAs have a significant secondary structure, in-line with their presumed association with proteins or protein complexes. Available literature on specific IncRNAs is linked, and users or authors can submit articles through a web interface. Protein coding potential is assessed by two different prediction algorithms: Coding Potential Calculator and HMMER. In addition, a novel strategy has been integrated for detecting potentially coding IncRNAs by automatically re-analysing the large body of publicly available mass spectrometry data in the PRIDE database. LNCipedia is publicly available and allows users to query and download IncRNA sequences and structures based on different search criteria. The database may serve as a resource to initiate small- and large-scale IncRNA studies. As an example, the LNCipedia content was used to develop a custom microarray for expression profiling of all available IncRNAs.
AB - Here, we present LNCipedia (http://www.lncipedia.org), a novel database for human long non-coding RNA (IncRNA) transcripts and genes. LncRNAs constitute a large and diverse class of non-coding RNA genes. Although several IncRNAs have been functionally annotated, the majority remains to be characterized. Different high-throughput methods to identify new IncRNAs (including RNA sequencing and annotation of chromatin-state maps) have been applied in various studies resulting in multiple unrelated IncRNA data sets. LNCipedia offers 21488 annotated human IncRNA transcripts obtained from different sources. In addition to basic transcript information and gene structure, several statistics are determined for each entry in the database, such as secondary structure information, protein coding potential and microRNA binding sites. Our analyses suggest that, much like microRNAs, many IncRNAs have a significant secondary structure, in-line with their presumed association with proteins or protein complexes. Available literature on specific IncRNAs is linked, and users or authors can submit articles through a web interface. Protein coding potential is assessed by two different prediction algorithms: Coding Potential Calculator and HMMER. In addition, a novel strategy has been integrated for detecting potentially coding IncRNAs by automatically re-analysing the large body of publicly available mass spectrometry data in the PRIDE database. LNCipedia is publicly available and allows users to query and download IncRNA sequences and structures based on different search criteria. The database may serve as a resource to initiate small- and large-scale IncRNA studies. As an example, the LNCipedia content was used to develop a custom microarray for expression profiling of all available IncRNAs.
UR - http://www.scopus.com/inward/record.url?scp=84875530540&partnerID=8YFLogxK
U2 - 10.1093/nar/gks915
DO - 10.1093/nar/gks915
M3 - Article
C2 - 23042674
AN - SCOPUS:84875530540
SN - 0305-1048
VL - 41
SP - D246-D251
JO - Nucleic acids research
JF - Nucleic acids research
IS - D1
ER -