TY - JOUR
T1 - Novel viral splicing events and open reading frames revealed by long-read direct RNA sequencing of adenovirus transcripts
AU - Price, Alexander M.
AU - Steinbock, Robert T.
AU - Lauman, Richard
AU - Charman, Matthew
AU - Hayer, Katharina E.
AU - Kumar, Namrata
AU - Halko, Edwin
AU - Lum, Krystal K.
AU - Wei, Monica
AU - Wilson, Angus C.
AU - Garcia, Benjamin A.
AU - Depledge, Daniel P.
AU - Weitzman, Matthew D.
N1 - Publisher Copyright:
© 2022 Public Library of Science. All rights reserved.
PY - 2022/9
Y1 - 2022/9
N2 - Adenovirus is a common human pathogen that relies on host cell processes for transcription and processing of viral RNA and protein production. Although adenoviral promoters, splice junctions, and polyadenylation sites have been characterized using low-Throughput biochemical techniques or short read cDNA-based sequencing, these technologies do not fully capture the complexity of the adenoviral transcriptome. By combining Illumina short-read and nanopore long-read direct RNA sequencing approaches, we mapped transcription start sites and RNA cleavage and polyadenylation sites across the adenovirus genome. In addition to confirming the known canonical viral early and late RNA cassettes, our analysis of splice junctions within long RNA reads revealed an additional 35 novel viral transcripts that meet stringent criteria for expression. These RNAs include fourteen new splice junctions which lead to expression of canonical open reading frames (ORFs), six novel ORF-containing transcripts, and 15 transcripts encoding for messages that could alter protein functions through truncation or fusion of canonical ORFs. In addition, we detect RNAs that bypass canonical cleavage sites and generate potential chimeric proteins by linking distinct gene transcription units. Among these chimeric proteins we detected an evolutionarily conserved protein containing the N-Terminus of E4orf6 fused to the downstream DBP/E2A ORF. Loss of this novel protein, E4orf6/DBP, was associated with aberrant viral replication center morphology and poor viral spread. Our work highlights how long-read sequencing technologies combined with mass spectrometry can reveal further complexity within viral transcriptomes and resulting proteomes.
AB - Adenovirus is a common human pathogen that relies on host cell processes for transcription and processing of viral RNA and protein production. Although adenoviral promoters, splice junctions, and polyadenylation sites have been characterized using low-Throughput biochemical techniques or short read cDNA-based sequencing, these technologies do not fully capture the complexity of the adenoviral transcriptome. By combining Illumina short-read and nanopore long-read direct RNA sequencing approaches, we mapped transcription start sites and RNA cleavage and polyadenylation sites across the adenovirus genome. In addition to confirming the known canonical viral early and late RNA cassettes, our analysis of splice junctions within long RNA reads revealed an additional 35 novel viral transcripts that meet stringent criteria for expression. These RNAs include fourteen new splice junctions which lead to expression of canonical open reading frames (ORFs), six novel ORF-containing transcripts, and 15 transcripts encoding for messages that could alter protein functions through truncation or fusion of canonical ORFs. In addition, we detect RNAs that bypass canonical cleavage sites and generate potential chimeric proteins by linking distinct gene transcription units. Among these chimeric proteins we detected an evolutionarily conserved protein containing the N-Terminus of E4orf6 fused to the downstream DBP/E2A ORF. Loss of this novel protein, E4orf6/DBP, was associated with aberrant viral replication center morphology and poor viral spread. Our work highlights how long-read sequencing technologies combined with mass spectrometry can reveal further complexity within viral transcriptomes and resulting proteomes.
UR - http://www.scopus.com/inward/record.url?scp=85138134382&partnerID=8YFLogxK
U2 - 10.1371/journal.ppat.1010797
DO - 10.1371/journal.ppat.1010797
M3 - Article
C2 - 36095031
AN - SCOPUS:85138134382
SN - 1553-7366
VL - 18
JO - PLoS pathogens
JF - PLoS pathogens
IS - 9
M1 - e1010797
ER -