Previous Article | Next Article ![]()
Journal of Virology, October 2005, p. 13094-13104, Vol. 79, No. 20
0022-538X/05/$08.00+0 doi:10.1128/JVI.79.20.13094-13104.2005
Copyright © 2005, American Society for Microbiology. All Rights Reserved.
Christopher S. Sullivan,2
Achintya Pal,3
K. W. C. Peden,3
Andrew M. Lewis,3 and
James M. Pipas1*
Department of Biological Sciences, University of Pittsburgh, Pittsburgh, Pennsylvania 15260,1 Howard Hughes Medical Institute and Departments of Microbiology and Medicine, G. W. Hooper Foundation, University of California, San Francisco, California 94143,2 Division of Viral Products, Center for Biologics Evaluation and Research, Food and Drug Administration, Bethesda, Maryland 208923
Received 26 May 2005/ Accepted 1 August 2005
|
|
|---|
|
|
|---|
Although over 11 polyomaviruses have been identified thus far, SV40 and murine polyomavirus are the best characterized. This is because they were the first identified members of the family and because both viruses are easily propagated in culture, where they undergo a well-defined productive infectious cycle. Following attachment to a specific receptor and transport of virions to the nucleus, viral chromatin is released and the early viral promoter is activated by the cellular transcription apparatus. This results in the synthesis of the viral early proteins, the tumor antigens or T antigens. The multifunctional T antigens function to drive the infected host cells into S phase, activate transcription of the viral late promoter, initiate and maintain viral DNA replication, and inactivate host defense systems. Consequently, new viral genomes are replicated and the virion proteins (VP1, VP2, and VP3) are expressed. The result is the assembly of about 300 infectious virions per cell.
SV40 is about 50 to 60% identical to murine polyomavirus DNA, and protein coding region identity ranges from 50% to 70% for the two viruses. SV40 is most closely related to BKV and JCV, with protein coding region identity ranging from 60 to 70% for JCV and 70 to 80% for BKV. While the protein coding regions of SV40, BKV, and JCV are very similar, each of these viruses has a distinct regulatory region. Specifically, the transcriptional enhancers that lie to the late side of the viral origin of DNA replication (ori) are virus specific. This observation leads to the speculation that polyomavirus host range might be determined by the compatibility of viral promoter elements and the cellular transcription apparatus.
Simian agent 12 (SA12) was first isolated from cultured kidney cells derived from Cercopithecus pygerythrus, the South African vervet monkey (20). Subsequently, it was found that a high percentage of Papio ursinus monkeys, chacma baboons, were seropositive for SA12 (41). Like other polyomaviruses, SA12 transforms cells in culture (40). Early studies demonstrated that SA12 is a polyomavirus closely related to SV40, JCV, and BKV (3, 19, 23, 32, 33). SA12 is more distantly related to a second nonhuman primate polyomavirus, lymphotropic polyomavirus (LPV) (24). In fact, limited sequence analysis indicated that SA12 is most closely related to BKV (8). Since that time, BKV and JCV have emerged as important human pathogens. Thus, as a step toward gaining a better understanding of the genetic elements that govern polyomavirus host range and to develop reagents that distinguish SA12 DNA from other polyomaviruses, especially BKV, we determined the complete sequence of a laboratory strain of SA12.
|
|
|---|
Mutagenesis. Nucleotide (nt) 10 of the SA12 genome was converted from an A to a T by using the QuikChange XL site-directed mutagenesis kit (Stratagene, La Jolla, CA) and by following the manufacturer's directions. pUC119-SA12 acted as the template in the mutagenesis reaction using the primers 5'-GAGGCGGCCTCGGCCTCTTATATATTATAAAAAAAAAGGC and 5'-GCCTTTTTTTTTATAATATATAAGAGGCCGAGGCCGCCTC. The base pair change in the resulting plasmid called pUC119-SA12-T10A was verified by sequencing.
Cell culture and virus growth assays. SA12 stocks were prepared from cloned viral DNA following transfection of BSC40 cells as described previously (8, 25). Briefly, pUC119-SA12 was digested with PstI to liberate the viral genome from the vector. Following inactivation of the restriction enzyme, the digest mixture was diluted in ligation buffer to a concentration of 3 µg/ml, and 1 unit of T4 DNA ligase added. The ligation reaction was carried out at 37°C for 4 h. Next the ligated DNA mixture was introduced into BSC40 cells by DEAE-dextran-mediated transfection as previously described (38). Transfected cells were maintained at 37°C in minimal essential medium supplemented with 10% fetal bovine serum. At 7 days posttransfection, the dishes were placed at 20°C until frozen. Following three freeze-thaws cycles and clarification by low-speed centrifugation, the virus stock was serially diluted and subjected to a plaque assay (8). In some cases, transfected cells were directly overlaid with agar for a plaque assay. Virus stocks were prepared by amplification of the virus present in individual plaques in BSC40 cells. Typically, titers of 3 x 107 PFU/ml were obtained.
Viral DNA replication. Freshly confluent BSC40 cells in a 6-cm dish were infected with SV40, SA12, or SA12-T10A at an multiplicity of infection (MOI) of 10. At various times postinfection, viral DNA was isolated by following the QIAprep spin miniprep kit protocol as described previously (44). After the monolayer was washed once with phosphate-buffered saline, the cells were lysed by adding 300 µl of buffers P1 and P2 (QIAGEN) to the dish and incubating it for 5 min at room temperature. With a rubber policeman, the cell lysate was transferred to a 1.5-ml polypropylene microcentrifuge tube. Next the lysate was digested by adding proteinase K to a final concentration of 800 µg/ml and incubating the tube at 55°C for 2 h. After the addition of 350 µl of buffer N3 and incubation on ice for 5 min, cellular DNA was precipitated by centrifugation for 10 min at 17,900 x g. Next the supernatant was applied to a QIAprep spin column and centrifuged for 1 min. The column was washed with 0.5 ml and 0.75 ml of buffers PB and PE, respectively, and samples were centrifuged after each addition of buffer for 1 min. Finally, DNA was eluted from the column by addition of 50 µl of elution buffer, incubation for 5 min, and centrifugation for 1 min. The DNA was then digested with BamHI or PstI to linearize the viral DNA. Following gel electrophoresis (0.7% agarose) the viral DNA was visualized by staining with GelStar (Cambrex Bio Science).
Mapping early mRNA splice sites. BSC40 cells were infected with SA12 at an MOI of 5 and maintained in minimal essential medium-10% fetal bovine serum at 37°C. At 24 and 72 h postinfection, cells were harvested using trypsin, and the cell pellet was washed three times with cold phosphate-buffered saline-EDTA. The cell pellet was frozen at 80°C until used. Total RNA was isolated from the cells by using the RNeasy kit (QIAGEN) with optional on-column DNase digestion. The isolation yielded on average 50 µg of total RNA per 10-cm tissue culture dish. Next, 1 µg of total RNA was used in a first-strand cDNA synthesis reaction mixture using Superscript II reverse transcriptase (Invitrogen) by following the recommendations of the manufacturer except that 50 units of enzyme was used. Primer 15 (5'-GGTGGGGTTGAGTGTTGAGAATC) was used in conjunction with primer 16 (5'-GATGGAGCAGGATGTAAAGGTAGC) and primer 17 (5'-TGGAGAAACACCCTTCAGAGA), in separate reactions, to flank the putative large T (LT) and small t (ST) early-region splice sites, respectively. Thirty-five cycles of PCR were performed on 1 µl of cDNA using 0.75 units of Taq DNA polymerase (Invitrogen) in a buffer containing 1.5 mM MgCl2, 0.2 mM deoxynucleoside triphosphates, and 0.4 µM of each primer. PCR products were resolved through a 1.5% agarose gel in 1x Tris-acetate-EDTA and stained with GelStar (Cambrex Bio Science). PCR bands were purified from the gel by using the QIAQuick gel extraction kit (QIAGEN) according the manufacturer's protocol and sequenced using primer 15.
miRNA analysis. RNA secondary-structure prediction showing predicted pre-microRNA (pre-miRNA) (36) was performed with mFOLD (45). Total RNA was isolated using RNABee (Tel-Test Inc., Friendswood, TX). RNA was fractionated on a denaturing urea 15% polyacrylamide gel, electroblot transferred to a ZetaProbe-GT membrane (Bio-Rad), hybridized in 12 ml of hybridization buffer (0.2 M sodium phosphate, 7% sodium dodecyl sulfate), and then probed overnight at 42°C. Membranes were washed, exposed to autoradiography film with an intensifying screen (Kodak), and developed after 1 to 3 days. Partially double- or single-stranded, radioactive oligonucleotide probes were generated by performing a Klenow fill-in reaction (Klenow exo-; New England Biolabs) with radioactive dCTP. Prior to the fill-in reaction, a universal linker [5'-(G)36ACCTGC] was annealed to the following oligonucleotides, which are antisense to portions of the predicted SA12 pre-miRNA hairpin: the 5' probe, 5'-CAGTGCTTTTCCCAAGCCTCAGATACCTCAGGCTCTGGCAGGTCG, and the 3' probe, 5'-CAGAAACTGAAGACTCTGGACATGGATCAAGCACTGGCAGGTCG.
Real-time PCR primer design. The sequence of SA12 was aligned using the ClustalW program with those of the primate polyomaviruses (SV40, BKV, JCV, and LPV) to identify genomic regions that were not conserved among these viruses so that primers and probes developed for SA12 would not be expected to react with the genomes of the other viruses. Two regions were identifiedone in the T antigen gene and the other in the region common to the VP2 and VP3 genes. Primers and probes were selected from these two regions using the Primer Express program (PE Applied Biosystems, Foster City, CA). The sequences of the primers/probe sets are shown in Table 1.
|
View this table: [in a new window] |
TABLE 1. Sequences and locations of the primers and probes for SA12
|
TaqMan data analysis. Standard curves were generated using cloned SA12 DNA. A standard curve was prepared for each experiment using 10-fold dilutions of purified viral DNA, representing 101 to 108 copies of DNA in 10 µl. All dilutions were prepared in 1x Tris-EDTA containing pUC19 DNA at 100 ng/ml as a stabilizer. TaqMan data were analyzed using Sequence Detection Systems v.1.7.0 (PE Applied Biosystems). Threshold values and baseline parameters for analysis of the raw data were selected according to the cycle threshold values for each experiment and by following the manufacturer's guidelines. The number of copies of viral genomes in a given sample is calculated by reference to the standard curve.
Nucleotide sequence accession number. The sequence of SA12 was submitted to GenBank and was assigned accession number AY614708.
|
|
|---|
Nucleotide sequence of SA12. The complete nucleotide sequence reveals that SA12 has a genome organization typical of the Polyomaviridae (Fig. 1). The viral regulatory region containing the promoter for early region transcription, the origin of viral DNA replication (ori), and the promoter for late region transcription and transcriptional enhancer sequences is flanked by coding sequences for large T and small t antigen and for an agnoprotein and the capsid proteins VP1, VP2, and VP3. We found no evidence for a middle T antigen, a protein encoded by murine polyomavirus and hamster polyomavirus. We also did not find any evidence for the expression of a T*-like protein; however, the sequence analysis and limited RNA studies that we performed do not exclude the possibility that one or more such proteins are produced. The organization of the late region is the same as is found in SV40, BKV, LPV, and JCV. This includes the position of the agnoprotein coding sequences, the overlap of VP2 and VP3 sequences, and the overlap of VP2/VP3 with VP1 coding sequences. Like SV40, SA12 encodes microRNAs on the late coding strand, overlapping the 3' end of early mRNA (this study).
![]() View larger version (34K): [in a new window] |
FIG. 1. SA12 genome. The SA12 genome is represented by the circle, with the numbers indicating base pair positions. The base pairs are numbered in a clockwise direction with base pair 1 being the GC bp of the third GAGGC repeat within the origin of DNA replication. Positions of the coding sequences for large T antigen, small t antigen, agnoprotein, and the viral capsid proteins VP1, VP2, and VP3 are indicated. The positions of the early mRNA introns are indicated. The position of the putative SA12-encoded microRNA is also shown.
|
Regulatory region. For the purposes of discussion, we define the viral regulatory region as the sequences that lie between the large T/small t antigen start codon and the agnoprotein initiation codon. This encompasses SA12 nucleotides 5124 to 294 (Fig. 2). Our previous studies revealed that the organization of the SA12 regulatory region is typical of polyomaviruses (8). The core ori contains four potential T antigen binding sequences, two on each strand, separated from each other by a single base pair (nucleotides 5219 to 11). Three of these pentanucleotides match the consensus T antigen binding sequence 5'-GAGGC-3', while the fourth has the sequence 5'-GTGGC-3'. A 27-bp imperfect palindrome lies to the early side of these T antigen binding sequences, while the late side is flanked by a 20-bp A/T-rich region. Based on studies with SV40, we hypothesize that these three elements (imperfect palindrome, T antigen binding pentanucleotides, and A/T-rich region) constitute the minimal ori (10). Two additional T antigen binding sites lie to the early side of the core ori. These correspond to the SV40 site I regulatory sequences involved in the autoregulation of T antigen transcription in SV40. The 263 bp that lies to the late side of the A/T-rich region contains several repeated sequences and transcription factor binding sites. Based on similarity to other polyomaviruses, these sequences most likely contain the late promoter and transcriptional enhancers. As we previously reported, the SA12 regulatory region is most closely related to BKV.
|
View larger version (26K): [in a new window] |
FIG. 2. SA12 regulatory region. The 401-bp regulatory region of SA12 extends from nucleotides 5124 to 294. Important regulatory elements are highlighted in color. Light green indicates the consensus pentanucleotide T antigen binding sequence, dark green indicates the nonconsensus pentanucleotide T antigen binding sequence, orange indicates the imperfect palindrome, and blue indicates the A/T-rich region. The enhancer region has been described elsewhere (8).
|
![]() View larger version (99K): [in a new window] |
FIG. 3. SA12 protein coding regions. The cDNA sequence of (A) large T antigen, (B) small t antigen, (C) VP1, (D) VP2, or (E) VP3 is displayed as the bottom sequence, and the translated protein sequence is displayed on top. The lengths of the proteins in numbers of amino acids (aa) are shown. Numbering for both the protein and the DNA sequence is on the left. An * in the protein sequence corresponds to the stop codon.
|
![]() View larger version (35K): [in a new window] |
FIG. 4. Sequence alignments of the variable regions and host range domains. The host range domains from SA12, BKV, and JCV were aligned (A) with and (B) without the host range domain of SV40. The variable regions from BKV, SA12, and JCV were aligned (C) with and (D) without the variable region of SV40. All alignments were done with MultAlin (7). Conserved residues are highlighted in red; partial conservation is indicated with blue, and no conservation is indicated with black. A consensus sequence is shown below each alignment, where conserved residues are represented as capital letters, partially conserved residues are in lowercase, and sites that show no homology are marked with a period.
|
Construction of an SA12 mutant with a consensus polyomavirus origin of DNA replication sequence. The large T antigen encoded by SV40 includes a DNA binding domain (origin binding domain) which recognizes the sequence 5'-GAGGC-3'. Four copies of this sequence are present in the SV40 core ori. Similarly, 9 of the previously sequenced 11 polyomaviruses have four copies of this pentanucleotide in their core ori regions, oriented in the same manner as SV40. The remaining two polyomaviruses, BFDV and GHFV, do not have this sequence in their ori regions. Presumably, the large T antigens encoded by these viruses recognize a different DNA sequence.
We previously reported that SA12 has three 5'-GAGGC-3' pentanucleotides in its ori region but that the fourth has the sequence 5'-GTGGC-3' (8). We hypothesized that the lower rate of viral DNA replication and small plaque size exhibited by SA12 might be due to the presence of this nonconsensus pentanucleotide. To test this hypothesis, we constructed a mutant, SA12-T10A, in which this pentanucleotide was changed to the consensus 5'-GAGGC-3' sequence. As shown in Fig. 5, SA12-T10A exhibited the same plaque size (data not shown), growth kinetics, and virus yield as wild-type SA12. Furthermore, the time course of viral DNA replication was the same for SA12-T10A and wild-type SA12. However, SA12-T10A replicated DNA to a somewhat lower level than wild-type SA12. Thus, conversion of the SA12 ori to the consensus does not enhance viral DNA replication or virus yield.
![]() View larger version (36K): [in a new window] |
FIG. 5. Mutation of the SA12 core ori to a consensus sequence does not enhance viral DNA replication or yield. (A) The sequence of the SA12 core ori is shown. Putative T antigen binding sites are underlined. The position of a T10A mutant that changes the left-most binding site to a consensus GAGGC is shown. (B) BSC40 cells were either mock infected or infected with SV40, SA12, or SA12-T10A, and at the indicated times postinfection (days), cells were harvested and lysed. Viral DNA was purified and digested with the appropriate enzyme and resolved through a 0.7% agarose gel. Digested DNAs from SV40, SA12, SA12-T10A, and mock reactions were loaded in lanes 4 to 9, lanes 10 to 15, lanes 16 to 21, and lanes 22 to 23, respectively. As a positive control, pSVB3 was digested with BamHI (lane 2) and pUC119-SA12 was digested with PstI (lane 3). DNA was visualized by staining with GelStar. Replicated viral DNA is indicated by an arrow. (C) Virus growth curve and yield. BSC40 cells were infected with SA12 wild-type wt100 (WT SA12) or SA12-T10A at an MOI of 5. At various times postinfection, the cells were frozen and thawed three times. The virus titer was then determined for each time point by a standard plaque assay.
|
![]() View larger version (47K): [in a new window] |
FIG. 6. Early mRNA splice junctions. (A) Sequence homology of early RNA splice sites. Regions surrounding the early RNA splice sites (5' large T donor, 5' small t donor, and 3' acceptor) from SA12, SV40, JCV, and BKV were aligned using MultAlin (7). Completely conserved residues are represented as capital letters, and partially conserved residues are in lowercase. Sites that show no homology are marked with a period. The consensus 5' donor and 3' acceptor splice site sequences are enclosed in a rectangle. (B) Genomic organization of the early region of SA12. Two pairs of PCR primers were designed to flank the putative splice sites, as identified for panel A. The putative splicing sites are marked (DL, putative large T donor site; DS, putative small t donor site; A, putative acceptor site). Sequences shared by large T and small t mRNAs are represented by vertical lines. The unique sequence that is part of the small t mRNA is represented by diamonds. The predicted PCR product sizes for genomic SA12, large T antigen mRNA, and small t antigen mRNA are shown for both primer pairs. (C) PCR analysis of the early region of SA12. Thirty-five cycles of PCR, using primer pair 15/16, were performed on 10 pg of pUC119-SA12 (positive control), 1 µl of the indicated cDNA, or no DNA (negative control). Various amounts of the PCR products were resolved on a 1.5% agarose gel and stained with GelStar. The sizes of the PCR markers are indicated on the right in base pairs. The positions of SA12 and small t and large T PCR products are indicated on the left. (D) PCR analysis of the small t splice junction. Thirty-five cycles of PCR were performed on 1 µl cDNA from the 72-h time point using the indicated primer pairs. PCR products were resolved on a 1.5% agarose gel and stained with GelStar. The sizes of the PCR markers are indicated on the left in base pairs. The positions of SA12 and small t PCR products are indicated on the right. (E) Identification of the early-region splice sites. The early splice junctions of SA12 are shown as in panel B. The SA12 sequence of each splice site (DL, DS, and A) is shown. Large-T antigen mRNA processing is displayed on top, and small t antigen mRNA processing on bottom. The first and last two bases of the intron are underlined. The size of the intron is indicated in base pairs. The exon sequences are highlighted in bold type. Arrows from the exon sequences point to the sequence chromatograph where the exon sequences are boxed.
|
The predicted size of the PCR fragment from genomic SA12 is 549 bp, which agrees with the PCR product of pUC119-SA12, where one band about this size is visible (Fig. 6C, lane 2). The large T antigen cDNA PCR product is visible in the 24- and 72-h cDNA reactions, but the small t antigen cDNA PCR product is visible only in the 72-h cDNA reaction (Fig. 4C, lanes 4 and 6). The PCR amplification of the 24- and 72-h cDNA revealed a band that comigrated with the band in the pUC119-SA12 reaction. This fragment was later shown to be contaminating SA12 genomic DNA through sequencing (data not shown). Since subsequent sequencing of the small t antigen PCR product using primer pair 15/16 was ineffective, we performed PCR on 72-h cDNA using primer pair 15/17 (Fig. 6D, lane 3).
Since the 72-h cDNA amplification yielded the strongest signals for the small t and large T antigen PCR products, the bands corresponding to the small t (Fig. 6D, lane 3) and large T (Fig. 6C, lane 6) antigens from these amplifications were excised, purified, and sequenced. Sequencing revealed that both share the same 3' acceptor splice site, which ends with the sequence AG (where the G is nucleotide position 4880). The 5' donor splice site for large T antigen starts with a GT (where the G is nucleotide position 4533), and the 5' donor splice site for small t antigen starts with a GT (where the G is nucleotide position 4603). Therefore, processing of large T antigen and small t antigen pre-mRNAs results in the removal of 348 and 71 bases of RNA, respectively (Fig. 6E).
SA12 encoded microRNAs. It is becoming increasingly clear that miRNAs, originally identified in Caenorhabditis elegans, play a broad and diverse role in the control of gene function in most eukaryotes (1). miRNAs are approximately 22-nucleotide RNAs that function by binding to the mRNAs of target genes with various degrees of antisense complementarity. miRNAs inhibit protein expression of their target mRNAs by driving cleavage or translational inhibition (37). Several viruses have recently been described to encode microRNAs; however, as with their cellular complements, little is known about their function (2, 28, 29). We have recently identified a virally encoded pre-microRNA expressed late in SV40 infection that is processed into several miRNAs (36). These miRNAs function to down-regulate early gene expression and reduce T-cell-mediated cellular lysis of SV40-infected cells in vitro, perhaps by reducing the amount of T antigen presented by major histocompatibility complex class I. Importantly, this pre-miRNA is computationally predicted to be conserved in other polyomaviruses, including JCV, BKV, and SA12 (36) (Fig. 7A), which suggests a conserved function for the miRNAs generated among these viruses. Thus, determining whether SA12 actually expresses virally encoded miRNAs would have important implications for the replication cycle of SA12 as well as the validity of the in silico engine that generated these predictions.
![]() View larger version (25K): [in a new window] |
FIG. 7. Expression of SA12 miRNAs. (A) A secondary-structure prediction of the SA12 hairpin, starting at nucleotide 2785, which is conserved among several polyomaviruses containing the pre-miRNA, is shown. (B) Probe diagram showing the sequence of the hairpin and probes designed to detect miRNAs derived from either arm 5' or 3' of the "terminal loop." (C) Northern blot showing expression of 60-nt and 22-nt bands in infected cells. Blots were probed with either the 5' or 3' probe as indicated. Lanes with RNA from uninfected (U) or 90 h postinfection (90) are indicated. The migration positions of oligonucleotide markers are indicated on the right side. Arrows indicate the locations of the bands corresponding to the pre-miRNA and the miRNA. MW, molecular weight.
|
60 nt, corresponding well with the observed pre-miRNA reported for SV40. Importantly, we also detected an approximately 22-nt band in all lanes that express the 60-nt band, consistent with a model in which SA12 derives
22-nt miRNAs from its abundant pre-miRNA precursor, similarly to SV40. Thus, we conclude that SA12 is the second member of the Polyomaviridae confirmed to encode miRNAs. Quantitative PCR assay that distinguishes SA12 from other primate polyomaviruses. In order to be able to quantify SA12 DNA for in vitro and in vivo studies, we have developed real-time, quantitative PCR assays for this viral genome. Two sets of primers and probes were chosen based on the Primer Express software (Table 1), one set for each of the early and late regions. When tested against the SA12 genome, the reactions were sensitive down to between 1 and 10 copies per reaction (Table 2). When tested against 107 copies of the related primate polyomaviruses BKV, JCV, SV40, and LPV, no reaction was found, demonstrating the specificity of the assay (Table 2).
|
View this table: [in a new window] |
TABLE 2. Sensitivities and specificities of the SA12 primer/probe sets
|
|
|
|---|
SA12 genome organization. As with other members of the Polyomaviridae, the early and late genes are encoded on opposite DNA strands and are separated by a noncoding regulatory region that includes the transcriptional promoters and enhancers and ori. The polyadenylation signals for early and late transcripts are located near each other at the opposite side of the circular genome from the ori region.
Viral regulatory sequences. We have previously reported the sequence of the SA12 regulatory region (8). The ori regions of most characterized polyomaviruses consist of four GAGGC pentanucleotides flanked by A/T-rich sequences on the late side and an imperfect palindrome on the early side. The transcriptional enhancers lie to the late side of the A/T-rich sequences. Each of these features is present in the SA12 regulatory region, except that SA12 contains only three GAGGC repeats. The position where the fourth pentanucleotide should be is occupied by a GTGGC pentanucleotide instead. As we previously observed, SA12 is very closely related to the human virus BKV, including in the enhancer, a region thought to be important in determining host range specificity. Analysis of the complete SA12 sequence reveals a close relationship between SA12 and BKV in all coding regions of the viral genome. This is evident at both the DNA and protein level as calculated by global sequence alignment.
SA12 is the third example of a polyomavirus that does not contain four GAGGC pentanucleotides within its core ori. The other exceptions are BFDV and GHFV, which do not contain any GAGGC pentanucleotides, suggesting that these viral T antigens have entirely different DNA binding specificities than SA12 and the other polyomaviruses. We considered the possibility that the SA12 strain that we sequenced acquired a mutation in one pentanucleotide during laboratory passage. To test this, we generated a mutant of SA12 that converted the GTGGC sequence to a consensus GAGGC pentanucleotide. The time courses of viral DNA replication were similar for the mutant and the laboratory isolate, although the mutant did not replicate DNA to the same extent. In addition, the presence of four consensus pentanucleotides did not alter SA12 plaque size or virus yield. We still cannot exclude the possibility that the GTGGC pentanucleotide is an artifact of laboratory passage. Sequencing of natural SA12 isolates will be needed to address this issue.
Early region. We detected coding sequences for large T antigen and small t antigen in the viral early region. SA12 does not contain sequences related to the middle T antigens encoded by the murine or hamster polyomaviruses. We also did not detect any evidence for T*- or tiny T-related proteins encoded by SV40, JCV, or murine polyomavirus; however, our data do not exclude the possibility that such proteins are made.
We previously reported that SA12 encodes an ST protein that consists of two regions: an amino-terminal J domain and a carboxy-terminal domain that contains spatially conserved cysteine repeats (8). This is typical of the ST antigens encoded by other polyomaviruses.
SA12 encodes an LT protein that consists of multiple domains that are related to the LT proteins encoded by other polyomaviruses. The amino-terminal J domain is followed by a putative flexible region (56 amino acids in SA12) that includes an LXCXE motif that governs interaction with the retinoblastoma family of tumor suppressor proteins and a nuclear localization signal. Amino acids that are phosphorylated in SV40 LT antigen as well as residues shown to be important for the interaction of SV40 LT antigen with CUL7 and BUB1 are conserved in SA12. These sequences are followed by the putative origin-binding domain, Zn-binding domain, and ATPase domain.
The LT proteins of many polyomaviruses terminate at the end of the ATPase domain. However, like SV40, BKV, and JCV, the SA12 LT antigen includes additional sequences at its carboxy terminus. These sequences consist of a variable region and the HR domain (27). The SV40 HR function maps to the carboxy-terminal 38 amino acids of LT antigen. There is good alignment between this region of SV40 LT antigen and the LT proteins of SA12, BKV, and JCV. The variable region of SV40 is not well conserved with the corresponding regions of the SA12, BKV, and JCV T antigens. Surprisingly we found an excellent alignment of the putative variable regions of SA12, BKV, and JCV. Thus, this region of the SV40 LT antigen has diverged significantly from the corresponding region of these other polyomavirus T antigens. The close sequence conservation of the variable regions of the SA12, BKV, and JCV T antigens makes it likely that this part of LT antigen serves an as-yet-unknown function.
Like SV40, SA12 encodes microRNAs on the late coding strand opposite the LT variable region (36). BKV and JCV are also predicted to encode microRNAs in this region, while other polyomaviruses are not predicted to encode microRNAs in this general region. So far all polyomaviruses that possess a variable region and an HR domain encode microRNAs. One intriguing possibility that would explain the close genetic linkage between the presence of a variable region/HR domain and microRNAs is that this region of LT antigen performs some function associated with perturbing microRNA biogenesis or function.
Late region. The SA12 late-coding region encodes an agnoprotein, as well as VP2, VP3, and VP1. The organization of the late coding sequences is similar to that found in other polyomaviruses, including overlapping VP2 and VP3 coding sequences, and the overlap of VP2/VP3 coding sequences with those of VP1. Known functional elements of the capsid proteins, such as the VP2/VP1 interaction surface, and key VP1 cysteine residues are conserved in SA12.
SA12 encodes microRNAs.
Like SV40, we have found that SA12 encodes microRNAs. The SA12 miRNAs share a remarkable degree of similarity with the SV40 miRNAs. First, the miRNAs derived from both viruses are complementary to the early transcripts, are encoded in the late orientation past the 3' polyadenylation cleavage site, and are expressed at higher levels as productive infection progresses late into the viral life cycle (data not shown). Second, both SA12 and SV40 display a pre-miRNA/miRNA ratio that is significantly higher than for most other reported viral or cellular microRNAs, with the pre-miRNA being more abundant than the miRNA (Fig. 7C). Third, the miRNAs derived from the pre-miRNA are readily detected from both arms of the pre-miRNA (Fig. 7C; note the
22-nt band in both the left and right panels), which is an atypical although not completely uncommon finding. This is consistent with miRNAs from both arms having complementarity to, and thus the ability to target, the early transcript. The congruity of miRNA attributes between SV40 and SA12 strongly supports a conserved function. Thus, it is likely that SA12, like SV40 (36), utilizes miRNAs to autoregulate early gene expression late in infection. Confirming that SA12 expresses this pre-miRNA lends credence to our earlier in silico predictions that also suggested that JCV and BKV express homologs of this pre-miRNA. Expression of these miRNAs may lead to reduced antigen presentation of the early gene products and perhaps contribute to less T-cell activation, as has been demonstrated for SV40 in vitro (36). In vivo, it is possible that miRNA-mediated autoregulation contributes to tamping down virus production during the course of a lifelong, persistent, subclinical infection that is common among these polyomaviruses.
SA12-specific quantitative PCR. The known primate polyomaviruses (SV40, BKV, JCV, LPV, and SA12) are closely related, with BKV, SV40, JCV, and SA12 all being remarkably similar to one another. The amino acid sequences of SA12 and BKV VP1 are 83% identical, and the overlapping VP2/VP3 coding sequences of SA12 and BKV are 82% identical. Therefore, it may be very difficult to distinguish these viruses by serology, as has been the case for SV40 and BKV (5, 22, 42). These viruses are also closely related at the nucleotide sequence level in that the amino acid sequences of SV40 and BKV VP1 are 82% identical and the overlapping VP2/VP3 coding sequences of SV40 and BKV are 72% identical.
Little is known about SA12 infections in nature. The fact that SA12 is so closely related to BKV raises the possibility that SA12 might infect humans. In order to follow the course of SA12 infection in baboons, the presumed natural host, and to distinguish SA12 infections from BKV infections, we developed a quantitative PCR assay that distinguishes SA12 from the other primate polyomaviruses. In an earlier study, we had developed quantitative PCR assays against SV40, BKV, and JCV (A. Pal et al., submitted for publication) and against LPV (unpublished). These primers and probes did not react against SA12, and thus specific quantitative PCR assays exist for all four of these primate polyomavirus genomes.
Polyomavirus diversity. The known primate polyomaviruses encode very closely related proteins. However, each possesses a unique regulatory region. Thus, it is hypothesized, though not yet proven, that the cell type tropism and host range exhibited by these viruses stems largely from their regulatory regions. While SA12 possesses a distinct regulatory region, it is closely related to the regulatory region of the human virus BKV. Based on the similarity of these two viruses, it will be interesting to determine if SA12 can infect humans. One intriguing possibility is that some polyomaviruses are continually crossing between human and nonhuman primates. However, the controversy that SV40 is a human infectious agent may indicate how difficult this will be to document. Finally we note that only 11 polyomaviruses have been sequenced to date and that each possesses common as well as unique features. This variation in polyomavirus sequences suggests that many other members of this family are circulating in the biosphere.
We thank Don Ganem for miRNA reagents and helpful discussions.
Present address: University of Pennsylvania School of Veterinary Medicine, 3800 Spruce St., Philadelphia, PA 19104. ![]()
|
|
|---|
This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»