Previous Article | Next Article ![]()
Journal of Virology, November 2008, p. 11247-11262, Vol. 82, No. 22
0022-538X/08/$08.00+0 doi:10.1128/JVI.00897-08
Copyright © 2008, American Society for Microbiology. All Rights Reserved.

Center for Pathogen Genomics, National Institute of Infectious Diseases, Tokyo 208-0011, Japan,1 Department of Virology II, National Institute of Infectious Diseases, Tokyo 208-0011, Japan,2 Sakai City Institute of Public Health, Osaka 590-0953, Japan3
Received 30 April 2008/ Accepted 25 August 2008
|
|
|---|
|
|
|---|
NoV is a nonenveloped virus of the Caliciviridae family. NoV has a single-stranded, positive-sense, polyadenylated RNA genome that encodes three open reading frames (ORFs): ORF1, ORF2, and ORF3 (51). ORF1 encodes a large polyprotein that is cleaved by the viral proteinase into six nonstructural proteins. ORF2 encodes a viral capsid protein (40). ORF3 encodes a VP2 protein that may function as a minor capsid protein for genome packaging (14). As is common in the RNA viruses, NoV in nature is genetically and antigenically highly diverse (16, 20, 21). NoV is tentatively divided into five genogroups (GI to GV) and more than 25 genotypes based on similarities among ORF2 sequences (1, 21). Among them, genogroup II genotype 4 (GII/4) is particularly important in public health, because it is the leading cause of NoV-associated acute gastroenteritis in humans worldwide since the mid-1990s (3, 12, 17, 28, 29, 32, 46).
Notably, the GII/4 epidemic has been augmented periodically during the past
15 years. Four NoV pandemics occurred in association with the emergence of new GII/4 variants in the winters of 1995-1996, 2002-2003, 2004-2005, and 2006-2007 (46). In the 2006-2007 epidemic, two variants, 2006a and 2006b, emerged and displaced the resident GII/4 variants in Europe (46). The emergence was associated with an atypical NoV epidemic in Europe (46) and Hong Kong (17). These and other chronological GII/4 variants that caused the global NoV epidemic had amino acid substitutions in the capsid protein (28, 46). Thus, host population immunity may play a role in the evolution of new GII/4 epidemic strains, as seen in the antigenic drift in the influenza virus (28, 46).
In Japan, the GII/4 strains are again the most prevalent since the 2002-2003 epidemic (35-38) and in natural environments (31, 41, 44). As seen in many countries, the rate of NoV infection in Japan increases in the winter. Notably, an atypically high level of NoV activity was noticed in early autumn and was sustained in the winter in 2006-2007 in Japan. The number of outbreaks of NoV infections reported by a nationwide surveillance group increased unusually rapidly in October (n = 72) compared to the previous year (n = 10), reaching peaks in November and December 2006 (n = 428 and 345, respectively), and decreasing during January and February 2007 (n = 164 and 82, respectively) (Infectious Disease Surveillance Center, http://idsc.nih.go.jp/iasr/prompt/graph-ke.html) (43, 52). The total number of infection cases during October and December in 2006 showed an
5-fold increase compared to the same season in 2005. We suspected the emergence of new NoV GII/4 variants, because the increase was not linked to changes in the surveillance system. Indeed, studies of the 5' capsid sequences suggested that a new GII/4 variant strain was circulating in some prefectures in Japan in 2006-2007 (26, 34, 50).
We report here comprehensive information on the near-full-length genomes of the NoV GII/4 variant strains in the atypical Japanese epidemic of 2006-2007. This information was used to examine the molecular mechanisms of the new NoV GII/4 epidemic. Our evolutionary and structural studies support a model of antigenic drift with tuning of multiple viral proteins for the periodic outgrowth of new NoV GII/4 variants.
|
|
|---|
![]() View larger version (26K): [in a new window] |
FIG. 1. (A) Geographic locations of 11 sample collection sites in Japan. (B) Schematic illustration of the locations of the primers used in the present study.
|
RNA extraction, reverse transcription, and long-range PCR. A 10% fecal suspension (wt/vol) was prepared with phosphate-buffered saline (PBS; pH 7.2) and centrifuged at 10,000 x g for 10 min. Viral RNA was extracted from the suspension by using a QIAamp viral RNA kit (Qiagen, Hilden, Germany) according to the manufacturer's instructions. The RNA extracts were stored at –80°C. Reverse transcription was performed with SuperScript III cDNA synthesis kit (Invitrogen, Carlsbad, CA). Seven microliters of the purified RNA was added to a reaction mixture (final, 20 µl) containing 50 pmol of random hexamar, 25 mM MgCl2 buffer, 10 mM deoxynucleoside triphosphate, 10x RT buffer, 0.1 M dithiothreitol, and 200 U SuperScript III RT. The mixture was incubated at 42°C for 1 h, after which 10 U of RNase H was added at 37°C for 20 min. Long-range PCR was performed with the Expand Long Template PCR System (Roche, Basel, Switzerland). Three microliters of cDNA was added to a reaction mixture (final, 50 µl) containing 350 µM deoxynucleoside triphosphate, 1.75 mM Mg2+, 300 nM PCR primers, and 3.5 U of Expand Long template enzyme. Amplification was performed for 25 to 30 cycles, at 94°C for 10 s, 60°C for 30 s, and 68°C for 5 min per cycle, and a final extension at 68°C for 8 min. Three microliters of the first PCR products was added to the reaction mixture (final 50 µl), and the same PCR condition was used for the second-round amplification.
DNA sequencing. The PCR products were purified with the QIAquick PCR purification kit and in some cases with a QIAquick gel extraction kit (Qiagen). The purified DNAs were used as a template for direct sequencing. The sequencing reaction was performed in a 96-well microplate using dye-terminator chemistry with BigDye version 3.1 (Applied Biosystems, Foster City, CA) according to the manufacturer's instructions. The reaction products were purified with the Montage SEQ96 sequencing reaction cleanup kit (Millipore, Bedford, MA) and sequenced on an automated ABI 3730 xl DNA analyzer (Applied Biosystems). To obtain full-length genome sequences, fragment sequences of 0.9, 5.2, and 2.5 from the same individual were aligned at an overlapping region by using the Staden Package (http://staden.sourceforge.net).
Phylogenetic analysis. Nucleotide sequences were aligned with an outgroup by using CLUSTAL W version 1.4 (49). A distance matrix of nucleotide substitutions per site was estimated from the alignment according to Kimura's two-parameter method (23). Neighbor-joining trees (42), maximum-likelihood trees, and UPGMA (unweighted pair-group method with arithmetic averages) trees were generated with 100 bootstrap replicates (10) from the matrix numbers by using MEGA version 3.0 (25). For the phylogenetic analysis of NoV ORF2 complete nucleotide sequences, we included sequences of well-recognized strains identified in the global GII/4 epidemic. They are the <1996 variants (Lordsdale strain [7], GenBank accession no. X86557; Bristol strain [15], accession no. X76716), the 1995-1996 epidemic variants (Grimsby strain, accession no. AJ004864; 95/96-US strain [32], accession no. AF080549; Camberwell strain [5], accession no. U46500), the 2002-2003 epidemic variants (Farmington Hills strain [53], accession no. AY502023; a United Kingdom strain [6], accession no. AY587990), and the 2004-2005 epidemic variants (Hunter strain [3], accession no. DQ078794; a Netherlands strain OB2004-083 [46], accession no. AB303941). We also included the reference sequences of the 2006/2007 epidemic variants, two Netherlands 2006a strains (Terneuzen70/2006/NL and Yerseke38/2006/NL [46], accession nos. EF126964 and EF126963, respectively), two Netherlands 2006b strains (DenHaag89/2006/NL and Nijmegen115/2006/NL [46], accession nos. EF126965 and EF126966, respectively), and a 2006b strain from Kobe, Japan (Kobe034/2006/JP, accession no. AB291542). All other reference sequences used in the present study were obtained from GenBank.
Analysis of amino acid variation.
Amino acid variations at individual positions of viral proteins were calculated according to the method described by Huang et al. (18) on the basis of Shannon's equation (45):
![]() |
Molecular modeling.
The crystal structure of the NoV capsid P domain of the GII/4 VA387 strain at a resolution of 2.00 Å (PDB code: 2OBS (4) was used as the modeling template. P domains of the GII/4 2006b strains have sequence similarities of greater than 90% to that of VA387, which are high enough to construct models with a root mean square distance of
1 Å for the main chain between the predicted and actual structures (2). A three-dimensional (3-D) model of the P domain monomer of the earliest 2006b strain in Japan (Aich3/2006/JP) was constructed by using MOE-Align and MOE-Homology in the Molecular Operating Environment (MOE; Chemical Computing Group, Inc., Montreal, Quebec, Canada) as described previously (33, 47). The 3-D structure was thermodynamically optimized by energy minimization using the MOE and an AMBER99 force field (39). A physically unacceptable local structure of the optimized 3-D model was further refined on the basis of Ramachandran plot evaluation by using MOE. The 3-D model of the P-domain dimer was constructed by the superimposition the structures of the P-domain monomer on chains A and B of the NoV capsid oligomer (PDB code 1IHM) (40).
Nucleotide sequence accession numbers. The DDBJ database accession numbers for the nucleotide sequences reported here are AB447427 to AB447463.
|
|
|---|
At least one genome sequence was obtained at each of the 11 collection sites (Fig. 1A and Table 1). The 35 genome sequences were collected in May, October, November, and December 2006 and in January 2007 (n = 1, 8, 12, 13, and 1, respectively). The collection months of two specimens were not recorded but were in the study period. The 28 sequences were collected from different age groups (n = 6, 3, 1, 4, 1, 4, 3, 2, and 4 for ages 0 to 9, 20 to 29, 30 to 39, 40 to 49, 50 to 59, 60 to 69, 70 to 79, 80 to 89, and 90 to 99 years old, respectively). The age groups of nine specimens were unavailable.
|
View this table: [in a new window] |
TABLE 1. Characteristics of NoV specimens and patients
|
15 years in The Netherlands (46). They referred to the two most newly emerged GII/4 epidemic variants in Europe as 2006a and 2006b (46). The genetic relationships of these and the new sequences obtained in the present study were examined by phylogenetic analysis. For the analysis, we included sequences of well-recognized reference strains from the global GII/4 epidemic during the past
15 years (see Materials and Methods for the strain names and accession numbers). In addition, we included GII/4 sequences identified in Japan during 1997 and 2006. A representative neighbor-joining tree shows that the nucleotide sequences of the complete capsid region of the Japanese 2006-2007 strains are divisible into three distinct lineage groups (Fig. 2A, yellow boxes). The first group, cluster I, included 33 of the 37 new sequences (89%) (Fig. 2A, cluster I, yellow boxes, bootstrap value 100/100). The cluster I specimens were from various age groups in all 11 sampling sites between May 2006 and January 2007. The second group, cluster II, included 3 of the 37 (8%) sequences. These were collected in the north of Japan (Aomori1/2006/JP and Aomori2/2006/JP) in December 2006 and in the south of Japan (Saga5/2006/JP) in November 2006 (Fig. 2A, cluster II, yellow boxes, bootstrap value 100/100). The third group, cluster III, included a single sequence collected in central Japan (Osaka2/2006/JP) in October 2006 (Fig. 2A, cluster III, a yellow box, bootstrap value 100/100).
![]() ![]() ![]() View larger version (98K): [in a new window] |
FIG. 2. Neighbor-joining trees of the nucleotide sequences of the complete ORF2 region ( 1.6 kb) (A) and the 5'-capsid N/S region (228 bases) (B and C) of NoV GII/4. Bootstrap values with 100/100 are indicated at the nodes of the tree. Sequence names are shown in the distinct color boxes on the basis of sampling sites and periods. Yellow boxes, 2006-2007 sequences obtained in the present study; green boxes, 2006/2007 sequences obtained in Europe, China, and Japan in other studies; orange boxes, <2006 sequences in Japan; pink boxes, 2005-2006 sequences in Japan: blue boxes, sequences from the global GII/4 epidemics pre-1996, 1995-1996, 2002-2003, and 2004-2005. Red letters with asterisks indicate sequences of the GII/4 2006a and 2006b epidemic strains in winter 2006-2007 in Europe (46).
|
The 5'-end segment of the VP1 gene, which encodes the capsid N-terminal/shell (N/S) domain (228 bases) (Fig. 1B) (20, 21), has been used to monitor the genotypes of NoV strains in many countries, including Japan. Therefore, we used published sequence information on the N/S region to verify the 2006b dominance in Japan in 2006-2007. A representative neighbor-joining tree shows that 36 of 42 (86%) of the sequences collected during 2006 and 2007 in Japan (26, 34, 50) were grouped with sequences from the cluster I 2006b strains (Fig. 2B, cluster I, green boxes). Only 3 (7%) and 3 (7%) sequences were grouped with sequences from cluster II 2006a and cluster III, respectively (Fig. 2B, clusters II and III, green boxes). Bootstrap values at the branching points of the groups were relatively low, at less than 50/100, probably because the sequences used for the analysis were relatively short. However, the monophyletic relationships among the sequences of clusters I, II, and III were reproducible when the tree was constructed with different algorithms, and the results were consistent with the phylogeny of the complete capsid sequences (Fig. 2A). Together with the sequences obtained in the present study, 69 of the 79 (87%) 5'-capsid sequences obtained across Japan during 2006-2007 were classified as 2006b strains. Only 6 (8%) and 4 (5%) of the 79 sequences were related to the 2006a and 2004-2005 epidemic variant strains, respectively.
Ozawa et al. (36) reported the N/S region sequences obtained across Japan during the winter of 2005-2006. These sequences were used to examine whether or not the 2006a and 2006b strains existed before the atypical Japanese epidemic in 2006-2007. A representative neighbor-joining tree shows that 25 of 36 (69%) of the 2005-2006 sequences (36) were grouped with sequences from the cluster III sequences that emerged in winter 2004-2005 in Japan (Fig. 2C, cluster III, pink boxes). Notably, 2 (6%) and 9 (25%) sequences of the 2005-2006 sequences were grouped with sequences from the cluster I 2006b and cluster II 2006a, respectively (Fig. 2C, clusters I and II, pink boxes). The monophyletic relationships of clusters I, II, and III sequences were reproducible when the tree was constructed with different algorithms. Together, these data suggest that the 2006a and 2006b strains were present in Japan as minorities in winter 2005-2006 and that only the Cluster I 2006b strains dominated over the resident GII/4 and 2006a strains during 2006. There were two conserved amino acid substitutions (S9N and T15A) in the N terminus of the capsid shell domain of the 2006b strains in winter 2006-2007 compared to the 2006b source in Japan in winter 2005-2006.
Genetic links between NoV ORF1, ORF3, and genome among patients in the 2006-2007 epidemics. Although partial sequences of the 3Dpol region have been reported (46), the molecular phylogenies of full-length ORF1 and ORF3 sequences of the 2006-2007 epidemic variants are unclear. Using phylogenetic analysis, we examined the genetic relationships between the new ORF1 and ORF3 sequences obtained in the present study and those of past epidemic GII/4 variant. For the analysis, we included sequences from Japan, China, Europe, and the United States, which were verified to be GII/4 strains by the genotyping of the ORF2 region.
Representative neighbor-joining trees show that the ORF1, ORF3, and genome sequences of the Japanese 2006-2007 variants again are divisable into three distinct lineage groups. All 33 sequences within cluster I in the ORF2 trees formed a monophyletic group (Fig. 3A, B, and C, cluster I, bootstrap value 100/100). Cluster I included a sequence from China in 2006 in the ORF3 tree (Duan/Beijing/2006/CHN). Cluster I was most closely related to the 02/03 cluster sequences in ORF1 and the genome, whereas in ORF3 it was most closely related to cluster II. The three sequences within cluster II in the ORF2 trees (Aomori1, Aomori2, and Saga5) also formed a unique group in all of the trees (Fig. 3A, B, and C, cluster II, bootstrap value 100/100). Finally, the single cluster III sequence in the ORF2 trees (Osaka2) was again grouped with the 2004-2005 epidemic variants in Japan and China (Fig. 3A, B, and C, cluster III, bootstrap value 100/100). The monophyletic relationships among clusters I, II, and III sequences were reproducible when the tree was constructed with different algorithms.
![]() ![]() ![]() View larger version (97K): [in a new window] |
FIG. 3. Neighbor-joining trees of the nucleotide sequences of the complete ORF1 (about 5.1 kb) (A), ORF3 (about 0.8 kb) (B), and near-full-length genome ( 7.5 kb) (C) regions of NoV GII/4. Bootstrap values with 100/100 are indicated at the nodes of the tree. Colored boxes indicate the sequences described in Fig. 2.
|
Amino acid signatures of the 2006-2007 GII/4 epidemic strains.
The amino acid substitutions specific to the cluster I 2006b strains were examined. The deduced amino acid sequences of ORF1, ORF2, and ORF3 of the 33 cluster I 2006b obtained in the present study were aligned with those of the past
15 years of epidemic strains described in Fig. 2 and 3. Amino acids specific to clusters I and II were extracted and referred to as amino acid signatures of the 2006b and 2006a epidemic variants, respectively.
Twenty-six amino acid signatures were identified for the cluster I 2006b strains (Fig. 4A). The numbers of the signatures were 6, 1, 2, 1, 1, 3, 7, and 5 in the N-term, NTPase, 3A, VPg, 3Cpro, 3Dpol, VP1 (capsid), and VP2 protein regions, respectively. All seven signatures of the capsid protein were mapped in the protruding P2 domain. Siebenga et al. (46) referred to 48 amino acid positions in the capsid protein as informative sites when at least two epidemic strains had an identical amino acid. Of the 48 positions 7 were perfectly preserved only in the 2006b strains in Japan. The cluster II 2006a strains each had a distinct signature pattern (Fig. 4B). The numbers of 2006a signatures were 5, 4, 1, 6, 3, and 2 in the N-term, NTPase, 3A, 3Dpol, VP1, and VP2 protein regions, respectively. VPg and 3Cpro of 2006a had no signatures.
![]() ![]() View larger version (44K): [in a new window] |
FIG. 4. Unique amino acid substitutions of the 2006-2007 GII/4 epidemic strains. The deduced amino acids of NoV GII/4 ORF1s, ORF2s, and ORF3s of the 2006b and 2006a strains were aligned with those of the past chronological epidemic strains. Amino acids are each indicated by one letter code. The positions in the ORFs of the Lordsdale strain are used for the amino acid numbering. Dots indicate amino acids identical to the Lordsdale strain. Amino acid substitutions specific to the 2006b (A) and 2006a (B) strains are shown in boldface italic letters.
|
![]() View larger version (24K): [in a new window] |
FIG. 5. Amino acid variation among the GII/4 epidemic variant proteins. Shannon entropy scores representing variations at individual amino acid positions (18) were calculated using NoV GII/4 ORF1, ORF2, and ORF3 sequences described in Fig. 3A, 2A, and 3B, respectively. The distribution of entropy scores in ORF1, ORF2, and ORF3 is shown along with the amino acid positions. An entropy score of 0 indicates absolute conservation, whereas that of 4.4 indicates complete randomness. Blue bars indicate regions rich in 2006b-specific changes.
|
Structural insights into roles of capsid amino acid signatures. An NoV capsid protein consists of a shell (S) and two protruding (P) domains: P1 and P2 (40). The P domain is exposed on the surface of the capsomer (40). Therefore, the P domain should play critical roles in virus entry into the target and host immune responses, although the precise roles have not been elucidated due to the lack of a tissue culture system to support effective NoV replication. To obtain structural insights into the roles of the amino acid substitutions specific to the 2006b capsid protein, we constructed a 3-D structure model of the VP1 P domain. The X-ray crystal structure of the P domain dimer of VA387 (4) was used for the modeling template, and the structure of the P domain dimer of a 2006b strain, Aichi3/2006/JP, was constructed by homology modeling. The model predicts that the folding of the P2 dimer is identical between the 2006b and the 1995-1996 epidemic strains.
The Shannon entropy scores with the GII/4 capsid P domain in Fig. 5 were expressed on the 3-D structure model (Fig. 6A). Notably, all seven amino acid signatures specific to the capsid domain of the 2006b strains were mapped on the variable P2 domain. Six of the seven signatures—L306, A356, P357, E372, H378, and A412—were positioned on the loops of the P2 domain, whereas the seventh signature, Y352, was positioned on the β-sheet of the P2 domain (Fig. 6A, side view). The side chains of six residues—Y352, A356, P357, E372, H378, and A412—formed accessible outer surfaces of the P2 domain, whereas the side chain of the seventh residue, L306, was positioned toward the P1 domain and was enclosed by neighboring amino acids (Fig. 6A, top view). E372 was located near the β2-β3 loop, which is one of the protruding loops on P2 and is extended by insertion mutation in the GII/3 (24). Thus, the 2006b-specific substitutions in the capsid protein were mostly clustered around regions with which host proteins such as an infection receptor(s) and antibodies can directly interact.
![]() View larger version (101K): [in a new window] |
FIG. 6. Structural model of the VP1 P domain dimer of the NoV GII/4 2006b strain. The model was constructed by homology modeling using the X-ray crystal structure of the P domain dimer of the 1995-1996 epidemic GII/4 strain (4). (A) Shannon entropy scores expressed on the P domain model. (B) Side and top views of the P domain model. Reported functional sites for virus entry into the cells are highlighted. Yellow dot circles, the fucose ring binding sites formed by P-domain dimer (4); cyan chain, an RGD motif (48) on the β2 sheet of the P domain; orange chain indicates an additional RGD-like motif, KGD (46), on the tip of the β4-β5 loop of the P domain. Red sticks indicate side chains of amino acids unique to the 2006b strains.
|
Interestingly, a 2006b-specific amino acid, H378, was positioned near the RGD motif and the newly created KGD motif (Fig. 6, cyan and orange ribbons, respectively). The H378 was also located near the binding site of the fucose ring (4), a part of the putative NoV infection receptor (Fig. 6, yellow dot circles). Consequently, H378, the RGD motif, and the new KGD motif each formed a protein surface rich in charged residues. Y352, A356, P357, and N412 were arranged linearly. These amino acids formed a putative accessible binding cleft on the P2 domain.
Siebenga et al. (46) have also reported the 3-D location of the variable sites using a structural model of the GII/4 capsid protein. They searched the sites where at least two strains had an identical amino acid substitution within a given sequence alignment (informative sites). This method helps one to identify mutations unique to some strains within a variant population. Using an alignment of complete capsid sequences of past epidemic variants, two 2006a strains, and two 2006b strains in Europe, they showed that many informative sites of the GII/4 capsid protein mapped to the surface of the P2 domain. Importantly, all 7 capsid amino acid signatures unique to the 2006b strains identified in the present study were included in the 48 informative sites obtained with the European strains.
|
|
|---|
Phylogenetic study of genomic and partial sequences provides basic information on the evolutionary histories of the GII/4 2006b epidemic variant strains (Fig. 2 and 3). The strong monophyly of the 2006b sequences out of past epidemic variants within the GII/4 cluster indicates that the 2006b strains were from an unidentified GII/4 variant source in nature. The branching order of the 2006b sequence cluster suggests that the 2006b source evolved from a common ancestor of the GII/4 02/03 strains, which is consistent with a previous conclusion (46). The constant monophyly of the 2006b sequences across distinct genomic regions within the GII/4 cluster indicates that the 2006b source is not the intergenogroup recombinant.
Notably, the 2006b source was present in Japan in winter 2005-2006 before notification of the atypical epidemic in winter 2006-2007 (Fig. 2C). In the 2005-2006 season, the 2006b source was rather minor compared to the 2004-2005 and 2006a strains (Fig. 2C). However, the minor 2006b strains dominated the resident GII/4 variants in Japan during 2006 (Fig. 2 and 3). Similar replacement of the resident GII/4 variants with the 2006b strains was observed in Europe (46) and Hong Kong (17) during 2006 as well. These data provide additional evidence for the recent global epidemic of the 2006b strains and strongly suggest that the new variant strains had selective advantages over other GII/4 strains that had been circulating before 2006.
Similar replacement of resident GII/4 strains by new GII/4 variant strains has periodically occurred on a global scale during the past
15 years and has often been associated with atypical NoV epidemics, as seen in the 2006-2007 epidemic in Japan (3, 12, 17, 28, 29, 32, 46). As Siebenga et al. (46) have discussed, the putative enhancement in the physical stability of virion, viral infectivity, or replication capability in cells alone may not explain the periodic outgrowth of new GII/4 variant strains. Indeed, no studies have indicated that the infectivity of GII/4 or the viral load of GII/4 in a stool specimen is increasing year by year.
Siebenga et al. (46) proposed that host population immunity might possibly act as a selective pressure for the periodic outgrowth phenomenon. They based this hypothesis on their observation that the GII/4 epidemic variants in The Netherlands underwent periodic amino acid substitutions in the capsid proteins. The results of our study are consistent with that hypothesis. The present analysis allowed us to identify the seven conserved amino acid signatures in the capsid of the 2006b strains (Fig. 4A). Remarkably, all seven signatures were arranged on the most protruding P2 domain of the capsid protein. Moreover, six of the seven signatures were positioned on the exposed protruding loops (Fig. 6). Interestingly, the 2006a strain, a recessive new GII/4 strain that appeared in 2006 in Japan, had only two unique signatures in the capsid (Fig. 4). These results provide further evidence that antigenic drift is involved in the outgrowth of new GII/4 strains over circulating GII/4 variants (28, 46).
The 2006b capsid signatures were positioned around the binding sites to putative viral infection receptors (Fig. 6). These amino acids are arranged so that the electrostatic potential on the binding surface and the shape of the ligand-binding cavity can be influenced. The data may imply that some capsid signatures are involved in controlling viral binding properties to infection receptors and thus may provide new bases on which to study the interplay among mutations. It is possible that some antigenic variations on the exposed surface of the capsid P2 domain could result in the functional deterioration or a defect of the capsid protein; therefore, compensatory mutations are required for virus survival. Experiments addressing each of these issues are in progress.
Previous studies focused mostly on substitutions in the capsid protein. Therefore, haplotype and noncapsid genetic changes of the 2006b strains were unclear. In addition to the capsid signatures, the present study discloses 19 changes specific to the 2006b strains (Fig. 4A). The high level of conservation of these 2006b signatures at otherwise variable sites among the GII/4 strains suggest the positive roles of some signatures in the epidemic in humans. Some signatures may be required for the escape from host population immunity, as suggested in the capsid. However, strong immune responses other than those against the capsid have not been reported.
An alternative possible role of conservation may be the maintenance or augmentation of replication fitness of the capsid mutants within a mixed variant population. In this regard, the 2006b signatures outside of the capsid region were particularly prominent in the N-term, and VP2 (Fig. 4A). The N-term is thought to function as a scaffolding protein for assembly of the genome replication complex (9, 11). The VP2 is thought to function as a minor capsid protein for RNA genome packaging (14). Therefore, some amino acid changes in these regions could critically influence intracellular viral replication and might have positive effects on maintaining improved replication fitness of the 2006b strains within a circulating NoV variant population. Further study is necessary to clarify these possibilities.
Another important factor contributing to the NoV diversity is evolutionary bottlenecks. For example, the monomorphic and divergent genotype of the 2006b observed in the present study could have been generated by a bottleneck effect during virus transmission: a few variants might have been transmitted, replicated, and spread to human populations. Therefore, it is possible that some unique genetic changes observed in the 2006b strains were primarily derived from the bottlenecks. On the other hand, given the extremely high levels of NoV infectivity (8, 27), it is likely that competition and natural selection of NoV variants within a mixed virus population occur frequently during virus spread to human populations. Further study is necessary to clarify how bottlenecks and natural selection shape the genetic diversity of NoV global epidemic variants.
Our findings, together with the general observations described in the introduction section, are compatible with a simple scheme of the NoV GII/4 evolution in nature: covariation in the capsid and other proteins confers an advantage in maintaining replication fitness in the face of population immunity, thereby being fixed better in virus that spread in humans during epidemics. When we take into account the ubiquitous existence of NoV in living and natural environments (8, 31, 41, 44), along with high levels of NoV infectivity (8, 27), physical stability (22), and virus production in asymptomatic individuals (36), it appears that humans are likely to be exposed much more frequently to NoVs than had previously been supposed. If this is the case, a human population could maintain certain levels of immunity against resident NoV variants, leading to periodic outgrowth, more severe symptomatic infection, and thus more frequent detection of antigenic variants that restore growth abilities by compensatory mutations.
In summary, we used here viral genome analysis and computational approaches to provide a new insight into the periodic pandemic of the GII/4 strain. A combination of phylogenetic and entropy analyses revealed for the first time the monomorphic and divergent haplotypes of the new epidemic strains. The data and computer-assisted structural study support a model of antigenic drift with tuning of the structure and functions of multiple proteins for the global outgrowth of new GII/4 variants. The availability of information on genome sequences and unique changes of the most recent epidemic variant strains will support studies of diagnostic assays and molecular epidemiology of NoV epidemic variants. Moreover, such information will provide a molecular basis for studying the interplay of viral proteins during the replication and adaptive evolution of NoV in nature.
This study was supported by grants for Research on Emerging and Re-emerging Infectious Diseases and for Research on Food Safety from the Ministry of Health, Labor, and Welfare of Japan.
Published ahead of print on 4 September 2008. ![]()
Participants of the Norovirus Surveillance Group of Japan who contributed to this study include Shima Yoshizumi (Hokkaido Institute of Public Health, Hokkaido, Japan), Toshiyuki Mikami (Aomori Institute of Public Health, Aomori, Japan), Hiroyuki Saito (Akita Prefectural Research Center for Public Health and Environment, Akita, Japan), You Ueki (Miyagi Prefectural Institute of Public Health and Environment, Sendai, Japan), Takenori Takizawa (Toyama Institute of Public Health, Toyama, Japan), Kiyoko Uchino (Sakai City Institute of Public Health, Sakai, Osaka, Japan), Mamoru Noda (National Institute of Health Sciences, Tokyo, Japan), Reiko Kondo (Ehime Prefecture Institute of Public Health and Environmental Science, Ehime, Japan), Yumiko Matsuoka (Kumamoto City Institute of Public Health, Kumamoto, Japan), Sadayuki Funatsumaru (Saga Institute of Public Health, Saga, Japan), and Shinichi Kobayashi (Aichi Prefectural Institute of Public Health, Nagoya, Aichi, Japan). ![]()
|
|
|---|
s. J. Virol. 81:7280-7285.
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»