- Nishimura, Yosuke;
- Watai, Hiroyasu;
- Honda, Takashi;
- Mihara, Tomoko;
- Omae, Kimiho;
- Roux, Simon;
- Blanc-Mathieu, Romain;
- Yamamoto, Keigo;
- Hingamp, Pascal;
- Sako, Yoshihiko;
- Sullivan, Matthew B;
- Goto, Susumu;
- Ogata, Hiroyuki;
- Yoshida, Takashi
- Editor(s): Tamaki, Hideyuki
Metagenomics has revealed the existence of numerous uncharacterized viral lineages, which are referred to as viral "dark matter." However, our knowledge regarding viral genomes is biased toward culturable viruses. In this study, we analyzed 1,600 (1,352 nonredundant) complete double-stranded DNA viral genomes (10 to 211 kb) assembled from 52 marine viromes. Together with 244 previously reported uncultured viral genomes, a genome-wide comparison delineated 617 genus-level operational taxonomic units (OTUs) for these environmental viral genomes (EVGs). Of these, 600 OTUs contained no representatives from known viruses, thus putatively corresponding to novel viral genera. Predicted hosts of the EVGs included major groups of marine prokaryotes, such as marine group II Euryarchaeota and SAR86, from which no viruses have been isolated to date, as well as Flavobacteriaceae and SAR116. Our analysis indicates that marine cyanophages are already well represented in genome databases and that one of the EVGs likely represents a new cyanophage lineage. Several EVGs encode many enzymes that appear to function for an efficient utilization of iron-sulfur clusters or to enhance host survival. This suggests that there is a selection pressure on these marine viruses to accumulate genes for specific viral propagation strategies. Finally, we revealed that EVGs contribute to a 4-fold increase in the recruitment of photic-zone viromes compared with the use of current reference viral genomes. IMPORTANCE Viruses are diverse and play significant ecological roles in marine ecosystems. However, our knowledge of genome-level diversity in viruses is biased toward those isolated from few culturable hosts. Here, we determined 1,352 nonredundant complete viral genomes from marine environments. Lifting the uncertainty that clouds short incomplete sequences, whole-genome-wide analysis suggests that these environmental genomes represent hundreds of putative novel viral genera. Predicted hosts include dominant groups of marine bacteria and archaea with no isolated viruses to date. Some of the viral genomes encode many functionally related enzymes, suggesting a strong selection pressure on these marine viruses to control cellular metabolisms by accumulating genes.