- Roux, Simon;
- Paul, Blair G;
- Bagby, Sarah C;
- Nayfach, Stephen;
- Allen, Michelle A;
- Attwood, Graeme;
- Cavicchioli, Ricardo;
- Chistoserdova, Ludmila;
- Gruninger, Robert J;
- Hallam, Steven J;
- Hernandez, Maria E;
- Hess, Matthias;
- Liu, Wen-Tso;
- McAllister, Tim A;
- O’Malley, Michelle A;
- Peng, Xuefeng;
- Rich, Virginia I;
- Saleska, Scott R;
- Eloe-Fadrosh, Emiley A
Changes in the sequence of an organism's genome, i.e., mutations, are the raw material of evolution. The frequency and location of mutations can be constrained by specific molecular mechanisms, such as diversity-generating retroelements (DGRs). DGRs have been characterized from cultivated bacteria and bacteriophages, and perform error-prone reverse transcription leading to mutations being introduced in specific target genes. DGR loci were also identified in several metagenomes, but the ecological roles and evolutionary drivers of these DGRs remain poorly understood. Here, we analyze a dataset of >30,000 DGRs from public metagenomes, establish six major lineages of DGRs including three primarily encoded by phages and seemingly used to diversify host attachment proteins, and demonstrate that DGRs are broadly active and responsible for >10% of all amino acid changes in some organisms. Overall, these results highlight the constraints under which DGRs evolve, and elucidate several distinct roles these elements play in natural communities.