- Zhu, Qiyun;
- Mai, Uyen;
- Pfeiffer, Wayne;
- Janssen, Stefan;
- Asnicar, Francesco;
- Sanders, Jon G;
- Belda-Ferre, Pedro;
- Al-Ghalith, Gabriel A;
- Kopylova, Evguenia;
- McDonald, Daniel;
- Kosciolek, Tomasz;
- Yin, John B;
- Huang, Shi;
- Salam, Nimaichand;
- Jiao, Jian-Yu;
- Wu, Zijun;
- Xu, Zhenjiang Z;
- Cantrell, Kalen;
- Yang, Yimeng;
- Sayyari, Erfan;
- Rabiee, Maryam;
- Morton, James T;
- Podell, Sheila;
- Knights, Dan;
- Li, Wen-Jun;
- Huttenhower, Curtis;
- Segata, Nicola;
- Smarr, Larry;
- Mirarab, Siavash;
- Knight, Rob
Rapid growth of genome data provides opportunities for updating microbial evolutionary relationships, but this is challenged by the discordant evolution of individual genes. Here we build a reference phylogeny of 10,575 evenly-sampled bacterial and archaeal genomes, based on a comprehensive set of 381 markers, using multiple strategies. Our trees indicate remarkably closer evolutionary proximity between Archaea and Bacteria than previous estimates that were limited to fewer "core" genes, such as the ribosomal proteins. The robustness of the results was tested with respect to several variables, including taxon and site sampling, amino acid substitution heterogeneity and saturation, non-vertical evolution, and the impact of exclusion of candidate phyla radiation (CPR) taxa. Our results provide an updated view of domain-level relationships.