Skip to main content
eScholarship
Open Access Publications from the University of California
Cover page of Visualizing metagenomic and metatranscriptomic data: A comprehensive review.

Visualizing metagenomic and metatranscriptomic data: A comprehensive review.

(2024)

The fields of Metagenomics and Metatranscriptomics involve the examination of complete nucleotide sequences, gene identification, and analysis of potential biological functions within diverse organisms or environmental samples. Despite the vast opportunities for discovery in metagenomics, the sheer volume and complexity of sequence data often present challenges in processing analysis and visualization. This article highlights the critical role of advanced visualization tools in enabling effective exploration, querying, and analysis of these complex datasets. Emphasizing the importance of accessibility, the article categorizes various visualizers based on their intended applications and highlights their utility in empowering bioinformaticians and non-bioinformaticians to interpret and derive insights from meta-omics data effectively.

Cover page of kmerDB: A database encompassing the set of genomic and proteomic sequence information for each species.

kmerDB: A database encompassing the set of genomic and proteomic sequence information for each species.

(2024)

The decrease in sequencing expenses has facilitated the creation of reference genomes and proteomes for an expanding array of organisms. Nevertheless, no established repository that details organism-specific genomic and proteomic sequences of specific lengths, referred to as kmers, exists to our knowledge. In this article, we present kmerDB, a database accessible through an interactive web interface that provides kmer-based information from genomic and proteomic sequences in a systematic way. kmerDB currently contains 202,340,859,107 base pairs and 19,304,903,356 amino acids, spanning 54,039 and 21,865 reference genomes and proteomes, respectively, as well as 6,905,362 and 149,305,183 genomic and proteomic species-specific sequences, termed quasi-primes. Additionally, we provide access to 5,186,757 nucleic and 214,904,089 peptide sequences absent from every genome and proteome, termed primes. kmerDB features a user-friendly interface offering various search options and filters for easy parsing and searching. The service is available at: www.kmerdb.com.

Pre-hypertrophic chondrogenic enhancer landscape of limb and axial skeleton development

(2024)

Chondrocyte differentiation controls skeleton development and stature. Here we provide a comprehensive map of chondrocyte-specific enhancers and show that they provide a mechanistic framework through which non-coding genetic variants can influence skeletal development and human stature. Working with fetal chondrocytes isolated from mice bearing a Col2a1 fluorescent regulatory sensor, we identify 780 genes and 2'704 putative enhancers specifically active in chondrocytes using a combination of RNA-seq, ATAC-seq and H3K27ac ChIP-seq. Most of these enhancers (74%) show pan-chondrogenic activity, with smaller populations being restricted to limb (18%) or trunk (8%) chondrocytes only. Notably, genetic variations overlapping these enhancers better explain height differences than those overlapping non-chondrogenic enhancers. Finally, targeted deletions of identified enhancers at the Fgfr3, Col2a1, Hhip and, Nkx3-2 loci confirm their role in regulating cognate genes. This enhancer map provides a framework for understanding how genes and non-coding variations influence bone development and diseases.

Cover page of Genome resources for three modern cotton lines guide future breeding efforts.

Genome resources for three modern cotton lines guide future breeding efforts.

(2024)

Cotton (Gossypium hirsutum L.) is the key renewable fibre crop worldwide, yet its yield and fibre quality show high variability due to genotype-specific traits and complex interactions among cultivars, management practices and environmental factors. Modern breeding practices may limit future yield gains due to a narrow founding gene pool. Precision breeding and biotechnological approaches offer potential solutions, contingent on accurate cultivar-specific data. Here we address this need by generating high-quality reference genomes for three modern cotton cultivars (UGA230, UA48 and CSX8308) and updating the TM-1 cotton genetic standard reference. Despite hypothesized genetic uniformity, considerable sequence and structural variation was observed among the four genomes, which overlap with ancient and ongoing genomic introgressions from Pima cotton, gene regulatory mechanisms and phenotypic trait divergence. Differentially expressed genes across fibre development correlate with fibre production, potentially contributing to the distinctive fibre quality traits observed in modern cotton cultivars. These genomes and comparative analyses provide a valuable foundation for future genetic endeavours to enhance global cotton yield and sustainability.

Cover page of Evolutionary history of arbuscular mycorrhizal fungi and genomic signatures of obligate symbiosis.

Evolutionary history of arbuscular mycorrhizal fungi and genomic signatures of obligate symbiosis.

(2024)

BACKGROUND: The colonization of land and the diversification of terrestrial plants is intimately linked to the evolutionary history of their symbiotic fungal partners. Extant representatives of these fungal lineages include mutualistic plant symbionts, the arbuscular mycorrhizal (AM) fungi in Glomeromycota and fine root endophytes in Endogonales (Mucoromycota), as well as fungi with saprotrophic, pathogenic and endophytic lifestyles. These fungal groups separate into three monophyletic lineages but their evolutionary relationships remain enigmatic confounding ancestral reconstructions. Their taxonomic ranks are currently fluid. RESULTS: In this study, we recognize these three monophyletic linages as phyla, and use a balanced taxon sampling and broad taxonomic representation for phylogenomic analysis that rejects a hard polytomy and resolves Glomeromycota as sister to a clade composed of Mucoromycota and Mortierellomycota. Low copy numbers of genes associated with plant cell wall degradation could not be assigned to the transition to a plant symbiotic lifestyle but appears to be an ancestral phylogenetic signal. Both plant symbiotic lineages, Glomeromycota and Endogonales, lack numerous thiamine metabolism genes but the lack of fatty acid synthesis genes is specific to AM fungi. Many genes previously thought to be missing specifically in Glomeromycota are either missing in all analyzed phyla, or in some cases, are actually present in some of the analyzed AM fungal lineages, e.g. the high affinity phosphorus transporter Pho89. CONCLUSION: Based on a broad taxon sampling of fungal genomes we present a well-supported phylogeny for AM fungi and their sister lineages. We show that among these lineages, two independent evolutionary transitions to mutualistic plant symbiosis happened in a genomic background profoundly different from that known from the emergence of ectomycorrhizal fungi in Dikarya. These results call for further reevaluation of genomic signatures associated with plant symbiosis.

Conserved Cis-Acting Range Extender Element Mediates Extreme Long-Range Enhancer Activity in Mammals

(2024)

While most mammalian enhancers regulate their cognate promoters over moderate distances of tens of kilobases (kb), some enhancers act over distances in the megabase range. The sequence features enabling such extreme-distance enhancer-promoter interactions remain elusive. Here, we used in vivo enhancer replacement experiments in mice to show that short- and medium-range enhancers cannot initiate gene expression at extreme-distance range. We uncover a novel conserved cis-acting element, Range EXtender (REX), that confers extreme-distance regulatory activity and is located next to a long-range enhancer of Sall1. The REX element itself has no endogenous enhancer activity. However, addition of the REX to other short- and mid-range enhancers substantially increases their genomic interaction range. In the most extreme example observed, addition of the REX increased the range of an enhancer by an order of magnitude, from its native 71kb to 840kb. The REX element contains highly conserved [C/T]AATTA homeodomain motifs. These motifs are enriched around long-range limb enhancers genome-wide, including the ZRS, a benchmark long-range limb enhancer of Shh. Mutating the [C/T]AATTA motifs within the ZRS does not affect its limb-specific enhancer activity at short range, but selectively abolishes its long-range activity, resulting in severe limb reduction in knock-in mice. In summary, we identify a sequence signature globally associated with long-range enhancer-promoter interactions and describe a prototypical REX element that is necessary and sufficient to confer extreme-distance gene activation by remote enhancers.

Cover page of Identification and recombinant expression of a cutinase from Papiliotrema laurentii that hydrolyzes natural and synthetic polyesters.

Identification and recombinant expression of a cutinase from Papiliotrema laurentii that hydrolyzes natural and synthetic polyesters.

(2024)

Given the multitude of extracellular enzymes at their disposal, many of which are designed to degrade natures polymers (lignin, cutin, cellulose, etc.), fungi are adept at targeting synthetic polyesters with similar chemical composition. Microbial-influenced deterioration of xenobiotic polymeric surfaces is an area of interest for material scientists as these are important for the conservation of the underlying structural materials. Here, we describe the isolation and characterization of the Papiliotrema laurentii 5307AH (P. laurentii) cutinase, Plcut1. P. laurentii is basidiomycete yeast with the ability to disperse Impranil-DLN (Impranil), a colloidal polyester polyurethane, in agar plates. To test whether the fungal factor involved in this clearing was a secreted enzyme, we screened the ability of P. laurentii culture supernatants to disperse Impranil. Using size exclusion chromatography (SEC), we isolated fractions that contained Impranil-clearing activity. These fractions harbored a single ~22 kD band, which was excised and subjected to peptide sequencing. Homology searches using the peptide sequences identified, revealed that the protein Papla1 543643 (Plcut1) displays similarities to serine esterase and cutinase family of proteins. Biochemical assays using recombinant Plcut1 confirmed that this enzyme has the capability to hydrolyze Impranil, soluble esterase substrates, and apple cutin. Finally, we confirmed the presence of the Plcut1 in culture supernatants using a custom antibody that specifically recognizes this protein. The work shown here supports a major role for the Plcut1 in the fungal degradation of natural polyesters and xenobiotic polymer surfaces.IMPORTANCEFungi play a vital role in the execution of a broad range of biological processes that drive ecosystem function through production of a diverse arsenal of enzymes. However, the universal reactivity of these enzymes is a current problem for the built environment and the undesired degradation of polymeric materials in protective coatings. Here, we report the identification and characterization of a hydrolase from Papiliotrema laurentii 5307AH, an aircraft-derived fungal isolate found colonizing a biodeteriorated polymer-coated surface. We show that P. laurentii secretes a cutinase capable of hydrolyzing soluble esters as well as ester-based compounds forming solid surface coatings. These findings indicate that this fungus plays a significant role in biodeterioration through the production of a cutinase adept at degrading ester-based polymers, some of which form the backbone of protective surface coatings. The work shown here provides insights into the mechanisms employed by fungi to degrade xenobiotic polymers.

Identification of conserved skeletal enhancers associated with craniosynostosis risk genes

(2024)

Craniosynostosis, defined by premature fusion of one or multiple cranial sutures, is a common congenital defect affecting more than 1/2000 infants and results in restricted brain expansion. Single gene mutations account for 15%-20% of cases, largely as part of a syndrome, but the majority are nonsyndromic with complex underlying genetics. We hypothesized that the two noncoding genomic regions identified by a GWAS for craniosynostosis contain distal regulatory elements for the risk genes BMPER and BMP2. To identify such regulatory elements, we surveyed conserved noncoding sequences from both risk loci for enhancer activity in transgenic Danio rerio. We identified enhancers from both regions that direct expression to skeletal tissues, consistent with the endogenous expression of bmper and bmp2. For each locus, we also found a skeletal enhancer that also contains a sequence variant associated with craniosynostosis risk. We examined the activity of each enhancer during craniofacial development and found that the BMPER-associated enhancer is active in the restricted region of cartilage closely associated with frontal bone initiation. The same enhancer is active in mouse skeletal tissues, demonstrating evolutionarily conserved activity. Using enhanced yeast one-hybrid assays, we identified transcription factors that bind each enhancer and observed differential binding between alleles, implicating multiple signaling pathways. Our findings help unveil the genetic mechanism of the two craniosynostosis risk loci. More broadly, our combined in vivo approach is applicable to many complex genetic diseases to build a link between association studies and specific genetic mechanisms.

Cover page of Genomes of multicellular algal sisters to land plants illuminate signaling network evolution.

Genomes of multicellular algal sisters to land plants illuminate signaling network evolution.

(2024)

Zygnematophyceae are the algal sisters of land plants. Here we sequenced four genomes of filamentous Zygnematophyceae, including chromosome-scale assemblies for three strains of Zygnema circumcarinatum. We inferred traits in the ancestor of Zygnematophyceae and land plants that might have ushered in the conquest of land by plants: expanded genes for signaling cascades, environmental response, and multicellular growth. Zygnematophyceae and land plants share all the major enzymes for cell wall synthesis and remodifications, and gene gains shaped this toolkit. Co-expression network analyses uncover gene cohorts that unite environmental signaling with multicellular developmental programs. Our data shed light on a molecular chassis that balances environmental response and growth modulation across more than 600 million years of streptophyte evolution.

Cover page of A concept for international societally relevant microbiology education and microbiology knowledge promulgation in society.

A concept for international societally relevant microbiology education and microbiology knowledge promulgation in society.

(2024)

EXECUTIVE SUMMARY: Microbes are all pervasive in their distribution and influence on the functioning and well-being of humans, life in general and the planet. Microbially-based technologies contribute hugely to the supply of important goods and services we depend upon, such as the provision of food, medicines and clean water. They also offer mechanisms and strategies to mitigate and solve a wide range of problems and crises facing humanity at all levels, including those encapsulated in the sustainable development goals (SDGs) formulated by the United Nations. For example, microbial technologies can contribute in multiple ways to decarbonisation and hence confronting global warming, provide sanitation and clean water to the billions of people lacking them, improve soil fertility and hence food production and develop vaccines and other medicines to reduce and in some cases eliminate deadly infections. They are the foundation of biotechnology, an increasingly important and growing business sector and source of employment, and the centre of the bioeconomy, Green Deal, etc. But, because microbes are largely invisible, they are not familiar to most people, so opportunities they offer to effectively prevent and solve problems are often missed by decision-makers, with the negative consequences this entrains. To correct this lack of vital knowledge, the International Microbiology Literacy Initiative-the IMiLI-is recruiting from the global microbiology community and making freely available, teaching resources for a curriculum in societally relevant microbiology that can be used at all levels of learning. Its goal is the development of a society that is literate in relevant microbiology and, as a consequence, able to take full advantage of the potential of microbes and minimise the consequences of their negative activities. In addition to teaching about microbes, almost every lesson discusses the influence they have on sustainability and the SDGs and their ability to solve pressing problems of societal inequalities. The curriculum thus teaches about sustainability, societal needs and global citizenship. The lessons also reveal the impacts microbes and their activities have on our daily lives at the personal, family, community, national and global levels and their relevance for decisions at all levels. And, because effective, evidence-based decisions require not only relevant information but also critical and systems thinking, the resources also teach about these key generic aspects of deliberation. The IMiLI teaching resources are learner-centric, not academic microbiology-centric and deal with the microbiology of everyday issues. These span topics as diverse as owning and caring for a companion animal, the vast range of everyday foods that are produced via microbial processes, impressive geological formations created by microbes, childhood illnesses and how they are managed and how to reduce waste and pollution. They also leverage the exceptional excitement of exploration and discovery that typifies much progress in microbiology to capture the interest, inspire and motivate educators and learners alike. The IMiLI is establishing Regional Centres to translate the teaching resources into regional languages and adapt them to regional cultures, and to promote their use and assist educators employing them. Two of these are now operational. The Regional Centres constitute the interface between resource creators and educators-learners. As such, they will collect and analyse feedback from the end-users and transmit this to the resource creators so that teaching materials can be improved and refined, and new resources added in response to demand: educators and learners will thereby be directly involved in evolution of the teaching resources. The interactions between educators-learners and resource creators mediated by the Regional Centres will establish dynamic and synergistic relationships-a global societally relevant microbiology education ecosystem-in which creators also become learners, teaching resources are optimised and all players/stakeholders are empowered and their motivation increased. The IMiLI concept thus embraces the principle of teaching societally relevant microbiology embedded in the wider context of societal, biosphere and planetary needs, inequalities, the range of crises that confront us and the need for improved decisioning, which should ultimately lead to better citizenship and a humanity that is more sustainable and resilient. ABSTRACT: The biosphere of planet Earth is a microbial world: a vast reactor of countless microbially driven chemical transformations and energy transfers that push and pull many planetary geochemical processes, including the cycling of the elements of life, mitigate or amplify climate change (e.g., Nature Reviews Microbiology, 2019, 17, 569) and impact the well-being and activities of all organisms, including humans. Microbes are both our ancestors and creators of the planetary chemistry that allowed us to evolve (e.g., Lifes engines: How microbes made earth habitable, 2023). To understand how the biosphere functions, how humans can influence its development and live more sustainably with the other organisms sharing it, we need to understand the microbes. In a recent editorial (Environmental Microbiology, 2019, 21, 1513), we advocated for improved microbiology literacy in society. Our concept of microbiology literacy is not based on knowledge of the academic subject of microbiology, with its multitude of component topics, plus the growing number of additional topics from other disciplines that become vitally important elements of current microbiology. Rather it is focused on microbial activities that impact us-individuals/communities/nations/the human world-and the biosphere and that are key to reaching informed decisions on a multitude of issues that regularly confront us, ranging from personal issues to crises of global importance. In other words, it is knowledge and understanding essential for adulthood and the transition to it, knowledge and understanding that must be acquired early in life in school. The 2019 Editorial marked the launch of the International Microbiology Literacy Initiative, the IMiLI. HERE, WE PRESENT: our concept of how microbiology literacy may be achieved and the rationale underpinning it; the type of teaching resources being created to realise the concept and the framing of microbial activities treated in these resources in the context of sustainability, societal needs and responsibilities and decision-making; and the key role of Regional Centres that will translate the teaching resources into local languages, adapt them according to local cultural needs, interface with regional educators and develop and serve as hubs of microbiology literacy education networks. The topics featuring in teaching resources are learner-centric and have been selected for their inherent relevance, interest and ability to excite and engage. Importantly, the resources coherently integrate and emphasise the overarching issues of sustainability, stewardship and critical thinking and the pervasive interdependencies of processes. More broadly, the concept emphasises how the multifarious applications of microbial activities can be leveraged to promote human/animal, plant, environmental and planetary health, improve social equity, alleviate humanitarian deficits and causes of conflicts among peoples and increase understanding between peoples (Microbial Biotechnology, 2023, 16(6), 1091-1111). Importantly, although the primary target of the freely available (CC BY-NC 4.0) IMiLI teaching resources is schoolchildren and their educators, they and the teaching philosophy are intended for all ages, abilities and cultural spectra of learners worldwide: in university education, lifelong learning, curiosity-driven, web-based knowledge acquisition and public outreach. The IMiLI teaching resources aim to promote development of a global microbiology education ecosystem that democratises microbiology knowledge.