Periodicity in nucleotide sequences arises from regular repeating patterns that may reflect important structure and function. Although a three-base periodicity in coding regions has been known for some time and has been used as the basis for powerful gene prediction algorithms, its origins are still not fully understood. Here we show that, contrary to common belief, amino acid bias and codon usage bias are insucient to create base-three periodicity. This paper rigorously applies the method of spectral envelope to systematically characterize the contributions of codon bias, amino acid bias, and protein structural motifs to the three-base periodicity of coding sequences. Additionally, we show how the method of spectral envelope can be used to trace the evolution of viral genomes and monitor global sequence changes without having to align to previously known genomes. This method also detects reassortment events, such as those that led to the 2009 pandemic H1N1 virus.
Cis-regulatory element (cRE) mutations underlie hereditary cancers and may contribute significantly to sporadic cancer, but cancer genetics research to date has primarily focused on the protein coding exome. Non-coding mutations can contribute to tumorigenesis by altering transcription factor (TF) binding and the expression level of their target gene(s). Determining which cRE mutations are functional and act as drivers of a malignant phenotype in part depends on the cell type-specific chromatin state and the presence of relevant transcription factors. Only a fraction of cREs in the genome are active in a given cell type, and many show little conservation between species, making their identification difficult using comparative genomics alone. Recent advances in mapping histone modifications using ChIP-seq allows for the rapid identification of active cREs in any accessible tissue type. We generated these “chromatin state” maps from primary Glioblastoma (GBM) and adult normal brain tissues, and intersected publicly available GBM whole genome and transcriptome sequencing data to identify novel mutations in cREs. We identified the TERT promoter to be the most commonly mutated cRE in GBM patients. TERT gene expression is critical for the modulation of telomerase activity, the enzyme responsible for telomere maintenance, and TERT promoter mutations are among the most common genetic alterations observed across multiple cancer types. We identify the functional consequence of these mutations in GBM to be recruitment of the multimeric GABP transcription factor specifically to the mutant promoter. Allelic recruitment of GABP is consistently observed across four cancer types, highlighting a shared mechanism underlying TERT reactivation. This study provides an experimental and computational framework to identify driver cRE mutations that is widely applicable to most other tumor types.
Cookie SettingseScholarship uses cookies to ensure you have the best experience on our website. You can manage which cookies you want us to use.Our Privacy Statement includes more details on the cookies we use and how we protect your privacy.