Skip to main content
eScholarship
Open Access Publications from the University of California

Identification and characterization of constrained non-exonic bases lacking predictive epigenomic and transcription factor binding annotations.

  • Author(s): Grujic, Olivera
  • Phung, Tanya N
  • Kwon, Soo Bin
  • Arneson, Adriana
  • Lee, Yuju
  • Lohmueller, Kirk E
  • Ernst, Jason
  • et al.
Abstract

Annotations of evolutionary sequence constraint based on multi-species genome alignments and genome-wide maps of epigenomic marks and transcription factor binding provide important complementary information for understanding the human genome and genetic variation. Here we developed the Constrained Non-Exonic Predictor (CNEP) to quantify the evidence of each base in the genome being in an evolutionarily constrained non-exonic element from an input of over 60,000 epigenomic and transcription factor binding features. We find that the CNEP score outperforms baseline and related existing scores at predicting evolutionarily constrained non-exonic bases from such data. However, a subset of them are still not well predicted by CNEP. We developed a complementary Conservation Signature Score by CNEP (CSS-CNEP) that is predictive of those bases. We further characterize the nature of constrained non-exonic bases with low CNEP scores using additional types of information. CNEP and CSS-CNEP are resources for analyzing constrained non-exonic bases in the genome.

Many UC-authored scholarly publications are freely available on this site because of the UC's open access policies. Let us know how this access is important for you.

Main Content
Current View