- Woo, Brian;
- Moussavi-Baygi, Ruhollah;
- Karner, Heather;
- Karimzadeh, Mehran;
- Yousefi, Hassan;
- Lee, Sean;
- Garcia, Kristle;
- Joshi, Tanvi;
- Yin, Keyi;
- Navickas, Albertas;
- Gilbert, Luke;
- Wang, Bo;
- Asgharian, Hosseinali;
- Feng, Felix;
- Goodarzi, Hani
Large-scale sequencing efforts have been undertaken to understand the mutational landscape of the coding genome. However, the vast majority of variants occur within non-coding genomic regions. We designed an integrative computational and experimental framework to identify recurrently mutated non-coding regulatory regions that drive tumor progression. Applying this framework to sequencing data from a large prostate cancer patient cohort revealed a large set of candidate drivers. We used (1) in silico analyses, (2) massively parallel reporter assays, and (3) in vivo CRISPR interference screens to systematically validate metastatic castration-resistant prostate cancer (mCRPC) drivers. One identified enhancer region, GH22I030351, acts on a bidirectional promoter to simultaneously modulate expression of the U2-associated splicing factor SF3A1 and chromosomal protein CCDC157. SF3A1 and CCDC157 promote tumor growth in vivo. We nominated a number of transcription factors, notably SOX6, to regulate expression of SF3A1 and CCDC157. Our integrative approach enables the systematic detection of non-coding regulatory regions that drive human cancers.