- Klopfenstein, DV;
- Zhang, Liangsheng;
- Pedersen, Brent S;
- Ramírez, Fidel;
- Warwick Vesztrocy, Alex;
- Naldi, Aurélien;
- Mungall, Christopher J;
- Yunes, Jeffrey M;
- Botvinnik, Olga;
- Weigel, Mark;
- Dampier, Will;
- Dessimoz, Christophe;
- Flick, Patrick;
- Tang, Haibao
The biological interpretation of gene lists with interesting shared properties, such as up- or down-regulation in a particular experiment, is typically accomplished using gene ontology enrichment analysis tools. Given a list of genes, a gene ontology (GO) enrichment analysis may return hundreds of statistically significant GO results in a "flat" list, which can be challenging to summarize. It can also be difficult to keep pace with rapidly expanding biological knowledge, which often results in daily changes to any of the over 47,000 gene ontologies that describe biological knowledge. GOATOOLS, a Python-based library, makes it more efficient to stay current with the latest ontologies and annotations, perform gene ontology enrichment analyses to determine over- and under-represented terms, and organize results for greater clarity and easier interpretation using a novel GOATOOLS GO grouping method. We performed functional analyses on both stochastic simulation data and real data from a published RNA-seq study to compare the enrichment results from GOATOOLS to two other popular tools: DAVID and GOstats. GOATOOLS is freely available through GitHub: https://github.com/tanghaibao/goatools .