- Rupp, Oliver;
- MacDonald, Madolyn L;
- Li, Shangzhong;
- Dhiman, Heena;
- Polson, Shawn;
- Griep, Sven;
- Heffner, Kelley;
- Hernandez, Inmaculada;
- Brinkrolf, Karina;
- Jadhav, Vaibhav;
- Samoudi, Mojtaba;
- Hao, Haiping;
- Kingham, Brewster;
- Goesmann, Alexander;
- Betenbaugh, Michael J;
- Lewis, Nathan E;
- Borth, Nicole;
- Lee, Kelvin H
Accurate and complete genome sequences are essential in biotechnology to facilitate genome-based cell engineering efforts. The current genome assemblies for Cricetulus griseus, the Chinese hamster, are fragmented and replete with gap sequences and misassemblies, consistent with most short-read-based assemblies. Here, we completely resequenced C. griseus using single molecule real time sequencing and merged this with Illumina-based assemblies. This generated a more contiguous and complete genome assembly than either technology alone, reducing the number of scaffolds by >28-fold, with 90% of the sequence in the 122 longest scaffolds. Most genes are now found in single scaffolds, including up- and downstream regulatory elements, enabling improved study of noncoding regions. With >95% of the gap sequence filled, important Chinese hamster ovary cell mutations have been detected in draft assembly gaps. This new assembly will be an invaluable resource for continued basic and pharmaceutical research.