Conference Dates

May 8-13, 2016

Abstract

Recent advances in next-generation sequencing technologies have led to the emergence of RNA-Seq as the preferred transcriptomic tool in the biopharmaceutical industry. However, an important challenge with deploying RNA-Seq to characterize CHO cells is the absence of a common genomic reference for this species. In most published CHO cell transcriptomic studies, RNA-Seq reads are assembled into de novo genomic references which were subsequently used for mapping of the constituent reads. Such an approach makes it difficult to compare results across studies due to the incomplete and non-universal nature of those assemblies. To address this challenge, we evaluated two publicly available genomes and their derived transcriptomes at the NCBI Reference Sequence Database (RefSeq), including CHO-K1 genome (GCF_000223135.1) and Chinese hamster genome (GCF_000419365.1). When applied for a diverse set of 60 RNA-Seq samples, each with approximately 40 million reads, both genomes showed significantly better mapping rates (~75%) compared to their derived transcriptomes (53-63%). Despite similar annotation, gene content, and KEGG pathway coverage level in both genomes, only 69% of overlapping genes between these two genomes had consistent quantification (i.e., read count) across 60 RNA-Seq samples. Examining genes with quantification discrepancies in a genome browser provides an effective avenue to identify targets for potential genome improvement. Two metrics were proposed to assess the genome-specific difference (consistency) and the sample-specific difference (stringency). Genes with low stringency can introduce biases during the identification of differentially expressed genes and pathways. Given that both genomes for CHO cells are still incomplete, we propose utilization of both in RNA-Seq data analyses until a universal reference with refined genome assembly and gene model annotation is generated.

Recommended Citation

Huong Le, Chun Chen, and Chetan Goudar, "Evaluation of public genome references for RNA-Seq data analysis in Chinese Hamster ovary cells" in "Cell Culture Engineering XV", Robert Kiss, Genentech Sarah Harcum, Clemson University Jeff Chalmers, Ohio State University Eds, ECI Symposium Series, (2016). https://dc.engconfintl.org/cellculture_xv/169

Download

Included in

Biomedical Engineering and Bioengineering Commons

COinS

Cell Culture Engineering XV

Evaluation of public genome references for RNA-Seq data analysis in Chinese Hamster ovary cells

Conference Dates

Abstract

Recommended Citation

Included in

Search

Browse

Additional Information

Cell Culture Engineering XV

Evaluation of public genome references for RNA-Seq data analysis in Chinese Hamster ovary cells

Authors

Conference Dates

Abstract

Recommended Citation

Included in

Share

Search

Browse

Additional Information