"Evaluation of public genome references for RNA-seq data analysis in Ch" by Huong Le, Chun Chen et al.

Cell Culture Engineering XV

Title

Evaluation of public genome references for RNA-seq data analysis in Chinese hamster ovary cells

Authors

Huong Le, AmgenFollow
Chun Chen, Amgen
Chetan Goudar, Amgen

Conference Dates

May 8-13, 2016

Abstract

Recent advances in next-generation sequencing technologies have led to the emergence of RNA-Seq as the preferred transcriptomic tool in the biopharmaceutical industry. However, an important challenge with deploying RNA-Seq to characterize CHO cells is the absence of a common genomic reference for this species. In most published CHO cell transcriptomic studies, RNA-Seq reads are assembled into de novo genomic references which were subsequently used for mapping of the constituent reads. Such an approach makes it difficult to compare results across studies due to the incomplete and non-universal nature of those assemblies. To address this challenge, we evaluated two publicly available genomes and their derived transcriptomes at the NCBI Reference Sequence Database (RefSeq), including CHO-K1 genome (GCF_000223135.1) and Chinese hamster genome (GCF_000419365.1). When applied for a diverse set of 60 RNA-Seq samples, each with approximately 40 million reads, both genomes showed significantly better mapping rates (~75%) compared to their derived transcriptomes (53-63%). Despite similar annotation, gene content, and KEGG pathway coverage level in both genomes, only 69% of overlapping genes between these two genomes had consistent quantification (i.e., read count) across 60 RNA-Seq samples. Examining genes with quantification discrepancies in a genome browser provides an effective avenue to identify targets for potential genome improvement. Two metrics were proposed to assess the genome-specific difference (consistency) and the sample-specific difference (stringency). Genes with low stringency can introduce biases during the identification of differentially expressed genes and pathways. Given that both genomes for CHO cells are still incomplete, we propose utilization of both in RNA-Seq data analyses until a universal reference with refined genome assembly and gene model annotation is generated.

Recommended Citation

Huong Le, Chun Chen, and Chetan Goudar, "Evaluation of public genome references for RNA-seq data analysis in Chinese hamster ovary cells" in "Cell Culture Engineering XV", Robert Kiss, Genentech Sarah Harcum, Clemson University Jeff Chalmers, Ohio State University Eds, ECI Symposium Series, (2016). https://dc.engconfintl.org/cellculture_xv/35

Download

Included in

Biomedical Engineering and Bioengineering Commons

COinS

ECI Digital Archives

Cell Culture Engineering XV

Title

Authors

Conference Dates

Abstract

Recommended Citation

Included in

Search

Browse

Additional Information

ECI Digital Archives

Cell Culture Engineering XV

Title

Authors

Conference Dates

Abstract

Recommended Citation

Included in

Share

Search

Browse

Additional Information