An iteration normalization and test method for differential expression analysis of RNA-seq data

Yan Zhou, Nan Lin, Baoxue Zhang

Research output: Contribution to journalArticlepeer-review

6 Scopus citations

Abstract

Background: Next generation sequencing technologies are powerful new tools for investigating a wide range of biological and medical questions. Statistical and computational methods are key to analyzing massive and complex sequencing data. In order to derive gene expression measures and compare these measures across samples or libraries, we first need to normalize read counts to adjust for varying sample sequencing depths and other potentially technical effects. Results: In this paper, we develop a normalization method based on iterating median of M-values (IMM) for detecting the differentially expressed (DE) genes. Compared to a previous approach TMM, the IMM method improves the accuracy of DE detection. Simulation studies show that the IMM method outperforms other methods for the sample normalization. We also look into the real data and find that the genes detected by IMM but not by TMM are much more accurate than the genes detected by TMM but not by IMM. What's more, we discovered that gene UNC5C is highly associated with kidney cancer and so on.

Original languageEnglish
Article number15
JournalBioData Mining
Volume7
Issue number1
DOIs
StatePublished - Aug 13 2014

Keywords

  • Expression level
  • IMM
  • Normalize
  • RNA-seq
  • TMM

Fingerprint

Dive into the research topics of 'An iteration normalization and test method for differential expression analysis of RNA-seq data'. Together they form a unique fingerprint.

Cite this