Diagenode

Systematic bias in high-throughput sequencing data and its correction by BEADS.


Cheung MS, Down TA, Latorre I, Ahringer J

Genomic sequences obtained through high-throughput sequencing are not uniformly distributed across the genome. For example, sequencing data of total genomic DNA show significant, yet unexpected enrichments on promoters and exons. This systematic bias is a particular problem for techniques such as chromatin immunoprecipitation, where the signal for a target factor is plotted across genomic features. We have focused on data obtained from Illumina's Genome Analyser platform, where at least three factors contribute to sequence bias: GC content, mappability of sequencing reads, and regional biases that might be generated by local structure. We show that relying on input control as a normalizer is not generally appropriate due to sample to sample variation in bias. To correct sequence bias, we present BEADS (bias elimination algorithm for deep sequencing), a simple three-step normalization scheme that successfully unmasks real binding patterns in ChIP-seq data. We suggest that this procedure be done routinely prior to data interpretation and downstream analyses.

Tags
Bioruptor
Chromatin Shearing
ChIP-seq

Share this article

Published
August, 2011

Source

Events

  • London Calling 2024
    London, UK
    May 21-May 24, 2024
  • Symposium of the Young Scientist Association
    Vienna, Austria
    May 28-May 29, 2024
  • ESHG 2024
    Berlin, Germany
    Jun 1-Jun 4, 2024
  • CLEPIC 2024
    Warsaw, Poland
    Jun 5-Jun 7, 2024
  • EACR 2024
    Rotterdam, Netherlands
    Jun 10-Jun 13, 2024
  • Chromatin meets South 2024
    Marseille, France
    Jun 13-Jun 14, 2024
 See all events

 


       Site map   |   Contact us   |   Conditions of sales   |   Conditions of purchase   |   Privacy policy   |   Diagenode Diagnostics