Background: Recent studies have demonstrated an unexpected complexity of transcription in eukaryotes. The majority of the genome is transcribed and only a little fraction of these transcripts is annotated as protein coding genes and their splice variants. Indeed, most transcripts are the result of antisense, overlapping and non-coding RNA expression. In this frame, one of the key aims of high throughput transcriptome sequencing is the detection of all RNA species present in the cell and the first crucial step for RNA-seq users is represented by the choice of the strategy for cDNA library construction. The protocols developed so far provide the utilization of the entire library for a single sequencing run with a specific platform. Results: We set up a unique protocol to generate and amplify a strand-specific cDNA library representative of all RNA species that may be implemented with all major platforms currently available on the market (Roche 454, Illumina, ABI/SOLiD). Our method is reproducible, fast, easy-to-perform and even allows to start from low input total RNA. Furthermore, we provide a suitable bioinformatics tool for the analysis of the sequences produced following this protocol. Conclusion: We tested the efficiency of our strategy, showing that our method is platform-independent, thus allowing the simultaneous analysis of the same sample with different NGS technologies, and providing an accurate quantitative and qualitative portrait of complex whole transcriptomes.
A platform independent RNA-Seq protocol for the detection of transcriptome complexity
KURELAC, IVANA;GASPARRE, GIUSEPPE;PORCELLI, ANNA MARIA;
2013
Abstract
Background: Recent studies have demonstrated an unexpected complexity of transcription in eukaryotes. The majority of the genome is transcribed and only a little fraction of these transcripts is annotated as protein coding genes and their splice variants. Indeed, most transcripts are the result of antisense, overlapping and non-coding RNA expression. In this frame, one of the key aims of high throughput transcriptome sequencing is the detection of all RNA species present in the cell and the first crucial step for RNA-seq users is represented by the choice of the strategy for cDNA library construction. The protocols developed so far provide the utilization of the entire library for a single sequencing run with a specific platform. Results: We set up a unique protocol to generate and amplify a strand-specific cDNA library representative of all RNA species that may be implemented with all major platforms currently available on the market (Roche 454, Illumina, ABI/SOLiD). Our method is reproducible, fast, easy-to-perform and even allows to start from low input total RNA. Furthermore, we provide a suitable bioinformatics tool for the analysis of the sequences produced following this protocol. Conclusion: We tested the efficiency of our strategy, showing that our method is platform-independent, thus allowing the simultaneous analysis of the same sample with different NGS technologies, and providing an accurate quantitative and qualitative portrait of complex whole transcriptomes.File | Dimensione | Formato | |
---|---|---|---|
1471-2164-14-855.pdf
accesso aperto
Tipo:
Versione (PDF) editoriale
Licenza:
Licenza per Accesso Aperto. Creative Commons Attribuzione (CCBY)
Dimensione
1.01 MB
Formato
Adobe PDF
|
1.01 MB | Adobe PDF | Visualizza/Apri |
12864_2013_5558_MOESM1_ESM.doc
accesso aperto
Descrizione: This file contains (I) Reads length distribution within the two OST samples; (II) Type of tags found within the two samples; (III) Comparison between 454 read length distributions obtained with the Roche standard cDNA library preparation and our protocol; (V) Tags distribution among all the reads sequenced; (VI) Real Time PCR primers sequences.
Tipo:
File Supplementare
Licenza:
Licenza per Accesso Aperto. Creative Commons Attribuzione (CCBY)
Dimensione
2.83 MB
Formato
Microsoft Word
|
2.83 MB | Microsoft Word | Visualizza/Apri |
12864_2013_5558_MOESM2_ESM.xls
accesso aperto
Descrizione: The file contains 6 sheets: (2a) Respiratory Chain genes found differentially expressed and related RPKM values; (2b-2c) Number of genes detected with both 454 and MiSeq, in both the OST samples; (2d-2e) Number of histone genes detected with both 454 and MiSeq, in both the OST samples; (2f) Number of genic loci with at least one read mapped in antisense.
Tipo:
File Supplementare
Licenza:
Licenza per Accesso Aperto. Creative Commons Attribuzione (CCBY)
Dimensione
6.02 MB
Formato
Microsoft Excel
|
6.02 MB | Microsoft Excel | Visualizza/Apri |
12864_2013_5558_MOESM3_ESM.doc
accesso aperto
Descrizione: This file contains (I) Tag Find input and usage; (II) Tag Find output description; (III) 454 simulated dataset composition and Tag Find accuracy
Tipo:
File Supplementare
Licenza:
Licenza per Accesso Aperto. Creative Commons Attribuzione (CCBY)
Dimensione
31.5 kB
Formato
Microsoft Word
|
31.5 kB | Microsoft Word | Visualizza/Apri |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.