The "5' end mRNA artifact" issue refers to the incorrect assignment of the first AUG codon in an mRNA, due to the incomplete determination of its 5' end sequence. We performed a systematic identification of coding regions at the 5' end of all human known mRNAs, using an automated expressed sequence tag (EST)-based approach. Following parsing of more than 7million BLAT alignments, we found 477 human loci, out of 18,665 analyzed, in which an extension of the mRNA 5' coding region was identified. Proof-of-concept confirmation was obtained by in vitro cloning and sequencing for GNB2L1, QARS and TDP2 cDNAs, and the consequences for the functional studies of these loci are discussed. We also generated a list of 20,775 human mRNAs where the presence of an in-frame stop codon upstream of the known start codon indicates completeness of the coding sequence at 5' in the current form.

Genome-scale analysis of human mRNA 5' coding sequences based on expressed sequence tag (EST) database / Casadei R.; Piovesan A.; Vitale L.; Facchin F.; Pelleri M.C.; Canaider S.; Bianconi E.; Frabetti F.; Strippoli P.. - In: GENOMICS. - ISSN 0888-7543. - STAMPA. - 100:2(2012), pp. 125-130. [10.1016/j.ygeno.2012.05.012]

Genome-scale analysis of human mRNA 5' coding sequences based on expressed sequence tag (EST) database.

CASADEI, RAFFAELLA;PIOVESAN, ALLISON;VITALE, LORENZA;FACCHIN, FEDERICA;PELLERI, MARIA CHIARA;CANAIDER, SILVIA;BIANCONI, EVA;FRABETTI, FLAVIA;STRIPPOLI, PIERLUIGI
2012

Abstract

The "5' end mRNA artifact" issue refers to the incorrect assignment of the first AUG codon in an mRNA, due to the incomplete determination of its 5' end sequence. We performed a systematic identification of coding regions at the 5' end of all human known mRNAs, using an automated expressed sequence tag (EST)-based approach. Following parsing of more than 7million BLAT alignments, we found 477 human loci, out of 18,665 analyzed, in which an extension of the mRNA 5' coding region was identified. Proof-of-concept confirmation was obtained by in vitro cloning and sequencing for GNB2L1, QARS and TDP2 cDNAs, and the consequences for the functional studies of these loci are discussed. We also generated a list of 20,775 human mRNAs where the presence of an in-frame stop codon upstream of the known start codon indicates completeness of the coding sequence at 5' in the current form.
2012
Genome-scale analysis of human mRNA 5' coding sequences based on expressed sequence tag (EST) database / Casadei R.; Piovesan A.; Vitale L.; Facchin F.; Pelleri M.C.; Canaider S.; Bianconi E.; Frabetti F.; Strippoli P.. - In: GENOMICS. - ISSN 0888-7543. - STAMPA. - 100:2(2012), pp. 125-130. [10.1016/j.ygeno.2012.05.012]
Casadei R.; Piovesan A.; Vitale L.; Facchin F.; Pelleri M.C.; Canaider S.; Bianconi E.; Frabetti F.; Strippoli P.
File in questo prodotto:
Eventuali allegati, non sono esposti

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11585/119984
 Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni
  • ???jsp.display-item.citation.pmc??? 7
  • Scopus 12
  • ???jsp.display-item.citation.isi??? 11
social impact