dc.contributorSistema FMUSP-HC: Faculdade de Medicina da Universidade de São Paulo (FMUSP) e Hospital das Clínicas da FMUSP-
dc.contributor.authorTAVARES, Raphael-
dc.contributor.authorRENAUD, Gabriel-
dc.contributor.authorOLIVEIRA, Paulo Sergio Lopes-
dc.contributor.authorFERREIRA, Carlos G.-
dc.contributor.authorDIAS-NETO, Emmanuel-
dc.contributor.authorPASSETTI, Fabio-
dc.identifier.citationCOMPUTATIONAL BIOLOGY AND CHEMISTRY, v.36, p.55-61, 2012-
dc.description.abstractIntron splicing is one of the most important steps involved in the maturation process of a pre-mRNA. Although the sequence profiles around the splice sites have been studied extensively, the levels of sequence identity between the exonic sequences preceding the donor sites and the intronic sequences preceding the acceptor sites has not been examined as thoroughly. In this study we investigated identity patterns between the last 15 nucleotides of the exonic sequence preceding the 5' splice site and the intronic sequence preceding the 3' splice site in a set of human protein-coding genes that do not exhibit intron retention. We found that almost 60% of consecutive exons and introns in human protein-coding genes share at least two identical nucleotides at their 3' ends and, on average, the sequence identity length is 2.47 nucleotides. Based on our findings we conclude that the 3' ends of exons and introns tend to have longer identical sequences within a gene than when being taken from different genes. Our results hold even if the pairs are non-consecutive in the transcription order.-
dc.description.sponsorshipCNPq [382791/2009-6]-
dc.description.sponsorshipDECIT/SCTIE/MS [577593/2008-0, 312733/2009-7]-
dc.description.sponsorshipSwiss Bridge Foundation-
dc.description.sponsorshipFundacao do Cancer-
dc.description.sponsorshipAssociacao Beneficente Alzira Denise Hertzog da Silva (ABADHS) [FMUSP-HC/LIM-27]-
dc.publisherELSEVIER SCI LTD-
dc.relation.ispartofComputational Biology and Chemistry-
dc.subjectSequence analysis-
dc.subject.otherspliceosomal introns-
dc.titleIdentical sequence patterns in the ends of exons and introns of human protein-coding genes-
dc.rights.holderCopyright ELSEVIER SCI LTD-
dc.subject.wosComputer Science, Interdisciplinary Applications-
dc.type.categoryoriginal article-
dc.type.versionpublishedVersion-, Raphael:Inst Nacl Canc INCA, Bioinformat Unit, BR-20231050 Rio De Janeiro, RJ, Brazil-, Gabriel:Inst Nacl Canc INCA, Bioinformat Unit, BR-20231050 Rio De Janeiro, RJ, Brazil-, Paulo Sergio Lopes:Lab Nacl Biociencias, BR-13083970 Campinas, SP, Brazil-, Fabio:Inst Nacl Canc INCA, Bioinformat Unit, BR-20231050 Rio De Janeiro, RJ, Brazil-
hcfmusp.remissive.sponsorshipMinistério da Saúde-
