RNA Accessibility in cubic time
7 pages
English

Découvre YouScribe en t'inscrivant gratuitement

Je m'inscris

RNA Accessibility in cubic time

-

Découvre YouScribe en t'inscrivant gratuitement

Je m'inscris
Obtenez un accès à la bibliothèque pour le consulter en ligne
En savoir plus
7 pages
English
Obtenez un accès à la bibliothèque pour le consulter en ligne
En savoir plus

Description

The accessibility of RNA binding motifs controls the efficacy of many biological processes. Examples are the binding of miRNA, siRNA or bacterial sRNA to their respective targets. Similarly, the accessibility of the Shine-Dalgarno sequence is essential for translation to start in prokaryotes. Furthermore, many classes of RNA binding proteins require the binding site to be single-stranded. Results We introduce a way to compute the accessibility of all intervals within an RNA sequence in ( n 3 ) time. This improves on previous implementations where only intervals of one defined length were computed in the same time. While the algorithm is in the same efficiency class as sampling approaches, the results, especially if the probabilities get small, are much more exact. Conclusions Our algorithm significantly speeds up methods for the prediction of RNA-RNA interactions and other applications that require the accessibility of RNA molecules. The algorithm is already available in the program RNAplfold of the ViennaRNA package.

Informations

Publié par
Publié le 01 janvier 2011
Nombre de lectures 244
Langue English

Extrait

Bernhartet al.Algorithms for Molecular Biology2011,6:3 http://www.almob.org/content/6/1/3
R E S E A R C H RNA Accessibility in cubic time * Stephan H Bernhart , Ullrike Mückstein, Ivo L Hofacker
Open Access
Abstract Background:The accessibility of RNA binding motifs controls the efficacy of many biological processes. Examples are the binding of miRNA, siRNA or bacterial sRNA to their respective targets. Similarly, the accessibility of the ShineDalgarno sequence is essential for translation to start in prokaryotes. Furthermore, many classes of RNA binding proteins require the binding site to be singlestranded. 3 Results:We introduce a way to compute the accessibility of all intervals within an RNA sequence inO(n) time. This improves on previous implementations where only intervals of one defined length were computed in the same time. While the algorithm is in the same efficiency class as sampling approaches, the results, especially if the probabilities get small, are much more exact. Conclusions:Our algorithm significantly speeds up methods for the prediction of RNARNA interactions and other applications that require the accessibility of RNA molecules. The algorithm is already available in the program RNAplfold of the ViennaRNA package.
Background The importance of RNA within living cells has been rea lized in the last two decades. RNA provides a layer of regulation in eukaryotes, e.g. via miRNA, but also in prokaryotes via small RNAs (sRNAs) and riboswitches. Many of these regulatory functions are mediated by RNA interactions. These interactions are mainly realized through WatsonCrick or wobble base pairing between two RNA molecules. For the initialization of these inter actions, a part of the interacting molecules has to be singlestranded. The tendency to be singlestranded is thus also important for the quality of putative target sites of miRNAs [1], siRNAs [2] and most probably sRNAs. Furthermore, the accessibilities of the Shine Dalgarno sequence and the start codons are indicators of translational efficacy [3]. In addition, RNA accessibil ity will also influence the efficacy of singlestrand bind ing proteins likeHuR[4]. As it is not known how big exactly a putative target site is, and where it is located, it is best to know the accessibilities of all possible inter vals within a RNA molecule. In particular, programs like RNAup [5,6] or IntaRNA [7] predict RNARNA interac tions by computing a total binding energyδGtot=δGint +δGopen, composed of a stabilizing energy for the
* Correspondence: berni@tbi.univie.ac.at Theoretical Biochemistry group, Institute for theoretical chemistry, University of Vienna, Währingerstrasse 17, Vienna, Austria
intermolecular duplexδGintand the cost of opening the binding sitesδGopen. The opening energy can be com puted from the accessibility, defined as the probability u pthat the binding site is unpaired in equilibrium, via u δGopen= RTln(p). The most naive approach to compute the accessibility of a certain stretch of bases is to use a constrained fold ing where no base pairs are allowed within a certain stretch of bases and dividing the respective restricted partition function by the unrestricted one. This is done for example in the miRNA target predictor PITA [8]. 2 However, doing this for allnpossible intervals requires 5 O(n) time. Ding and Lawrence [9] proposed to com pute accessibilities by stochastically sampling structures from the Boltzmann ensemble. Sampling structures can 3 be done inO(n), but necessarily introduces sampling errors which become large if the accessibilities get small, as is necessarily the case for longer regions. In [10], we introduced an algorithm that computes the accessibil ities of all intervals of a given lengthlin cubic time. 4 This leads to aO(n) algorithm when applied to inter vals of all possible lengths. In addition, the algorithm could be used as a scanning algorithm that considers only local structures of a maximum lengthLand runs 2 inO(nL l). Here we introduce an algorithm to compute the accessibilities or singlestrandedness of all intervals of
© 2011 Bernhart et al; licensee BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
  • Univers Univers
  • Ebooks Ebooks
  • Livres audio Livres audio
  • Presse Presse
  • Podcasts Podcasts
  • BD BD
  • Documents Documents