Na r cisus

“NaRCiSuS”

Noncoding RNA Comparitive Searching System

Marie Curie European Reintegration Grants (ERG)

Call: FP7-PEOPLE-RG-2009

What are noncoding RNAs?

• RNA transcripts

• without long translable open reading frame

• highly structured

• realising their functions as RNAs

What do they do?

• RNA-protein machine:– Transfer RNA (tRNA).– Ribosomal RNA (rRNA).– RNAs (snRNAs) in spliceosome.

• Catalytic RNAs (ribozymes): catalyzing some functions.

• Micro RNAs (miRNAs): regulatory roles.• Small interfering RNAs (siRNAs): RNA silencing

– The genome’s immune system. [Plasterk, Science (2002)]– The breakthrough of the year by Science magazine in 2002.

Protein coding gene prediction

1. Prokaryota

• translation initiation site• ribosome binding site (Kozak sequence)• ORF features – length, similarity to known proteins,

codon usage, coding capacity

2. Eukaryota

• poli-A site• 3’, 5’ UTR• axon-intron location, splice sites

ncRNA gene prediction ?

• no translation initiation site

• no ribosome binding site

• no KOZAK sequence

• no significant ORF

ncRNA gene prediction ?

• base composition statistics

• simple translated regions search

• secondary structure based search

• combined comparative approach

RNA secondary structure

• ncRNA is not a random sequence.• Most RNAs fold into particular base-paired

secondary structure.

• Canonical basepairs:– Watson-Crick basepairs:

• G - C• A - U

– Wobble basepair:• G – U

• Stacks: continuous nested basepairs. (energetically favorable)

• Non-basepaired loops:– Hairpin loop.– Bulge.– Internal loop.– Multiloop.

• Most basepairs are non-crossing basepairs.– Any two pairs (i, j) and (i’,j’) i < i’ < j’ < j or i’ < i < j < j’

• Pseudoknots are the crossing basepairs.

Pseudoknots• Pseudoknots are important for certain ncRNAs• Violate the non-crossing assumption.• Pseudoknots make most problems harder • We assume there are no pseudoknots otherwise

noted.

[Rivas and Eddy (1999)]

ncRNA evolution is constrained by it secondary

structure

http://www.sanger.ac.uk/Software/Rfam/

• Drastic sequence changes can be tolerated.• Compensatory mutations are very common.

– One basepair mutates into another basepair.– Doesn’t change its secondary structure.

• In this talk:

ncRNA – conserved structured RNA.

tRNA1:

tRNA2:

Compensatory mutation

Secondary Structure alone is not enough

• de novo prediction:– Find stable secondary

structure from genome. [Shapiro et al. (1990)]

• The stability of ncRNA secondary structure is not sufficiently different from the predicted stability of a random sequence. [Rivas and Eddy (2000)].

– Look transcript signals. [Wassarman et al. (2001), Argaman et al. (2000)]

• ncRNA transcript signals are not strong.

• protein coding gene signals (open reading frame, promoter).

[Rivas and Eddy (2000)].

RNA secondary structure prediction

• It is a basic issue in ncRNA analysis

• It is important information to the biologists.

• Searching and alignment algorithms are based on these models.

• RNA secondary structure -- a set of non-crossing base pairs.

Base pair maximization problem

• A simple energy model is to maximize the number of basepairs to minimize the free energy. [Waterman (1978), Nussinov et al (1978), Waterman and Smith (1978)]

• G – C, A – U, and G – U are treated as equal stability.

• Contributions of stacking are ignored.

A dynamic programming solution

• Let s[1…n] be an RNA sequence.• δ(i,j) = 1 if s[i] and s[j] form a complementary base

pair, else δ(i,j) = 0.• M(i,j) is the maximum number of base pairs in s[i…j].

[Nussinov (1980)]

A dynamic programming solution

• M(1,n) is the number of base pairs in the optimal basepaired structure for s[1…n].

• All these basepairs can be found by tracing back through the matrix M.

• Filling M needs O(n3) time.

RNA structure: example

2 2 1 1

3 2 1 1 0

i 1 2 3 4 5 6j

2A C G A U UA C G A U U1 2 3 4 5 61 2 3 4 5 6

Zuker-Sankoff minimum energy model

• Stacks (contiguous nested base pairs) are the dominant stabilizing force – contribute the negative energy

• Unpaired bases form loops contribute the positive energy.– Hairpin loops, bulge/internal loops, and multiloops.

• Zuker-Sankoff minimum energy model. [Zuker and Sankoff (1984), Sankoff (1985)]

• Mfold and ViennaRNA are all based on this model. (this model is also called mfold model)

Zuker-Sankoff minimum energy model

:eH(i,j)

:a+3*b+4*c

:eL(i,j,i’,j’)

j J’

[Lyngsø (1999)]

j j-1:eS(i,j,i+1,j-1)

RNA minimum energy problem

• This problem can be solved by a dynamic programming algorithm in O(n4) time.

• Lyngsø et al. (1999) revise the energy function for internal loop, proposed an O(n3) time solution.

Zuker-Sankoff model

Recursive functions

• W(i) holds the minimum energy of a structure on s[1…i].

• V(i,j) holds the minimum energy of a structure on s[i…j] with s[i] and s[j] forming a basepair.

• WM(i,j) holds the minimum energy of a structure on s[i…j] that is part of multiloop.

Recursive functions (Zuker)• W(i) holds the

minimum energy of a structure on s[1…i].

A recursive solution• W(i) holds the

Idea of the project• Integrated bioinformatics platform that is specifically addressed

for detecting, verifying, and classifying of noncoding RNAs. • This complex approach to "Computational RNomics" will provide

the pipeline which will be capable of detecting RNA motifs with low sequence conservation.

• It will also integrate RNA motif prediction which should significantly improve the quality of the RNA homolog search.

• The secondary structure of the RNA can be represented as a graph.

• Graph can distinguish RNA of known families from Rfam database.

• Using experssed sequence tags and profile-profile protein like method for predicting ORF.

• The candidates will be folded “ab initio” usinf method proposed by zucker (MFold)

• The secondary strucetres of the candidates can be then compared to sequence form Rfam or ncRNA database to find novel ncRNAs or detect missing members of already know families.

Service Features• Prediction missing members of already defined RNA

families • Describe properties of ncRNA and measure

confidence level of their functional importance • Prediction of novel ncRNA “ab initio”. • Visualization secondary and tetriary structure “online” • Exchange formats of RNA annotation. • Identify interaction with coding genes. • Design novel RNA’s for therapy and drug delivery

(RNAi)

SERVER

Sanger

Repeatmasker

BLASTz

tBLASTn

BLASTn

mRNA/EST%GC

Loop Scaner QRNA

Human Mouse

Remove repeats

Searching for sequence of high

functionality

Searching for coding sequences

Comparison of expression patterns

Searching for sequences of well defined structure

orcontaining compensatory

mutations

Searching for remaining genes

Na r cisus

Documents

Transcript of Na r cisus

Init 10/28/2010 by Daniel R. Barnes Na + Cl - Na + Cl - Na + Cl - Na + Cl - Na + Cl - Na + Cl - Na + Cl - Na + Cl - Na + Cl - Na + Cl - Na + Cl - Na.

Notes for Na Na Na (Na Na Na Na Na Na Na Na)

STAVOVI I MISLEWA NA JAVNOSTA ZA CRVENIOT KRST NA R ...star.ckrm.org.mk/Downloads/Istrazuvanje na javno mislenje za CKRM 2006.pdf · 4. [irewe na znaewata za Crveniot krst i Me|unarodnoto

Species distribution modeling with R · 2017. 2. 4. · 1 Santa Catalina -21.9000 -66.1000 NaN 2 Canchis -13.5000 -71.0000 4500 3 -22.2666 -65.1333

L65993WB1970PLC027781 UNITED CREDIT LTD Date Of AGM(DD …unitedcreditltd.com/unitedadmin/upload/cmspage_470_data.pdf · rakesh r bharadwaj na na na z18 ghanshyam nagar opp r t o

,r, I · CENTRAL C", NA ,NTE"NA, ,ONA. CAP. TA. .,", TED 11niisi\:I' to lire Main 130:11'

e an, Jos Th Ad, Na r

DETOA Albrechtice s. r. o. Továrna na výrobu dřevěného zboží

r na Bff AGENDA jll . a Los Angeles Community College ...

t er i na r y Scie r c Journal of Veterinary Science & f e ...

projects-static.raspberrypi.org · de RPG . i r [direçäo] [item] Voce na Inventário > i r Voce näo i r por Voce está na Inventário esse caminho ! F le Edit Shell Debug Options

o.D. - Fingers elektrische Welt na na na na na na na na na na na na na na na na na na na na na na na na na na na na 207 na na na

I.PRELUDE AND SABBATH MELODY€¦ · de,_ cha " -na -vel ul - za --- r --ve - e -_mu I r a I!) r r -r .,r -t mer I' -shim-cha_ el yon. I") I r r -r - -· I -u - na - t' - cha ba -le

NA Program Review R A I L

NA R - dms.psc.sc.gov

Guts ClASSE OPERARIA NA OFENSIVA -OP-ERA-:-'R'-A -NA-O ... · 'Con",

I N TE R NA TI O NA L M O N E T A R Y F ND SEMINARS · 2016-04-09 · 2 SEMINARS TENTATIVE SCHEDULE I N TE R NA TI O NA L M O N E T A R Y F ND Thursday, April 14 10:00 a.m. – 11:30

a, · 2020. 2. 28. · Case Narrative/Conformance Summary .....37 b. QC Summary ... 8528577 201A N/A NA NA NA NA NA NA Y NA NA NA N NA NA 8/16/2016 10:29:33AM 8044 ... 600/R-94/111

Author(s): Joseph L. Sapir Russell Kidman R. W. Brewer...6.114 +006 −000 7.490 +006 19Y29156 N5 7.528 8.972 na na na na N6 8.999 10.000 na − 000 8.980 +006 − 006 9.992 +000 19Y29156

M issio na r y THE - Central Union Mission