1. Modeling 3D structures of protein-RNA interactions
- Author
-
Čopar, Andrej and Curk, Tomaž
- Subjects
computer and information science ,računalništvo ,protein-RNA interactions ,napovedni model ,kombinatorična optimizacija ,computer science ,magisteriji ,bioinformatics ,molecular docking ,udc:004.85:575.112(043.2) ,strukturna analiza ,bioinformatika ,prediction model ,umestitev molekul ,računalništvo in informatika ,combinatorial optimization ,structural analysis ,master's degree ,interakcije protein-RNA - Abstract
Interakcije med proteini in RNA imajo ključno vlogo pri velikem številu celičnih procesov. Eksperimentalna analiza 3D struktur molekul je počasna in zahtevna, zato obstaja velika potreba po računskih metodah, ki uspešno napovedujejo mesta ter strukturo molekul v interakciji. V magistrskem delu smo definirali vrsto značilk, ki opisujejo lokalne lastnosti interakcij protein-RNA, na podlagi podatkov o 3D strukturah molekul protein-RNA. Razvili smo metodo, ki združuje strojno učenje in optimizacijski postopek za napovedovanje mesta interakcij med proteinom in RNA. Napovedi strojnega učenja se uporabijo za določanje začetnega stanja optimizacije. Optimizacijski postopek nato uporabi ocenjevalne funkcije osnovane na porazdelitvi 3D strukturnih značilk in tako predlaga najverjetnejšo pozicijo molekule RNA. Predlagani napovedni model dosega natančnost, ki je primerljiva z uspešnostjo najboljših obstoječih metod. Protein-RNA interactions have an essential role in many cellular processes. Experimental analysis of 3D molecular structure is slow and difficult process. Consequently, computational methods, which successfully predict interaction sites and molecular conformations are needed. In this thesis we have defined a number of attributes to describe local properties of protein-RNA interactions using data on 3D structure of protein-RNA molecules. We have implemented a method that uses machine learning and optimization algorithm for prediction of protein-RNA interaction sites. Machine learning predictions are used to generate initial positions for optimization. Optimization algorithm uses scoring functions based on the distribution of 3D structural attributes to identify most likely positions of the RNA molecule interacting with a given protein. The accuracy of the proposed prediction model is comparable to results obtained with best existing methods.
- Published
- 2015