Bioinformatics approach to unearthing bimolecular hammerhead ribozymes

No Thumbnail Available
Journal Title
Journal ISSN
Volume Title
Perustieteiden korkeakoulu | Master's thesis
Date
2019-03-11
Department
Major/Subject
Bioinformatics
Mcode
SCI3058
Degree programme
Master’s Programme in Life Science Technologies
Language
en
Pages
6 + 63
Series
Abstract
Background. Motif finding in large genomic datasets is one of the most powerful bioinformatics techniques in research. It acts as an important gateway to answering a wide variety of biological questions. However, it also remains one of bioinformatics' most challenging and computationally-expensive aspects. Motivation. In this work, we focus on small catalytic RNA molecules, i.e. hammerhead ribozymes, that over the past decades have been found to be present among the entire tree of life. Hammerhead ribozymes have been observed in many different forms and unusualities across biological samples and organisms of different origins. Nonetheless, bioinformatics searches have only concentrated on hammerhead ribozymes that are contiguous and wholly found on a single chromosome. In this work, we concentrate on hammerheads that are composed of two split RNA molecules each found on a different chromosome. Such hammerheads have been widely the focus of laboratory design and experiments due to their importance in medicine and therapeutics, and as of yet have not been reported as naturally-occurring motifs. Objective. Development and design of a bioinformatics tool that has the capability of locating bimolecular motifs that span two chromosomes; using hammerhead ribozymes and their consensus sequence structure to evaluate the tool on a certain model organism; discovering whether there is a theoretical probability that such motifs could occur in nature or they are truly restricted to laboratory design. Results. We have developed a hybrid pipeline in R, Perl, and Bash languages that utilizes regular expressions and BLAST for exact pattern matching of small motifs. We evaluated our tool on five chromosomes of Drosophila melanogaster and obtained thousands of results that match the structural requirements of a bimolecular discontiguous hammerhead ribozyme. These results showcase a high chance of occurrence for such motifs in nature. Moreover, many of the hits show characteristics of trans-acting and multiple-turnover hammerheads.
Description
Supervisor
Lähdesmäki, Harri
Thesis advisor
Curtis, Edward
Hoksza, David
Keywords
bioinformatics, rna motif discovery, bimolecular motif, sequence search, hammerhead ribozyme, computational biology
Other note
Citation