Algorithms for Order-Preserving Matching

dc.contributorAalto-yliopistofi
dc.contributorAalto Universityen
dc.contributor.advisorTarhio, Jorma, Prof., Aalto University, Department of Computer Science, Finland
dc.contributor.authorChhabra, Tamanna
dc.contributor.departmentTietotekniikan laitosfi
dc.contributor.departmentDepartment of Computer Scienceen
dc.contributor.schoolPerustieteiden korkeakoulufi
dc.contributor.schoolSchool of Scienceen
dc.contributor.supervisorTarhio, Jorma, Prof., Aalto University, Department of Computer Science, Finland
dc.date.accessioned2016-05-25T09:01:19Z
dc.date.available2016-05-25T09:01:19Z
dc.date.defence2016-06-10
dc.date.issued2016
dc.description.abstractString matching is a widely studied problem in Computer Science. There have been many recent developments in this field. One fascinating problem considered lately is the order-preserving matching (OPM) problem. The task is to find all the substrings in the text which have the same length and relative order as the pattern, where the relative order is the numerical order of the numbers in a string. The problem finds its applications in the areas involving time series or series of numbers. More specifically, it is useful for those who are interested in the relative order of the pattern and not in the pattern itself. For example, it can be used by analysts in a stock market to study movements of prices.  In addition to the OPM problem, we also studied its approximate variation. In approximate order-preserving matching, we search for those substrings in the text which have relative order similar to the pattern, i.e., relative order of the pattern matches with at most k mismatches. With respect to applications of order-preserving matching, approximate search is more meaningful than exact search. We developed various advanced solutions for the problem and its variant. Special emphasis was laid on the practical efficiency of the solutions. Particularly, we introduced a simple solution for the OPM problem using filtration. We proved experimentally that our method was effective and faster than the previous solutions for the problem. In addition, we combined the Single Instruction Multiple Data (SIMD) instruction set architecture with filtration to develop competent solutions which were faster than our previous solution. Moreover, we proposed another efficient solution without filtration using the SIMD architecture. We also presented an offline solution based on the FM-index scheme. Furthermore, we proposed practical solutions for the approximate order-preserving matching problem and one of the solutions was the first sublinear solution on average for the problem.en
dc.format.extent70 + app. 43
dc.format.mimetypeapplication/pdfen
dc.identifier.isbn978-952-60-6829-9 (electronic)
dc.identifier.isbn978-952-60-6828-2 (printed)
dc.identifier.issn1799-4942 (electronic)
dc.identifier.issn1799-4934 (printed)
dc.identifier.issn1799-4934 (ISSN-L)
dc.identifier.urihttps://aaltodoc.aalto.fi/handle/123456789/20383
dc.identifier.urnURN:ISBN:978-952-60-6829-9
dc.language.isoenen
dc.opnWatson, Bruce, Prof., Stellenbosch University, South Africa
dc.publisherAalto Universityen
dc.publisherAalto-yliopistofi
dc.relation.haspart[Publication 1]: Tamanna Chhabra and Jorma Tarhio. A filtration method for order-preserving matching. Information Processing Letters, 116(2): 71–74, 2016. DOI: 10.1016/j.ipl.2015.10.005
dc.relation.haspart[Publication 2]: Tamanna Chhabra, M. Oguzhan Kulekci, and Jorma Tarhio. Alternative algorithms for order-preserving matching. In Proceedings of the Prague Stringology Conference, Prague, Czech Republic, 36–46, August 2015.
dc.relation.haspart[Publication 3]: Tamanna Chhabra, Simone Faro, and M. Oguzhan Kulekci. Engineering order-preserving pattern matching with SIMD parallelism. Software–Practice and Experience, 2015.
dc.relation.haspart[Publication 4]: Tamanna Chhabra, Emanuele Giaquinta, and Jorma Tarhio. Filtration algorithms for approximate order-preserving matching. In Proceedings of the String Processing and Information Retrieval – 22nd International Symposium, SPIRE, London, UK, Lecture Notes in Computer Science 9309: 177–187, September 2015. DOI: 10.1007/978-3-319-23826-5_18
dc.relation.ispartofseriesAalto University publication series DOCTORAL DISSERTATIONSen
dc.relation.ispartofseries101/2016
dc.revLecroq, Thierry, Prof., University of Rouen, France
dc.revKärkkäinen, Juha, University Lecturer, University of Helsinki, Finland
dc.subject.keywordstring matchingen
dc.subject.keywordindexingen
dc.subject.keywordSIMDen
dc.subject.keywordfiltrationen
dc.subject.otherComputer scienceen
dc.titleAlgorithms for Order-Preserving Matchingen
dc.typeG5 Artikkeliväitöskirjafi
dc.type.dcmitypetexten
dc.type.ontasotDoctoral dissertation (article-based)en
dc.type.ontasotVäitöskirja (artikkeli)fi
local.aalto.archiveyes
local.aalto.formfolder2016_05_25_klo_11_04

Files

Original bundle

Now showing 1 - 2 of 2
No Thumbnail Available
Name:
isbn9789526068299.pdf
Size:
506.76 KB
Format:
Adobe Portable Document Format
No Thumbnail Available
Name:
Errata_chhabra_tamanna_P2_P3.pdf
Size:
119.06 KB
Format:
Adobe Portable Document Format