The problem of time series analysis and incomlete data: Real-world applications

dc.contributorAalto-yliopistofi
dc.contributorAalto Universityen
dc.contributor.advisorLendasse, Amaury
dc.contributor.authorChistiakova, Tatiana
dc.contributor.departmentPerustieteiden korkeakoulufi
dc.contributor.schoolPerustieteiden korkeakoulufi
dc.contributor.schoolSchool of Scienceen
dc.contributor.supervisorSimula, Olli
dc.date.accessioned2020-12-28T15:08:49Z
dc.date.available2020-12-28T15:08:49Z
dc.date.issued2013
dc.description.abstractOne of the characteristics of almost any data collection is the presence of outstanding series and missing values. The risk to get the incomplete and hard processed data increases especially if the data is characterized with a large size or collected manually. The presence of missing values in the data cannot be underestimated. In addition to containing important information, missing values are often correlated with other values. Furthermore, the predicted data allows analysing the data and performing future forecast on obtained results. In case of data analysis, it is essential to study data properties carefully. The data analysis occurs in every sphere, e.g. sociology, finance, environment, science, wherever there are issues to be studied and explored. Social networks have been always a reach topic to explore. Being highly dynamic objects, the issues require a deep and careful investigation. Moreover, due to their properties, like a small number of samples and a high amount of variables at the same time, online data seeks for additional methods to highlight and uncover interesting parts. The proposed methodology of a modified Forward-Backward algorithm aims to analyse social networks presented as time series data sets. All the time, people study deeply burning issues, related to climate and economy. Since these topics are of a particular interest, in the thesis, the imputations of missing values are performed on real-world data sets from climatology and financial areas. The application shows the possible variety and importance of predicting the missing values. There exist a large number of methods which allow imputing missing values. A number of promising algorithms is investigated and compared due to data sets difference -The EOF, the Ensemble of SOMs and the Mixture of Gaussians.en
dc.format.extent53
dc.identifier.urihttps://aaltodoc.aalto.fi/handle/123456789/100821
dc.identifier.urnURN:NBN:fi:aalto-2020122859652
dc.language.isoenen
dc.programme.majorInformaatiotekniikkafi
dc.programme.mcodeT-115fi
dc.rights.accesslevelclosedAccess
dc.subject.keywordtime series predictionen
dc.subject.keywordsocial networksen
dc.subject.keywordvariable selectionen
dc.subject.keywordforward-backward algorithmen
dc.subject.keywordmissing valuesen
dc.subject.keywordwater temperature dataen
dc.subject.keywordimputationsen
dc.subject.keywordensemble of SOMsen
dc.subject.keywordEOFen
dc.subject.keywordmixture of gaussiansen
dc.titleThe problem of time series analysis and incomlete data: Real-world applicationsen
dc.type.okmG2 Pro gradu, diplomityö
dc.type.ontasotMaster's thesisen
dc.type.ontasotPro gradu -tutkielmafi
dc.type.publicationmasterThesis
local.aalto.digiauthask
local.aalto.digifolderAalto_10497
local.aalto.idinssi47025
local.aalto.inssiarchivenr8667
local.aalto.inssilocationP1 Ark Aalto
local.aalto.openaccessno

Files