PairGP: Gaussian process modeling of longitudinal data from paired multi-condition studies

dc.contributorAalto-yliopistofi
dc.contributorAalto Universityen
dc.contributor.authorVantini, Micheleen_US
dc.contributor.authorMannerström, Henriken_US
dc.contributor.authorRautio, Sinien_US
dc.contributor.authorAhlfors, Helenaen_US
dc.contributor.authorStockinger, Brigittaen_US
dc.contributor.authorLähdesmäki, Harrien_US
dc.contributor.departmentDepartment of Computer Scienceen
dc.contributor.groupauthorProfessorship Lähdesmäki Harrien
dc.contributor.groupauthorComputer Science Professorsen
dc.contributor.groupauthorComputer Science - Computational Life Sciences (CSLife) - Research areaen
dc.contributor.groupauthorComputer Science - Artificial Intelligence and Machine Learning (AIML) - Research areaen
dc.contributor.groupauthorHelsinki Institute for Information Technology (HIIT)en
dc.contributor.organizationDepartment of Computer Scienceen_US
dc.contributor.organizationFrancis Crick Instituteen_US
dc.date.accessioned2022-02-16T07:41:36Z
dc.date.available2022-02-16T07:41:36Z
dc.date.issued2022-04en_US
dc.descriptionPublisher Copyright: © 2022
dc.description.abstractHigh-throughput technologies produce gene expression time-series data that need fast and specialized algorithms to be processed. While current methods already deal with different aspects, such as the non-stationarity of the process and the temporal correlation, they often fail to take into account the pairing among replicates. We propose PairGP, a non-stationary Gaussian process method to compare gene expression time-series across several conditions that can account for paired longitudinal study designs and can identify groups of conditions that have different gene expression dynamics. We demonstrate the method on both simulated data and previously unpublished RNA sequencing (RNA-seq) time-series with five conditions. The results show the advantage of modeling the pairing effect to better identify groups of conditions with different dynamics. The pairing effect model displays good capabilities of selecting the most probable grouping of conditions even in the presence of a high number of conditions. The developed method is of general application and can be applied to any gene expression time series dataset. The model can identify common replicate effects among the samples coming from the same biological replicates and model those as separate components. Learning the pairing effect as a separate component, not only allows us to exclude it from the model to get better estimates of the condition effects, but also to improve the precision of the model selection process. The pairing effect that was accounted before as noise, is now identified as a separate component, resulting in more accurate and explanatory models of the data.en
dc.description.versionPeer revieweden
dc.format.extent7
dc.format.mimetypeapplication/pdfen_US
dc.identifier.citationVantini, M, Mannerström, H, Rautio, S, Ahlfors, H, Stockinger, B & Lähdesmäki, H 2022, 'PairGP: Gaussian process modeling of longitudinal data from paired multi-condition studies', Computers in Biology and Medicine, vol. 143, 105268, pp. 1-7. https://doi.org/10.1016/j.compbiomed.2022.105268en
dc.identifier.doi10.1016/j.compbiomed.2022.105268en_US
dc.identifier.issn0010-4825
dc.identifier.issn1879-0534
dc.identifier.otherPURE UUID: f4e7af16-3927-478d-aec1-d81dd1b9c2c6en_US
dc.identifier.otherPURE ITEMURL: https://research.aalto.fi/en/publications/f4e7af16-3927-478d-aec1-d81dd1b9c2c6en_US
dc.identifier.otherPURE FILEURL: https://research.aalto.fi/files/79392985/PairGP_Gaussian_process_modeling_of_longitudinal_data_from_paired_multi_condition_studies.pdf
dc.identifier.urihttps://aaltodoc.aalto.fi/handle/123456789/113061
dc.identifier.urnURN:NBN:fi:aalto-202202161953
dc.language.isoenen
dc.publisherElsevier
dc.relation.fundinginfoThis work was supported by the Academy of Finland [grant numbers 292 660 , 313 271 ]
dc.relation.ispartofseriesComputers in Biology and Medicineen
dc.relation.ispartofseriesVolume 143, pp. 1-7en
dc.rightsopenAccessen
dc.subject.keywordDifferential condition analysisen_US
dc.subject.keywordGaussian processesen_US
dc.subject.keywordGene expressionsen_US
dc.subject.keywordPairing effecten_US
dc.subject.keywordTime-seriesen_US
dc.titlePairGP: Gaussian process modeling of longitudinal data from paired multi-condition studiesen
dc.typeA1 Alkuperäisartikkeli tieteellisessä aikakauslehdessäfi
dc.type.versionpublishedVersion

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
PairGP_Gaussian_process_modeling_of_longitudinal_data_from_paired_multi_condition_studies.pdf
Size:
2.01 MB
Format:
Adobe Portable Document Format