LuxGLM: A probabilistic covariate model for quantification of DNA methylation modifications with complex experimental designs

dc.contributorAalto-yliopistofi
dc.contributorAalto Universityen
dc.contributor.authorÄijö, Tarmoen_US
dc.contributor.authorYue, Xiaojingen_US
dc.contributor.authorRao, Anjanaen_US
dc.contributor.authorLähdesmäki, Harrien_US
dc.contributor.departmentDepartment of Computer Scienceen
dc.contributor.groupauthorProfessorship Lähdesmäki Harrien
dc.contributor.groupauthorCentre of Excellence in Molecular Systems Immunology and Physiology Research Group, SyMMysen
dc.contributor.organizationLa Jolla Institute for Allergy and Immunologyen_US
dc.contributor.organizationSanford Consortium for Regenerative Medicineen_US
dc.date.accessioned2021-05-05T06:20:32Z
dc.date.available2021-05-05T06:20:32Z
dc.date.issued2016-09-01en_US
dc.description.abstractMotivation: 5-methylcytosine (5mC) is a widely studied epigenetic modification of DNA. The ten-eleven translocation (TET) dioxygenases oxidize 5mC into oxidized methylcytosines (oxi-mCs): 5-hydroxymethylcytosine (5hmC), 5-formylcytosine (5fC) and 5-carboxylcytosine (5caC). DNA methylation modifications have multiple functions. For example, 5mC is shown to be associated with diseases and oxi-mC species are reported to have a role in active DNA demethylation through 5mC oxidation and DNA repair, among others, but the detailed mechanisms are poorly understood. Bisulphite sequencing and its various derivatives can be used to gain information about all methylation modifications at single nucleotide resolution. Analysis of bisulphite based sequencing data is complicated due to the convoluted read-outs and experiment-specific variation in biochemistry. Moreover, statistical analysis is often complicated by various confounding effects. How to analyse 5mC and oxi-mC data sets with arbitrary and complex experimental designs is an open and important problem. Results: We propose the first method to quantify oxi-mC species with arbitrary covariate structures from bisulphite based sequencing data. Our probabilistic modeling framework combines a previously proposed hierarchical generative model for oxi-mC-seq data and a general linear model component to account for confounding effects. We show that our method provides accurate methylation level estimates and accurate detection of differential methylation when compared with existing methods. Analysis of novel and published data gave insights into to the demethylation of the forkhead box P3 (Foxp3) locus during the induced T regulatory cell differentiation. We also demonstrate how our covariate model accurately predicts methylation levels of the Foxp3 locus. Collectively, LuxGLM method improves the analysis of DNA methylation modifications, particularly for oxi-mC species. Availability and Implementation: An implementation of the proposed method is available under MIT license at https://github.org/tare/LuxGLM/en
dc.description.versionPeer revieweden
dc.format.extenti511-i519
dc.format.mimetypeapplication/pdfen_US
dc.identifier.citationÄijö, T, Yue, X, Rao, A & Lähdesmäki, H 2016, ' LuxGLM : A probabilistic covariate model for quantification of DNA methylation modifications with complex experimental designs ', Bioinformatics, vol. 32, no. 17, pp. i511-i519 . https://doi.org/10.1093/bioinformatics/btw468en
dc.identifier.doi10.1093/bioinformatics/btw468en_US
dc.identifier.issn1367-4803
dc.identifier.issn1460-2059
dc.identifier.otherPURE UUID: fc4ad283-7fec-40c4-b15f-943bb7eb0461en_US
dc.identifier.otherPURE ITEMURL: https://research.aalto.fi/en/publications/fc4ad283-7fec-40c4-b15f-943bb7eb0461en_US
dc.identifier.otherPURE LINK: http://www.scopus.com/inward/record.url?scp=84991010778&partnerID=8YFLogxKen_US
dc.identifier.otherPURE FILEURL: https://research.aalto.fi/files/62350808/btw468.pdfen_US
dc.identifier.urihttps://aaltodoc.aalto.fi/handle/123456789/107267
dc.identifier.urnURN:NBN:fi:aalto-202105056531
dc.language.isoenen
dc.publisherOXFORD UNIV PRESS INC
dc.relation.ispartofseriesBioinformaticsen
dc.relation.ispartofseriesVolume 32, issue 17en
dc.rightsopenAccessen
dc.titleLuxGLM: A probabilistic covariate model for quantification of DNA methylation modifications with complex experimental designsen
dc.typeA1 Alkuperäisartikkeli tieteellisessä aikakauslehdessäfi
dc.type.versionpublishedVersion

Files