Graphical test for discrete uniformity and its applications in goodness-of-fit evaluation and multiple sample comparison

dc.contributorAalto-yliopistofi
dc.contributorAalto Universityen
dc.contributor.authorSäilynoja, Teemuen_US
dc.contributor.authorBürkner, Paul Christianen_US
dc.contributor.authorVehtari, Akien_US
dc.contributor.departmentDepartment of Computer Scienceen
dc.contributor.groupauthorProbabilistic Machine Learningen
dc.contributor.groupauthorProfessorship Vehtari Akien
dc.contributor.groupauthorComputer Science Professorsen
dc.contributor.groupauthorComputer Science - Artificial Intelligence and Machine Learning (AIML)en
dc.contributor.groupauthorHelsinki Institute for Information Technology (HIIT)en
dc.date.accessioned2022-04-28T08:08:24Z
dc.date.available2022-04-28T08:08:24Z
dc.date.issued2022-04-15en_US
dc.descriptionFunding Information: We thank the Academy of Finland (grant 298742), the Finnish Center for Artificial Intelligence, and the Technology Industries of Finland Centennial Foundation (grant 70007503; Artificial Intelligence for Research and Development) for partial support of this research. We also acknowledge the computational resources provided by the Aalto Science-IT project. Publisher Copyright: © 2022, The Author(s).
dc.description.abstractAssessing goodness of fit to a given distribution plays an important role in computational statistics. The probability integral transformation (PIT) can be used to convert the question of whether a given sample originates from a reference distribution into a problem of testing for uniformity. We present new simulation- and optimization-based methods to obtain simultaneous confidence bands for the whole empirical cumulative distribution function (ECDF) of the PIT values under the assumption of uniformity. Simultaneous confidence bands correspond to such confidence intervals at each point that jointly satisfy a desired coverage. These methods can also be applied in cases where the reference distribution is represented only by a finite sample, which is useful, for example, for simulation-based calibration. The confidence bands provide an intuitive ECDF-based graphical test for uniformity, which also provides useful information on the quality of the discrepancy. We further extend the simulation and optimization methods to determine simultaneous confidence bands for testing whether multiple samples come from the same underlying distribution. This multiple sample comparison test is useful, for example, as a complementary diagnostic in multi-chain Markov chain Monte Carlo (MCMC) convergence diagnostics, where most currently used convergence diagnostics provide a single diagnostic value, but do not usually offer insight into the nature of the deviation. We provide numerical experiments to assess the properties of the tests using both simulated and real-world data and give recommendations on their practical application in computational statistics workflows.en
dc.description.versionPeer revieweden
dc.format.extent21
dc.format.mimetypeapplication/pdfen_US
dc.identifier.citationSäilynoja, T, Bürkner, P C & Vehtari, A 2022, ' Graphical test for discrete uniformity and its applications in goodness-of-fit evaluation and multiple sample comparison ', STATISTICS AND COMPUTING, vol. 32, no. 2, 32, pp. 1-21 . https://doi.org/10.1007/s11222-022-10090-6en
dc.identifier.doi10.1007/s11222-022-10090-6en_US
dc.identifier.issn0960-3174
dc.identifier.otherPURE UUID: 06189a9d-32f2-4748-a092-cb85e7117059en_US
dc.identifier.otherPURE ITEMURL: https://research.aalto.fi/en/publications/06189a9d-32f2-4748-a092-cb85e7117059en_US
dc.identifier.otherPURE LINK: http://www.scopus.com/inward/record.url?scp=85127300829&partnerID=8YFLogxK
dc.identifier.otherPURE FILEURL: https://research.aalto.fi/files/82052543/Graphical_test_for_discrete_uniformity_and_its_applications_in_goodness_of_fit_evaluation_and_multiple_sample_comparison.pdfen_US
dc.identifier.urihttps://aaltodoc.aalto.fi/handle/123456789/114013
dc.identifier.urnURN:NBN:fi:aalto-202204282899
dc.language.isoenen
dc.publisherSpringer
dc.relation.ispartofseriesSTATISTICS AND COMPUTINGen
dc.relation.ispartofseriesVolume 32, issue 2, pp. 1-21en
dc.rightsopenAccessen
dc.subject.keywordStatistics - Methodologyen_US
dc.titleGraphical test for discrete uniformity and its applications in goodness-of-fit evaluation and multiple sample comparisonen
dc.typeA1 Alkuperäisartikkeli tieteellisessä aikakauslehdessäfi
dc.type.versionpublishedVersion

Files