Graphical test for discrete uniformity and its applications in goodness-of-fit evaluation and multiple sample comparison
dc.contributor | Aalto-yliopisto | fi |
dc.contributor | Aalto University | en |
dc.contributor.author | Säilynoja, Teemu | en_US |
dc.contributor.author | Bürkner, Paul Christian | en_US |
dc.contributor.author | Vehtari, Aki | en_US |
dc.contributor.department | Department of Computer Science | en |
dc.contributor.groupauthor | Probabilistic Machine Learning | en |
dc.contributor.groupauthor | Professorship Vehtari Aki | en |
dc.contributor.groupauthor | Computer Science Professors | en |
dc.contributor.groupauthor | Computer Science - Artificial Intelligence and Machine Learning (AIML) | en |
dc.contributor.groupauthor | Helsinki Institute for Information Technology (HIIT) | en |
dc.date.accessioned | 2022-04-28T08:08:24Z | |
dc.date.available | 2022-04-28T08:08:24Z | |
dc.date.issued | 2022-04-15 | en_US |
dc.description | Funding Information: We thank the Academy of Finland (grant 298742), the Finnish Center for Artificial Intelligence, and the Technology Industries of Finland Centennial Foundation (grant 70007503; Artificial Intelligence for Research and Development) for partial support of this research. We also acknowledge the computational resources provided by the Aalto Science-IT project. Publisher Copyright: © 2022, The Author(s). | |
dc.description.abstract | Assessing goodness of fit to a given distribution plays an important role in computational statistics. The probability integral transformation (PIT) can be used to convert the question of whether a given sample originates from a reference distribution into a problem of testing for uniformity. We present new simulation- and optimization-based methods to obtain simultaneous confidence bands for the whole empirical cumulative distribution function (ECDF) of the PIT values under the assumption of uniformity. Simultaneous confidence bands correspond to such confidence intervals at each point that jointly satisfy a desired coverage. These methods can also be applied in cases where the reference distribution is represented only by a finite sample, which is useful, for example, for simulation-based calibration. The confidence bands provide an intuitive ECDF-based graphical test for uniformity, which also provides useful information on the quality of the discrepancy. We further extend the simulation and optimization methods to determine simultaneous confidence bands for testing whether multiple samples come from the same underlying distribution. This multiple sample comparison test is useful, for example, as a complementary diagnostic in multi-chain Markov chain Monte Carlo (MCMC) convergence diagnostics, where most currently used convergence diagnostics provide a single diagnostic value, but do not usually offer insight into the nature of the deviation. We provide numerical experiments to assess the properties of the tests using both simulated and real-world data and give recommendations on their practical application in computational statistics workflows. | en |
dc.description.version | Peer reviewed | en |
dc.format.extent | 21 | |
dc.format.mimetype | application/pdf | en_US |
dc.identifier.citation | Säilynoja, T, Bürkner, P C & Vehtari, A 2022, ' Graphical test for discrete uniformity and its applications in goodness-of-fit evaluation and multiple sample comparison ', STATISTICS AND COMPUTING, vol. 32, no. 2, 32, pp. 1-21 . https://doi.org/10.1007/s11222-022-10090-6 | en |
dc.identifier.doi | 10.1007/s11222-022-10090-6 | en_US |
dc.identifier.issn | 0960-3174 | |
dc.identifier.other | PURE UUID: 06189a9d-32f2-4748-a092-cb85e7117059 | en_US |
dc.identifier.other | PURE ITEMURL: https://research.aalto.fi/en/publications/06189a9d-32f2-4748-a092-cb85e7117059 | en_US |
dc.identifier.other | PURE LINK: http://www.scopus.com/inward/record.url?scp=85127300829&partnerID=8YFLogxK | |
dc.identifier.other | PURE FILEURL: https://research.aalto.fi/files/82052543/Graphical_test_for_discrete_uniformity_and_its_applications_in_goodness_of_fit_evaluation_and_multiple_sample_comparison.pdf | en_US |
dc.identifier.uri | https://aaltodoc.aalto.fi/handle/123456789/114013 | |
dc.identifier.urn | URN:NBN:fi:aalto-202204282899 | |
dc.language.iso | en | en |
dc.publisher | Springer | |
dc.relation.ispartofseries | STATISTICS AND COMPUTING | en |
dc.relation.ispartofseries | Volume 32, issue 2, pp. 1-21 | en |
dc.rights | openAccess | en |
dc.subject.keyword | Statistics - Methodology | en_US |
dc.title | Graphical test for discrete uniformity and its applications in goodness-of-fit evaluation and multiple sample comparison | en |
dc.type | A1 Alkuperäisartikkeli tieteellisessä aikakauslehdessä | fi |
dc.type.version | publishedVersion |