Distributed Bayesian matrix factorization with limited communication
| dc.contributor | Aalto-yliopisto | fi |
| dc.contributor | Aalto University | en |
| dc.contributor.author | Qin, Xiangju | |
| dc.contributor.author | Blomstedt, Paul | |
| dc.contributor.author | Leppäaho, Eemeli | |
| dc.contributor.author | Parviainen, Pekka | |
| dc.contributor.author | Kaski, Samuel | |
| dc.contributor.department | Probabilistic Machine Learning | |
| dc.contributor.department | Centre of Excellence in Computational Inference, COIN | |
| dc.contributor.department | Department of Computer Science | |
| dc.date.accessioned | 2019-05-06T09:19:15Z | |
| dc.date.available | 2019-05-06T09:19:15Z | |
| dc.date.issued | 2019-01-01 | |
| dc.description | | openaire: EC/H2020/671555/EU//ExCAPE | |
| dc.description.abstract | Bayesian matrix factorization (BMF) is a powerful tool for producing low-rank representations of matrices and for predicting missing values and providing confidence intervals. Scaling up the posterior inference for massive-scale matrices is challenging and requires distributing both data and computation over many workers, making communication the main computational bottleneck. Embarrassingly parallel inference would remove the communication needed, by using completely independent computations on different data subsets, but it suffers from the inherent unidentifiability of BMF solutions. We introduce a hierarchical decomposition of the joint posterior distribution, which couples the subset inferences, allowing for embarrassingly parallel computations in a sequence of at most three stages. Using an efficient approximate implementation, we show improvements empirically on both real and simulated data. Our distributed approach is able to achieve a speed-up of almost an order of magnitude over the full posterior, with a negligible effect on predictive accuracy. Our method outperforms state-of-the-art embarrassingly parallel MCMC methods in accuracy, and achieves results competitive to other available distributed and parallel implementations of BMF. | en |
| dc.description.version | Peer reviewed | en |
| dc.format.extent | 1-26 | |
| dc.format.mimetype | application/pdf | |
| dc.identifier.citation | Qin , X , Blomstedt , P , Leppäaho , E , Parviainen , P & Kaski , S 2019 , ' Distributed Bayesian matrix factorization with limited communication ' , Machine Learning , pp. 1-26 . https://doi.org/10.1007/s10994-019-05778-2 | en |
| dc.identifier.doi | 10.1007/s10994-019-05778-2 | |
| dc.identifier.issn | 0885-6125 | |
| dc.identifier.issn | 1573-0565 | |
| dc.identifier.other | PURE UUID: 8f440f9a-370a-48fd-8433-4318b7a976ba | |
| dc.identifier.other | PURE ITEMURL: https://research.aalto.fi/en/publications/distributed-bayesian-matrix-factorization-with-limited-communication(8f440f9a-370a-48fd-8433-4318b7a976ba).html | |
| dc.identifier.other | PURE LINK: http://www.scopus.com/inward/record.url?scp=85064242641&partnerID=8YFLogxK | |
| dc.identifier.other | PURE FILEURL: https://research.aalto.fi/files/33413162/Qin2019_Article_DistributedBayesianMatrixFacto.pdf | |
| dc.identifier.uri | https://aaltodoc.aalto.fi/handle/123456789/37722 | |
| dc.identifier.urn | URN:NBN:fi:aalto-201905062840 | |
| dc.language.iso | en | en |
| dc.publisher | Springer Netherlands | |
| dc.relation | info:eu-repo/grantAgreement/EC/H2020/671555/EU//ExCAPE | |
| dc.relation.ispartofseries | Machine Learning | en |
| dc.rights | openAccess | en |
| dc.subject.keyword | Bayesian matrix factorization | |
| dc.subject.keyword | Distributed inference | |
| dc.subject.keyword | Embarrassingly parallel MCMC | |
| dc.subject.keyword | Posterior propagation | |
| dc.subject.keyword | Software | |
| dc.subject.keyword | Artificial Intelligence | |
| dc.subject.keyword | 113 Computer and information sciences | |
| dc.subject.other | Software | en |
| dc.subject.other | Artificial Intelligence | en |
| dc.subject.other | 113 Computer and information sciences | en |
| dc.title | Distributed Bayesian matrix factorization with limited communication | en |
| dc.type | A1 Alkuperäisartikkeli tieteellisessä aikakauslehdessä | fi |
| dc.type.version | publishedVersion |
Files
Original bundle
1 - 1 of 1
Loading...
- Name:
- Qin2019_Article_DistributedBayesianMatrixFacto.pdf
- Size:
- 1.18 MB
- Format:
- Adobe Portable Document Format