Multi-node Training for StyleGAN2

Loading...
Thumbnail Image

Access rights

openAccess

URL

Journal Title

Journal ISSN

Volume Title

A4 Artikkeli konferenssijulkaisussa

Date

2021

Major/Subject

Mcode

Degree programme

Language

en

Pages

8
677-684

Series

Pattern Recognition. ICPR International Workshops and Challenges, 2021, Proceedings, Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), Volume 12661 LNCS

Abstract

StyleGAN2 is a Tensorflow-based Generative Adversarial Network (GAN) framework that represents the state-of-the-art in generative image modelling. The current release of StyleGAN2 implements multi-GPU training via Tensorflow’s device contexts which limits data parallelism to a single node. In this work, a data-parallel multi-node training capability is implemented in StyleGAN2 via Horovod which enables harnessing the compute capability of larger cluster architectures. We demonstrate that the new Horovod-based communication outperforms the previous context approach on a single node. Furthermore, we demonstrate that the multi-node training does not compromise the accuracy of StyleGAN2 for a constant effective batch size. Finally, we report strong and weak scaling of the new implementation up to 64 NVIDIA Tesla A100 GPUs distributed across eight NVIDIA DGX A100 nodes, demonstrating the utility of the approach at scale.

Description

Publisher Copyright: © 2021, Springer Nature Switzerland AG. Copyright: Copyright 2021 Elsevier B.V., All rights reserved.

Keywords

GAN, GPU, Massively parallel architectures, Multi-node training, StyleGAN2

Other note

Citation

Loppi, N & Kynkäänniemi, T 2021, Multi-node Training for StyleGAN2 . in A Del Bimbo, R Cucchiara, S Sclaroff, G M Farinella, T Mei, M Bertini, H J Escalante & R Vezzani (eds), Pattern Recognition. ICPR International Workshops and Challenges, 2021, Proceedings . Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), vol. 12661 LNCS, Springer, pp. 677-684, International Conference on Pattern Recognition, Milan, Italy, 10/01/2021 . https://doi.org/10.1007/978-3-030-68763-2_51