Exploring the structure in deep networks: Group, manifold and category theory
| dc.contributor | Aalto-yliopisto | fi |
| dc.contributor | Aalto University | en |
| dc.contributor.advisor | Schnoor, Ekkehard | |
| dc.contributor.author | Zheng, Bo | |
| dc.contributor.school | Perustieteiden korkeakoulu | fi |
| dc.contributor.school | School of Science | en |
| dc.contributor.supervisor | Jung, Alex | |
| dc.date.accessioned | 2026-01-22T18:04:41Z | |
| dc.date.available | 2026-01-22T18:04:41Z | |
| dc.date.issued | 2025-12-21 | |
| dc.description.abstract | Modern deep learning has achieved remarkable success in recent years, yet we lack a comprehensive understanding of why it performs well in some tasks while failing in others. This thesis develops a mathematical framework for understanding and designing neural networks through the lenses of group theory, differential geometry, and category theory. We begin by analyzing the symmetry structure of parameter spaces. For a traditional deep learning structure: linear layers + non-linear activation + regularization, we prove that the linear part possesses maximal $\mathrm{GL}_n(\mathbb{R})$ symmetry. Nonlinear activations break this symmetry to proper subgroups; we analyze ReLU and sigmoid, for example. Then we study how regularization with different norms affects symmetry, especially Schatten-$p$ norms and entry-wise $\ell_p$ norms. This work connects the choice of activation/regularization and the geometry of representations we want to learn. We then introduce Path Equivariant Networks (PENs), which generalize classical group equivariance from point-wise constraints $F(g \cdot x) = \rho(g) \cdot F(x)$ to path-wise constraints on manifolds. We prove that classical group equivariance arises as a special case under certain conditions. As an extension of this idea, we introduce content-pose decomposition, which factors the data manifolds into a symmetry-carrying pose (living in the group $G$) and a symmetry-free content (living in the quotient $U = X/G$). Finally, we provide a categorical formalization where equivariant maps are natural transformations between functors. The naturality condition captures the essence of symmetry-preserving computation. This work contributes to the theoretical foundation that the design of neural networks is fundamentally a choice of structures we want to retain in the data. | en |
| dc.format.extent | 77 | |
| dc.format.mimetype | application/pdf | en |
| dc.identifier.uri | https://aaltodoc.aalto.fi/handle/123456789/142482 | |
| dc.identifier.urn | URN:NBN:fi:aalto-202601221854 | |
| dc.language.iso | en | en |
| dc.programme | Master's Programme in Computer, Communication and Information Sciences | en |
| dc.programme | Master's Programme in Computer, Communication and Information Sciences | fi |
| dc.programme | Master's Programme in Computer, Communication and Information Sciences | sv |
| dc.programme.major | Machine Learning, Data Science and Artificial Intelligence | en |
| dc.subject.keyword | geometric deep learning | en |
| dc.subject.keyword | group equivariance | en |
| dc.subject.keyword | symmetry | en |
| dc.subject.keyword | neural networks | en |
| dc.subject.keyword | differential geometry | en |
| dc.subject.keyword | category theory | en |
| dc.subject.keyword | regularization | en |
| dc.title | Exploring the structure in deep networks: Group, manifold and category theory | en |
| dc.type | G2 Pro gradu, diplomityö | fi |
| dc.type.ontasot | Master's thesis | en |
| dc.type.ontasot | Diplomityö | fi |
| local.aalto.electroniconly | yes | |
| local.aalto.openaccess | yes |
Files
Original bundle
1 - 1 of 1
Loading...
- Name:
- master_Zheng_Bo_2026.pdf
- Size:
- 2.06 MB
- Format:
- Adobe Portable Document Format