Exploring the structure in deep networks: Group, manifold and category theory

dc.contributorAalto-yliopistofi
dc.contributorAalto Universityen
dc.contributor.advisorSchnoor, Ekkehard
dc.contributor.authorZheng, Bo
dc.contributor.schoolPerustieteiden korkeakoulufi
dc.contributor.schoolSchool of Scienceen
dc.contributor.supervisorJung, Alex
dc.date.accessioned2026-01-22T18:04:41Z
dc.date.available2026-01-22T18:04:41Z
dc.date.issued2025-12-21
dc.description.abstractModern deep learning has achieved remarkable success in recent years, yet we lack a comprehensive understanding of why it performs well in some tasks while failing in others. This thesis develops a mathematical framework for understanding and designing neural networks through the lenses of group theory, differential geometry, and category theory. We begin by analyzing the symmetry structure of parameter spaces. For a traditional deep learning structure: linear layers + non-linear activation + regularization, we prove that the linear part possesses maximal $\mathrm{GL}_n(\mathbb{R})$ symmetry. Nonlinear activations break this symmetry to proper subgroups; we analyze ReLU and sigmoid, for example. Then we study how regularization with different norms affects symmetry, especially Schatten-$p$ norms and entry-wise $\ell_p$ norms. This work connects the choice of activation/regularization and the geometry of representations we want to learn. We then introduce Path Equivariant Networks (PENs), which generalize classical group equivariance from point-wise constraints $F(g \cdot x) = \rho(g) \cdot F(x)$ to path-wise constraints on manifolds. We prove that classical group equivariance arises as a special case under certain conditions. As an extension of this idea, we introduce content-pose decomposition, which factors the data manifolds into a symmetry-carrying pose (living in the group $G$) and a symmetry-free content (living in the quotient $U = X/G$). Finally, we provide a categorical formalization where equivariant maps are natural transformations between functors. The naturality condition captures the essence of symmetry-preserving computation. This work contributes to the theoretical foundation that the design of neural networks is fundamentally a choice of structures we want to retain in the data.en
dc.format.extent77
dc.format.mimetypeapplication/pdfen
dc.identifier.urihttps://aaltodoc.aalto.fi/handle/123456789/142482
dc.identifier.urnURN:NBN:fi:aalto-202601221854
dc.language.isoenen
dc.programmeMaster's Programme in Computer, Communication and Information Sciencesen
dc.programmeMaster's Programme in Computer, Communication and Information Sciencesfi
dc.programmeMaster's Programme in Computer, Communication and Information Sciencessv
dc.programme.majorMachine Learning, Data Science and Artificial Intelligenceen
dc.subject.keywordgeometric deep learningen
dc.subject.keywordgroup equivarianceen
dc.subject.keywordsymmetryen
dc.subject.keywordneural networksen
dc.subject.keyworddifferential geometryen
dc.subject.keywordcategory theoryen
dc.subject.keywordregularizationen
dc.titleExploring the structure in deep networks: Group, manifold and category theoryen
dc.typeG2 Pro gradu, diplomityöfi
dc.type.ontasotMaster's thesisen
dc.type.ontasotDiplomityöfi
local.aalto.electroniconlyyes
local.aalto.openaccessyes

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
master_Zheng_Bo_2026.pdf
Size:
2.06 MB
Format:
Adobe Portable Document Format