Bayesian optimization of multivariate multi-output biotechnological processes

dc.contributorAalto-yliopistofi
dc.contributorAalto Universityen
dc.contributor.advisorOllila, Samuli
dc.contributor.authorTiainen, Vilma
dc.contributor.schoolPerustieteiden korkeakoulufi
dc.contributor.schoolSchool of Scienceen
dc.contributor.supervisorLähdesmäki, Harri
dc.date.accessioned2026-01-22T18:11:22Z
dc.date.available2026-01-22T18:11:22Z
dc.date.issued2025-12-22
dc.description.abstractMany industrial biotechnology applications require the optimization of complex multi-output processes, often with limited data and costly, laborious experiments. Hence we have developed and evaluated a customizable Bayesian optimization (BO) framework, BAIO2, using AI surrogates for multivariate, multi-output biotechnological applications. We focus on BO frameworks and potential surrogate models for two application cases: media optimization and genetic engineering for production of desired molecules using micro-organisms. Predictive analyses and simulation experiments are used to assess surrogate model performance and guide experimental design. Performance of BAIO2 is experimentally evaluated for the media optimization case to reach a target lipid distribution. The predictive analyses and simulations show that surrogate model performance varies by data size and round: multi-layer perceptrons (MLP) and Gaussian processes (GP) were generally strong with good exploration capabilities, whereas XGBoost regression (XGBR) often excelled in prediction but struggled to explore the search space. Simulations provide further insights into optimal experimental design and model selection. We conclude that BAIO2 is effective for multi-output biotechnological optimization and adaptable to different experimental setups. Future work will expand on a wider variety of acquisition functions and improve uncertainty estimation.en
dc.format.extent62
dc.format.mimetypeapplication/pdfen
dc.identifier.urihttps://aaltodoc.aalto.fi/handle/123456789/142520
dc.identifier.urnURN:NBN:fi:aalto-202601221892
dc.language.isoenen
dc.programmeMaster's Programme in Computer, Communication and Information Sciencesen
dc.programmeMaster's Programme in Computer, Communication and Information Sciencesfi
dc.programmeMaster's Programme in Computer, Communication and Information Sciencessv
dc.programme.majorMachine Learning, Data Science and Artificial Intelligenceen
dc.subject.keywordBayesian optimizationen
dc.subject.keywordindustrial biotechnologyen
dc.subject.keywordartificial intelligenceen
dc.subject.keywordmulti-output optimizationen
dc.subject.keywordneural networksen
dc.subject.keywordmedia optimizationen
dc.titleBayesian optimization of multivariate multi-output biotechnological processesen
dc.typeG2 Pro gradu, diplomityöfi
dc.type.ontasotMaster's thesisen
dc.type.ontasotDiplomityöfi
local.aalto.electroniconlyyes
local.aalto.openaccessno

Files