CuLao - Constructing Utilities of Large Language Models in Resource-Constrained Environments

dc.contributorAalto-yliopistofi
dc.contributorAalto Universityen
dc.contributor.authorTruong, Hong-Linh
dc.contributor.authorNhu Trang, Nguyen Ngoc
dc.contributor.departmentDepartment of Computer Scienceen
dc.contributor.groupauthorComputer Science Professorsen
dc.contributor.groupauthorComputer Science - Computing Systems (ComputingSystems) - Research areaen
dc.contributor.organizationDaienso Lab
dc.date.accessioned2025-01-15T06:32:42Z
dc.date.available2025-01-15T06:32:42Z
dc.date.issued2024-09-04
dc.description.abstractThe increasing development and utilization of Large Language Model (LLM) services have demonstrated many benefits in different contexts. However, LLM services are mainly available in the public cloud and require huge computing resources to operate, thus not accessible to many companies, organizations or communities with constrained resources. While research efforts have concentrated on LLMs quantization for resource-constrained computing environments like edge devices, to democratize the availability of LLM services as utilities for such communities requires much more than the optimization of LLM models. In this paper, we introduce CuLao - a framework for constructing utilities from LLMs in resource-constrained environments. Our framework focuses on key requirements of resource-constrained companies, organizations and communities by enabling the provisioning and coordination of LLMs as utilities, based on the availability of open-source LLMs. CuLao provides techniques and tools for abstracting LLMs as services with suitable APIs and coordinating them as utility ensembles in edge infrastructures.en
dc.description.versionPeer revieweden
dc.format.extent6
dc.format.mimetypeapplication/pdf
dc.identifier.citationTruong, H-L & Nhu Trang, N N 2024, CuLao - Constructing Utilities of Large Language Models in Resource-Constrained Environments. in GoodIT '24: Proceedings of the 2024 International Conference on Information Technology for Social Good. ACM, pp. 100-104, International Conference on Information Technology for Social Good, Bremen, Germany, 04/09/2024. https://doi.org/10.1145/3677525.3678648en
dc.identifier.doi10.1145/3677525.3678648
dc.identifier.isbn979-8-4007-1094-0
dc.identifier.otherPURE UUID: 934d4926-ade4-478a-b027-7deb66d70213
dc.identifier.otherPURE ITEMURL: https://research.aalto.fi/en/publications/934d4926-ade4-478a-b027-7deb66d70213
dc.identifier.otherPURE FILEURL: https://research.aalto.fi/files/170404413/SCI_Truong_etal_GoodIT_2024.pdf
dc.identifier.urihttps://aaltodoc.aalto.fi/handle/123456789/132934
dc.identifier.urnURN:NBN:fi:aalto-202501151227
dc.language.isoenen
dc.relation.ispartofInternational Conference on Information Technology for Social Gooden
dc.relation.ispartofseriesGoodIT '24: Proceedings of the 2024 International Conference on Information Technology for Social Gooden
dc.relation.ispartofseriespp. 100-104en
dc.rightsopenAccessen
dc.titleCuLao - Constructing Utilities of Large Language Models in Resource-Constrained Environmentsen
dc.typeA4 Artikkeli konferenssijulkaisussafi
dc.type.versionacceptedVersion

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
SCI_Truong_etal_GoodIT_2024.pdf
Size:
446.54 KB
Format:
Adobe Portable Document Format