CuLao - Constructing Utilities of Large Language Models in Resource-Constrained Environments
| dc.contributor | Aalto-yliopisto | fi |
| dc.contributor | Aalto University | en |
| dc.contributor.author | Truong, Hong-Linh | |
| dc.contributor.author | Nhu Trang, Nguyen Ngoc | |
| dc.contributor.department | Department of Computer Science | en |
| dc.contributor.groupauthor | Computer Science Professors | en |
| dc.contributor.groupauthor | Computer Science - Computing Systems (ComputingSystems) - Research area | en |
| dc.contributor.organization | Daienso Lab | |
| dc.date.accessioned | 2025-01-15T06:32:42Z | |
| dc.date.available | 2025-01-15T06:32:42Z | |
| dc.date.issued | 2024-09-04 | |
| dc.description.abstract | The increasing development and utilization of Large Language Model (LLM) services have demonstrated many benefits in different contexts. However, LLM services are mainly available in the public cloud and require huge computing resources to operate, thus not accessible to many companies, organizations or communities with constrained resources. While research efforts have concentrated on LLMs quantization for resource-constrained computing environments like edge devices, to democratize the availability of LLM services as utilities for such communities requires much more than the optimization of LLM models. In this paper, we introduce CuLao - a framework for constructing utilities from LLMs in resource-constrained environments. Our framework focuses on key requirements of resource-constrained companies, organizations and communities by enabling the provisioning and coordination of LLMs as utilities, based on the availability of open-source LLMs. CuLao provides techniques and tools for abstracting LLMs as services with suitable APIs and coordinating them as utility ensembles in edge infrastructures. | en |
| dc.description.version | Peer reviewed | en |
| dc.format.extent | 6 | |
| dc.format.mimetype | application/pdf | |
| dc.identifier.citation | Truong, H-L & Nhu Trang, N N 2024, CuLao - Constructing Utilities of Large Language Models in Resource-Constrained Environments. in GoodIT '24: Proceedings of the 2024 International Conference on Information Technology for Social Good. ACM, pp. 100-104, International Conference on Information Technology for Social Good, Bremen, Germany, 04/09/2024. https://doi.org/10.1145/3677525.3678648 | en |
| dc.identifier.doi | 10.1145/3677525.3678648 | |
| dc.identifier.isbn | 979-8-4007-1094-0 | |
| dc.identifier.other | PURE UUID: 934d4926-ade4-478a-b027-7deb66d70213 | |
| dc.identifier.other | PURE ITEMURL: https://research.aalto.fi/en/publications/934d4926-ade4-478a-b027-7deb66d70213 | |
| dc.identifier.other | PURE FILEURL: https://research.aalto.fi/files/170404413/SCI_Truong_etal_GoodIT_2024.pdf | |
| dc.identifier.uri | https://aaltodoc.aalto.fi/handle/123456789/132934 | |
| dc.identifier.urn | URN:NBN:fi:aalto-202501151227 | |
| dc.language.iso | en | en |
| dc.relation.ispartof | International Conference on Information Technology for Social Good | en |
| dc.relation.ispartofseries | GoodIT '24: Proceedings of the 2024 International Conference on Information Technology for Social Good | en |
| dc.relation.ispartofseries | pp. 100-104 | en |
| dc.rights | openAccess | en |
| dc.title | CuLao - Constructing Utilities of Large Language Models in Resource-Constrained Environments | en |
| dc.type | A4 Artikkeli konferenssijulkaisussa | fi |
| dc.type.version | acceptedVersion |
Files
Original bundle
1 - 1 of 1
Loading...
- Name:
- SCI_Truong_etal_GoodIT_2024.pdf
- Size:
- 446.54 KB
- Format:
- Adobe Portable Document Format