Exploring phage-host interactions in the human microbiome: Development of a dataset for benchmarking predictive models

Loading...
Thumbnail Image

URL

Journal Title

Journal ISSN

Volume Title

School of Science | Master's thesis

Department

Mcode

Language

en

Pages

74

Series

Abstract

The prediction of bacteriophage hosts and phage-host dynamics in the human microbiome is critical for understanding their role in modulating human health and in the development of potential therapeutic treatments. While several bioinformatic tools are available for predicting phage-host interactions, their evaluation is complicated by the lack of high-quality, representative datasets. In this study, a comprehensive benchmark dataset was developed to assess the performance of phage host prediction tools. The initial metagenomic assembly, obtained from the Pasolli et al. (2019) study, included 154,723 microbial genomes and was preprocessed. Viral sequences from the metagenomic dataset were identified using the VirSorter2 tool and the quality and completeness of the viral dataset were assessed using the CheckV tool. This study produced a dataset for evaluating phage host prediction tools, along with an additional metadata table and visual analysis of the initial metagenomic assembly and viral dataset. Recommendations for tool optimization were also provided. Future work can use this dataset to benchmark widely used phage-host prediction tools.

Description

Supervisor

Lähdesmäki, Harri

Thesis advisor

Vatanen, Tommi

Other note

Citation