Estimating origin-destination matrices in Helsinki's public transport through multi-source data fusion

Loading...
Thumbnail Image

URL

Journal Title

Journal ISSN

Volume Title

School of Science | Master's thesis

Department

Mcode

Language

en

Pages

72

Series

Abstract

Accurate Origin-Destination (OD) matrices are essential for public transport planning, yet challenging to obtain in Proof-of-Payment systems without entry-exit records. This thesis presents a multi-source data fusion framework to estimate network-level OD matrix for Helsinki's bus network by integrating trajectories from HSL's mobile application (TravelSense) with Automated Passenger Count (APC) stop totals. Using Iterative Proportional Fitting (IPF), we balance sparse TravelSense seed matrices with APC marginals at route level and aggregate to a network-level OD matrix. A streaming pipeline enables the processing and fusion of the large-scale source datasets, and an interactive web visualization supports exploration of edge loads, stop net flows, and OD arcs. In a weekday-morning case study, route-level IPF achieves good marginal fit, showing logical OD patterns, while network aggregation recovers high trip coverage on major corridors. External validation against Telia CrowdInsights shows limited cell-wise agreement at fine granularity due to scope and spatial aggregation effects, but metropolitan scale patterns are consistent and expected to agree more under coarser aggregation and transit-likelihood filtering. Overall, the approach yields balanced route-level matrices that align with observed APC marginals and a network-wide OD that covers the bulk of scheduled service, with visible logical patterns. Results demonstrate that fusing emerging trajectory data with traditional counts yields operationally useful OD products in Proof-of-Payment (POP) systems and provides a practical foundation for routine OD estimation.

Description

Supervisor

Saramäki, Jari

Thesis advisor

Huang, Zhiren

Other note

Citation