Full Text:   <6623>

CLC number: U121; TP391

On-line Access: 2024-08-27

Received: 2023-10-17

Revision Accepted: 2024-05-08

Crosschecked: 2012-08-20

Cited: 22

Clicked: 10658

Citations:  Bibtex RefMan EndNote GB/T7714

-   Go to

Article info.
Open peer comments

Journal of Zhejiang University SCIENCE C 2012 Vol.13 No.10 P.750-760

http://doi.org/10.1631/jzus.C12a0049


Transit smart card data mining for passenger origin information extraction


Author(s):  Xiao-lei Ma, Yin-hai Wang, Feng Chen, Jian-feng Liu

Affiliation(s):  Department of Civil and Environmental Engineering, University of Washington, Seattle, WA 98195-2700, USA; more

Corresponding email(s):   xiaolm@uw.edu, yinhai@uw.edu

Key Words:  Transit smart card, Automated fare collection (AFC), Bayesian decision tree, Markov chain, Origin inference



Abstract: 
The automated fare collection (AFC) system, also known as the transit smart card (SC) system, has gained more and more popularity among transit agencies worldwide. Compared with the conventional manual fare collection system, an AFC system has its inherent advantages in low labor cost and high efficiency for fare collection and transaction data archival. Although it is possible to collect highly valuable data from transit SC transactions, substantial efforts and methodologies are needed for extracting such data because most AFC systems are not initially designed for data collection. This is true especially for the Beijing AFC system, where a passenger’s boarding stop (origin) on a flat-rate bus is not recorded on the check-in scan. To extract passengers’ origin data from recorded SC transaction information, a markov chain based bayesian decision tree algorithm is developed in this study. Using the time invariance property of the markov chain, the algorithm is further optimized and simplified to have a linear computational complexity. This algorithm is verified with transit vehicles equipped with global positioning system (GPS) data loggers. Our verification results demonstrated that the proposed algorithm is effective in extracting transit passengers’ origin information from SC transactions with a relatively high accuracy. Such transit origin data are highly valuable for transit system planning and route optimization.

Open peer comments: Debate/Discuss/Question/Opinion

<1>

Please provide your name, email address and a comment





Journal of Zhejiang University-SCIENCE, 38 Zheda Road, Hangzhou 310027, China
Tel: +86-571-87952783; E-mail: cjzhang@zju.edu.cn
Copyright © 2000 - 2025 Journal of Zhejiang University-SCIENCE