Human Mobility Modeling Based on Heterogeneous Urban Sensing Systems
Friday, March 27, 2020, 10:00am - 12:00pm
Speaker: Zhihan Fang
Location : Remote - Webex
Desheng Zhang (Rutgers CS, Committee Chair), Jie Gao (Rutgers CS), Yongfeng Zhang (Rutgers CS), Ruilin Liu (Facebook, external member)
Event Type: PhD Defense
Abstract: Recently, with an increasing number of people living in cities, it introduces new challengesin human mobility such as traffic congestion and energy consumption, which arecaused by a dense human population distribution, unbalanced infrastructure deployment,or insufficient understanding of travel demand. Thus, it is essential to improvethe mobility of urban residents on a daily basis, which can be achieved by accuratelymodeling human mobility with ubiquitous urban sensing data from heterogeneous urbansensing systems, e.g., on-board GPS systems including taxis, buses, personal vehicles,and portable device systems such as cellphones. Existing studies modeling human mobilityare mostly built upon single systems. However, people in cities take multipletransportation modalities on daily basis, where a single sensing system limits a comprehensiveunderstanding and modeling of human mobility.In the dissertation, we aim to model human mobility at metropolitan scale, by utilizingspatio-temporal data of heterogeneous sensing systems already collected for billingor management purposes. We design, implement and evaluate a urban sensing systemnamed urbanSense with three modules for human mobility modeling (e.g., travel distance,travel time, travel speed): (i) a sensing module to collect and preprocess human mobility sensing data from 8 urban sensing systems crossing 3 domains (i.e., transportation, communication, and payment); (ii) a measurement module where we present a measurement work named SysRep to measure the data bias of urban sensing systemsfor human mobility modeling. In SysRep, we quantify the data bias of urban sensingsystems as the representativeness of sensing systems. We analyze potential reasonsfor representativeness and found representativeness is highly correlated with contextual factors such as population, mobility, and Point of Interests. We further design a correctionmodel to improve representativeness of sensing systems. The evaluation results show the proposed correction model can improve the representativeness of singe systems by 45%. (iii) a prediction module to model human mobility from heterogeneous urban sensing systems. In particular, we present two works: a work named MultiCell for realtime population modeling and the other work named MAC for travel time prediction.In MultiCell, we design two techniques to model real-time population from multiple cellular networks: a spatial alignment technique to align different spatial partitions into a uniform spatial partition; a co-training technique to learn the relation betweenactive cellphone users of different networks and population distribution simultaneously.MultiCell is implemented with Call Detail Records (CDR) of three major networks in China in the same city covering 100% cellphone users. The evaluation results prove the effectiveness of MultiCell by reducing the modeling error by 27% compared with the start-of-the-art models. In MAC, we decompose travel time of multiple transportationsystems (i.e., subway, taxi, bus, and personal vehicle) into fine-grained travel time basedon different travel stages (e.g., walking, riding, waiting time). Moreover, we design a time-series model based on Long Short-Term Memory (LSTM) architect to predict the travel delay under the impact of different anomalies. We implement and evaluate MAC with data collected from 37 thousand vehicles and 5 million smart cards. The resultsshow MAC reduces the prediction error by 31% compared with the state-of-the-art methods. Finally, we discuss some lessons learned and potential applications of our framework.