计算机科学
人口
合成数据
匹配(统计)
贝叶斯概率
维数(图论)
微模拟
数据科学
数据挖掘
人工智能
工程类
运输工程
数学
社会学
人口学
统计
纯数学
作者
Meng Zhou,Jason Li,Rounaq Basu,Joseph Ferreira
标识
DOI:10.1016/j.compenvurbsys.2021.101717
摘要
• We propose an integrated framework to create spatially detailed and heterogeneous synthetic populations. • Households and individuals are synthesized using multiple Bayesian Networks and Generalized Raking. • Spatial entities in the built environment are constructed via an ontology-based data-fusion approach. • Our framework outperforms traditional population synthesis methods in our Singapore case study. • Results show heterogeneous synthetic population with detailed and rich socio-spatial information. Agent-based models (ABMs) of urban systems have grown in popularity and complexity due to the widespread availability of high-performance computing resources and large data storage capabilities. Credible synthetic populations are crucial for the application of ABMs to understand urban phenomena. Although several (agent) population synthesis methods have been suggested over the years, the spatial dimension of synthetic populations has not received as much attention. This study addresses this myopic treatment of synthetic populations by creating two distinct components – agents and the built environment – that are integrated to form a ‘full’ spatially-detailed synthetic population. To generate agents, we used multiple Bayesian Networks (BN) to probabilistically draw pools from the microsample, followed by a Generalized Raking (GR) adjustment to match marginal controls. Using various measures, we demonstrate that our BN + GR framework outperforms more commonly used synthesis methods in both capturing the heterogeneity in the microsample and matching marginal controls. We also highlight the importance of accounting for heterogeneity by using separate type-specific models based on an explicitly defined household typology. For built environment synthesis, we generated various spatial entities such as buildings, housing units, establishments, and jobs at distinct spatial locations by fusing data from various spatial datasets. Their spatial distributions are found to effectively approximate the ‘real’ built environment in our study area. Our proposed framework can be used to generate a ‘full’ synthetic population for use in ABMs with more spatio-demographic heterogeneity than can otherwise be estimated using traditional methods.
科研通智能强力驱动
Strongly Powered by AbleSci AI