CN 11-5366/S     ISSN 1673-1530
“风景园林,不只是一本期刊。”

基于YOLO目标检测改进模型的城市公园人群动态感知与活力测度——西安幸福林带实证研究

Dynamic Perception and Vitality Assessment of Crowds in Urban Parks Based on an Enhanced YOLO Object Detection Model: A Case Study of Xi’an Xingfu Forest Belt

  • 摘要:
    目的 本研究旨在构建高精度人群动态监测方法,精准测度城市公园活力的时空分异格局,为公园服务效能评估与空间治理优化提供支持。
    方法 以西安幸福林带为实证案例,基于无人机航拍时序影像数据,利用改进的YOLO(you only look once)模型YOLO11m-CBAM实现行人目标的自动识别与时空定位;结合核密度估计与空间基尼系数,定量解析公园活力的时空分异特征及聚集程度。
    结果 1)公园活力呈双峰型时间分布模式,高峰出现在休息日10:00、16:00及工作日10:00、18:00,低谷均集中于12:00;2)休息日人流量显著高于工作日,活力分布更集中(休息日空间基尼系数为0.40,工作日为0.32);3)儿童游乐区为持续性高活力区;4)时序空间基尼系数揭示了活力分布在休息日下午集中而工作日较为分散的规律。
    结论 本研究所提出的“YOLO-航拍时序影像”方法实现了城市公园人群动态的高精度感知,不仅为城市公园活力评估、效能优化及空间治理提供了可迁移的技术范式,还为城市景观研究中人工智能与空间分析的融合方法提供启示。

     

    Abstract:
    Objective Urban parks play a vital role in enhancing residents’ physical and mental well-being and offering leisure opportunities. Their vitality has become a crucial indicator of urban spatial quality and public welfare. Rapid urbanization has further intensified the imbalance in the allocation of public service resources. Existing research, which primarily relies on heat maps, mobile signaling data, or ground-based camera monitoring, can reveal macroscopic trends but fail to capture the dynamic spatiotemporal characteristics of crowd distribution at the micro scale. Meanwhile, aerial photography obtained through unmanned aerial vehicle (UAV) offers high spatial resolution and flexible data acquisition capabilities, while the advancement of object detection algorithms based on deep learning presents new technological opportunities for crowd recognition in complex urban environments. This research aims to develop and validate a micro-scale vitality measurement method for urban parks based on aerial time-series imagery and an improved object detection model. The method seeks to reveal the spatiotemporal patterns of crowd distribution, identify high-frequency vitality nodes and their driving mechanisms, and provide data support and strategic insights for optimizing the spatial layout, facility allocation, and refined management of parks. Taking Xi’an Xingfu Linear Park as an example, the research focuses on analyzing vitality intensity, fluctuation, and spatial balance at a fine spatiotemporal scale.
    Methods Between March 27 and 30, 2025, continuous UAV-based aerial photography was conducted at a fixed altitude of 75 m during six standard time periods (08:00, 10:00, 12:00, 14:00, 16:00, 18:00), yielding over 2,300 high-resolution images. A manually annotated dataset of 2,000 sub-images with 12,340 pedestrian instances is constructed for model training. To address challenges of small-scale targets and complex occlusions in aerial imagery, an enhanced YOLO11m-CBAM model is developed by embedding a convolutional block attention module (CBAM) into YOLO11m. The improved model achieves notable performance gains: mAP50 increases from 77.1% to 81.3%, mAP50–95 from 45.6% to 51.7%, with precision and recall reaching 86.4% and 72.0% respectively, demonstrating enhanced robustness under medium and low occlusion conditions. Detection outputs are orthorectified to geographic coordinates to construct a structured spatiotemporal dataset. Spatial analysis employs kernel density estimation, coefficient of variation (CV), spatial Gini coefficient, and the “latitude-population” curve to characterize multidimensional vitality patterns.
    Results The temporal analysis results indicate that the overall utilization of Xingfu Forest Belt exhibits a distinct “dual-peak” pattern. On rest days, the number of visitors reaches 2,112 at 10:00 and 3,641 at 16:00, reflecting typical peaks of family and leisure activities. The daily coefficient of variation (CV = 38.67%) is relatively low, suggesting stable visiting patterns with activity concentrated in leisure hours. In contrast, on working days, vitality peaks occur at 10:00 and 18:00, corresponding to post-commuting and after-work relaxation periods, respectively. The higher daily visiting (CV = 55.34%) indicates a more uneven temporal distribution of activities. Notably, 12:00 represents the lowest point of visiting (the minimum number of visitors is only 595, and the average number is 883), implying underutilization of space during midday and suggesting potential opportunities for future facility optimization or time-specific programming. The spatial equilibrium analysis further reveals that during peak hours (14:00 and 16:00), the spatial Gini coefficient reaches 0.44 – 0.48, indicating a strong concentration of vitality in specific functional zones and a pronounced spatial polarization effect. In contrast, the Gini coefficient drops to 0.24 during off-peak periods (08:00 and 12:00), reflecting a more dispersed and evenly distributed use of space. At 18:00, the Gini coefficient remains between 0.38 and 0.41, suggesting a moderate level of aggregation in the evening. Overall, the vitality of Xingfu Forest Belt demonstrates a dynamic pattern of “daytime polarization with evening recovery”. In terms of spatial distribution, vitality hotspots are primarily concentrated along the central and northern segments of the belt, forming localized peaks. The emergence of these core areas is driven by two main factors: 1) the attraction of fixed functional facilities such as children’s play areas, fitness zones, and square-dancing spaces; and 2) the temporal aggregation generated by periodic activities, including weekend family events and morning exercise. At the macro scale, the concentration of residential and educational land uses, high accessibility to bus stops, and the scarcity of comparable recreational facilities jointly reinforce the sustained vitality of the central children’s play area. Maintaining consistently high footfall and strong spatial spillover effects across multiple time periods, this area serves as a key vitality hub within the overall spatial structure of Xingfu Forest Belt.
    Conclusion The research demonstrates that the proposed UAV-based and YOLO-based vitality measurement framework provides high spatiotemporal resolution at the micro-park scale, enabling accurate identification of vitality hotspots, temporal fluctuations, and spatial imbalances. This approach offers an operational, quantitative basis for optimizing facility layouts, designing flexible spaces, and implementing differentiated management strategies. Methodological limitations are also discussed: The approach performs reliably in spring, autumn, and winter with low to moderate vegetation coverage, but may encounter partial omissions under dense canopy or multi-layer pergola structures in summer. To enhance applicability, future improvements include multi-drone and multi-view data acquisition, infrared thermal imaging to mitigate occlusion, air-ground data fusion, inter-frame trajectory matching to distinguish stay/pass behaviors, and fine-grained activity recognition. Overall, the proposed method provides a replicable technical pathway and empirical reference for refined park governance and smart park development. The findings contribute to advancing quantitative urban vitality assessment and provide methodological insights for integrating AI and spatial analysis in urban landscape research.

     

/

返回文章
返回