Abstract:
Objective Quantitative analysis of children’s behaviors has emerged as a new requirement in the research on urban public spaces. To this end, the primary focus of research lies in the precise spatiotemporal positioning of the crowd within the space. Traditional methods often involve long-term video recording and manual notation of crowd positions in every frame using the “observation method”. While effective, these methods are time-intensive. With the development of computer vision technology, it has become possible to automate the tracking of complex crowd behaviors in public spaces, thereby introducing novel methodologies for computational analysis in urban public space. However, the detailed identification and analysis of different crowd categories, such as children and parents, remain a significant challenge. This research aims to establish a computational analysis method for multi-subject behavior based on targeted computer vision tracking. This method reveals interaction patterns among children, parents and spatial morphology, thereby supporting spatial optimization designs for specific crowd behaviors.
Methods Taking multi-subject interaction behaviors in public spaces as the research object, this research adopts a three-stage research approach: technological investigation, methodology construction, and case study validation. Initially, the technical framework for targeted pedestrian tracking is established. Video data is recorded from selected angles based on spatial conditions, ensuring adequate representation of spatial-temporal dynamics. And a pre-trained deep learning model is adopted for precise localization and trajectory annotation of pedestrians. Subsequently, computational analysis and visualization methods for revealing the interaction behaviors of different groups are explored, which involves a pedestrian identification threshold model based on human proportion characteristics that enables targeted identification and differentiation of children from adults, and a dedicated analysis module designed to visualize behavioral patterns of each identified crowd and thereby provide visual patterns for the spatial-temporal distributions of different crowds. Finally, the effectiveness of the multi-subject behavior analysis framework is validated through a case study on children’s recreational public spaces. The research selects two typical children’s recreational public spaces located in commercial areas. It analyzes three key behavioral metrics: average spatial distance distribution, stay duration distribution, and passer-by count distribution. Correlation analyses and interpretations of these metrics reveal the interaction patterns between children and parents and their relationship with the spatial morphological layout.
Results The computational analysis method for multi-subject behavior enables long-term, large-scale behavioral data collection and analysis for different crowd categories. The case study on the children’s recreational public spaces reveals that, children’s activities in radially organized spatial layouts tend to be concentrated independently in central areas, while parents often move along the periphery for supervision. No significant overlap between the activity areas of children and parents is observed, suggesting minimal need for spatial overlap consideration. In such designs, the focus should be on the orientation of children’s activity spaces, as the layout of play facilities affects the observation points of supervising parents. In linear spatial layouts, parents and children closely accompany each other, primarily engaging in stationary supervision. These layouts require the consideration of spatial overlap between parents and children, as well as additional seating or rest facilities. The placement of play facilities in linear spaces significantly influences both children’s resting positions and parents’ supervision points. The empirical findings indicate that tracking technology based on human proportion features is effective for identifying target children and adults crowds at the scale of public space. The computational analysis method based on congestion degree, static usage rate, and dynamic usage rate systematically reveals adult − child interaction dynamics, and the cross-comparative analysis using visualized heatmaps uncovers the effects of spatial features on multi-subject interaction behaviors.
Conclusion The computational analysis method for multi-subject behavior supports spatial behavioral research involving interactions of various crowd categories and is applicable to post-occupancy evaluations and design optimization in complex public spaces. It facilitates targeted spatial renovations and facility placements based on the actual spatial usage and behavioral requirements of different crowds. The research further recognizes existing technological limitations and potential future developments. While the method effectively differentiates adults and children using body aspect ratios, it cannot yet distinguish other demographic groups and their detailed semantic behaviors. Therefore, future development using human pose tracking is essential for more refined analysis. Furthermore, this research primarily explores technical methodologies based on a case study on children’s recreational spaces in commercial areas, resulting in certain sample limitations. Future research should expand the case categories, propose comprehensive optimization principles, and validate outcomes through feedback from practical projects.