Neidio i’r brif dudalen lywio Neidio i chwilio Neidio i’r prif gynnwys

The Effectiveness of a Simplified Model Structure for Crowd Counting

  • Xingen Gao
  • , Lei Chen
  • , Fei Chao
  • , Xiang Chang
  • , Xinghang Gao
  • , Huali Jiang
  • , Li Liu
  • , Hongyi Zhang*
  • *Awdur cyfatebol y gwaith hwn
  • Xiamen University
  • Xiamen University of Technology

Allbwn ymchwil: Cyfraniad at gyfnodolynErthygladolygiad gan gymheiriaid

8 Dyfyniadau (Scopus)
60 Wedi eu Llwytho i Lawr (Pure)

Crynodeb

Crowd counting, a method for measuring crowd sizes, has seen significant advancements with deep learning techniques, which have proven highly effective in accurate estimation. However, the improvement in these methods' accuracy is frequently achieved at the cost of more intricate model architectures. This article discusses how to construct high-performance crowd counting models using only simple structures. We propose the fuss-free structure, a simple and efficient architecture with a backbone network and multiscale feature fusion. It exhibits notable adaptability, ensuring that slight replacing its components do not lead to a substantial decline in performance. The multiscale feature fusion structure is an uncomplicated design that consists of three distinct pathways, each featuring only a focus transition module (FTM). It combines the features from these pathways by directly employing the concatenation operation. By selecting appropriate components, our proposed structure has been trained and evaluated across four public datasets, demonstrating an accuracy that rivals that of existing complex models. Furthermore, a comprehensive evaluation is conducted by replacing the backbones of various models such as CCTrans and the proposed structure with different networks, including MobileNet-v3, ConvNeXt-Tiny, and Swin-Transformer-Small. The experimental results further indicate that excellent crowd counting performance can be achieved with the simple structure proposed by us. Code is available at https://github.com/erdongsanshi/Fuss-Free-structure.

Iaith wreiddiolSaesneg
Rhif yr erthygl5023411
Nifer y tudalennau11
CyfnodolynIEEE Transactions on Instrumentation and Measurement
Cyfrol74
Dynodwyr Gwrthrych Digidol (DOIs)
StatwsCyhoeddwyd - 26 Maw 2025

Ôl bys

Gweld gwybodaeth am bynciau ymchwil 'The Effectiveness of a Simplified Model Structure for Crowd Counting'. Gyda’i gilydd, maen nhw’n ffurfio ôl bys unigryw.

Dyfynnu hyn