Perspective-assisted prototype-based learning for semi-supervised crowd counting

Yifei Qian, Liangfei Zhang, Zhongliang Guo, Xiaopeng Hong*, Ognjen Arandjelović, Carl R. Donovan

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

Abstract

To alleviate the burden of labeling data to train crowd counting models, we propose a prototype-based learning approach for semi-supervised crowd counting with an embeded understanding of perspective. Our key idea is that image patches with the same density of people are likely to exhibit coherent appearance changes under similar perspective distortion, but differ significantly under varying distortions. Motivated by this observation, we construct multiple prototypes for each density level to capture variations in perspective. For labeled data, the prototype-based learning assists the regression task by regularizing the feature space and modeling the relationships within and across different density levels. For unlabeled data, the learnt perspective-embedded prototypes enhance differentiation between samples of the same density levels, allowing for a more nuanced assessment of the predictions. By incorporating regression results, we categorize unlabeled samples as reliable or unreliable, applying tailored consistency learning strategies to enhance model accuracy and generalization. Since the perspective information is often unavailable, we propose a novel pseudo-label assigner based on perspective self-organization which requires no additional annotations and assigns image regions to distinct spatial density groups, which mainly reflect the differences in average density among regions. Extensive experiments on four crowd counting benchmarks demonstrate the effectiveness of our approach.
Original languageEnglish
Article number111073
Number of pages10
JournalPattern Recognition
Volume158
Early online date21 Oct 2024
DOIs
Publication statusE-pub ahead of print - 21 Oct 2024

Keywords

  • Perspective analysis
  • Representation learning
  • Task analysis
  • Consistency regularization

Fingerprint

Dive into the research topics of 'Perspective-assisted prototype-based learning for semi-supervised crowd counting'. Together they form a unique fingerprint.

Cite this