Abstract
The analysis of human crowds has widespread uses from law enforcement to urban engineering and traffic management. All of these require a crowd to first be detected, which is the problem addressed in this paper. Given an image, the algorithm we propose segments it into crowd and non-crowd regions. The main idea is to capture two key properties of crowds: (i) on a narrow scale, its basic element should look like a human (only weakly so, due to low resolution, occlusion, clothing variation etc.), while (ii) on a larger scale, a crowd inherently contains repetitive appearance elements. Our method exploits this by building a pyramid of sliding windows and quantifying how "crowd-like" each level of the pyramid is using an underlying statistical model based on quantized SIFT features. The two aforementioned crowd properties are captured by the resulting feature vector of window responses, describing the degree of crowd-like appearance around an image location as the surrounding spatial extent is increased.
Original language | English |
---|---|
Title of host publication | BMVC 2008 - Proceedings of the British Machine Vision Conference 2008 |
Publisher | British Machine Vision Association, BMVA |
DOIs | |
Publication status | Published - 2008 |
Event | 2008 19th British Machine Vision Conference, BMVC 2008 - Leeds, United Kingdom Duration: 1 Sept 2008 → 4 Sept 2008 |
Conference
Conference | 2008 19th British Machine Vision Conference, BMVC 2008 |
---|---|
Country/Territory | United Kingdom |
City | Leeds |
Period | 1/09/08 → 4/09/08 |