CLC number: TP391.4
On-line Access: 2020-08-10
Received: 2019-06-05
Revision Accepted: 2020-02-06
Crosschecked: 2020-06-11
Cited: 0
Clicked: 4940
Citations: Bibtex RefMan EndNote GB/T7714
Jie-hao Huang, Xiao-guang Di, Jun-de Wu, Ai-yue Chen. A novel convolutional neural network method for crowd counting[J]. Frontiers of Information Technology & Electronic Engineering,in press.https://doi.org/10.1631/FITEE.1900282 @article{title="A novel convolutional neural network method for crowd counting", %0 Journal Article TY - JOUR
一种新的基于卷积神经网络的人群计数方法哈尔滨工业大学控制与仿真中心,中国哈尔滨市,150080 摘要:人群密度估计是一项具有挑战性的任务,因为人群中人头大小存在大范围变化。现有方法均采用多列式结构卷积神经网络去适应这种变化,但会导致密度图上不同密度区域产生平均效应并引入额外噪声。为解决该问题,提出一种新的基于分割先验图的神经网络方法,在分割图基础上生成一个高质量且没有噪声的密度图。该网络主要包括两个部分,即头部的人群前景分割神经网络和尾部的人群回归神经网络。在数据集只提供单点人头标记的情况下,采用均匀函数生成人群头部的掩膜真值图。基于该真值图,前景分割网络输出人群分割图,可有效减少密度图中无人区域噪声。将人群分割图输入人群回归网络,后者能生成高质量人群密度图并提供准确的人数估计。在4个公开数据集(即ShanghaiTech、UCF-CC-50、WorldExpo’10和UCSD)上验证了所提方法有效性;其中,在ShanghaiTech partB和UCF-CC-50两个数据集上该方法取得了当前最好结果。 关键词组: Darkslateblue:Affiliate; Royal Blue:Author; Turquoise:Article
Reference[1]Canny J, 1986. A computational approach to edge detection. IEEE Trans Patt Anal Mach Intell, 8(6):679-698. [2]Chan AB, Vasconcelos N, 2009. Bayesian Poisson regression for crowd counting. Proc IEEE 12th Int Conf on Computer Vision, p.545-551. [3]Chan AB, Liang ZSJ, Vasconcelos N, 2008. Privacy preserving crowd monitoring: counting people without people models or tracking. Proc IEEE Conf on Computer Vision and Pattern Recognition, p.1-7. [4]Dai JF, Li Y, He KM, et al., 2016. R-FCN: object detection via region-based fully convolutional networks. Proc 30th Int Conf on Neural Information Processing Systems, p.379-387. [5]Dollar P, Wojek C, Schiele B, et al., 2012. Pedestrian detection: an evaluation of the state of the art. IEEE Trans Patt Anal Mach Intell, 34(4):743-761. [6]Idrees H, Saleemi I, Seibert C, et al., 2013. Multi-source multi-scale counting in extremely dense crowd images. Proc IEEE Conf on Computer Vision and Pattern Recognition, p.2547-2554. [7]Kang K, Wang XG, 2014. Fully convolutional neural networks for crowd segmentation. https://arxiv.org/abs/1411.4464 [8]Lempitsky V, Zisserman A, 2010. Learning to count objects in images. Proc 23rd Int Conf on Neural Information Processing Systems, p.1324-1332. [9]Li HH, He XJ, Wu HF, et al., 2018. Structured inhomogeneous density map learning for crowd counting. https://arxiv.org/abs/1801.06642 [10]Li JJ, Yang H, Wu S, 2016. Crowd semantic segmentation based on spatial-temporal dynamics. Proc 13th IEEE Int Conf on Advanced Video and Signal Based Surveillance, p.102-108. [11]Li T, Chang H, Wang M, et al., 2015. Crowded scene analysis: a survey. IEEE Trans Circ Syst Video Technol, 25(3):367-386. [12]Li YH, Zhang XF, Chen DM, 2018. CSRNet: dilated convolutional neural networks for understanding the highly congested scenes. Proc IEEE/CVF Conf on Computer Vision and Pattern Recognition, p.1091-1100. [13]Liu J, Gao CQ, Meng DY, et al., 2018. DecideNet: counting varying density crowds through attention guided detection and density estimation. Proc IEEE/CVF Conf on Computer Vision and Pattern Recognition, p.5197-5206. [14]Long J, Shelhamer E, Darrell T, 2015. Fully convolutional networks for semantic segmentation. Proc IEEE Conf on Computer Vision and Pattern Recognition, p.3431-3440. [15]Sam DB, Surya S, Babu RV, 2017. Switching convolutional neural network for crowd counting. Proc IEEE Conf on Computer Vision and Pattern Recognition, p.4031-4039. [16]Sam DB, Sajjan NN, Babu RV, 2018. Divide and grow: capturing huge diversity in crowd images with incrementally growing CNN. Proc IEEE/CVF Conf on Computer Vision and Pattern Recognition, p.3618-3626. [17]Shen Z, Xu Y, Ni B, et al., 2018. Crowd counting via adversarial cross-scale consistency pursuit. IEEE/CVF Conf on Computer Vision and Pattern Recognition, p.5245-5254. [18]Sindagi VA, Patel VM, 2017. Generating high-quality crowd density maps using contextual pyramid CNNs. Proc IEEE Int Conf on Computer Vision, p.1879-1888. [19]Sindagi VA, Patel VM, 2018. A survey of recent advances in CNN-based single image crowd counting and density estimation. Patt Recogn Lett, 107:3-16. [20]Zhan BB, Monekosso DN, Remagnino P, et al., 2008. Crowd analysis: a survey. Mach Vis Appl, 19(5-6):345-357. [21]Zhang C, Li HS, Wang XG, et al., 2015. Cross-scene crowd counting via deep convolutional neural networks. Proc IEEE Conf on Computer Vision and Pattern Recognition, p.833-841. [22]Zhang C, Zhang K, Li HS, et al., 2016. Data-driven crowd understanding: a baseline for a large-scale crowd dataset. IEEE Trans Multim, 18(6):1048-1061. [23]Zhang YY, Zhou DS, Chen SQ, et al., 2016. Single-image crowd counting via multi-column convolutional neural network. Proc IEEE Conf on Computer Vision and Pattern Recognition, p.589-597. Journal of Zhejiang University-SCIENCE, 38 Zheda Road, Hangzhou
310027, China
Tel: +86-571-87952783; E-mail: cjzhang@zju.edu.cn Copyright © 2000 - 2024 Journal of Zhejiang University-SCIENCE |
Open peer comments: Debate/Discuss/Question/Opinion
<1>