nav emailalert searchbtn searchbox tablepage yinyongbenwen piczone journalimg journalInfo journalinfonormal searchdiv searchzone qikanlogo popupnotification paper paperNew
2019, 03, v.36;No.129 28-36
基于深度学习的图像识别技术研究综述
基金项目(Foundation):
邮箱(Email):
DOI: 10.16191/j.cnki.hbkx.2019.03.004
摘要:

本文介绍了基于深度学习的图像识别算法,包括R-CNN、SPP-Net、Fast R-CNN、Faster R-CNN、YOLO以及SDD算法,讨论了深度学习在人脸识别、车牌识别和医学图像识别方面的应用,最后对深度学习图像识别技术的研究提出了问题与展望。

Abstract:

This paper introduces several deep learning network models which was commonly used in image processing,such as R-CNN,SPP-Net,Fast R-CNN,Faster R-CNN,YOLO and SDD.Then the article discussed the application of deep learning in face recognition,license plate recognition and medical image recognition.Finally,the paper puts forward some problems and prospects for the research of deep learning image recognition technology.

参考文献

[1] Krizhevsky A,Sutskever I,Hinton G.ImageNet Classification with Deep Convolutional Neural Networks[C]//NIPS.Curran Associates Inc.2012.

[2] Chris Harris,Mike Stephens,A Combined Corner and Edge Detector,4th Alvey Vision Conference,1988,pp147-151

[3] Ojala T,Pietikinen M,Harwood D.A comparative study of texture measures with classification based on featured distributions[J].Pattern Recognition:The Journal of the Pattern Recognition Society,1996,29(1):51-59.

[4] David G.Lowe Object Recognition from Local Scale-Invariant Features.1999

[5] David G.Lowe Distinctive Image Features from Scale-Invariant Keypoints.January 5,2004.

[6] Viola P,Jones M.Rapid Object Detection using a Boosted Cascade of Simple Features[C]//null.IEEE Computer Society,2001.

[7] Dalal and B.Triggs", Histograms of oriented gradients for human detection", 2005IEEE Computer Society Conference on Computer Vision and Pattern Recognition(CVPR'05),San Diego,CA,USA,2005,pp.886-893.

[8] Bay H,Ess A,Tuytelaars T,et al.Speeded-Up Robust Features(SURF)[J].Computer Vision and Image Understanding,2008,110(3):346-359.

[9] Osuna E.Training support vector machines:an application to face detection[J].Proc.IEEE Computer Society Conf.on Computer Vision and Pattern Recognition,1997,1997.

[10] Lecun Y,Bottou L,Bengio Y,et al.Gradient-based learning applied to document recognition[J].Proceedings of the IEEE,1998,86(11):2278-2324.

[11]董立岩,苑森淼,刘光远,贾书洪.基于贝叶斯分类器的图像分类[J].吉林大学学报(理学版),2007(02):249-253.

[12]余肖生,周宁,张芳芳.基于KNN的图像自动分类模型研究[J].中国图书馆学报,2007(01):74-76.

[13]潘琛,杜培军,张海荣.决策树分类法及其在遥感图像处理中的应用[J].测绘科学,2008(01):208-211+253.

[14] Szegedy C,Liu W,Jia Y,et al.Going Deeper with Convolutions[J].2014.

[15] Simonyan K,Zisserman A.Very Deep Convolutional Networks for Large-Scale Image Recognition[J].Computer Science,2014.

[16] He K,Zhang X,Ren S,et al.Deep Residual Learning for Image Recognition[J].2015.

[17] Girshick,Ross,Donahue,Jeff,Darrell,Trevor,et al.Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation[C].//2014IEEE Conference on Computer Vision and Pattern Recognition:2014 27th IEEE Conference on Computer Vision and Pattern Recognition(CVPR 2014),23-28June 2014,Columbus,Ohio.2014:580-587.

[18]封晶.基于区域卷积神经网络的车辆检测方法[J].科技广场,2017(03):10-14.

[19] He K,Zhang X,Ren S,et al.Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition[J].IEEE Transactions on Pattern Analysis&Machine Intelligence,2014,37(9):1904-16.

[20]侯明伟.基于空间金字塔池化的卷积神经网络图像分类算法[D].武汉大学,2018.

[21] Girshick R.Fast R-CNN[J].Computer Science,2015.

[22]潘广贞,孙艳青,王凤.基于Fast RCNN模型的车辆阴影去除[J].计算机工程与设计,2018,39(03):819-823.

[23] Ren S,He K,Girshick R,et al.Faster R-CNN:Towards Real-Time Object Detection with Region Proposal Networks[J].2015.

[24]张汇,杜煜,宁淑荣,张永华,杨硕,杜晨.基于Faster RCNN的行人检测方法[J].传感器与微系统,2019,38(02):147-149+153.

[25] Redmon J,Divvala S,Girshick R,et al.You Only Look Once:Unified,Real-Time Object Detection[J].2015.

[26]于秀萍,吕淑平,陈志韬.基于YOLO算法的多类目标识别[J].实验室研究与探索,2019,38(03):34-36+76.

[27] Liu W,Anguelov D,Erhan D,et al.SSD:Single Shot MultiBox Detector[J].2015.

[28]姚红革,白小军,杨浩琪.基于SSD的多特征刑侦场景识别[J].西安邮电大学学报,2018,23(04):28-33.

[29] Taigman Y,Yang M,Ranzato M,et al.DeepFace:Closing the Gap to Human-Level Performance in Face Verification[C]//Conference on Computer Vision and Pattern Recognition(CVPR).IEEE Computer Society,2014.

[30] Zhou E,Cao Z,Yin Q.Naive-Deep Face Recognition:Touching the Limit of LFW Benchmark or Not?[J].Computer Science,2015.

[31] Sun Y,Liang D,Wang X,et al.DeepID3:Face Recognition with Very Deep Neural Networks[J].Computer Science,2015.

[32] Schroff F,Kalenichenko D,Philbin J.[IEEE 2015IEEE Conference on Computer Vision and Pattern Recognition(CVPR)-Boston,MA,USA(2015.6.7-2015.6.12)]2015IEEE Conference on Computer Vision and Pattern Recognition(CVPR)-FaceNet:A unified embedding for face recognition and clustering[J].2015:815-823.

[33] Liu J,Deng Y,Bai T,et al.Targeting Ultimate Accuracy:Face Recognition via Deep Embedding[J].2015.

[34]胡亚洲,周亚丽,张奇志.基于深度学习的人脸识别算法研究[J/OL].计算机应用研究:1-7[2019-05-14].陈利.基于深度学习的车牌识别系统设计[J].计算机技术与发展,2018,28(06):85-89.

[35]彭先霖,张海曦,胡琦瑶.基于多任务深度卷积神经网络的人脸/面瘫表情识别方法[J].西北大学学报(自然科学版),2019,49(02):187-192.

[36]陈利.基于深度学习的车牌识别系统设计[J].计算机技术与发展,2018,28(06):85-89.

[37]曹正凤,李少丁,王栋梁,徐建波,刘晗,卓晴.基于深度学习的端到端车牌检测识别系统[J].中国交通信息化,2018(09):88-91.

[38]李晓燃.基于深度学习的倾斜车牌矫正识别[J].软件,2018,39(10):215-219.

[39] Roth H.R.et al.(2014)A New 2.5DRepresentation for Lymph Node Detection Using Random Sets of Deep Convolutional Neural Network Observations.In:Golland P.,Hata N.,Barillot C.,Hornegger J.,Howe R.(eds)Medical Image Computing and Computer-Assisted Intervention-MICCAI 2014.MICCAI 2014.Lecture Notes in Computer Science,vol8673.Springer,Cham

[40] Gao Xinting,Lin Stephen,Wong Tien Yin.Automatic Feature Learning to Grade Nuclear Cataracts Based on Deep Learning[J].IEEE transactions on bio-medical engineering,2015,Vol.62(11),pp.2693-701

[41] Qi Dou,Hao Chen,et al.Automatic Detection of Cerebral Microbleeds From MR Images via 3DConvolutional Neural Networks[J].IEEE Trans Med Imaging.2016May;35(5):1182-1195.

[42] Esteva A,Kuprel B,Novoa R A,et al.Dermatologist-level classification of skin cancer with deep neural networks[J].Nature,2017,542(7639):115-118.

[43]张泽中,高敬阳,吕纲,赵地.基于深度学习的胃癌病理图像分类方法[J].计算机科学,2018,45(S2):263-268.

基本信息:

DOI:10.16191/j.cnki.hbkx.2019.03.004

中图分类号:TP391.41;TP18

引用信息:

[1]张琦,张荣梅,陈彬.基于深度学习的图像识别技术研究综述[J].河北省科学院学报,2019,36(03):28-36.DOI:10.16191/j.cnki.hbkx.2019.03.004.

检 索 高级检索

引用

GB/T 7714-2015 格式引文
MLA格式引文
APA格式引文