Journal of Guangxi Normal University(Natural Science Edition) ›› 2021, Vol. 39 ›› Issue (2): 32-40.doi: 10.16088/j.issn.1001-6600.2020090704

Previous Articles     Next Articles

A Dimensionality-reduction Method Based on Attention Mechanismon Image Classification

DENG Wenxuan, YANG Hang, JIN Ting*   

  1. School of Computer Science and Cyberspace Security, Hainan University, Haikou Hainan 570228, China
  • Received:2020-09-07 Revised:2020-09-30 Online:2021-03-25 Published:2021-04-15

Abstract: The convolution operators are the core building blocks of convolutional neural network, which enable the network to fuse the information of various layers of space and channels according to a certain perception field of view, and extract the characteristics of the information. However, adjacent pixels often have similar values in an image, which results in a large amount of redundant information in the output of the convolutional layer. In order to reduce redundant information and speed up model inference, many pooling layers are added to the convolutional neural network for reducing information dimensionality. Pooling has better dimensionality reduction effect on image features with the invariance of translation and rotation. And end-to-end model can be maintained compared with traditional dimensionality reduction methods. Therefore, a dimensionality reduction method is proposed based on the attention mechanism by using the pooling layer. In the process of feature extraction, the dimensionality reduction information from each layer’s are reused nonlinearly, so that the potential connections of information in different layers after dimensionality reduction can be learned. In order to obtain the characteristics of the input information, the proposed method focuses on the main texture of the target in the image, and then the low texture and background information of the target are combined. Based on the DLA-34 (deep layer aggregation) neural network, the dimensionality reduction method proposed in this paper and the others dimensionality reduction methods based on the maximum value and the average value are compared to deal with multiple sets on the CIFAR10 and CIFAR100 datasets, which proves the effectiveness of the new method.

Key words: deep learning, image classification, convolutional neural network, residual network, attentionmechanism

CLC Number: 

  • TP391.4
[1] RUSSAKOVSKY O,DENG J,SU H,et al.ImageNet large scale visual recognition challenge[J].International Journal of Computer Vision,2015,115(3):211-252.DOI: 10.1007/s11263-015-0816-y.
[2] 崔雍浩,商聪,陈锶奇,等.人工智能综述:AI的发展[J].无线电通信技术,2019,45(3):225-231.DOI: 10.3969/j.issn.1003-3114.2019.03.01.
[3] 郭丽丽,丁世飞.深度学习研究进展[J].计算机科学,2015,42(5):28-33.DOI: 10.11896/j.issn.1002-137X.2015.05.006.
[4] 毛其超,贾瑞生,左羚群,等.基于深度学习的交通监控视频车辆检测算法[J].计算机应用与软件,2020,37(9):111-117,164.DOI: 10.3969/j.issn.1000-386x.2020.09.019.
[5] 周玲,胡月,刘红,等.循证医学和深度学习在社区新型冠状病毒肺炎疫情防控管理中的应用[J].护理研究,2020,34(6):947-949.DOI: 10.12102/j.issn.1009-6493.2020.06.035.
[6] 陈德鑫,占袁圆,杨兵.深度学习技术在教育大数据挖掘领域的应用分析[J].电化教育研究,2019,40(2):68-76.DOI: 10.13811/j.cnki.eer.2019.02.009.
[7] 张晓海,操新文.基于深度学习的军事辅助决策研究[J].火力与指挥控制,2020,45(3):1-6.DOI: 10.3969/j.issn.1002-0640.2020.03.001.
[8] 王青松,赵西安,马超.特征提取和特征匹配改进方法的研究[J].测绘科学技术学报,2014,31(4):377-382.DOI: 10.3969/j.issn.1673-6338.2014.04.011.
[9] KRIZHEVSKY A,SUTSKEVER I,HINTON G E.ImageNet classification with deep convolutional neural networks[C]//Advances in neural information processing systems 25:26th Annual Conference on Neural Information Processing Systems 2012.La Jolla,CA:Neural Information Processing Systems,2012:1097-1105.
[10] 王伟男,杨朝红.基于图像处理技术的目标识别方法综述[J].电脑与信息技术,2019,27(6):9-15.DOI: 10.19414/j.cnki.1005-1228.2019.06.003.
[11] SIMONYAN K,ZISSERMAN A.Very deep convolutional networks for large-scale image recognition[EB/OL].(2014-09-04)[2020-02-07].https://arxiv.org/pdf/1409.1556.pdf.
[12] SZEGEDY C,LIU W,JIA Y Q,et al.Going deeper with convolutions[C]//Proceedings of the 2015 IEEE Conference on Computer Vision and Pattern Recognition.Los Alamitos,CA:IEEE Computer Society,2015:1-9.DOI: 10.1109/CVPR.2015.7298594.
[13] SZEGEDY C,VANHOUCKE V,IOFFE S,et al.Rethinking the inception architecture for computer vision[C]//Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition.Los Alamitos,CA:IEEE Computer Society,2016:2818-2826.DOI: 10.1109/cvpr.2016.308.
[14] HE K M,ZHANG X Y,REN S Q,et al.Deep residual learning for image recognition[C]//Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition.Los Alamitos,CA:IEEE Computer Society,2016:770-778.DOI: 10.1109/CVPR.2016.90.
[15] XIE S N,GIRSHICK R,DOLLÁR P,et al.Aggregated residual transformations for deep neural networks[C]//Proceedings of the 2017 IEEE Conference on Computer Vision And Pattern Recognition.Los Alamitos,CA:IEEE Computer Society,2017:5987-5995.DOI: 10.1109/CVPR.2017.634.
[16] HUANG G,LIU Z,Van Der MAATEN L,et al.Densely connected convolutional networks[C]//Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition.Los Alamitos,CA:IEEE Computer Society,2017:2261-2269.DOI: 10.1109/CVPR.2017.243.
[17] ELFIKY N M,KHAN F S,van de WEIJER J,et al.Discriminative compact pyramids for object and scene recognition[J].Pattern Recognition,2012,45(4):1627-1636.DOI: 10.1016/j.patcog.2011.09.020.
[18] KE Y,SUKTHANKAR R.PCA-SIFT:a more distinctive representation for local image descriptors[C]//Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition:Volume II.Los Alamitos,CA:IEEE Computer Society,2004:506-513.DOI: 10.1109/CVPR.2004.1315206.
[19] HU J,SHEN L,SUN G.Squeeze-and-excitation networks[C]//Proceedings of the 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.Los Alamitos,CA:IEEE Computer Society,2018:7132-7141.DOI: 10.1109/cvpr.2018.00745.
[20] YU F,WANG D Q,SHELHAMER E,et al.Deep layer aggregation[C]//Proceedings of the 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.Los Alamitos,CA:IEEE Computer Society,2018:2403-2412.DOI: 10.1109/cvpr.2018.00255.
[21] RONNEBERGER O,FISCHER P,BROX T.U-net:Convolutional networks for biomedical image segmentation[C]//Medical Image Computing and Computer-Assisted Intervention-MICCAI 2015.Berlin:Springer,2015:234-241.DOI: 10.1007/978-3-319-24574-4_28.
[22] LIN T Y,DOLLÁR P,GIRSHICK R,et al.Feature pyramid networks for object detection[C]//Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition.Los Alamitos,CA:IEEE Computer Society,2017:936-944.DOI: 10.1109/CVPR.2017.106.
[23] HOWARD A G,ZHU M L,CHEN B,et al.MobileNets:efficient convolutional neural networks for mobile vision applications[EB/OL].(2017-04-17)[2020-02-07].https://arxiv.org/pdf/1704.04861.pdf.
[24] HE T,ZHANG Z,ZHANG H,et al.Bag of tricks for image classification with convolutional neural networks[C]//Proceedings of the 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition.Los Alamitos,CA:IEEE Computer Society,2019:558-567.DOI: 10.1109/cvpr.2019.00065.
[25] ZHANG X Y,ZHOU X Y,LIN M X,et al.ShuffleNet:an extremely efficient convolutional neural network for mobile devices[C]//Proceedings of the 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.Los Alamitos,CA:IEEE Computer Society,2018:6848-6856.DOI: 10.1109/cvpr.2018.00716.
[1] YANG Zhou, FAN Yixing, ZHU Xiaofei, GUO Jiafeng, WANG Yue. Survey on Modeling Factors of Neural Information Retrieval Model [J]. Journal of Guangxi Normal University(Natural Science Edition), 2021, 39(2): 1-12.
[2] XUE Tao, QIU Senhui, LU Hao, QIN Xingsheng. Exchange Rate Prediction Based on Empirical Mode Decomposition and Multi-branch LSTM Network [J]. Journal of Guangxi Normal University(Natural Science Edition), 2021, 39(2): 41-50.
[3] TANG Rongchai, WU Xiru. Real-time Detection of Passion Fruit Based on Improved YOLO-V3 Network [J]. Journal of Guangxi Normal University(Natural Science Edition), 2020, 38(6): 32-39.
[4] ZHANG Mingyu,ZHAO Meng,CAI Fuhong,LIANG Yu,WANG Xinhong. Wave Power Prediction Based on Deep Learning [J]. Journal of Guangxi Normal University(Natural Science Edition), 2020, 38(3): 25-32.
[5] LI Weiyong, LIU Bin, ZHANG Wei, CHEN Yunfang. An Automatic Summarization Model Based on Deep Learning for Chinese [J]. Journal of Guangxi Normal University(Natural Science Edition), 2020, 38(2): 51-63.
[6] YAN Hao, XU Hongbo, SHEN Yinghan, CHENG Xueqi. Research on Open Chinese Event Detection [J]. Journal of Guangxi Normal University(Natural Science Edition), 2020, 38(2): 64-71.
[7] LIU Yingxuan, WU Xiru, XUE Ganggang. Multi-target Real-time Detection for Road Traffic SignsBased on Deep Learning [J]. Journal of Guangxi Normal University(Natural Science Edition), 2020, 38(2): 96-106.
[8] FAN Rui, JIANG Pinqun, ZENG Shangyou, XIA Haiying, LIAO Zhixian, LI Peng. Design of Lightweight Convolution Neural Network Based on Multi-scale Parallel Fusion [J]. Journal of Guangxi Normal University(Natural Science Edition), 2019, 37(3): 50-59.
[9] ZHANG Jinlei, LUO Yuling, FU Qiang. Predicting Financial Time Series Based on Gated Recurrent Unit Neural Network [J]. Journal of Guangxi Normal University(Natural Science Edition), 2019, 37(2): 82-89.
[10] HUANG Liming,CHEN Weizheng,YAN Hongfei,CHEN Chong. A Stock Prediction Method Based on Recurrent Neural Network and Deep Learning [J]. Journal of Guangxi Normal University(Natural Science Edition), 2019, 37(1): 13-22.
[11] WU Wenya,CHEN Yufeng,XU Jin’an,ZHANG Yujie. High-level Semantic Attention-based Convolutional Neural Networks for Chinese Relation Extraction [J]. Journal of Guangxi Normal University(Natural Science Edition), 2019, 37(1): 32-41.
[12] YUE Tianchi, ZHANG Shaowu, YANG Liang, LIN Hongfei, YU Kai. Stance Detection Method Based on Two-Stage Attention Mechanism [J]. Journal of Guangxi Normal University(Natural Science Edition), 2019, 37(1): 42-49.
[13] YU Chuanming,LI Haonan,AN Lu. Analysis of Text Emotion Cause Based on Multi-task Deep Learning [J]. Journal of Guangxi Normal University(Natural Science Edition), 2019, 37(1): 50-61.
[14] WANG Qi,QIU Jiahui,RUAN Tong,GAO Daqi,GAO Ju. Recurrent Capsule Network for Clinical Relation Extraction [J]. Journal of Guangxi Normal University(Natural Science Edition), 2019, 37(1): 80-88.
[15] XUE Yang,ZENG Qingke,XIA Haiying,WANG Wentao. Remote Sensing Image Fusion Based on Convolutional Neural Network Super-resolution Reconstruction [J]. Journal of Guangxi Normal University(Natural Science Edition), 2018, 36(2): 33-41.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
[1] HU Jinming, WEI Duqu. Research on Generalized Sychronization of Fractional-order PMSM[J]. Journal of Guangxi Normal University(Natural Science Edition), 2020, 38(6): 14 -20 .
[2] ZHU Yongjian, LUO Jian, QIN Yunbai, QIN Guofeng, TANG Chuliu. A Method for Detecting Metal Surface Defects Based on Photometric Stereo and Series Expansion Methods[J]. Journal of Guangxi Normal University(Natural Science Edition), 2020, 38(6): 21 -31 .
[3] YANG Liting, LIU Xuecong, FAN Penglai, ZHOU Qihai. Research Progress in Vocal Communication of Nonhuman Primates in China[J]. Journal of Guangxi Normal University(Natural Science Edition), 2021, 39(1): 1 -9 .
[4] BIN Shiyu, LIAO Fang, DU Xuesong, XU Yilan, WANG Xin, WU Xia, LIN Yong. Research Progress on Cold Tolerance of Tilapia[J]. Journal of Guangxi Normal University(Natural Science Edition), 2021, 39(1): 10 -16 .
[5] LIU Jing, BIAN Xun. Characteristics of the Orthoptera Mitogenome and Its Application[J]. Journal of Guangxi Normal University(Natural Science Edition), 2021, 39(1): 17 -28 .
[6] LI Xingkang, ZHONG Enzhu, CUI Chunyan, ZHOU Jia, LI Xiaoping, GUAN Zhenhua. Monitoring Singing Behavior of Western Black Crested Gibbon (Nomascus concolor furvogaster)[J]. Journal of Guangxi Normal University(Natural Science Edition), 2021, 39(1): 29 -37 .
[7] HE Xinming, XIA Wancai, BA Sang, LONG Xiaobin, LAI Jiandong, YANG Chan, WANG Fan, LI Dayong. Grooming Strategies of Resident Males with Different Number of Mates in Yunnan Snub-nosed Monkeys (Rhinopithecus bieti)[J]. Journal of Guangxi Normal University(Natural Science Edition), 2021, 39(1): 38 -44 .
[8] FU Wen, REN Baoping, LIN Jianzhong, LUAN Ke, WANG Pengcheng, WANG Bing, LI Dayong, ZHOU Qihai. Jiyuan Taihang Mountain Macaque Population and Conservation Status[J]. Journal of Guangxi Normal University(Natural Science Edition), 2021, 39(1): 45 -52 .
[9] ZHENG Jingjin, LIANG Jipeng, ZHANG Kechu, HUANG Aimian, LU Qian, LI Youbang, HUANG Zhonghao. White-headed Langurs Select Foods Based on Woody Plants' Dominances[J]. Journal of Guangxi Normal University(Natural Science Edition), 2021, 39(1): 53 -64 .
[10] YANG Chan, WAN Yaqiong, HUANG Xiaofu, YUAN Xudong, ZHOU Hongyan, FANG Haocun, LI Dayong, LI Jiaqi. Activity Rhythm of Muntiacus reevesi Based on Infrared Camera Technology[J]. Journal of Guangxi Normal University(Natural Science Edition), 2021, 39(1): 65 -70 .