Show simple item record

dc.contributor.authorYang, Jinfu
dc.contributor.authorYang, Fei
dc.contributor.authorWang, Guanghui
dc.contributor.authorLi, Mingai
dc.date.accessioned2019-02-04T17:31:03Z
dc.date.available2019-02-04T17:31:03Z
dc.date.issued2017
dc.identifier.citationJinfu Yang, Jinfu Yang, Fei Yang, Fei Yang, Guanghui Wang, Guanghui Wang, Mingai Li, Mingai Li, "Multi-channel and multi-scale mid-level image representation for scene classification," Journal of Electronic Imaging 26(2), 023018 (11 April 2017). https://doi.org/10.1117/1.JEI.26.2.023018en_US
dc.identifier.urihttp://hdl.handle.net/1808/27681
dc.description.abstractConvolutional neural network (CNN)-based approaches have received state-of-the-art results in scene classification. Features from the output of fully connected (FC) layers express one-dimensional semantic information but lose the detailed information of objects and the spatial information of scene categories. On the contrary, deep convolutional features have been proved to be more suitable for describing an object itself and the spatial relations among objects in an image. In addition, the feature map from each layer is max-pooled within local neighborhoods, which weakens the invariance of global consistency and is unfavorable to scenes with highly complicated variation. To cope with the above issues, an orderless multi-channel mid-level image representation on pre-trained CNN features is proposed to improve the classification performance. The mid-level image representation of two channels from the FC layer and the deep convolutional layer are integrated at multi-scale levels. A sum pooling approach is also employed to aggregate multi-scale mid-level image representation to highlight the importance of the descriptors beneficial for scene classification. Extensive experiments on SUN397 and MIT 67 indoor datasets demonstrate that the proposed method achieves promising classification performance.en_US
dc.publisherSociety of Photo-optical Instrumentation Engineers (SPIE)en_US
dc.rightsCopyright 2017 Society of Photo-Optical Instrumentation Engineers. One print or electronic copy may be made for personal use only. Systematic reproduction and distribution, duplication of any material in this paper for a fee or for commercial purposes, or modification of the content of the paper are prohibited.en_US
dc.subjectScene classificationen_US
dc.subjectComputer programmingen_US
dc.subjectPrincipal component analysisen_US
dc.subjectImage segmentationen_US
dc.subjectVisualizationen_US
dc.subjectFeature extractionen_US
dc.subjectImage classificationen_US
dc.titleMulti-channel and multi-scale mid-level image representation for scene classificationen_US
dc.typeArticleen_US
kusw.kuauthorWang, Guanghui
kusw.kudepartmentElectrical Engineering & Computer Scienceen_US
dc.identifier.doi10.1117/1.JEI.26.2.023018en_US
kusw.oaversionScholarly/refereed, publisher versionen_US
kusw.oapolicyThis item meets KU Open Access policy criteria.en_US
dc.rights.accessrightsopenAccessen_US


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record