Multi-channel and multi-scale mid-level image representation for scene classification

Yang, Jinfu; Yang, Fei; Wang, Guanghui; Li, Mingai

Publication

Multi-channel and multi-scale mid-level image representation for scene classification

Yang, Jinfu
;
Yang, Fei
;
Wang, Guanghui
;
Li, Mingai

Abstract

Convolutional neural network (CNN)-based approaches have received state-of-the-art results in scene classification. Features from the output of fully connected (FC) layers express one-dimensional semantic information but lose the detailed information of objects and the spatial information of scene categories. On the contrary, deep convolutional features have been proved to be more suitable for describing an object itself and the spatial relations among objects in an image. In addition, the feature map from each layer is max-pooled within local neighborhoods, which weakens the invariance of global consistency and is unfavorable to scenes with highly complicated variation. To cope with the above issues, an orderless multi-channel mid-level image representation on pre-trained CNN features is proposed to improve the classification performance. The mid-level image representation of two channels from the FC layer and the deep convolutional layer are integrated at multi-scale levels. A sum pooling approach is also employed to aggregate multi-scale mid-level image representation to highlight the importance of the descriptors beneficial for scene classification. Extensive experiments on SUN397 and MIT 67 indoor datasets demonstrate that the proposed method achieves promising classification performance.

Date

2017

Publisher

Society of Photo-optical Instrumentation Engineers (SPIE)

Collections

Electrical Engineering and Computer Science Scholarly Works

Show all metadata

Files

Yang_2017_JElecImaging.pdf

Adobe PDF, 2.6 MB

Keywords

Scene classification, Computer programming, Principal component analysis, Image segmentation, Visualization, Feature extraction, Image classification

Citation

Jinfu Yang, Jinfu Yang, Fei Yang, Fei Yang, Guanghui Wang, Guanghui Wang, Mingai Li, Mingai Li, "Multi-channel and multi-scale mid-level image representation for scene classification," Journal of Electronic Imaging 26(2), 023018 (11 April 2017). https://doi.org/10.1117/1.JEI.26.2.023018

URI

https://hdl.handle.net/1808/27681

DOI

10.1117/1.JEI.26.2.023018

Multi-channel and multi-scale mid-level image representation for scene classification

Yang, Jinfu
;
Yang, Fei
;
Wang, Guanghui
;
Li, Mingai

Citations

Abstract

Description

Date

Journal Title

Journal ISSN

Volume Title

Publisher

Collections

Files

Research Projects

Organizational Units

Journal Issue

Keywords

Citation

URI

DOI

Embedded videos

Multi-channel and multi-scale mid-level image representation for scene classification

Yang, Jinfu ; Yang, Fei ; Wang, Guanghui ; Li, Mingai

Citations

Abstract

Description

Date

Journal Title

Journal ISSN

Volume Title

Publisher

Collections

Files

Research Projects

Organizational Units

Journal Issue

Keywords

Citation

URI

DOI

Embedded videos

Yang, Jinfu
;
Yang, Fei
;
Wang, Guanghui
;
Li, Mingai