Convolutional Neural Network in Pattern Recognition

Mo, Xi

dc.contributor.advisor	Zhong, Cuncong
dc.contributor.author	Mo, Xi
dc.date.accessioned	2023-05-23T14:55:04Z
dc.date.available	2023-05-23T14:55:04Z
dc.date.issued	2022-05-31
dc.date.submitted	2022
dc.identifier.other	http://dissertations.umi.com/ku:18346
dc.identifier.uri	https://hdl.handle.net/1808/34204
dc.description.abstract	Since convolutional neural network (CNN) was first implemented by Yann LeCun et al. in 1989, CNN and its variants have been widely implemented to numerous topics of pattern recognition, and have been considered as the most crucial techniques in the field of artificial intelligence and computer vision. This dissertation not only demonstrates the implementation aspect of CNN, but also lays emphasis on the methodology of neural network (NN) based classifier. As known to many, one general pipeline of NN-based classifier can be recognized as three stages: pre-processing, inference by models, and post-processing. To demonstrate the importance of pre-processing techniques, this dissertation presents how to model actual problems in medical pattern recognition and image processing by introducing conceptual abstraction and fuzzification. In particular, a transformer on the basis of self-attention mechanism, namely beat-rhythm transformer, greatly benefits from correct R-peak detection results and conceptual fuzzification. Recently proposed self-attention mechanism has been proven to be the top performer in the fields of computer vision and natural language processing. In spite of the pleasant accuracy and precision it has gained, it usually consumes huge computational resources to perform self-attention. Therefore, realtime global attention network is proposed to make a better trade-off between efficiency and performance for the task of image segmentation. To illustrate more on the stage of inference, we also propose models to detect polyps via Faster R-CNN - one of the most popular CNN-based 2D detectors, as well as a 3D object detection pipeline for regressing 3D bounding boxes from LiDAR points and stereo image pairs powered by CNN. The goal for post-processing stage is to refine artifacts inferred by models. For the semantic segmentation task, the dilated continuous random field is proposed to be better fitted to CNN-based models than the widely implemented fully-connected continuous random field. Proposed approaches can be further integrated into a reinforcement learning architecture for robotics.
dc.format.extent	207 pages
dc.language.iso	en
dc.publisher	University of Kansas
dc.rights	Copyright held by the author.
dc.subject	Electrical engineering
dc.subject	Computer science
dc.subject	Artifiicial Neural Network
dc.subject	Continuous Random Field
dc.subject	Convolutional Neural Network
dc.subject	Object Detection
dc.subject	Semantic Segmentation
dc.subject	Transformer
dc.title	Convolutional Neural Network in Pattern Recognition
dc.type	Dissertation
dc.contributor.cmtemember	Luo, Bo
dc.contributor.cmtemember	Kim, Taejoon
dc.contributor.cmtemember	Li, Fengjun
dc.contributor.cmtemember	Fang, Huazhen
dc.thesis.degreeDiscipline	Electrical Engineering & Computer Science
dc.thesis.degreeLevel	D.Eng.
dc.identifier.orcid	https://orcid.org/0000-0002-3016-3308	en_US
dc.rights.accessrights	openAccess

Files in this item

Name:: Mo_ku_0099D_18346_DATA_1.pdf
Size:: 26.34Mb
Format:: PDF

View/Open

This item appears in the following Collection(s)

The University of Kansas prohibits discrimination on the basis of race, color, ethnicity, religion, sex, national origin, age, ancestry, disability, status as a veteran, sexual orientation, marital status, parental status, gender identity, gender expression and genetic information in the University’s programs and activities. The following person has been designated to handle inquiries regarding the non-discrimination policies: Director of the Office of Institutional Opportunity and Access, IOA@ku.edu, 1246 W. Campus Road, Room 153A, Lawrence, KS, 66045, (785)864-6414, 711 TTY.