Review of deep learning: concepts, CNN

Abstract

In the last few years, the deep learning (DL) computing paradigm has been deemed the Gold Standard in the machine learning (ML) community. Moreover, it has gradually become the most widely used computational approach in the field of ML, thus achieving outstanding results on several complex cognitive tasks, matching or even beating those provided by human performance. One of the benefits of DL is the ability to learn massive amounts of data. The DL field has grown fast in the last few years and it has been extensively used to successfully address a wide range of traditional applications. More importantly, DL has outperformed well-known ML techniques in many domains, e.g., cybersecurity, natural language processing, bioinformatics, robotics and control, and medical information processing, among many others. Despite it has been contributed several works reviewing the State-of-the-Art on DL, all of them only tackled one aspect of the DL, which leads to an overall lack of knowledge about it. Therefore, in this contribution, we propose using a more holistic approach in order to provide a more suitable starting point from which to develop a full understanding of DL. Specifically, this review attempts to provide a more comprehensive survey of the most important aspects of DL and including those enhancements recently added to the field. In particular, this paper outlines the importance of DL, presents the types of DL techniques and networks. It then presents convolutional neural networks (CNNs) which the most utilized DL network type and describes the development of CNNs architectures together with their main features, e.g., starting with the AlexNet network and closing with the High-Resolution network (HR.Net). Finally, we further present the challenges and suggested solutions to help researchers understand the existing research gaps. It is followed by a list of the major DL applications. Computational tools including FPGA, GPU, and CPU are summarized along with a description of their influence on DL. The paper ends with the evolution matrix, benchmark datasets, and summary and conclusion.

Details

Title

Review of deep learning: concepts, CNN architectures, challenges, applications, future directions

Author

Alzubaidi Laith¹

; Zhang Jinglan²; Humaidi, Amjad J³; Al-Dujaili Ayad⁴; Duan Ye⁵; Al-Shamma, Omran⁶; Santamaría, J⁷; Fadhel, Mohammed A⁸; Al-Amidie Muthana⁵; Farhan Laith⁹

¹ Queensland University of Technology, School of Computer Science, Brisbane, Australia (GRID:grid.1024.7) (ISNI:0000000089150953); University of Information Technology & Communications, AlNidhal Campus, Baghdad, Iraq (GRID:grid.1024.7)
² Queensland University of Technology, School of Computer Science, Brisbane, Australia (GRID:grid.1024.7) (ISNI:0000000089150953)
³ University of Technology, Control and Systems Engineering Department, Baghdad, Iraq (GRID:grid.1024.7)
⁴ Middle Technical University, Electrical Engineering Technical College, Baghdad, Iraq (GRID:grid.1024.7)
⁵ University of Missouri, Faculty of Electrical Engineering & Computer Science, Columbia, USA (GRID:grid.134936.a) (ISNI:0000 0001 2162 3504)
⁶ University of Information Technology & Communications, AlNidhal Campus, Baghdad, Iraq (GRID:grid.134936.a)
⁷ University of Jaén, Department of Computer Science, Jaén, Spain (GRID:grid.21507.31) (ISNI:0000 0001 2096 9837)
⁸ University of Sumer, College of Computer Science and Information Technology, Thi Qar, Iraq (GRID:grid.21507.31)
⁹ Manchester Metropolitan University, School of Engineering, Manchester, UK (GRID:grid.25627.34) (ISNI:0000 0001 0790 5329)

Publication year

2021

Publication date

Mar 2021

Publisher

Springer Nature B.V.

e-ISSN

21961115

Source type

Scholarly Journal

Language of publication

English

DOI

https://doi.org/10.1186/s40537-021-00444-8

ProQuest document ID

2507363662

© The Author(s) 2021. This work is published under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.

Review of deep learning: concepts, CNN architectures, challenges, applications, future directions

Jump to:

Abstract

Details

Full text options

Suggested sources