Abstract
Deep neural networks provide good results for computer vision tasks. This has been possible due to a renewed interest in neural networks, availability of large-scale labelled training data, virtually unlimited processing and storage on cloud platforms and high-performance clusters. A Convolutional Neural Network (CNN) is one such architecture better suited for image classification. An important factor for a better CNN performance, besides the data quality, is the choice of hyperparameters, which define the model itself. The model or hyperparameter optimisation involves selecting the best configuration of hyperparameters, but is challenging because the set of hyperparameters are different for each type of machine learning algorithm. Thus, it requires a lot of computational time and resources to determine a better performing machine learning model. Therefore, the process has a lot of research interest and currently a transition to a fully automated process is also underway. This paper provides a survey of the CNN model optimisation techniques proposed in the literature.
Original language | English |
---|---|
Title of host publication | Handbook of Research on New Investigations in Artificial Life, AI, and Machine Learning |
Publisher | IGI Global |
Chapter | 11 |
Pages | 269-298 |
Number of pages | 30 |
ISBN (Electronic) | 9781799886877 |
ISBN (Print) | 9781799886860 |
DOIs | |
Publication status | Published - Feb 2022 |
Keywords
- automated machine learning
- deep learning
- explainable artificial intelligence
- model complexity
- object recognition
- convolution
- classification
- cloud computing
- Neural Network Performance