Avoiding Overfitting: A Survey on Regularization Methods for Convolutional Neural Networks

Bibliographic Details
Main Author: Santos, Claudio Filipi Gonçalves Dos
Publication Date: 2022
Other Authors: Papa, João Paulo [UNESP]
Format: Other
Language: eng
Source: Repositório Institucional da UNESP
Download full: http://dx.doi.org/10.1145/3510413
http://hdl.handle.net/11449/249425
Summary: Several image processing tasks, such as image classification and object detection, have been significantly improved using Convolutional Neural Networks (CNN). Like ResNet and EfficientNet, many architectures have achieved outstanding results in at least one dataset by the time of their creation. A critical factor in training concerns the network's regularization, which prevents the structure from overfitting. This work analyzes several regularization methods developed in the past few years, showing significant improvements for different CNN models. The works are classified into three main areas: The first one is called data augmentation,where all the techniques focus on performing changes in the input data. The second, named internal changes,aims to describe procedures to modify the feature maps generated by the neural network or the kernels. The last one, called label,concerns transforming the labels of a given input. This work presents two main differences comparing to other available surveys about regularization: (i) the first concerns the papers gathered in the manuscript, which are not older than five years, and (ii) the second distinction is about reproducibility, i.e., all works referred here have their code available in public repositories or they have been directly implemented in some framework, such as TensorFlow or Torch.
id UNSP_ffe378942dfdb90ad6dea33c7d9ec1ff
oai_identifier_str oai:repositorio.unesp.br:11449/249425
network_acronym_str UNSP
network_name_str Repositório Institucional da UNESP
repository_id_str 2946
spelling Avoiding Overfitting: A Survey on Regularization Methods for Convolutional Neural Networksconvolutional neural networksRegularizationSeveral image processing tasks, such as image classification and object detection, have been significantly improved using Convolutional Neural Networks (CNN). Like ResNet and EfficientNet, many architectures have achieved outstanding results in at least one dataset by the time of their creation. A critical factor in training concerns the network's regularization, which prevents the structure from overfitting. This work analyzes several regularization methods developed in the past few years, showing significant improvements for different CNN models. The works are classified into three main areas: The first one is called data augmentation,where all the techniques focus on performing changes in the input data. The second, named internal changes,aims to describe procedures to modify the feature maps generated by the neural network or the kernels. The last one, called label,concerns transforming the labels of a given input. This work presents two main differences comparing to other available surveys about regularization: (i) the first concerns the papers gathered in the manuscript, which are not older than five years, and (ii) the second distinction is about reproducibility, i.e., all works referred here have their code available in public repositories or they have been directly implemented in some framework, such as TensorFlow or Torch.Federal Institute of São Carlos-UFSCar, Rod. Washington Luiz, 235, São CarlosEldorado's Institute of Technology, Av. Alan Turing, 275, CampinasSão Paulo State University-UNESP, Av. Eng. Luís Edmundo Carrijo Coube, 14-01, BauruSão Paulo State University-UNESP, Av. Eng. Luís Edmundo Carrijo Coube, 14-01, BauruUniversidade Federal de São Carlos (UFSCar)Eldorado's Institute of TechnologyUniversidade Estadual Paulista (UNESP)Santos, Claudio Filipi Gonçalves DosPapa, João Paulo [UNESP]2023-07-29T15:34:47Z2023-07-29T15:34:47Z2022-09-14info:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/otherhttp://dx.doi.org/10.1145/3510413ACM Computing Surveys, v. 54, n. 10 s, 2022.1557-73410360-0300http://hdl.handle.net/11449/24942510.1145/35104132-s2.0-85143053852Scopusreponame:Repositório Institucional da UNESPinstname:Universidade Estadual Paulista (UNESP)instacron:UNESPengACM Computing Surveysinfo:eu-repo/semantics/openAccess2024-04-23T16:11:11Zoai:repositorio.unesp.br:11449/249425Repositório InstitucionalPUBhttp://repositorio.unesp.br/oai/requestrepositoriounesp@unesp.bropendoar:29462024-04-23T16:11:11Repositório Institucional da UNESP - Universidade Estadual Paulista (UNESP)false
dc.title.none.fl_str_mv Avoiding Overfitting: A Survey on Regularization Methods for Convolutional Neural Networks
title Avoiding Overfitting: A Survey on Regularization Methods for Convolutional Neural Networks
spellingShingle Avoiding Overfitting: A Survey on Regularization Methods for Convolutional Neural Networks
Santos, Claudio Filipi Gonçalves Dos
convolutional neural networks
Regularization
title_short Avoiding Overfitting: A Survey on Regularization Methods for Convolutional Neural Networks
title_full Avoiding Overfitting: A Survey on Regularization Methods for Convolutional Neural Networks
title_fullStr Avoiding Overfitting: A Survey on Regularization Methods for Convolutional Neural Networks
title_full_unstemmed Avoiding Overfitting: A Survey on Regularization Methods for Convolutional Neural Networks
title_sort Avoiding Overfitting: A Survey on Regularization Methods for Convolutional Neural Networks
author Santos, Claudio Filipi Gonçalves Dos
author_facet Santos, Claudio Filipi Gonçalves Dos
Papa, João Paulo [UNESP]
author_role author
author2 Papa, João Paulo [UNESP]
author2_role author
dc.contributor.none.fl_str_mv Universidade Federal de São Carlos (UFSCar)
Eldorado's Institute of Technology
Universidade Estadual Paulista (UNESP)
dc.contributor.author.fl_str_mv Santos, Claudio Filipi Gonçalves Dos
Papa, João Paulo [UNESP]
dc.subject.por.fl_str_mv convolutional neural networks
Regularization
topic convolutional neural networks
Regularization
description Several image processing tasks, such as image classification and object detection, have been significantly improved using Convolutional Neural Networks (CNN). Like ResNet and EfficientNet, many architectures have achieved outstanding results in at least one dataset by the time of their creation. A critical factor in training concerns the network's regularization, which prevents the structure from overfitting. This work analyzes several regularization methods developed in the past few years, showing significant improvements for different CNN models. The works are classified into three main areas: The first one is called data augmentation,where all the techniques focus on performing changes in the input data. The second, named internal changes,aims to describe procedures to modify the feature maps generated by the neural network or the kernels. The last one, called label,concerns transforming the labels of a given input. This work presents two main differences comparing to other available surveys about regularization: (i) the first concerns the papers gathered in the manuscript, which are not older than five years, and (ii) the second distinction is about reproducibility, i.e., all works referred here have their code available in public repositories or they have been directly implemented in some framework, such as TensorFlow or Torch.
publishDate 2022
dc.date.none.fl_str_mv 2022-09-14
2023-07-29T15:34:47Z
2023-07-29T15:34:47Z
dc.type.status.fl_str_mv info:eu-repo/semantics/publishedVersion
dc.type.driver.fl_str_mv info:eu-repo/semantics/other
format other
status_str publishedVersion
dc.identifier.uri.fl_str_mv http://dx.doi.org/10.1145/3510413
ACM Computing Surveys, v. 54, n. 10 s, 2022.
1557-7341
0360-0300
http://hdl.handle.net/11449/249425
10.1145/3510413
2-s2.0-85143053852
url http://dx.doi.org/10.1145/3510413
http://hdl.handle.net/11449/249425
identifier_str_mv ACM Computing Surveys, v. 54, n. 10 s, 2022.
1557-7341
0360-0300
10.1145/3510413
2-s2.0-85143053852
dc.language.iso.fl_str_mv eng
language eng
dc.relation.none.fl_str_mv ACM Computing Surveys
dc.rights.driver.fl_str_mv info:eu-repo/semantics/openAccess
eu_rights_str_mv openAccess
dc.source.none.fl_str_mv Scopus
reponame:Repositório Institucional da UNESP
instname:Universidade Estadual Paulista (UNESP)
instacron:UNESP
instname_str Universidade Estadual Paulista (UNESP)
instacron_str UNESP
institution UNESP
reponame_str Repositório Institucional da UNESP
collection Repositório Institucional da UNESP
repository.name.fl_str_mv Repositório Institucional da UNESP - Universidade Estadual Paulista (UNESP)
repository.mail.fl_str_mv repositoriounesp@unesp.br
_version_ 1834484468519272448