Avoiding Overfitting: A Survey on Regularization Methods for Convolutional Neural Networks
Main Author: | |
---|---|
Publication Date: | 2022 |
Other Authors: | |
Format: | Other |
Language: | eng |
Source: | Repositório Institucional da UNESP |
Download full: | http://dx.doi.org/10.1145/3510413 http://hdl.handle.net/11449/249425 |
Summary: | Several image processing tasks, such as image classification and object detection, have been significantly improved using Convolutional Neural Networks (CNN). Like ResNet and EfficientNet, many architectures have achieved outstanding results in at least one dataset by the time of their creation. A critical factor in training concerns the network's regularization, which prevents the structure from overfitting. This work analyzes several regularization methods developed in the past few years, showing significant improvements for different CNN models. The works are classified into three main areas: The first one is called data augmentation,where all the techniques focus on performing changes in the input data. The second, named internal changes,aims to describe procedures to modify the feature maps generated by the neural network or the kernels. The last one, called label,concerns transforming the labels of a given input. This work presents two main differences comparing to other available surveys about regularization: (i) the first concerns the papers gathered in the manuscript, which are not older than five years, and (ii) the second distinction is about reproducibility, i.e., all works referred here have their code available in public repositories or they have been directly implemented in some framework, such as TensorFlow or Torch. |
id |
UNSP_ffe378942dfdb90ad6dea33c7d9ec1ff |
---|---|
oai_identifier_str |
oai:repositorio.unesp.br:11449/249425 |
network_acronym_str |
UNSP |
network_name_str |
Repositório Institucional da UNESP |
repository_id_str |
2946 |
spelling |
Avoiding Overfitting: A Survey on Regularization Methods for Convolutional Neural Networksconvolutional neural networksRegularizationSeveral image processing tasks, such as image classification and object detection, have been significantly improved using Convolutional Neural Networks (CNN). Like ResNet and EfficientNet, many architectures have achieved outstanding results in at least one dataset by the time of their creation. A critical factor in training concerns the network's regularization, which prevents the structure from overfitting. This work analyzes several regularization methods developed in the past few years, showing significant improvements for different CNN models. The works are classified into three main areas: The first one is called data augmentation,where all the techniques focus on performing changes in the input data. The second, named internal changes,aims to describe procedures to modify the feature maps generated by the neural network or the kernels. The last one, called label,concerns transforming the labels of a given input. This work presents two main differences comparing to other available surveys about regularization: (i) the first concerns the papers gathered in the manuscript, which are not older than five years, and (ii) the second distinction is about reproducibility, i.e., all works referred here have their code available in public repositories or they have been directly implemented in some framework, such as TensorFlow or Torch.Federal Institute of São Carlos-UFSCar, Rod. Washington Luiz, 235, São CarlosEldorado's Institute of Technology, Av. Alan Turing, 275, CampinasSão Paulo State University-UNESP, Av. Eng. Luís Edmundo Carrijo Coube, 14-01, BauruSão Paulo State University-UNESP, Av. Eng. Luís Edmundo Carrijo Coube, 14-01, BauruUniversidade Federal de São Carlos (UFSCar)Eldorado's Institute of TechnologyUniversidade Estadual Paulista (UNESP)Santos, Claudio Filipi Gonçalves DosPapa, João Paulo [UNESP]2023-07-29T15:34:47Z2023-07-29T15:34:47Z2022-09-14info:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/otherhttp://dx.doi.org/10.1145/3510413ACM Computing Surveys, v. 54, n. 10 s, 2022.1557-73410360-0300http://hdl.handle.net/11449/24942510.1145/35104132-s2.0-85143053852Scopusreponame:Repositório Institucional da UNESPinstname:Universidade Estadual Paulista (UNESP)instacron:UNESPengACM Computing Surveysinfo:eu-repo/semantics/openAccess2024-04-23T16:11:11Zoai:repositorio.unesp.br:11449/249425Repositório InstitucionalPUBhttp://repositorio.unesp.br/oai/requestrepositoriounesp@unesp.bropendoar:29462024-04-23T16:11:11Repositório Institucional da UNESP - Universidade Estadual Paulista (UNESP)false |
dc.title.none.fl_str_mv |
Avoiding Overfitting: A Survey on Regularization Methods for Convolutional Neural Networks |
title |
Avoiding Overfitting: A Survey on Regularization Methods for Convolutional Neural Networks |
spellingShingle |
Avoiding Overfitting: A Survey on Regularization Methods for Convolutional Neural Networks Santos, Claudio Filipi Gonçalves Dos convolutional neural networks Regularization |
title_short |
Avoiding Overfitting: A Survey on Regularization Methods for Convolutional Neural Networks |
title_full |
Avoiding Overfitting: A Survey on Regularization Methods for Convolutional Neural Networks |
title_fullStr |
Avoiding Overfitting: A Survey on Regularization Methods for Convolutional Neural Networks |
title_full_unstemmed |
Avoiding Overfitting: A Survey on Regularization Methods for Convolutional Neural Networks |
title_sort |
Avoiding Overfitting: A Survey on Regularization Methods for Convolutional Neural Networks |
author |
Santos, Claudio Filipi Gonçalves Dos |
author_facet |
Santos, Claudio Filipi Gonçalves Dos Papa, João Paulo [UNESP] |
author_role |
author |
author2 |
Papa, João Paulo [UNESP] |
author2_role |
author |
dc.contributor.none.fl_str_mv |
Universidade Federal de São Carlos (UFSCar) Eldorado's Institute of Technology Universidade Estadual Paulista (UNESP) |
dc.contributor.author.fl_str_mv |
Santos, Claudio Filipi Gonçalves Dos Papa, João Paulo [UNESP] |
dc.subject.por.fl_str_mv |
convolutional neural networks Regularization |
topic |
convolutional neural networks Regularization |
description |
Several image processing tasks, such as image classification and object detection, have been significantly improved using Convolutional Neural Networks (CNN). Like ResNet and EfficientNet, many architectures have achieved outstanding results in at least one dataset by the time of their creation. A critical factor in training concerns the network's regularization, which prevents the structure from overfitting. This work analyzes several regularization methods developed in the past few years, showing significant improvements for different CNN models. The works are classified into three main areas: The first one is called data augmentation,where all the techniques focus on performing changes in the input data. The second, named internal changes,aims to describe procedures to modify the feature maps generated by the neural network or the kernels. The last one, called label,concerns transforming the labels of a given input. This work presents two main differences comparing to other available surveys about regularization: (i) the first concerns the papers gathered in the manuscript, which are not older than five years, and (ii) the second distinction is about reproducibility, i.e., all works referred here have their code available in public repositories or they have been directly implemented in some framework, such as TensorFlow or Torch. |
publishDate |
2022 |
dc.date.none.fl_str_mv |
2022-09-14 2023-07-29T15:34:47Z 2023-07-29T15:34:47Z |
dc.type.status.fl_str_mv |
info:eu-repo/semantics/publishedVersion |
dc.type.driver.fl_str_mv |
info:eu-repo/semantics/other |
format |
other |
status_str |
publishedVersion |
dc.identifier.uri.fl_str_mv |
http://dx.doi.org/10.1145/3510413 ACM Computing Surveys, v. 54, n. 10 s, 2022. 1557-7341 0360-0300 http://hdl.handle.net/11449/249425 10.1145/3510413 2-s2.0-85143053852 |
url |
http://dx.doi.org/10.1145/3510413 http://hdl.handle.net/11449/249425 |
identifier_str_mv |
ACM Computing Surveys, v. 54, n. 10 s, 2022. 1557-7341 0360-0300 10.1145/3510413 2-s2.0-85143053852 |
dc.language.iso.fl_str_mv |
eng |
language |
eng |
dc.relation.none.fl_str_mv |
ACM Computing Surveys |
dc.rights.driver.fl_str_mv |
info:eu-repo/semantics/openAccess |
eu_rights_str_mv |
openAccess |
dc.source.none.fl_str_mv |
Scopus reponame:Repositório Institucional da UNESP instname:Universidade Estadual Paulista (UNESP) instacron:UNESP |
instname_str |
Universidade Estadual Paulista (UNESP) |
instacron_str |
UNESP |
institution |
UNESP |
reponame_str |
Repositório Institucional da UNESP |
collection |
Repositório Institucional da UNESP |
repository.name.fl_str_mv |
Repositório Institucional da UNESP - Universidade Estadual Paulista (UNESP) |
repository.mail.fl_str_mv |
repositoriounesp@unesp.br |
_version_ |
1834484468519272448 |