Exploring Popular Software Repositories: A Study on Sentiment Analysis and Commit Clustering

Detalhes bibliográficos
Autor(a) principal: Vieira, Bianca Puerta Rocha [UNESP]
Data de Publicação: 2024
Outros Autores: Garcia, Rogério Eduardo [UNESP]
Tipo de documento: Artigo de conferência
Idioma: eng
Título da fonte: Repositório Institucional da UNESP
Texto Completo: http://dx.doi.org/10.5220/0012633400003690
https://hdl.handle.net/11449/299442
Resumo: The software repositories store data and metadata about the project development, including commits, which record user modifications to projects and their metadata, such as the user responsible for the commit, date, time, and others. The programmer can register a comment to inform the modification content, its purpose, requester, motivation, and useful data. Focusing on those comments, this paper proposes using comments to group the commits and construct a sentiment analysis regarding the messages. The main purpose is to analyze those messages, both by the groups and the sentiments expressed, to understand them (what sort of sentiment they express). Opinions are central to almost all human activities and are key influences on our behaviors. Beliefs, perceptions of reality, and choices made are conditioned upon sentiments. Therefore, understanding how the developers, especially programmers, feel about a task might be useful in analyzing progress and interaction among people and artifacts (source code). In this paper, we present initial analyses of data and metadata from the twenty most popular software repositories, written in five popular programming languages. We stated five research questions and answered them, pointing out further investigations.
id UNSP_071856ace0c63c58a715ee1c3ca11573
oai_identifier_str oai:repositorio.unesp.br:11449/299442
network_acronym_str UNSP
network_name_str Repositório Institucional da UNESP
repository_id_str 2946
spelling Exploring Popular Software Repositories: A Study on Sentiment Analysis and Commit ClusteringKnowledge DiscoveryMining Software RepositoriesSentiment AnalysesThe software repositories store data and metadata about the project development, including commits, which record user modifications to projects and their metadata, such as the user responsible for the commit, date, time, and others. The programmer can register a comment to inform the modification content, its purpose, requester, motivation, and useful data. Focusing on those comments, this paper proposes using comments to group the commits and construct a sentiment analysis regarding the messages. The main purpose is to analyze those messages, both by the groups and the sentiments expressed, to understand them (what sort of sentiment they express). Opinions are central to almost all human activities and are key influences on our behaviors. Beliefs, perceptions of reality, and choices made are conditioned upon sentiments. Therefore, understanding how the developers, especially programmers, feel about a task might be useful in analyzing progress and interaction among people and artifacts (source code). In this paper, we present initial analyses of data and metadata from the twenty most popular software repositories, written in five popular programming languages. We stated five research questions and answered them, pointing out further investigations.Coordenação de Aperfeiçoamento de Pessoal de Nível Superior (CAPES)Department of Mathematical and Computer Science São Paulo State University (UNESP), R. Roberto Símonsen, 305, Centro Educacional, SPDepartment of Mathematical and Computer Science São Paulo State University (UNESP), R. Roberto Símonsen, 305, Centro Educacional, SPUniversidade Estadual Paulista (UNESP)Vieira, Bianca Puerta Rocha [UNESP]Garcia, Rogério Eduardo [UNESP]2025-04-29T18:42:24Z2024-01-01info:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/conferenceObject297-304http://dx.doi.org/10.5220/0012633400003690International Conference on Enterprise Information Systems, ICEIS - Proceedings, v. 1, p. 297-304.2184-4992https://hdl.handle.net/11449/29944210.5220/00126334000036902-s2.0-85193931809Scopusreponame:Repositório Institucional da UNESPinstname:Universidade Estadual Paulista (UNESP)instacron:UNESPengInternational Conference on Enterprise Information Systems, ICEIS - Proceedingsinfo:eu-repo/semantics/openAccess2025-04-30T13:24:35Zoai:repositorio.unesp.br:11449/299442Repositório InstitucionalPUBhttp://repositorio.unesp.br/oai/requestrepositoriounesp@unesp.bropendoar:29462025-04-30T13:24:35Repositório Institucional da UNESP - Universidade Estadual Paulista (UNESP)false
dc.title.none.fl_str_mv Exploring Popular Software Repositories: A Study on Sentiment Analysis and Commit Clustering
title Exploring Popular Software Repositories: A Study on Sentiment Analysis and Commit Clustering
spellingShingle Exploring Popular Software Repositories: A Study on Sentiment Analysis and Commit Clustering
Vieira, Bianca Puerta Rocha [UNESP]
Knowledge Discovery
Mining Software Repositories
Sentiment Analyses
title_short Exploring Popular Software Repositories: A Study on Sentiment Analysis and Commit Clustering
title_full Exploring Popular Software Repositories: A Study on Sentiment Analysis and Commit Clustering
title_fullStr Exploring Popular Software Repositories: A Study on Sentiment Analysis and Commit Clustering
title_full_unstemmed Exploring Popular Software Repositories: A Study on Sentiment Analysis and Commit Clustering
title_sort Exploring Popular Software Repositories: A Study on Sentiment Analysis and Commit Clustering
author Vieira, Bianca Puerta Rocha [UNESP]
author_facet Vieira, Bianca Puerta Rocha [UNESP]
Garcia, Rogério Eduardo [UNESP]
author_role author
author2 Garcia, Rogério Eduardo [UNESP]
author2_role author
dc.contributor.none.fl_str_mv Universidade Estadual Paulista (UNESP)
dc.contributor.author.fl_str_mv Vieira, Bianca Puerta Rocha [UNESP]
Garcia, Rogério Eduardo [UNESP]
dc.subject.por.fl_str_mv Knowledge Discovery
Mining Software Repositories
Sentiment Analyses
topic Knowledge Discovery
Mining Software Repositories
Sentiment Analyses
description The software repositories store data and metadata about the project development, including commits, which record user modifications to projects and their metadata, such as the user responsible for the commit, date, time, and others. The programmer can register a comment to inform the modification content, its purpose, requester, motivation, and useful data. Focusing on those comments, this paper proposes using comments to group the commits and construct a sentiment analysis regarding the messages. The main purpose is to analyze those messages, both by the groups and the sentiments expressed, to understand them (what sort of sentiment they express). Opinions are central to almost all human activities and are key influences on our behaviors. Beliefs, perceptions of reality, and choices made are conditioned upon sentiments. Therefore, understanding how the developers, especially programmers, feel about a task might be useful in analyzing progress and interaction among people and artifacts (source code). In this paper, we present initial analyses of data and metadata from the twenty most popular software repositories, written in five popular programming languages. We stated five research questions and answered them, pointing out further investigations.
publishDate 2024
dc.date.none.fl_str_mv 2024-01-01
2025-04-29T18:42:24Z
dc.type.status.fl_str_mv info:eu-repo/semantics/publishedVersion
dc.type.driver.fl_str_mv info:eu-repo/semantics/conferenceObject
format conferenceObject
status_str publishedVersion
dc.identifier.uri.fl_str_mv http://dx.doi.org/10.5220/0012633400003690
International Conference on Enterprise Information Systems, ICEIS - Proceedings, v. 1, p. 297-304.
2184-4992
https://hdl.handle.net/11449/299442
10.5220/0012633400003690
2-s2.0-85193931809
url http://dx.doi.org/10.5220/0012633400003690
https://hdl.handle.net/11449/299442
identifier_str_mv International Conference on Enterprise Information Systems, ICEIS - Proceedings, v. 1, p. 297-304.
2184-4992
10.5220/0012633400003690
2-s2.0-85193931809
dc.language.iso.fl_str_mv eng
language eng
dc.relation.none.fl_str_mv International Conference on Enterprise Information Systems, ICEIS - Proceedings
dc.rights.driver.fl_str_mv info:eu-repo/semantics/openAccess
eu_rights_str_mv openAccess
dc.format.none.fl_str_mv 297-304
dc.source.none.fl_str_mv Scopus
reponame:Repositório Institucional da UNESP
instname:Universidade Estadual Paulista (UNESP)
instacron:UNESP
instname_str Universidade Estadual Paulista (UNESP)
instacron_str UNESP
institution UNESP
reponame_str Repositório Institucional da UNESP
collection Repositório Institucional da UNESP
repository.name.fl_str_mv Repositório Institucional da UNESP - Universidade Estadual Paulista (UNESP)
repository.mail.fl_str_mv repositoriounesp@unesp.br
_version_ 1834482593682161664