Use of Medical Subject Headings (MeSH) in Portuguese for categorizing web-based healthcare content

Bibliographic Details
Main Author: Mancini, Felipe [UNIFESP]
Publication Date: 2011
Other Authors: Sousa, Fernando Sequeira [UNIFESP], Teixeira, Fabio Oliveira [UNIFESP], Falcão, Alex Esteves Jacoud [UNIFESP], Hummel, Anderson Diniz [UNIFESP], Costa, Thiago Martini da [UNIFESP], Calado, Pavel Pereira, Araújo, Luciano Vieira de, Pisa, Ivan Torres [UNIFESP]
Format: Article
Language: eng
Source: Repositório Institucional da UNIFESP
Download full: http://dx.doi.org/10.1016/j.jbi.2010.12.002
http://repositorio.unifesp.br/handle/11600/33566
Summary: Introduction: Internet users are increasingly using the worldwide web to search for information relating to their health. This situation makes it necessary to create specialized tools capable of supporting users in their searches.Objective: To apply and compare strategies that were developed to investigate the use of the Portuguese version of Medical Subject Headings (MeSH) for constructing an automated classifier for Brazilian Portuguese-language web-based content within or outside of the field of healthcare, focusing on the lay public.Methods: 3658 Brazilian web pages were used to train the classifier and 606 Brazilian web pages were used to validate it. the strategies proposed were constructed using content-based vector methods for text classification, such that Naive Bayes was used for the task of classifying vector patterns with characteristics obtained through the proposed strategies.Results: A strategy named InDeCS was developed specifically to adapt MeSH for the problem that was put forward. This approach achieved better accuracy for this pattern classification task (0.94 sensitivity, specificity and area under the ROC curve).Conclusions: Because of the significant results achieved by InDeCS, this tool has been successfully applied to the Brazilian healthcare search portal known as Busca Saude. Furthermore, it could be shown that MeSH presents important results when used for the task of classifying web-based content focusing on the lay public. It was also possible to show from this study that MeSH was able to map out mutable non-deterministic characteristics of the web. (c) 2010 Elsevier Inc. All rights reserved.
id UFSP_dba7fbf1cbd880c9ce93df22e541db3c
oai_identifier_str oai:repositorio.unifesp.br/:11600/33566
network_acronym_str UFSP
network_name_str Repositório Institucional da UNIFESP
repository_id_str 3465
spelling Use of Medical Subject Headings (MeSH) in Portuguese for categorizing web-based healthcare contentInformation storage and retrievalMedical Subject HeadingsConsumer health informationIndexingInternetIntroduction: Internet users are increasingly using the worldwide web to search for information relating to their health. This situation makes it necessary to create specialized tools capable of supporting users in their searches.Objective: To apply and compare strategies that were developed to investigate the use of the Portuguese version of Medical Subject Headings (MeSH) for constructing an automated classifier for Brazilian Portuguese-language web-based content within or outside of the field of healthcare, focusing on the lay public.Methods: 3658 Brazilian web pages were used to train the classifier and 606 Brazilian web pages were used to validate it. the strategies proposed were constructed using content-based vector methods for text classification, such that Naive Bayes was used for the task of classifying vector patterns with characteristics obtained through the proposed strategies.Results: A strategy named InDeCS was developed specifically to adapt MeSH for the problem that was put forward. This approach achieved better accuracy for this pattern classification task (0.94 sensitivity, specificity and area under the ROC curve).Conclusions: Because of the significant results achieved by InDeCS, this tool has been successfully applied to the Brazilian healthcare search portal known as Busca Saude. Furthermore, it could be shown that MeSH presents important results when used for the task of classifying web-based content focusing on the lay public. It was also possible to show from this study that MeSH was able to map out mutable non-deterministic characteristics of the web. (c) 2010 Elsevier Inc. All rights reserved.Universidade Federal de São Paulo UNIFESP, Dept Informat Saude, BR-04023062 São Paulo, BrazilInst Fed Educ Ciencia & Tecnol São Paulo, BR-07155000 Guarulhos, SP, BrazilInst Super Tecn, P-2744016 Porto Salvo, PortugalUniv São Paulo, Escola Artes Ciencias & Humanidades, BR-03828000 São Paulo, BrazilUniversidade Federal de São Paulo UNIFESP, Dept Informat Saude, BR-04023062 São Paulo, BrazilWeb of ScienceElsevier B.V.Universidade Federal de São Paulo (UNIFESP)Inst Fed Educ Ciencia & Tecnol São PauloInst Super TecnUniversidade de São Paulo (USP)Mancini, Felipe [UNIFESP]Sousa, Fernando Sequeira [UNIFESP]Teixeira, Fabio Oliveira [UNIFESP]Falcão, Alex Esteves Jacoud [UNIFESP]Hummel, Anderson Diniz [UNIFESP]Costa, Thiago Martini da [UNIFESP]Calado, Pavel PereiraAraújo, Luciano Vieira dePisa, Ivan Torres [UNIFESP]2016-01-24T14:06:19Z2016-01-24T14:06:19Z2011-04-01info:eu-repo/semantics/articleinfo:eu-repo/semantics/publishedVersion299-309application/pdfhttp://dx.doi.org/10.1016/j.jbi.2010.12.002Journal of Biomedical Informatics. San Diego: Academic Press Inc Elsevier Science, v. 44, n. 2, p. 299-309, 2011.10.1016/j.jbi.2010.12.002WOS000289030100011.pdf1532-0464http://repositorio.unifesp.br/handle/11600/33566WOS:000289030100011engJournal of Biomedical Informaticsinfo:eu-repo/semantics/openAccesshttp://www.elsevier.com/about/open-access/open-access-policies/article-posting-policyreponame:Repositório Institucional da UNIFESPinstname:Universidade Federal de São Paulo (UNIFESP)instacron:UNIFESP2024-07-31T16:23:36Zoai:repositorio.unifesp.br/:11600/33566Repositório InstitucionalPUBhttp://www.repositorio.unifesp.br/oai/requestbiblioteca.csp@unifesp.bropendoar:34652024-07-31T16:23:36Repositório Institucional da UNIFESP - Universidade Federal de São Paulo (UNIFESP)false
dc.title.none.fl_str_mv Use of Medical Subject Headings (MeSH) in Portuguese for categorizing web-based healthcare content
title Use of Medical Subject Headings (MeSH) in Portuguese for categorizing web-based healthcare content
spellingShingle Use of Medical Subject Headings (MeSH) in Portuguese for categorizing web-based healthcare content
Mancini, Felipe [UNIFESP]
Information storage and retrieval
Medical Subject Headings
Consumer health information
Indexing
Internet
title_short Use of Medical Subject Headings (MeSH) in Portuguese for categorizing web-based healthcare content
title_full Use of Medical Subject Headings (MeSH) in Portuguese for categorizing web-based healthcare content
title_fullStr Use of Medical Subject Headings (MeSH) in Portuguese for categorizing web-based healthcare content
title_full_unstemmed Use of Medical Subject Headings (MeSH) in Portuguese for categorizing web-based healthcare content
title_sort Use of Medical Subject Headings (MeSH) in Portuguese for categorizing web-based healthcare content
author Mancini, Felipe [UNIFESP]
author_facet Mancini, Felipe [UNIFESP]
Sousa, Fernando Sequeira [UNIFESP]
Teixeira, Fabio Oliveira [UNIFESP]
Falcão, Alex Esteves Jacoud [UNIFESP]
Hummel, Anderson Diniz [UNIFESP]
Costa, Thiago Martini da [UNIFESP]
Calado, Pavel Pereira
Araújo, Luciano Vieira de
Pisa, Ivan Torres [UNIFESP]
author_role author
author2 Sousa, Fernando Sequeira [UNIFESP]
Teixeira, Fabio Oliveira [UNIFESP]
Falcão, Alex Esteves Jacoud [UNIFESP]
Hummel, Anderson Diniz [UNIFESP]
Costa, Thiago Martini da [UNIFESP]
Calado, Pavel Pereira
Araújo, Luciano Vieira de
Pisa, Ivan Torres [UNIFESP]
author2_role author
author
author
author
author
author
author
author
dc.contributor.none.fl_str_mv Universidade Federal de São Paulo (UNIFESP)
Inst Fed Educ Ciencia & Tecnol São Paulo
Inst Super Tecn
Universidade de São Paulo (USP)
dc.contributor.author.fl_str_mv Mancini, Felipe [UNIFESP]
Sousa, Fernando Sequeira [UNIFESP]
Teixeira, Fabio Oliveira [UNIFESP]
Falcão, Alex Esteves Jacoud [UNIFESP]
Hummel, Anderson Diniz [UNIFESP]
Costa, Thiago Martini da [UNIFESP]
Calado, Pavel Pereira
Araújo, Luciano Vieira de
Pisa, Ivan Torres [UNIFESP]
dc.subject.por.fl_str_mv Information storage and retrieval
Medical Subject Headings
Consumer health information
Indexing
Internet
topic Information storage and retrieval
Medical Subject Headings
Consumer health information
Indexing
Internet
description Introduction: Internet users are increasingly using the worldwide web to search for information relating to their health. This situation makes it necessary to create specialized tools capable of supporting users in their searches.Objective: To apply and compare strategies that were developed to investigate the use of the Portuguese version of Medical Subject Headings (MeSH) for constructing an automated classifier for Brazilian Portuguese-language web-based content within or outside of the field of healthcare, focusing on the lay public.Methods: 3658 Brazilian web pages were used to train the classifier and 606 Brazilian web pages were used to validate it. the strategies proposed were constructed using content-based vector methods for text classification, such that Naive Bayes was used for the task of classifying vector patterns with characteristics obtained through the proposed strategies.Results: A strategy named InDeCS was developed specifically to adapt MeSH for the problem that was put forward. This approach achieved better accuracy for this pattern classification task (0.94 sensitivity, specificity and area under the ROC curve).Conclusions: Because of the significant results achieved by InDeCS, this tool has been successfully applied to the Brazilian healthcare search portal known as Busca Saude. Furthermore, it could be shown that MeSH presents important results when used for the task of classifying web-based content focusing on the lay public. It was also possible to show from this study that MeSH was able to map out mutable non-deterministic characteristics of the web. (c) 2010 Elsevier Inc. All rights reserved.
publishDate 2011
dc.date.none.fl_str_mv 2011-04-01
2016-01-24T14:06:19Z
2016-01-24T14:06:19Z
dc.type.driver.fl_str_mv info:eu-repo/semantics/article
dc.type.status.fl_str_mv info:eu-repo/semantics/publishedVersion
format article
status_str publishedVersion
dc.identifier.uri.fl_str_mv http://dx.doi.org/10.1016/j.jbi.2010.12.002
Journal of Biomedical Informatics. San Diego: Academic Press Inc Elsevier Science, v. 44, n. 2, p. 299-309, 2011.
10.1016/j.jbi.2010.12.002
WOS000289030100011.pdf
1532-0464
http://repositorio.unifesp.br/handle/11600/33566
WOS:000289030100011
url http://dx.doi.org/10.1016/j.jbi.2010.12.002
http://repositorio.unifesp.br/handle/11600/33566
identifier_str_mv Journal of Biomedical Informatics. San Diego: Academic Press Inc Elsevier Science, v. 44, n. 2, p. 299-309, 2011.
10.1016/j.jbi.2010.12.002
WOS000289030100011.pdf
1532-0464
WOS:000289030100011
dc.language.iso.fl_str_mv eng
language eng
dc.relation.none.fl_str_mv Journal of Biomedical Informatics
dc.rights.driver.fl_str_mv info:eu-repo/semantics/openAccess
http://www.elsevier.com/about/open-access/open-access-policies/article-posting-policy
eu_rights_str_mv openAccess
rights_invalid_str_mv http://www.elsevier.com/about/open-access/open-access-policies/article-posting-policy
dc.format.none.fl_str_mv 299-309
application/pdf
dc.publisher.none.fl_str_mv Elsevier B.V.
publisher.none.fl_str_mv Elsevier B.V.
dc.source.none.fl_str_mv reponame:Repositório Institucional da UNIFESP
instname:Universidade Federal de São Paulo (UNIFESP)
instacron:UNIFESP
instname_str Universidade Federal de São Paulo (UNIFESP)
instacron_str UNIFESP
institution UNIFESP
reponame_str Repositório Institucional da UNIFESP
collection Repositório Institucional da UNIFESP
repository.name.fl_str_mv Repositório Institucional da UNIFESP - Universidade Federal de São Paulo (UNIFESP)
repository.mail.fl_str_mv biblioteca.csp@unifesp.br
_version_ 1841453723211005952