Use of Medical Subject Headings (MeSH) in Portuguese for categorizing web-based healthcare content
Main Author: | |
---|---|
Publication Date: | 2011 |
Other Authors: | , , , , , , , |
Format: | Article |
Language: | eng |
Source: | Repositório Institucional da UNIFESP |
Download full: | http://dx.doi.org/10.1016/j.jbi.2010.12.002 http://repositorio.unifesp.br/handle/11600/33566 |
Summary: | Introduction: Internet users are increasingly using the worldwide web to search for information relating to their health. This situation makes it necessary to create specialized tools capable of supporting users in their searches.Objective: To apply and compare strategies that were developed to investigate the use of the Portuguese version of Medical Subject Headings (MeSH) for constructing an automated classifier for Brazilian Portuguese-language web-based content within or outside of the field of healthcare, focusing on the lay public.Methods: 3658 Brazilian web pages were used to train the classifier and 606 Brazilian web pages were used to validate it. the strategies proposed were constructed using content-based vector methods for text classification, such that Naive Bayes was used for the task of classifying vector patterns with characteristics obtained through the proposed strategies.Results: A strategy named InDeCS was developed specifically to adapt MeSH for the problem that was put forward. This approach achieved better accuracy for this pattern classification task (0.94 sensitivity, specificity and area under the ROC curve).Conclusions: Because of the significant results achieved by InDeCS, this tool has been successfully applied to the Brazilian healthcare search portal known as Busca Saude. Furthermore, it could be shown that MeSH presents important results when used for the task of classifying web-based content focusing on the lay public. It was also possible to show from this study that MeSH was able to map out mutable non-deterministic characteristics of the web. (c) 2010 Elsevier Inc. All rights reserved. |
id |
UFSP_dba7fbf1cbd880c9ce93df22e541db3c |
---|---|
oai_identifier_str |
oai:repositorio.unifesp.br/:11600/33566 |
network_acronym_str |
UFSP |
network_name_str |
Repositório Institucional da UNIFESP |
repository_id_str |
3465 |
spelling |
Use of Medical Subject Headings (MeSH) in Portuguese for categorizing web-based healthcare contentInformation storage and retrievalMedical Subject HeadingsConsumer health informationIndexingInternetIntroduction: Internet users are increasingly using the worldwide web to search for information relating to their health. This situation makes it necessary to create specialized tools capable of supporting users in their searches.Objective: To apply and compare strategies that were developed to investigate the use of the Portuguese version of Medical Subject Headings (MeSH) for constructing an automated classifier for Brazilian Portuguese-language web-based content within or outside of the field of healthcare, focusing on the lay public.Methods: 3658 Brazilian web pages were used to train the classifier and 606 Brazilian web pages were used to validate it. the strategies proposed were constructed using content-based vector methods for text classification, such that Naive Bayes was used for the task of classifying vector patterns with characteristics obtained through the proposed strategies.Results: A strategy named InDeCS was developed specifically to adapt MeSH for the problem that was put forward. This approach achieved better accuracy for this pattern classification task (0.94 sensitivity, specificity and area under the ROC curve).Conclusions: Because of the significant results achieved by InDeCS, this tool has been successfully applied to the Brazilian healthcare search portal known as Busca Saude. Furthermore, it could be shown that MeSH presents important results when used for the task of classifying web-based content focusing on the lay public. It was also possible to show from this study that MeSH was able to map out mutable non-deterministic characteristics of the web. (c) 2010 Elsevier Inc. All rights reserved.Universidade Federal de São Paulo UNIFESP, Dept Informat Saude, BR-04023062 São Paulo, BrazilInst Fed Educ Ciencia & Tecnol São Paulo, BR-07155000 Guarulhos, SP, BrazilInst Super Tecn, P-2744016 Porto Salvo, PortugalUniv São Paulo, Escola Artes Ciencias & Humanidades, BR-03828000 São Paulo, BrazilUniversidade Federal de São Paulo UNIFESP, Dept Informat Saude, BR-04023062 São Paulo, BrazilWeb of ScienceElsevier B.V.Universidade Federal de São Paulo (UNIFESP)Inst Fed Educ Ciencia & Tecnol São PauloInst Super TecnUniversidade de São Paulo (USP)Mancini, Felipe [UNIFESP]Sousa, Fernando Sequeira [UNIFESP]Teixeira, Fabio Oliveira [UNIFESP]Falcão, Alex Esteves Jacoud [UNIFESP]Hummel, Anderson Diniz [UNIFESP]Costa, Thiago Martini da [UNIFESP]Calado, Pavel PereiraAraújo, Luciano Vieira dePisa, Ivan Torres [UNIFESP]2016-01-24T14:06:19Z2016-01-24T14:06:19Z2011-04-01info:eu-repo/semantics/articleinfo:eu-repo/semantics/publishedVersion299-309application/pdfhttp://dx.doi.org/10.1016/j.jbi.2010.12.002Journal of Biomedical Informatics. San Diego: Academic Press Inc Elsevier Science, v. 44, n. 2, p. 299-309, 2011.10.1016/j.jbi.2010.12.002WOS000289030100011.pdf1532-0464http://repositorio.unifesp.br/handle/11600/33566WOS:000289030100011engJournal of Biomedical Informaticsinfo:eu-repo/semantics/openAccesshttp://www.elsevier.com/about/open-access/open-access-policies/article-posting-policyreponame:Repositório Institucional da UNIFESPinstname:Universidade Federal de São Paulo (UNIFESP)instacron:UNIFESP2024-07-31T16:23:36Zoai:repositorio.unifesp.br/:11600/33566Repositório InstitucionalPUBhttp://www.repositorio.unifesp.br/oai/requestbiblioteca.csp@unifesp.bropendoar:34652024-07-31T16:23:36Repositório Institucional da UNIFESP - Universidade Federal de São Paulo (UNIFESP)false |
dc.title.none.fl_str_mv |
Use of Medical Subject Headings (MeSH) in Portuguese for categorizing web-based healthcare content |
title |
Use of Medical Subject Headings (MeSH) in Portuguese for categorizing web-based healthcare content |
spellingShingle |
Use of Medical Subject Headings (MeSH) in Portuguese for categorizing web-based healthcare content Mancini, Felipe [UNIFESP] Information storage and retrieval Medical Subject Headings Consumer health information Indexing Internet |
title_short |
Use of Medical Subject Headings (MeSH) in Portuguese for categorizing web-based healthcare content |
title_full |
Use of Medical Subject Headings (MeSH) in Portuguese for categorizing web-based healthcare content |
title_fullStr |
Use of Medical Subject Headings (MeSH) in Portuguese for categorizing web-based healthcare content |
title_full_unstemmed |
Use of Medical Subject Headings (MeSH) in Portuguese for categorizing web-based healthcare content |
title_sort |
Use of Medical Subject Headings (MeSH) in Portuguese for categorizing web-based healthcare content |
author |
Mancini, Felipe [UNIFESP] |
author_facet |
Mancini, Felipe [UNIFESP] Sousa, Fernando Sequeira [UNIFESP] Teixeira, Fabio Oliveira [UNIFESP] Falcão, Alex Esteves Jacoud [UNIFESP] Hummel, Anderson Diniz [UNIFESP] Costa, Thiago Martini da [UNIFESP] Calado, Pavel Pereira Araújo, Luciano Vieira de Pisa, Ivan Torres [UNIFESP] |
author_role |
author |
author2 |
Sousa, Fernando Sequeira [UNIFESP] Teixeira, Fabio Oliveira [UNIFESP] Falcão, Alex Esteves Jacoud [UNIFESP] Hummel, Anderson Diniz [UNIFESP] Costa, Thiago Martini da [UNIFESP] Calado, Pavel Pereira Araújo, Luciano Vieira de Pisa, Ivan Torres [UNIFESP] |
author2_role |
author author author author author author author author |
dc.contributor.none.fl_str_mv |
Universidade Federal de São Paulo (UNIFESP) Inst Fed Educ Ciencia & Tecnol São Paulo Inst Super Tecn Universidade de São Paulo (USP) |
dc.contributor.author.fl_str_mv |
Mancini, Felipe [UNIFESP] Sousa, Fernando Sequeira [UNIFESP] Teixeira, Fabio Oliveira [UNIFESP] Falcão, Alex Esteves Jacoud [UNIFESP] Hummel, Anderson Diniz [UNIFESP] Costa, Thiago Martini da [UNIFESP] Calado, Pavel Pereira Araújo, Luciano Vieira de Pisa, Ivan Torres [UNIFESP] |
dc.subject.por.fl_str_mv |
Information storage and retrieval Medical Subject Headings Consumer health information Indexing Internet |
topic |
Information storage and retrieval Medical Subject Headings Consumer health information Indexing Internet |
description |
Introduction: Internet users are increasingly using the worldwide web to search for information relating to their health. This situation makes it necessary to create specialized tools capable of supporting users in their searches.Objective: To apply and compare strategies that were developed to investigate the use of the Portuguese version of Medical Subject Headings (MeSH) for constructing an automated classifier for Brazilian Portuguese-language web-based content within or outside of the field of healthcare, focusing on the lay public.Methods: 3658 Brazilian web pages were used to train the classifier and 606 Brazilian web pages were used to validate it. the strategies proposed were constructed using content-based vector methods for text classification, such that Naive Bayes was used for the task of classifying vector patterns with characteristics obtained through the proposed strategies.Results: A strategy named InDeCS was developed specifically to adapt MeSH for the problem that was put forward. This approach achieved better accuracy for this pattern classification task (0.94 sensitivity, specificity and area under the ROC curve).Conclusions: Because of the significant results achieved by InDeCS, this tool has been successfully applied to the Brazilian healthcare search portal known as Busca Saude. Furthermore, it could be shown that MeSH presents important results when used for the task of classifying web-based content focusing on the lay public. It was also possible to show from this study that MeSH was able to map out mutable non-deterministic characteristics of the web. (c) 2010 Elsevier Inc. All rights reserved. |
publishDate |
2011 |
dc.date.none.fl_str_mv |
2011-04-01 2016-01-24T14:06:19Z 2016-01-24T14:06:19Z |
dc.type.driver.fl_str_mv |
info:eu-repo/semantics/article |
dc.type.status.fl_str_mv |
info:eu-repo/semantics/publishedVersion |
format |
article |
status_str |
publishedVersion |
dc.identifier.uri.fl_str_mv |
http://dx.doi.org/10.1016/j.jbi.2010.12.002 Journal of Biomedical Informatics. San Diego: Academic Press Inc Elsevier Science, v. 44, n. 2, p. 299-309, 2011. 10.1016/j.jbi.2010.12.002 WOS000289030100011.pdf 1532-0464 http://repositorio.unifesp.br/handle/11600/33566 WOS:000289030100011 |
url |
http://dx.doi.org/10.1016/j.jbi.2010.12.002 http://repositorio.unifesp.br/handle/11600/33566 |
identifier_str_mv |
Journal of Biomedical Informatics. San Diego: Academic Press Inc Elsevier Science, v. 44, n. 2, p. 299-309, 2011. 10.1016/j.jbi.2010.12.002 WOS000289030100011.pdf 1532-0464 WOS:000289030100011 |
dc.language.iso.fl_str_mv |
eng |
language |
eng |
dc.relation.none.fl_str_mv |
Journal of Biomedical Informatics |
dc.rights.driver.fl_str_mv |
info:eu-repo/semantics/openAccess http://www.elsevier.com/about/open-access/open-access-policies/article-posting-policy |
eu_rights_str_mv |
openAccess |
rights_invalid_str_mv |
http://www.elsevier.com/about/open-access/open-access-policies/article-posting-policy |
dc.format.none.fl_str_mv |
299-309 application/pdf |
dc.publisher.none.fl_str_mv |
Elsevier B.V. |
publisher.none.fl_str_mv |
Elsevier B.V. |
dc.source.none.fl_str_mv |
reponame:Repositório Institucional da UNIFESP instname:Universidade Federal de São Paulo (UNIFESP) instacron:UNIFESP |
instname_str |
Universidade Federal de São Paulo (UNIFESP) |
instacron_str |
UNIFESP |
institution |
UNIFESP |
reponame_str |
Repositório Institucional da UNIFESP |
collection |
Repositório Institucional da UNIFESP |
repository.name.fl_str_mv |
Repositório Institucional da UNIFESP - Universidade Federal de São Paulo (UNIFESP) |
repository.mail.fl_str_mv |
biblioteca.csp@unifesp.br |
_version_ |
1841453723211005952 |