Logistic Regression: Limitations in the Estimation of Measures of Association with Binary Health Outcomes
Main Author: | |
---|---|
Publication Date: | 2024 |
Other Authors: | , |
Format: | Article |
Language: | eng |
Source: | Repositórios Científicos de Acesso Aberto de Portugal (RCAAP) |
Download full: | https://www.actamedicaportuguesa.com/revista/index.php/amp/article/view/21435 |
Summary: | Introduction: Logistic regression models are frequently used to estimate measures of association between an exposure, health determinant or intervention, and a binary outcome. However, when the outcome is frequent (> 10%), model estimates for relative risks and prevalence ratios might be biased. Despite the availability of several alternatives, many still rely on these models, and a consensus is yet to be reached. We aimed to compare the estimation and goodness-of-fit of logistic, log-binomial and robust Poisson regression models, in cross-sectional studies involving frequent binary outcomes.Methods: Two cross-sectional studies were conducted. Study 1 was a nationally representative study on the impact of air pollution on mental health. Study 2 was a local study on immigrants’ access to urgent healthcare services. Odds ratios (OR) were obtained through logistic regression, and prevalence ratios (PR) through log-binomial and robust Poisson regression models. Confidence intervals (CI), their ranges, and standard-errors (SE) were also computed, along with models’ relative goodness-of-fit through Akaike Information Criterion (AIC), when applicable.Results: In Study 1, the OR (95% CI) was 1.015 (0.970 - 1.063), while the PR (95% CI) obtained through the robust Poisson mode was 1.012 (0.979 - 1.045). The log-binomial regression model did not converge in this study. In Study 2, the OR (95% CI) was 1.584 (1.026 - 2.446), the PR (95% CI) for the log-binomial model was 1.217 (0.978 - 1.515), and 1.130 (1.013 - 1.261) for the robust Poisson model. The 95% CI, their ranges, and the SE of the OR were higher than those of the PR, in both studies. However, in Study 2, the AIC value was lower for the logistic regression model.Conclusion: The odds ratio overestimated PR with wider 95% CI and higher SE. The overestimation was greater as the outcome of the study became more prevalent, in line with previous studies. In Study 2, the logistic regression was the model with the best fit, illustrating the need to consider multiple criteria when selecting the most appropriate statistical model for each study. Employing logistic regression models by default might lead to misinterpretations. Robust Poisson models are viable alternatives in cross-sectional studies with frequent binary outcomes, avoiding the non-convergence of log-binomial models. |
id |
RCAP_691c381e8355fbec1a6677aa1b5d1a7a |
---|---|
oai_identifier_str |
oai:ojs.www.actamedicaportuguesa.com:article/21435 |
network_acronym_str |
RCAP |
network_name_str |
Repositórios Científicos de Acesso Aberto de Portugal (RCAAP) |
repository_id_str |
https://opendoar.ac.uk/repository/7160 |
spelling |
Logistic Regression: Limitations in the Estimation of Measures of Association with Binary Health OutcomesRegressão Logística: Limitações na Estimação de Medidas de Associação com Desfechos de Saúde BináriosLogistic ModelsModels, StatisticalOdds RatioOutcome Assessment, Health CarePoisson DistributionAvaliação de Processos e Resultados em Cuidados de SaúdeDistribuição de PoissonModelos EstatísticosModelos LogísticosRácio de ProbabilidadesIntroduction: Logistic regression models are frequently used to estimate measures of association between an exposure, health determinant or intervention, and a binary outcome. However, when the outcome is frequent (> 10%), model estimates for relative risks and prevalence ratios might be biased. Despite the availability of several alternatives, many still rely on these models, and a consensus is yet to be reached. We aimed to compare the estimation and goodness-of-fit of logistic, log-binomial and robust Poisson regression models, in cross-sectional studies involving frequent binary outcomes.Methods: Two cross-sectional studies were conducted. Study 1 was a nationally representative study on the impact of air pollution on mental health. Study 2 was a local study on immigrants’ access to urgent healthcare services. Odds ratios (OR) were obtained through logistic regression, and prevalence ratios (PR) through log-binomial and robust Poisson regression models. Confidence intervals (CI), their ranges, and standard-errors (SE) were also computed, along with models’ relative goodness-of-fit through Akaike Information Criterion (AIC), when applicable.Results: In Study 1, the OR (95% CI) was 1.015 (0.970 - 1.063), while the PR (95% CI) obtained through the robust Poisson mode was 1.012 (0.979 - 1.045). The log-binomial regression model did not converge in this study. In Study 2, the OR (95% CI) was 1.584 (1.026 - 2.446), the PR (95% CI) for the log-binomial model was 1.217 (0.978 - 1.515), and 1.130 (1.013 - 1.261) for the robust Poisson model. The 95% CI, their ranges, and the SE of the OR were higher than those of the PR, in both studies. However, in Study 2, the AIC value was lower for the logistic regression model.Conclusion: The odds ratio overestimated PR with wider 95% CI and higher SE. The overestimation was greater as the outcome of the study became more prevalent, in line with previous studies. In Study 2, the logistic regression was the model with the best fit, illustrating the need to consider multiple criteria when selecting the most appropriate statistical model for each study. Employing logistic regression models by default might lead to misinterpretations. Robust Poisson models are viable alternatives in cross-sectional studies with frequent binary outcomes, avoiding the non-convergence of log-binomial models.Introdução: A regressão logística é frequentemente utilizada para estimar medidas de associação entre uma exposição, determinante de saúde ou intervenção e um desfecho binário. No entanto, quando o desfecho é frequente (> 10%), estas estimativas podem ser enviesadas. Apesar de existirem modelos estatísticos alternativos, muitos estudos continuam a aplicar modelos de regressão logística indiscriminadamente. O objetivo deste estudo foi comparar as estimativas e o ajuste de modelos de regressão logística, log-binomial e Poisson robustos, em estudos transversais com desfechos binários frequentes.Métodos: Realizaram-se dois estudos transversais. O Estudo 1 foi um estudo representativo a nível nacional sobre o impacto da poluição atmosférica na saúde mental. O Estudo 2 foi um estudo local sobre o acesso de imigrantes a serviços de urgência. Obtiveram-se odds ratio (OR) através de regressões logísticas e razões de prevalência (RP) através de modelos log-binomiais e Poisson robustos. Foram ainda obtidos intervalos de confiança a 95% (IC 95%), suas amplitudes, os erros-padrão (EP) das estimativas e comparados os valores Akaike Information Criteria (AIC).Resultados: No Estudo 1, a OR (IC 95%) foi de 1,015 (0,970 - 1,063) e a RP (IC 95%) obtida através do modelo de Poisson robusto foi de 1,012 (0,979 - 1,045). O modelo de regressão log-binomial não convergiu. No Estudo 2, a OR (IC 95%) foi de 1,584 (1,026 - 2,446), a RP (IC 95%) para o modelo de regressão log-binomial foi de 1,217 (0,978 - 1,515) e para o modelo de Poisson robusto foi de 1,130 (1,013 - 1,261). Os IC 95%, as suas amplitudes e os EP das OR foram superiores ao das RP, em ambos os estudos. No entanto, no Estudo 2, o valor do AIC foi inferior no modelo de regressão logística.Conclusão: As OR sobrestimaram as RP, com IC 95% mais amplos e EP superiores. A magnitude da sobrestimação foi tanto maior quanto mais prevalente o desfecho em estudo, em linha com estudos prévios. No Estudo 2, a regressão logística foi a que melhor se ajustou aos dados. Este exemplo ilustra a necessidade de avaliar vários critérios para selecionar o modelo estatístico mais apropriado. Os modelos de Poisson robustos são uma alternativa viável em estudos transversais com desfechos binários frequentes e evitam o problema de não convergência dos modelos log-binomiais.Ordem dos Médicos2024-10-01info:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/articleapplication/pdfapplication/pdfapplication/pdfapplication/pdfhttps://www.actamedicaportuguesa.com/revista/index.php/amp/article/view/21435Acta Médica Portuguesa; Vol. 37 No. 10 (2024): October; 697-705Acta Médica Portuguesa; Vol. 37 N.º 10 (2024): Outubro; 697-7051646-07580870-399Xreponame:Repositórios Científicos de Acesso Aberto de Portugal (RCAAP)instname:FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologiainstacron:RCAAPenghttps://www.actamedicaportuguesa.com/revista/index.php/amp/article/view/21435https://www.actamedicaportuguesa.com/revista/index.php/amp/article/view/21435/15525https://www.actamedicaportuguesa.com/revista/index.php/amp/article/view/21435/15510https://www.actamedicaportuguesa.com/revista/index.php/amp/article/view/21435/15511https://www.actamedicaportuguesa.com/revista/index.php/amp/article/view/21435/15512Direitos de Autor (c) 2024 Acta Médica Portuguesainfo:eu-repo/semantics/openAccessPinheiro-Guedes, LaraMartinho, ClarisseO. Martins, Maria Rosário2024-10-06T03:00:43Zoai:ojs.www.actamedicaportuguesa.com:article/21435Portal AgregadorONGhttps://www.rcaap.pt/oai/openaireinfo@rcaap.ptopendoar:https://opendoar.ac.uk/repository/71602025-05-28T18:55:15.402249Repositórios Científicos de Acesso Aberto de Portugal (RCAAP) - FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologiafalse |
dc.title.none.fl_str_mv |
Logistic Regression: Limitations in the Estimation of Measures of Association with Binary Health Outcomes Regressão Logística: Limitações na Estimação de Medidas de Associação com Desfechos de Saúde Binários |
title |
Logistic Regression: Limitations in the Estimation of Measures of Association with Binary Health Outcomes |
spellingShingle |
Logistic Regression: Limitations in the Estimation of Measures of Association with Binary Health Outcomes Pinheiro-Guedes, Lara Logistic Models Models, Statistical Odds Ratio Outcome Assessment, Health Care Poisson Distribution Avaliação de Processos e Resultados em Cuidados de Saúde Distribuição de Poisson Modelos Estatísticos Modelos Logísticos Rácio de Probabilidades |
title_short |
Logistic Regression: Limitations in the Estimation of Measures of Association with Binary Health Outcomes |
title_full |
Logistic Regression: Limitations in the Estimation of Measures of Association with Binary Health Outcomes |
title_fullStr |
Logistic Regression: Limitations in the Estimation of Measures of Association with Binary Health Outcomes |
title_full_unstemmed |
Logistic Regression: Limitations in the Estimation of Measures of Association with Binary Health Outcomes |
title_sort |
Logistic Regression: Limitations in the Estimation of Measures of Association with Binary Health Outcomes |
author |
Pinheiro-Guedes, Lara |
author_facet |
Pinheiro-Guedes, Lara Martinho, Clarisse O. Martins, Maria Rosário |
author_role |
author |
author2 |
Martinho, Clarisse O. Martins, Maria Rosário |
author2_role |
author author |
dc.contributor.author.fl_str_mv |
Pinheiro-Guedes, Lara Martinho, Clarisse O. Martins, Maria Rosário |
dc.subject.por.fl_str_mv |
Logistic Models Models, Statistical Odds Ratio Outcome Assessment, Health Care Poisson Distribution Avaliação de Processos e Resultados em Cuidados de Saúde Distribuição de Poisson Modelos Estatísticos Modelos Logísticos Rácio de Probabilidades |
topic |
Logistic Models Models, Statistical Odds Ratio Outcome Assessment, Health Care Poisson Distribution Avaliação de Processos e Resultados em Cuidados de Saúde Distribuição de Poisson Modelos Estatísticos Modelos Logísticos Rácio de Probabilidades |
description |
Introduction: Logistic regression models are frequently used to estimate measures of association between an exposure, health determinant or intervention, and a binary outcome. However, when the outcome is frequent (> 10%), model estimates for relative risks and prevalence ratios might be biased. Despite the availability of several alternatives, many still rely on these models, and a consensus is yet to be reached. We aimed to compare the estimation and goodness-of-fit of logistic, log-binomial and robust Poisson regression models, in cross-sectional studies involving frequent binary outcomes.Methods: Two cross-sectional studies were conducted. Study 1 was a nationally representative study on the impact of air pollution on mental health. Study 2 was a local study on immigrants’ access to urgent healthcare services. Odds ratios (OR) were obtained through logistic regression, and prevalence ratios (PR) through log-binomial and robust Poisson regression models. Confidence intervals (CI), their ranges, and standard-errors (SE) were also computed, along with models’ relative goodness-of-fit through Akaike Information Criterion (AIC), when applicable.Results: In Study 1, the OR (95% CI) was 1.015 (0.970 - 1.063), while the PR (95% CI) obtained through the robust Poisson mode was 1.012 (0.979 - 1.045). The log-binomial regression model did not converge in this study. In Study 2, the OR (95% CI) was 1.584 (1.026 - 2.446), the PR (95% CI) for the log-binomial model was 1.217 (0.978 - 1.515), and 1.130 (1.013 - 1.261) for the robust Poisson model. The 95% CI, their ranges, and the SE of the OR were higher than those of the PR, in both studies. However, in Study 2, the AIC value was lower for the logistic regression model.Conclusion: The odds ratio overestimated PR with wider 95% CI and higher SE. The overestimation was greater as the outcome of the study became more prevalent, in line with previous studies. In Study 2, the logistic regression was the model with the best fit, illustrating the need to consider multiple criteria when selecting the most appropriate statistical model for each study. Employing logistic regression models by default might lead to misinterpretations. Robust Poisson models are viable alternatives in cross-sectional studies with frequent binary outcomes, avoiding the non-convergence of log-binomial models. |
publishDate |
2024 |
dc.date.none.fl_str_mv |
2024-10-01 |
dc.type.status.fl_str_mv |
info:eu-repo/semantics/publishedVersion |
dc.type.driver.fl_str_mv |
info:eu-repo/semantics/article |
format |
article |
status_str |
publishedVersion |
dc.identifier.uri.fl_str_mv |
https://www.actamedicaportuguesa.com/revista/index.php/amp/article/view/21435 |
url |
https://www.actamedicaportuguesa.com/revista/index.php/amp/article/view/21435 |
dc.language.iso.fl_str_mv |
eng |
language |
eng |
dc.relation.none.fl_str_mv |
https://www.actamedicaportuguesa.com/revista/index.php/amp/article/view/21435 https://www.actamedicaportuguesa.com/revista/index.php/amp/article/view/21435/15525 https://www.actamedicaportuguesa.com/revista/index.php/amp/article/view/21435/15510 https://www.actamedicaportuguesa.com/revista/index.php/amp/article/view/21435/15511 https://www.actamedicaportuguesa.com/revista/index.php/amp/article/view/21435/15512 |
dc.rights.driver.fl_str_mv |
Direitos de Autor (c) 2024 Acta Médica Portuguesa info:eu-repo/semantics/openAccess |
rights_invalid_str_mv |
Direitos de Autor (c) 2024 Acta Médica Portuguesa |
eu_rights_str_mv |
openAccess |
dc.format.none.fl_str_mv |
application/pdf application/pdf application/pdf application/pdf |
dc.publisher.none.fl_str_mv |
Ordem dos Médicos |
publisher.none.fl_str_mv |
Ordem dos Médicos |
dc.source.none.fl_str_mv |
Acta Médica Portuguesa; Vol. 37 No. 10 (2024): October; 697-705 Acta Médica Portuguesa; Vol. 37 N.º 10 (2024): Outubro; 697-705 1646-0758 0870-399X reponame:Repositórios Científicos de Acesso Aberto de Portugal (RCAAP) instname:FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologia instacron:RCAAP |
instname_str |
FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologia |
instacron_str |
RCAAP |
institution |
RCAAP |
reponame_str |
Repositórios Científicos de Acesso Aberto de Portugal (RCAAP) |
collection |
Repositórios Científicos de Acesso Aberto de Portugal (RCAAP) |
repository.name.fl_str_mv |
Repositórios Científicos de Acesso Aberto de Portugal (RCAAP) - FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologia |
repository.mail.fl_str_mv |
info@rcaap.pt |
_version_ |
1833597754219167744 |