Logistic Regression: Limitations in the Estimation of Measures of Association with Binary Health Outcomes

Bibliographic Details
Main Author: Pinheiro-Guedes, Lara
Publication Date: 2024
Other Authors: Martinho, Clarisse, O. Martins, Maria Rosário
Format: Article
Language: eng
Source: Repositórios Científicos de Acesso Aberto de Portugal (RCAAP)
Download full: https://www.actamedicaportuguesa.com/revista/index.php/amp/article/view/21435
Summary: Introduction: Logistic regression models are frequently used to estimate measures of association between an exposure, health determinant or intervention, and a binary outcome. However, when the outcome is frequent (> 10%), model estimates for relative risks and prevalence ratios might be biased. Despite the availability of several alternatives, many still rely on these models, and a consensus is yet to be reached. We aimed to compare the estimation and goodness-of-fit of logistic, log-binomial and robust Poisson regression models, in cross-sectional studies involving frequent binary outcomes.Methods: Two cross-sectional studies were conducted. Study 1 was a nationally representative study on the impact of air pollution on mental health. Study 2 was a local study on immigrants’ access to urgent healthcare services. Odds ratios (OR) were obtained through logistic regression, and prevalence ratios (PR) through log-binomial and robust Poisson regression models. Confidence intervals (CI), their ranges, and standard-errors (SE) were also computed, along with models’ relative goodness-of-fit through Akaike Information Criterion (AIC), when applicable.Results: In Study 1, the OR (95% CI) was 1.015 (0.970 - 1.063), while the PR (95% CI) obtained through the robust Poisson mode was 1.012 (0.979 - 1.045). The log-binomial regression model did not converge in this study. In Study 2, the OR (95% CI) was 1.584 (1.026 - 2.446), the PR (95% CI) for the log-binomial model was 1.217 (0.978 - 1.515), and 1.130 (1.013 - 1.261) for the robust Poisson model. The 95% CI, their ranges, and the SE of the OR were higher than those of the PR, in both studies. However, in Study 2, the AIC value was lower for the logistic regression model.Conclusion: The odds ratio overestimated PR with wider 95% CI and higher SE. The overestimation was greater as the outcome of the study became more prevalent, in line with previous studies. In Study 2, the logistic regression was the model with the best fit, illustrating the need to consider multiple criteria when selecting the most appropriate statistical model for each study. Employing logistic regression models by default might lead to misinterpretations. Robust Poisson models are viable alternatives in cross-sectional studies with frequent binary outcomes, avoiding the non-convergence of log-binomial models.
id RCAP_691c381e8355fbec1a6677aa1b5d1a7a
oai_identifier_str oai:ojs.www.actamedicaportuguesa.com:article/21435
network_acronym_str RCAP
network_name_str Repositórios Científicos de Acesso Aberto de Portugal (RCAAP)
repository_id_str https://opendoar.ac.uk/repository/7160
spelling Logistic Regression: Limitations in the Estimation of Measures of Association with Binary Health OutcomesRegressão Logística: Limitações na Estimação de Medidas de Associação com Desfechos de Saúde BináriosLogistic ModelsModels, StatisticalOdds RatioOutcome Assessment, Health CarePoisson DistributionAvaliação de Processos e Resultados em Cuidados de SaúdeDistribuição de PoissonModelos EstatísticosModelos LogísticosRácio de ProbabilidadesIntroduction: Logistic regression models are frequently used to estimate measures of association between an exposure, health determinant or intervention, and a binary outcome. However, when the outcome is frequent (> 10%), model estimates for relative risks and prevalence ratios might be biased. Despite the availability of several alternatives, many still rely on these models, and a consensus is yet to be reached. We aimed to compare the estimation and goodness-of-fit of logistic, log-binomial and robust Poisson regression models, in cross-sectional studies involving frequent binary outcomes.Methods: Two cross-sectional studies were conducted. Study 1 was a nationally representative study on the impact of air pollution on mental health. Study 2 was a local study on immigrants’ access to urgent healthcare services. Odds ratios (OR) were obtained through logistic regression, and prevalence ratios (PR) through log-binomial and robust Poisson regression models. Confidence intervals (CI), their ranges, and standard-errors (SE) were also computed, along with models’ relative goodness-of-fit through Akaike Information Criterion (AIC), when applicable.Results: In Study 1, the OR (95% CI) was 1.015 (0.970 - 1.063), while the PR (95% CI) obtained through the robust Poisson mode was 1.012 (0.979 - 1.045). The log-binomial regression model did not converge in this study. In Study 2, the OR (95% CI) was 1.584 (1.026 - 2.446), the PR (95% CI) for the log-binomial model was 1.217 (0.978 - 1.515), and 1.130 (1.013 - 1.261) for the robust Poisson model. The 95% CI, their ranges, and the SE of the OR were higher than those of the PR, in both studies. However, in Study 2, the AIC value was lower for the logistic regression model.Conclusion: The odds ratio overestimated PR with wider 95% CI and higher SE. The overestimation was greater as the outcome of the study became more prevalent, in line with previous studies. In Study 2, the logistic regression was the model with the best fit, illustrating the need to consider multiple criteria when selecting the most appropriate statistical model for each study. Employing logistic regression models by default might lead to misinterpretations. Robust Poisson models are viable alternatives in cross-sectional studies with frequent binary outcomes, avoiding the non-convergence of log-binomial models.Introdução: A regressão logística é frequentemente utilizada para estimar medidas de associação entre uma exposição, determinante de saúde ou intervenção e um desfecho binário. No entanto, quando o desfecho é frequente (> 10%), estas estimativas podem ser enviesadas. Apesar de existirem modelos estatísticos alternativos, muitos estudos continuam a aplicar modelos de regressão logística indiscriminadamente. O objetivo deste estudo foi comparar as estimativas e o ajuste de modelos de regressão logística, log-binomial e Poisson robustos, em estudos transversais com desfechos binários frequentes.Métodos: Realizaram-se dois estudos transversais. O Estudo 1 foi um estudo representativo a nível nacional sobre o impacto da poluição atmosférica na saúde mental. O Estudo 2 foi um estudo local sobre o acesso de imigrantes a serviços de urgência. Obtiveram-se odds ratio (OR) através de regressões logísticas e razões de prevalência (RP) através de modelos log-binomiais e Poisson robustos. Foram ainda obtidos intervalos de confiança a 95% (IC 95%), suas amplitudes, os erros-padrão (EP) das estimativas e comparados os valores Akaike Information Criteria (AIC).Resultados: No Estudo 1, a OR (IC 95%) foi de 1,015 (0,970 - 1,063) e a RP (IC 95%) obtida através do modelo de Poisson robusto foi de 1,012 (0,979 - 1,045). O modelo de regressão log-binomial não convergiu. No Estudo 2, a OR (IC 95%) foi de 1,584 (1,026 - 2,446), a RP (IC 95%) para o modelo de regressão log-binomial foi de 1,217 (0,978 - 1,515) e para o modelo de Poisson robusto foi de 1,130 (1,013 - 1,261). Os IC 95%, as suas amplitudes e os EP das OR foram superiores ao das RP, em ambos os estudos. No entanto, no Estudo 2, o valor do AIC foi inferior no modelo de regressão logística.Conclusão: As OR sobrestimaram as RP, com IC 95% mais amplos e EP superiores. A magnitude da sobrestimação foi tanto maior quanto mais prevalente o desfecho em estudo, em linha com estudos prévios. No Estudo 2, a regressão logística foi a que melhor se ajustou aos dados. Este exemplo ilustra a necessidade de avaliar vários critérios para selecionar o modelo estatístico mais apropriado. Os modelos de Poisson robustos são uma alternativa viável em estudos transversais com desfechos binários frequentes e evitam o problema de não convergência dos modelos log-binomiais.Ordem dos Médicos2024-10-01info:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/articleapplication/pdfapplication/pdfapplication/pdfapplication/pdfhttps://www.actamedicaportuguesa.com/revista/index.php/amp/article/view/21435Acta Médica Portuguesa; Vol. 37 No. 10 (2024): October; 697-705Acta Médica Portuguesa; Vol. 37 N.º 10 (2024): Outubro; 697-7051646-07580870-399Xreponame:Repositórios Científicos de Acesso Aberto de Portugal (RCAAP)instname:FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologiainstacron:RCAAPenghttps://www.actamedicaportuguesa.com/revista/index.php/amp/article/view/21435https://www.actamedicaportuguesa.com/revista/index.php/amp/article/view/21435/15525https://www.actamedicaportuguesa.com/revista/index.php/amp/article/view/21435/15510https://www.actamedicaportuguesa.com/revista/index.php/amp/article/view/21435/15511https://www.actamedicaportuguesa.com/revista/index.php/amp/article/view/21435/15512Direitos de Autor (c) 2024 Acta Médica Portuguesainfo:eu-repo/semantics/openAccessPinheiro-Guedes, LaraMartinho, ClarisseO. Martins, Maria Rosário2024-10-06T03:00:43Zoai:ojs.www.actamedicaportuguesa.com:article/21435Portal AgregadorONGhttps://www.rcaap.pt/oai/openaireinfo@rcaap.ptopendoar:https://opendoar.ac.uk/repository/71602025-05-28T18:55:15.402249Repositórios Científicos de Acesso Aberto de Portugal (RCAAP) - FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologiafalse
dc.title.none.fl_str_mv Logistic Regression: Limitations in the Estimation of Measures of Association with Binary Health Outcomes
Regressão Logística: Limitações na Estimação de Medidas de Associação com Desfechos de Saúde Binários
title Logistic Regression: Limitations in the Estimation of Measures of Association with Binary Health Outcomes
spellingShingle Logistic Regression: Limitations in the Estimation of Measures of Association with Binary Health Outcomes
Pinheiro-Guedes, Lara
Logistic Models
Models, Statistical
Odds Ratio
Outcome Assessment, Health Care
Poisson Distribution
Avaliação de Processos e Resultados em Cuidados de Saúde
Distribuição de Poisson
Modelos Estatísticos
Modelos Logísticos
Rácio de Probabilidades
title_short Logistic Regression: Limitations in the Estimation of Measures of Association with Binary Health Outcomes
title_full Logistic Regression: Limitations in the Estimation of Measures of Association with Binary Health Outcomes
title_fullStr Logistic Regression: Limitations in the Estimation of Measures of Association with Binary Health Outcomes
title_full_unstemmed Logistic Regression: Limitations in the Estimation of Measures of Association with Binary Health Outcomes
title_sort Logistic Regression: Limitations in the Estimation of Measures of Association with Binary Health Outcomes
author Pinheiro-Guedes, Lara
author_facet Pinheiro-Guedes, Lara
Martinho, Clarisse
O. Martins, Maria Rosário
author_role author
author2 Martinho, Clarisse
O. Martins, Maria Rosário
author2_role author
author
dc.contributor.author.fl_str_mv Pinheiro-Guedes, Lara
Martinho, Clarisse
O. Martins, Maria Rosário
dc.subject.por.fl_str_mv Logistic Models
Models, Statistical
Odds Ratio
Outcome Assessment, Health Care
Poisson Distribution
Avaliação de Processos e Resultados em Cuidados de Saúde
Distribuição de Poisson
Modelos Estatísticos
Modelos Logísticos
Rácio de Probabilidades
topic Logistic Models
Models, Statistical
Odds Ratio
Outcome Assessment, Health Care
Poisson Distribution
Avaliação de Processos e Resultados em Cuidados de Saúde
Distribuição de Poisson
Modelos Estatísticos
Modelos Logísticos
Rácio de Probabilidades
description Introduction: Logistic regression models are frequently used to estimate measures of association between an exposure, health determinant or intervention, and a binary outcome. However, when the outcome is frequent (> 10%), model estimates for relative risks and prevalence ratios might be biased. Despite the availability of several alternatives, many still rely on these models, and a consensus is yet to be reached. We aimed to compare the estimation and goodness-of-fit of logistic, log-binomial and robust Poisson regression models, in cross-sectional studies involving frequent binary outcomes.Methods: Two cross-sectional studies were conducted. Study 1 was a nationally representative study on the impact of air pollution on mental health. Study 2 was a local study on immigrants’ access to urgent healthcare services. Odds ratios (OR) were obtained through logistic regression, and prevalence ratios (PR) through log-binomial and robust Poisson regression models. Confidence intervals (CI), their ranges, and standard-errors (SE) were also computed, along with models’ relative goodness-of-fit through Akaike Information Criterion (AIC), when applicable.Results: In Study 1, the OR (95% CI) was 1.015 (0.970 - 1.063), while the PR (95% CI) obtained through the robust Poisson mode was 1.012 (0.979 - 1.045). The log-binomial regression model did not converge in this study. In Study 2, the OR (95% CI) was 1.584 (1.026 - 2.446), the PR (95% CI) for the log-binomial model was 1.217 (0.978 - 1.515), and 1.130 (1.013 - 1.261) for the robust Poisson model. The 95% CI, their ranges, and the SE of the OR were higher than those of the PR, in both studies. However, in Study 2, the AIC value was lower for the logistic regression model.Conclusion: The odds ratio overestimated PR with wider 95% CI and higher SE. The overestimation was greater as the outcome of the study became more prevalent, in line with previous studies. In Study 2, the logistic regression was the model with the best fit, illustrating the need to consider multiple criteria when selecting the most appropriate statistical model for each study. Employing logistic regression models by default might lead to misinterpretations. Robust Poisson models are viable alternatives in cross-sectional studies with frequent binary outcomes, avoiding the non-convergence of log-binomial models.
publishDate 2024
dc.date.none.fl_str_mv 2024-10-01
dc.type.status.fl_str_mv info:eu-repo/semantics/publishedVersion
dc.type.driver.fl_str_mv info:eu-repo/semantics/article
format article
status_str publishedVersion
dc.identifier.uri.fl_str_mv https://www.actamedicaportuguesa.com/revista/index.php/amp/article/view/21435
url https://www.actamedicaportuguesa.com/revista/index.php/amp/article/view/21435
dc.language.iso.fl_str_mv eng
language eng
dc.relation.none.fl_str_mv https://www.actamedicaportuguesa.com/revista/index.php/amp/article/view/21435
https://www.actamedicaportuguesa.com/revista/index.php/amp/article/view/21435/15525
https://www.actamedicaportuguesa.com/revista/index.php/amp/article/view/21435/15510
https://www.actamedicaportuguesa.com/revista/index.php/amp/article/view/21435/15511
https://www.actamedicaportuguesa.com/revista/index.php/amp/article/view/21435/15512
dc.rights.driver.fl_str_mv Direitos de Autor (c) 2024 Acta Médica Portuguesa
info:eu-repo/semantics/openAccess
rights_invalid_str_mv Direitos de Autor (c) 2024 Acta Médica Portuguesa
eu_rights_str_mv openAccess
dc.format.none.fl_str_mv application/pdf
application/pdf
application/pdf
application/pdf
dc.publisher.none.fl_str_mv Ordem dos Médicos
publisher.none.fl_str_mv Ordem dos Médicos
dc.source.none.fl_str_mv Acta Médica Portuguesa; Vol. 37 No. 10 (2024): October; 697-705
Acta Médica Portuguesa; Vol. 37 N.º 10 (2024): Outubro; 697-705
1646-0758
0870-399X
reponame:Repositórios Científicos de Acesso Aberto de Portugal (RCAAP)
instname:FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologia
instacron:RCAAP
instname_str FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologia
instacron_str RCAAP
institution RCAAP
reponame_str Repositórios Científicos de Acesso Aberto de Portugal (RCAAP)
collection Repositórios Científicos de Acesso Aberto de Portugal (RCAAP)
repository.name.fl_str_mv Repositórios Científicos de Acesso Aberto de Portugal (RCAAP) - FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologia
repository.mail.fl_str_mv info@rcaap.pt
_version_ 1833597754219167744