AncesTrees: ancestry estimation with randomized decision trees

Bibliographic Details
Main Author: Navega, David
Publication Date: 2014
Other Authors: Coelho, Catarina, Vicente, Ricardo, Ferreira, Maria Teresa, Wasterlain, Sofia, Cunha, Eugénia
Format: Article
Language: eng
Source: Repositórios Científicos de Acesso Aberto de Portugal (RCAAP)
Download full: https://hdl.handle.net/10316/44333
https://doi.org/10.1007/s00414-014-1050-9
Summary: In forensic anthropology, ancestry estimation is essential in establishing the individual biological profile. The aim of this study is to present a new program--AncesTrees--developed for assessing ancestry based on metric analysis. AncesTrees relies on a machine learning ensemble algorithm, random forest, to classify the human skull. In the ensemble learning paradigm, several models are generated and co-jointly used to arrive at the final decision. The random forest algorithm creates ensembles of decision trees classifiers, a non-linear and non-parametric classification technique. The database used in AncesTrees is composed by 23 craniometric variables from 1,734 individuals, representative of six major ancestral groups and selected from the Howells' craniometric series. The program was tested in 128 adult crania from the following collections: the African slaves' skeletal collection of Valle da Gafaria; the Medical School Skull Collection and the Identified Skeletal Collection of 21st Century, both curated at the University of Coimbra. The first step of the test analysis was to perform ancestry estimation including all the ancestral groups of the database. The second stage of our test analysis was to conduct ancestry estimation including only the European and the African ancestral groups. In the first test analysis, 75% of the individuals of African ancestry and 79.2% of the individuals of European ancestry were correctly identified. The model involving only African and European ancestral groups had a better performance: 93.8% of all individuals were correctly classified. The obtained results show that AncesTrees can be a valuable tool in forensic anthropology.
id RCAP_aef9767aef37278f2c99f35b0e10897a
oai_identifier_str oai:estudogeral.uc.pt:10316/44333
network_acronym_str RCAP
network_name_str Repositórios Científicos de Acesso Aberto de Portugal (RCAAP)
repository_id_str https://opendoar.ac.uk/repository/7160
spelling AncesTrees: ancestry estimation with randomized decision treesAdultAlgorithmsEthnic GroupsFemaleForensic AnthropologyHumansMachine LearningMaleSex Determination by SkeletonCephalometryContinental Population GroupsDatabases as TopicDecision TreesIn forensic anthropology, ancestry estimation is essential in establishing the individual biological profile. The aim of this study is to present a new program--AncesTrees--developed for assessing ancestry based on metric analysis. AncesTrees relies on a machine learning ensemble algorithm, random forest, to classify the human skull. In the ensemble learning paradigm, several models are generated and co-jointly used to arrive at the final decision. The random forest algorithm creates ensembles of decision trees classifiers, a non-linear and non-parametric classification technique. The database used in AncesTrees is composed by 23 craniometric variables from 1,734 individuals, representative of six major ancestral groups and selected from the Howells' craniometric series. The program was tested in 128 adult crania from the following collections: the African slaves' skeletal collection of Valle da Gafaria; the Medical School Skull Collection and the Identified Skeletal Collection of 21st Century, both curated at the University of Coimbra. The first step of the test analysis was to perform ancestry estimation including all the ancestral groups of the database. The second stage of our test analysis was to conduct ancestry estimation including only the European and the African ancestral groups. In the first test analysis, 75% of the individuals of African ancestry and 79.2% of the individuals of European ancestry were correctly identified. The model involving only African and European ancestral groups had a better performance: 93.8% of all individuals were correctly classified. The obtained results show that AncesTrees can be a valuable tool in forensic anthropology.2014info:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/articlehttps://hdl.handle.net/10316/44333https://hdl.handle.net/10316/44333https://doi.org/10.1007/s00414-014-1050-9enghttp://link.springer.com/journal/414Navega, DavidCoelho, CatarinaVicente, RicardoFerreira, Maria TeresaWasterlain, SofiaCunha, Eugéniainfo:eu-repo/semantics/openAccessreponame:Repositórios Científicos de Acesso Aberto de Portugal (RCAAP)instname:FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologiainstacron:RCAAP2020-05-29T09:42:17Zoai:estudogeral.uc.pt:10316/44333Portal AgregadorONGhttps://www.rcaap.pt/oai/openaireinfo@rcaap.ptopendoar:https://opendoar.ac.uk/repository/71602025-05-29T05:09:36.107264Repositórios Científicos de Acesso Aberto de Portugal (RCAAP) - FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologiafalse
dc.title.none.fl_str_mv AncesTrees: ancestry estimation with randomized decision trees
title AncesTrees: ancestry estimation with randomized decision trees
spellingShingle AncesTrees: ancestry estimation with randomized decision trees
Navega, David
Adult
Algorithms
Ethnic Groups
Female
Forensic Anthropology
Humans
Machine Learning
Male
Sex Determination by Skeleton
Cephalometry
Continental Population Groups
Databases as Topic
Decision Trees
title_short AncesTrees: ancestry estimation with randomized decision trees
title_full AncesTrees: ancestry estimation with randomized decision trees
title_fullStr AncesTrees: ancestry estimation with randomized decision trees
title_full_unstemmed AncesTrees: ancestry estimation with randomized decision trees
title_sort AncesTrees: ancestry estimation with randomized decision trees
author Navega, David
author_facet Navega, David
Coelho, Catarina
Vicente, Ricardo
Ferreira, Maria Teresa
Wasterlain, Sofia
Cunha, Eugénia
author_role author
author2 Coelho, Catarina
Vicente, Ricardo
Ferreira, Maria Teresa
Wasterlain, Sofia
Cunha, Eugénia
author2_role author
author
author
author
author
dc.contributor.author.fl_str_mv Navega, David
Coelho, Catarina
Vicente, Ricardo
Ferreira, Maria Teresa
Wasterlain, Sofia
Cunha, Eugénia
dc.subject.por.fl_str_mv Adult
Algorithms
Ethnic Groups
Female
Forensic Anthropology
Humans
Machine Learning
Male
Sex Determination by Skeleton
Cephalometry
Continental Population Groups
Databases as Topic
Decision Trees
topic Adult
Algorithms
Ethnic Groups
Female
Forensic Anthropology
Humans
Machine Learning
Male
Sex Determination by Skeleton
Cephalometry
Continental Population Groups
Databases as Topic
Decision Trees
description In forensic anthropology, ancestry estimation is essential in establishing the individual biological profile. The aim of this study is to present a new program--AncesTrees--developed for assessing ancestry based on metric analysis. AncesTrees relies on a machine learning ensemble algorithm, random forest, to classify the human skull. In the ensemble learning paradigm, several models are generated and co-jointly used to arrive at the final decision. The random forest algorithm creates ensembles of decision trees classifiers, a non-linear and non-parametric classification technique. The database used in AncesTrees is composed by 23 craniometric variables from 1,734 individuals, representative of six major ancestral groups and selected from the Howells' craniometric series. The program was tested in 128 adult crania from the following collections: the African slaves' skeletal collection of Valle da Gafaria; the Medical School Skull Collection and the Identified Skeletal Collection of 21st Century, both curated at the University of Coimbra. The first step of the test analysis was to perform ancestry estimation including all the ancestral groups of the database. The second stage of our test analysis was to conduct ancestry estimation including only the European and the African ancestral groups. In the first test analysis, 75% of the individuals of African ancestry and 79.2% of the individuals of European ancestry were correctly identified. The model involving only African and European ancestral groups had a better performance: 93.8% of all individuals were correctly classified. The obtained results show that AncesTrees can be a valuable tool in forensic anthropology.
publishDate 2014
dc.date.none.fl_str_mv 2014
dc.type.status.fl_str_mv info:eu-repo/semantics/publishedVersion
dc.type.driver.fl_str_mv info:eu-repo/semantics/article
format article
status_str publishedVersion
dc.identifier.uri.fl_str_mv https://hdl.handle.net/10316/44333
https://hdl.handle.net/10316/44333
https://doi.org/10.1007/s00414-014-1050-9
url https://hdl.handle.net/10316/44333
https://doi.org/10.1007/s00414-014-1050-9
dc.language.iso.fl_str_mv eng
language eng
dc.relation.none.fl_str_mv http://link.springer.com/journal/414
dc.rights.driver.fl_str_mv info:eu-repo/semantics/openAccess
eu_rights_str_mv openAccess
dc.source.none.fl_str_mv reponame:Repositórios Científicos de Acesso Aberto de Portugal (RCAAP)
instname:FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologia
instacron:RCAAP
instname_str FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologia
instacron_str RCAAP
institution RCAAP
reponame_str Repositórios Científicos de Acesso Aberto de Portugal (RCAAP)
collection Repositórios Científicos de Acesso Aberto de Portugal (RCAAP)
repository.name.fl_str_mv Repositórios Científicos de Acesso Aberto de Portugal (RCAAP) - FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologia
repository.mail.fl_str_mv info@rcaap.pt
_version_ 1833602271791808512