Automating psycholinguistic statistics computation: procura palavras

Bibliographic Details
Main Author: Machado, João F.
Publication Date: 2010
Other Authors: Almeida, J. J., Simões, Alberto, Soares, Ana Paula
Language: eng
Source: Repositórios Científicos de Acesso Aberto de Portugal (RCAAP)
Download full: http://hdl.handle.net/1822/12070
Summary: This article describes psycholinguistic lexical databases available in various languages, including English, Spanish and Portuguese. These lexical databases are important for re- searchers in Psycholinguistics and other related areas, providing a pool of experimental materials and allowing for an efficient process of selection of these experimental materials. The process of gathering statistics is slow, resulting in a small pool of materials in the short-term. The need to find an alternative method to gather limited or yet unavailable statistics for a specific language led us to consider gathering statistics from other languages and to compute their triangulation. Our aim was to automatize the computation of statistics such as Fa- miliarity, Imageability, Age of Acquisition and Written Word Frequency for that specific language. We will describe the process of preparing this data and tri- angulating and comparing statistics for some languages in an at- tempt of finding a relationship between them. The results were analysed considering correlations between each statistic in each pair of languages and by computing the mean of absolute dif- ferences between each language’s values.
id RCAP_f31c3ae9d58309a8e678ee2ffc3e1eae
oai_identifier_str oai:repositorium.sdum.uminho.pt:1822/12070
network_acronym_str RCAP
network_name_str Repositórios Científicos de Acesso Aberto de Portugal (RCAAP)
repository_id_str https://opendoar.ac.uk/repository/7160
spelling Automating psycholinguistic statistics computation: procura palavrasPsycholinguistic statisticsLexical databasesThis article describes psycholinguistic lexical databases available in various languages, including English, Spanish and Portuguese. These lexical databases are important for re- searchers in Psycholinguistics and other related areas, providing a pool of experimental materials and allowing for an efficient process of selection of these experimental materials. The process of gathering statistics is slow, resulting in a small pool of materials in the short-term. The need to find an alternative method to gather limited or yet unavailable statistics for a specific language led us to consider gathering statistics from other languages and to compute their triangulation. Our aim was to automatize the computation of statistics such as Fa- miliarity, Imageability, Age of Acquisition and Written Word Frequency for that specific language. We will describe the process of preparing this data and tri- angulating and comparing statistics for some languages in an at- tempt of finding a relationship between them. The results were analysed considering correlations between each statistic in each pair of languages and by computing the mean of absolute dif- ferences between each language’s values.Fundação para a Ciência e a Tecnologia (FCT) - "European Portuguese words” (PTDC/PSI-PCO/104679/2008)National Strategic Reference Framework (NSRF)Operational Agenda for Competitiveness Factors (COMPETE)European Regional Development Fund (ERDF).Universidade do MinhoMachado, João F.Almeida, J. J.Simões, AlbertoSoares, Ana Paula20102010-01-01T00:00:00Zconference paperinfo:eu-repo/semantics/publishedVersionapplication/pdfhttp://hdl.handle.net/1822/12070engMATEO, Carmen ; CAMPILLO DÍAZ Francisco ; MENDEZ PAZÓ, Francisco, eds. lit. – “Proceedings of the FALA 2010 : VI Jornadas en Tecnología del Habla and II Iberian SLTech Workshop, Vigo, 2010.” Vigo : Centro Social Caixanova, 2010. ISBN 978-84-8158-510-0. p. 217-220.978-84-8158-510-0http://lorien.die.upm.es/~lapiz/rtth/JORNADAS/VI/menulateral.htmlinfo:eu-repo/semantics/openAccessreponame:Repositórios Científicos de Acesso Aberto de Portugal (RCAAP)instname:FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologiainstacron:RCAAP2024-05-11T06:41:04Zoai:repositorium.sdum.uminho.pt:1822/12070Portal AgregadorONGhttps://www.rcaap.pt/oai/openaireinfo@rcaap.ptopendoar:https://opendoar.ac.uk/repository/71602025-05-28T16:00:59.706160Repositórios Científicos de Acesso Aberto de Portugal (RCAAP) - FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologiafalse
dc.title.none.fl_str_mv Automating psycholinguistic statistics computation: procura palavras
title Automating psycholinguistic statistics computation: procura palavras
spellingShingle Automating psycholinguistic statistics computation: procura palavras
Machado, João F.
Psycholinguistic statistics
Lexical databases
title_short Automating psycholinguistic statistics computation: procura palavras
title_full Automating psycholinguistic statistics computation: procura palavras
title_fullStr Automating psycholinguistic statistics computation: procura palavras
title_full_unstemmed Automating psycholinguistic statistics computation: procura palavras
title_sort Automating psycholinguistic statistics computation: procura palavras
author Machado, João F.
author_facet Machado, João F.
Almeida, J. J.
Simões, Alberto
Soares, Ana Paula
author_role author
author2 Almeida, J. J.
Simões, Alberto
Soares, Ana Paula
author2_role author
author
author
dc.contributor.none.fl_str_mv Universidade do Minho
dc.contributor.author.fl_str_mv Machado, João F.
Almeida, J. J.
Simões, Alberto
Soares, Ana Paula
dc.subject.por.fl_str_mv Psycholinguistic statistics
Lexical databases
topic Psycholinguistic statistics
Lexical databases
description This article describes psycholinguistic lexical databases available in various languages, including English, Spanish and Portuguese. These lexical databases are important for re- searchers in Psycholinguistics and other related areas, providing a pool of experimental materials and allowing for an efficient process of selection of these experimental materials. The process of gathering statistics is slow, resulting in a small pool of materials in the short-term. The need to find an alternative method to gather limited or yet unavailable statistics for a specific language led us to consider gathering statistics from other languages and to compute their triangulation. Our aim was to automatize the computation of statistics such as Fa- miliarity, Imageability, Age of Acquisition and Written Word Frequency for that specific language. We will describe the process of preparing this data and tri- angulating and comparing statistics for some languages in an at- tempt of finding a relationship between them. The results were analysed considering correlations between each statistic in each pair of languages and by computing the mean of absolute dif- ferences between each language’s values.
publishDate 2010
dc.date.none.fl_str_mv 2010
2010-01-01T00:00:00Z
dc.type.driver.fl_str_mv conference paper
dc.type.status.fl_str_mv info:eu-repo/semantics/publishedVersion
status_str publishedVersion
dc.identifier.uri.fl_str_mv http://hdl.handle.net/1822/12070
url http://hdl.handle.net/1822/12070
dc.language.iso.fl_str_mv eng
language eng
dc.relation.none.fl_str_mv MATEO, Carmen ; CAMPILLO DÍAZ Francisco ; MENDEZ PAZÓ, Francisco, eds. lit. – “Proceedings of the FALA 2010 : VI Jornadas en Tecnología del Habla and II Iberian SLTech Workshop, Vigo, 2010.” Vigo : Centro Social Caixanova, 2010. ISBN 978-84-8158-510-0. p. 217-220.
978-84-8158-510-0
http://lorien.die.upm.es/~lapiz/rtth/JORNADAS/VI/menulateral.html
dc.rights.driver.fl_str_mv info:eu-repo/semantics/openAccess
eu_rights_str_mv openAccess
dc.format.none.fl_str_mv application/pdf
dc.source.none.fl_str_mv reponame:Repositórios Científicos de Acesso Aberto de Portugal (RCAAP)
instname:FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologia
instacron:RCAAP
instname_str FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologia
instacron_str RCAAP
institution RCAAP
reponame_str Repositórios Científicos de Acesso Aberto de Portugal (RCAAP)
collection Repositórios Científicos de Acesso Aberto de Portugal (RCAAP)
repository.name.fl_str_mv Repositórios Científicos de Acesso Aberto de Portugal (RCAAP) - FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologia
repository.mail.fl_str_mv info@rcaap.pt
_version_ 1833595685357748224