Um estudo quantitativo sobre a variabilidade dos tempos de execução de programas em experimentos computacionais

Detalhes bibliográficos
Ano de defesa: 2015
Autor(a) principal: Nogueira, Paulo Eduardo
Orientador(a): Não Informado pela instituição
Banca de defesa: Não Informado pela instituição
Tipo de documento: Dissertação
Tipo de acesso: Acesso aberto
Idioma: por
Instituição de defesa: Universidade Federal de Uberlândia
BR
Programa de Pós-graduação em Ciência da Computação
Ciências Exatas e da Terra
UFU
Programa de Pós-Graduação: Não Informado pela instituição
Departamento: Não Informado pela instituição
País: Não Informado pela instituição
Palavras-chave em Português:
Link de acesso: https://repositorio.ufu.br/handle/123456789/12587
https://doi.org/10.14393/ufu.di.2015.183
Resumo: In experimental research in computer science many works depend on correct measurement and analysis of the execution times of computer programs. It is observed that not all hold that repeated executions of the same program can produce significantly different execution times in statistical terms. Because of this, there was a quantitative study on the variability of programs execution times, in order to establish a protocol for comparative quantitative analysis of execution times of programs in computational experiments. Subsequently, three experiments were set up and implemented so as to allow the observation of some factors present in the computing environment, in particular related to the operating system. The factors chosen were: Runlevel, compiler Optimization, the Size of an Environment Variable, the Number of Threads and Thread Allocation Strategies. The results obtained with the RTA protocol demonstrate that these factors can influence the program execution times, leading to erroneous conclusions. In some cases, the statistical difference between treatments in an experiment reached 100% of the comparisons. Furthermore, it was observed that the distribution of execution times is not always adhered to a Gaussian distribution. It was observed also that in significance analysis with multiple treatments, the problem known as familywise error rate should be considered and should be treated because it can also lead to wrong conclusions. Thus, these observations emphasize the use of a controlled computing environment and using a statistical method for statistical analysis of multiple comparisons.