NGSPipes: from specification to automatic deployment of NGS pipelines

Bibliographic Details
Main Author: Dantas, Bruno
Publication Date: 2016
Other Authors: Fleitas, Calmenelias, Francisco, Alexandre P., Simão, José, Vaz, Cátia
Language: eng
Source: Repositórios Científicos de Acesso Aberto de Portugal (RCAAP)
Download full: http://hdl.handle.net/10400.21/8071
Summary: Biosciences have been revolutionized by next generation sequencing (NGS) technologies in last years, leading to new perspectives in medical, industrial and environmental applications. And although our motivation comes from biosciences, the following is true for many areas of science: published results are usually hard to reproduce either because data is not available or tools are not readily available, which delays the adoption of new methodologies and hinders innovation. Our focus is on tool readiness and pipelines availability. Even though most tools are freely available, pipelines are in general barely described and their configuration is far from trivial, with many parameters to be tuned.In this paper we discuss how to effectively build and use pipelines, relying on state of the art computing technologies to execute them without users need to configure, install and manage tools, servers and complex workflow management systems. A framework is also proposed showing that we can have public pipelines ready to process and analyse very high volume experimental data, produced for instance by high-throughput technologies, and that can be executed by users without effort. The NGSPipes framework and underlying architecture provides a major step towards open science and true collaboration in what concerns tools and pipelines among computational biology researchers and practitioners, which may share and replicate results in an easier and transparent way.
id RCAP_ddc24624a4e2617d1b924a88274e794d
oai_identifier_str oai:repositorio.ipl.pt:10400.21/8071
network_acronym_str RCAP
network_name_str Repositórios Científicos de Acesso Aberto de Portugal (RCAAP)
repository_id_str https://opendoar.ac.uk/repository/7160
spelling NGSPipes: from specification to automatic deployment of NGS pipelinesBiosciencesPipelinesBiosciences have been revolutionized by next generation sequencing (NGS) technologies in last years, leading to new perspectives in medical, industrial and environmental applications. And although our motivation comes from biosciences, the following is true for many areas of science: published results are usually hard to reproduce either because data is not available or tools are not readily available, which delays the adoption of new methodologies and hinders innovation. Our focus is on tool readiness and pipelines availability. Even though most tools are freely available, pipelines are in general barely described and their configuration is far from trivial, with many parameters to be tuned.In this paper we discuss how to effectively build and use pipelines, relying on state of the art computing technologies to execute them without users need to configure, install and manage tools, servers and complex workflow management systems. A framework is also proposed showing that we can have public pipelines ready to process and analyse very high volume experimental data, produced for instance by high-throughput technologies, and that can be executed by users without effort. The NGSPipes framework and underlying architecture provides a major step towards open science and true collaboration in what concerns tools and pipelines among computational biology researchers and practitioners, which may share and replicate results in an easier and transparent way.InforumRCIPLDantas, BrunoFleitas, CalmeneliasFrancisco, Alexandre P.Simão, JoséVaz, Cátia2018-02-20T11:11:43Z2016-092016-09-01T00:00:00Zconference objectinfo:eu-repo/semantics/publishedVersionapplication/pdfhttp://hdl.handle.net/10400.21/8071enginfo:eu-repo/semantics/openAccessreponame:Repositórios Científicos de Acesso Aberto de Portugal (RCAAP)instname:FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologiainstacron:RCAAP2025-02-12T10:16:59Zoai:repositorio.ipl.pt:10400.21/8071Portal AgregadorONGhttps://www.rcaap.pt/oai/openaireinfo@rcaap.ptopendoar:https://opendoar.ac.uk/repository/71602025-05-28T20:05:21.685884Repositórios Científicos de Acesso Aberto de Portugal (RCAAP) - FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologiafalse
dc.title.none.fl_str_mv NGSPipes: from specification to automatic deployment of NGS pipelines
title NGSPipes: from specification to automatic deployment of NGS pipelines
spellingShingle NGSPipes: from specification to automatic deployment of NGS pipelines
Dantas, Bruno
Biosciences
Pipelines
title_short NGSPipes: from specification to automatic deployment of NGS pipelines
title_full NGSPipes: from specification to automatic deployment of NGS pipelines
title_fullStr NGSPipes: from specification to automatic deployment of NGS pipelines
title_full_unstemmed NGSPipes: from specification to automatic deployment of NGS pipelines
title_sort NGSPipes: from specification to automatic deployment of NGS pipelines
author Dantas, Bruno
author_facet Dantas, Bruno
Fleitas, Calmenelias
Francisco, Alexandre P.
Simão, José
Vaz, Cátia
author_role author
author2 Fleitas, Calmenelias
Francisco, Alexandre P.
Simão, José
Vaz, Cátia
author2_role author
author
author
author
dc.contributor.none.fl_str_mv RCIPL
dc.contributor.author.fl_str_mv Dantas, Bruno
Fleitas, Calmenelias
Francisco, Alexandre P.
Simão, José
Vaz, Cátia
dc.subject.por.fl_str_mv Biosciences
Pipelines
topic Biosciences
Pipelines
description Biosciences have been revolutionized by next generation sequencing (NGS) technologies in last years, leading to new perspectives in medical, industrial and environmental applications. And although our motivation comes from biosciences, the following is true for many areas of science: published results are usually hard to reproduce either because data is not available or tools are not readily available, which delays the adoption of new methodologies and hinders innovation. Our focus is on tool readiness and pipelines availability. Even though most tools are freely available, pipelines are in general barely described and their configuration is far from trivial, with many parameters to be tuned.In this paper we discuss how to effectively build and use pipelines, relying on state of the art computing technologies to execute them without users need to configure, install and manage tools, servers and complex workflow management systems. A framework is also proposed showing that we can have public pipelines ready to process and analyse very high volume experimental data, produced for instance by high-throughput technologies, and that can be executed by users without effort. The NGSPipes framework and underlying architecture provides a major step towards open science and true collaboration in what concerns tools and pipelines among computational biology researchers and practitioners, which may share and replicate results in an easier and transparent way.
publishDate 2016
dc.date.none.fl_str_mv 2016-09
2016-09-01T00:00:00Z
2018-02-20T11:11:43Z
dc.type.driver.fl_str_mv conference object
dc.type.status.fl_str_mv info:eu-repo/semantics/publishedVersion
status_str publishedVersion
dc.identifier.uri.fl_str_mv http://hdl.handle.net/10400.21/8071
url http://hdl.handle.net/10400.21/8071
dc.language.iso.fl_str_mv eng
language eng
dc.rights.driver.fl_str_mv info:eu-repo/semantics/openAccess
eu_rights_str_mv openAccess
dc.format.none.fl_str_mv application/pdf
dc.publisher.none.fl_str_mv Inforum
publisher.none.fl_str_mv Inforum
dc.source.none.fl_str_mv reponame:Repositórios Científicos de Acesso Aberto de Portugal (RCAAP)
instname:FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologia
instacron:RCAAP
instname_str FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologia
instacron_str RCAAP
institution RCAAP
reponame_str Repositórios Científicos de Acesso Aberto de Portugal (RCAAP)
collection Repositórios Científicos de Acesso Aberto de Portugal (RCAAP)
repository.name.fl_str_mv Repositórios Científicos de Acesso Aberto de Portugal (RCAAP) - FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologia
repository.mail.fl_str_mv info@rcaap.pt
_version_ 1833598484716978176