Difference between revisions of "User:Shawndouglas/Sandbox"

From LIMSWiki
Jump to navigationJump to search
(Replaced content with "<div class="nonumtoc">__TOC__</div> {{ombox | type = notice | style = width: 960px; | text = This is my primary sandbox page, where I play with features and...")
Tags: Manual revert Replaced
 
(78 intermediate revisions by the same user not shown)
Line 3: Line 3:
| type      = notice
| type      = notice
| style    = width: 960px;
| style    = width: 960px;
| text      = This is sublevel1 of my sandbox, where I play with features and test MediaWiki code. If you wish to leave a comment for me, please see [[User_talk:Shawndouglas|my discussion page]] instead.<p></p>
| text      = This is my primary sandbox page, where I play with features and test MediaWiki code. If you wish to leave a comment for me, please see [[User_talk:Shawndouglas|my discussion page]] instead.<p></p>
}}
}}


==Sandbox begins below==
==Sandbox begins below==
{{Infobox journal article
|name        =
|image        =
|alt          = <!-- Alternative text for images -->
|caption      =
|title_full  = systemPipeR: NGS workflow and report generation environment
|journal      = ''BMC Bioinformatics''
|authors      = Backman, Tyler W.H.; Girke, Thomas
|affiliations = University of California, Riverside
|contact      = Email: thomas dot girke at ucr dot edu
|editors      =
|pub_year    = 2016
|vol_iss      = '''17'''
|pages        = 388
|doi          = [http://10.1186/s12859-016-1241-0 10.1186/s12859-016-1241-0]
|issn        = 1471-2105
|license      = [http://creativecommons.org/licenses/by/4.0/ Creative Commons Attribution 4.0 International]
|website      = [https://bmcbioinformatics.biomedcentral.com/articles/10.1186/s12859-016-1241-0 https://bmcbioinformatics.biomedcentral.com/articles/10.1186/s12859-016-1241-0]
|download    = [https://bmcbioinformatics.biomedcentral.com/track/pdf/10.1186/s12859-016-1241-0 https://bmcbioinformatics.biomedcentral.com/track/pdf/10.1186/s12859-016-1241-0] (PDF)
}}
{{ombox
| type      = content
| style    = width: 500px;
| text      = This article should not be considered complete until this message box has been removed. This is a work in progress.
}}
==Abstract==
'''Background''': Next-generation sequencing (NGS) has revolutionized how research is carried out in many areas of biology and medicine. However, the analysis of NGS data remains a major obstacle to the efficient utilization of the technology, as it requires complex multi-step processing of big data, demanding considerable computational expertise from users. While substantial effort has been invested on the development of software dedicated to the individual analysis steps of NGS experiments, insufficient resources are currently available for integrating the individual software components within the widely used R/Bioconductor environment into automated [[Workflow|workflows]] capable of running the analysis of most types of NGS applications from start-to-finish in a time-efficient and reproducible manner.
'''Results''': To address this need, we have developed the R/Bioconductor package systemPipeR. It is an extensible environment for both building and running end-to-end analysis workflows with automated report generation for a wide range of NGS applications. Its unique features include a uniform workflow interface across different NGS applications, automated report generation, and support for running both R and command-line software on local computers and computer clusters. A flexible sample annotation infrastructure efficiently handles complex sample sets and experimental designs. To simplify the analysis of widely used NGS applications, the package provides pre-configured workflows and reporting templates for RNA-Seq, ChIP-Seq, VAR-Seq, and Ribo-Seq. Additional workflow templates will be provided in the future.
'''Conclusions''': systemPipeR accelerates the extraction of reproducible analysis results from NGS experiments. By combining the capabilities of many R/Bioconductor and command-line tools, it makes efficient use of existing software resources without limiting the user to a set of predefined methods or environments. systemPipeR is freely available for all common operating systems from Bioconductor (http://bioconductor.org/packages/devel/systemPipeR).
'''Keywords''': analysis workflow, next generation sequencing (NGS), Ribo-Seq, ChIP-Seq, RNA-Seq, VAR-Seq
==Background==
By allowing scientists to rapidly sequence and quantify DNA and RNA molecules, next-generation sequencing (NGS) technology has transformed biology into one of the most data intensive research disciplines. In the past, experiments have been performed on a gene-by-gene basis, while NGS has introduced an age where it is has become a routine to sequence entire transcriptomes, genomes, or epigenomes rather than their isolated parts of interest. It will soon be possible to conduct these experiments on large numbers of single cell samples<ref name="KaliskySingle11">{{cite journal |title=Single-cell genomics |journal=Nature Methods |author=Kalisky, T.; Quake, S.R. |volume=8 |issue=4 |pages=311–4 |year=2011 |doi=10.1038/nmeth0411-311 |pmid=21451520}}</ref><ref name="TrapnellTheDynamics14">{{cite journal |title=The dynamics and regulators of cell fate decisions are revealed by pseudotemporal ordering of single cells |journal=Nature Biotechnology |author=Trapnell, C.; Cacchiarelli, D.; Grimsby, J. et al. |volume=32 |issue=4 |pages=381–86 |year=2014 |doi=10.1038/nbt.2859 |pmid=24658644 |pmc=PMC4122333}}</ref> for a wide range of time points, treatments, and genetic backgrounds to study biological systems with greater resolution and precision. Sequencing the genetic material of each individual within entire populations of organisms of the same species or genus will enable the study of adaptation processes<ref name="Lindblad-TohAHigh11">{{cite journal |title=A high-resolution map of human evolutionary constraint using 29 mammals |journal=Nature |author=Lindblad-Toh, K.; Garber, M.; Zuk, O. et al. |volume=478 |issue=7370 |pages=476–82 |year=2011 |doi=10.1038/nature10530 |pmid=21993624 |pmc=PMC3207357}}</ref>, disease progression, and micro-evolution in real time.<ref name="Kato-MaedaUseOf13">{{cite journal |title=Use of whole genome sequencing to determine the microevolution of Mycobacterium tuberculosis during an outbreak |journal=PLoS One |author=Kato-Maeda, M.; Ho, C.; Passarelli, B. et al. |volume=8 |issue=3 |pages=e58235 |year=2013 |doi=10.1371/journal.pone.0058235 |pmid=23472164 |pmc=PMC3589338}}</ref> This technological shift empowers researchers to address questions at a [[Genomics|genome-wide]] scale, for example by profiling the mRNA, miRNA, and DNA methylation states of a large set of biological samples in parallel.<ref name="HoltTheNew08">{{cite journal |title=The new paradigm of flow cell sequencing |journal=Genome Research |author=Holt, R.A.; Jones, S.J. |volume=18 |issue=6 |pages=839-46 |year=2008 |doi=10.1101/gr.073262.107 |pmid=18519653}}</ref>
==References==
{{Reflist|colwidth=30em}}
==Notes==
This presentation is faithful to the original, with only a few minor changes to presentation. In some cases important information was missing from the references, and that information was added.
<!--Place all category tags here-->
[[Category:LIMSwiki journal articles (added in 2018)‎]]
[[Category:LIMSwiki journal articles (all)‎]]
[[Category:LIMSwiki journal articles on bioinformatics]]
[[Category:LIMSwiki journal articles on software‎‎]]

Latest revision as of 17:47, 1 February 2022

Sandbox begins below