TY - JOUR
T1 - insiM
T2 - in silico Mutator Software for Bioinformatics Pipeline Validation of Clinical Next-Generation Sequencing Assays
AU - Patil, Sushant A.
AU - Mujacic, Ibro
AU - Ritterhouse, Lauren L.
AU - Segal, Jeremy P.
AU - Kadri, Sabah
N1 - Funding Information:
Supported by The University of Chicago Biological Sciences Division and University of Chicago Medicine.
Publisher Copyright:
© 2019 American Society for Investigative Pathology and the Association for Molecular Pathology
PY - 2019/1
Y1 - 2019/1
N2 - Lack of reliable reference samples containing different mutations of interest across large sets of disease-relevant loci limits the extensive validation clinical next-generation sequencing (NGS) assays and their associated bioinformatics pipelines. Herein, we have generated a publicly available, highly flexible tool, in silico Mutator (insiM), to introduce point mutations, insertions, deletions, and duplications of any size into real data sets of amplicon-based or hybrid-capture NGS assays. insiM accepts an alignment file along with target territory and produces paired-end FASTQ files containing specified mutations via modification of original sequencing reads. Mutant signal is, thus, generated within the context of existing real-world data to most closely mimic assay performance. Resulting files may then be passed through the assay's bioinformatics pipeline to assist with assay/bioinformatics validation and to identify performance gaps in detection. To establish the basic functionality of the software, a series of simulation experiments with varying mutation types, sizes, and allele frequencies were performed across the entire clinical territory of hybrid-capture and amplicon-based clinical assays developed at The University of Chicago. This work demonstrates the utility of insiM as a supplementary tool during the validation of an NGS assay's bioinformatics pipeline.
AB - Lack of reliable reference samples containing different mutations of interest across large sets of disease-relevant loci limits the extensive validation clinical next-generation sequencing (NGS) assays and their associated bioinformatics pipelines. Herein, we have generated a publicly available, highly flexible tool, in silico Mutator (insiM), to introduce point mutations, insertions, deletions, and duplications of any size into real data sets of amplicon-based or hybrid-capture NGS assays. insiM accepts an alignment file along with target territory and produces paired-end FASTQ files containing specified mutations via modification of original sequencing reads. Mutant signal is, thus, generated within the context of existing real-world data to most closely mimic assay performance. Resulting files may then be passed through the assay's bioinformatics pipeline to assist with assay/bioinformatics validation and to identify performance gaps in detection. To establish the basic functionality of the software, a series of simulation experiments with varying mutation types, sizes, and allele frequencies were performed across the entire clinical territory of hybrid-capture and amplicon-based clinical assays developed at The University of Chicago. This work demonstrates the utility of insiM as a supplementary tool during the validation of an NGS assay's bioinformatics pipeline.
UR - http://www.scopus.com/inward/record.url?scp=85058231247&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85058231247&partnerID=8YFLogxK
U2 - 10.1016/j.jmoldx.2018.08.001
DO - 10.1016/j.jmoldx.2018.08.001
M3 - Article
C2 - 30273779
AN - SCOPUS:85058231247
SN - 1525-1578
VL - 21
SP - 19
EP - 26
JO - Journal of Molecular Diagnostics
JF - Journal of Molecular Diagnostics
IS - 1
ER -