TY - JOUR
T1 - Recommendations for the Use of in Silico Approaches for Next-Generation Sequencing Bioinformatic Pipeline Validation
T2 - A Joint Report of the Association for Molecular Pathology, Association for Pathology Informatics, and College of American Pathologists
AU - Duncavage, Eric J.
AU - Coleman, Joshua F.
AU - de Baca, Monica E.
AU - Kadri, Sabah
AU - Leon, Annette
AU - Routbort, Mark
AU - Roy, Somak
AU - Suarez, Carlos J.
AU - Vanderbilt, Chad
AU - Zook, Justin M.
N1 - Publisher Copyright:
© 2023 Association for Molecular Pathology and American Society for Investigative Pathology
PY - 2023/1
Y1 - 2023/1
N2 - In silico approaches for next-generation sequencing (NGS) data modeling have utility in the clinical laboratory as a tool for clinical assay validation. In silico NGS data can take a variety of forms, including pure simulated data or manipulated data files in which variants are inserted into existing data files. In silico data enable simulation of a range of variants that may be difficult to obtain from a single physical sample. Such data allow laboratories to more accurately test the performance of clinical bioinformatics pipelines without sequencing additional cases. For example, clinical laboratories may use in silico data to simulate low variant allele fraction variants to test the analytical sensitivity of variant calling software or simulate a range of insertion/deletion sizes to determine the performance of insertion/deletion calling software. In this article, the Working Group reviews the different types of in silico data with their strengths and limitations, methods to generate in silico data, and how data can be used in the clinical molecular diagnostic laboratory. Survey data indicate how in silico NGS data are currently being used. Finally, potential applications for which in silico data may become useful in the future are presented.
AB - In silico approaches for next-generation sequencing (NGS) data modeling have utility in the clinical laboratory as a tool for clinical assay validation. In silico NGS data can take a variety of forms, including pure simulated data or manipulated data files in which variants are inserted into existing data files. In silico data enable simulation of a range of variants that may be difficult to obtain from a single physical sample. Such data allow laboratories to more accurately test the performance of clinical bioinformatics pipelines without sequencing additional cases. For example, clinical laboratories may use in silico data to simulate low variant allele fraction variants to test the analytical sensitivity of variant calling software or simulate a range of insertion/deletion sizes to determine the performance of insertion/deletion calling software. In this article, the Working Group reviews the different types of in silico data with their strengths and limitations, methods to generate in silico data, and how data can be used in the clinical molecular diagnostic laboratory. Survey data indicate how in silico NGS data are currently being used. Finally, potential applications for which in silico data may become useful in the future are presented.
UR - http://www.scopus.com/inward/record.url?scp=85143868240&partnerID=8YFLogxK
U2 - 10.1016/j.jmoldx.2022.09.007
DO - 10.1016/j.jmoldx.2022.09.007
M3 - Review article
C2 - 36244574
AN - SCOPUS:85143868240
SN - 1525-1578
VL - 25
SP - 3
EP - 16
JO - Journal of Molecular Diagnostics
JF - Journal of Molecular Diagnostics
IS - 1
ER -