-
PepSIRF + QIIME 2: software tools for automated, reproducible analysis of highly-multiplexed serology data
Authors:
Annabelle M. Brown,
Evan Bolyen,
Isaiah Raspet,
John A. Altin,
Jason T. Ladner
Abstract:
PepSIRF is a command-line, module-based open-source software package that facilitates the analysis of data from highly-multiplexed serology assays (e.g., PepSeq or PhIP-Seq). It has nine separate modules in its current release (v1.5.0): demux, info, subjoin, norm, bin, zscore, enrich, link, and deconv. These modules can be used together to conduct analyses ranging from demultiplexing raw high-thro…
▽ More
PepSIRF is a command-line, module-based open-source software package that facilitates the analysis of data from highly-multiplexed serology assays (e.g., PepSeq or PhIP-Seq). It has nine separate modules in its current release (v1.5.0): demux, info, subjoin, norm, bin, zscore, enrich, link, and deconv. These modules can be used together to conduct analyses ranging from demultiplexing raw high-throughput sequencing data to the identification of enriched peptides. QIIME 2 is an open-source, community-developed and plugin-based bioinformatics platform that focuses on data and analytical transparency. QIIME 2's features include integrated and automatic tracking of data provenance, a semantic type system, and built-in support for many types of user interfaces. Here, we describe three new QIIME 2 plugins that allow users to conduct PepSIRF analyses within the QIIME 2 environment and extend the core functionality of PepSIRF in two key ways: 1) enabling generation of interactive visualizations and 2) enabling automation of analysis pipelines that include multiple PepSIRF modules.
△ Less
Submitted 23 July, 2022;
originally announced July 2022.
-
Unlocking capacities of viral genomics for the COVID-19 pandemic response
Authors:
Sergey Knyazev,
Karishma Chhugani,
Varuni Sarwal,
Ram Ayyala,
Harman Singh,
Smruthi Karthikeyan,
Dhrithi Deshpande,
Zoia Comarova,
Angela Lu,
Yuri Porozov,
Ai** Wu,
Malak Abedalthagafi,
Shivashankar Nagaraj,
Adam Smith,
Pavel Skums,
Jason Ladner,
Tommy Tsan-Yuk Lam,
Nicholas Wu,
Alex Zelikovsky,
Rob Knight,
Keith Crandall,
Serghei Mangul
Abstract:
More than any other infectious disease epidemic, the COVID-19 pandemic has been characterized by the generation of large volumes of viral genomic data at an incredible pace due to recent advances in high-throughput sequencing technologies, the rapid global spread of SARS-CoV-2, and its persistent threat to public health. However, distinguishing the most epidemiologically relevant information encod…
▽ More
More than any other infectious disease epidemic, the COVID-19 pandemic has been characterized by the generation of large volumes of viral genomic data at an incredible pace due to recent advances in high-throughput sequencing technologies, the rapid global spread of SARS-CoV-2, and its persistent threat to public health. However, distinguishing the most epidemiologically relevant information encoded in these vast amounts of data requires substantial effort across the research and public health communities. Studies of SARS-CoV-2 genomes have been critical in tracking the spread of variants and understanding its epidemic dynamics, and may prove crucial for controlling future epidemics and alleviating significant public health burdens. Together, genomic data and bioinformatics methods enable broad-scale investigations of the spread of SARS-CoV-2 at the local, national, and global scales and allow researchers the ability to efficiently track the emergence of novel variants, reconstruct epidemic dynamics, and provide important insights into drug and vaccine development and disease control. Here, we discuss the tremendous opportunities that genomics offers to unlock the effective use of SARS-CoV-2 genomic data for efficient public health surveillance and guiding timely responses to COVID-19.
△ Less
Submitted 4 June, 2021; v1 submitted 28 April, 2021;
originally announced April 2021.
-
PepSIRF: a flexible and comprehensive tool for the analysis of data from highly-multiplexed DNA-barcoded peptide assays
Authors:
Zane W. Fink,
Vidal Martinez,
John Altin,
Jason T. Ladner
Abstract:
By coupling peptides with DNA tags (i.e., 'barcodes'), it is now possible to harness high-throughput sequencing (HTS) technologies to enable highly multiplexed peptide-based assays, which have a variety of potential applications including broad characterization of the epitopes recognized by antibodies. While the processing of HTS data, in general, is already well supported, there are very few soft…
▽ More
By coupling peptides with DNA tags (i.e., 'barcodes'), it is now possible to harness high-throughput sequencing (HTS) technologies to enable highly multiplexed peptide-based assays, which have a variety of potential applications including broad characterization of the epitopes recognized by antibodies. While the processing of HTS data, in general, is already well supported, there are very few software tools that have been developed for working with data generated in these highly-multiplexed peptide assays. In order to fill this gap, we present PepSIRF (Peptide-based Serological Immune Response Framework), which is a flexible and comprehensive software package designed specifically for the analysis of HTS data from highly-multiplexed peptide-based assays.
△ Less
Submitted 9 July, 2020;
originally announced July 2020.