Packaging, containerization, and virtualization of computational omics methods: Advances, challenges, and opportunities
Authors:
Mohammed Alser,
Sharon Waymost,
Ram Ayyala,
Brendan Lawlor,
Richard J. Abdill,
Neha Rajkumar,
Nathan LaPierre,
Jaqueline Brito,
Andre M. Ribeiro-dos-Santos,
Can Firtina,
Nour Almadhoun,
Varuni Sarwal,
Eleazar Eskin,
Qiyang Hu,
Derek Strong,
Byoung-Do,
Kim,
Malak S. Abedalthagafi,
Onur Mutlu,
Serghei Mangul
Abstract:
Omics software tools have reshaped the landscape of modern biology and become an essential component of biomedical research. The increasing dependence of biomedical scientists on these powerful tools creates a need for easier installation and greater usability. Packaging, virtualization, and containerization are different approaches to satisfy this need by wrap** omics tools in additional softwa…
▽ More
Omics software tools have reshaped the landscape of modern biology and become an essential component of biomedical research. The increasing dependence of biomedical scientists on these powerful tools creates a need for easier installation and greater usability. Packaging, virtualization, and containerization are different approaches to satisfy this need by wrap** omics tools in additional software that makes the omics tools easier to install and use. Here, we systematically review practices across prominent packaging, virtualization, and containerization platforms. We outline the challenges, advantages, and limitations of each approach and some of the most widely used platforms from the perspectives of users, software developers, and system administrators. We also propose principles to make packaging, virtualization, and containerization of omics software more sustainable and robust to increase the reproducibility of biomedical and life science research.
△ Less
Submitted 30 March, 2022;
originally announced March 2022.
Unlocking capacities of viral genomics for the COVID-19 pandemic response
Authors:
Sergey Knyazev,
Karishma Chhugani,
Varuni Sarwal,
Ram Ayyala,
Harman Singh,
Smruthi Karthikeyan,
Dhrithi Deshpande,
Zoia Comarova,
Angela Lu,
Yuri Porozov,
Ai** Wu,
Malak Abedalthagafi,
Shivashankar Nagaraj,
Adam Smith,
Pavel Skums,
Jason Ladner,
Tommy Tsan-Yuk Lam,
Nicholas Wu,
Alex Zelikovsky,
Rob Knight,
Keith Crandall,
Serghei Mangul
Abstract:
More than any other infectious disease epidemic, the COVID-19 pandemic has been characterized by the generation of large volumes of viral genomic data at an incredible pace due to recent advances in high-throughput sequencing technologies, the rapid global spread of SARS-CoV-2, and its persistent threat to public health. However, distinguishing the most epidemiologically relevant information encod…
▽ More
More than any other infectious disease epidemic, the COVID-19 pandemic has been characterized by the generation of large volumes of viral genomic data at an incredible pace due to recent advances in high-throughput sequencing technologies, the rapid global spread of SARS-CoV-2, and its persistent threat to public health. However, distinguishing the most epidemiologically relevant information encoded in these vast amounts of data requires substantial effort across the research and public health communities. Studies of SARS-CoV-2 genomes have been critical in tracking the spread of variants and understanding its epidemic dynamics, and may prove crucial for controlling future epidemics and alleviating significant public health burdens. Together, genomic data and bioinformatics methods enable broad-scale investigations of the spread of SARS-CoV-2 at the local, national, and global scales and allow researchers the ability to efficiently track the emergence of novel variants, reconstruct epidemic dynamics, and provide important insights into drug and vaccine development and disease control. Here, we discuss the tremendous opportunities that genomics offers to unlock the effective use of SARS-CoV-2 genomic data for efficient public health surveillance and guiding timely responses to COVID-19.
△ Less
Submitted 4 June, 2021; v1 submitted 28 April, 2021;
originally announced April 2021.