Skip to main content

Showing 1–16 of 16 results for author: Mangul, S

.
  1. arXiv:2401.02965  [pdf

    cs.DL

    Perceptual and technical barriers in sharing and formatting metadata accompanying omics studies

    Authors: Yu-Ning Huang, Michael I. Love, Cynthia Flaire Ronkowski, Dhrithi Deshpande, Lynn M. Schriml, Annie Wong-Beringer, Barend Mons, Russell Corbett-Detig, Christopher I Hunter, Jason H. Moore, Lana X. Garmire, T. B. K. Reddy, Winston A. Hide, Atul J. Butte, Mark D. Robinson, Serghei Mangul

    Abstract: Metadata, often termed "data about data," is crucial for organizing, understanding, and managing vast omics datasets. It aids in efficient data discovery, integration, and interpretation, enabling users to access, comprehend, and utilize data effectively. Its significance spans the domains of scientific research, facilitating data reproducibility, reusability, and secondary analysis. However, nume… ▽ More

    Submitted 22 November, 2023; originally announced January 2024.

  2. arXiv:2311.02029  [pdf

    q-bio.GN cs.AR q-bio.QM

    MetaTrinity: Enabling Fast Metagenomic Classification via Seed Counting and Edit Distance Approximation

    Authors: Arvid E. Gollwitzer, Mohammed Alser, Joel Bergtholdt, Joel Lindegger, Maximilian-David Rumpf, Can Firtina, Serghei Mangul, Onur Mutlu

    Abstract: Metagenomics, the study of genome sequences of diverse organisms cohabiting in a shared environment, has experienced significant advancements across various medical and biological fields. Metagenomic analysis is crucial, for instance, in clinical applications such as infectious disease screening and the diagnosis and early detection of diseases such as cancer. A key task in metagenomics is to dete… ▽ More

    Submitted 16 February, 2024; v1 submitted 3 November, 2023; originally announced November 2023.

  3. arXiv:2310.16908  [pdf

    q-bio.GN cs.AR q-bio.QM

    SequenceLab: A Comprehensive Benchmark of Computational Methods for Comparing Genomic Sequences

    Authors: Maximilian-David Rumpf, Mohammed Alser, Arvid E. Gollwitzer, Joel Lindegger, Nour Almadhoun, Can Firtina, Serghei Mangul, Onur Mutlu

    Abstract: Computational complexity is a key limitation of genomic analyses. Thus, over the last 30 years, researchers have proposed numerous fast heuristic methods that provide computational relief. Comparing genomic sequences is one of the most fundamental computational steps in most genomic analyses. Due to its high computational complexity, optimized exact and heuristic algorithms are still being develop… ▽ More

    Submitted 21 January, 2024; v1 submitted 25 October, 2023; originally announced October 2023.

  4. arXiv:2309.16994  [pdf

    q-bio.GN

    A rigorous benchmarking of methods for SARS-CoV-2 lineage abundance estimation in wastewater

    Authors: Viorel Munteanu, Victor Gordeev, Michael Saldana, Eva Aßmann, Justin Maine Su, Nicolae Drabcinski, Oksana Zlenko, Maryna Kit, Felicia Iordachi, Khooshbu Kantibhai Patel, Abdullah Al Nahid, Likhitha Chittampalli, Yidian Xu, Pavel Skums, Shelesh Agrawal, Martin Hölzer, Adam Smith, Alex Zelikovsky, Serghei Mangul

    Abstract: In light of the continuous transmission and evolution of SARS-CoV-2 coupled with a significant decline in clinical testing, there is a pressing need for scalable, cost-effective, long-term, passive surveillance tools to effectively monitor viral variants circulating in the population. Wastewater genomic surveillance of SARS-CoV-2 has arrived as an alternative to clinical genomic surveillance, allo… ▽ More

    Submitted 21 January, 2024; v1 submitted 29 September, 2023; originally announced September 2023.

    Comments: For correspondence: [email protected]

  5. arXiv:2309.13326  [pdf

    q-bio.GN

    SARS-CoV-2 Wastewater Genomic Surveillance: Approaches, Challenges, and Opportunities

    Authors: Viorel Munteanu, Michael Saldana, Dumitru Ciorba, Viorel Bostan, Justin Maine Su, Nadiia Kasianchuk, Nitesh Kumar Sharma, Sergey Knyazev, Victor Gordeev, Eva Aßmann, Andrei Lobiuc, Mihai Covasa, Keith A. Crandall, Wenhao O. Ouyang, Nicholas C. Wu, Christopher Mason, Braden T Tierney, Alexander G Lucaci, Alex Zelikovsky, Fatemeh Mohebbi, Pavel Skums, Cynthia Gibas, Jessica Schlueter, Piotr Rzymski, Helena Solo-Gabriele , et al. (3 additional authors not shown)

    Abstract: During the SARS-CoV-2 pandemic, wastewater-based genomic surveillance (WWGS) emerged as an efficient viral surveillance tool that takes into account asymptomatic cases and can identify known and novel mutations and offers the opportunity to assign known virus lineages based on the detected mutations profiles. WWGS can also hint towards novel or cryptic lineages, but it is difficult to clearly iden… ▽ More

    Submitted 30 January, 2024; v1 submitted 23 September, 2023; originally announced September 2023.

    Comments: V Munteanu and M Saldana contributed equally to this work. M Hölzer, A Smith and S Mangul jointly supervised this work. For correspondence: [email protected]

  6. arXiv:2308.09558  [pdf

    q-bio.GN

    Genomic reproducibility in the bioinformatics era

    Authors: Pelin Icer Baykal, Paweł P. Łabaj, Florian Markowetz, Lynn M. Schriml, Daniel J. Stekhoven, Serghei Mangul, Niko Beerenwinkel

    Abstract: In biomedical research, validation of a new scientific discovery is tied to the reproducibility of its experimental results. However, in genomics, the definition and implementation of reproducibility still remain imprecise. Here, we argue that genomic reproducibility, defined as the ability of bioinformatics tools to maintain consistent genomics results across technical replicates, is key to gener… ▽ More

    Submitted 18 August, 2023; originally announced August 2023.

    Comments: 10 pages, 2 figures, 2 tables

    MSC Class: J.3

  7. arXiv:2203.16261  [pdf

    q-bio.GN cs.DC cs.SE stat.AP

    Packaging, containerization, and virtualization of computational omics methods: Advances, challenges, and opportunities

    Authors: Mohammed Alser, Sharon Waymost, Ram Ayyala, Brendan Lawlor, Richard J. Abdill, Neha Rajkumar, Nathan LaPierre, Jaqueline Brito, Andre M. Ribeiro-dos-Santos, Can Firtina, Nour Almadhoun, Varuni Sarwal, Eleazar Eskin, Qiyang Hu, Derek Strong, Byoung-Do, Kim, Malak S. Abedalthagafi, Onur Mutlu, Serghei Mangul

    Abstract: Omics software tools have reshaped the landscape of modern biology and become an essential component of biomedical research. The increasing dependence of biomedical scientists on these powerful tools creates a need for easier installation and greater usability. Packaging, virtualization, and containerization are different approaches to satisfy this need by wrap** omics tools in additional softwa… ▽ More

    Submitted 30 March, 2022; originally announced March 2022.

  8. arXiv:2104.14005  [pdf

    q-bio.GN q-bio.PE

    Unlocking capacities of viral genomics for the COVID-19 pandemic response

    Authors: Sergey Knyazev, Karishma Chhugani, Varuni Sarwal, Ram Ayyala, Harman Singh, Smruthi Karthikeyan, Dhrithi Deshpande, Zoia Comarova, Angela Lu, Yuri Porozov, Ai** Wu, Malak Abedalthagafi, Shivashankar Nagaraj, Adam Smith, Pavel Skums, Jason Ladner, Tommy Tsan-Yuk Lam, Nicholas Wu, Alex Zelikovsky, Rob Knight, Keith Crandall, Serghei Mangul

    Abstract: More than any other infectious disease epidemic, the COVID-19 pandemic has been characterized by the generation of large volumes of viral genomic data at an incredible pace due to recent advances in high-throughput sequencing technologies, the rapid global spread of SARS-CoV-2, and its persistent threat to public health. However, distinguishing the most epidemiologically relevant information encod… ▽ More

    Submitted 4 June, 2021; v1 submitted 28 April, 2021; originally announced April 2021.

  9. Pathogenesis, Symptomatology, and Transmission of SARS-CoV-2 through Analysis of Viral Genomics and Structure

    Authors: Halie M. Rando, Adam L. MacLean, Alexandra J. Lee, Ronan Lordan, Sandipan Ray, Vikas Bansal, Ashwin N. Skelly, Elizabeth Sell, John J. Dziak, Lamonica Shinholster, Lucy D'Agostino McGowan, Marouen Ben Guebila, Nils Wellhausen, Sergey Knyazev, Simina M. Boca, Stephen Capone, Yanjun Qi, YoSon Park, Yuchen Sun, David Mai, Joel D. Boerckel, Christian Brueffer, James Brian Byrd, Jeremy P. Kamil, **hui Wang , et al. (9 additional authors not shown)

    Abstract: The novel coronavirus SARS-CoV-2, which emerged in late 2019, has since spread around the world and infected hundreds of millions of people with coronavirus disease 2019 (COVID-19). While this viral species was unknown prior to January 2020, its similarity to other coronaviruses that infect humans has allowed for rapid insight into the mechanisms that it uses to infect human hosts, as well as the… ▽ More

    Submitted 3 December, 2021; v1 submitted 1 February, 2021; originally announced February 2021.

  10. arXiv:2010.10402  [pdf

    q-bio.GN

    Diversity in immunogenomics: the value and the challenge

    Authors: Kerui Peng, Yana Safonova, Mikhail Shugay, Alice Popejoy, Oscar Rodriguez, Felix Breden, Petter Brodin, Amanda M. Burkhardt, Carlos Bustamante, Van-Mai Cao-Lormeau, Martin M. Corcoran, Darragh Duffy, Macarena Fuentes Guajardo, Ricardo Fujita, Victor Greiff, Vanessa D. Jonsson, Xiao Liu, Lluis Quintana-Murci, Maura Rossetti, Jianming Xie, Gur Yaari, Wei Zhang, Malak S. Abedalthagafi, Khalid O. Adekoya, Rahaman A. Ahmed , et al. (10 additional authors not shown)

    Abstract: With the advent of high-throughput sequencing technologies, the fields of immunogenomics and adaptive immune receptor repertoire research are facing both opportunities and challenges. Adaptive immune receptor repertoire sequencing (AIRR-seq) has become an increasingly important tool to characterize T and B cell responses in settings of interest. However, the majority of AIRR-seq studies conducted… ▽ More

    Submitted 1 March, 2021; v1 submitted 20 October, 2020; originally announced October 2020.

    Comments: 22 pages,1 table

  11. arXiv:2010.02391  [pdf

    q-bio.GN

    RNA-seq data science: From raw data to effective interpretation

    Authors: Dhrithi Deshpande, Karishma Chhugani, Yutong Chang, Aaron Karlsberg, Caitlin Loeffler, **yang Zhang, Agata Muszynska, Jeremy Rotman, Laura Tao, Brunilda Balliu, Elizabeth Tseng, Eleazar Eskin, Fangqing Zhao, Pejman Mohammadi, Pawel P Labaj, Serghei Mangul

    Abstract: RNA-sequencing (RNA-seq) has become an exemplar technology in modern biology and clinical applications over the past decade. It has gained immense popularity in the recent years driven by continuous efforts of the bioinformatics community to develop accurate and scalable computational tools. RNA-seq is a method of analyzing the RNA content of a sample using the modern sequencing platforms. It gene… ▽ More

    Submitted 16 February, 2021; v1 submitted 5 October, 2020; originally announced October 2020.

  12. arXiv:2003.00110  [pdf

    q-bio.GN q-bio.QM

    Technology dictates algorithms: Recent developments in read alignment

    Authors: Mohammed Alser, Jeremy Rotman, Kodi Taraszka, Huwenbo Shi, Pelin Icer Baykal, Harry Taegyun Yang, Victor Xue, Sergey Knyazev, Benjamin D. Singer, Brunilda Balliu, David Koslicki, Pavel Skums, Alex Zelikovsky, Can Alkan, Onur Mutlu, Serghei Mangul

    Abstract: Massively parallel sequencing techniques have revolutionized biological and medical sciences by providing unprecedented insight into the genomes of humans, animals, and microbes. Modern sequencing platforms generate enormous amounts of genomic data in the form of nucleotide sequences or reads. Aligning reads onto reference genomes enables the identification of individual-specific genetic variants… ▽ More

    Submitted 9 July, 2020; v1 submitted 28 February, 2020; originally announced March 2020.

    Journal ref: Genome Biol . Aug 26;22(1):249, 2021

  13. arXiv:2002.12268  [pdf

    physics.pop-ph

    Refining the conference experience for junior scientists in the wake of climate change

    Authors: Ruth Johnson, Andrada Fiscutean, Serghei Mangul

    Abstract: With the ever-increasing carbon footprint associated with conferences, scientists can learn to refine their conference experiences when they do need to travel. We offer insight on how to optimize the conference experience through attending speaker sessions, giving presentations, and networking.

    Submitted 17 June, 2020; v1 submitted 18 February, 2020; originally announced February 2020.

  14. arXiv:2001.05127  [pdf

    q-bio.OT

    Recommendations to enhance rigor and reproducibility in biomedical research

    Authors: Jaqueline J. Brito, Jun Li, Jason H. Moore, Casey S. Greene, Nicole A. Nogoy, Lana X. Garmire, Serghei Mangul

    Abstract: Computational methods have reshaped the landscape of modern biology. While the biomedical community is increasingly dependent on computational tools, the mechanisms ensuring open data, open software, and reproducibility are variably enforced by academic institutions, funders, and publishers. Publications may present academic software for which essential materials are or become unavailable, such as… ▽ More

    Submitted 27 July, 2020; v1 submitted 14 January, 2020; originally announced January 2020.

  15. arXiv:1911.11304  [pdf

    q-bio.QM q-bio.GN

    Metagenomics for clinical diagnostics: technologies and informatics

    Authors: Caitlin Loeffler, Keylie M. Gibson, Lana Martin, Liz Chang, Jeremy Rotman, Ian V. Toma, Christopher E. Mason, Eleazar Eskin, Joseph P. Zackular, Keith A. Crandall, David Koslicki, Serghei Mangul

    Abstract: The human-associated microbiome is closely tied to human health and is of substantial clinical interest. Metagenomics-based tools are emerging for clinical diagnostics, tracking the spread of diseases, and surveillance of potential pathogens. In some cases, these tools are overcoming limitations of traditional clinical approaches. Metagenomics has limitations barring the tools from clinical valida… ▽ More

    Submitted 7 August, 2020; v1 submitted 25 November, 2019; originally announced November 2019.

    Comments: 75 pages, 7 figures, 2 tables, 4 supplementary table, review paper

  16. arXiv:1909.12469  [pdf

    cs.DC

    Telescope: an interactive tool for managing large scale analysis from mobile devices

    Authors: Jaqueline J. Brito, Thiago Mosqueiro, Jeremy Rotman, Victor Xue, Douglas J. Chapski, Juan De la Hoz, Paulo Matias, Lana Martin, Alex Zelikovsky, Matteo Pellegrinni, Serghei Mangul

    Abstract: In today's world of big data, computational analysis has become a key driver of biomedical research. Recent exponential growth in the volume of available omics data has reshaped the landscape of contemporary biology, creating demand for a continuous feedback loop that seamlessly integrates experimental biology techniques and bioinformatics tools. High-performance computational facilities are capab… ▽ More

    Submitted 5 December, 2019; v1 submitted 26 September, 2019; originally announced September 2019.