Skip to main content

Showing 1–3 of 3 results for author: Saul, R

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.03991  [pdf, other

    cs.CR cs.LG

    Assemblage: Automatic Binary Dataset Construction for Machine Learning

    Authors: Chang Liu, Rebecca Saul, Yihao Sun, Edward Raff, Maya Fuchs, Townsend Southard Pantano, James Holt, Kristopher Micinski

    Abstract: Binary code is pervasive, and binary analysis is a key task in reverse engineering, malware classification, and vulnerability discovery. Unfortunately, while there exist large corpuses of malicious binaries, obtaining high-quality corpuses of benign binaries for modern systems has proven challenging (e.g., due to licensing issues). Consequently, machine learning based pipelines for binary analysis… ▽ More

    Submitted 7 May, 2024; originally announced May 2024.

  2. arXiv:2211.13250  [pdf, other

    cs.LG cs.AI

    Lempel-Ziv Networks

    Authors: Rebecca Saul, Mohammad Mahmudul Alam, John Hurwitz, Edward Raff, Tim Oates, James Holt

    Abstract: Sequence processing has long been a central area of machine learning research. Recurrent neural nets have been successful in processing sequences for a number of tasks; however, they are known to be both ineffective and computationally expensive when applied to very long sequences. Compression-based methods have demonstrated more robustness when processing such sequences -- in particular, an appro… ▽ More

    Submitted 23 November, 2022; originally announced November 2022.

    Comments: I Can't Believe It's Not Better Workshop at NeurIPS 2022

  3. arXiv:2011.04137  [pdf

    cs.IR

    Automated data extraction of bar chart raster images

    Authors: Alex Carderas, Ye Yuan, Itamar Livnat, Ryan Yanagihara, Rosita Saul, Gabrielle Montes De Oca, Kai Zheng, Andrew W. Browne

    Abstract: Objective: To develop software utilizing optical character recognition toward the automatic extraction of data from bar charts for meta-analysis. Methods: We utilized a multistep data extraction approach that included figure extraction, text detection, and image disassembly. PubMed Central papers that were processed in this manner included clinical trials regarding macular degeneration, a disease… ▽ More

    Submitted 8 November, 2020; originally announced November 2020.