Skip to main content

Showing 1–10 of 10 results for author: Kaplan, R

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.00332  [pdf, other

    cs.CL cs.AI cs.LG

    A Careful Examination of Large Language Model Performance on Grade School Arithmetic

    Authors: Hugh Zhang, Jeff Da, Dean Lee, Vaughn Robinson, Catherine Wu, Will Song, Tiffany Zhao, Pranav Raja, Dylan Slack, Qin Lyu, Sean Hendryx, Russell Kaplan, Michele Lunati, Summer Yue

    Abstract: Large language models (LLMs) have achieved impressive success on many benchmarks for mathematical reasoning. However, there is growing concern that some of this performance actually reflects dataset contamination, where data closely resembling benchmark questions leaks into the training data, instead of true reasoning ability. To investigate this claim rigorously, we commission Grade School Math 1… ▽ More

    Submitted 3 May, 2024; v1 submitted 1 May, 2024; originally announced May 2024.

  2. arXiv:2403.03218  [pdf, other

    cs.LG cs.AI cs.CL cs.CY

    The WMDP Benchmark: Measuring and Reducing Malicious Use With Unlearning

    Authors: Nathaniel Li, Alexander Pan, Anjali Gopal, Summer Yue, Daniel Berrios, Alice Gatti, Justin D. Li, Ann-Kathrin Dombrowski, Shashwat Goel, Long Phan, Gabriel Mukobi, Nathan Helm-Burger, Rassin Lababidi, Lennart Justen, Andrew B. Liu, Michael Chen, Isabelle Barrass, Oliver Zhang, Xiaoyuan Zhu, Rishub Tamirisa, Bhrugu Bharathi, Adam Khoja, Zhenqi Zhao, Ariel Herbert-Voss, Cort B. Breuer , et al. (32 additional authors not shown)

    Abstract: The White House Executive Order on Artificial Intelligence highlights the risks of large language models (LLMs) empowering malicious actors in develo** biological, cyber, and chemical weapons. To measure these risks of malicious use, government institutions and major AI labs are develo** evaluations for hazardous capabilities in LLMs. However, current evaluations are private, preventing furthe… ▽ More

    Submitted 15 May, 2024; v1 submitted 5 March, 2024; originally announced March 2024.

    Comments: See the project page at https://wmdp.ai

  3. arXiv:2304.14999  [pdf, other

    cs.CL cs.AI

    Empirical Analysis of the Strengths and Weaknesses of PEFT Techniques for LLMs

    Authors: George Pu, Anirudh Jain, Jihan Yin, Russell Kaplan

    Abstract: As foundation models continue to exponentially scale in size, efficient methods of adaptation become increasingly critical. Parameter-efficient fine-tuning (PEFT), a recent class of techniques that require only modifying a small percentage of the model parameters, is currently the most popular method for adapting large language models (LLMs). Several PEFT techniques have recently been proposed wit… ▽ More

    Submitted 28 April, 2023; originally announced April 2023.

    Comments: Short paper, ICLR '23 Workshop on Understanding Foundation Models

  4. arXiv:1901.05959  [pdf

    cs.ET cs.AR

    BioSEAL: In-Memory Biological Sequence Alignment Accelerator for Large-Scale Genomic Data

    Authors: Roman Kaplan, Leonid Yavits, Ran Ginosar

    Abstract: Genome sequences contain hundreds of millions of DNA base pairs. Finding the degree of similarity between two genomes requires executing a compute-intensive dynamic programming algorithm, such as Smith-Waterman. Traditional von Neumann architectures have limited parallelism and cannot provide an efficient solution for large-scale genomic data. Approximate heuristic methods (e.g. BLAST) are commonl… ▽ More

    Submitted 17 January, 2019; originally announced January 2019.

  5. arXiv:1901.04976  [pdf

    cs.DC

    AIDA: Associative DNN Inference Accelerator

    Authors: Leonid Yavits, Roman Kaplan, Ran Ginosar

    Abstract: We propose AIDA, an inference engine for accelerating fully-connected (FC) layers of Deep Neural Network (DNN). AIDA is an associative in-memory processor, where the bulk of data never leaves the confines of the memory arrays, and processing is performed in-situ. AIDA area and energy efficiency strongly benefit from sparsity and lower arithmetic precision. We show that AIDA outperforms the state o… ▽ More

    Submitted 20 December, 2018; originally announced January 2019.

  6. arXiv:1809.01127  [pdf

    q-bio.GN cs.ET

    RASSA: Resistive Pre-Alignment Accelerator for Approximate DNA Long Read Map**

    Authors: Roman Kaplan, Leonid Yavits, Ran Ginosar

    Abstract: DNA read map** is a computationally expensive bioinformatics task, required for genome assembly and consensus polishing. It requires to find the best-fitting location for each DNA read on a long reference sequence. A novel resistive approximate similarity search accelerator, RASSA, exploits charge distribution and parallel in-memory processing to reflect a mismatch count between DNA sequences. R… ▽ More

    Submitted 28 January, 2019; v1 submitted 2 September, 2018; originally announced September 2018.

  7. arXiv:1807.09937  [pdf, other

    cs.CV cs.LG

    HiDDeN: Hiding Data With Deep Networks

    Authors: Jiren Zhu, Russell Kaplan, Justin Johnson, Li Fei-Fei

    Abstract: Recent work has shown that deep neural networks are highly sensitive to tiny perturbations of input images, giving rise to adversarial examples. Though this property is usually considered a weakness of learned models, we explore whether it can be beneficial. We find that neural networks can learn to use invisible perturbations to encode a rich amount of useful information. In fact, one can exploit… ▽ More

    Submitted 25 July, 2018; originally announced July 2018.

  8. arXiv:1805.09612  [pdf

    cs.AR

    PRINS: Resistive CAM Processing in Storage

    Authors: Leonid Yavits, Roman Kaplan, Ran Ginosar

    Abstract: Near-data in-storage processing research has been gaining momentum in recent years. Typical processing-in-storage architecture places a single or several processing cores inside the storage and allows data processing without transferring it to the host CPU. Since this approach replicates von Neumann architecture inside storage, it is exposed to the problems faced by von Neumann architecture, espec… ▽ More

    Submitted 17 March, 2019; v1 submitted 24 May, 2018; originally announced May 2018.

  9. arXiv:1704.05539  [pdf, other

    cs.AI

    Beating Atari with Natural Language Guided Reinforcement Learning

    Authors: Russell Kaplan, Christopher Sauer, Alexander Sosa

    Abstract: We introduce the first deep reinforcement learning agent that learns to beat Atari games with the aid of natural language instructions. The agent uses a multimodal embedding between environment observations and natural language to self-monitor progress through a list of English instructions, granting itself reward for completing instructions in addition to increasing the game score. Our agent sign… ▽ More

    Submitted 18 April, 2017; originally announced April 2017.

  10. A Resistive CAM Processing-in-Storage Architecture for DNA Sequence Alignment

    Authors: Roman Kaplan, Leonid Yavits, Ran Ginosar, Uri Weiser

    Abstract: A novel processing-in-storage (PRinS) architecture based on Resistive CAM (ReCAM) is described and proposed for Smith-Waterman (S-W) sequence alignment. The ReCAM massively-parallel compare operation finds matching base-pairs in a fixed number of cycles, regardless of sequence length. The ReCAM PRinS S-W algorithm is simulated and compared to FPGA, Xeon Phi and GPU-based implementations, showing a… ▽ More

    Submitted 11 June, 2017; v1 submitted 17 January, 2017; originally announced January 2017.

    Journal ref: IEEE Micro, vol. 37, no. 4, pp. 20-28, 2017