Skip to main content

Showing 1–10 of 10 results for author: Nori, A V

.
  1. arXiv:2406.18786  [pdf, other

    cs.AR

    Constable: Improving Performance and Power Efficiency by Safely Eliminating Load Instruction Execution

    Authors: Rahul Bera, Adithya Ranganathan, Joydeep Rakshit, Sujit Mahto, Anant V. Nori, Jayesh Gaur, Ataberk Olgun, Konstantinos Kanellopoulos, Mohammad Sadrosadati, Sreenivas Subramoney, Onur Mutlu

    Abstract: Load instructions often limit instruction-level parallelism (ILP) in modern processors due to data and resource dependences they cause. Prior techniques like Load Value Prediction (LVP) and Memory Renaming (MRN) mitigate load data dependence by predicting the data value of a load instruction. However, they fail to mitigate load resource dependence as the predicted load instruction gets executed no… ▽ More

    Submitted 26 June, 2024; originally announced June 2024.

    Comments: To appear in the proceedings of 51st International Symposium on Computer Architecture (ISCA)

  2. arXiv:2312.00501  [pdf, other

    stat.ME

    Cautionary Tales on Synthetic Controls in Survival Analyses

    Authors: Alicia Curth, Hoifung Poon, Aditya V. Nori, Javier González

    Abstract: Synthetic control (SC) methods have gained rapid popularity in economics recently, where they have been applied in the context of inferring the effects of treatments on standard continuous outcomes assuming linear input-output relations. In medical applications, conversely, survival outcomes are often of primary interest, a setup in which both commonly assumed data-generating processes (DGPs) and… ▽ More

    Submitted 16 February, 2024; v1 submitted 1 December, 2023; originally announced December 2023.

    Comments: To appear in the 3rd Conference on Causal Learning and Reasoning (CLeaR 2024)

  3. arXiv:2311.03033  [pdf, ps, other

    cs.LG cs.AI

    Beyond Words: A Mathematical Framework for Interpreting Large Language Models

    Authors: Javier González, Aditya V. Nori

    Abstract: Large language models (LLMs) are powerful AI tools that can generate and comprehend natural language text and other complex information. However, the field lacks a mathematical framework to systematically describe, compare and improve LLMs. We propose Hex a framework that clarifies key terms and concepts in LLM research, such as hallucinations, alignment, self-verification and chain-of-thought rea… ▽ More

    Submitted 6 November, 2023; originally announced November 2023.

    Comments: 4 figures, 18 pages

  4. arXiv:2310.14573  [pdf, other

    cs.CL

    Exploring the Boundaries of GPT-4 in Radiology

    Authors: Qianchu Liu, Stephanie Hyland, Shruthi Bannur, Kenza Bouzid, Daniel C. Castro, Maria Teodora Wetscherek, Robert Tinn, Harshita Sharma, Fernando Pérez-García, Anton Schwaighofer, Pranav Rajpurkar, Sameer Tajdin Khanna, Hoifung Poon, Naoto Usuyama, Anja Thieme, Aditya V. Nori, Matthew P. Lungren, Ozan Oktay, Javier Alvarez-Valle

    Abstract: The recent success of general-domain large language models (LLMs) has significantly changed the natural language processing paradigm towards a unified foundation model across domains and applications. In this paper, we focus on assessing the performance of GPT-4, the most capable LLM so far, on the text-based applications for radiology reports, comparing against state-of-the-art (SOTA) radiology-s… ▽ More

    Submitted 23 October, 2023; originally announced October 2023.

    Comments: EMNLP 2023 main

  5. arXiv:2205.14778  [pdf, other

    cs.AR cs.LG

    TransforMAP: Transformer for Memory Access Prediction

    Authors: Pengmiao Zhang, Ajitesh Srivastava, Anant V. Nori, Rajgopal Kannan, Viktor K. Prasanna

    Abstract: Data Prefetching is a technique that can hide memory latency by fetching data before it is needed by a program. Prefetching relies on accurate memory access prediction, to which task machine learning based methods are increasingly applied. Unlike previous approaches that learn from deltas or offsets and perform one access prediction, we develop TransforMAP, based on the powerful Transformer model,… ▽ More

    Submitted 29 May, 2022; originally announced May 2022.

  6. Fine-Grained Address Segmentation for Attention-Based Variable-Degree Prefetching

    Authors: Pengmiao Zhang, Ajitesh Srivastava, Anant V. Nori, Rajgopal Kannan, Viktor K. Prasanna

    Abstract: Machine learning algorithms have shown potential to improve prefetching performance by accurately predicting future memory accesses. Existing approaches are based on the modeling of text prediction, considering prefetching as a classification problem for sequence prediction. However, the vast and sparse memory address space leads to large vocabulary, which makes this modeling impractical. The numb… ▽ More

    Submitted 1 May, 2022; originally announced May 2022.

  7. Pythia: A Customizable Hardware Prefetching Framework Using Online Reinforcement Learning

    Authors: Rahul Bera, Konstantinos Kanellopoulos, Anant V. Nori, Taha Shahroodi, Sreenivas Subramoney, Onur Mutlu

    Abstract: Past research has proposed numerous hardware prefetching techniques, most of which rely on exploiting one specific type of program context information (e.g., program counter, cacheline address) to predict future memory accesses. These techniques either completely neglect a prefetcher's undesirable effects (e.g., memory bandwidth usage) on the overall system, or incorporate system-level feedback as… ▽ More

    Submitted 6 April, 2023; v1 submitted 24 September, 2021; originally announced September 2021.

    ACM Class: C.1.2

  8. arXiv:2011.11695  [pdf, other

    cs.AR

    Proximu$: Efficiently Scaling DNN Inference in Multi-core CPUs through Near-Cache Compute

    Authors: Anant V. Nori, Rahul Bera, Shankar Balachandran, Joydeep Rakshit, Om J. Omer, Avishaii Abuhatzera, Belliappa Kuttanna, Sreenivas Subramoney

    Abstract: Deep Neural Network (DNN) inference is emerging as the fundamental bedrock for a multitude of utilities and services. CPUs continue to scale up their raw compute capabilities for DNN inference along with mature high performance libraries to extract optimal performance. While general purpose CPUs offer unique attractive advantages for DNN inference at both datacenter and edge, they have primarily e… ▽ More

    Submitted 2 December, 2020; v1 submitted 23 November, 2020; originally announced November 2020.

    Comments: 18 pages, 21 figures

  9. DSPatch: Dual Spatial Pattern Prefetcher

    Authors: Rahul Bera, Anant V. Nori, Onur Mutlu, Sreenivas Subramoney

    Abstract: High main memory latency continues to limit performance of modern high-performance out-of-order cores. While DRAM latency has remained nearly the same over many generations, DRAM bandwidth has grown significantly due to higher frequencies, newer architectures (DDR4, LPDDR4, GDDR5) and 3D-stacked memory packaging (HBM). Current state-of-the-art prefetchers do not do well in extracting higher perfor… ▽ More

    Submitted 7 October, 2019; originally announced October 2019.

    Comments: This work is to appear in MICRO 2019

  10. arXiv:1902.05983  [pdf, other

    cs.LG cs.PL cs.SE stat.ML

    Robustness of Neural Networks: A Probabilistic and Practical Approach

    Authors: Ravi Mangal, Aditya V. Nori, Alessandro Orso

    Abstract: Neural networks are becoming increasingly prevalent in software, and it is therefore important to be able to verify their behavior. Because verifying the correctness of neural networks is extremely challenging, it is common to focus on the verification of other properties of these systems. One important property, in particular, is robustness. Most existing definitions of robustness, however, focus… ▽ More

    Submitted 15 February, 2019; originally announced February 2019.

    Comments: Accepted for publication at ICSE-NIER 2019