Skip to main content

Showing 1–38 of 38 results for author: Nair, P

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.17542  [pdf, ps, other

    cs.LG cs.AI cs.CL

    CDQuant: Accurate Post-training Weight Quantization of Large Pre-trained Models using Greedy Coordinate Descent

    Authors: Pranav Ajit Nair, Arun Sai Suggala

    Abstract: Large language models (LLMs) have recently demonstrated remarkable performance across diverse language tasks. But their deployment is often constrained by their substantial computational and storage requirements. Quantization has emerged as a key technique for addressing this challenge, enabling the compression of large models with minimal impact on performance. The recent GPTQ algorithm, a post-t… ▽ More

    Submitted 26 June, 2024; v1 submitted 25 June, 2024; originally announced June 2024.

  2. arXiv:2404.14632  [pdf, other

    cs.AR cs.DC

    Workload-Aware Hardware Accelerator Mining for Distributed Deep Learning Training

    Authors: Muhammad Adnan, Amar Phanishayee, Janardhan Kulkarni, Prashant J. Nair, Divya Mahajan

    Abstract: In this paper, we present a novel technique to search for hardware architectures of accelerators optimized for end-to-end training of deep neural networks (DNNs). Our approach addresses both single-device and distributed pipeline and tensor model parallel scenarios, latter being addressed for the first time. The search optimized accelerators for training relevant metrics such as throughput/TDP und… ▽ More

    Submitted 22 April, 2024; originally announced April 2024.

  3. arXiv:2404.04270  [pdf, other

    cs.IR cs.LG

    Accelerating Recommender Model Training by Dynamically Skip** Stale Embeddings

    Authors: Yassaman Ebrahimzadeh Maboud, Muhammad Adnan, Divya Mahajan, Prashant J. Nair

    Abstract: Training recommendation models pose significant challenges regarding resource utilization and performance. Prior research has proposed an approach that categorizes embeddings into popular and non-popular classes to reduce the training time for recommendation models. We observe that, even among the popular embeddings, certain embeddings undergo rapid training and exhibit minimal subsequent variatio… ▽ More

    Submitted 21 March, 2024; originally announced April 2024.

  4. arXiv:2403.09054  [pdf, other

    cs.LG cs.AI cs.AR cs.CL

    Keyformer: KV Cache Reduction through Key Tokens Selection for Efficient Generative Inference

    Authors: Muhammad Adnan, Akhil Arunkumar, Gaurav Jain, Prashant J. Nair, Ilya Soloveychik, Purushotham Kamath

    Abstract: Transformers have emerged as the underpinning architecture for Large Language Models (LLMs). In generative language models, the inference process involves two primary phases: prompt processing and token generation. Token generation, which constitutes the majority of the computational workload, primarily entails vector-matrix multiplications and interactions with the Key-Value (KV) Cache. This phas… ▽ More

    Submitted 5 April, 2024; v1 submitted 13 March, 2024; originally announced March 2024.

    MSC Class: 68U35 ACM Class: I.2.7; C.0

    Journal ref: Proceedings of the 7th Annual Conference on Machine Learning and Systems (MLSys), 2024

  5. arXiv:2402.08644  [pdf, other

    cs.AI cs.CL

    Tandem Transformers for Inference Efficient LLMs

    Authors: Aishwarya P S, Pranav Ajit Nair, Yashas Samaga, Toby Boyd, Sanjiv Kumar, Prateek Jain, Praneeth Netrapalli

    Abstract: The autoregressive nature of conventional large language models (LLMs) inherently limits inference speed, as tokens are generated sequentially. While speculative and parallel decoding techniques attempt to mitigate this, they face limitations: either relying on less accurate smaller models for generation or failing to fully leverage the base LLM's representations. We introduce a novel architectu… ▽ More

    Submitted 26 March, 2024; v1 submitted 13 February, 2024; originally announced February 2024.

  6. arXiv:2312.10550  [pdf, other

    cs.LG stat.ML

    Amortized Reparametrization: Efficient and Scalable Variational Inference for Latent SDEs

    Authors: Kevin Course, Prasanth B. Nair

    Abstract: We consider the problem of inferring latent stochastic differential equations (SDEs) with a time and memory cost that scales independently with the amount of data, the total length of the time series, and the stiffness of the approximate differential equations. This is in stark contrast to typical methods for inferring latent differential equations which, despite their constant memory cost, have a… ▽ More

    Submitted 16 December, 2023; originally announced December 2023.

    Comments: In Advances in Neural Information Processing Systems. 2023

  7. arXiv:2311.17143  [pdf, other

    astro-ph.IM astro-ph.HE cs.LG stat.ML

    Predicting the Age of Astronomical Transients from Real-Time Multivariate Time Series

    Authors: Hali Huang, Daniel Muthukrishna, Prajna Nair, Zimi Zhang, Michael Fausnaugh, Torsha Majumder, Ryan J. Foley, George R. Ricker

    Abstract: Astronomical transients, such as supernovae and other rare stellar explosions, have been instrumental in some of the most significant discoveries in astronomy. New astronomical sky surveys will soon record unprecedented numbers of transients as sparsely and irregularly sampled multivariate time series. To improve our understanding of the physical mechanisms of transients and their progenitor syste… ▽ More

    Submitted 28 November, 2023; originally announced November 2023.

    Comments: 6 pages, 4 figures. Accepted at the NeurIPS 2023 Machine Learning and the Physical Sciences workshop

  8. arXiv:2311.05782  [pdf, other

    cs.DC

    MPGemmFI: A Fault Injection Technique for Mixed Precision GEMM in ML Applications

    Authors: Bo Fang, Xinyi Li, Harvey Dam, Cheng Tan, Siva Kumar Sastry Hari, Timothy Tsai, Ignacio Laguna, Dingwen Tao, Ganesh Gopalakrishnan, Prashant Nair, Kevin Barker, Ang Li

    Abstract: Emerging deep learning workloads urgently need fast general matrix multiplication (GEMM). To meet such demand, one of the critical features of machine-learning-specific accelerators such as NVIDIA Tensor Cores, AMD Matrix Cores, and Google TPUs is the support of mixed-precision enabled GEMM. For DNN models, lower-precision FP data formats and computation offer acceptable correctness but significan… ▽ More

    Submitted 9 November, 2023; originally announced November 2023.

  9. arXiv:2308.14902  [pdf, other

    cs.IR cs.LG

    Ad-Rec: Advanced Feature Interactions to Address Covariate-Shifts in Recommendation Networks

    Authors: Muhammad Adnan, Yassaman Ebrahimzadeh Maboud, Divya Mahajan, Prashant J. Nair

    Abstract: Recommendation models are vital in delivering personalized user experiences by leveraging the correlation between multiple input features. However, deep learning-based recommendation models often face challenges due to evolving user behaviour and item features, leading to covariate shifts. Effective cross-feature learning is crucial to handle data distribution drift and adapting to changing user b… ▽ More

    Submitted 28 August, 2023; originally announced August 2023.

  10. arXiv:2307.02623  [pdf, other

    cs.LG cs.DC

    FLuID: Mitigating Stragglers in Federated Learning using Invariant Dropout

    Authors: Irene Wang, Prashant J. Nair, Divya Mahajan

    Abstract: Federated Learning (FL) allows machine learning models to train locally on individual mobile devices, synchronizing model updates via a shared server. This approach safeguards user privacy; however, it also generates a heterogeneous training environment due to the varying performance capabilities across devices. As a result, straggler devices with lower performance often dictate the overall traini… ▽ More

    Submitted 26 September, 2023; v1 submitted 5 July, 2023; originally announced July 2023.

    Comments: Accepted at the 37th Conference on Neural Information Processing Systems (NeurIPS), 2023

  11. arXiv:2305.16820  [pdf, other

    cs.CL cs.AI

    Domain Aligned Prefix Averaging for Domain Generalization in Abstractive Summarization

    Authors: Pranav Ajit Nair, Sukomal Pal, Pradeepika Verma

    Abstract: Domain generalization is hitherto an underexplored area applied in abstractive summarization. Moreover, most existing works on domain generalization have sophisticated training algorithms. In this paper, we propose a lightweight, weight averaging based, Domain Aligned Prefix Averaging approach to domain generalization for abstractive summarization. Given a number of source domains, our method firs… ▽ More

    Submitted 29 May, 2023; v1 submitted 26 May, 2023; originally announced May 2023.

    Comments: 13 pages, Accepted to ACL 2023 Findings

  12. arXiv:2305.15108  [pdf, other

    cs.CL

    The Role of Output Vocabulary in T2T LMs for SPARQL Semantic Parsing

    Authors: Debayan Banerjee, Pranav Ajit Nair, Ricardo Usbeck, Chris Biemann

    Abstract: In this work, we analyse the role of output vocabulary for text-to-text (T2T) models on the task of SPARQL semantic parsing. We perform experiments within the the context of knowledge graph question answering (KGQA), where the task is to convert questions in natural language to the SPARQL query language. We observe that the query vocabulary is distinct from human vocabulary. Language Models (LMs)… ▽ More

    Submitted 24 May, 2023; originally announced May 2023.

    Comments: Accepted as a short paper to ACL 2023 findings

  13. arXiv:2303.13284  [pdf, other

    cs.CL cs.DB cs.IR

    GETT-QA: Graph Embedding based T2T Transformer for Knowledge Graph Question Answering

    Authors: Debayan Banerjee, Pranav Ajit Nair, Ricardo Usbeck, Chris Biemann

    Abstract: In this work, we present an end-to-end Knowledge Graph Question Answering (KGQA) system named GETT-QA. GETT-QA uses T5, a popular text-to-text pre-trained language model. The model takes a question in natural language as input and produces a simpler form of the intended SPARQL query. In the simpler form, the model does not directly produce entity and relation IDs. Instead, it produces correspondin… ▽ More

    Submitted 28 March, 2023; v1 submitted 23 March, 2023; originally announced March 2023.

    Comments: 16 pages single column format accepted at ESWC 2023 research track

  14. Biologically inspired ChaosNet architecture for Hypothetical Protein Classification

    Authors: Sneha K H, Adhithya Sudeesh, Pramod P Nair, Prashanth Suravajhala

    Abstract: ChaosNet is a type of artificial neural network framework developed for classification problems and is influenced by the chaotic property of the human brain. Each neuron of the ChaosNet architecture is the one-dimensional chaotic map called the Generalized Luroth Series (GLS). The addition of GLS as neurons in ChaosNet makes the computations straightforward while utilizing the advantageous element… ▽ More

    Submitted 5 February, 2023; originally announced February 2023.

  15. Physics-informed Neural Networks approach to solve the Blasius function

    Authors: Greeshma Krishna, Malavika S Nair, Pramod P Nair, Anil Lal S

    Abstract: Deep learning techniques with neural networks have been used effectively in computational fluid dynamics (CFD) to obtain solutions to nonlinear differential equations. This paper presents a physics-informed neural network (PINN) approach to solve the Blasius function. This method eliminates the process of changing the non-linear differential equation to an initial value problem. Also, it tackles t… ▽ More

    Submitted 5 February, 2023; v1 submitted 30 December, 2022; originally announced January 2023.

  16. Scalable and Secure Row-Swap: Efficient and Safe Row Hammer Mitigation in Memory Systems

    Authors: Jeonghyun Woo, Gururaj Saileshwar, Prashant J. Nair

    Abstract: As Dynamic Random Access Memories (DRAM) scale, they are becoming increasingly susceptible to Row Hammer. By rapidly activating rows of DRAM cells (aggressor rows), attackers can exploit inter-cell interference through Row Hammer to flip bits in neighboring rows (victim rows). A recent work, called Randomized Row-Swap (RRS), proposed proactively swap** aggressor rows with randomly selected rows… ▽ More

    Submitted 23 December, 2022; originally announced December 2022.

    Journal ref: The 29th IEEE International Symposium on High-Performance Computer Architecture (HPCA 2022)

  17. The Dirty Secret of SSDs: Embodied Carbon

    Authors: Swamit Tannu, Prashant J. Nair

    Abstract: Scalable Solid-State Drives (SSDs) have ushered in a transformative era in data storage and accessibility, spanning both data centers and portable devices. However, the strides made in scaling this technology can bear significant environmental consequences. On a global scale, a notable portion of semiconductor manufacturing relies on electricity derived from coal and natural gas sources. A strikin… ▽ More

    Submitted 28 September, 2023; v1 submitted 8 July, 2022; originally announced July 2022.

    Journal ref: Energy Informatics Review (Volume 3 Issue 3, October 2023)

  18. arXiv:2205.04785  [pdf, other

    cs.DM math.CO

    Upper Bounds to Genome Rearrangement Problem using Prefix Transpositions

    Authors: Pramod P Nair

    Abstract: A Genome rearrangement problem studies large-scale mutations on a set of DNAs in living organisms. Various rearrangements like reversals, transpositions, translocations, fissions, fusions, and combinations and different variations have been studied extensively by computational biologists and computer scientists over the past four decades. From a mathematical point of view, a genome is represented… ▽ More

    Submitted 10 May, 2022; originally announced May 2022.

  19. Modern Baselines for SPARQL Semantic Parsing

    Authors: Debayan Banerjee, Pranav Ajit Nair, Jivat Neet Kaur, Ricardo Usbeck, Chris Biemann

    Abstract: In this work, we focus on the task of generating SPARQL queries from natural language questions, which can then be executed on Knowledge Graphs (KGs). We assume that gold entity and relations have been provided, and the remaining task is to arrange them in the right order along with SPARQL vocabulary, and input tokens to produce the correct SPARQL query. Pre-trained Language Models (PLMs) have not… ▽ More

    Submitted 14 September, 2023; v1 submitted 27 April, 2022; originally announced April 2022.

    Comments: 5 pages, short paper, SIGIR 2022

  20. arXiv:2204.05436  [pdf, other

    cs.AR cs.AI cs.LG

    Heterogeneous Acceleration Pipeline for Recommendation System Training

    Authors: Muhammad Adnan, Yassaman Ebrahimzadeh Maboud, Divya Mahajan, Prashant J. Nair

    Abstract: Recommendation models rely on deep learning networks and large embedding tables, resulting in computationally and memory-intensive processes. These models are typically trained using hybrid CPU-GPU or GPU-only configurations. The hybrid mode combines the GPU's neural network acceleration with the CPUs' memory storage and supply for embedding tables but may incur significant CPU-to-GPU transfer tim… ▽ More

    Submitted 28 April, 2024; v1 submitted 11 April, 2022; originally announced April 2022.

    Comments: Accepted at The International Symposium on Computer Architecture (ISCA), 2024

  21. arXiv:2203.13892  [pdf, other

    quant-ph cs.ET

    TQSim: A Case for Reuse-Focused Tree-Based Quantum Circuit Simulation

    Authors: Meng Wang, Rui Huang, Swamit Tannu, Prashant Nair

    Abstract: Quantum computers can speed up computationally hard problems. However, to realize their full potential, we must mitigate qubit errors (from noise) by develo** noise-aware algorithms, compilers, and architectures. Thus, simulating quantum programs on classical computers with different noise models is a de-facto tool that is used by researchers and practitioners. Unfortunately, noisy quantum simul… ▽ More

    Submitted 25 March, 2022; originally announced March 2022.

  22. arXiv:2104.10348  [pdf, other

    math.OC cs.CV eess.IV

    Fixed-Point and Objective Convergence of Plug-and-Play Algorithms

    Authors: Pravin Nair, Ruturaj G. Gavaskar, Kunal N. Chaudhury

    Abstract: A standard model for image reconstruction involves the minimization of a data-fidelity term along with a regularizer, where the optimization is performed using proximal algorithms such as ISTA and ADMM. In plug-and-play (PnP) regularization, the proximal operator (associated with the regularizer) in ISTA and ADMM is replaced by a powerful image denoiser. Although PnP regularization works surprisin… ▽ More

    Submitted 21 April, 2021; originally announced April 2021.

    Comments: Published in IEEE Transactions on Computational Imaging

    Journal ref: in IEEE Transactions on Computational Imaging, vol. 7, pp. 337-348, 2021

  23. arXiv:2104.05096  [pdf, other

    cs.LG

    Weak Form Generalized Hamiltonian Learning

    Authors: Kevin L. Course, Trefor W. Evans, Prasanth B. Nair

    Abstract: We present a method for learning generalized Hamiltonian decompositions of ordinary differential equations given a set of noisy time series measurements. Our method simultaneously learns a continuous time model and a scalar energy function for a general dynamical system. Learning predictive models in this form allows one to place strong, high-level, physics inspired priors onto the form of the lea… ▽ More

    Submitted 11 April, 2021; originally announced April 2021.

    Comments: 34th Conference on Neural Information Processing Systems, 18 pages

    Journal ref: Advances in Neural Information Processing Systems. Vol. 33 (2020), pp. 18716-18726

  24. arXiv:2103.08074  [pdf, other

    cs.CV

    Pushing the Limits of Capsule Networks

    Authors: Prem Nair, Rohan Doshi, Stefan Keselj

    Abstract: Convolutional neural networks use pooling and other downscaling operations to maintain translational invariance for detection of features, but in their architecture they do not explicitly maintain a representation of the locations of the features relative to each other. This means they do not represent two instances of the same object in different orientations the same way, like humans do, and so… ▽ More

    Submitted 14 March, 2021; originally announced March 2021.

  25. arXiv:2103.00686  [pdf, other

    cs.IR cs.AI cs.AR cs.LG

    Accelerating Recommendation System Training by Leveraging Popular Choices

    Authors: Muhammad Adnan, Yassaman Ebrahimzadeh Maboud, Divya Mahajan, Prashant J. Nair

    Abstract: Recommender models are commonly used to suggest relevant items to a user for e-commerce and online advertisement-based applications. These models use massive embedding tables to store numerical representation of items' and users' categorical variables (memory intensive) and employ neural networks (compute intensive) to generate final recommendations. Training these large-scale recommendation model… ▽ More

    Submitted 28 September, 2021; v1 submitted 28 February, 2021; originally announced March 2021.

    ACM Class: I.2.6; C.5.0

    Journal ref: Proceedings of the VLDB Endowment, 2022

  26. arXiv:2007.03437  [pdf, other

    cs.LG cs.AI stat.ML

    Group Equivariant Deep Reinforcement Learning

    Authors: Arnab Kumar Mondal, Pratheeksha Nair, Kaleem Siddiqi

    Abstract: In Reinforcement Learning (RL), Convolutional Neural Networks(CNNs) have been successfully applied as function approximators in Deep Q-Learning algorithms, which seek to learn action-value functions and policies in various environments. However, to date, there has been little work on the learning of symmetry-transformation equivariant representations of the input environment state. In this paper,… ▽ More

    Submitted 30 June, 2020; originally announced July 2020.

    Comments: Presented at the ICML 2020 Workshop on Inductive Biases, Invariances and Generalization in RL

  27. arXiv:2006.03015  [pdf, other

    stat.ML cs.LG

    Quadruply Stochastic Gaussian Processes

    Authors: Trefor W. Evans, Prasanth B. Nair

    Abstract: We introduce a stochastic variational inference procedure for training scalable Gaussian process (GP) models whose per-iteration complexity is independent of both the number of training points, $n$, and the number basis functions used in the kernel approximation, $m$. Our central contributions include an unbiased stochastic estimator of the evidence lower bound (ELBO) for a Gaussian likelihood, as… ▽ More

    Submitted 4 June, 2020; originally announced June 2020.

  28. arXiv:1911.11834  [pdf, other

    cs.CV

    Towards Fairness in Visual Recognition: Effective Strategies for Bias Mitigation

    Authors: Zeyu Wang, Klint Qinami, Ioannis Christos Karakozis, Kyle Genova, Prem Nair, Kenji Hata, Olga Russakovsky

    Abstract: Computer vision models learn to perform a task by capturing relevant statistics from training data. It has been shown that models learn spurious age, gender, and race correlations when trained for seemingly unrelated tasks like activity recognition or image captioning. Various mitigation techniques have been presented to prevent models from utilizing or learning such biases. However, there has bee… ▽ More

    Submitted 2 April, 2020; v1 submitted 26 November, 2019; originally announced November 2019.

    Comments: To appear in CVPR 2020

  29. arXiv:1909.00553  [pdf, ps, other

    cs.AR cs.DC cs.PF eess.SY

    Touché: Towards Ideal and Efficient Cache Compression By Mitigating Tag Area Overheads

    Authors: Seokin Hong, Bulent Abali, Alper Buyuktosunoglu, Michael B. Healy, Prashant J. Nair

    Abstract: Compression is seen as a simple technique to increase the effective cache capacity. Unfortunately, compression techniques either incur tag area overheads or restrict data placement to only include neighboring compressed cache blocks to mitigate tag area overheads. Ideally, we should be able to place arbitrary compressed cache blocks without any placement restrictions and tag area overheads. This… ▽ More

    Submitted 2 September, 2019; originally announced September 2019.

    Comments: Keywords: Compression, Caches, Tag Array, Data Array, Hashing

    Journal ref: Proceedings of the 52nd Annual IEEE/ACM International Symposium on Microarchitecture, October 2019, Pages 453-465

  30. Fast High-Dimensional Kernel Filtering

    Authors: Pravin Nair, Kunal N. Chaudhury

    Abstract: The bilateral and nonlocal means filters are instances of kernel-based filters that are popularly used in image processing. It was recently shown that fast and accurate bilateral filtering of grayscale images can be performed using a low-rank approximation of the kernel matrix. More specifically, based on the eigendecomposition of the kernel matrix, the overall filtering was approximated using spa… ▽ More

    Submitted 18 January, 2019; originally announced January 2019.

    Journal ref: IEEE Signal Processing Letters 2019

  31. arXiv:1812.08882  [pdf, other

    cs.CV

    An Optical Flow-Based Approach for Minimally-Divergent Velocimetry Data Interpolation

    Authors: Berkay Kanberoglu, Dhritiman Das, Priya Nair, Pavan Turaga, David Frakes

    Abstract: Three-dimensional (3D) biomedical image sets are often acquired with in-plane pixel spacings that are far less than the out-of-plane spacings between images. The resultant anisotropy, which can be detrimental in many applications, can be decreased using image interpolation. Optical flow and/or other registration-based interpolators have proven useful in such interpolation roles in the past. When a… ▽ More

    Submitted 20 December, 2018; originally announced December 2018.

    Comments: 24 pages, 10 figures, International Journal of Biomedical Imaging, accepted manuscript

  32. Fast High-Dimensional Bilateral and Nonlocal Means Filtering

    Authors: Pravin Nair, Kunal. N. Chaudhury

    Abstract: Existing fast algorithms for bilateral and nonlocal means filtering mostly work with grayscale images. They cannot easily be extended to high-dimensional data such as color and hyperspectral images, patch-based data, flow-fields, etc. In this paper, we propose a fast algorithm for high-dimensional bilateral and nonlocal means filtering. Unlike existing approaches, where the focus is on approximati… ▽ More

    Submitted 6 November, 2018; originally announced November 2018.

    Comments: This work is accepted in IEEE Transactions on Image Processing

  33. arXiv:1809.04279  [pdf, other

    stat.ML cs.LG

    Discretely Relaxing Continuous Variables for tractable Variational Inference

    Authors: Trefor W. Evans, Prasanth B. Nair

    Abstract: We explore a new research direction in Bayesian variational inference with discrete latent variable priors where we exploit Kronecker matrix algebra for efficient and exact computations of the evidence lower bound (ELBO). The proposed "DIRECT" approach has several advantages over its predecessors; (i) it can exactly compute ELBO gradients (i.e. unbiased, zero-variance gradient estimates), eliminat… ▽ More

    Submitted 9 January, 2019; v1 submitted 12 September, 2018; originally announced September 2018.

    Comments: Appears in the proceedings of the Advances in Neural Information Processing Systems (NeurIPS), 2018. Full code is available at https://github.com/treforevans/direct

  34. arXiv:1808.03351  [pdf, other

    stat.ML cs.LG

    Exploiting Structure for Fast Kernel Learning

    Authors: Trefor W. Evans, Prasanth B. Nair

    Abstract: We propose two methods for exact Gaussian process (GP) inference and learning on massive image, video, spatial-temporal, or multi-output datasets with missing values (or "gaps") in the observed responses. The first method ignores the gaps using sparse selection matrices and a highly effective low-rank preconditioner is introduced to accelerate computations. The second method introduces a novel app… ▽ More

    Submitted 9 August, 2018; originally announced August 2018.

    Comments: Appears in the proceedings of the SIAM International Conference on Data Mining (SDM), 2018

  35. arXiv:1807.02125  [pdf, other

    stat.ML cs.LG

    Scalable Gaussian Processes with Grid-Structured Eigenfunctions (GP-GRIEF)

    Authors: Trefor W. Evans, Prasanth B. Nair

    Abstract: We introduce a kernel approximation strategy that enables computation of the Gaussian process log marginal likelihood and all hyperparameter derivatives in $\mathcal{O}(p)$ time. Our GRIEF kernel consists of $p$ eigenfunctions found using a Nystrom approximation from a dense Cartesian product grid of inducing points. By exploiting algebraic properties of Kronecker and Khatri-Rao tensor products, c… ▽ More

    Submitted 1 August, 2018; v1 submitted 5 July, 2018; originally announced July 2018.

    Comments: Appears in the proceedings of the International Conference on Machine Learning (ICML), 2018

  36. arXiv:1805.03184  [pdf, other

    cs.AR

    LISA: Increasing Internal Connectivity in DRAM for Fast Data Movement and Low Latency

    Authors: Kevin K. Chang, Prashant J. Nair, Saugata Ghose, Donghyuk Lee, Moinuddin K. Qureshi, Onur Mutlu

    Abstract: This paper summarizes the idea of Low-Cost Interlinked Subarrays (LISA), which was published in HPCA 2016, and examines the work's significance and future potential. Contemporary systems perform bulk data movement movement inefficiently, by transferring data from DRAM to the processor, and then back to DRAM, across a narrow off-chip channel. The use of this narrow channel results in high latency a… ▽ More

    Submitted 8 May, 2018; originally announced May 2018.

  37. arXiv:1710.01330  [pdf, other

    cs.RO cs.CV

    Robotic Pick-and-Place of Novel Objects in Clutter with Multi-Affordance Gras** and Cross-Domain Image Matching

    Authors: Andy Zeng, Shuran Song, Kuan-Ting Yu, Elliott Donlon, Francois R. Hogan, Maria Bauza, Daolin Ma, Orion Taylor, Melody Liu, Eudald Romo, Nima Fazeli, Ferran Alet, Nikhil Chavan Dafle, Rachel Holladay, Isabella Morona, Prem Qu Nair, Druck Green, Ian Taylor, Weber Liu, Thomas Funkhouser, Alberto Rodriguez

    Abstract: This paper presents a robotic pick-and-place system that is capable of gras** and recognizing both known and novel objects in cluttered environments. The key new feature of the system is that it handles a wide range of object categories without needing any task-specific training data for novel objects. To achieve this, it first uses a category-agnostic affordance prediction algorithm to select a… ▽ More

    Submitted 30 May, 2020; v1 submitted 3 October, 2017; originally announced October 2017.

    Comments: Project webpage: http://arc.cs.princeton.edu Summary video: https://youtu.be/6fG7zwGfIkI

  38. arXiv:1704.03991  [pdf, ps, other

    cs.AR cs.ET

    Architectural Techniques to Enable Reliable and Scalable Memory Systems

    Authors: Prashant J. Nair

    Abstract: High capacity and scalable memory systems play a vital role in enabling our desktops, smartphones, and pervasive technologies like Internet of Things (IoT). Unfortunately, memory systems are becoming increasingly prone to faults. This is because we rely on technology scaling to improve memory density, and at small feature sizes, memory cells tend to break easily. Today, memory reliability is seen… ▽ More

    Submitted 13 April, 2017; originally announced April 2017.

    Comments: PhD thesis, Georgia Institute of Technology (May 2017)