Skip to main content

Showing 1–5 of 5 results for author: Mukherji, R

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.06079  [pdf, other

    cs.CV

    Latent Representation Matters: Human-like Sketches in One-shot Drawing Tasks

    Authors: Victor Boutin, Rishav Mukherji, Aditya Agrawal, Sabine Muzellec, Thomas Fel, Thomas Serre, Rufin VanRullen

    Abstract: Humans can effortlessly draw new categories from a single exemplar, a feat that has long posed a challenge for generative models. However, this gap has started to close with recent advances in diffusion models. This one-shot drawing task requires powerful inductive biases that have not been systematically investigated. Here, we study how different inductive biases shape the latent space of Latent… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

  2. arXiv:2405.00433  [pdf, other

    cs.LG cs.AI cs.NE

    Weight Sparsity Complements Activity Sparsity in Neuromorphic Language Models

    Authors: Rishav Mukherji, Mark Schöne, Khaleelulla Khan Nazeer, Christian Mayr, David Kappel, Anand Subramoney

    Abstract: Activity and parameter sparsity are two standard methods of making neural networks computationally more efficient. Event-based architectures such as spiking neural networks (SNNs) naturally exhibit activity sparsity, and many methods exist to sparsify their connectivity by pruning weights. While the effect of weight pruning on feed-forward SNNs has been previously studied for computer vision tasks… ▽ More

    Submitted 1 May, 2024; originally announced May 2024.

    Comments: arXiv admin note: text overlap with arXiv:2311.07625

  3. arXiv:2312.09084  [pdf, other

    cs.NE cs.CL cs.ET cs.LG

    Language Modeling on a SpiNNaker 2 Neuromorphic Chip

    Authors: Khaleelulla Khan Nazeer, Mark Schöne, Rishav Mukherji, Bernhard Vogginger, Christian Mayr, David Kappel, Anand Subramoney

    Abstract: As large language models continue to scale in size rapidly, so too does the computational power required to run them. Event-based networks on neuromorphic devices offer a potential way to reduce energy consumption for inference significantly. However, to date, most event-based networks that can run on neuromorphic hardware, including spiking neural networks (SNNs), have not achieved task performan… ▽ More

    Submitted 24 January, 2024; v1 submitted 14 December, 2023; originally announced December 2023.

  4. arXiv:2311.07625  [pdf, ps, other

    cs.LG

    Activity Sparsity Complements Weight Sparsity for Efficient RNN Inference

    Authors: Rishav Mukherji, Mark Schöne, Khaleelulla Khan Nazeer, Christian Mayr, Anand Subramoney

    Abstract: Artificial neural networks open up unprecedented machine learning capabilities at the cost of ever growing computational requirements. Sparsifying the parameters, often achieved through weight pruning, has been identified as a powerful technique to compress the number of model parameters and reduce the computational operations of neural networks. Yet, sparse activations, while omnipresent in both… ▽ More

    Submitted 7 December, 2023; v1 submitted 13 November, 2023; originally announced November 2023.

    Comments: Accepted to the First MLNCP Workshop @ NeurIPS 2023

  5. arXiv:2301.11722  [pdf, other

    cs.AI cs.HC

    Diffusion Models as Artists: Are we Closing the Gap between Humans and Machines?

    Authors: Victor Boutin, Thomas Fel, Lakshya Singhal, Rishav Mukherji, Akash Nagaraj, Julien Colin, Thomas Serre

    Abstract: An important milestone for AI is the development of algorithms that can produce drawings that are indistinguishable from those of humans. Here, we adapt the 'diversity vs. recognizability' scoring framework from Boutin et al, 2022 and find that one-shot diffusion models have indeed started to close the gap between humans and machines. However, using a finer-grained measure of the originality of in… ▽ More

    Submitted 31 May, 2023; v1 submitted 27 January, 2023; originally announced January 2023.