Skip to main content

Showing 1–5 of 5 results for author: Jahromi, S S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2403.14379  [pdf, other

    cs.CV cs.LG quant-ph

    Tensor network compressibility of convolutional models

    Authors: Sukhbinder Singh, Saeed S. Jahromi, Roman Orus

    Abstract: Convolutional neural networks (CNNs) represent one of the most widely used neural network architectures, showcasing state-of-the-art performance in computer vision tasks. Although larger CNNs generally exhibit higher accuracy, their size can be effectively reduced by "tensorization" while maintaining accuracy. Tensorization consists of replacing the convolution kernels with compact decompositions… ▽ More

    Submitted 21 March, 2024; originally announced March 2024.

    Comments: 20 pages, 21 images

  2. arXiv:2401.14109  [pdf, other

    cs.CL cs.AI cs.LG quant-ph

    CompactifAI: Extreme Compression of Large Language Models using Quantum-Inspired Tensor Networks

    Authors: Andrei Tomut, Saeed S. Jahromi, Abhijoy Sarkar, Uygar Kurt, Sukhbinder Singh, Faysal Ishtiaq, Cesar Muñoz, Prabdeep Singh Bajaj, Ali Elborady, Gianni del Bimbo, Mehrazin Alizadeh, David Montero, Pablo Martin-Ramiro, Muhammad Ibrahim, Oussama Tahiri Alaoui, John Malcolm, Samuel Mugel, Roman Orus

    Abstract: Large Language Models (LLMs) such as ChatGPT and LlaMA are advancing rapidly in generative Artificial Intelligence (AI), but their immense size poses significant challenges, such as huge training and inference costs, substantial energy demands, and limitations for on-site deployment. Traditional compression methods such as pruning, distillation, and low-rank approximation focus on reducing the eff… ▽ More

    Submitted 13 May, 2024; v1 submitted 25 January, 2024; originally announced January 2024.

    Comments: 5 pages, 4 figures, 2 tables, and supplementary information of 2 pages and 1 figure. Revised version with new benchmarks for LlaMA2-7B

  3. arXiv:2309.15642  [pdf, other

    quant-ph cond-mat.str-el cs.CE cs.LG

    Efficient tensor network simulation of IBM's largest quantum processors

    Authors: Siddhartha Patra, Saeed S. Jahromi, Sukhbinder Singh, Roman Orus

    Abstract: We show how quantum-inspired 2d tensor networks can be used to efficiently and accurately simulate the largest quantum processors from IBM, namely Eagle (127 qubits), Osprey (433 qubits) and Condor (1121 qubits). We simulate the dynamics of a complex quantum many-body system -- specifically, the kicked Ising experiment considered recently by IBM in Nature 618, p. 500-505 (2023) -- using graph-base… ▽ More

    Submitted 2 April, 2024; v1 submitted 27 September, 2023; originally announced September 2023.

    Comments: 7 pages, 8 figures, revised version

    Journal ref: Phys. Rev. Research 6, 013326 (2024)

  4. arXiv:2212.14076  [pdf, other

    q-fin.PR cs.CE cs.LG quant-ph

    Quantum-Inspired Tensor Neural Networks for Option Pricing

    Authors: Raj G. Patel, Chia-Wei Hsing, Serkan Sahin, Samuel Palmer, Saeed S. Jahromi, Shivam Sharma, Tomas Dominguez, Kris Tziritas, Christophe Michel, Vincent Porte, Mustafa Abid, Stephane Aubert, Pierre Castellani, Samuel Mugel, Roman Orus

    Abstract: Recent advances in deep learning have enabled us to address the curse of dimensionality (COD) by solving problems in higher dimensions. A subset of such approaches of addressing the COD has led us to solving high-dimensional PDEs. This has resulted in opening doors to solving a variety of real-world problems ranging from mathematical finance to stochastic control for industrial applications. Altho… ▽ More

    Submitted 10 March, 2024; v1 submitted 28 December, 2022; originally announced December 2022.

    Comments: 11 pages, 8 figures, minor changes. arXiv admin note: substantial text overlap with arXiv:2208.02235

  5. arXiv:2208.02235  [pdf, other

    cs.LG cond-mat.str-el cs.AI physics.comp-ph quant-ph

    Quantum-Inspired Tensor Neural Networks for Partial Differential Equations

    Authors: Raj Patel, Chia-Wei Hsing, Serkan Sahin, Saeed S. Jahromi, Samuel Palmer, Shivam Sharma, Christophe Michel, Vincent Porte, Mustafa Abid, Stephane Aubert, Pierre Castellani, Chi-Guhn Lee, Samuel Mugel, Roman Orus

    Abstract: Partial Differential Equations (PDEs) are used to model a variety of dynamical systems in science and engineering. Recent advances in deep learning have enabled us to solve them in a higher dimension by addressing the curse of dimensionality in new ways. However, deep learning methods are constrained by training time and memory. To tackle these shortcomings, we implement Tensor Neural Networks (TN… ▽ More

    Submitted 10 August, 2022; v1 submitted 3 August, 2022; originally announced August 2022.

    Comments: 14 pages, 11 figures, minimal changes