Skip to main content

Showing 1–3 of 3 results for author: Ishtiaq, F

Searching in archive cs. Search in all archives.
.
  1. arXiv:2401.14109  [pdf, other

    cs.CL cs.AI cs.LG quant-ph

    CompactifAI: Extreme Compression of Large Language Models using Quantum-Inspired Tensor Networks

    Authors: Andrei Tomut, Saeed S. Jahromi, Abhijoy Sarkar, Uygar Kurt, Sukhbinder Singh, Faysal Ishtiaq, Cesar Muñoz, Prabdeep Singh Bajaj, Ali Elborady, Gianni del Bimbo, Mehrazin Alizadeh, David Montero, Pablo Martin-Ramiro, Muhammad Ibrahim, Oussama Tahiri Alaoui, John Malcolm, Samuel Mugel, Roman Orus

    Abstract: Large Language Models (LLMs) such as ChatGPT and LlaMA are advancing rapidly in generative Artificial Intelligence (AI), but their immense size poses significant challenges, such as huge training and inference costs, substantial energy demands, and limitations for on-site deployment. Traditional compression methods such as pruning, distillation, and low-rank approximation focus on reducing the eff… ▽ More

    Submitted 13 May, 2024; v1 submitted 25 January, 2024; originally announced January 2024.

    Comments: 5 pages, 4 figures, 2 tables, and supplementary information of 2 pages and 1 figure. Revised version with new benchmarks for LlaMA2-7B

  2. arXiv:2212.03223  [pdf, other

    quant-ph cond-mat.str-el cs.CE cs.LG

    Financial Risk Management on a Neutral Atom Quantum Processor

    Authors: Lucas Leclerc, Luis Ortiz-Guitierrez, Sebastian Grijalva, Boris Albrecht, Julia R. K. Cline, Vincent E. Elfving, Adrien Signoles, Loïc Henriet, Gianni Del Bimbo, Usman Ayub Sheikh, Maitree Shah, Luc Andrea, Faysal Ishtiaq, Andoni Duarte, Samuel Mugel, Irene Caceres, Michel Kurek, Roman Orus, Achraf Seddik, Oumaima Hammammi, Hacene Isselnane, Didier M'tamon

    Abstract: Machine Learning models capable of handling the large datasets collected in the financial world can often become black boxes expensive to run. The quantum computing paradigm suggests new optimization techniques, that combined with classical algorithms, may deliver competitive, faster and more interpretable models. In this work we propose a quantum-enhanced machine learning solution for the predict… ▽ More

    Submitted 3 April, 2024; v1 submitted 6 December, 2022; originally announced December 2022.

    Comments: 17 pages, 11 figures, 2 tables, revised version

    Journal ref: Phys. Rev. Research 5, 043117 (2023)

  3. arXiv:2011.06102  [pdf, other

    cs.AI

    Improving Multimodal Accuracy Through Modality Pre-training and Attention

    Authors: Aya Abdelsalam Ismail, Mahmudul Hasan, Faisal Ishtiaq

    Abstract: Training a multimodal network is challenging and it requires complex architectures to achieve reasonable performance. We show that one reason for this phenomena is the difference between the convergence rate of various modalities. We address this by pre-training modality-specific sub-networks in multimodal architectures independently before end-to-end training of the entire network. Furthermore, w… ▽ More

    Submitted 11 November, 2020; originally announced November 2020.