Skip to main content

Showing 1–2 of 2 results for author: Gkountouras, J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2306.08162  [pdf, other

    cs.CL cs.AI cs.LG

    INT2.1: Towards Fine-Tunable Quantized Large Language Models with Error Correction through Low-Rank Adaptation

    Authors: Yuji Chai, John Gkountouras, Glenn G. Ko, David Brooks, Gu-Yeon Wei

    Abstract: We introduce a method that dramatically reduces fine-tuning VRAM requirements and rectifies quantization errors in quantized Large Language Models. First, we develop an extremely memory-efficient fine-tuning (EMEF) method for quantized models using Low-Rank Adaptation (LoRA), and drawing upon it, we construct an error-correcting algorithm designed to minimize errors induced by the quantization pro… ▽ More

    Submitted 13 June, 2023; originally announced June 2023.

  2. arXiv:2304.06391  [pdf, other

    cs.CV cs.AI cs.LG

    VISION DIFFMASK: Faithful Interpretation of Vision Transformers with Differentiable Patch Masking

    Authors: Angelos Nalmpantis, Apostolos Panagiotopoulos, John Gkountouras, Konstantinos Papakostas, Wilker Aziz

    Abstract: The lack of interpretability of the Vision Transformer may hinder its use in critical real-world applications despite its effectiveness. To overcome this issue, we propose a post-hoc interpretability method called VISION DIFFMASK, which uses the activations of the model's hidden layers to predict the relevant parts of the input that contribute to its final predictions. Our approach uses a gating m… ▽ More

    Submitted 13 April, 2023; originally announced April 2023.

    Comments: Accepted in the XAI4CV Workshop at CVPR 2023