Skip to main content

Showing 1–13 of 13 results for author: Munoz, J P

Searching in archive cs. Search in all archives.
.
  1. arXiv:2404.15182  [pdf, other

    cs.LG cs.AI

    FLoRA: Enhancing Vision-Language Models with Parameter-Efficient Federated Learning

    Authors: Duy Phuong Nguyen, J. Pablo Munoz, Ali Jannesari

    Abstract: In the rapidly evolving field of artificial intelligence, multimodal models, e.g., integrating vision and language into visual-language models (VLMs), have become pivotal for many applications, ranging from image captioning to multimodal search engines. Among these models, the Contrastive Language-Image Pre-training (CLIP) model has demonstrated remarkable performance in understanding and generati… ▽ More

    Submitted 11 April, 2024; originally announced April 2024.

    Comments: 10 pages, 11 figures

  2. arXiv:2404.10934  [pdf, other

    cs.LG cs.AI cs.CL

    Shears: Unstructured Sparsity with Neural Low-rank Adapter Search

    Authors: J. Pablo Muñoz, **jie Yuan, Nilesh Jain

    Abstract: Recently, several approaches successfully demonstrated that weight-sharing Neural Architecture Search (NAS) can effectively explore a search space of elastic low-rank adapters (LoRA), allowing the parameter-efficient fine-tuning (PEFT) and compression of large language models. In this paper, we introduce a novel approach called Shears, demonstrating how the integration of cost-effective sparsity a… ▽ More

    Submitted 16 April, 2024; originally announced April 2024.

    Comments: 2024 Annual Conference of the North American Chapter of the Association for Computational Linguistics (Industry Track)

  3. arXiv:2402.02018  [pdf, other

    cs.LG

    The Landscape and Challenges of HPC Research and LLMs

    Authors: Le Chen, Nesreen K. Ahmed, Akash Dutta, Arijit Bhattacharjee, Sixing Yu, Quazi Ishtiaque Mahmud, Waqwoya Abebe, Hung Phan, Aishwarya Sarkar, Branden Butler, Niranjan Hasabnis, Gal Oren, Vy A. Vo, Juan Pablo Munoz, Theodore L. Willke, Tim Mattson, Ali Jannesari

    Abstract: Recently, language models (LMs), especially large language models (LLMs), have revolutionized the field of deep learning. Both encoder-decoder models and prompt-based techniques have shown immense potential for natural language processing and code-based tasks. Over the past several years, many research labs and institutions have invested heavily in high-performance computing, approaching or breach… ▽ More

    Submitted 6 February, 2024; v1 submitted 2 February, 2024; originally announced February 2024.

  4. arXiv:2310.00247  [pdf, other

    cs.LG cs.DC

    Bridging the Gap Between Foundation Models and Heterogeneous Federated Learning

    Authors: Sixing Yu, J. Pablo Muñoz, Ali Jannesari

    Abstract: Federated learning (FL) offers privacy-preserving decentralized machine learning, optimizing models at edge clients without sharing private data. Simultaneously, foundation models (FMs) have gained traction in the artificial intelligence (AI) community due to their exceptional performance across various tasks. However, integrating FMs into FL presents challenges, primarily due to their substantial… ▽ More

    Submitted 4 October, 2023; v1 submitted 30 September, 2023; originally announced October 2023.

  5. arXiv:2305.11414  [pdf, other

    cs.LG cs.AI cs.CR

    Federated Foundation Models: Privacy-Preserving and Collaborative Learning for Large Models

    Authors: Sixing Yu, J. Pablo Muñoz, Ali Jannesari

    Abstract: Foundation Models (FMs), such as LLaMA, BERT, GPT, ViT, and CLIP, have demonstrated remarkable success in a wide range of applications, driven by their ability to leverage vast amounts of data for pre-training. However, optimizing FMs often requires access to sensitive data, raising privacy concerns and limiting their applicability in many domains. In this paper, we propose the Federated Foundatio… ▽ More

    Submitted 19 March, 2024; v1 submitted 18 May, 2023; originally announced May 2023.

    Comments: Accepted at the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024)

  6. arXiv:2211.05716  [pdf, other

    cs.LG cs.CV

    Resource-Aware Heterogeneous Federated Learning using Neural Architecture Search

    Authors: Sixing Yu, J. Pablo Muñoz, Ali Jannesari

    Abstract: Federated Learning (FL) is extensively used to train AI/ML models in distributed and privacy-preserving settings. Participant edge devices in FL systems typically contain non-independent and identically distributed (Non-IID) private data and unevenly distributed computational resources. Preserving user data privacy while optimizing AI/ML models in a heterogeneous federated network requires us to a… ▽ More

    Submitted 30 April, 2024; v1 submitted 9 November, 2022; originally announced November 2022.

    Comments: Accepted at the 30th International European Conference on Parallel and Distributed Computing (Euro-Par 2024)

  7. arXiv:2209.07413  [pdf, other

    cs.LG cs.CV cs.NE

    EZNAS: Evolving Zero Cost Proxies For Neural Architecture Scoring

    Authors: Yash Akhauri, J. Pablo Munoz, Nilesh Jain, Ravi Iyer

    Abstract: Neural Architecture Search (NAS) has significantly improved productivity in the design and deployment of neural networks (NN). As NAS typically evaluates multiple models by training them partially or completely, the improved productivity comes at the cost of significant carbon footprint. To alleviate this expensive training routine, zero-shot/cost proxies analyze an NN at initialization to generat… ▽ More

    Submitted 21 December, 2022; v1 submitted 15 September, 2022; originally announced September 2022.

  8. arXiv:2208.07978  [pdf, other

    cs.DC cs.CR cs.LG

    Enhancing Heterogeneous Federated Learning with Knowledge Extraction and Multi-Model Fusion

    Authors: Duy Phuong Nguyen, Sixing Yu, J. Pablo Muñoz, Ali Jannesari

    Abstract: Concerned with user data privacy, this paper presents a new federated learning (FL) method that trains machine learning models on edge devices without accessing sensitive data. Traditional FL methods, although privacy-protective, fail to manage model heterogeneity and incur high communication costs due to their reliance on aggregation methods. To address this limitation, we propose a resource-awar… ▽ More

    Submitted 30 September, 2023; v1 submitted 16 August, 2022; originally announced August 2022.

    Comments: Accept at the 4th workshop on Artificial Intelligence and Machine Learning for Scientific Applications (AI4S), SC 23

  9. arXiv:2205.10358  [pdf, other

    cs.LG cs.NE

    A Hardware-Aware Framework for Accelerating Neural Architecture Search Across Modalities

    Authors: Daniel Cummings, Anthony Sarah, Sharath Nittur Sridhar, Maciej Szankin, Juan Pablo Munoz, Sairam Sundaresan

    Abstract: Recent advances in Neural Architecture Search (NAS) such as one-shot NAS offer the ability to extract specialized hardware-aware sub-network configurations from a task-specific super-network. While considerable effort has been employed towards improving the first stage, namely, the training of the super-network, the search for derivative high-performing sub-networks is still under-explored. Popula… ▽ More

    Submitted 19 May, 2022; originally announced May 2022.

  10. arXiv:2202.12954  [pdf, other

    cs.AI

    A Hardware-Aware System for Accelerating Deep Neural Network Optimization

    Authors: Anthony Sarah, Daniel Cummings, Sharath Nittur Sridhar, Sairam Sundaresan, Maciej Szankin, Tristan Webb, J. Pablo Munoz

    Abstract: Recent advances in Neural Architecture Search (NAS) which extract specialized hardware-aware configurations (a.k.a. "sub-networks") from a hardware-agnostic "super-network" have become increasingly popular. While considerable effort has been employed towards improving the first stage, namely, the training of the super-network, the search for derivative high-performing sub-networks is still largely… ▽ More

    Submitted 25 February, 2022; originally announced February 2022.

  11. arXiv:2112.10878  [pdf, other

    cs.LG

    Enabling NAS with Automated Super-Network Generation

    Authors: J. Pablo Muñoz, Nikolay Lyalyushkin, Yash Akhauri, Anastasia Senina, Alexander Kozlov, Nilesh Jain

    Abstract: Recent Neural Architecture Search (NAS) solutions have produced impressive results training super-networks and then deriving subnetworks, a.k.a. child models that outperform expert-crafted models from a pre-defined search space. Efficient and robust subnetworks can be selected for resource-constrained edge devices, allowing them to perform well in the wild. However, constructing super-networks for… ▽ More

    Submitted 20 December, 2021; originally announced December 2021.

    Comments: Accepted at AAAI2022 - Practical Deep Learning in the Wild

    ACM Class: I.2; D.0; I.2.2

  12. arXiv:2106.09180  [pdf, other

    cs.LG cs.AR cs.NE

    RHNAS: Realizable Hardware and Neural Architecture Search

    Authors: Yash Akhauri, Adithya Niranjan, J. Pablo Muñoz, Suvadeep Banerjee, Abhijit Davare, Pasquale Cocchini, Anton A. Sorokin, Ravi Iyer, Nilesh Jain

    Abstract: The rapidly evolving field of Artificial Intelligence necessitates automated approaches to co-design neural network architecture and neural accelerators to maximize system efficiency and address productivity challenges. To enable joint optimization of this vast space, there has been growing interest in differentiable NN-HW co-design. Fully differentiable co-design has reduced the resource requirem… ▽ More

    Submitted 16 June, 2021; originally announced June 2021.

    Comments: 15 pages

  13. arXiv:2002.04698  [pdf, other

    cs.CV cs.RO eess.IV

    Improving Place Recognition Using Dynamic Object Detection

    Authors: Juan Pablo Munoz, Scott Dexter

    Abstract: We present a novel approach to place recognition well-suited to environments with many dynamic objects--objects that may or may not be present in an agent's subsequent visits. By incorporating an object-detecting preprocessing step, our approach yields high-quality place representations that incorporate object information. Not only does this result in significantly improved place recognition in dy… ▽ More

    Submitted 12 June, 2020; v1 submitted 11 February, 2020; originally announced February 2020.

    ACM Class: I.4.9; I.2; J.7