Skip to main content

Showing 1–17 of 17 results for author: Stamoulis, D

.
  1. arXiv:2406.06799  [pdf, other

    cs.DC cs.CL

    LLM-dCache: Improving Tool-Augmented LLMs with GPT-Driven Localized Data Caching

    Authors: Simranjit Singh, Michael Fore, Andreas Karatzas, Chaehong Lee, Yanan Jian, Longfei Shangguan, Fuxun Yu, Iraklis Anagnostopoulos, Dimitrios Stamoulis

    Abstract: As Large Language Models (LLMs) broaden their capabilities to manage thousands of API calls, they are confronted with complex data operations across vast datasets with significant overhead to the underlying system. In this work, we introduce LLM-dCache to optimize data accesses by treating cache operations as callable API functions exposed to the tool-augmented agent. We grant LLMs the autonomy to… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

  2. arXiv:2405.19563  [pdf, other

    cs.CL

    Unlearning Climate Misinformation in Large Language Models

    Authors: Michael Fore, Simranjit Singh, Chaehong Lee, Amritanshu Pandey, Antonios Anastasopoulos, Dimitrios Stamoulis

    Abstract: Misinformation regarding climate change is a key roadblock in addressing one of the most serious threats to humanity. This paper investigates factual accuracy in large language models (LLMs) regarding climate information. Using true/false labeled Q&A data for fine-tuning and evaluating LLMs on climate-related claims, we compare open-source models, assessing their ability to generate truthful respo… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

  3. arXiv:2405.18831  [pdf, other

    cs.CV cs.LG

    Evaluating Zero-Shot GPT-4V Performance on 3D Visual Question Answering Benchmarks

    Authors: Simranjit Singh, Georgios Pavlakos, Dimitrios Stamoulis

    Abstract: As interest in "reformulating" the 3D Visual Question Answering (VQA) problem in the context of foundation models grows, it is imperative to assess how these new paradigms influence existing closed-vocabulary datasets. In this case study, we evaluate the zero-shot performance of foundational models (GPT-4 Vision and GPT-4) on well-established 3D VQA benchmarks, namely 3D-VQA and ScanQA. We provide… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

    Comments: Accepted at 1st Workshop on Multimodalities for 3D Scenes CVPR 2024

  4. arXiv:2405.17438  [pdf, other

    cs.PL cs.AI cs.LG

    An LLM-Tool Compiler for Fused Parallel Function Calling

    Authors: Simranjit Singh, Andreas Karatzas, Michael Fore, Iraklis Anagnostopoulos, Dimitrios Stamoulis

    Abstract: State-of-the-art sequential reasoning in Large Language Models (LLMs) has expanded the capabilities of Copilots beyond conversational tasks to complex function calling, managing thousands of API calls. However, the tendency of compositional prompting to segment tasks into multiple steps, each requiring a round-trip to the GPT APIs, leads to increased system latency and costs. Although recent advan… ▽ More

    Submitted 7 May, 2024; originally announced May 2024.

  5. arXiv:2405.00709  [pdf, other

    cs.CL cs.AI cs.LG

    Evaluating Tool-Augmented Agents in Remote Sensing Platforms

    Authors: Simranjit Singh, Michael Fore, Dimitrios Stamoulis

    Abstract: Tool-augmented Large Language Models (LLMs) have shown impressive capabilities in remote sensing (RS) applications. However, existing benchmarks assume question-answering input templates over predefined image-text data pairs. These standalone instructions neglect the intricacies of realistic user-grounded tasks. Consider a geospatial analyst: they zoom in a map area, they draw a region over which… ▽ More

    Submitted 23 April, 2024; originally announced May 2024.

    Comments: ICLR 2024 Machine Learning for Remote Sensing (ML4RS) Workshop

  6. arXiv:2404.15804  [pdf, other

    cs.LG cs.AI

    GeckOpt: LLM System Efficiency via Intent-Based Tool Selection

    Authors: Michael Fore, Simranjit Singh, Dimitrios Stamoulis

    Abstract: In this preliminary study, we investigate a GPT-driven intent-based reasoning approach to streamline tool selection for large language models (LLMs) aimed at system efficiency. By identifying the intent behind user prompts at runtime, we narrow down the API toolset required for task execution, reducing token consumption by up to 24.6\%. Early results on a real-world, massively parallel Copilot pla… ▽ More

    Submitted 24 April, 2024; originally announced April 2024.

    Comments: GLSVLSI 2024

  7. arXiv:2404.15500  [pdf, other

    cs.AI cs.CL cs.LG

    GeoLLM-Engine: A Realistic Environment for Building Geospatial Copilots

    Authors: Simranjit Singh, Michael Fore, Dimitrios Stamoulis

    Abstract: Geospatial Copilots unlock unprecedented potential for performing Earth Observation (EO) applications through natural language instructions. However, existing agents rely on overly simplified single tasks and template-based prompts, creating a disconnect with real-world scenarios. In this work, we present GeoLLM-Engine, an environment for tool-augmented agents with intricate tasks routinely execut… ▽ More

    Submitted 23 April, 2024; originally announced April 2024.

    Comments: Earthvision 2024, CVPR Workshop

  8. arXiv:2311.12345  [pdf, other

    cs.CV cs.AI cs.LG

    Stable Diffusion For Aerial Object Detection

    Authors: Yanan Jian, Fuxun Yu, Simranjit Singh, Dimitrios Stamoulis

    Abstract: Aerial object detection is a challenging task, in which one major obstacle lies in the limitations of large-scale data collection and the long-tail distribution of certain classes. Synthetic data offers a promising solution, especially with recent advances in diffusion-based methods like stable diffusion (SD). However, the direct application of diffusion methods to aerial domains poses unique chal… ▽ More

    Submitted 20 November, 2023; originally announced November 2023.

    Comments: Accepted at NeurIPS 2023 Synthetic Data Generation with Generative AI workshop

  9. arXiv:2011.10912  [pdf, ps, other

    cs.AR cs.LG

    Third ArchEdge Workshop: Exploring the Design Space of Efficient Deep Neural Networks

    Authors: Fuxun Yu, Dimitrios Stamoulis, Di Wang, Dimitrios Lymberopoulos, Xiang Chen

    Abstract: This paper gives an overview of our ongoing work on the design space exploration of efficient deep neural networks (DNNs). Specifically, we cover two aspects: (1) static architecture design efficiency and (2) dynamic model execution efficiency. For static architecture design, different from existing end-to-end hardware modeling assumptions, we conduct full-stack profiling at the GPU core level to… ▽ More

    Submitted 21 November, 2020; originally announced November 2020.

    Comments: Presented in Third ArchEdge Workshop, Co-located with SEC'2020

  10. arXiv:2011.03897  [pdf, other

    cs.AR cs.PF

    Towards Latency-aware DNN Optimization with GPU Runtime Analysis and Tail Effect Elimination

    Authors: Fuxun Yu, Zirui Xu, Tong Shen, Dimitrios Stamoulis, Longfei Shangguan, Di Wang, Rishi Madhok, Chunshui Zhao, Xin Li, Nikolaos Karianakis, Dimitrios Lymberopoulos, Ang Li, ChenChen Liu, Yiran Chen, Xiang Chen

    Abstract: Despite the superb performance of State-Of-The-Art (SOTA) DNNs, the increasing computational cost makes them very challenging to meet real-time latency and accuracy requirements. Although DNN runtime latency is dictated by model property (e.g., architecture, operations), hardware property (e.g., utilization, throughput), and more importantly, the effective map** between these two, many existing… ▽ More

    Submitted 30 November, 2020; v1 submitted 7 November, 2020; originally announced November 2020.

  11. arXiv:1907.00959  [pdf, other

    cs.LG cs.CV stat.ML

    Single-Path Mobile AutoML: Efficient ConvNet Design and NAS Hyperparameter Optimization

    Authors: Dimitrios Stamoulis, Ruizhou Ding, Di Wang, Dimitrios Lymberopoulos, Bodhi Priyantha, Jie Liu, Diana Marculescu

    Abstract: Can we reduce the search cost of Neural Architecture Search (NAS) from days down to only few hours? NAS methods automate the design of Convolutional Networks (ConvNets) under hardware constraints and they have emerged as key components of AutoML frameworks. However, the NAS problem remains challenging due to the combinatorially large design space and the significant search time (at least 200 GPU-h… ▽ More

    Submitted 1 July, 2019; originally announced July 2019.

    Comments: Detailed extension (journal) of the Single-Path NAS ECMLPKDD'19 paper (arXiv:1904.02877)

  12. arXiv:1905.04159  [pdf, other

    cs.LG cs.CV stat.ML

    Single-Path NAS: Device-Aware Efficient ConvNet Design

    Authors: Dimitrios Stamoulis, Ruizhou Ding, Di Wang, Dimitrios Lymberopoulos, Bodhi Priyantha, Jie Liu, Diana Marculescu

    Abstract: Can we automatically design a Convolutional Network (ConvNet) with the highest image classification accuracy under the latency constraint of a mobile device? Neural Architecture Search (NAS) for ConvNet design is a challenging problem due to the combinatorially large design space and search time (at least 200 GPU-hours). To alleviate this complexity, we propose Single-Path NAS, a novel differentia… ▽ More

    Submitted 10 May, 2019; originally announced May 2019.

    Comments: ODML-CDNNR 2019 (ICML'19 workshop) oral presentation (extended abstract, required non-archival version). Full paper: arXiv:1904.02877

  13. arXiv:1904.02877  [pdf, other

    cs.LG cs.CV stat.ML

    Single-Path NAS: Designing Hardware-Efficient ConvNets in less than 4 Hours

    Authors: Dimitrios Stamoulis, Ruizhou Ding, Di Wang, Dimitrios Lymberopoulos, Bodhi Priyantha, Jie Liu, Diana Marculescu

    Abstract: Can we automatically design a Convolutional Network (ConvNet) with the highest image classification accuracy under the runtime constraint of a mobile device? Neural architecture search (NAS) has revolutionized the design of hardware-efficient ConvNets by automating this process. However, the NAS problem remains challenging due to the combinatorially large design space, causing a significant search… ▽ More

    Submitted 5 April, 2019; originally announced April 2019.

  14. arXiv:1809.05476  [pdf, other

    cs.LG stat.ML

    Hardware-Aware Machine Learning: Modeling and Optimization

    Authors: Diana Marculescu, Dimitrios Stamoulis, Ermao Cai

    Abstract: Recent breakthroughs in Deep Learning (DL) applications have made DL models a key component in almost every modern computing system. The increased popularity of DL applications deployed on a wide-spectrum of platforms have resulted in a plethora of design challenges related to the constraints introduced by the hardware itself. What is the latency or energy cost for an inference made by a Deep Neur… ▽ More

    Submitted 14 September, 2018; originally announced September 2018.

    Comments: ICCAD'18 Invited Paper

  15. arXiv:1808.01550  [pdf, other

    cs.LG stat.ML

    Designing Adaptive Neural Networks for Energy-Constrained Image Classification

    Authors: Dimitrios Stamoulis, Ting-Wu Chin, Anand Krishnan Prakash, Haocheng Fang, Sribhuvan Sajja, Mitchell Bognar, Diana Marculescu

    Abstract: As convolutional neural networks (CNNs) enable state-of-the-art computer vision applications, their high energy consumption has emerged as a key impediment to their deployment on embedded and mobile devices. Towards efficient image classification under hardware constraints, prior work has proposed adaptive CNNs, i.e., systems of networks with different accuracy and computation characteristics, whe… ▽ More

    Submitted 6 August, 2018; v1 submitted 4 August, 2018; originally announced August 2018.

    Comments: This conference paper will appear in the proceedings of ICCAD 2018

  16. arXiv:1712.02446  [pdf, other

    cs.LG

    HyperPower: Power- and Memory-Constrained Hyper-Parameter Optimization for Neural Networks

    Authors: Dimitrios Stamoulis, Ermao Cai, Da-Cheng Juan, Diana Marculescu

    Abstract: While selecting the hyper-parameters of Neural Networks (NNs) has been so far treated as an art, the emergence of more complex, deeper architectures poses increasingly more challenges to designers and Machine Learning (ML) practitioners, especially when power and memory constraints need to be considered. In this work, we propose HyperPower, a framework that enables efficient Bayesian optimization… ▽ More

    Submitted 6 December, 2017; originally announced December 2017.

    Comments: This conference paper will appear in the proceedings of DATE 2018

  17. arXiv:1710.05420  [pdf, other

    cs.LG cs.PF stat.ML

    NeuralPower: Predict and Deploy Energy-Efficient Convolutional Neural Networks

    Authors: Ermao Cai, Da-Cheng Juan, Dimitrios Stamoulis, Diana Marculescu

    Abstract: "How much energy is consumed for an inference made by a convolutional neural network (CNN)?" With the increased popularity of CNNs deployed on the wide-spectrum of platforms (from mobile devices to workstations), the answer to this question has drawn significant attention. From lengthening battery life of mobile devices to reducing the energy bill of a datacenter, it is important to understand the… ▽ More

    Submitted 15 October, 2017; originally announced October 2017.

    Comments: Accepted as a conference paper at ACML 2017