Skip to main content

Showing 1–21 of 21 results for author: Khan, O

Searching in archive cs. Search in all archives.
.
  1. arXiv:2403.19949  [pdf, other

    cs.CV

    FairCLIP: Harnessing Fairness in Vision-Language Learning

    Authors: Yan Luo, Min Shi, Muhammad Osama Khan, Muhammad Muneeb Afzal, Hao Huang, Shuaihang Yuan, Yu Tian, Luo Song, Ava Kouhana, Tobias Elze, Yi Fang, Mengyu Wang

    Abstract: Fairness is a critical concern in deep learning, especially in healthcare, where these models influence diagnoses and treatment decisions. Although fairness has been investigated in the vision-only domain, the fairness of medical vision-language (VL) models remains unexplored due to the scarcity of medical VL datasets for studying fairness. To bridge this research gap, we introduce the first fair… ▽ More

    Submitted 5 April, 2024; v1 submitted 28 March, 2024; originally announced March 2024.

    Comments: CVPR 2024

  2. arXiv:2312.08656  [pdf, other

    cs.LG cs.AI cs.DC

    MaxK-GNN: Extremely Fast GPU Kernel Design for Accelerating Graph Neural Networks Training

    Authors: Hongwu Peng, Xi Xie, Kaustubh Shivdikar, MD Amit Hasan, Jiahui Zhao, Shaoyi Huang, Omer Khan, David Kaeli, Caiwen Ding

    Abstract: In the acceleration of deep neural network training, the GPU has become the mainstream platform. GPUs face substantial challenges on GNNs, such as workload imbalance and memory access irregularities, leading to underutilized hardware. Existing solutions such as PyG, DGL with cuSPARSE, and GNNAdvisor frameworks partially address these challenges but memory traffic is still significant. We argue t… ▽ More

    Submitted 18 March, 2024; v1 submitted 14 December, 2023; originally announced December 2023.

    Comments: ASPLOS 2024 accepted publication

    ACM Class: I.2; C.5

  3. arXiv:2311.09762  [pdf, other

    cs.CL cs.AI cs.LG

    Graph Elicitation for Guiding Multi-Step Reasoning in Large Language Models

    Authors: **young Park, Ameen Patel, Omar Zia Khan, Hyunwoo J. Kim, Joo-Kyung Kim

    Abstract: Chain-of-Thought (CoT) prompting along with sub-question generation and answering has enhanced multi-step reasoning capabilities of Large Language Models (LLMs). However, prompting the LLMs to directly generate sub-questions is suboptimal since they sometimes generate redundant or irrelevant questions. To deal with them, we propose a GE-Reasoning method, which directs LLMs to generate proper sub-q… ▽ More

    Submitted 22 June, 2024; v1 submitted 16 November, 2023; originally announced November 2023.

    Comments: Preprint

  4. arXiv:2311.06573  [pdf, other

    quant-ph cs.ET cs.LO cs.SE

    A Generalized Space-Efficient Algorithm for Quantum Bit String Comparators

    Authors: Khuram Shahzad, Omar Usman Khan

    Abstract: Quantum Bit String Comparators (QBSC) operate on two sequences of n-qubits, enabling the determination of their relationships, such as equality, greater than, or less than. This is analogous to the way conditional statements are used in programming languages. Consequently, QBSCs play a crucial role in various algorithms that can be executed or adapted for quantum computers. The development of effi… ▽ More

    Submitted 14 November, 2023; v1 submitted 11 November, 2023; originally announced November 2023.

  5. arXiv:2310.20081  [pdf, other

    cs.CL cs.AI cs.IR

    Integrating Summarization and Retrieval for Enhanced Personalization via Large Language Models

    Authors: Chris Richardson, Yao Zhang, Kellen Gillespie, Sudipta Kar, Arshdeep Singh, Zeynab Raeesy, Omar Zia Khan, Abhinav Sethy

    Abstract: Personalization, the ability to tailor a system to individual users, is an essential factor in user experience with natural language processing (NLP) systems. With the emergence of Large Language Models (LLMs), a key question is how to leverage these models to better personalize user experiences. To personalize a language model's output, a straightforward approach is to incorporate past user data… ▽ More

    Submitted 30 October, 2023; originally announced October 2023.

    Comments: 4 pages, International Workshop on Personalized Generative AI (@CIKM 2023)

    ACM Class: I.2.7; H.3.3

  6. arXiv:2310.04551  [pdf, other

    cs.CV

    MeSa: Masked, Geometric, and Supervised Pre-training for Monocular Depth Estimation

    Authors: Muhammad Osama Khan, Junbang Liang, Chun-Kai Wang, Shan Yang, Yu Lou

    Abstract: Pre-training has been an important ingredient in develo** strong monocular depth estimation models in recent years. For instance, self-supervised learning (SSL) is particularly effective by alleviating the need for large datasets with dense ground-truth depth maps. However, despite these improvements, our study reveals that the later layers of the SOTA SSL method are actually suboptimal. By exam… ▽ More

    Submitted 6 October, 2023; originally announced October 2023.

  7. arXiv:2310.02492  [pdf, other

    cs.CV

    FairVision: Equitable Deep Learning for Eye Disease Screening via Fair Identity Scaling

    Authors: Yan Luo, Muhammad Osama Khan, Yu Tian, Min Shi, Zehao Dou, Tobias Elze, Yi Fang, Mengyu Wang

    Abstract: Equity in AI for healthcare is crucial due to its direct impact on human well-being. Despite advancements in 2D medical imaging fairness, the fairness of 3D models remains underexplored, hindered by the small sizes of 3D fairness datasets. Since 3D imaging surpasses 2D imaging in SOTA clinical care, it is critical to understand the fairness of these 3D models. To address this research gap, we cond… ▽ More

    Submitted 12 April, 2024; v1 submitted 3 October, 2023; originally announced October 2023.

  8. arXiv:2308.11825  [pdf, other

    cs.AR cs.LG

    Accel-GCN: High-Performance GPU Accelerator Design for Graph Convolution Networks

    Authors: Xi Xie, Hongwu Peng, Amit Hasan, Shaoyi Huang, Jiahui Zhao, Haowen Fang, Wei Zhang, Tong Geng, Omer Khan, Caiwen Ding

    Abstract: Graph Convolutional Networks (GCNs) are pivotal in extracting latent information from graph data across various domains, yet their acceleration on mainstream GPUs is challenged by workload imbalance and memory access irregularity. To address these challenges, we present Accel-GCN, a GPU accelerator architecture for GCNs. The design of Accel-GCN encompasses: (i) a lightweight degree sorting stage t… ▽ More

    Submitted 22 August, 2023; originally announced August 2023.

    Comments: ICCAD 2023 accepted publication

    ACM Class: I.2; B.6; C.3

  9. arXiv:2307.10915  [pdf, other

    cs.CV

    Revisiting Fine-Tuning Strategies for Self-supervised Medical Imaging Analysis

    Authors: Muhammad Osama Khan, Yi Fang

    Abstract: Despite the rapid progress in self-supervised learning (SSL), end-to-end fine-tuning still remains the dominant fine-tuning strategy for medical imaging analysis. However, it remains unclear whether this approach is truly optimal for effectively utilizing the pre-trained knowledge, especially considering the diverse categories of SSL that capture different types of features. In this paper, we pres… ▽ More

    Submitted 16 November, 2023; v1 submitted 20 July, 2023; originally announced July 2023.

    Comments: Extended Abstract presented at Machine Learning for Health (ML4H) symposium 2023, December 10th, 2023, New Orleans, United States, 18 pages

  10. arXiv:2305.14218  [pdf, other

    cs.CV cs.AI

    DUBLIN -- Document Understanding By Language-Image Network

    Authors: Kriti Aggarwal, Aditi Khandelwal, Kumar Tanmay, Owais Mohammed Khan, Qiang Liu, Monojit Choudhury, Hardik Hansrajbhai Chauhan, Subhojit Som, Vishrav Chaudhary, Saurabh Tiwary

    Abstract: Visual document understanding is a complex task that involves analyzing both the text and the visual elements in document images. Existing models often rely on manual feature engineering or domain-specific pipelines, which limit their generalization ability across different document types and languages. In this paper, we propose DUBLIN, which is pretrained on web pages using three novel objectives… ▽ More

    Submitted 27 October, 2023; v1 submitted 23 May, 2023; originally announced May 2023.

    ACM Class: F.2.2; I.2.7

  11. arXiv:2302.10978  [pdf, other

    cs.CL cs.AI cs.IR cs.LG

    Learning to Retrieve Engaging Follow-Up Queries

    Authors: Christopher Richardson, Sudipta Kar, Anjishnu Kumar, Anand Ramachandran, Omar Zia Khan, Zeynab Raeesy, Abhinav Sethy

    Abstract: Open domain conversational agents can answer a broad range of targeted queries. However, the sequential nature of interaction with these systems makes knowledge exploration a lengthy task which burdens the user with asking a chain of well phrased questions. In this paper, we present a retrieval based system and associated dataset for predicting the next questions that the user might have. Such a s… ▽ More

    Submitted 21 February, 2023; originally announced February 2023.

    Comments: EACL 2023

  12. arXiv:2210.04114  [pdf, other

    cs.LG

    Towards Real-Time Temporal Graph Learning

    Authors: Deniz Gurevin, Mohsin Shan, Tong Geng, Weiwen Jiang, Caiwen Ding, Omer Khan

    Abstract: In recent years, graph representation learning has gained significant popularity, which aims to generate node embeddings that capture features of graphs. One of the methods to achieve this is employing a technique called random walks that captures node sequences in a graph and then learns embeddings for each node using a natural language processing technique called Word2Vec. These embeddings are t… ▽ More

    Submitted 11 October, 2022; v1 submitted 8 October, 2022; originally announced October 2022.

  13. arXiv:2209.04766  [pdf, other

    cs.LG

    Towards Sparsification of Graph Neural Networks

    Authors: Hongwu Peng, Deniz Gurevin, Shaoyi Huang, Tong Geng, Weiwen Jiang, Omer Khan, Caiwen Ding

    Abstract: As real-world graphs expand in size, larger GNN models with billions of parameters are deployed. High parameter count in such models makes training and inference on graphs expensive and challenging. To reduce the computational and memory costs of GNNs, optimization methods such as pruning the redundant nodes and edges in input graphs have been commonly adopted. However, model compression, which di… ▽ More

    Submitted 24 February, 2023; v1 submitted 10 September, 2022; originally announced September 2022.

    Comments: ICCD 2022 Paper

    ACM Class: I.2; C.4

  14. arXiv:2201.01834  [pdf, other

    cs.CR cs.AR

    Secure Remote Attestation with Strong Key Insulation Guarantees

    Authors: Deniz Gurevin, Chenglu **, Phuong Ha Nguyen, Omer Khan, Marten van Dijk

    Abstract: Recent years have witnessed a trend of secure processor design in both academia and industry. Secure processors with hardware-enforced isolation can be a solid foundation of cloud computation in the future. However, due to recent side-channel attacks, the commercial secure processors failed to deliver the promises of a secure isolated execution environment. Sensitive information inside the secure… ▽ More

    Submitted 5 January, 2022; originally announced January 2022.

  15. arXiv:1906.09850  [pdf

    cs.HC

    Multisensory cues facilitate coordination of step** movements with a virtual reality avatar

    Authors: Omar Khan, Imran Ahmed, Joshua Cottingham, Musa Rahhal, Theodoros N Arvanitis, Mark Elliott

    Abstract: The effectiveness of simple sensory cues for retraining gait have been demonstrated, yet the feasibility of humanoid avatars for entrainment have yet to be investigated. Here, we describe the development of a novel method of visually cued training, in the form of a virtual partner, and investigate its ability to provide movement guidance in the form of step**. Real step** movements were mapped… ▽ More

    Submitted 24 June, 2019; originally announced June 2019.

    Comments: 28 pages, 8 figures, submitted to PLOS ONE

  16. arXiv:1904.12729  [pdf

    cs.CR cs.AR

    IRONHIDE: A Secure Multicore that Efficiently Mitigates Microarchitecture State Attacks for Interactive Applications

    Authors: Hamza Omar, Omer Khan

    Abstract: Microprocessors enable aggressive hardware virtualization by means of which multiple processes temporally execute on the system. These security-critical and ordinary processes interact with each other to assure application progress. However, temporal sharing of hardware resources exposes the processor to various microarchitecture state attacks. State-of-the-art secure processors, such as MI6 adopt… ▽ More

    Submitted 27 January, 2020; v1 submitted 29 April, 2019; originally announced April 2019.

  17. arXiv:1904.08689  [pdf, other

    cs.MM cs.IR

    Exquisitor: Interactive Learning at Large

    Authors: Björn Þór Jónsson, Omar Shahbaz Khan, Hanna Ragnarsdóttir, Þórhildur Þorleiksdóttir, Jan Zahálka, Stevan Rudinac, Gylfi Þór Guðmundsson, Laurent Amsaleg, Marcel Worring

    Abstract: Increasing scale is a dominant trend in today's multimedia collections, which especially impacts interactive applications. To facilitate interactive exploration of large multimedia collections, new approaches are needed that are capable of learning on the fly new analytic categories based on the visual and textual content. To facilitate general use on standard desktops, laptops, and mobile devices… ▽ More

    Submitted 17 July, 2019; v1 submitted 18 April, 2019; originally announced April 2019.

  18. arXiv:1904.08222  [pdf, other

    cs.NI

    Experimental Clock Calibration\\on a Crystal-Free Mote-on-a-Chip

    Authors: Ioana Suciu, Filip Maksimovic, David Burnett, Osama Khan, Brad Wheeler, Arvind Sundararajan, Thomas Watteyne, Xavier Vilajosana, Kris Pister

    Abstract: The elimination of the off-chip frequency reference, typically a crystal oscillator, would bring important benefits in terms of size, price and energy efficiency to IEEE802.15.4 compliant radios and systems-on-chip. The stability of on-chip oscillators is orders of magnitude worse than that of a crystal. It is known that as the temperature changes, they can drift more than 50 ppm/°C. This paper pr… ▽ More

    Submitted 17 April, 2019; originally announced April 2019.

    Comments: CNERT: Computer and Networking Experimental Research using Testbeds, in conjunction with IEEE INFOCOM 2019, April 29 - May 2, 2019, Paris, France

  19. arXiv:1807.09193  [pdf, other

    cs.GR

    GRAINS: Generative Recursive Autoencoders for INdoor Scenes

    Authors: Manyi Li, Akshay Gadi Patil, Kai Xu, Siddhartha Chaudhuri, Owais Khan, Ariel Shamir, Changhe Tu, Baoquan Chen, Daniel Cohen-Or, Hao Zhang

    Abstract: We present a generative neural network which enables us to generate plausible 3D indoor scenes in large quantities and varieties, easily and highly efficiently. Our key observation is that indoor scene structures are inherently hierarchical. Hence, our network is not convolutional; it is a recursive neural network or RvNN. Using a dataset of annotated scene hierarchies, we train a variational recu… ▽ More

    Submitted 8 May, 2019; v1 submitted 24 July, 2018; originally announced July 2018.

    Comments: 21 pages, 26 figures

  20. arXiv:1707.02589  [pdf, other

    cs.DC

    Exploiting the Tradeoff between Program Accuracy and Soft-error Resiliency Overhead for Machine Learning Workloads

    Authors: Qingchuan Shi, Hamza Omar, Omer Khan

    Abstract: To protect multicores from soft-error perturbations, resiliency schemes have been developed with high coverage but high power and performance overheads. Emerging safety-critical machine learning applications are increasingly being deployed on these platforms. Moreover, these systems are exposed to harsh environments, such as unmanned aerial vehicles (UAVs) and self-driving cars. Due to the unique… ▽ More

    Submitted 9 July, 2017; originally announced July 2017.

    Comments: Presented in 2017 IEEE Workshop on Silicon Errors in Logic - System Effects

  21. arXiv:1706.03852  [pdf, other

    cs.CR

    Revisiting Definitional Foundations of Oblivious RAM for Secure Processor Implementations

    Authors: Syed Kamran Haider, Omer Khan, Marten van Dijk

    Abstract: Oblivious RAM (ORAM) is a renowned technique to hide the access patterns of an application to an untrusted memory. According to the standard ORAM definition presented by Goldreich and Ostrovsky, two ORAM access sequences must be computationally indistinguishable if the lengths of these sequences are identically distributed. An artifact of this definition is that it does not apply to modern ORAM im… ▽ More

    Submitted 21 October, 2017; v1 submitted 12 June, 2017; originally announced June 2017.