Skip to main content

Showing 1–50 of 94 results for author: Shah, K

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.17368  [pdf, other

    cs.CV

    Fusing uncalibrated IMUs and handheld smartphone video to reconstruct knee kinematics

    Authors: J. D. Peiffer, Kunal Shah, Shawana Anarwala, Kayan Abdou, R. James Cotton

    Abstract: Video and wearable sensor data provide complementary information about human movement. Video provides a holistic understanding of the entire body in the world while wearable sensors provide high-resolution measurements of specific body segments. A robust method to fuse these modalities and obtain biomechanically accurate kinematics would have substantial utility for clinical assessment and monitor… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

    Comments: Accepted to International Conference on Biomedical Robotics and Biomechatronics 2024

  2. arXiv:2405.07518  [pdf, other

    cs.AR cs.AI

    SambaNova SN40L: Scaling the AI Memory Wall with Dataflow and Composition of Experts

    Authors: Raghu Prabhakar, Ram Sivaramakrishnan, Darshan Gandhi, Yun Du, Mingran Wang, Xiangyu Song, Kejie Zhang, Tianren Gao, Angela Wang, Karen Li, Yongning Sheng, Joshua Brot, Denis Sokolov, Apurv Vivek, Calvin Leung, Arjun Sabnis, Jiayu Bai, Tuowen Zhao, Mark Gottscho, David Jackson, Mark Luttrell, Manish K. Shah, Edison Chen, Kaizhao Liang, Swayambhoo Jain , et al. (5 additional authors not shown)

    Abstract: Monolithic large language models (LLMs) like GPT-4 have paved the way for modern generative AI applications. Training, serving, and maintaining monolithic LLMs at scale, however, remains prohibitively expensive and challenging. The disproportionate increase in compute-to-memory ratio of modern AI accelerators have created a memory wall, necessitating new methods to deploy AI. Composition of Expert… ▽ More

    Submitted 13 May, 2024; originally announced May 2024.

  3. arXiv:2404.18893  [pdf, other

    cs.DS cs.LG stat.ML

    Learning general Gaussian mixtures with efficient score matching

    Authors: Sitan Chen, Vasilis Kontonis, Kulin Shah

    Abstract: We study the problem of learning mixtures of $k$ Gaussians in $d$ dimensions. We make no separation assumptions on the underlying mixture components: we only require that the covariance matrices have bounded condition number and that the means and covariances lie in a ball of bounded radius. We give an algorithm that draws $d^{\mathrm{poly}(k/\varepsilon)}$ samples from the target mixture, runs in… ▽ More

    Submitted 29 April, 2024; originally announced April 2024.

    Comments: 57 pages

  4. arXiv:2403.06107  [pdf, other

    cs.CV

    Textureless Object Recognition: An Edge-based Approach

    Authors: Frincy Clement, Kirtan Shah, Dhara Pancholi, Gabriel Lugo Bustillo, Dr. Irene Cheng

    Abstract: Textureless object recognition has become a significant task in Computer Vision with the advent of Robotics and its applications in manufacturing sector. It has been challenging to obtain good accuracy in real time because of its lack of discriminative features and reflectance properties which makes the techniques for textured object recognition insufficient for textureless objects. A lot of work… ▽ More

    Submitted 10 March, 2024; originally announced March 2024.

    Comments: arXiv admin note: text overlap with arXiv:1910.14255

  5. arXiv:2403.00961  [pdf, other

    physics.ed-ph cs.LG physics.data-an

    Data Science Education in Undergraduate Physics: Lessons Learned from a Community of Practice

    Authors: Karan Shah, Julie Butler, Alexis Knaub, Anıl Zenginoğlu, William Ratcliff, Mohammad Soltanieh-ha

    Abstract: It is becoming increasingly important that physics educators equip their students with the skills to work with data effectively. However, many educators may lack the necessary training and expertise in data science to teach these skills. To address this gap, we created the Data Science Education Community of Practice (DSECOP), bringing together graduate students and physics educators from differen… ▽ More

    Submitted 16 June, 2024; v1 submitted 1 March, 2024; originally announced March 2024.

    Comments: 21 pages, 4 figures, 2 tables. The associated GItHub repository can be found at https://github.com/GDS-Education-Community-of-Practice/DSECOP

  6. arXiv:2402.14454  [pdf, other

    cs.CV

    CCPA: Long-term Person Re-Identification via Contrastive Clothing and Pose Augmentation

    Authors: Vuong D. Nguyen, Shishir K. Shah

    Abstract: Long-term Person Re-Identification (LRe-ID) aims at matching an individual across cameras after a long period of time, presenting variations in clothing, pose, and viewpoint. In this work, we propose CCPA: Contrastive Clothing and Pose Augmentation framework for LRe-ID. Beyond appearance, CCPA captures body shape information which is cloth-invariant using a Relation Graph Attention Network. Traini… ▽ More

    Submitted 22 February, 2024; originally announced February 2024.

  7. arXiv:2402.09710  [pdf

    cs.CR cs.LG cs.NI

    Preserving Data Privacy for ML-driven Applications in Open Radio Access Networks

    Authors: Pranshav Gajjar, Azuka Chie**a, Vijay K. Shah

    Abstract: Deep learning offers a promising solution to improve spectrum access techniques by utilizing data-driven approaches to manage and share limited spectrum resources for emerging applications. For several of these applications, the sensitive wireless data (such as spectrograms) are stored in a shared database or multistakeholder cloud environment and are therefore prone to privacy leaks. This paper a… ▽ More

    Submitted 15 February, 2024; originally announced February 2024.

  8. arXiv:2402.07258  [pdf, other

    cs.CV

    Data Quality Aware Approaches for Addressing Model Drift of Semantic Segmentation Models

    Authors: Samiha Mirza, Vuong D. Nguyen, Pranav Mantini, Shishir K. Shah

    Abstract: In the midst of the rapid integration of artificial intelligence (AI) into real world applications, one pressing challenge we confront is the phenomenon of model drift, wherein the performance of AI models gradually degrades over time, compromising their effectiveness in real-world, dynamic environments. Once identified, we need techniques for handling this drift to preserve the model performance… ▽ More

    Submitted 11 February, 2024; originally announced February 2024.

  9. arXiv:2402.06846  [pdf, other

    cs.CR eess.SY

    System-level Analysis of Adversarial Attacks and Defenses on Intelligence in O-RAN based Cellular Networks

    Authors: Azuka Chie**a, Brian Kim, Kaushik Chowhdury, Vijay K. Shah

    Abstract: While the open architecture, open interfaces, and integration of intelligence within Open Radio Access Network technology hold the promise of transforming 5G and 6G networks, they also introduce cybersecurity vulnerabilities that hinder its widespread adoption. In this paper, we conduct a thorough system-level investigation of cyber threats, with a specific focus on machine learning (ML) intellige… ▽ More

    Submitted 13 February, 2024; v1 submitted 9 February, 2024; originally announced February 2024.

    Comments: his paper has been accepted for publication in ACM WiSec 2024

  10. arXiv:2402.04447  [pdf, other

    cs.NI eess.SP

    Context-Aware Spectrum Coexistence of Terrestrial Beyond 5G Networks in Satellite Bands

    Authors: Ta Seen Reaz Niloy, Zoheb Hasan, Rob Smith, Vikram R. Anapana, Vijay K. Shah

    Abstract: Spectrum sharing between terrestrial 5G and incumbent networks in the satellite bands presents a promising avenue to satisfy the ever-increasing bandwidth demand of the next-generation wireless networks. However, protecting incumbent operations from harmful interference poses a fundamental challenge in accommodating terrestrial broadband cellular networks in the satellite bands. State-of-the-art s… ▽ More

    Submitted 14 February, 2024; v1 submitted 6 February, 2024; originally announced February 2024.

  11. arXiv:2402.03716  [pdf, other

    cs.CV

    Attention-based Shape and Gait Representations Learning for Video-based Cloth-Changing Person Re-Identification

    Authors: Vuong D. Nguyen, Samiha Mirza, Pranav Mantini, Shishir K. Shah

    Abstract: Current state-of-the-art Video-based Person Re-Identification (Re-ID) primarily relies on appearance features extracted by deep learning models. These methods are not applicable for long-term analysis in real-world scenarios where persons have changed clothes, making appearance information unreliable. In this work, we deal with the practical problem of Video-based Cloth-Changing Person Re-ID (VCCR… ▽ More

    Submitted 6 February, 2024; originally announced February 2024.

  12. arXiv:2401.15900  [pdf, other

    cs.CV

    MV2MAE: Multi-View Video Masked Autoencoders

    Authors: Ketul Shah, Robert Crandall, Jie Xu, Peng Zhou, Marian George, Mayank Bansal, Rama Chellappa

    Abstract: Videos captured from multiple viewpoints can help in perceiving the 3D structure of the world and benefit computer vision tasks such as action recognition, tracking, etc. In this paper, we present a method for self-supervised learning from synchronized multi-view videos. We use a cross-view reconstruction task to inject geometry information in the model. Our approach is based on the masked autoenc… ▽ More

    Submitted 29 January, 2024; originally announced January 2024.

  13. arXiv:2312.02914  [pdf, other

    cs.CV cs.LG

    Unsupervised Video Domain Adaptation with Masked Pre-Training and Collaborative Self-Training

    Authors: Arun Reddy, William Paul, Corban Rivera, Ketul Shah, Celso M. de Melo, Rama Chellappa

    Abstract: In this work, we tackle the problem of unsupervised domain adaptation (UDA) for video action recognition. Our approach, which we call UNITE, uses an image teacher model to adapt a video student model to the target domain. UNITE first employs self-supervised pre-training to promote discriminative feature learning on target domain videos using a teacher-guided masked distillation objective. We then… ▽ More

    Submitted 20 April, 2024; v1 submitted 5 December, 2023; originally announced December 2023.

    Comments: Accepted at CVPR 2024. 13 pages, 4 figures. Approved for public release: distribution unlimited

  14. arXiv:2311.12161  [pdf, other

    cs.CV

    ChemScraper: Leveraging PDF Graphics Instructions for Molecular Diagram Parsing

    Authors: Ayush Kumar Shah, Bryan Manrique Amador, Abhisek Dey, Ming Creekmore, Blake Ocampo, Scott Denmark, Richard Zanibbi

    Abstract: Most molecular diagram parsers recover chemical structure from raster images (e.g., PNGs). However, many PDFs include commands giving explicit locations and shapes for characters, lines, and polygons. We present a new parser that uses these born-digital PDF primitives as input. The parsing model is fast and accurate, and does not require GPUs, Optical Character Recognition (OCR), or vectorization.… ▽ More

    Submitted 31 May, 2024; v1 submitted 20 November, 2023; originally announced November 2023.

    Comments: 20 pages without references, 12 figures, 4 Tables, submitted to International Conference on Document Analysis and Recognition (ICDAR) - Journal Track

  15. arXiv:2311.09753  [pdf, other

    cs.CV

    DIFFNAT: Improving Diffusion Image Quality Using Natural Image Statistics

    Authors: Aniket Roy, Maiterya Suin, Anshul Shah, Ketul Shah, Jiang Liu, Rama Chellappa

    Abstract: Diffusion models have advanced generative AI significantly in terms of editing and creating naturalistic images. However, efficiently improving generated image quality is still of paramount interest. In this context, we propose a generic "naturalness" preserving loss function, viz., kurtosis concentration (KC) loss, which can be readily applied to any standard diffusion model pipeline to elevate t… ▽ More

    Submitted 16 November, 2023; originally announced November 2023.

  16. arXiv:2311.06430  [pdf, other

    cs.RO

    GOAT: GO to Any Thing

    Authors: Matthew Chang, Theophile Gervet, Mukul Khanna, Sriram Yenamandra, Dhruv Shah, So Yeon Min, Kavit Shah, Chris Paxton, Saurabh Gupta, Dhruv Batra, Roozbeh Mottaghi, Jitendra Malik, Devendra Singh Chaplot

    Abstract: In deployment scenarios such as homes and warehouses, mobile robots are expected to autonomously navigate for extended periods, seamlessly executing tasks articulated in terms that are intuitively understandable by human operators. We present GO To Any Thing (GOAT), a universal navigation system capable of tackling these requirements with three key features: a) Multimodal: it can tackle goals spec… ▽ More

    Submitted 10 November, 2023; originally announced November 2023.

  17. arXiv:2310.12143  [pdf, other

    cs.LG cs.CL stat.ML

    Simple Mechanisms for Representing, Indexing and Manipulating Concepts

    Authors: Yuanzhi Li, Raghu Meka, Rina Panigrahy, Kulin Shah

    Abstract: Deep networks typically learn concepts via classifiers, which involves setting up a model and training it via gradient descent to fit the concept-labeled data. We will argue instead that learning a concept could be done by looking at its moment statistics matrix to generate a concrete representation or signature of that concept. These signatures can be used to discover structure across the set of… ▽ More

    Submitted 18 October, 2023; originally announced October 2023.

    Comments: 19 pages

  18. arXiv:2310.07886  [pdf, other

    cs.CV

    A Survey of Feature Types and Their Contributions for Camera Tampering Detection

    Authors: Pranav Mantini, Shishir K. Shah

    Abstract: Camera tamper detection is the ability to detect unauthorized and unintentional alterations in surveillance cameras by analyzing the video. Camera tampering can occur due to natural events or it can be caused intentionally to disrupt surveillance. We cast tampering detection as a change detection problem, and perform a review of the existing literature with emphasis on feature types. We formulate… ▽ More

    Submitted 11 October, 2023; originally announced October 2023.

  19. arXiv:2309.03844  [pdf, other

    cs.NI eess.SP

    Experimental Study of Adversarial Attacks on ML-based xApps in O-RAN

    Authors: Naveen Naik Sapavath, Brian Kim, Kaushik Chowdhury, Vijay K Shah

    Abstract: Open Radio Access Network (O-RAN) is considered as a major step in the evolution of next-generation cellular networks given its support for open interfaces and utilization of artificial intelligence (AI) into the deployment, operation, and maintenance of RAN. However, due to the openness of the O-RAN architecture, such AI models are inherently vulnerable to various adversarial machine learning (ML… ▽ More

    Submitted 7 September, 2023; originally announced September 2023.

    Comments: Accepted for Globecom 2023

  20. arXiv:2307.16321  [pdf, other

    cs.CV

    Self-Supervised Learning of Gait-Based Biomarkers

    Authors: R. James Cotton, J. D. Peiffer, Kunal Shah, Allison DeLillo, Anthony Cimorelli, Shawana Anarwala, Kayan Abdou, Tasos Karakostas

    Abstract: Markerless motion capture (MMC) is revolutionizing gait analysis in clinical settings by making it more accessible, raising the question of how to extract the most clinically meaningful information from gait data. In multiple fields ranging from image processing to natural language processing, self-supervised learning (SSL) from large amounts of unannotated data produces very effective representat… ▽ More

    Submitted 30 July, 2023; originally announced July 2023.

    Comments: Accepted to Ambient Inteligence for Healthcare workshop at MICCAI 2023

    Journal ref: Ambient Inteligence for Healthcare workshop at MICCAI 2023

  21. arXiv:2307.12473  [pdf, other

    cs.IT

    Adaptive RRI Selection Algorithms for Improved Cooperative Awareness in Decentralized NR-V2X

    Authors: Avik Dayal, Vijay K. Shah, Harpreet S. Dhillon, Jeffrey H. Reed

    Abstract: Decentralized vehicle-to-everything (V2X) networks (i.e., C-V2X Mode-4 and NR-V2X Mode-2) utilize sensing-based semi-persistent scheduling (SPS) where vehicles sense and reserve suitable radio resources for Basic Safety Message (BSM) transmissions at prespecified periodic intervals termed as Resource Reservation Interval (RRI). Vehicles rely on these received periodic BSMs to localize nearby (tran… ▽ More

    Submitted 23 July, 2023; originally announced July 2023.

  22. arXiv:2307.11317  [pdf, other

    cs.LG cs.AI cs.CV

    XLDA: Linear Discriminant Analysis for Scaling Continual Learning to Extreme Classification at the Edge

    Authors: Karan Shah, Vishruth Veerendranath, Anushka Hebbar, Raghavendra Bhat

    Abstract: Streaming Linear Discriminant Analysis (LDA) while proven in Class-incremental Learning deployments at the edge with limited classes (upto 1000), has not been proven for deployment in extreme classification scenarios. In this paper, we present: (a) XLDA, a framework for Class-IL in edge deployment where LDA classifier is proven to be equivalent to FC layer including in extreme classification scena… ▽ More

    Submitted 20 July, 2023; originally announced July 2023.

    Comments: Submitted at ICML 2023: PAC-Bayes Interactive Learning Workshop

  23. arXiv:2307.01178  [pdf, ps, other

    cs.DS cs.LG stat.ML

    Learning Mixtures of Gaussians Using the DDPM Objective

    Authors: Kulin Shah, Sitan Chen, Adam Klivans

    Abstract: Recent works have shown that diffusion models can learn essentially any distribution provided one can perform score estimation. Yet it remains poorly understood under what settings score estimation is possible, let alone when practical gradient-based algorithms for this task can provably succeed. In this work, we give the first provably efficient results along these lines for one of the most fun… ▽ More

    Submitted 3 July, 2023; originally announced July 2023.

    Comments: 48 pages

  24. arXiv:2306.15242  [pdf, other

    cs.CV

    SPDER: Semiperiodic Dam**-Enabled Object Representation

    Authors: Kathan Shah, Chawin Sitawarin

    Abstract: We present a neural network architecture designed to naturally learn a positional embedding and overcome the spectral bias towards lower frequencies faced by conventional implicit neural representation networks. Our proposed architecture, SPDER, is a simple MLP that uses an activation function composed of a sinusoidal multiplied by a sublinear function, called the dam** function. The sinusoidal… ▽ More

    Submitted 27 June, 2023; originally announced June 2023.

  25. arXiv:2306.05376  [pdf, other

    cs.CV cs.LG

    Anomaly Detection in Satellite Videos using Diffusion Models

    Authors: Akash Awasthi, Son Ly, Jaer Nizam, Samira Zare, Videet Mehta, Safwan Ahmed, Keshav Shah, Ramakrishna Nemani, Saurabh Prasad, Hien Van Nguyen

    Abstract: The definition of anomaly detection is the identification of an unexpected event. Real-time detection of extreme events such as wildfires, cyclones, or floods using satellite data has become crucial for disaster management. Although several earth-observing satellites provide information about disasters, satellites in the geostationary orbit provide data at intervals as frequent as every minute, ef… ▽ More

    Submitted 25 May, 2023; originally announced June 2023.

  26. arXiv:2305.19256  [pdf, other

    cs.LG cs.AI cs.CV cs.IT

    Ambient Diffusion: Learning Clean Distributions from Corrupted Data

    Authors: Giannis Daras, Kulin Shah, Yuval Dagan, Aravind Gollakota, Alexandros G. Dimakis, Adam Klivans

    Abstract: We present the first diffusion-based framework that can learn an unknown distribution using only highly-corrupted samples. This problem arises in scientific applications where access to uncorrupted samples is impossible or expensive to acquire. Another benefit of our approach is the ability to train generative models that are less likely to memorize individual training samples since they never obs… ▽ More

    Submitted 30 May, 2023; originally announced May 2023.

    Comments: 24 pages, 11 figures

  27. arXiv:2305.14410  [pdf, other

    cs.CV cs.AI cs.CL

    Image Manipulation via Multi-Hop Instructions -- A New Dataset and Weakly-Supervised Neuro-Symbolic Approach

    Authors: Harman Singh, Poorva Garg, Mohit Gupta, Kevin Shah, Ashish Goswami, Satyam Modi, Arnab Kumar Mondal, Dinesh Khandelwal, Dinesh Garg, Parag Singla

    Abstract: We are interested in image manipulation via natural language text -- a task that is useful for multiple AI applications but requires complex reasoning over multi-modal spaces. We extend recently proposed Neuro Symbolic Concept Learning (NSCL), which has been quite effective for the task of Visual Question Answering (VQA), for the task of image manipulation. Our system referred to as NeuroSIM can p… ▽ More

    Submitted 24 October, 2023; v1 submitted 23 May, 2023; originally announced May 2023.

    Comments: EMNLP 2023 (long paper, main conference)

  28. arXiv:2304.00387  [pdf, other

    cs.CV

    HaLP: Hallucinating Latent Positives for Skeleton-based Self-Supervised Learning of Actions

    Authors: Anshul Shah, Aniket Roy, Ketul Shah, Shlok Kumar Mishra, David Jacobs, Anoop Cherian, Rama Chellappa

    Abstract: Supervised learning of skeleton sequence encoders for action recognition has received significant attention in recent times. However, learning such encoders without labels continues to be a challenging problem. While prior works have shown promising results by applying contrastive learning to pose sequences, the quality of the learned representations is often observed to be closely tied to data au… ▽ More

    Submitted 1 April, 2023; originally announced April 2023.

    Comments: To be presented at CVPR 2023

  29. arXiv:2303.10654  [pdf, other

    cs.CV

    Markerless Motion Capture and Biomechanical Analysis Pipeline

    Authors: R. James Cotton, Allison DeLillo, Anthony Cimorelli, Kunal Shah, J. D. Peiffer, Shawana Anarwala, Kayan Abdou, Tasos Karakostas

    Abstract: Markerless motion capture using computer vision and human pose estimation (HPE) has the potential to expand access to precise movement analysis. This could greatly benefit rehabilitation by enabling more accurate tracking of outcomes and providing more sensitive tools for research. There are numerous steps between obtaining videos to extracting accurate biomechanical results and limited research t… ▽ More

    Submitted 19 March, 2023; originally announced March 2023.

  30. arXiv:2303.10280  [pdf, other

    cs.CV

    Synthetic-to-Real Domain Adaptation for Action Recognition: A Dataset and Baseline Performances

    Authors: Arun V. Reddy, Ketul Shah, William Paul, Rohita Mocharla, Judy Hoffman, Kapil D. Katyal, Dinesh Manocha, Celso M. de Melo, Rama Chellappa

    Abstract: Human action recognition is a challenging problem, particularly when there is high variability in factors such as subject appearance, backgrounds and viewpoint. While deep neural networks (DNNs) have been shown to perform well on action recognition tasks, they typically require large amounts of high-quality labeled data to achieve robust performance across a variety of conditions. Synthetic data h… ▽ More

    Submitted 17 March, 2023; originally announced March 2023.

    Comments: ICRA 2023. The first two authors contributed equally. Dataset available at: https://github.com/reddyav1/RoCoG-v2

  31. Keep It Simple: CNN Model Complexity Studies for Interference Classification Tasks

    Authors: Taiwo Oyedare, Vijay K. Shah, Daniel J. Jakubisin, Jeffrey H. Reed

    Abstract: The growing number of devices using the wireless spectrum makes it important to find ways to minimize interference and optimize the use of the spectrum. Deep learning models, such as convolutional neural networks (CNNs), have been widely utilized to identify, classify, or mitigate interference due to their ability to learn from the data directly. However, there have been limited research on the co… ▽ More

    Submitted 6 March, 2023; originally announced March 2023.

    Comments: 6 pages, 7 figures, 3 tables

  32. arXiv:2303.02413  [pdf, other

    cs.CV

    Improved Trajectory Reconstruction for Markerless Pose Estimation

    Authors: R. James Cotton, Anthony Cimorelli, Kunal Shah, Shawana Anarwala, Scott Uhlrich, Tasos Karakostas

    Abstract: Markerless pose estimation allows reconstructing human movement from multiple synchronized and calibrated views, and has the potential to make movement analysis easy and quick, including gait analysis. This could enable much more frequent and quantitative characterization of gait impairments, allowing better monitoring of outcomes and responses to interventions. However, the impact of different ke… ▽ More

    Submitted 8 March, 2023; v1 submitted 4 March, 2023; originally announced March 2023.

  33. arXiv:2212.05404  [pdf, other

    cs.CV

    Cap2Aug: Caption guided Image to Image data Augmentation

    Authors: Aniket Roy, Anshul Shah, Ketul Shah, Anirban Roy, Rama Chellappa

    Abstract: Visual recognition in a low-data regime is challenging and often prone to overfitting. To mitigate this issue, several data augmentation strategies have been proposed. However, standard transformations, e.g., rotation, crop**, and flip** provide limited semantic variations. To this end, we propose Cap2Aug, an image-to-image diffusion model-based data augmentation strategy using image captions… ▽ More

    Submitted 6 November, 2023; v1 submitted 10 December, 2022; originally announced December 2022.

  34. arXiv:2212.04563  [pdf, other

    cs.CR

    An SLR on Edge Computing Security and possible threat protection

    Authors: Harsiddh Kalariya, Kavish Shah, Vini Patel

    Abstract: Mobile and Internet of Things devices are generating enormous amounts of multi-modal data due to their exponential growth and accessibility. As a result, these data sources must be directly analyzed in real time at the network edge rather than relying on the cloud. Significant processing power at the network's edge has made it possible to gather data and make decisions prior to data being sent to… ▽ More

    Submitted 8 December, 2022; originally announced December 2022.

  35. arXiv:2205.13178  [pdf, other

    cs.NI cs.DC

    Prototy** Next-Generation O-RAN Research Testbeds with SDRs

    Authors: Pratheek S. Upadhyaya, Aly S. Abdalla, Vuk Marojevic, Jeffrey H. Reed, Vijay K. Shah

    Abstract: Open RAN (O-RAN) defines an emerging cellular radio access network (RAN) architecture for future 6G wireless networks, emphasizing openness and intelligence which are considered the foundations of future 6G wireless networks. While the inherent complexity and flexibility of the RAN give rise to many new research problems, progress in develo** solutions is hampered due to the lack of end-to-end,… ▽ More

    Submitted 26 May, 2022; originally announced May 2022.

    Comments: This manuscript has been submitted to IEEE Vehicular Technology Magazine for possible publication

  36. arXiv:2203.11258  [pdf, other

    cs.CL

    Efficient Classification of Long Documents Using Transformers

    Authors: Hyunji Hayley Park, Yogarshi Vyas, Kashif Shah

    Abstract: Several methods have been proposed for classifying long textual documents using Transformers. However, there is a lack of consensus on a benchmark to enable a fair comparison among different approaches. In this paper, we provide a comprehensive evaluation of the relative efficacy measured against various baselines and diverse datasets -- both in terms of accuracy as well as time and space overhead… ▽ More

    Submitted 21 March, 2022; originally announced March 2022.

    Comments: Accepted to ACL 2022; 8 pages

  37. arXiv:2203.04227  [pdf, other

    cs.IT cs.LG

    A Practical AoI Scheduler in IoT Networks with Relays

    Authors: Biplav Choudhury, Prasenjit Karmakar, Vijay K. Shah, Jeffrey H. Reed

    Abstract: Internet of Things (IoT) networks have become ubiquitous as autonomous computing, communication and collaboration among devices become popular for accomplishing various tasks. The use of relays in IoT networks further makes it convenient to deploy IoT networks as relays provide a host of benefits, like increasing the communication range and minimizing power consumption. Existing literature on trad… ▽ More

    Submitted 25 April, 2023; v1 submitted 8 March, 2022; originally announced March 2022.

    Comments: arXiv admin note: substantial text overlap with arXiv:2107.05181

  38. arXiv:2201.04180  [pdf, other

    cs.RO cs.AI cs.LG math.OC

    Learning Robust Policies for Generalized Debris Capture with an Automated Tether-Net System

    Authors: Chen Zeng, Grant Hecht, Prajit KrisshnaKumar, Raj K. Shah, Souma Chowdhury, Eleonora M. Botta

    Abstract: Tether-net launched from a chaser spacecraft provides a promising method to capture and dispose of large space debris in orbit. This tether-net system is subject to several sources of uncertainty in sensing and actuation that affect the performance of its net launch and closing control. Earlier reliability-based optimization approaches to design control actions however remain challenging and compu… ▽ More

    Submitted 11 January, 2022; originally announced January 2022.

    Comments: This paper has been presented at the 2022 AIAA SciTech Forum and Exposition, and accepted for publication in the corresponding AIAA proceedings

    MSC Class: 68T05 (Primary) 70E60; 93E35 (Secondary) ACM Class: I.2.9; J.2

  39. arXiv:2112.08988  [pdf, other

    eess.SP cs.LG cs.NI

    Interference Suppression Using Deep Learning: Current Approaches and Open Challenges

    Authors: Taiwo Oyedare, Vijay K Shah, Daniel J Jakubisin, Jeff H Reed

    Abstract: In light of the finite nature of the wireless spectrum and the increasing demand for spectrum use arising from recent technological breakthroughs in wireless communication, the problem of interference continues to persist. Despite recent advancements in resolving interference issues, interference still presents a difficult challenge to effective usage of the spectrum. This is partly due to the ris… ▽ More

    Submitted 16 December, 2021; originally announced December 2021.

    Comments: 26 pages, 10 figures, journal article

  40. arXiv:2111.13754  [pdf, other

    cs.NI eess.SY

    Toward Next Generation Open Radio Access Network--What O-RAN Can and Cannot Do!

    Authors: Aly S. Abdalla, Pratheek S. Upadhyaya, Vijay K. Shah, Vuk Marojevic

    Abstract: The open radio access network (O-RAN) describes an industry-driven open architecture and interfaces for building next generation RANs with artificial intelligence (AI) controllers. We circulated a survey among researchers, developers, and practitioners to gather their perspectives on O-RAN as a framework for 6G wireless research and development (R&D). The majority responded in favor of O-RAN and i… ▽ More

    Submitted 25 March, 2022; v1 submitted 26 November, 2021; originally announced November 2021.

    Comments: This article has been accepted for publication in the IEEE Network Magazine

  41. arXiv:2111.07089  [pdf, other

    cs.LG eess.SP

    Evaluating Contrastive Learning on Wearable Timeseries for Downstream Clinical Outcomes

    Authors: Kevalee Shah, Dimitris Spathis, Chi Ian Tang, Cecilia Mascolo

    Abstract: Vast quantities of person-generated health data (wearables) are collected but the process of annotating to feed to machine learning models is impractical. This paper discusses ways in which self-supervised approaches that use contrastive losses, such as SimCLR and BYOL, previously applied to the vision domain, can be applied to high-dimensional health signals for downstream classification tasks of… ▽ More

    Submitted 13 November, 2021; originally announced November 2021.

    Comments: Machine Learning for Health (ML4H) - Extended Abstract

  42. arXiv:2111.05457  [pdf, other

    cs.IT

    Optimizing Number, Placement, and Backhaul Connectivity of Multi-UAV Networks

    Authors: Javad Sabzehali, Vijay K. Shah, Qiang Fan, Biplav Choudhury, Lingjia Liu, Jeffrey H. Reed

    Abstract: Multi-Unmanned Aerial Vehicle (UAV) Networks is a promising solution to providing wireless coverage to ground users in challenging rural areas (such as Internet of Things (IoT) devices in farmlands), where the traditional cellular networks are sparse or unavailable. A key challenge in such networks is the 3D placement of all UAV base stations such that the formed Multi-UAV Network (i) utilizes a m… ▽ More

    Submitted 16 June, 2022; v1 submitted 9 November, 2021; originally announced November 2021.

    Comments: To appear in IEEE Internet of Things Journal

  43. arXiv:2110.11602  [pdf, other

    cs.DS cs.IR cs.OS

    An O(1) algorithm for implementing the LFU cache eviction scheme

    Authors: Dhruv Matani, Ketan Shah, Anirban Mitra

    Abstract: Cache eviction algorithms are used widely in operating systems, databases and other systems that use caches to speed up execution by caching data that is used by the application. There are many policies such as MRU (Most Recently Used), MFU (Most Frequently Used), LRU (Least Recently Used) and LFU (Least Frequently Used) which each have their advantages and drawbacks and are hence used in specific… ▽ More

    Submitted 22 October, 2021; originally announced October 2021.

  44. arXiv:2110.09308  [pdf, other

    cs.NI eess.SP eess.SY

    Power Systems Performance under 5G Radio Access Network in a Co-Simulation Environment

    Authors: Rahul Iyer, Biplav Choudhury, Vijay K. Shah, Ali Mehrizi-Sani

    Abstract: Communication can improve control of important system parameters by allowing different grid components to communicate their states with each other. This information exchange requires a reliable and fast communication infrastructure. 5G communication can be a viable means to achieve this objective. This paper investigates the performance of several smart grid applications under a 5G radio access ne… ▽ More

    Submitted 16 August, 2021; originally announced October 2021.

  45. arXiv:2107.10956  [pdf, other

    cs.RO math.OC

    Reciprocal Multi-Robot Collision Avoidance with Asymmetric State Uncertainty

    Authors: Kunal Shah, Guillermo Angeris, Mac Schwager

    Abstract: We present a general decentralized formulation for a large class of collision avoidance methods and show that all collision avoidance methods of this form are guaranteed to be collision free. This class includes several existing algorithms in the literature as special cases. We then present a particular instance of this collision avoidance method, CARP (Collision Avoidance by Reciprocal Projection… ▽ More

    Submitted 22 July, 2021; originally announced July 2021.

    Comments: arXiv admin note: text overlap with arXiv:1905.12875

  46. arXiv:2107.05181  [pdf, other

    cs.NI cs.AI eess.SP

    AoI-minimizing Scheduling in UAV-relayed IoT Networks

    Authors: Biplav Choudhury, Vijay K. Shah, Aidin Ferdowsi, Jeffrey H. Reed, Y. Thomas Hou

    Abstract: Due to flexibility, autonomy and low operational cost, unmanned aerial vehicles (UAVs), as fixed aerial base stations, are increasingly being used as \textit{relays} to collect time-sensitive information (i.e., status updates) from IoT devices and deliver it to the nearby terrestrial base station (TBS), where the information gets processed. In order to ensure timely delivery of information to the… ▽ More

    Submitted 24 September, 2021; v1 submitted 11 July, 2021; originally announced July 2021.

  47. arXiv:2107.04140  [pdf, other

    cs.AR

    First-Generation Inference Accelerator Deployment at Facebook

    Authors: Michael Anderson, Benny Chen, Stephen Chen, Summer Deng, Jordan Fix, Michael Gschwind, Aravind Kalaiah, Changkyu Kim, Jaewon Lee, Jason Liang, Haixin Liu, Yinghai Lu, Jack Montgomery, Arun Moorthy, Satish Nadathur, Sam Naghshineh, Avinash Nayak, Jongsoo Park, Chris Petersen, Martin Schatz, Narayanan Sundaram, Bangsheng Tang, Peter Tang, Amy Yang, Jiecao Yu , et al. (90 additional authors not shown)

    Abstract: In this paper, we provide a deep dive into the deployment of inference accelerators at Facebook. Many of our ML workloads have unique characteristics, such as sparse memory accesses, large model sizes, as well as high compute, memory and network bandwidth requirements. We co-designed a high-performance, energy-efficient inference accelerator platform based on these requirements. We describe the in… ▽ More

    Submitted 4 August, 2021; v1 submitted 8 July, 2021; originally announced July 2021.

  48. arXiv:2107.03090  [pdf, other

    cs.LG cs.AI

    RISAN: Robust Instance Specific Abstention Network

    Authors: Bhavya Kalra, Kulin Shah, Naresh Manwani

    Abstract: In this paper, we propose deep architectures for learning instance specific abstain (reject option) binary classifiers. The proposed approach uses double sigmoid loss function as described by Kulin Shah and Naresh Manwani in ("Online Active Learning of Reject Option Classifiers", AAAI, 2020), as a performance measure. We show that the double sigmoid loss is classification calibrated. We also show… ▽ More

    Submitted 7 July, 2021; originally announced July 2021.

  49. arXiv:2106.14574  [pdf, other

    cs.CL

    Quantifying Social Biases in NLP: A Generalization and Empirical Comparison of Extrinsic Fairness Metrics

    Authors: Paula Czarnowska, Yogarshi Vyas, Kashif Shah

    Abstract: Measuring bias is key for better understanding and addressing unfairness in NLP/ML models. This is often done via fairness metrics which quantify the differences in a model's behaviour across a range of demographic groups. In this work, we shed more light on the differences and similarities between the fairness metrics used in NLP. First, we unify a broad range of existing metrics under three gene… ▽ More

    Submitted 28 June, 2021; originally announced June 2021.

    Comments: Accepted for publication in Transaction of the Association for Computational Linguistics (TACL), 2021. The arXiv version is a pre-MIT Press publication version

  50. arXiv:2106.10535  [pdf, other

    cs.LG cs.AI

    Learning and Generalization in Overparameterized Normalizing Flows

    Authors: Kulin Shah, Amit Deshpande, Navin Goyal

    Abstract: In supervised learning, it is known that overparameterized neural networks with one hidden layer provably and efficiently learn and generalize, when trained using stochastic gradient descent with a sufficiently small learning rate and suitable initialization. In contrast, the benefit of overparameterization in unsupervised learning is not well understood. Normalizing flows (NFs) constitute an impo… ▽ More

    Submitted 23 March, 2022; v1 submitted 19 June, 2021; originally announced June 2021.

    Comments: 75 pages, Accepted in AISTATS 2022