Skip to main content

Showing 1–20 of 20 results for author: Suprem, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2301.07981  [pdf, other

    cs.LG cs.CL cs.CY cs.SI

    Continuously Reliable Detection of New-Normal Misinformation: Semantic Masking and Contrastive Smoothing in High-Density Latent Regions

    Authors: Abhijit Suprem, Joao Eduardo Ferreira, Calton Pu

    Abstract: Toxic misinformation campaigns have caused significant societal harm, e.g., affecting elections and COVID-19 information awareness. Unfortunately, despite successes of (gold standard) retrospective studies of misinformation that confirmed their harmful effects after the fact, they arrive too late for timely intervention and reduction of such harm. By design, misinformation evades retrospective cla… ▽ More

    Submitted 19 January, 2023; originally announced January 2023.

  2. arXiv:2211.12508  [pdf

    cs.CL cs.LG cs.SI

    Time-Aware Datasets are Adaptive Knowledgebases for the New Normal

    Authors: Abhijit Suprem, Sanjyot Vaidya, Joao Eduardo Ferreira, Calton Pu

    Abstract: Recent advances in text classification and knowledge capture in language models have relied on availability of large-scale text datasets. However, language models are trained on static snapshots of knowledge and are limited when that knowledge evolves. This is especially critical for misinformation detection, where new types of misinformation continuously appear, replacing old campaigns. We propos… ▽ More

    Submitted 22 November, 2022; originally announced November 2022.

  3. arXiv:2211.09322  [pdf, other

    cs.CV

    Targeted Attention for Generalized- and Zero-Shot Learning

    Authors: Abhijit Suprem

    Abstract: The Zero-Shot Learning (ZSL) task attempts to learn concepts without any labeled data. Unlike traditional classification/detection tasks, the evaluation environment is provided unseen classes never encountered during training. As such, it remains both challenging, and promising on a variety of fronts, including unsupervised concept learning, domain adaptation, and dataset drift detection. Recently… ▽ More

    Submitted 16 November, 2022; originally announced November 2022.

  4. arXiv:2211.09098  [pdf, other

    cs.CV cs.LG eess.SY

    ATEAM: Knowledge Integration from Federated Datasets for Vehicle Feature Extraction using Annotation Team of Experts

    Authors: Abhijit Suprem, Purva Singh, Suma Cherkadi, Sanjyot Vaidya, Joao Eduardo Ferreira, Calton Pu

    Abstract: The vehicle recognition area, including vehicle make-model recognition (VMMR), re-id, tracking, and parts-detection, has made significant progress in recent years, driven by several large-scale datasets for each task. These datasets are often non-overlap**, with different label schemas for each task: VMMR focuses on make and model, while re-id focuses on vehicle ID. It is promising to combine th… ▽ More

    Submitted 16 November, 2022; originally announced November 2022.

    Comments: ATEAM for Vehicle Classification and Re-ID

  5. arXiv:2211.06783  [pdf

    cs.LG eess.SY

    EdnaML: A Declarative API and Framework for Reproducible Deep Learning

    Authors: Abhijit Suprem, Sanjyot Vaidya, Avinash Venugopal, Joao Eduardo Ferreira, Calton Pu

    Abstract: Machine Learning has become the bedrock of recent advances in text, image, video, and audio processing and generation. Most production systems deal with several models during deployment and training, each with a variety of tuned hyperparameters. Furthermore, data collection and processing aspects of ML pipelines are receiving increasing interest due to their importance in creating sustainable high… ▽ More

    Submitted 12 November, 2022; originally announced November 2022.

  6. arXiv:2205.10011  [pdf, other

    cs.CV cs.LG

    Constructive Interpretability with CoLabel: Corroborative Integration, Complementary Features, and Collaborative Learning

    Authors: Abhijit Suprem, Sanjyot Vaidya, Suma Cherkadi, Purva Singh, Joao Eduardo Ferreira, Calton Pu

    Abstract: Machine learning models with explainable predictions are increasingly sought after, especially for real-world, mission-critical applications that require bias detection and risk mitigation. Inherent interpretability, where a model is designed from the ground-up for interpretability, provides intuitive insights and transparent explanations on model prediction and performance. In this paper, we pres… ▽ More

    Submitted 20 May, 2022; originally announced May 2022.

  7. arXiv:2205.09817  [pdf, other

    cs.LG cs.CL

    MiDAS: Multi-integrated Domain Adaptive Supervision for Fake News Detection

    Authors: Abhijit Suprem, Calton Pu

    Abstract: COVID-19 related misinformation and fake news, coined an 'infodemic', has dramatically increased over the past few years. This misinformation exhibits concept drift, where the distribution of fake news changes over time, reducing effectiveness of previously trained models for fake news detection. Given a set of fake news models trained on multiple domains, we propose an adaptive decision module to… ▽ More

    Submitted 19 May, 2022; originally announced May 2022.

    Comments: We use Lipschitz smoothness and probabilistic Lipschitzness to build a theoretical foundation for effective multi-domain adaptation using randomized perturbations on unseen data

  8. arXiv:2205.07154  [pdf, other

    cs.LG cs.CL

    Evaluating Generalizability of Fine-Tuned Models for Fake News Detection

    Authors: Abhijit Suprem, Calton Pu

    Abstract: The Covid-19 pandemic has caused a dramatic and parallel rise in dangerous misinformation, denoted an `infodemic' by the CDC and WHO. Misinformation tied to the Covid-19 infodemic changes continuously; this can lead to performance degradation of fine-tuned models due to concept drift. Degredation can be mitigated if models generalize well-enough to capture some cyclical aspects of drifted data. In… ▽ More

    Submitted 23 May, 2022; v1 submitted 14 May, 2022; originally announced May 2022.

  9. arXiv:2011.05416  [pdf, other

    cs.SI cs.LG

    Challenges and Opportunities in Rapid Epidemic Information Propagation with Live Knowledge Aggregation from Social Media

    Authors: Calton Pu, Abhijit Suprem, Rodrigo Alves Lima

    Abstract: A rapidly evolving situation such as the COVID-19 pandemic is a significant challenge for AI/ML models because of its unpredictability. %The most reliable indicator of the pandemic spreading has been the number of test positive cases. However, the tests are both incomplete (due to untested asymptomatic cases) and late (due the lag from the initial contact event, worsening symptoms, and test result… ▽ More

    Submitted 8 November, 2020; originally announced November 2020.

  10. arXiv:2010.04084  [pdf, other

    cs.SI

    EDNA-Covid: A Large-Scale Covid-19 Tweets Dataset Collected with the EDNA Streaming Toolkit

    Authors: Abhijit Suprem, Calton Pu

    Abstract: The Covid-19 pandemic has fundamentally altered many facets of our lives. With nationwide lockdowns and stay-at-home advisories, conversations about the pandemic have naturally moved to social networks, e.g. Twitter. This affords an unprecedented insight into the evolution of social discourse in the presence of a long-running destabilizing factor such as a pandemic with the high-volume, high-veloc… ▽ More

    Submitted 21 October, 2020; v1 submitted 6 October, 2020; originally announced October 2020.

  11. arXiv:2009.05440  [pdf, other

    cs.CV cs.LG eess.SY

    ODIN: Automated Drift Detection and Recovery in Video Analytics

    Authors: Abhijit Suprem, Joy Arulraj, Calton Pu, Joao Ferreira

    Abstract: Recent advances in computer vision have led to a resurgence of interest in visual data analytics. Researchers are develo** systems for effectively and efficiently analyzing visual data at scale. A significant challenge that these systems encounter lies in the drift in real-world visual data. For instance, a model for self-driving vehicles that is not trained on images containing snow does not wo… ▽ More

    Submitted 9 September, 2020; originally announced September 2020.

    Journal ref: PVLDB, 13(11):2453-2465, 2020

  12. arXiv:2002.02256  [pdf, other

    cs.CV

    Looking GLAMORous: Vehicle Re-Id in Heterogeneous Cameras Networks with Global and Local Attention

    Authors: Abhijit Suprem, Calton Pu

    Abstract: Vehicle re-identification (re-id) is a fundamental problem for modern surveillance camera networks. Existing approaches for vehicle re-id utilize global features and local features for re-id by combining multiple subnetworks and losses. In this paper, we propose GLAMOR, or Global and Local Attention MOdules for Re-id. GLAMOR performs global and local feature extraction simultaneously in a unified… ▽ More

    Submitted 6 February, 2020; originally announced February 2020.

  13. arXiv:2001.08895  [pdf, other

    cs.CV cs.IR cs.LG

    Small, Accurate, and Fast Vehicle Re-ID on the Edge: the SAFR Approach

    Authors: Abhijit Suprem, Calton Pu, Joao Eduardo Ferreira

    Abstract: We propose a Small, Accurate, and Fast Re-ID (SAFR) design for flexible vehicle re-id under a variety of compute environments such as cloud, mobile, edge, or embedded devices by only changing the re-id model backbone. Through best-fit design choices, feature extraction, training tricks, global attention, and local attention, we create a reid model design that optimizes multi-dimensionally along mo… ▽ More

    Submitted 24 January, 2020; originally announced January 2020.

  14. arXiv:2001.08700  [pdf, other

    cs.IR cs.CL cs.LG

    EventMapper: Detecting Real-World Physical Events Using Corroborative and Probabilistic Sources

    Authors: Abhijit Suprem, Calton Pu

    Abstract: The ubiquity of social media makes it a rich source for physical event detection, such as disasters, and as a potential resource for crisis management resource allocation. There have been some recent works on leveraging social media sources for retrospective, after-the-fact event detection of large events such as earthquakes or hurricanes. Similarly, there is a long history of using traditional ph… ▽ More

    Submitted 23 January, 2020; originally announced January 2020.

  15. arXiv:1912.04423  [pdf, other

    cs.CV cs.LG

    Robust, Extensible, and Fast: Teamed Classifiers for Vehicle Tracking and Vehicle Re-ID in Multi-Camera Networks

    Authors: Abhijit Suprem, Rodrigo Alves Lima, Bruno Padilha, Joao Eduardo Ferreira, Calton Pu

    Abstract: As camera networks have become more ubiquitous over the past decade, the research interest in video management has shifted to analytics on multi-camera networks. This includes performing tasks such as object detection, attribute identification, and vehicle/person tracking across different cameras without overlap. Current frameworks for management are designed for multi-camera networks in a closed… ▽ More

    Submitted 7 January, 2020; v1 submitted 9 December, 2019; originally announced December 2019.

    Journal ref: 2019 IEEE Conference on Cognitive Machine Intelligence

  16. arXiv:1911.09281  [pdf, other

    cs.LG cs.SI eess.SY stat.ML

    Event Detection in Noisy Streaming Data with Combination of Corroborative and Probabilistic Sources

    Authors: Abhijit Suprem, Calton Pu

    Abstract: Global physical event detection has traditionally relied on dense coverage of physical sensors around the world; while this is an expensive undertaking, there have not been alternatives until recently. The ubiquity of social networks and human sensors in the field provides a tremendous amount of real-time, live data about true physical events from around the world. However, while such human sensor… ▽ More

    Submitted 20 November, 2019; originally announced November 2019.

    Journal ref: IEEE Collaboration in Computing 2019

  17. arXiv:1911.05494  [pdf

    cs.SI cs.CY cs.LG stat.ML

    Concept Drift Adaptive Physical Event Detection for Social Media Streams

    Authors: Abhijit Suprem, Aibek Musaev, Calton Pu

    Abstract: Event detection has long been the domain of physical sensors operating in a static dataset assumption. The prevalence of social media and web access has led to the emergence of social, or human sensors who report on events globally. This warrants development of event detectors that can take advantage of the truly dense and high spatial and temporal resolution data provided by more than 3 billion s… ▽ More

    Submitted 17 September, 2019; originally announced November 2019.

    Journal ref: Services Congress 2019

  18. arXiv:1910.01064  [pdf, other

    cs.LG stat.ML

    Concept Drift Detection and Adaptation with Weak Supervision on Streaming Unlabeled Data

    Authors: Abhijit Suprem

    Abstract: Concept drift in learning and classification occurs when the statistical properties of either the data features or target change over time; evidence of drift has appeared in search data, medical research, malware, web data, and video. Drift adaptation has not yet been addressed in high dimensional, noisy, low-context data such as streaming text, video, or images due to the unique challenges these… ▽ More

    Submitted 2 October, 2019; originally announced October 2019.

  19. arXiv:1909.07596  [pdf, other

    cs.SI cs.LG eess.SP

    ASSED -- A Framework for Identifying Physical Events through Adaptive Social Sensor Data Filtering

    Authors: Abhijit Suprem, Calton Pu

    Abstract: Physical event detection has long been the domain of static event processors operating on numeric sensor data. This works well for large scale strong-signal events such as hurricanes, and important classes of events such as earthquakes. However, for a variety of domains there is insufficient sensor coverage, e.g., landslides, wildfires, and flooding. Social networks have provided massive volume of… ▽ More

    Submitted 17 September, 2019; originally announced September 2019.

    Journal ref: ACM DEBS 2019

  20. arXiv:1803.05401  [pdf, other

    cs.CV cs.IR

    Approximate Query Matching for Image Retrieval

    Authors: Abhijit Suprem, Polo Chau

    Abstract: Traditional image recognition involves identifying the key object in a portrait-type image with a single object focus (ILSVRC, AlexNet, and VGG). More recent approaches consider dense image recognition - segmenting an image with appropriate bounding boxes and performing image recognition within these bounding boxes (Semantic segmentation). The Visual Genome dataset [5] is an attempt to bridge thes… ▽ More

    Submitted 14 March, 2018; originally announced March 2018.