Skip to main content

Showing 1–50 of 54 results for author: Sudhanshu

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.10886  [pdf, other

    cs.CL cs.LG

    Distilling Opinions at Scale: Incremental Opinion Summarization using XL-OPSUMM

    Authors: Sri Raghava Muddu, Rupasai Rangaraju, Tejpalsingh Siledar, Swaroop Nath, Pushpak Bhattacharyya, Swaprava Nath, Suman Banerjee, Amey Patil, Muthusamy Chelliah, Sudhanshu Shekhar Singh, Nikesh Garera

    Abstract: Opinion summarization in e-commerce encapsulates the collective views of numerous users about a product based on their reviews. Typically, a product on an e-commerce platform has thousands of reviews, each review comprising around 10-15 words. While Large Language Models (LLMs) have shown proficiency in summarization tasks, they struggle to handle such a large volume of reviews due to context limi… ▽ More

    Submitted 16 June, 2024; originally announced June 2024.

  2. arXiv:2404.05243  [pdf, other

    cs.CL cs.AI

    Product Description and QA Assisted Self-Supervised Opinion Summarization

    Authors: Tejpalsingh Siledar, Rupasai Rangaraju, Sankara Sri Raghava Ravindra Muddu, Suman Banerjee, Amey Patil, Sudhanshu Shekhar Singh, Muthusamy Chelliah, Nikesh Garera, Swaprava Nath, Pushpak Bhattacharyya

    Abstract: In e-commerce, opinion summarization is the process of summarizing the consensus opinions found in product reviews. However, the potential of additional sources such as product description and question-answers (QA) has been considered less often. Moreover, the absence of any supervised training data makes this task challenging. To address this, we propose a novel synthetic dataset creation (SDC) s… ▽ More

    Submitted 8 April, 2024; originally announced April 2024.

  3. Loss Regularizing Robotic Terrain Classification

    Authors: Shakti Deo Kumar, Sudhanshu Tripathi, Krishna Ujjwal, Sarvada Sakshi Jha, Suddhasil De

    Abstract: Locomotion mechanics of legged robots are suitable when pacing through difficult terrains. Recognising terrains for such robots are important to fully yoke the versatility of their movements. Consequently, robotic terrain classification becomes significant to classify terrains in real time with high accuracy. The conventional classifiers suffer from overfitting problem, low accuracy problem, high… ▽ More

    Submitted 20 March, 2024; originally announced March 2024.

    Comments: Preliminary draft of the work published in IEEE conference 2023

  4. arXiv:2402.15473  [pdf, other

    cs.CL cs.LG

    Leveraging Domain Knowledge for Efficient Reward Modelling in RLHF: A Case-Study in E-Commerce Opinion Summarization

    Authors: Swaroop Nath, Tejpalsingh Siledar, Sankara Sri Raghava Ravindra Muddu, Rupasai Rangaraju, Harshad Khadilkar, Pushpak Bhattacharyya, Suman Banerjee, Amey Patil, Sudhanshu Shekhar Singh, Muthusamy Chelliah, Nikesh Garera

    Abstract: Reinforcement Learning from Human Feedback (RLHF) has become a dominating strategy in aligning Language Models (LMs) with human values/goals. The key to the strategy is learning a reward model ($\varphi$), which can reflect the latent reward model of humans. While this strategy has proven effective, the training methodology requires a lot of human preference annotation (usually in the order of ten… ▽ More

    Submitted 18 April, 2024; v1 submitted 23 February, 2024; originally announced February 2024.

    Comments: 19 pages, 6 figures, 21 tables

  5. arXiv:2402.11683  [pdf, other

    cs.CL

    One Prompt To Rule Them All: LLMs for Opinion Summary Evaluation

    Authors: Tejpalsingh Siledar, Swaroop Nath, Sankara Sri Raghava Ravindra Muddu, Rupasai Rangaraju, Swaprava Nath, Pushpak Bhattacharyya, Suman Banerjee, Amey Patil, Sudhanshu Shekhar Singh, Muthusamy Chelliah, Nikesh Garera

    Abstract: Evaluation of opinion summaries using conventional reference-based metrics rarely provides a holistic evaluation and has been shown to have a relatively low correlation with human judgments. Recent studies suggest using Large Language Models (LLMs) as reference-free metrics for NLG evaluation, however, they remain unexplored for opinion summary evaluation. Moreover, limited opinion summary evaluat… ▽ More

    Submitted 9 June, 2024; v1 submitted 18 February, 2024; originally announced February 2024.

  6. arXiv:2402.01843  [pdf, other

    cs.DC

    Towards a Scalable In Situ Fast Fourier Transform

    Authors: Sudhanshu Kulkarni, Burlen Loring, E. Wes Bethel

    Abstract: The Fast Fourier Transform (FFT) is a numerical operation that transforms a function into a form comprised of its constituent frequencies and is an integral part of scientific computation and data analysis. The objective of our work is to enable use of the FFT as part of a scientific in situ processing chain to facilitate the analysis of data in the spectral regime. We describe the implementation… ▽ More

    Submitted 2 February, 2024; originally announced February 2024.

    Comments: 5 pages, 2 figures. Submitted to ISAV workshop in SC23 conference

  7. arXiv:2401.06783  [pdf, other

    cs.CL cs.AI cs.LG cs.SI

    MultiSiam: A Multiple Input Siamese Network For Social Media Text Classification And Duplicate Text Detection

    Authors: Sudhanshu Bhoi, Swapnil Markhedkar, Shruti Phadke, Prashant Agrawal

    Abstract: Social media accounts post increasingly similar content, creating a chaotic experience across platforms, which makes accessing desired information difficult. These posts can be organized by categorizing and grou** duplicates across social handles and accounts. There can be more than one duplicate of a post, however, a conventional Siamese neural network only considers a pair of inputs for duplic… ▽ More

    Submitted 6 January, 2024; originally announced January 2024.

  8. arXiv:2312.14973  [pdf, other

    cs.GR

    Interactive Visualization of Time-Varying Flow Fields Using Particle Tracing Neural Networks

    Authors: Mengjiao Han, Jixian Li, Sudhanshu Sane, Shubham Gupta, Bei Wang, Steve Petruzza, Chris R. Johnson

    Abstract: In this paper, we present a comprehensive evaluation to establish a robust and efficient framework for Lagrangian-based particle tracing using deep neural networks (DNNs). Han et al. (2021) first proposed a DNN-based approach to learn Lagrangian representations and demonstrated accurate particle tracing for an analytic 2D flow field. In this paper, we extend and build upon this prior work in signi… ▽ More

    Submitted 15 May, 2024; v1 submitted 20 December, 2023; originally announced December 2023.

    Comments: Accepted by Pacific Vis 2024

  9. arXiv:2312.03691  [pdf, other

    cs.LG cs.SI

    On the Role of Edge Dependency in Graph Generative Models

    Authors: Sudhanshu Chanpuriya, Cameron Musco, Konstantinos Sotiropoulos, Charalampos Tsourakakis

    Abstract: In this work, we introduce a novel evaluation framework for generative models of graphs, emphasizing the importance of model-generated graph overlap (Chanpuriya et al., 2021) to ensure both accuracy and edge-diversity. We delineate a hierarchy of graph generative models categorized into three levels of complexity: edge independent, node independent, and fully dependent models. This hierarchy encap… ▽ More

    Submitted 6 December, 2023; originally announced December 2023.

  10. arXiv:2311.11250  [pdf, other

    cs.AI

    A Comprehensive Review on Sentiment Analysis: Tasks, Approaches and Applications

    Authors: Sudhanshu Kumar, Partha Pratim Roy, Debi Prosad Dogra, Byung-Gyu Kim

    Abstract: Sentiment analysis (SA) is an emerging field in text mining. It is the process of computationally identifying and categorizing opinions expressed in a piece of text over different social media platforms. Social media plays an essential role in knowing the customer mindset towards a product, services, and the latest market trends. Most organizations depend on the customer's response and feedback to… ▽ More

    Submitted 19 November, 2023; originally announced November 2023.

  11. arXiv:2310.19961  [pdf, other

    cs.LG cs.AI

    ExPT: Synthetic Pretraining for Few-Shot Experimental Design

    Authors: Tung Nguyen, Sudhanshu Agrawal, Aditya Grover

    Abstract: Experimental design is a fundamental problem in many science and engineering fields. In this problem, sample efficiency is crucial due to the time, money, and safety costs of real-world design evaluations. Existing approaches either rely on active data collection or access to large, labeled datasets of past experiments, making them impractical in many real-world scenarios. In this work, we address… ▽ More

    Submitted 30 October, 2023; originally announced October 2023.

    Comments: 2023 Conference on Neural Information Processing Systems (NeurIPS)

  12. arXiv:2310.00594  [pdf

    cs.CR

    Performance evaluation of Machine learning algorithms for Intrusion Detection System

    Authors: Sudhanshu Sekhar Tripathy, Bichitrananda Behera

    Abstract: The escalation of hazards to safety and hijacking of digital networks are among the strongest perilous difficulties that must be addressed in the present day. Numerous safety procedures were set up to track and recognize any illicit activity on the network's infrastructure. IDS are the best way to resist and recognize intrusions on internet connections and digital technologies. To classify network… ▽ More

    Submitted 1 October, 2023; originally announced October 2023.

  13. arXiv:2308.06448  [pdf, other

    cs.LG cs.SI

    Latent Random Steps as Relaxations of Max-Cut, Min-Cut, and More

    Authors: Sudhanshu Chanpuriya, Cameron Musco

    Abstract: Algorithms for node clustering typically focus on finding homophilous structure in graphs. That is, they find sets of similar nodes with many edges within, rather than across, the clusters. However, graphs often also exhibit heterophilous structure, as exemplified by (nearly) bipartite and tripartite graphs, where most edges occur across the clusters. Grappling with such structure is typically lef… ▽ More

    Submitted 11 August, 2023; originally announced August 2023.

  14. arXiv:2308.03277  [pdf, other

    cs.CL

    From Ambiguity to Explicitness: NLP-Assisted 5G Specification Abstraction for Formal Analysis

    Authors: Shiyu Yuan, **gda Yang, Sudhanshu Arya, Carlo Lipizzi, Ying Wang

    Abstract: Formal method-based analysis of the 5G Wireless Communication Protocol is crucial for identifying logical vulnerabilities and facilitating an all-encompassing security assessment, especially in the design phase. Natural Language Processing (NLP) assisted techniques and most of the tools are not widely adopted by the industry and research community. Traditional formal verification through a mathema… ▽ More

    Submitted 6 August, 2023; originally announced August 2023.

  15. arXiv:2307.11247  [pdf, other

    cs.CR

    Formal-Guided Fuzz Testing: Targeting Security Assurance from Specification to Implementation for 5G and Beyond

    Authors: **gda Yang, Sudhanshu Arya, Ying Wang

    Abstract: Softwarization and virtualization in 5G and beyond necessitate thorough testing to ensure the security of critical infrastructure and networks, requiring the identification of vulnerabilities and unintended emergent behaviors from protocol designs to their software stack implementation. To provide an efficient and comprehensive solution, we propose a novel and first-of-its-kind approach that conne… ▽ More

    Submitted 20 July, 2023; originally announced July 2023.

  16. arXiv:2305.02451  [pdf, other

    cs.IT eess.SP

    Ground-to-UAV Integrated Network: Low Latency Communication over Interference Channel

    Authors: Sudhanshu Arya, Ying Wang

    Abstract: We present a novel and first-of-its-kind information-theoretic framework for the key design consideration and implementation of a ground-to-UAV (G2U) communication network to minimize end-to-end transmission delay in the presence of interference. The proposed framework is useful as it describes the minimum transmission latency for an uplink ground-to-UAV communication must satisfy while achieving… ▽ More

    Submitted 3 May, 2023; originally announced May 2023.

    Comments: 12 pages, 11 Figures

  17. arXiv:2302.06227  [pdf, other

    eess.AS cs.SD

    Fast and small footprint Hybrid HMM-HiFiGAN based system for speech synthesis in Indian languages

    Authors: Sudhanshu Srivastava, Ishika Gupta, Anusha Prakash, Jom Kuriakose, Hema A. Murthy

    Abstract: Hidden-Markov-model (HMM) based text-to-speech (HTS) offers flexibility in speaking styles along with fast training and synthesis while being computationally less intense. HTS performs well even in low-resource scenarios. The primary drawback is that the voice quality is poor compared to that of E2E systems. A hybrid approach combining HMM-based feature generation and neural-network-based HiFi-GAN… ▽ More

    Submitted 13 February, 2023; originally announced February 2023.

    Comments: 5 pages, 5 figures

  18. arXiv:2302.04075  [pdf, other

    cs.CV

    Best Practices in Active Learning for Semantic Segmentation

    Authors: Sudhanshu Mittal, Joshua Niemeijer, Jörg P. Schäfer, Thomas Brox

    Abstract: Active learning is particularly of interest for semantic segmentation, where annotations are costly. Previous academic studies focused on datasets that are already very diverse and where the model is trained in a supervised manner with a large annotation budget. In contrast, data collected in many driving scenarios is highly redundant, and most medical applications are subject to very constrained… ▽ More

    Submitted 15 March, 2023; v1 submitted 8 February, 2023; originally announced February 2023.

  19. arXiv:2211.12914  [pdf, other

    cs.CV cs.LG

    Open-vocabulary Attribute Detection

    Authors: María A. Bravo, Sudhanshu Mittal, Simon Ging, Thomas Brox

    Abstract: Vision-language modeling has enabled open-vocabulary tasks where predictions can be queried using any text prompt in a zero-shot manner. Existing open-vocabulary tasks focus on object classes, whereas research on object attributes is limited due to the lack of a reliable attribute-focused evaluation benchmark. This paper introduces the Open-Vocabulary Attribute Detection (OVAD) task and the corres… ▽ More

    Submitted 8 March, 2023; v1 submitted 23 November, 2022; originally announced November 2022.

    Comments: Accepted at CVPR 2023. https://ovad-benchmark.github.io

  20. arXiv:2211.01338  [pdf, other

    eess.AS cs.CL cs.MM cs.SD eess.IV

    Technology Pipeline for Large Scale Cross-Lingual Dubbing of Lecture Videos into Multiple Indian Languages

    Authors: Anusha Prakash, Arun Kumar, Ashish Seth, Bhagyashree Mukherjee, Ishika Gupta, Jom Kuriakose, Jordan Fernandes, K V Vikram, Mano Ranjith Kumar M, Metilda Sagaya Mary, Mohammad Wajahat, Mohana N, Mudit Batra, Navina K, Nihal John George, Nithya Ravi, Pruthwik Mishra, Sudhanshu Srivastava, Vasista Sai Lodagala, Vandan Mujadia, Kada Sai Venkata Vineeth, Vrunda Sukhadia, Dipti Sharma, Hema Murthy, Pushpak Bhattacharya , et al. (2 additional authors not shown)

    Abstract: Cross-lingual dubbing of lecture videos requires the transcription of the original audio, correction and removal of disfluencies, domain term discovery, text-to-text translation into the target language, chunking of text using target language rhythm, text-to-speech synthesis followed by isochronous lipsyncing to the original video. This task becomes challenging when the source and target languages… ▽ More

    Submitted 1 November, 2022; originally announced November 2022.

  21. arXiv:2210.14380  [pdf, other

    cs.CL

    Progressive Sentiment Analysis for Code-Switched Text Data

    Authors: Sudhanshu Ranjan, Dheeraj Mekala, **gbo Shang

    Abstract: Multilingual transformer language models have recently attracted much attention from researchers and are used in cross-lingual transfer learning for many NLP tasks such as text classification and named entity recognition. However, similar methods for transfer learning from monolingual text to code-switched text have not been extensively explored mainly due to the following challenges: (1) Code-swi… ▽ More

    Submitted 25 October, 2022; originally announced October 2022.

    Comments: To appear in Findings of EMNLP 2022

  22. arXiv:2210.00032  [pdf, other

    cs.LG cs.SI

    Direct Embedding of Temporal Network Edges via Time-Decayed Line Graphs

    Authors: Sudhanshu Chanpuriya, Ryan A. Rossi, Sungchul Kim, Tong Yu, Jane Hoffswell, Nedim Lipka, Shunan Guo, Cameron Musco

    Abstract: Temporal networks model a variety of important phenomena involving timed interactions between entities. Existing methods for machine learning on temporal networks generally exhibit at least one of two limitations. First, time is assumed to be discretized, so if the time data is continuous, the user must determine the discretization and discard precise time information. Second, edge representations… ▽ More

    Submitted 30 September, 2022; originally announced October 2022.

  23. arXiv:2207.11318  [pdf, other

    stat.ME cs.GR

    Fiber Uncertainty Visualization for Bivariate Data With Parametric and Nonparametric Noise Models

    Authors: Tushar M. Athawale, Chris R. Johnson, Sudhanshu Sane, David Pugmire

    Abstract: Visualization and analysis of multivariate data and their uncertainty are top research challenges in data visualization. Constructing fiber surfaces is a popular technique for multivariate data visualization that generalizes the idea of level-set visualization for univariate data to multivariate data. In this paper, we present a statistical framework to quantify positional probabilities of fibers… ▽ More

    Submitted 22 July, 2022; originally announced July 2022.

    Comments: 9 pages paper + 2 page references, 10 figures, IEEE VIS 2022 paper to be published as a special issue of IEEE Transactions on Visualization and Computer Graphics (TVCG)

  24. arXiv:2205.06160  [pdf, other

    cs.CV cs.LG

    Localized Vision-Language Matching for Open-vocabulary Object Detection

    Authors: Maria A. Bravo, Sudhanshu Mittal, Thomas Brox

    Abstract: In this work, we propose an open-vocabulary object detection method that, based on image-caption pairs, learns to detect novel object classes along with a given set of known classes. It is a two-stage training approach that first uses a location-guided image-caption matching technique to learn class labels for both novel and known classes in a weakly-supervised manner and second specializes the mo… ▽ More

    Submitted 28 July, 2022; v1 submitted 12 May, 2022; originally announced May 2022.

    Comments: Accepted at DAGM German Conference on Pattern Recognition (GCPR 2022)

  25. arXiv:2202.04139  [pdf, other

    cs.LG cs.SI

    Simplified Graph Convolution with Heterophily

    Authors: Sudhanshu Chanpuriya, Cameron Musco

    Abstract: Recent work has shown that a simple, fast method called Simple Graph Convolution (SGC) (Wu et al., 2019), which eschews deep learning, is competitive with deep methods like graph convolutional networks (GCNs) (Kipf & Welling, 2017) in common graph machine learning benchmarks. The use of graph data in SGC implicitly assumes the common but not universal graph characteristic of homophily, wherein nod… ▽ More

    Submitted 3 June, 2022; v1 submitted 8 February, 2022; originally announced February 2022.

  26. arXiv:2201.08440  [pdf, other

    cs.DC cs.GR cs.PF

    A Guide to Particle Advection Performance

    Authors: Abhishek Yenpure, Sudhanshu Sane, Roba Binyahib, David Pugmire, Christoph Garth, Hank Childs

    Abstract: The performance of particle advection-based flow visualization techniques is complex, since computational work can vary based on many factors, including number of particles, duration, and mesh type. Further, while many approaches have been introduced to optimize performance, the efficacy of a given approach can be similarly complex. In this work, we seek to establish a guide for particle advection… ▽ More

    Submitted 20 January, 2022; originally announced January 2022.

    Comments: 19 pages, survey paper submitted to TVCG (Transactions in Visualization and Computer Graphics)

  27. arXiv:2111.03030  [pdf, other

    cs.LG cs.SI

    Exact Representation of Sparse Networks with Symmetric Nonnegative Embeddings

    Authors: Sudhanshu Chanpuriya, Ryan A. Rossi, Anup Rao, Tung Mai, Nedim Lipka, Zhao Song, Cameron Musco

    Abstract: Many models for undirected graphs are based on factorizing the graph's adjacency matrix; these models find a vector representation of each node such that the predicted probability of a link between two nodes increases with the similarity (dot product) of their associated vectors. Recent work has shown that these models are unable to capture key structures in real-world graphs, particularly heterop… ▽ More

    Submitted 30 September, 2022; v1 submitted 4 November, 2021; originally announced November 2021.

  28. arXiv:2111.00048  [pdf, other

    cs.LG cs.SI

    On the Power of Edge Independent Graph Models

    Authors: Sudhanshu Chanpuriya, Cameron Musco, Konstantinos Sotiropoulos, Charalampos E. Tsourakakis

    Abstract: Why do many modern neural-network-based graph generative models fail to reproduce typical real-world network characteristics, such as high triangle density? In this work we study the limitations of edge independent random graph models, in which each edge is added to the graph independently with some probability. Such models include both the classic Erdös-Rényi and stochastic block models, as well… ▽ More

    Submitted 29 October, 2021; originally announced November 2021.

  29. Exploratory Lagrangian-Based Particle Tracing Using Deep Learning

    Authors: Mengjiao Han, Sudhanshu Sane, Chris R. Johnson

    Abstract: Time-varying vector fields produced by computational fluid dynamics simulations are often prohibitively large and pose challenges for accurate interactive analysis and exploration. To address these challenges, reduced Lagrangian representations have been increasingly researched as a means to improve scientific time-varying vector field exploration capabilities. This paper presents a novel deep neu… ▽ More

    Submitted 6 January, 2022; v1 submitted 15 October, 2021; originally announced October 2021.

    Comments: The paper has been accepted to publish by Journal of Flow Visualization and Image Processing

  30. arXiv:2108.03066  [pdf, other

    cs.GR

    Uncertainty Visualization of the Marching Squares and Marching Cubes Topology Cases

    Authors: Tushar M. Athawale, Sudhanshu Sane, Chris R. Johnson

    Abstract: Marching squares (MS) and marching cubes (MC) are widely used algorithms for level-set visualization of scientific data. In this paper, we address the challenge of uncertainty visualization of the topology cases of the MS and MC algorithms for uncertain scalar field data sampled on a uniform grid. The visualization of the MS and MC topology cases for uncertain data is challenging due to their expo… ▽ More

    Submitted 6 August, 2021; originally announced August 2021.

  31. Recommending best course of treatment based on similarities of prognostic markers

    Authors: Sudhanshu, Narinder Singh Punn, Sanjay Kumar Sonbhadra, Sonali Agarwal

    Abstract: With the advancement in the technology sector spanning over every field, a huge influx of information is inevitable. Among all the opportunities that the advancements in the technology have brought, one of them is to propose efficient solutions for data retrieval. This means that from an enormous pile of data, the retrieval methods should allow the users to fetch the relevant and recent data over… ▽ More

    Submitted 19 July, 2021; v1 submitted 15 July, 2021; originally announced July 2021.

  32. arXiv:2106.13232  [pdf, other

    cs.NI eess.SP

    A Novel Compact Dual-Band Antenna Design for WLAN Applications

    Authors: Peshal B. Nayak, Ramu Endluri, Sudhanshu Verma, Preetam Kumar

    Abstract: A novel and compact dual band planar antenna for 2.4/5.2/5.8-GHz wireless local area network(WLAN) applications is proposed and studied in this paper. The antenna comprises of a T-shaped and a F-shaped element to generate two resonant modes for dual band operation. The two elements can independently control the operating frequencies of the two excited resonant modes. The T-element which is fed dir… ▽ More

    Submitted 12 May, 2021; originally announced June 2021.

  33. arXiv:2106.12884  [pdf, other

    cs.NI eess.SP

    A Novel Compact Tri-Band Antenna Design for WiMAX, WLAN and Bluetooth Applications

    Authors: Peshal Nayak, Sudhanshu Verma, Preetam Kumar

    Abstract: A novel and compact tri-band planar antenna for 2.4/5.2/5.8-GHz wireless local area network (WLAN), 2.3/3.5/5.5GHz Worldwide Interoperability for Microwave Access (WiMAX) and Bluetooth applications is proposed and studied in this paper. The antenna comprises of a L-shaped element which is coupled with a ground shorted parasitic resonator to generate three resonant modes for tri-band operation. The… ▽ More

    Submitted 13 May, 2021; originally announced June 2021.

  34. arXiv:2102.09517  [pdf, other

    cs.CV cs.LG

    Essentials for Class Incremental Learning

    Authors: Sudhanshu Mittal, Silvio Galesso, Thomas Brox

    Abstract: Contemporary neural networks are limited in their ability to learn from evolving streams of training data. When trained sequentially on new or evolving tasks, their accuracy drops sharply, making them unsuitable for many real-world applications. In this work, we shed light on the causes of this well-known yet unsolved phenomenon - often referred to as catastrophic forgetting - in a class-increment… ▽ More

    Submitted 18 February, 2021; originally announced February 2021.

  35. arXiv:2102.08532  [pdf, other

    cs.LG cs.SI

    DeepWalking Backwards: From Embeddings Back to Graphs

    Authors: Sudhanshu Chanpuriya, Cameron Musco, Konstantinos Sotiropoulos, Charalampos E. Tsourakakis

    Abstract: Low-dimensional node embeddings play a key role in analyzing graph datasets. However, little work studies exactly what information is encoded by popular embedding methods, and how this information correlates with performance in downstream machine learning tasks. We tackle this question by studying whether embeddings can be inverted to (approximately) recover the graph used to generate them. Focusi… ▽ More

    Submitted 16 February, 2021; originally announced February 2021.

  36. arXiv:2102.00297  [pdf, other

    cs.CV

    Deep Learning--Based Scene Simplification for Bionic Vision

    Authors: Nicole Han, Sudhanshu Srivastava, Aiwen Xu, Devi Klein, Michael Beyeler

    Abstract: Retinal degenerative diseases cause profound visual impairment in more than 10 million people worldwide, and retinal prostheses are being developed to restore vision to these individuals. Analogous to cochlear implants, these devices electrically stimulate surviving retinal cells to evoke visual percepts (phosphenes). However, the quality of current prosthetic vision is still rudimentary. Rather t… ▽ More

    Submitted 30 January, 2021; originally announced February 2021.

    Comments: 10 pages, 8 figures, 3 tables

  37. arXiv:2101.11155  [pdf, other

    cs.CL cs.AI cs.LG cs.SI

    Exploring multi-task multi-lingual learning of transformer models for hate speech and offensive speech identification in social media

    Authors: Sudhanshu Mishra, Shivangi Prasad, Shubhanshu Mishra

    Abstract: Hate Speech has become a major content moderation issue for online social media platforms. Given the volume and velocity of online content production, it is impossible to manually moderate hate speech related content on any platform. In this paper we utilize a multi-task and multi-lingual approach based on recently proposed Transformer Neural Networks to solve three sub-tasks for hate speech. Thes… ▽ More

    Submitted 26 January, 2021; originally announced January 2021.

    Comments: "To be published in SN Computer Science at https://doi.org/10.1007/s42979-021-00455-5" "30 pages, 6 figures" "Code available at https://github.com/socialmediaie/MTML_HateSpeech"

    MSC Class: 68T50 68T50 (Primary); 68T07 (Secondary) ACM Class: I.2.7

  38. arXiv:2007.15619  [pdf, other

    cs.CY cs.CL cs.LG

    AI-based Monitoring and Response System for Hospital Preparedness towards COVID-19 in Southeast Asia

    Authors: Tushar Goswamy, Naishadh Parmar, Ayush Gupta, Raunak Shah, Vatsalya Tandon, Varun Goyal, Sanyog Gupta, Karishma Laud, Shivam Gupta, Sudhanshu Mishra, Ashutosh Modi

    Abstract: This research paper proposes a COVID-19 monitoring and response system to identify the surge in the volume of patients at hospitals and shortage of critical equipment like ventilators in South-east Asian countries, to understand the burden on health facilities. This can help authorities in these regions with resource planning measures to redirect resources to the regions identified by the model. D… ▽ More

    Submitted 5 September, 2022; v1 submitted 30 July, 2020; originally announced July 2020.

    Comments: 5 pages, 5 figures. Accepted to the ICML 2020 Workshop on Healthcare Systems, Population Health, and the Role of Health-Tech

  39. arXiv:2006.05592  [pdf, other

    cs.LG cs.DS cs.SI stat.ML

    Node Embeddings and Exact Low-Rank Representations of Complex Networks

    Authors: Sudhanshu Chanpuriya, Cameron Musco, Konstantinos Sotiropoulos, Charalampos E. Tsourakakis

    Abstract: Low-dimensional embeddings, from classical spectral embeddings to modern neural-net-inspired methods, are a cornerstone in the modeling and analysis of complex networks. Recent work by Seshadhri et al. (PNAS 2020) suggests that such embeddings cannot capture local structure arising in complex networks. In particular, they show that any network generated from a natural low-dimensional model cannot… ▽ More

    Submitted 16 October, 2020; v1 submitted 9 June, 2020; originally announced June 2020.

  40. arXiv:2006.00094  [pdf, other

    cs.LG cs.SI stat.ML

    InfiniteWalk: Deep Network Embeddings as Laplacian Embeddings with a Nonlinearity

    Authors: Sudhanshu Chanpuriya, Cameron Musco

    Abstract: The skip-gram model for learning word embeddings (Mikolov et al. 2013) has been widely popular, and DeepWalk (Perozzi et al. 2014), among other methods, has extended the model to learning node representations from networks. Recent work of Qiu et al. (2018) provides a closed-form expression for the DeepWalk objective, obviating the need for sampling for small datasets and improving accuracy. In the… ▽ More

    Submitted 17 August, 2020; v1 submitted 29 May, 2020; originally announced June 2020.

  41. arXiv:2004.02003  [pdf, other

    cs.CE cs.DC physics.comp-ph

    Scalable In Situ Lagrangian Flow Map Extraction: Demonstrating the Viability of a Communication-Free Model

    Authors: Sudhanshu Sane, Abhishek Yenpure, Roxana Bujack, Matthew Larsen, Kenneth Moreland, Christoph Garth, Hank Childs

    Abstract: We introduce and evaluate a new algorithm for the in situ extraction of Lagrangian flow maps, which we call Boundary Termination Optimization (BTO). Our approach is a communication-free model, requiring no message passing or synchronization between processes, improving scalability, thereby reducing overall execution time and alleviating the encumbrance placed on simulation codes from in situ proce… ▽ More

    Submitted 4 April, 2020; originally announced April 2020.

  42. arXiv:1912.05361  [pdf, other

    cs.CV

    Parting with Illusions about Deep Active Learning

    Authors: Sudhanshu Mittal, Maxim Tatarchenko, Özgün Çiçek, Thomas Brox

    Abstract: Active learning aims to reduce the high labeling cost involved in training machine learning models on large datasets by efficiently labeling only the most informative samples. Recently, deep active learning has shown success on various tasks. However, the conventional evaluation scheme used for deep active learning is below par. Current methods disregard some apparent parallel work in the closely… ▽ More

    Submitted 11 December, 2019; originally announced December 2019.

  43. arXiv:1909.13561  [pdf, other

    cs.LG cs.CV cs.RO stat.ML

    Imagine That! Leveraging Emergent Affordances for 3D Tool Synthesis

    Authors: Yizhe Wu, Sudhanshu Kasewa, Oliver Groth, Sasha Salter, Li Sun, Oiwi Parker Jones, Ingmar Posner

    Abstract: In this paper we explore the richness of information captured by the latent space of a vision-based generative model. The model combines unsupervised generative learning with a task-based performance predictor to learn and to exploit task-relevant object affordances given visual observations from a reaching task, involving a scenario and a stick-like tool. While the learned embedding of the genera… ▽ More

    Submitted 7 October, 2020; v1 submitted 30 September, 2019; originally announced September 2019.

    Comments: 12 pages, 6 figures

    ACM Class: I.2.10; I.2.6

  44. arXiv:1908.05724  [pdf, other

    cs.CV

    Semi-Supervised Semantic Segmentation with High- and Low-level Consistency

    Authors: Sudhanshu Mittal, Maxim Tatarchenko, Thomas Brox

    Abstract: The ability to understand visual information from limited labeled data is an important aspect of machine learning. While image-level classification has been extensively studied in a semi-supervised setting, dense pixel-level classification with limited data has only drawn attention recently. In this work, we propose an approach for semi-supervised semantic segmentation that learns from limited pix… ▽ More

    Submitted 15 August, 2019; originally announced August 2019.

  45. Multi-Species Cuckoo Search Algorithm for Global Optimization

    Authors: Xin-She Yang, Suash Deb, Sudhanshu K Mishra

    Abstract: Many optimization problems in science and engineering are highly nonlinear, and thus require sophisticated optimization techniques to solve. Traditional techniques such as gradient-based algorithms are mostly local search methods, and often struggle to cope with such challenging optimization problems. Recent trends tend to use nature-inspired optimization algorithms. This work extends the standard… ▽ More

    Submitted 27 March, 2019; originally announced March 2019.

    Comments: 15 pages, 1 figures

    MSC Class: 90C26; 78M32

    Journal ref: Cognitive Computation, vol. 10, number 6, 1085-1095 (2018)

  46. arXiv:1901.08759  [pdf, other

    cs.CL cs.CV eess.IV

    Misleading Metadata Detection on YouTube

    Authors: Priyank Palod, Ayush Patwari, Sudhanshu Bahety, Saurabh Bagchi, Pawan Goyal

    Abstract: YouTube is the leading social media platform for sharing videos. As a result, it is plagued with misleading content that includes staged videos presented as real footages from an incident, videos with misrepresented context and videos where audio/video content is morphed. We tackle the problem of detecting such misleading videos as a supervised classification task. We develop UCNet - a deep networ… ▽ More

    Submitted 25 January, 2019; originally announced January 2019.

    Comments: Accepted at European Conference on Information Retrieval(ECIR) 2019. 7 Pages

  47. arXiv:1811.10804  [pdf, other

    cs.IR cs.SI

    Movie Recommendation System using Sentiment Analysis from Microblogging Data

    Authors: Sudhanshu Kumar, Shirsendu Sukanta Halder, Kanjar De, Partha Pratim Roy

    Abstract: Recommendation systems are important intelligent systems that play a vital role in providing selective information to users. Traditional approaches in recommendation systems include collaborative filtering and content-based filtering. However, these approaches have certain limitations like the necessity of prior user history and habits for performing the task of recommendation. In order to reduce… ▽ More

    Submitted 26 November, 2018; originally announced November 2018.

    Comments: 19 pages, 7 tables, 5 figures

  48. arXiv:1811.03516  [pdf, other

    cs.LG stat.ML

    Learning from Demonstration in the Wild

    Authors: Feryal Behbahani, Kyriacos Shiarlis, Xi Chen, Vitaly Kurin, Sudhanshu Kasewa, Ciprian Stirbu, João Gomes, Supratik Paul, Frans A. Oliehoek, João Messias, Shimon Whiteson

    Abstract: Learning from demonstration (LfD) is useful in settings where hand-coding behaviour or a reward function is impractical. It has succeeded in a wide range of problems but typically relies on manually generated demonstrations or specially deployed sensors and has not generally been able to leverage the copious demonstrations available in the wild: those that capture behaviours that were occurring an… ▽ More

    Submitted 25 March, 2019; v1 submitted 8 November, 2018; originally announced November 2018.

    Comments: Accepted to the IEEE International Conference on Robotics and Automation (ICRA) 2019; extended version with appendix

  49. An Efficient Fault Tolerant Workflow Scheduling Approach using Replication Heuristics and Checkpointing in the Cloud

    Authors: S. Jaya Nirmala, Amrith Rajagopal Setlur, Har Simrat Singh, Sudhanshu Khoriya

    Abstract: Scientific workflows have been predominantly used for complex and large scale data analysis and scientific computation/automation and the need for robust workflow scheduling techniques has grown considerably. But, most of the existing workflow scheduling algorithms do not provide the required reliability and robustness. In this paper, a new fault tolerant workflow scheduling algorithm that learns… ▽ More

    Submitted 31 October, 2019; v1 submitted 15 October, 2018; originally announced October 2018.

    Comments: 35 pages, 9 figures

    Journal ref: Journal of Parallel and Distributed Computing 2020

  50. arXiv:1810.00668  [pdf, ps, other

    cs.CL cs.LG stat.ML

    Wronging a Right: Generating Better Errors to Improve Grammatical Error Detection

    Authors: Sudhanshu Kasewa, Pontus Stenetorp, Sebastian Riedel

    Abstract: Grammatical error correction, like other machine learning tasks, greatly benefits from large quantities of high quality training data, which is typically expensive to produce. While writing a program to automatically generate realistic grammatical errors would be difficult, one could learn the distribution of naturallyoccurring errors and attempt to introduce them into other datasets. Initial work… ▽ More

    Submitted 26 September, 2018; originally announced October 2018.

    Comments: Accepted as a short paper at EMNLP 2018