Skip to main content

Showing 1–25 of 25 results for author: Ashima

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.04779  [pdf, other

    cs.LG cs.AI cs.NI

    Mobile Network Configuration Recommendation using Deep Generative Graph Neural Network

    Authors: Shirwan Piroti, Ashima Chawla, Tahar Zanouda

    Abstract: There are vast number of configurable parameters in a Radio Access Telecom Network. A significant amount of these parameters is configured by Radio Node or cell based on their deployment setting. Traditional methods rely on domain knowledge for individual parameter configuration, often leading to sub-optimal results. To improve this, a framework using a Deep Generative Graph Neural Network (GNN) i… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

    Comments: 4 pages, 4 figures

  2. arXiv:2404.02456  [pdf, other

    cs.CL cs.AI cs.LG cs.SD eess.AS

    PhonologyBench: Evaluating Phonological Skills of Large Language Models

    Authors: Ashima Suvarna, Harshita Khandelwal, Nanyun Peng

    Abstract: Phonology, the study of speech's structure and pronunciation rules, is a critical yet often overlooked component in Large Language Model (LLM) research. LLMs are widely used in various downstream applications that leverage phonology such as educational tools and poetry generation. Moreover, LLMs can potentially learn imperfect associations between orthographic and phonological forms from the train… ▽ More

    Submitted 5 April, 2024; v1 submitted 3 April, 2024; originally announced April 2024.

    Comments: 17 pages, 7 figures, 6 tables

  3. arXiv:2404.01030  [pdf, ps, other

    cs.CV cs.AI cs.CY

    Survey of Bias In Text-to-Image Generation: Definition, Evaluation, and Mitigation

    Authors: Yixin Wan, Arjun Subramonian, Anaelia Ovalle, Zongyu Lin, Ashima Suvarna, Christina Chance, Hritik Bansal, Rebecca Pattichis, Kai-Wei Chang

    Abstract: The recent advancement of large and powerful models with Text-to-Image (T2I) generation abilities -- such as OpenAI's DALLE-3 and Google's Gemini -- enables users to generate high-quality images from textual prompts. However, it has become increasingly evident that even simple prompts could cause T2I models to exhibit conspicuous social bias in generated images. Such bias might lead to both alloca… ▽ More

    Submitted 1 May, 2024; v1 submitted 1 April, 2024; originally announced April 2024.

  4. arXiv:2404.00530  [pdf, other

    cs.CL cs.AI cs.LG

    Comparing Bad Apples to Good Oranges: Aligning Large Language Models via Joint Preference Optimization

    Authors: Hritik Bansal, Ashima Suvarna, Gantavya Bhatt, Nanyun Peng, Kai-Wei Chang, Aditya Grover

    Abstract: A common technique for aligning large language models (LLMs) relies on acquiring human preferences by comparing multiple generations conditioned on a fixed context. This only leverages the pairwise comparisons when the generations are placed in an identical context. However, such conditional rankings often fail to capture the complex and multidimensional aspects of human preferences. In this work,… ▽ More

    Submitted 30 March, 2024; originally announced April 2024.

    Comments: 25 pages, 14 figures, 5 tables

  5. arXiv:2403.02586  [pdf, other

    cs.CL

    Improving Event Definition Following For Zero-Shot Event Detection

    Authors: Zefan Cai, Po-Nien Kung, Ashima Suvarna, Mingyu Derek Ma, Hritik Bansal, Baobao Chang, P. Jeffrey Brantingham, Wei Wang, Nanyun Peng

    Abstract: Existing approaches on zero-shot event detection usually train models on datasets annotated with known event types, and prompt them with unseen event definitions. These approaches yield sporadic successes, yet generally fall short of expectations. In this work, we aim to improve zero-shot event detection by training models to better follow event definitions. We hypothesize that a diverse set of ev… ▽ More

    Submitted 4 March, 2024; originally announced March 2024.

  6. arXiv:2310.05765  [pdf, other

    cs.CE cs.RO

    Examining the simulation-to-reality gap of a wheel loader digging in deformable terrain

    Authors: Koji Aoshima, Martin Servin

    Abstract: We investigate how well a physics-based simulator can replicate a real wheel loader performing bucket filling in a pile of soil. The comparison is made using field test time series of the vehicle motion and actuation forces, loaded mass, and total work. The vehicle was modeled as a rigid multibody system with frictional contacts, driveline, and linear actuators. For the soil, we tested discrete el… ▽ More

    Submitted 27 April, 2024; v1 submitted 9 October, 2023; originally announced October 2023.

    Comments: 22 pages, 15 figures

  7. arXiv:2309.12016  [pdf, other

    cs.RO

    World Modeling for Autonomous Wheel Loaders

    Authors: Koji Aoshima, Arvid Fälldin, Eddie Wadbro, Martin Servin

    Abstract: This paper presents a method for learning world models for wheel loaders performing automatic loading actions on a pile of soil. Data-driven models were learned to output the resulting pile state, loaded mass, time, and work for a single loading cycle given inputs that include a heightmap of the initial pile shape and action parameters for an automatic bucket-filling controller. Long-horizon plann… ▽ More

    Submitted 28 May, 2024; v1 submitted 21 September, 2023; originally announced September 2023.

    Comments: 23 pages, 19 figures

  8. arXiv:2211.14573  [pdf, other

    cs.CV cs.LG

    Deep Curvilinear Editing: Commutative and Nonlinear Image Manipulation for Pretrained Deep Generative Model

    Authors: Takehiro Aoshima, Takashi Matsubara

    Abstract: Semantic editing of images is the fundamental goal of computer vision. Although deep learning methods, such as generative adversarial networks (GANs), are capable of producing high-quality images, they often do not have an inherent way of editing generated images semantically. Recent studies have investigated a way of manipulating the latent variable to determine the images to be generated. Howeve… ▽ More

    Submitted 29 August, 2023; v1 submitted 26 November, 2022; originally announced November 2022.

    Comments: 15 pages. The last update made no changes except for adding the following link to the CVF repository: https://openaccess.thecvf.com/content/CVPR2023/html/Aoshima_Deep_Curvilinear_Editing_Commutative_and_Nonlinear_Image_Manipulation_for_Pretrained_CVPR_2023_paper.html. Here, you can find our code to reproduce our results

    Journal ref: The IEEE/CVF Conference on Computer Vision and Pattern Recognition 2023 (CVPR2023)

  9. arXiv:2207.12646  [pdf, other

    cs.CV

    Learning Hierarchy Aware Features for Reducing Mistake Severity

    Authors: Ashima Garg, Depanshu Sani, Saket Anand

    Abstract: Label hierarchies are often available apriori as part of biological taxonomy or language datasets WordNet. Several works exploit these to learn hierarchy aware features in order to improve the classifier to make semantically meaningful mistakes while maintaining or reducing the overall error. In this paper, we propose a novel approach for learning Hierarchy Aware Features (HAF) that leverages clas… ▽ More

    Submitted 26 July, 2022; originally announced July 2022.

    Comments: 21 pages, 7 figures, Accepted in ECCV 2022

  10. arXiv:2206.07331  [pdf

    cs.MM

    ETMA: Efficient Transformer Based Multilevel Attention framework for Multimodal Fake News Detection

    Authors: Ashima Yadav, Shivani Gaba, Haneef Khan, Ishan Budhiraja, Akansha Singh, Krishan Kant Singh

    Abstract: In this new digital era, social media has created a severe impact on the lives of people. In recent times, fake news content on social media has become one of the major challenging problems for society. The dissemination of fabricated and false news articles includes multimodal data in the form of text and images. The previous methods have mainly focused on unimodal analysis. Moreover, for multimo… ▽ More

    Submitted 13 March, 2023; v1 submitted 15 June, 2022; originally announced June 2022.

    Comments: Accepted for publication in IEEE Transactions on Computational Social Systems

  11. arXiv:2111.00164  [pdf, other

    cs.CV

    HIERMATCH: Leveraging Label Hierarchies for Improving Semi-Supervised Learning

    Authors: Ashima Garg, Shaurya Bagga, Yashvardhan Singh, Saket Anand

    Abstract: Semi-supervised learning approaches have emerged as an active area of research to combat the challenge of obtaining large amounts of annotated data. Towards the goal of improving the performance of semi-supervised learning methods, we propose a novel framework, HIERMATCH, a semi-supervised approach that leverages hierarchical information to reduce labeling costs and performs as well as a vanilla s… ▽ More

    Submitted 21 December, 2021; v1 submitted 29 October, 2021; originally announced November 2021.

    Comments: 11 pages, 1 figure, Accepted in WACV 2022

  12. arXiv:2107.14615  [pdf, other

    cs.CE

    Simulation-Based Optimization of High-Performance Wheel Loading

    Authors: Koji Aoshima, Martin Servin, Eddie Wadbro

    Abstract: Having smart and autonomous earthmoving in mind, we explore high-performance wheel loading in a simulated environment. This paper introduces a wheel loader simulator that combines contacting 3D multibody dynamics with a hybrid continuum-particle terrain model, supporting realistic digging forces and soil displacements at real-time performance. A total of 270,000 simulations are run with different… ▽ More

    Submitted 29 September, 2021; v1 submitted 30 July, 2021; originally announced July 2021.

    Comments: 8 pages, 9 figures, 3 tables

  13. arXiv:2107.07842  [pdf, other

    cs.IR cs.AI

    A Survey of Knowledge Graph Embedding and Their Applications

    Authors: Shivani Choudhary, Tarun Luthra, Ashima Mittal, Rajat Singh

    Abstract: Knowledge Graph embedding provides a versatile technique for representing knowledge. These techniques can be used in a variety of applications such as completion of knowledge graph to predict missing information, recommender systems, question answering, query expansion, etc. The information embedded in Knowledge graph though being structured is challenging to consume in a real-world application. K… ▽ More

    Submitted 16 July, 2021; originally announced July 2021.

    Comments: 11 pages, 9 figures

  14. arXiv:2104.01131  [pdf

    cs.CL cs.SI

    Mining Trends of COVID-19 Vaccine Beliefs on Twitter with Lexical Embeddings

    Authors: Harshita Chopra, Aniket Vashishtha, Ridam Pal, Ashima, Ananya Tyagi, Tavpritesh Sethi

    Abstract: Social media plays a pivotal role in disseminating news globally and acts as a platform for people to express their opinions on various topics. A wide variety of views accompanies COVID-19 vaccination drives across the globe, often colored by emotions, which change along with rising cases, approval of vaccines, and multiple factors discussed online. This study aims at analyzing the temporal evolut… ▽ More

    Submitted 20 July, 2021; v1 submitted 2 April, 2021; originally announced April 2021.

  15. A Deep Multi-Level Attentive network for Multimodal Sentiment Analysis

    Authors: Ashima Yadav, Dinesh Kumar Vishwakarma

    Abstract: Multimodal sentiment analysis has attracted increasing attention with broad application prospects. The existing methods focuses on single modality, which fails to capture the social media content for multiple modalities. Moreover, in multi-modal learning, most of the works have focused on simply combining the two modalities, without exploring the complicated correlations between them. This resulte… ▽ More

    Submitted 15 December, 2020; originally announced December 2020.

    Comments: 11 pages, 7 figures

    Journal ref: ACM Transactions on Multimedia Computing, Communications, and Applications, 2022

  16. arXiv:2011.10358  [pdf

    cs.CL cs.IR

    A Deep Language-independent Network to analyze the impact of COVID-19 on the World via Sentiment Analysis

    Authors: Ashima Yadav, Dinesh Kumar Vishwakarma

    Abstract: Towards the end of 2019, Wuhan experienced an outbreak of novel coronavirus, which soon spread all over the world, resulting in a deadly pandemic that infected millions of people around the globe. The government and public health agencies followed many strategies to counter the fatal virus. However, the virus severely affected the social and economic lives of the people. In this paper, we extract… ▽ More

    Submitted 20 November, 2020; originally announced November 2020.

  17. arXiv:1910.05339  [pdf, other

    cs.DC cs.SE eess.SY

    DeCaf: Diagnosing and Triaging Performance Issues in Large-Scale Cloud Services

    Authors: Chetan Bansal, Sundararajan Renganathan, Ashima Asudani, Olivier Midy, Mathru Janakiraman

    Abstract: Large scale cloud services use Key Performance Indicators (KPIs) for tracking and monitoring performance. They usually have Service Level Objectives (SLOs) baked into the customer agreements which are tied to these KPIs. Dependency failures, code bugs, infrastructure failures, and other problems can cause performance regressions. It is critical to minimize the time and manual effort in diagnosing… ▽ More

    Submitted 2 February, 2020; v1 submitted 11 October, 2019; originally announced October 2019.

    Comments: To be published in the proceedings of ICSE-SEIP '20, Seoul, Republic of Korea

  18. arXiv:1808.03812  [pdf, ps, other

    cs.RO

    Swarm Robots Inspired by Friendship Formation Process

    Authors: Takeshi Kano, Naoki matsui, Eiichi Naito, Takenobu Aoshima, Akio Ishiguro

    Abstract: Swarm robotic systems are systems in which multiple robots having simple functionality perform tasks through their cooperation, and are advantageous in that they can exhibit non-trivial macroscopic functions such as adaptability, fault tolerance, and scalability. We previously proposed a simple model of swarm formation inspired by friendship formation process in human society, and demonstrated via… ▽ More

    Submitted 11 August, 2018; originally announced August 2018.

    Comments: 9 pages, 8 figures

  19. arXiv:1807.10615  [pdf, other

    cs.CL cs.AI

    Judging a Book by its Description : Analyzing Gender Stereotypes in the Man Bookers Prize Winning Fiction

    Authors: Nishtha Madaan, Sameep Mehta, Shravika Mittal, Ashima Suvarna

    Abstract: The presence of gender stereotypes in many aspects of society is a well-known phenomenon. In this paper, we focus on studying and quantifying such stereotypes and bias in the Man Bookers Prize winning fiction. We consider 275 books shortlisted for Man Bookers Prize between 1969 and 2017. The gender bias is analyzed by semantic modeling of book descriptions on Goodreads. This reveals the pervasiven… ▽ More

    Submitted 25 July, 2018; originally announced July 2018.

    Comments: arXiv admin note: substantial text overlap with arXiv:1710.04117

  20. arXiv:1802.03938  [pdf, ps, other

    stat.ML cs.LG

    Revisiting the Vector Space Model: Sparse Weighted Nearest-Neighbor Method for Extreme Multi-Label Classification

    Authors: Tatsuhiro Aoshima, Kei Kobayashi, Mihoko Minami

    Abstract: Machine learning has played an important role in information retrieval (IR) in recent times. In search engines, for example, query keywords are accepted and documents are returned in order of relevance to the given query; this can be cast as a multi-label ranking problem in machine learning. Generally, the number of candidate documents is extremely large (from several thousand to several million);… ▽ More

    Submitted 12 February, 2018; originally announced February 2018.

  21. arXiv:1706.09597  [pdf, other

    cs.AI eess.SY

    Path Integral Networks: End-to-End Differentiable Optimal Control

    Authors: Masashi Okada, Luca Rigazio, Takenobu Aoshima

    Abstract: In this paper, we introduce Path Integral Networks (PI-Net), a recurrent network representation of the Path Integral optimal control algorithm. The network includes both system dynamics and cost models, used for optimal control based planning. PI-Net is fully differentiable, learning both dynamics and cost models end-to-end by back-propagation and stochastic gradient descent. Because of this, PI-N… ▽ More

    Submitted 29 June, 2017; originally announced June 2017.

  22. arXiv:1702.08019  [pdf, ps, other

    stat.ML cs.LG

    Support vector machine and its bias correction in high-dimension, low-sample-size settings

    Authors: Yugo Nakayama, Kazuyoshi Yata, Makoto Aoshima

    Abstract: In this paper, we consider asymptotic properties of the support vector machine (SVM) in high-dimension, low-sample-size (HDLSS) settings. We show that the hard-margin linear SVM holds a consistency property in which misclassification rates tend to zero as the dimension goes to infinity under certain severe conditions. We show that the SVM is very biased in HDLSS settings and its performance is aff… ▽ More

    Submitted 26 February, 2017; originally announced February 2017.

    Comments: 23 pages, 3 figures

    MSC Class: 62H30; 62G20

  23. arXiv:1510.07795  [pdf

    cs.NI

    Improvised Broadcast Algorithm for Wireless Networks

    Authors: Ashima Goel, Debasis Das

    Abstract: Broadcasting problem is an important issue in the wireless networks, especially in dynamic wireless networks. In dynamic wireless networks the node density and mobility is high, due to several problems which arise during broadcasting. Two major problems faced are namely, Broadcast Storm Problem and Disconnected network problem. In a highly dense network, if information is being flooded in a loop,… ▽ More

    Submitted 27 October, 2015; originally announced October 2015.

    Comments: 4 pages

    Journal ref: International Conference on Electrical, Electronics, Signals, Communication and Optimization (EESCO) - 2015

  24. arXiv:1408.4899  [pdf

    cs.SE

    Software Cloning in Extreme Programming Environment

    Authors: Ginika Mahajan, Ashima

    Abstract: Software systems are evolving by adding new functions and modifying existing functions over time. Through the evolution, the structure of software is becoming more complex and so the understandability and maintainability of software systems is deteriorating day by day. These are not only important but one of the most expensive activities in software development. Refactoring has often been applied… ▽ More

    Submitted 21 August, 2014; originally announced August 2014.

    Comments: 14 pages

    Journal ref: International Journal of Research in Engineering & Applied Sciences, VOl 2, Issue 2,2012

  25. arXiv:1110.3014  [pdf, ps, other

    cs.DM

    On the Existence of Hamiltonian Paths for History Based Pivot Rules on Acyclic Unique Sink Orientations of Hypercubes

    Authors: Yoshikazu Aoshima, David Avis, Theresa Deering, Yoshitake Matsumoto, Sonoko Moriyama

    Abstract: An acyclic USO on a hypercube is formed by directing its edges in such as way that the digraph is acyclic and each face of the hypercube has a unique sink and a unique source. A path to the global sink of an acyclic USO can be modeled as pivoting in a unit hypercube of the same dimension with an abstract objective function, and vice versa. In such a way, Zadeh's 'least entered rule' and other hist… ▽ More

    Submitted 24 May, 2012; v1 submitted 13 October, 2011; originally announced October 2011.