Skip to main content

Showing 1–50 of 223 results for author: Goel, A

.
  1. arXiv:2407.02420  [pdf

    astro-ph.EP astro-ph.IM physics.geo-ph

    Geophysical Observations of the 24 September 2023 OSIRIS-REx Sample Return Capsule Re-Entry

    Authors: Elizabeth A. Silber, Daniel C. Bowman, Chris G. Carr, David P. Eisenberg, Brian R. Elbing, Benjamin Fernando, Milton A. Garcés, Robert Haaser, Siddharth Krishnamoorthy, Charles A. Langston, Yasuhiro Nishikawa, Jeremy Webster, Jacob F. Anderson, Stephen Arrowsmith, Sonia Bazargan, Luke Beardslee, Brant Beck, Jordan W. Bishop, Philip Blom, Grant Bracht, David L. Chichester, Anthony Christe, Kenneth Cummins, James Cutts, Lisa Danielson , et al. (57 additional authors not shown)

    Abstract: Sample Return Capsules (SRCs) entering Earth's atmosphere at hypervelocity from interplanetary space are a valuable resource for studying meteor phenomena. The 24 September 2023 arrival of the OSIRIS-REx (Origins, Spectral Interpretation, Resource Identification, and Security-Regolith Explorer) SRC provided an unprecedented chance for geophysical observations of a well-characterized source with kn… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

    Comments: 87 pages, 14 figures

  2. arXiv:2407.01757  [pdf, other

    astro-ph.EP astro-ph.IM cs.MA physics.ao-ph physics.geo-ph

    Distributed Instruments for Planetary Surface Science: Scientific Opportunities and Technology Feasibility

    Authors: Federico Rossi, Robert C. Anderson, Saptarshi Bandyopadhyay, Erik Brandon, Ashish Goel, Joshua Vander Hook, Michael Mischna, Michaela Villarreal, Mark Wronkiewicz

    Abstract: In this paper, we assess the scientific promise and technology feasibility of distributed instruments for planetary science. A distributed instrument is an instrument designed to collect spatially and temporally correlated data from multiple networked, geographically distributed point sensors. Distributed instruments are ubiquitous in Earth science, where they are routinely employed for weather an… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

  3. arXiv:2406.08931  [pdf, other

    cs.CL cs.AI cs.SD eess.AS

    Exploring Multilingual Unseen Speaker Emotion Recognition: Leveraging Co-Attention Cues in Multitask Learning

    Authors: Arnav Goel, Medha Hira, Anubha Gupta

    Abstract: Advent of modern deep learning techniques has given rise to advancements in the field of Speech Emotion Recognition (SER). However, most systems prevalent in the field fail to generalize to speakers not seen during training. This study focuses on handling challenges of multilingual SER, specifically on unseen speakers. We introduce CAMuLeNet, a novel architecture leveraging co-attention based fusi… ▽ More

    Submitted 19 June, 2024; v1 submitted 13 June, 2024; originally announced June 2024.

    Comments: 5 pages, Accepted to INTERSPEECH 2024. The first two authors contributed equally

  4. arXiv:2406.00022  [pdf, other

    cs.CL cs.SD eess.AS

    Multilingual Prosody Transfer: Comparing Supervised & Transfer Learning

    Authors: Arnav Goel, Medha Hira, Anubha Gupta

    Abstract: The field of prosody transfer in speech synthesis systems is rapidly advancing. This research is focused on evaluating learning methods for adapting pre-trained monolingual text-to-speech (TTS) models to multilingual conditions, i.e., Supervised Fine-Tuning (SFT) and Transfer Learning (TL). This comparison utilizes three distinct metrics: Mean Opinion Score (MOS), Recognition Accuracy (RA), and Me… ▽ More

    Submitted 18 June, 2024; v1 submitted 23 May, 2024; originally announced June 2024.

    Comments: 7 pages, Accepted to ICLR 2024 - Tiny Track

  5. arXiv:2406.00021  [pdf, other

    cs.CL cs.SD eess.AS

    CrossVoice: Crosslingual Prosody Preserving Cascade-S2ST using Transfer Learning

    Authors: Medha Hira, Arnav Goel, Anubha Gupta

    Abstract: This paper presents CrossVoice, a novel cascade-based Speech-to-Speech Translation (S2ST) system employing advanced ASR, MT, and TTS technologies with cross-lingual prosody preservation through transfer learning. We conducted comprehensive experiments comparing CrossVoice with direct-S2ST systems, showing improved BLEU scores on tasks such as Fisher Es-En, VoxPopuli Fr-En and prosody preservation… ▽ More

    Submitted 18 June, 2024; v1 submitted 23 May, 2024; originally announced June 2024.

    Comments: 8 pages, Accepted at ICLR 2024 - Tiny Track

  6. arXiv:2405.20917  [pdf, other

    cs.CL cs.LG cs.LO

    Learning to Estimate System Specifications in Linear Temporal Logic using Transformers and Mamba

    Authors: İlker Işık, Ebru Aydin Gol, Ramazan Gokberk Cinbis

    Abstract: Temporal logic is a framework for representing and reasoning about propositions that evolve over time. It is commonly used for specifying requirements in various domains, including hardware and software systems, as well as robotics. Specification mining or formula generation involves extracting temporal logic formulae from system traces and has numerous applications, such as detecting bugs and imp… ▽ More

    Submitted 31 May, 2024; originally announced May 2024.

    Comments: 20 pages, 15 figures

  7. arXiv:2405.19631  [pdf, other

    cs.AI

    Leveraging Open-Source Large Language Models for encoding Social Determinants of Health using an Intelligent Router

    Authors: Akul Goel, Surya Narayanan Hari, Belinda Waltman, Matt Thomson

    Abstract: Social Determinants of Health (SDOH) play a significant role in patient health outcomes. The Center of Disease Control (CDC) introduced a subset of ICD-10 codes called Z-codes in an attempt to officially recognize and measure SDOH in the health care system. However, these codes are rarely annotated in a patient's Electronic Health Record (EHR), and instead, in many cases, need to be inferred from… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

  8. arXiv:2405.16355  [pdf, other

    cs.HC cs.AI

    Navigating AI Fallibility: Examining People's Reactions and Perceptions of AI after Encountering Personality Misrepresentations

    Authors: Qiaosi Wang, Chidimma L. Anyi, Vedant Das Swain, Ashok K. Goel

    Abstract: Many hyper-personalized AI systems profile people's characteristics (e.g., personality traits) to provide personalized recommendations. These systems are increasingly used to facilitate interactions among people, such as providing teammate recommendations. Despite improved accuracy, such systems are not immune to errors when making inferences about people's most personal traits. These errors manif… ▽ More

    Submitted 25 May, 2024; originally announced May 2024.

    Comments: 37 pages, 11 figures

    ACM Class: I.2.0

  9. arXiv:2405.11775  [pdf, other

    cs.CL cs.LG

    Exploring Ordinality in Text Classification: A Comparative Study of Explicit and Implicit Techniques

    Authors: Siva Rajesh Kasa, Aniket Goel, Karan Gupta, Sumegh Roychowdhury, Anish Bhanushali, Nikhil Pattisapu, Prasanna Srinivasa Murthy

    Abstract: Ordinal Classification (OC) is a widely encountered challenge in Natural Language Processing (NLP), with applications in various domains such as sentiment analysis, rating prediction, and more. Previous approaches to tackle OC have primarily focused on modifying existing or creating novel loss functions that \textbf{explicitly} account for the ordinal nature of labels. However, with the advent of… ▽ More

    Submitted 20 May, 2024; originally announced May 2024.

    Comments: Findings of ACL 2024

  10. arXiv:2405.11070  [pdf, other

    cs.AI cs.CL cs.LG

    Jill Watson: A Virtual Teaching Assistant powered by ChatGPT

    Authors: Karan Taneja, Pratyusha Maiti, Sandeep Kakar, Pranav Guruprasad, Sanjeev Rao, Ashok K. Goel

    Abstract: Conversational AI agents often require extensive datasets for training that are not publicly released, are limited to social chit-chat or handling a specific domain, and may not be easily extended to accommodate the latest advances in AI technologies. This paper introduces Jill Watson, a conversational Virtual Teaching Assistant (VTA) leveraging the capabilities of ChatGPT. Jill Watson based on Ch… ▽ More

    Submitted 17 May, 2024; originally announced May 2024.

  11. arXiv:2405.05572  [pdf, other

    cs.CL cs.AI

    From Human Judgements to Predictive Models: Unravelling Acceptability in Code-Mixed Sentences

    Authors: Prashant Kodali, Anmol Goel, Likhith Asapu, Vamshi Krishna Bonagiri, Anirudh Govil, Monojit Choudhury, Manish Shrivastava, Ponnurangam Kumaraguru

    Abstract: Current computational approaches for analysing or generating code-mixed sentences do not explicitly model "naturalness" or "acceptability" of code-mixed sentences, but rely on training corpora to reflect distribution of acceptable code-mixed sentences. Modelling human judgement for the acceptability of code-mixed text can help in distinguishing natural code-mixed text and enable quality-controlled… ▽ More

    Submitted 9 May, 2024; originally announced May 2024.

  12. arXiv:2405.03162  [pdf, other

    cs.CV cs.AI cs.CL cs.LG

    Advancing Multimodal Medical Capabilities of Gemini

    Authors: Lin Yang, Shawn Xu, Andrew Sellergren, Timo Kohlberger, Yuchen Zhou, Ira Ktena, Atilla Kiraly, Faruk Ahmed, Farhad Hormozdiari, Tiam Jaroensri, Eric Wang, Ellery Wulczyn, Fayaz Jamil, Theo Guidroz, Chuck Lau, Siyuan Qiao, Yun Liu, Akshay Goel, Kendall Park, Arnav Agharwal, Nick George, Yang Wang, Ryutaro Tanno, David G. T. Barrett, Wei-Hung Weng , et al. (22 additional authors not shown)

    Abstract: Many clinical tasks require an understanding of specialized data, such as medical images and genomics, which is not typically found in general-purpose large multimodal models. Building upon Gemini's multimodal models, we develop several models within the new Med-Gemini family that inherit core capabilities of Gemini and are optimized for medical use via fine-tuning with 2D and 3D radiology, histop… ▽ More

    Submitted 6 May, 2024; originally announced May 2024.

  13. arXiv:2404.07616  [pdf, other

    cs.CL cs.SD eess.AS

    Audio Dialogues: Dialogues dataset for audio and music understanding

    Authors: Arushi Goel, Zhifeng Kong, Rafael Valle, Bryan Catanzaro

    Abstract: Existing datasets for audio understanding primarily focus on single-turn interactions (i.e. audio captioning, audio question answering) for describing audio in natural language, thus limiting understanding audio via interactive dialogue. To address this gap, we introduce Audio Dialogues: a multi-turn dialogue dataset containing 163.8k samples for general audio sounds and music. In addition to dial… ▽ More

    Submitted 11 April, 2024; originally announced April 2024.

    Comments: Demo website: https://audiodialogues.github.io/

  14. Optimal Policy Synthesis from A Sequence of Goal Sets with An Application to Electric Distribution System Restoration

    Authors: İlker Işık, Onur Yigit Arpali, Ebru Aydin Gol

    Abstract: Motivated by the post-disaster distribution system restoration problem, in this paper, we study the problem of synthesizing the optimal policy for a Markov Decision Process (MDP) from a sequence of goal sets. For each goal set, our aim is to both maximize the probability to reach and minimize the expected time to reach the goal set. The order of the goal sets represents their priority. In particul… ▽ More

    Submitted 5 April, 2024; originally announced April 2024.

    Comments: 7th ADHS 2021 Conference Paper

    Journal ref: IFAC-PapersOnLine Volume 54, Issue 5, 2021, Pages 271-276

  15. Field Teams Coordination for Earthquake-Damaged Distribution System Energization

    Authors: İlker Işık, Ebru Aydin Gol

    Abstract: The re-energization of electrical distribution systems in a post-disaster scenario is of grave importance as most modern infrastructure systems rely heavily on the presence of electricity. This paper introduces a method to coordinate the field teams for the optimal energization of an electrical distribution system after an earthquake-induced blackout. The proposed method utilizes a Markov Decision… ▽ More

    Submitted 5 April, 2024; originally announced April 2024.

    Comments: Accepted manuscript, published in Reliability Engineering & System Safety

    Journal ref: Reliability Engineering & System Safety Volume 245, May 2024, 110050

  16. arXiv:2403.18333  [pdf, other

    hep-th cond-mat.str-el gr-qc

    Quantum gravity of the Heisenberg algebra

    Authors: Ahmed Almheiri, Akash Goel, Xu-Yao Hu

    Abstract: We consider a simplified model of double scaled SYK (DSSYK) in which the Hamiltonian is the position operator of the Harmonic oscillator. This model captures the high temperature limit of DSSYK but could also be defined as a quantum theory in its own right. We study properties of the emergent geometry including its dynamics in response to inserting matter particles. In particular, we find that the… ▽ More

    Submitted 16 May, 2024; v1 submitted 27 March, 2024; originally announced March 2024.

    Comments: 30 pages + appendices; v2: typos corrected, references added

  17. arXiv:2403.03029  [pdf, other

    cs.CL

    Socratic Reasoning Improves Positive Text Rewriting

    Authors: Anmol Goel, Nico Daheim, Iryna Gurevych

    Abstract: Reframing a negative into a positive thought is at the crux of several cognitive approaches to mental health and psychotherapy that could be made more accessible by large language model-based solutions. Such reframing is typically non-trivial and requires multiple rationalization steps to uncover the underlying issue of a negative thought and transform it to be more positive. However, this rationa… ▽ More

    Submitted 5 March, 2024; originally announced March 2024.

  18. arXiv:2403.00826  [pdf, other

    cs.CL cs.CR cs.LG

    LLMGuard: Guarding Against Unsafe LLM Behavior

    Authors: Shubh Goyal, Medha Hira, Shubham Mishra, Sukriti Goyal, Arnav Goel, Niharika Dadu, Kirushikesh DB, Sameep Mehta, Nishtha Madaan

    Abstract: Although the rise of Large Language Models (LLMs) in enterprise settings brings new opportunities and capabilities, it also brings challenges, such as the risk of generating inappropriate, biased, or misleading content that violates regulations and can have legal concerns. To alleviate this, we present "LLMGuard", a tool that monitors user interactions with an LLM application and flags content aga… ▽ More

    Submitted 27 February, 2024; originally announced March 2024.

    Comments: accepted in demonstration track of AAAI-24

  19. arXiv:2402.10567  [pdf, other

    cs.CL cs.AI

    InSaAF: Incorporating Safety through Accuracy and Fairness | Are LLMs ready for the Indian Legal Domain?

    Authors: Yogesh Tripathi, Raghav Donakanti, Sahil Girhepuje, Ishan Kavathekar, Bhaskara Hanuma Vedula, Gokul S Krishnan, Shreya Goyal, Anmol Goel, Balaraman Ravindran, Ponnurangam Kumaraguru

    Abstract: Recent advancements in language technology and Artificial Intelligence have resulted in numerous Language Models being proposed to perform various tasks in the legal domain ranging from predicting judgments to generating summaries. Despite their immense potential, these models have been proven to learn and exhibit societal biases and make unfair predictions. In this study, we explore the ability o… ▽ More

    Submitted 17 June, 2024; v1 submitted 16 February, 2024; originally announced February 2024.

  20. arXiv:2402.03717  [pdf, ps, other

    math.OC

    Retrospective Cost-based Extremum Seeking Control with Vanishing Perturbation for Online Output Minimization

    Authors: Juan A. Paredes, Jhon Manuel Portella, Dennis S. Bernstein, Ankit Goel

    Abstract: Extremum seeking control (ESC) constitutes a powerful technique for online optimization with theoretical guarantees for convergence to the neighborhood of the optimizer under well-understood conditions. However, ESC requires a nonconstant perturbation signal to provide persistent excitation to the target system to yield convergent results, which usually results in steady state oscillations. While… ▽ More

    Submitted 6 February, 2024; originally announced February 2024.

  21. arXiv:2402.03709  [pdf, ps, other

    math.OC

    Adaptive Backstep** Control of a Bicopter in Pure Feedback Form with Dynamic Extension

    Authors: Jhon Manuel Portella Delgado, Mohammad Mirtaba, Ankit Goel

    Abstract: This paper presents a model-based, adaptive, nonlinear controller for the bicopter stabilization and trajectory-tracking problem. The nonlinear controller is designed using the backstep** technique. Due to the non-invertibility of the input map, the bicopter system is first dynamically extended. However, the resulting dynamically extended system is in the pure feedback form with the uncertainty… ▽ More

    Submitted 6 February, 2024; originally announced February 2024.

    Comments: arXiv admin note: text overlap with arXiv:2305.03554

  22. arXiv:2402.01831  [pdf, other

    cs.SD cs.LG eess.AS

    Audio Flamingo: A Novel Audio Language Model with Few-Shot Learning and Dialogue Abilities

    Authors: Zhifeng Kong, Arushi Goel, Rohan Badlani, Wei **, Rafael Valle, Bryan Catanzaro

    Abstract: Augmenting large language models (LLMs) to understand audio -- including non-speech sounds and non-verbal speech -- is critically important for diverse real-world applications of LLMs. In this paper, we propose Audio Flamingo, a novel audio language model with 1) strong audio understanding abilities, 2) the ability to quickly adapt to unseen tasks via in-context learning and retrieval, and 3) stro… ▽ More

    Submitted 28 May, 2024; v1 submitted 2 February, 2024; originally announced February 2024.

    Comments: ICML 2024

  23. arXiv:2401.16920  [pdf, other

    q-fin.PM cs.LG q-fin.ST

    Sparse Portfolio Selection via Topological Data Analysis based Clustering

    Authors: Anubha Goel, Damir Filipović, Puneet Pasricha

    Abstract: This paper uses topological data analysis (TDA) tools and introduces a data-driven clustering-based stock selection strategy tailored for sparse portfolio construction. Our asset selection strategy exploits the topological features of stock price movements to select a subset of topologically similar (different) assets for a sparse index tracking (Markowitz) portfolio. We introduce new distance mea… ▽ More

    Submitted 30 January, 2024; originally announced January 2024.

  24. arXiv:2401.13092  [pdf, ps, other

    math.OC

    Retrospective Cost Attitude Filtering with Noisy Measurements and Unknown Gyro Bias

    Authors: Parham Oveissi, Ankit Goel

    Abstract: Attitude filtering is a critical technology with applications in diverse domains such as aerospace engineering, robotics, computer vision, and augmented reality. Although attitude filtering is a particular case of the state estimation problem, attitude filtering is uniquely challenging due to the special geometric structure of the attitude parameterization. This paper presents a novel data-driven… ▽ More

    Submitted 23 January, 2024; originally announced January 2024.

  25. Rank, Pack, or Approve: Voting Methods in Participatory Budgeting

    Authors: Lodewijk Gelauff, Ashish Goel

    Abstract: Participatory budgeting is a popular method to engage residents in budgeting decisions by local governments. The Stanford Participatory Budgeting platform is an online platform that has been used to engage residents in more than 150 budgeting processes. We present a data set with anonymized budget opinions from these processes with K-approval, K-ranking or knapsack primary ballots. For a subset of… ▽ More

    Submitted 25 March, 2024; v1 submitted 22 January, 2024; originally announced January 2024.

    Comments: Accepted for publication at ICWSM. Data set is available through: https://doi.org/10.25740/db709zg9088

    Journal ref: Proceedings of the International AAAI Conference on Web and Social Media, 18 (2024) 448-461

  26. arXiv:2401.05467  [pdf, other

    cs.LG cs.AI

    Active Label Correction for Building LLM-based Modular AI Systems

    Authors: Karan Taneja, Ashok Goel

    Abstract: Large Language Models (LLMs) have been used to build modular AI systems such as HuggingGPT, Microsoft Bing Chat, and more. To improve such systems after deployment using the data collected from human interactions, each module can be replaced by a fine-tuned model but the annotations received from LLMs are low quality. We propose that active label correction can be used to improve the data quality… ▽ More

    Submitted 17 May, 2024; v1 submitted 10 January, 2024; originally announced January 2024.

  27. arXiv:2312.06871  [pdf, other

    cs.AI cs.LG cs.MA

    Using Analytics on Student Created Data to Content Validate Pedagogical Tools

    Authors: John Kos, Kenneth Eaton, Sareen Zhang, Rahul Dass, Stephen Buckley, Sungeun An, Ashok Goel

    Abstract: Conceptual and simulation models can function as useful pedagogical tools, however it is important to categorize different outcomes when evaluating them in order to more meaningfully interpret results. VERA is a ecology-based conceptual modeling software that enables users to simulate interactions between biotics and abiotics in an ecosystem, allowing users to form and then verify hypothesis throu… ▽ More

    Submitted 11 December, 2023; originally announced December 2023.

    Comments: 16 pages, preprint

  28. arXiv:2312.04994  [pdf, ps, other

    physics.flu-dyn

    Numerical determination of iron dust laminar flame speeds with the counterflow twin-flame technique

    Authors: C. E. A. G. van Gool, T. Hazenberg, J. A. van Oijen, L. P. H. de Goey

    Abstract: Iron dust counter-flow flames have been studied with the low-Mach-number combustion approximation. The model considers full coupling between the two phases, including particle/droplet drag. The dispersed phase flow strain relations are derived under the assumption of low Reynolds number conditions. The importance of solving a particle flow strain model is demonstrated by comparing three different… ▽ More

    Submitted 8 December, 2023; originally announced December 2023.

    Comments: 20 pages, 11 figures

  29. arXiv:2312.02296  [pdf, other

    cs.CL cs.AI cs.LG

    LLMs Accelerate Annotation for Medical Information Extraction

    Authors: Akshay Goel, Almog Gueta, Omry Gilon, Chang Liu, Sofia Erell, Lan Huong Nguyen, Xiaohong Hao, Bolous Jaber, Shashir Reddy, Rupesh Kartha, Jean Steiner, Itay Laish, Amir Feder

    Abstract: The unstructured nature of clinical notes within electronic health records often conceals vital patient-related information, making it challenging to access or interpret. To uncover this hidden information, specialized Natural Language Processing (NLP) models are required. However, training these models necessitates large amounts of labeled data, a process that is both time-consuming and costly wh… ▽ More

    Submitted 4 December, 2023; originally announced December 2023.

    Comments: Published in proceedings of the Machine Learning for Health (ML4H) Symposium 2023

  30. arXiv:2311.17405  [pdf, other

    cs.RO

    Learning and Autonomy for Extraterrestrial Terrain Sampling: An Experience Report from OWLAT Deployment

    Authors: Pranay Thangeda, Ashish Goel, Erica Tevere, Yifan Zhu, Erik Kramer, Adriana Daca, Hari Nayar, Kris Hauser, Melkior Ornik

    Abstract: Extraterrestrial autonomous lander missions increasingly demand adaptive capabilities to handle the unpredictable and diverse nature of the terrain. This paper discusses the deployment of a Deep Meta-Learning with Controlled Deployment Gaps (CoDeGa) trained model for terrain scoo** tasks in Ocean Worlds Lander Autonomy Testbed (OWLAT) at NASA Jet Propulsion Laboratory. The CoDeGa-powered scoopin… ▽ More

    Submitted 4 December, 2023; v1 submitted 29 November, 2023; originally announced November 2023.

    Comments: Updated references to include recent work on autonomy for ocean worlds

  31. arXiv:2311.07060  [pdf, ps, other

    math.AC

    Arithmetic of semisubtractive semidomains

    Authors: Hannah Fox, Agastya Goel, Sophia Liao

    Abstract: A subset $S$ of an integral domain is called a semidomain if the pairs $(S,+)$ and $(S\setminus\{0\}, \cdot)$ are commutative and cancellative semigroups with identities. The multiplication of $S$ extends to the group of differences $\mathscr{G}(S)$, turning $\mathscr{G}(S)$ into an integral domain. In this paper, we study the arithmetic of semisubtractive semidomains (i.e., semidomains $S$ for wh… ▽ More

    Submitted 28 November, 2023; v1 submitted 12 November, 2023; originally announced November 2023.

    Comments: 15 pages

    MSC Class: Primary: 16Y60; 11C08; Secondary: 20M13; 13F05

  32. arXiv:2311.05779  [pdf, other

    cs.RO cs.CV

    Language-guided Robot Gras**: CLIP-based Referring Grasp Synthesis in Clutter

    Authors: Georgios Tziafas, Yucheng Xu, Arushi Goel, Mohammadreza Kasaei, Zhibin Li, Hamidreza Kasaei

    Abstract: Robots operating in human-centric environments require the integration of visual grounding and gras** capabilities to effectively manipulate objects based on user instructions. This work focuses on the task of referring grasp synthesis, which predicts a grasp pose for an object referred through natural language in cluttered scenes. Existing approaches often employ multi-stage pipelines that firs… ▽ More

    Submitted 9 November, 2023; originally announced November 2023.

    Comments: Poster CoRL 2023. Dataset and code available here: https://github.com/gtziafas/OCID-VLG

  33. arXiv:2310.13619  [pdf, other

    cs.CL cs.CV

    Semi-supervised multimodal coreference resolution in image narrations

    Authors: Arushi Goel, Basura Fernando, Frank Keller, Hakan Bilen

    Abstract: In this paper, we study multimodal coreference resolution, specifically where a longer descriptive text, i.e., a narration is paired with an image. This poses significant challenges due to fine-grained image-text alignment, inherent ambiguity present in narrative language, and unavailability of large annotated training sets. To tackle these challenges, we present a data efficient semi-supervised a… ▽ More

    Submitted 20 October, 2023; originally announced October 2023.

    Comments: Long paper at EMNLP'23-Main

  34. arXiv:2310.11643  [pdf, other

    cs.CY

    Opinion Change or Differential Turnout: Changing Opinions on the Austin Police Department in a Budget Feedback Process

    Authors: Lodewijk L. Gelauff, Ashish Goel

    Abstract: In 2020 the tragic murder of George Floyd at the hands of law enforcement ignited and intensified nationwide protests, demanding changes in police funding and allocation. This happened during a budgeting feedback exercise where residents of Austin, Texas were invited to share opinions on the budgets of various city service areas, including the Police Department, on an online platform designed by o… ▽ More

    Submitted 16 January, 2024; v1 submitted 17 October, 2023; originally announced October 2023.

    Comments: This preprint is an extended version of a previously published conference paper: https://dl.acm.org/doi/10.1145/3551624.3555295

  35. arXiv:2310.09578  [pdf, other

    cs.CE q-fin.PM

    Sparse Index Tracking via Topological Learning

    Authors: Anubha Goel, Puneet Pasricha, Juho Kanniainen

    Abstract: In this research, we introduce a novel methodology for the index tracking problem with sparse portfolios by leveraging topological data analysis (TDA). Utilizing persistence homology to measure the riskiness of assets, we introduce a topological method for data-driven learning of the parameters for regularization terms. Specifically, the Vietoris-Rips filtration method is utilized to capture the i… ▽ More

    Submitted 14 October, 2023; originally announced October 2023.

  36. arXiv:2309.13450  [pdf

    cs.SE

    Conducting A/B Experiments with a Scalable Architecture

    Authors: Andrew Hornback, Sungeun An, Scott Bunin, Stephen Buckley, John Kos, Ashok Goel

    Abstract: A/B experiments are commonly used in research to compare the effects of changing one or more variables in two different experimental groups - a control group and a treatment group. While the benefits of using A/B experiments are widely known and accepted, there is less agreement on a principled approach to creating software infrastructure systems to assist in rapidly conducting such experiments. W… ▽ More

    Submitted 23 September, 2023; originally announced September 2023.

  37. arXiv:2308.00813  [pdf

    cs.HC cs.AI

    Designing a Communication Bridge between Communities: Participatory Design for a Question-Answering AI Agent

    Authors: Jeonghyun Lee, Vrinda Nandan, Harshvardhan Sikka, Spencer Rugaber, Ashok Goel

    Abstract: How do we design an AI system that is intended to act as a communication bridge between two user communities with different mental models and vocabularies? Skillsync is an interactive environment that engages employers (companies) and training providers (colleges) in a sustained dialogue to help them achieve the goal of building a training proposal that successfully meets the needs of the employer… ▽ More

    Submitted 1 August, 2023; originally announced August 2023.

  38. arXiv:2307.15275  [pdf, ps, other

    math.OC

    Computing Invariant Zeros of a Linear System Using State-Space Realization

    Authors: Jhon Manuel Portella Delgado, Ankit Goel

    Abstract: It is well known that zeros and poles of a single-input, single-output system in the transfer function form are the roots of the transfer function's numerator and the denominator polynomial, respectively. However, in the state-space form, where the poles are a subset of the eigenvalue of the dynamics matrix and thus can be computed by solving an eigenvalue problem, the computation of zeros is a no… ▽ More

    Submitted 5 February, 2024; v1 submitted 27 July, 2023; originally announced July 2023.

  39. arXiv:2307.05538  [pdf, other

    cs.CL

    Advancements in Scientific Controllable Text Generation Methods

    Authors: Arnav Goel, Medha Hira, Avinash Anand, Siddhesh Bangar, Dr. Rajiv Ratn Shah

    Abstract: The previous work on controllable text generation is organized using a new schema we provide in this study. Seven components make up the schema, and each one is crucial to the creation process. To accomplish controlled generation for scientific literature, we describe the various modulation strategies utilised to modulate each of the seven components. We also offer a theoretical study and qualitat… ▽ More

    Submitted 8 July, 2023; originally announced July 2023.

  40. arXiv:2306.17674  [pdf, other

    cs.CL

    X-RiSAWOZ: High-Quality End-to-End Multilingual Dialogue Datasets and Few-shot Agents

    Authors: Mehrad Moradshahi, Tianhao Shen, Kalika Bali, Monojit Choudhury, Gaël de Chalendar, Anmol Goel, Sungkyun Kim, Prashant Kodali, Ponnurangam Kumaraguru, Nasredine Semmar, Sina J. Semnani, Jiwon Seo, Vivek Seshadri, Manish Shrivastava, Michael Sun, Aditya Yadavalli, Chaobin You, Deyi Xiong, Monica S. Lam

    Abstract: Task-oriented dialogue research has mainly focused on a few popular languages like English and Chinese, due to the high dataset creation cost for a new language. To reduce the cost, we apply manual editing to automatically translated data. We create a new multilingual benchmark, X-RiSAWOZ, by translating the Chinese RiSAWOZ to 4 languages: English, French, Hindi, Korean; and a code-mixed English-H… ▽ More

    Submitted 30 June, 2023; originally announced June 2023.

    Comments: Accepted by ACL 2023 Findings

  41. arXiv:2306.10243  [pdf, ps, other

    math.PR math-ph

    Central limit theorem for the complex eigenvalues of Gaussian random matrices

    Authors: Advay Goel, Patrick Lopatto, Xiaoyu Xie

    Abstract: We establish a central limit theorem for the eigenvalue counting function of a matrix of real Gaussian random variables.

    Submitted 8 March, 2024; v1 submitted 16 June, 2023; originally announced June 2023.

    Comments: 15 pages. To appear in Electronic Communications in Probability

  42. arXiv:2306.09224  [pdf, other

    cs.CV

    Encyclopedic VQA: Visual questions about detailed properties of fine-grained categories

    Authors: Thomas Mensink, Jasper Uijlings, Lluis Castrejon, Arushi Goel, Felipe Cadar, Howard Zhou, Fei Sha, André Araujo, Vittorio Ferrari

    Abstract: We propose Encyclopedic-VQA, a large scale visual question answering (VQA) dataset featuring visual questions about detailed properties of fine-grained categories and instances. It contains 221k unique question+answer pairs each matched with (up to) 5 images, resulting in a total of 1M VQA samples. Moreover, our dataset comes with a controlled knowledge base derived from Wikipedia, marking the evi… ▽ More

    Submitted 24 July, 2023; v1 submitted 15 June, 2023; originally announced June 2023.

    Comments: ICCV'23

  43. arXiv:2305.11296  [pdf, other

    cs.GT

    A Mechanism for Participatory Budgeting With Funding Constraints and Project Interactions

    Authors: Mohak Goyal, Sahasrajit Sarmasarkar, Ashish Goel

    Abstract: Participatory budgeting (PB) has been widely adopted and has attracted significant research efforts; however, there is a lack of mechanisms for PB which elicit project interactions, such as substitution and complementarity, from voters. Also, the outcomes of PB in practice are subject to various minimum/maximum funding constraints on 'types' of projects. We propose a novel preference elicitation s… ▽ More

    Submitted 14 July, 2023; v1 submitted 18 May, 2023; originally announced May 2023.

  44. arXiv:2305.05015  [pdf, other

    astro-ph.HE astro-ph.SR

    A Low-Mass Helium Star Progenitor Model for the Type Ibn SN 2020nxt

    Authors: Qinan Wang, Anika Goel, Luc Dessart, Ori D. Fox, Melissa Shahbandeh, Sofia Rest, Armin Rest, Jose H. Groh, Andrew Allan, Claes Fransson, Nathan Smith, Griffin Hosseinzadeh, Alexei V. Filippenko, Jennifer Andrews, K. Azalee Bostroem, Thomas G. Brink, Peter Brown, Jamison Burke, Roger Chevalier, Geoffrey C. Clayton, Mi Dai, Kyle W. Davis, Ryan J. Foley, Sebastian Gomez, Chelsea Harris , et al. (33 additional authors not shown)

    Abstract: A growing number of supernovae (SNe) are now known to exhibit evidence for significant interaction with a dense, pre-existing, circumstellar medium (CSM). SNe Ibn comprise one such class that can be characterised by both rapidly evolving light curves and persistent narrow He I lines. The origin of such a dense CSM in these systems remains a pressing question, specifically concerning the progenitor… ▽ More

    Submitted 8 May, 2023; originally announced May 2023.

    Comments: 17 pages, 13 figures, 1 table, submitted to MNRAS

  45. arXiv:2305.03554  [pdf, ps, other

    math.DS

    Adaptive Nonlinear Control of a Bicopter with Unknown Dynamics

    Authors: Jhon Manuel Portella Delgado, Ankit Goel

    Abstract: This paper presents an adaptive, model-based, nonlinear controller for the bicopter trajectory-tracking problem. The nonlinear controller is constructed by dynamically extending the bicopter model, stabilizing the extended dynamics using input-output linearization, augmenting the controller with a finite-time convergent parameter estimator, and designing a linear tracking controller. Unlike contro… ▽ More

    Submitted 7 February, 2024; v1 submitted 5 May, 2023; originally announced May 2023.

  46. arXiv:2304.10634  [pdf, other

    eess.SY cs.RO

    Experimental Flight Testing of an Adaptive Autopilot with Parameter Drift Mitigation

    Authors: Yin Yong Chee, Parham Oveissi, Siyuan Shao, Joonghyun Lee, Juan A. Paredes, Dennis S. Bernstein, Ankit Goel

    Abstract: This paper modifies an adaptive multicopter autopilot to mitigate instabilities caused by adaptive parameter drift and presents simulation and experimental results to validate the modified autopilot. The modified adaptive controller is obtained by including a static nonlinearity in the adaptive loop, updated by the retrospective cost adaptive control algorithm. It is shown in simulation and physic… ▽ More

    Submitted 20 April, 2023; originally announced April 2023.

    Comments: 6 pages, 16 figures, submitted to IROS 2023

  47. arXiv:2304.02730  [pdf, ps, other

    cs.CR

    Fair Ordering via Streaming Social Choice Theory

    Authors: Geoffrey Ramseyer, Ashish Goel

    Abstract: Prior work studies the question of ``fairly'' ordering transactions in a replicated state machine. Each of $n$ replicas receives transactions in a possibly different order, and the system must aggregate the observed orderings into a single order. We argue that this problem is best viewed through the lens of social choice theory, in which (in the preference aggregation problem) rankings on candidat… ▽ More

    Submitted 27 February, 2024; v1 submitted 5 April, 2023; originally announced April 2023.

  48. arXiv:2303.07476  [pdf, other

    cs.SE cs.AI

    Challenges and Practices of Deep Learning Model Reengineering: A Case Study on Computer Vision

    Authors: Wenxin Jiang, Vishnu Banna, Naveen Vivek, Abhinav Goel, Nicholas Synovic, George K. Thiruvathukal, James C. Davis

    Abstract: Many engineering organizations are reimplementing and extending deep neural networks from the research community. We describe this process as deep learning model reengineering. Deep learning model reengineering - reusing, reproducing, adapting, and enhancing state-of-the-art deep learning approaches - is challenging for reasons including under-documented reference models, changing requirements, an… ▽ More

    Submitted 25 August, 2023; v1 submitted 13 March, 2023; originally announced March 2023.

    Comments: Under submission to EMSE

  49. arXiv:2303.07247  [pdf

    cs.CL cs.CY

    Are Models Trained on Indian Legal Data Fair?

    Authors: Sahil Girhepuje, Anmol Goel, Gokul S Krishnan, Shreya Goyal, Satyendra Pandey, Ponnurangam Kumaraguru, Balaraman Ravindran

    Abstract: Recent advances and applications of language technology and artificial intelligence have enabled much success across multiple domains like law, medical and mental health. AI-based Language Models, like Judgement Prediction, have recently been proposed for the legal sector. However, these models are strife with encoded social biases picked up from the training data. While bias and fairness have bee… ▽ More

    Submitted 14 May, 2024; v1 submitted 13 March, 2023; originally announced March 2023.

    Comments: Presented at the Symposium on AI and Law (SAIL) 2023

  50. arXiv:2303.05323  [pdf, other

    cs.CV

    Controllable Video Generation by Learning the Underlying Dynamical System with Neural ODE

    Authors: Yucheng Xu, Li Nanbo, Arushi Goel, Zijian Guo, Zonghai Yao, Hamidreza Kasaei, Mohammadreze Kasaei, Zhibin Li

    Abstract: Videos depict the change of complex dynamical systems over time in the form of discrete image sequences. Generating controllable videos by learning the dynamical system is an important yet underexplored topic in the computer vision community. This paper presents a novel framework, TiV-ODE, to generate highly controllable videos from a static image and a text caption. Specifically, our framework le… ▽ More

    Submitted 4 April, 2023; v1 submitted 9 March, 2023; originally announced March 2023.