Skip to main content

Showing 1–50 of 60 results for author: Ríos, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.19538  [pdf, other

    cs.CL

    Context Matters: An Empirical Study of the Impact of Contextual Information in Temporal Question Answering Systems

    Authors: Dan Schumacher, Fatemeh Haji, Tara Grey, Niharika Bandlamudi, Nupoor Karnik, Gagana Uday Kumar, Jason Cho-Yu Chiang, Paul Rad, Nishant Vishwamitra, Anthony Rios

    Abstract: Large language models (LLMs) often struggle with temporal reasoning, crucial for tasks like historical event analysis and time-sensitive information retrieval. Despite advancements, state-of-the-art models falter in handling temporal information, especially when faced with irrelevant or noisy contexts. This paper addresses this gap by empirically examining the robustness of temporal question-answe… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

  2. arXiv:2406.17574  [pdf, other

    cs.CL

    Beyond Text-to-SQL for IoT Defense: A Comprehensive Framework for Querying and Classifying IoT Threats

    Authors: Ryan Pavlich, Nima Ebadi, Richard Tarbell, Billy Linares, Adrian Tan, Rachael Humphreys, Jayanta Kumar Das, Rambod Ghandiparsi, Hannah Haley, Jerris George, Rocky Slavin, Kim-Kwang Raymond Choo, Glenn Dietrich, Anthony Rios

    Abstract: Recognizing the promise of natural language interfaces to databases, prior studies have emphasized the development of text-to-SQL systems. While substantial progress has been made in this field, existing research has concentrated on generating SQL statements from text queries. The broader challenge, however, lies in inferring new information about the returned data. Our research makes two major co… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

  3. arXiv:2406.14545  [pdf, other

    cs.CL

    Unmasking Database Vulnerabilities: Zero-Knowledge Schema Inference Attacks in Text-to-SQL Systems

    Authors: Đorđe Klisura, Anthony Rios

    Abstract: Relational databases are integral to modern information systems, serving as the foundation for storing, querying, and managing data efficiently and effectively. Advancements in large language modeling have led to the emergence of text-to-SQL technologies, significantly enhancing the querying and extracting of information from these databases and raising concerns about privacy and security. Our res… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

  4. arXiv:2406.14500  [pdf, other

    cs.CL

    Improving Expert Radiology Report Summarization by Prompting Large Language Models with a Layperson Summary

    Authors: Xingmeng Zhao, Tongnian Wang, Anthony Rios

    Abstract: Radiology report summarization (RRS) is crucial for patient care, requiring concise "Impressions" from detailed "Findings." This paper introduces a novel prompting strategy to enhance RRS by first generating a layperson summary. This approach normalizes key observations and simplifies complex information using non-expert communication techniques inspired by doctor-patient interactions. Combined wi… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

  5. arXiv:2404.01961  [pdf, other

    cs.CL

    Team UTSA-NLP at SemEval 2024 Task 5: Prompt Ensembling for Argument Reasoning in Civil Procedures with GPT4

    Authors: Dan Schumacher, Anthony Rios

    Abstract: In this paper, we present our system for the SemEval Task 5, The Legal Argument Reasoning Task in Civil Procedure Challenge. Legal argument reasoning is an essential skill that all law students must master. Moreover, it is important to develop natural language processing solutions that can reason about a question given terse domain-specific contextual information. Our system explores a prompt-base… ▽ More

    Submitted 2 April, 2024; originally announced April 2024.

    Comments: Accepted to SemEval@NAACL 2024

  6. arXiv:2403.17363  [pdf, other

    cs.CL

    Extracting Biomedical Entities from Noisy Audio Transcripts

    Authors: Nima Ebadi, Kellen Morgan, Adrian Tan, Billy Linares, Sheri Osborn, Emma Majors, Jeremy Davis, Anthony Rios

    Abstract: Automatic Speech Recognition (ASR) technology is fundamental in transcribing spoken language into text, with considerable applications in the clinical realm, including streamlining medical transcription and integrating with Electronic Health Record (EHR) systems. Nevertheless, challenges persist, especially when transcriptions contain noise, leading to significant drops in performance when Natural… ▽ More

    Submitted 25 March, 2024; originally announced March 2024.

    Comments: Accepted to LREC-COLING 2024

  7. Help Supporters: Exploring the Design Space of Assistive Technologies to Support Face-to-Face Help Between Blind and Sighted Strangers

    Authors: Yuanyang Teng, Connor Courtien, David Angel Rios, Yves M. Tseng, Jacqueline Gibson, Maryam Aziz, Avery Reyna, Rajan Vaish, Brian A. Smith

    Abstract: Blind and low-vision (BLV) people face many challenges when venturing into public environments, often wishing it were easier to get help from people nearby. Ironically, while many sighted individuals are willing to help, such interactions are infrequent. Asking for help is socially awkward for BLV people, and sighted people lack experience in hel** BLV people. Through a mixed-ability research-th… ▽ More

    Submitted 12 March, 2024; originally announced March 2024.

    Comments: To Appear In Proceedings of the 2024 CHI Conference on Human Factors in Computing Systems (Honolulu, HI, USA) Association for Computing Machinery, New York, NY, USA. 24 pages

  8. German also Hallucinates! Inconsistency Detection in News Summaries with the Absinth Dataset

    Authors: Laura Mascarell, Ribin Chalumattu, Annette Rios

    Abstract: The advent of Large Language Models (LLMs) has led to remarkable progress on a wide range of natural language processing tasks. Despite the advances, these large-sized models still suffer from hallucinating information in their output, which poses a major issue in automatic text summarization, as we must guarantee that the generated summary is consistent with the content of the source document. Pr… ▽ More

    Submitted 14 March, 2024; v1 submitted 6 March, 2024; originally announced March 2024.

    Comments: 11 pages, 2 figures, 7 tables, conference: Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024), Turin, Italy, May 20-25, 2024

    ACM Class: I.2.7

  9. arXiv:2401.09407  [pdf, other

    cs.CL cs.LG

    Deciphering Textual Authenticity: A Generalized Strategy through the Lens of Large Language Semantics for Detecting Human vs. Machine-Generated Text

    Authors: Mazal Bethany, Brandon Wherry, Emet Bethany, Nishant Vishwamitra, Anthony Rios, Peyman Najafirad

    Abstract: With the recent proliferation of Large Language Models (LLMs), there has been an increasing demand for tools to detect machine-generated text. The effective detection of machine-generated text face two pertinent problems: First, they are severely limited in generalizing against real-world scenarios, where machine-generated text is produced by a variety of generators, including but not limited to G… ▽ More

    Submitted 2 April, 2024; v1 submitted 17 January, 2024; originally announced January 2024.

  10. arXiv:2310.16681  [pdf, other

    cs.CL

    BabyStories: Can Reinforcement Learning Teach Baby Language Models to Write Better Stories?

    Authors: Xingmeng Zhao, Tongnian Wang, Sheri Osborn, Anthony Rios

    Abstract: Language models have seen significant growth in the size of their corpus, leading to notable performance improvements. Yet, there has been limited progress in develo** models that handle smaller, more human-like datasets. As part of the BabyLM shared task, this study explores the impact of reinforcement learning from human feedback (RLHF) on language models pretrained from scratch with a limited… ▽ More

    Submitted 25 October, 2023; originally announced October 2023.

    Comments: Accepted to BabyLM workshop at CoNLL

  11. arXiv:2309.03918  [pdf, other

    cs.AI cs.CY cs.LG

    A recommender for the management of chronic pain in patients undergoing spinal cord stimulation

    Authors: Tigran Tchrakian, Mykhaylo Zayats, Alessandra Pascale, Dat Huynh, Pritish Parida, Carla Agurto Rios, Sergiy Zhuk, Jeffrey L. Rogers, ENVISION Studies Physician Author Group, Boston Scientific Research Scientists Consortium

    Abstract: Spinal cord stimulation (SCS) is a therapeutic approach used for the management of chronic pain. It involves the delivery of electrical impulses to the spinal cord via an implanted device, which when given suitable stimulus parameters can mask or block pain signals. Selection of optimal stimulation parameters usually happens in the clinic under the care of a provider whereas at-home SCS optimizati… ▽ More

    Submitted 6 September, 2023; originally announced September 2023.

  12. Fitted avatars: automatic skeleton adjustment for self-avatars in virtual reality

    Authors: Jose Luis Ponton, Víctor Ceballos, Lesly Acosta, Alejandro Ríos, Eva Monclús, Nuria Pelechano

    Abstract: In the era of the metaverse, self-avatars are gaining popularity, as they can enhance presence and provide embodiment when a user is immersed in Virtual Reality. They are also very important in collaborative Virtual Reality to improve communication through gestures. Whether we are using a complex motion capture solution or a few trackers with inverse kinematics (IK), it is essential to have a good… ▽ More

    Submitted 14 July, 2023; originally announced July 2023.

    Comments: Published in Virtual Reality Springer

    Journal ref: Springer Virtual Reality (2023) 1-20

  13. arXiv:2305.18618  [pdf

    cs.CL cs.AI

    Chatbots put to the test in math and logic problems: A preliminary comparison and assessment of ChatGPT-3.5, ChatGPT-4, and Google Bard

    Authors: Vagelis Plevris, George Papazafeiropoulos, Alejandro Jiménez Rios

    Abstract: A comparison between three chatbots which are based on large language models, namely ChatGPT-3.5, ChatGPT-4 and Google Bard is presented, focusing on their ability to give correct answers to mathematics and logic problems. In particular, we check their ability to Understand the problem at hand; Apply appropriate algorithms or methods for its solution; and Generate a coherent response and a correct… ▽ More

    Submitted 30 May, 2023; originally announced May 2023.

    MSC Class: 68T50 ACM Class: I.2.0; I.2.7

  14. arXiv:2305.15591  [pdf, other

    cs.LG

    Lightweight Learner for Shared Knowledge Lifelong Learning

    Authors: Yunhao Ge, Yuecheng Li, Di Wu, Ao Xu, Adam M. Jones, Amanda Sofie Rios, Iordanis Fostiropoulos, Shixian Wen, Po-Hsuan Huang, Zachary William Murdock, Gozde Sahin, Shuo Ni, Kiran Lekkala, Sumedh Anand Sontakke, Laurent Itti

    Abstract: In Lifelong Learning (LL), agents continually learn as they encounter new conditions and tasks. Most current LL is limited to a single agent that learns tasks sequentially. Dedicated LL machinery is then deployed to mitigate the forgetting of old tasks as new tasks are learned. This is inherently slow. We propose a new Shared Knowledge Lifelong Learning (SKILL) challenge, which deploys a decentral… ▽ More

    Submitted 24 May, 2023; originally announced May 2023.

    Comments: Transactions on Machine Learning Research (TMLR) paper

  15. arXiv:2303.12898  [pdf, other

    cs.CL

    Towards Understanding the Generalization of Medical Text-to-SQL Models and Datasets

    Authors: Richard Tarbell, Kim-Kwang Raymond Choo, Glenn Dietrich, Anthony Rios

    Abstract: Electronic medical records (EMRs) are stored in relational databases. It can be challenging to access the required information if the user is unfamiliar with the database schema or general database fundamentals. Hence, researchers have explored text-to-SQL generation methods that provide healthcare professionals direct access to EMR data without needing a database expert. However, currently availa… ▽ More

    Submitted 22 March, 2023; originally announced March 2023.

  16. arXiv:2301.06178  [pdf, other

    cs.CY cs.CL

    Bike Frames: Understanding the Implicit Portrayal of Cyclists in the News

    Authors: Xingmeng Zhao, Xavier Walton, Suhana Shrestha, Anthony Rios

    Abstract: Increasing the number of cyclists, whether for general transport or recreation, can provide health improvements and reduce the environmental impact of vehicular transportation. However, the public's perception of cycling may be driven by the ideologies and reporting standards of news agencies. For instance, people may identify cyclists on the road as "dangerous" if news agencies overly report cycl… ▽ More

    Submitted 15 January, 2023; originally announced January 2023.

  17. arXiv:2301.01212  [pdf, ps, other

    q-fin.RM cs.LG cs.SI

    Assessment of creditworthiness models privacy-preserving training with synthetic data

    Authors: Ricardo Muñoz-Cancino, Cristián Bravo, Sebastián A. Ríos, Manuel Graña

    Abstract: Credit scoring models are the primary instrument used by financial institutions to manage credit risk. The scarcity of research on behavioral scoring is due to the difficult data access. Financial institutions have to maintain the privacy and security of borrowers' information refrain them from collaborating in research initiatives. In this work, we present a methodology that allows us to evaluate… ▽ More

    Submitted 31 December, 2022; originally announced January 2023.

    Journal ref: Hybrid Artificial Intelligent Systems. HAIS 2022. Lecture Notes in Computer Science(), vol 13469

  18. arXiv:2212.12801  [pdf, other

    cs.CL

    Linguistic Elements of Engaging Customer Service Discourse on Social Media

    Authors: Sonam Singh, Anthony Rios

    Abstract: Customers are rapidly turning to social media for customer support. While brand agents on these platforms are motivated and well-intentioned to help and engage with customers, their efforts are often ignored if their initial response to the customer does not match a specific tone, style, or topic the customer is aiming to receive. The length of a conversation can reflect the effort and quality of… ▽ More

    Submitted 24 December, 2022; originally announced December 2022.

    Comments: Accepted to NLP+CSS at EMNLP 2022

  19. arXiv:2212.12800  [pdf, other

    cs.CL

    A Marker-based Neural Network System for Extracting Social Determinants of Health

    Authors: Xingmeng Zhao, Anthony Rios

    Abstract: Objective. The impact of social determinants of health (SDoH) on patients' healthcare quality and the disparity is well-known. Many SDoH items are not coded in structured forms in electronic health records. These items are often captured in free-text clinical notes, but there are limited methods for automatically extracting them. We explore a multi-stage pipeline involving named entity recognition… ▽ More

    Submitted 24 December, 2022; originally announced December 2022.

  20. arXiv:2212.12799  [pdf, other

    cs.CL

    A Comprehensive Study of Gender Bias in Chemical Named Entity Recognition Models

    Authors: Xingmeng Zhao, Ali Niazi, Anthony Rios

    Abstract: Chemical named entity recognition (NER) models are used in many downstream tasks, from adverse drug reaction identification to pharmacoepidemiology. However, it is unknown whether these models work the same for everyone. Performance disparities can potentially cause harm rather than the intended good. This paper assesses gender-related performance disparities in chemical NER systems. We develop a… ▽ More

    Submitted 13 March, 2024; v1 submitted 24 December, 2022; originally announced December 2022.

  21. arXiv:2212.00089  [pdf, other

    cs.AR cs.ET

    Ferroelectric FET based Context-Switching FPGA Enabling Dynamic Reconfiguration for Adaptive Deep Learning Machines

    Authors: Yixin Xu, Zijian Zhao, Yi Xiao, Tongguang Yu, Halid Mulaosmanovic, Dominik Kleimaier, Stefan Duenkel, Sven Beyer, Xiao Gong, Rajiv Joshi, X. Sharon Hu, Shixian Wen, Amanda Sofie Rios, Kiran Lekkala, Laurent Itti, Eric Homan, Sumitha George, Vijaykrishnan Narayanan, Kai Ni

    Abstract: Field Programmable Gate Array (FPGA) is widely used in acceleration of deep learning applications because of its reconfigurability, flexibility, and fast time-to-market. However, conventional FPGA suffers from the tradeoff between chip area and reconfiguration latency, making efficient FPGA accelerations that require switching between multiple configurations still elusive. In this paper, we perfor… ▽ More

    Submitted 30 November, 2022; originally announced December 2022.

    Comments: 54 pages, 15 figures

  22. arXiv:2211.15464  [pdf, other

    cs.CL cs.AI

    Considerations for meaningful sign language machine translation based on glosses

    Authors: Mathias Müller, Zifan Jiang, Amit Moryossef, Annette Rios, Sarah Ebling

    Abstract: Automatic sign language processing is gaining popularity in Natural Language Processing (NLP) research (Yin et al., 2021). In machine translation (MT) in particular, sign language translation based on glosses is a prominent approach. In this paper, we review recent works on neural gloss translation. We find that limitations of glosses in general and limitations of specific datasets are not discuss… ▽ More

    Submitted 28 November, 2022; originally announced November 2022.

  23. arXiv:2209.07353  [pdf, other

    cs.CL

    Measuring Geographic Performance Disparities of Offensive Language Classifiers

    Authors: Brandon Lwowski, Paul Rad, Anthony Rios

    Abstract: Text classifiers are applied at scale in the form of one-size-fits-all solutions. Nevertheless, many studies show that classifiers are biased regarding different languages and dialects. When measuring and discovering these biases, some gaps present themselves and should be addressed. First, ``Does language, dialect, and topical content vary across geographical regions?'' and secondly ``If there ar… ▽ More

    Submitted 15 September, 2022; originally announced September 2022.

    Comments: Accepted by 29th International Conference on Computational Linguistics (COLING 2022)

  24. arXiv:2209.00470  [pdf, other

    cs.CL cs.IR cs.LG stat.ML

    Negation detection in Dutch clinical texts: an evaluation of rule-based and machine learning methods

    Authors: Bram van Es, Leon C. Reteig, Sander C. Tan, Marijn Schraagen, Myrthe M. Hemker, Sebastiaan R. S. Arends, Miguel A. R. Rios, Saskia Haitjema

    Abstract: As structured data are often insufficient, labels need to be extracted from free text in electronic health records when develo** models for clinical information retrieval and decision support systems. One of the most important contextual properties in clinical text is negation, which indicates the absence of findings. We aimed to improve large scale extraction of labels by comparing three method… ▽ More

    Submitted 1 September, 2022; originally announced September 2022.

    Comments: 24, 8, journal

    MSC Class: 68T50; 68P20 ACM Class: I.2.7; J.3; H.3.3

  25. arXiv:2204.06122  [pdf, other

    cs.SI cs.LG

    On the dynamics of credit history and social interaction features, and their impact on creditworthiness assessment performance

    Authors: Ricardo Muñoz-Cancino, Cristián Bravo, Sebastián A. Ríos, Manuel Graña

    Abstract: For more than a half-century, credit risk management has used credit scoring models in each of its well-defined stages to manage credit risk. Application scoring is used to decide whether to grant a credit or not, while behavioral scoring is used mainly for portfolio management and to take preventive actions in case of default signals. In both cases, network data has recently been shown to be valu… ▽ More

    Submitted 12 April, 2022; originally announced April 2022.

  26. arXiv:2203.14920  [pdf, other

    cs.CL

    UTSA NLP at SemEval-2022 Task 4: An Exploration of Simple Ensembles of Transformers, Convolutional, and Recurrent Neural Networks

    Authors: Xingmeng Zhao, Anthony Rios

    Abstract: The act of appearing kind or helpful via the use of but having a feeling of superiority condescending and patronizing language can have have serious mental health implications to those that experience it. Thus, detecting this condescending and patronizing language online can be useful for online moderation systems. Thus, in this manuscript, we describe the system developed by Team UTSA SemEval-202… ▽ More

    Submitted 28 March, 2022; originally announced March 2022.

    Comments: Submitted to SemEval 2022

  27. arXiv:2203.08694  [pdf, other

    cs.CL cs.SI

    Turning Stocks into Memes: A Dataset for Understanding How Social Communities Can Drive Wall Street

    Authors: Richard Alvarez, Paras Bhatt, Xingmeng Zhao, Anthony Rios

    Abstract: Who actually expresses an intent to buy GameStop shares on Reddit? What convinces people to buy stocks? Are people convinced to support a coordinated plan to adversely impact Wall Street investors? Existing literature on understanding intent has mainly relied on surveys and self reporting; however there are limitations to these methodologies. Hence, in this paper, we develop an annotated dataset o… ▽ More

    Submitted 16 March, 2022; originally announced March 2022.

    Comments: Accepted to ICWSM 2022

  28. arXiv:2202.06946  [pdf

    cs.HC

    Prototy** a Virtual Agent for Pre-school English Teaching

    Authors: Eduardo Benitez Sandoval, Diego Vazquez Rojas, Clarissa A. Parada Cereceres, Alvaro Anzueto Rios, Amit Barde, Mark Billinghurst

    Abstract: This paper describes a case study and the insights gained from prototy** an Intelligent Virtual Agent (IVA) for English vocabulary building for Spanish-speaking preschool children. After an initial exploration to evaluate the feasibility of develo** an IVA, we followed a Human-Centered Design (HCD) approach to create a prototype. We report on the multidisciplinary process used that incorporate… ▽ More

    Submitted 8 February, 2022; originally announced February 2022.

    Comments: Accepted in the IEEE Virtual Reality Conference 2022, Christchurch, New Zealand

    ACM Class: I.3.8; K.3.1

  29. arXiv:2201.08098  [pdf, other

    cs.CV

    What can we learn from misclassified ImageNet images?

    Authors: Shixian Wen, Amanda Sofie Rios, Kiran Lekkala, Laurent Itti

    Abstract: Understanding the patterns of misclassified ImageNet images is particularly important, as it could guide us to design deep neural networks (DNN) that generalize better. However, the richness of ImageNet imposes difficulties for researchers to visually find any useful patterns of misclassification. Here, to help find these patterns, we propose "Superclassing ImageNet dataset". It is a subset of Ima… ▽ More

    Submitted 20 January, 2022; originally announced January 2022.

  30. On the combination of graph data for assessing thin-file borrowers' creditworthiness

    Authors: Ricardo Muñoz-Cancino, Cristián Bravo, Sebastián A. Ríos, Manuel Graña

    Abstract: The thin-file borrowers are customers for whom a creditworthiness assessment is uncertain due to their lack of credit history; many researchers have used borrowers' relationships and interactions networks in the form of graphs as an alternative data source to address this. Incorporating network data is traditionally made by hand-crafted feature engineering, and lately, the graph neural network has… ▽ More

    Submitted 16 September, 2022; v1 submitted 26 November, 2021; originally announced November 2021.

    Journal ref: Expert Systems with Applications, 2022, 118809

  31. arXiv:2111.08174  [pdf, other

    cs.CV cs.LG

    ShapeY: Measuring Shape Recognition Capacity Using Nearest Neighbor Matching

    Authors: Jong Woo Nam, Amanda S. Rios, Bartlett W. Mel

    Abstract: Object recognition in humans depends primarily on shape cues. We have developed a new approach to measuring the shape recognition performance of a vision system based on nearest neighbor view matching within the system's embedding space. Our performance benchmark, ShapeY, allows for precise control of task difficulty, by enforcing that view matching span a specified degree of 3D viewpoint change a… ▽ More

    Submitted 15 November, 2021; originally announced November 2021.

    Comments: 6 pages, 5 figures, Accepted to NeurIPS: ImageNet Past, Present, and Future

  32. arXiv:2107.08030  [pdf, other

    stat.ME cs.AI

    A New Robust Multivariate Mode Estimator for Eye-tracking Calibration

    Authors: Adrien Brilhault, Sergio Neuenschwander, Ricardo Araujo Rios

    Abstract: We propose in this work a new method for estimating the main mode of multivariate distributions, with application to eye-tracking calibrations. When performing eye-tracking experiments with poorly cooperative subjects, such as infants or monkeys, the calibration data generally suffer from high contamination. Outliers are typically organized in clusters, corresponding to the time intervals when sub… ▽ More

    Submitted 16 July, 2021; originally announced July 2021.

  33. arXiv:2106.06811  [pdf, other

    cs.SI cs.CL cs.LG

    Case Study on Detecting COVID-19 Health-Related Misinformation in Social Media

    Authors: Mir Mehedi A. Pritom, Rosana Montanez Rodriguez, Asad Ali Khan, Sebastian A. Nugroho, Esra'a Alrashydah, Beatrice N. Ruiz, Anthony Rios

    Abstract: COVID-19 pandemic has generated what public health officials called an infodemic of misinformation. As social distancing and stay-at-home orders came into effect, many turned to social media for socializing. This increase in social media usage has made it a prime vehicle for the spreading of misinformation. This paper presents a mechanism to detect COVID-19 health-related misinformation in social… ▽ More

    Submitted 12 June, 2021; originally announced June 2021.

    Comments: 10 pages

  34. arXiv:2106.01170  [pdf, other

    cs.CL

    Detecting Bot-Generated Text by Characterizing Linguistic Accommodation in Human-Bot Interactions

    Authors: Paras Bhatt, Anthony Rios

    Abstract: Language generation models' democratization benefits many domains, from answering health-related questions to enhancing education by providing AI-driven tutoring services. However, language generation models' democratization also makes it easier to generate human-like text at-scale for nefarious activities, from spreading misinformation to targeting specific groups with hate speech. Thus, it is es… ▽ More

    Submitted 2 June, 2021; originally announced June 2021.

    Comments: 13 pages, to be published in Findings of ACL-IJCNLP 2021

  35. arXiv:2104.10166  [pdf, other

    cs.CL

    Evaluating the Immediate Applicability of Pose Estimation for Sign Language Recognition

    Authors: Amit Moryossef, Ioannis Tsochantaridis, Joe Dinn, Necati Cihan Camgöz, Richard Bowden, Tao Jiang, Annette Rios, Mathias Müller, Sarah Ebling

    Abstract: Signed languages are visual languages produced by the movement of the hands, face, and body. In this paper, we evaluate representations based on skeleton poses, as these are explainable, person-independent, privacy-preserving, low-dimensional representations. Basically, skeletal representations generalize over an individual's appearance and background, allowing us to focus on the recognition of mo… ▽ More

    Submitted 20 April, 2021; originally announced April 2021.

  36. arXiv:2104.08726  [pdf, other

    cs.CL

    AmericasNLI: Evaluating Zero-shot Natural Language Understanding of Pretrained Multilingual Models in Truly Low-resource Languages

    Authors: Abteen Ebrahimi, Manuel Mager, Arturo Oncevay, Vishrav Chaudhary, Luis Chiruzzo, Angela Fan, John Ortega, Ricardo Ramos, Annette Rios, Ivan Meza-Ruiz, Gustavo A. Giménez-Lugo, Elisabeth Mager, Graham Neubig, Alexis Palmer, Rolando Coto-Solano, Ngoc Thang Vu, Katharina Kann

    Abstract: Pretrained multilingual models are able to perform cross-lingual transfer in a zero-shot setting, even for languages unseen during pretraining. However, prior work evaluating performance on unseen languages has largely been limited to low-level, syntactic tasks, and it remains unclear if zero-shot learning of high-level, semantic tasks is possible for unseen languages. To explore this question, we… ▽ More

    Submitted 16 March, 2022; v1 submitted 18 April, 2021; originally announced April 2021.

    Comments: Accepted to ACL 2022

  37. arXiv:2104.03945  [pdf, other

    cs.CL

    On Biasing Transformer Attention Towards Monotonicity

    Authors: Annette Rios, Chantal Amrhein, Noëmi Aepli, Rico Sennrich

    Abstract: Many sequence-to-sequence tasks in natural language processing are roughly monotonic in the alignment between source and target sequence, and previous work has facilitated or enforced learning of monotonic attention behavior via specialized attention functions or pretraining. In this work, we introduce a monotonicity loss function that is compatible with standard attention mechanisms and test it o… ▽ More

    Submitted 8 April, 2021; originally announced April 2021.

    Comments: To be published in: Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL-HLT 2021)

  38. Quality at a Glance: An Audit of Web-Crawled Multilingual Datasets

    Authors: Julia Kreutzer, Isaac Caswell, Lisa Wang, Ahsan Wahab, Daan van Esch, Nasanbayar Ulzii-Orshikh, Allahsera Tapo, Nishant Subramani, Artem Sokolov, Claytone Sikasote, Monang Setyawan, Supheakmungkol Sarin, Sokhar Samb, Benoît Sagot, Clara Rivera, Annette Rios, Isabel Papadimitriou, Salomey Osei, Pedro Ortiz Suarez, Iroro Orife, Kelechi Ogueji, Andre Niyongabo Rubungo, Toan Q. Nguyen, Mathias Müller, André Müller , et al. (27 additional authors not shown)

    Abstract: With the success of large-scale pre-training and multilingual modeling in Natural Language Processing (NLP), recent years have seen a proliferation of large, web-mined text datasets covering hundreds of languages. We manually audit the quality of 205 language-specific corpora released with five major public datasets (CCAligned, ParaCrawl, WikiMatrix, OSCAR, mC4). Lower-resource corpora have system… ▽ More

    Submitted 21 February, 2022; v1 submitted 22 March, 2021; originally announced March 2021.

    Comments: Accepted at TACL; pre-MIT Press publication version

    Journal ref: Transactions of the Association for Computational Linguistics (2022) 10: 50-72

  39. arXiv:2101.08674  [pdf, other

    cs.CV

    DAF:re: A Challenging, Crowd-Sourced, Large-Scale, Long-Tailed Dataset For Anime Character Recognition

    Authors: Edwin Arkel Rios, Wen-Huang Cheng, Bo-Cheng Lai

    Abstract: In this work we tackle the challenging problem of anime character recognition. Anime, referring to animation produced within Japan and work derived or inspired from it. For this purpose we present DAF:re (DanbooruAnimeFaces:revamped), a large-scale, crowd-sourced, long-tailed dataset with almost 500 K images spread across more than 3000 classes. Additionally, we conduct experiments on DAF:re and s… ▽ More

    Submitted 21 January, 2021; originally announced January 2021.

    Comments: 5 pages, 3 figures, 4 tables

    ACM Class: I.2; I.4

  40. arXiv:2011.13429  [pdf

    cs.LG cs.CV

    Explaining Deep Learning Models for Structured Data using Layer-Wise Relevance Propagation

    Authors: hsan Ullah, Andre Rios, Vaibhav Gala, Susan Mckeever

    Abstract: Trust and credibility in machine learning models is bolstered by the ability of a model to explain itsdecisions. While explainability of deep learning models is a well-known challenge, a further chal-lenge is clarity of the explanation itself, which must be interpreted by downstream users. Layer-wiseRelevance Propagation (LRP), an established explainability technique developed for deep models inco… ▽ More

    Submitted 26 November, 2020; originally announced November 2020.

    Comments: 13 pages, 5 figures, 6 tables

  41. arXiv:2011.04783  [pdf, other

    cs.LG cs.AI

    Lifelong Learning Without a Task Oracle

    Authors: Amanda Rios, Laurent Itti

    Abstract: Supervised deep neural networks are known to undergo a sharp decline in the accuracy of older tasks when new tasks are learned, termed "catastrophic forgetting". Many state-of-the-art solutions to continual learning rely on biasing and/or partitioning a model to accommodate successive tasks incrementally. However, these methods largely depend on the availability of a task-oracle to confer task ide… ▽ More

    Submitted 9 November, 2020; originally announced November 2020.

    Comments: Proceedings of the IEEE 32nd International Conference on Tools with Artificial Intelligence (ICTAI 2020)

  42. arXiv:2011.01703  [pdf, other

    cs.CL

    Subword Segmentation and a Single Bridge Language Affect Zero-Shot Neural Machine Translation

    Authors: Annette Rios, Mathias Müller, Rico Sennrich

    Abstract: Zero-shot neural machine translation is an attractive goal because of the high cost of obtaining data and building translation systems for new translation directions. However, previous papers have reported mixed success in zero-shot translation. It is hard to predict in which settings it will be effective, and what limits performance compared to a fully supervised system. In this paper, we investi… ▽ More

    Submitted 3 November, 2020; originally announced November 2020.

    Comments: Accepted at WMT 2020

  43. Beneficial Perturbation Network for designing general adaptive artificial intelligence systems

    Authors: Shixian Wen, Amanda Rios, Yunhao Ge, Laurent Itti

    Abstract: The human brain is the gold standard of adaptive learning. It not only can learn and benefit from experience, but also can adapt to new situations. In contrast, deep neural networks only learn one sophisticated but fixed map** from inputs to outputs. This limits their applicability to more dynamic situations, where input to output map** may change with different contexts. A salient example is… ▽ More

    Submitted 1 February, 2021; v1 submitted 26 September, 2020; originally announced September 2020.

    Comments: Accepted at IEEE Transactions on Neural Networks and Learning Systems Keyword: Adaptive artificial intelligence system , Switch modes , Beneficial perturbations , Continual learning , Adversarial examples

    Journal ref: IEEE Transactions on Neural Networks and Learning Systems 2021

  44. arXiv:2009.12724  [pdf, other

    cs.LG cs.CR stat.ML

    Beneficial Perturbations Network for Defending Adversarial Examples

    Authors: Shixian Wen, Amanda Rios, Laurent Itti

    Abstract: Deep neural networks can be fooled by adversarial attacks: adding carefully computed small adversarial perturbations to clean inputs can cause misclassification on state-of-the-art machine learning models. The reason is that neural networks fail to accommodate the distribution drift of the input data caused by adversarial perturbations. Here, we present a new solution - Beneficial Perturbation Net… ▽ More

    Submitted 13 September, 2021; v1 submitted 26 September, 2020; originally announced September 2020.

    Comments: The paper is under consideration at Pattern Recognition Letters

  45. Pure Pattern Calculus à la de Bruijn

    Authors: Alexis Martín, Alejandro Ríos, Andrés Viso

    Abstract: It is well-known in the field of programming languages that dealing with variable names and binders may lead to conflicts such as undesired captures when implementing interpreters or compilers. This situation has been overcome by resorting to de Bruijn indices for calculi where binders capture only one variable name, like the $λ$-calculus. The advantage of this approach relies on the fact that so-… ▽ More

    Submitted 28 June, 2020; v1 submitted 13 June, 2020; originally announced June 2020.

  46. The Bang Calculus Revisited

    Authors: Antonio Bucciarelli, Delia Kesner, Alejandro Ríos, Andrés Viso

    Abstract: Call-by-Push-Value (CBPV) is a programming paradigm subsuming both Callby-Name (CBN) and Call-by-Value (CBV) semantics. The essence of this paradigm is captured by the Bang Calculus, a (concise) term language connecting CBPV and Linear Logic. This paper presents a revisited version of the Bang Calculus, called $λ!$, enjoying some important properties missing in the original formulation. Indeed,… ▽ More

    Submitted 5 May, 2023; v1 submitted 10 February, 2020; originally announced February 2020.

  47. arXiv:1911.03109  [pdf, other

    cs.CL

    Domain Robustness in Neural Machine Translation

    Authors: Mathias Müller, Annette Rios, Rico Sennrich

    Abstract: Translating text that diverges from the training domain is a key challenge for machine translation. Domain robustness---the generalization of models to unseen test domains---is low for both statistical (SMT) and neural machine translation (NMT). In this paper, we study the performance of SMT and NMT models on out-of-domain test sets. We find that in unknown domains, SMT and NMT suffer from very di… ▽ More

    Submitted 24 September, 2020; v1 submitted 8 November, 2019; originally announced November 2019.

    Comments: V2: AMTA camera-ready

  48. arXiv:1811.02668  [pdf

    cs.CV cs.LG stat.ML

    Automated Diagnosis of Lymphoma with Digital Pathology Images Using Deep Learning

    Authors: Hanadi El Achi, Tatiana Belousova, Lei Chen, Amer Wahed, Iris Wang, Zhihong Hu, Zeyad Kanaan, Adan Rios, Andy N. D. Nguyen

    Abstract: Recent studies have shown promising results in using Deep Learning to detect malignancy in whole slide imaging. However, they were limited to just predicting positive or negative finding for a specific neoplasm. We attempted to use Deep Learning with a convolutional neural network algorithm to build a lymphoma diagnostic model for four diagnostic categories: benign lymph node, diffuse large B cell… ▽ More

    Submitted 30 October, 2018; originally announced November 2018.

    Comments: 13 pages, 2 figures, 2 tables

  49. arXiv:1811.01146  [pdf, other

    cs.LG cs.AI stat.ML

    Closed-Loop Memory GAN for Continual Learning

    Authors: Amanda Rios, Laurent Itti

    Abstract: Sequential learning of tasks using gradient descent leads to an unremitting decline in the accuracy of tasks for which training data is no longer available, termed catastrophic forgetting. Generative models have been explored as a means to approximate the distribution of old tasks and bypass storage of real data. Here we propose a cumulative closed-loop memory replay GAN (CloGAN) provided with ext… ▽ More

    Submitted 28 September, 2020; v1 submitted 2 November, 2018; originally announced November 2018.

    Comments: Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence (IJCAI-2019). https://doi.org/10.24963/ijcai.2019/462

  50. arXiv:1810.13247  [pdf

    cs.LG q-bio.QM stat.ML

    Application of Deep Learning on Predicting Prognosis of Acute Myeloid Leukemia with Cytogenetics, Age, and Mutations

    Authors: Mei Lin, Vanya Jaitly, Iris Wang, Zhihong Hu, Lei Chen, Md. Amer Wahed, Zeyad Kanaan, Adan Rios, Andy N. D. Nguyen

    Abstract: We explore how Deep Learning (DL) can be utilized to predict prognosis of acute myeloid leukemia (AML). Out of TCGA (The Cancer Genome Atlas) database, 94 AML cases are used in this study. Input data include age, 10 common cytogenetic and 23 most common mutation results; output is the prognosis (diagnosis to death, DTD). In our DL network, autoencoders are stacked to form a hierarchical DL model f… ▽ More

    Submitted 30 October, 2018; originally announced October 2018.

    Comments: 11 pages, 1 table, 1 figure. arXiv admin note: substantial text overlap with arXiv:1801.01019