Skip to main content

Showing 1–50 of 59 results for author: Rosé, C

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.19545  [pdf, other

    cs.CL cs.AI

    Leveraging Machine-Generated Rationales to Facilitate Social Meaning Detection in Conversations

    Authors: Ritam Dutt, Zhen Wu, Kelly Shi, Divyanshu Sheth, Prakhar Gupta, Carolyn Penstein Rose

    Abstract: We present a generalizable classification approach that leverages Large Language Models (LLMs) to facilitate the detection of implicitly encoded social meaning in conversations. We design a multi-faceted prompt to extract a textual explanation of the reasoning that connects visible cues to underlying social meanings. These extracted explanations or rationales serve as augmentations to the conversa… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

    Comments: To appear at The Proceedings of the Association for Computational Linguistics, 2024

  2. arXiv:2404.18262  [pdf, other

    cs.AI

    Generating Situated Reflection Triggers about Alternative Solution Paths: A Case Study of Generative AI for Computer-Supported Collaborative Learning

    Authors: Atharva Naik, Jessica Ruhan Yin, Anusha Kamath, Qianou Ma, Sherry Tongshuang Wu, Charles Murray, Christopher Bogart, Majd Sakr, Carolyn P. Rose

    Abstract: An advantage of Large Language Models (LLMs) is their contextualization capability - providing different responses based on student inputs like solution strategy or prior discussion, to potentially better engage students than standard feedback. We present a design and evaluation of a proof-of-concept LLM application to offer students dynamic and contextualized feedback. Specifically, we augment an… ▽ More

    Submitted 28 April, 2024; originally announced April 2024.

  3. arXiv:2404.00566  [pdf, other

    cs.SE cs.CL

    CodeBenchGen: Creating Scalable Execution-based Code Generation Benchmarks

    Authors: Yiqing Xie, Alex Xie, Divyanshu Sheth, Pengfei Liu, Daniel Fried, Carolyn Rose

    Abstract: To facilitate evaluation of code generation systems across diverse scenarios, we present CodeBenchGen, a framework to create scalable execution-based benchmarks that only requires light guidance from humans. Specifically, we leverage a large language model (LLM) to convert an arbitrary piece of code into an evaluation example, including test cases for execution-based evaluation. We illustrate the… ▽ More

    Submitted 7 May, 2024; v1 submitted 31 March, 2024; originally announced April 2024.

  4. arXiv:2402.15034  [pdf, ps, other

    math.CO cs.CG

    Rectilinear Crossing Number of Graphs Excluding Single-Crossing Graphs as Minors

    Authors: Vida Dujmović, Camille La Rose

    Abstract: The crossing number of a graph $G$ is the minimum number of crossings in a drawing of $G$ in the plane. A rectilinear drawing of a graph $G$ represents vertices of $G$ by a set of points in the plane and represents each edge of $G$ by a straight-line segment connecting its two endpoints. The rectilinear crossing number of $G$ is the minimum number of crossings in a rectilinear drawing of $G$. By… ▽ More

    Submitted 22 February, 2024; originally announced February 2024.

  5. arXiv:2311.09581  [pdf, other

    cs.CL

    DocLens: Multi-aspect Fine-grained Evaluation for Medical Text Generation

    Authors: Yiqing Xie, Sheng Zhang, Hao Cheng, Pengfei Liu, Zelalem Gero, Cliff Wong, Tristan Naumann, Hoifung Poon, Carolyn Rose

    Abstract: Medical text generation aims to assist with administrative work and highlight salient information to support decision-making. To reflect the specific requirements of medical text, in this paper, we propose a set of metrics to evaluate the completeness, conciseness, and attribution of the generated text at a fine-grained level. The metrics can be computed by various types of evaluators including in… ▽ More

    Submitted 18 February, 2024; v1 submitted 16 November, 2023; originally announced November 2023.

  6. arXiv:2311.00317  [pdf, other

    cs.CL cs.LG cs.SE

    Data Augmentation for Code Translation with Comparable Corpora and Multiple References

    Authors: Yiqing Xie, Atharva Naik, Daniel Fried, Carolyn Rose

    Abstract: One major challenge of translating code between programming languages is that parallel training data is often limited. To overcome this challenge, we present two data augmentation techniques, one that builds comparable corpora (i.e., code pairs with similar functionality), and another that augments existing parallel data with multiple reference translations. Specifically, we build and analyze mult… ▽ More

    Submitted 1 November, 2023; originally announced November 2023.

    Comments: EMNLP 2023 Findings

  7. arXiv:2307.03823  [pdf, other

    cs.CL

    Linguistic representations for fewer-shot relation extraction across domains

    Authors: Sireesh Gururaja, Ritam Dutt, Tinglong Liao, Carolyn Rose

    Abstract: Recent work has demonstrated the positive impact of incorporating linguistic representations as additional context and scaffolding on the in-domain performance of several NLP tasks. We extend this work by exploring the impact of linguistic representations on cross-domain performance in a few-shot transfer setting. An important question is whether linguistic representations enhance generalizability… ▽ More

    Submitted 7 July, 2023; originally announced July 2023.

    Comments: ACL 2023

  8. arXiv:2305.02840  [pdf

    cs.CY

    Making Sense of Machine Learning: Integrating Youth's Conceptual, Creative, and Critical Understandings of AI

    Authors: Luis Morales-Navarro, Yasmin B. Kafai, Francisco Castro, William Payne, Kayla DesPortes, Daniella DiPaola, Randi Williams, Safinah Ali, Cynthia Breazeal, Clifford Lee, Elisabeth Soep, Duri Long, Brian Magerko, Jaemarie Solyst, Amy Ogan, Cansu Tatar, Shiyan Jiang, Jie Chao, Carolyn P. Rosé, Sepehr Vakil

    Abstract: Understanding how youth make sense of machine learning and how learning about machine learning can be supported in and out of school is more relevant than ever before as young people interact with machine learning powered applications everyday; while connecting with friends, listening to music, playing games, or attending school. In this symposium, we present different perspectives on understandin… ▽ More

    Submitted 4 May, 2023; originally announced May 2023.

    ACM Class: K.3.2; H.5.3

    Journal ref: Proceedings of the 17th International Conference of the Learning Sciences - ICLS 2023

  9. arXiv:2209.14089  [pdf, other

    cond-mat.stat-mech cs.LG

    Combining Reinforcement Learning and Tensor Networks, with an Application to Dynamical Large Deviations

    Authors: Edward Gillman, Dominic C. Rose, Juan P. Garrahan

    Abstract: We present a framework to integrate tensor network (TN) methods with reinforcement learning (RL) for solving dynamical optimisation tasks. We consider the RL actor-critic method, a model-free approach for solving RL problems, and introduce TNs as the approximators for its policy and value functions. Our "actor-critic with tensor networks" (ACTeN) method is especially well suited to problems with l… ▽ More

    Submitted 5 April, 2024; v1 submitted 28 September, 2022; originally announced September 2022.

    Comments: [v1]: Combined main text of 6 pages, 3 figures and supplemental materials of 7 pages, 1 figure. [v2]: Accepted version, Phys. Rev. Lett. Combined main text of 8 pages, 4 figures and supplemental materials of 5 pages

  10. arXiv:2209.11116  [pdf, other

    cond-mat.stat-mech cond-mat.dis-nn cs.LG

    Training neural network ensembles via trajectory sampling

    Authors: Jamie F. Mair, Dominic C. Rose, Juan P. Garrahan

    Abstract: In machine learning, there is renewed interest in neural network ensembles (NNEs), whereby predictions are obtained as an aggregate from a diverse set of smaller models, rather than from a single larger model. Here, we show how to define and train a NNE using techniques from the study of rare trajectories in stochastic systems. We define an NNE in terms of the trajectory of the model parameters un… ▽ More

    Submitted 10 May, 2023; v1 submitted 22 September, 2022; originally announced September 2022.

    Comments: 12 pages, 5 figures, 1 appendix

  11. arXiv:2209.00568  [pdf, other

    cs.CL cs.AI cs.LG

    Multi-Scale Contrastive Knowledge Co-Distillation for Event Temporal Relation Extraction

    Authors: Hao-Ren Yao, Luke Breitfeller, Aakanksha Naik, Chunxiao Zhou, Carolyn Rose

    Abstract: Event Temporal Relation Extraction (ETRE) is a crucial yet challenging problem. Event pairs are situated within a discourse at different distances, which we refer to as proximity bands. The temporal ordering communicated about event pairs situated at more remote (i.e., ``long'') or less remote (i.e., ``short'') proximity bands is encoded differently. SOTA ETRE models have tended to perform well on… ▽ More

    Submitted 20 March, 2024; v1 submitted 1 September, 2022; originally announced September 2022.

    Comments: update

  12. arXiv:2207.06215  [pdf, other

    eess.IV cs.CV q-bio.CB

    YOLO2U-Net: Detection-Guided 3D Instance Segmentation for Microscopy

    Authors: Amirkoushyar Ziabari, Derek C. Rose, Abbas Shirinifard, David Solecki

    Abstract: Microscopy imaging techniques are instrumental for characterization and analysis of biological structures. As these techniques typically render 3D visualization of cells by stacking 2D projections, issues such as out-of-plane excitation and low resolution in the $z$-axis may pose challenges (even for human experts) to detect individual cells in 3D volumes as these non-overlap** cells may appear… ▽ More

    Submitted 13 July, 2022; originally announced July 2022.

  13. arXiv:2202.13018  [pdf, other

    cs.CV

    HCIL: Hierarchical Class Incremental Learning for Longline Fishing Visual Monitoring

    Authors: Jie Mei, Suzanne Romain, Craig Rose, Kelsey Magrane, Jenq-Neng Hwang

    Abstract: The goal of electronic monitoring of longline fishing is to visually monitor the fish catching activities on fishing vessels based on cameras, either for regulatory compliance or catch counting. The previous hierarchical classification method demonstrates efficient fish species identification of catches from longline fishing, where fishes are under severe deformation and self-occlusion during the… ▽ More

    Submitted 25 February, 2022; originally announced February 2022.

    Comments: Preprint for ICIP 2022

  14. arXiv:2201.09373  [pdf, other

    cs.CV

    Unsupervised Severely Deformed Mesh Reconstruction (DMR) from a Single-View Image

    Authors: Jie Mei, **gxi Yu, Suzanne Romain, Craig Rose, Kelsey Magrane, Graeme LeeSon, Jenq-Neng Hwang

    Abstract: Much progress has been made in the supervised learning of 3D reconstruction of rigid objects from multi-view images or a video. However, it is more challenging to reconstruct severely deformed objects from a single-view RGB image in an unsupervised manner. Although training-based methods, such as specific category-level training, have been shown to successfully reconstruct rigid objects and slight… ▽ More

    Submitted 23 January, 2022; originally announced January 2022.

    Comments: Under Review

  15. arXiv:2111.01340  [pdf, other

    cs.CL

    Adapting to the Long Tail: A Meta-Analysis of Transfer Learning Research for Language Understanding Tasks

    Authors: Aakanksha Naik, Jill Lehman, Carolyn Rose

    Abstract: Natural language understanding (NLU) has made massive progress driven by large benchmarks, but benchmarks often leave a long tail of infrequent phenomena underrepresented. We reflect on the question: have transfer learning methods sufficiently addressed the poor performance of benchmark-trained models on the long tail? We conceptualize the long tail using macro-level dimensions (e.g., underreprese… ▽ More

    Submitted 3 June, 2022; v1 submitted 1 November, 2021; originally announced November 2021.

    Comments: To appear in TACL 2022. This is a pre-MIT Press publication version

  16. arXiv:2108.08965  [pdf, other

    cs.CV cs.CL

    Localize, Group, and Select: Boosting Text-VQA by Scene Text Modeling

    Authors: Xiaopeng Lu, Zhen Fan, Yansen Wang, Jean Oh, Carolyn P. Rose

    Abstract: As an important task in multimodal context understanding, Text-VQA (Visual Question Answering) aims at question answering through reading text information in images. It differentiates from the original VQA task as Text-VQA requires large amounts of scene-text relationship understanding, in addition to the cross-modal grounding capability. In this paper, we propose Localize, Group, and Select (LOGO… ▽ More

    Submitted 19 August, 2021; originally announced August 2021.

    Comments: 9 pages

  17. arXiv:2107.00218  [pdf

    cs.SE

    Comparing Example-Based Collaborative Reflection to Problem Solving Practice for Learning during Team-Based Software Engineering Projects

    Authors: Sreecharan Sankaranarayanan, Siddharth Reddy Kandimalla, Christopher Bogart, R. Charles Murray, Haokang An, Michael Hilton, Majd Sakr, Carolyn Rosé

    Abstract: Contributing to the literature on aptitude-treatment interactions between worked examples and problem-solving, this paper addresses differential learning from the two approaches when students are positioned as domain experts learning new concepts. Our evaluation is situated in a team project that is part of an advanced software engineering course. In this course, students who possess foundational… ▽ More

    Submitted 1 July, 2021; originally announced July 2021.

    Comments: 4 pages, 1 image, 1 table, 14th Computer Supported Collaborative Learning (CSCL) Proceedings at the Annual Meeting of the International Society of the Learning Sciences (ISLS)

    Journal ref: 14th Computer-Supported Collaborative Learning Proceedings at the Annual Meeting of the International Society of the Learning Sciences 2021, pp. 213-216

  18. arXiv:2106.06555  [pdf, other

    cs.LG

    Robust Knowledge Graph Completion with Stacked Convolutions and a Student Re-Ranking Network

    Authors: Justin Lovelace, Denis Newman-Griffis, Shikhar Vashishth, Jill Fain Lehman, Carolyn Penstein Rosé

    Abstract: Knowledge Graph (KG) completion research usually focuses on densely connected benchmark datasets that are not representative of real KGs. We curate two KG datasets that include biomedical and encyclopedic knowledge and use an existing commonsense KG dataset to explore KG completion in the more realistic setting where dense connectivity is not guaranteed. We develop a deep convolutional network tha… ▽ More

    Submitted 11 June, 2021; originally announced June 2021.

    Comments: The Joint Conference of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (ACL-IJCNLP 2021)

  19. arXiv:2105.07314  [pdf, other

    cs.CL

    STAGE: Tool for Automated Extraction of Semantic Time Cues to Enrich Neural Temporal Ordering Models

    Authors: Luke Breitfeller, Aakanksha Naik, Carolyn Rose

    Abstract: Despite achieving state-of-the-art accuracy on temporal ordering of events, neural models showcase significant gaps in performance. Our work seeks to fill one of these gaps by leveraging an under-explored dimension of textual semantics: rich semantic information provided by explicit textual time cues. We develop STAGE, a system that consists of a novel temporal framework and a parser that can auto… ▽ More

    Submitted 15 May, 2021; originally announced May 2021.

  20. arXiv:2105.04321  [pdf, other

    cond-mat.stat-mech cs.LG physics.chem-ph

    Reinforcement learning of rare diffusive dynamics

    Authors: Avishek Das, Dominic C. Rose, Juan P. Garrahan, David T. Limmer

    Abstract: We present a method to probe rare molecular dynamics trajectories directly using reinforcement learning. We consider trajectories that are conditioned to transition between regions of configuration space in finite time, like those relevant in the study of reactive events, as well as trajectories exhibiting rare fluctuations of time-integrated quantities in the long time limit, like those relevant… ▽ More

    Submitted 11 August, 2021; v1 submitted 10 May, 2021; originally announced May 2021.

    Comments: 23 pages, 8 figures

    Journal ref: J. Chem. Phys. 155, 134105 (2021)

  21. arXiv:2104.10215  [pdf, other

    cs.CL cs.AI cs.LG

    Evaluating the Impact of a Hierarchical Discourse Representation on Entity Coreference Resolution Performance

    Authors: Sopan Khosla, James Fiacco, Carolyn Rose

    Abstract: Recent work on entity coreference resolution (CR) follows current trends in Deep Learning applied to embeddings and relatively simple task-related features. SOTA models do not make use of hierarchical representations of discourse structure. In this work, we leverage automatically constructed discourse parse trees within a neural approach and demonstrate a significant improvement on two benchmark e… ▽ More

    Submitted 20 April, 2021; originally announced April 2021.

    Comments: Also contains the Appendix. Accepted to NAACL 2021 as a short paper

  22. arXiv:2104.07874  [pdf, other

    cs.CL cs.AI

    Translational NLP: A New Paradigm and General Principles for Natural Language Processing Research

    Authors: Denis Newman-Griffis, Jill Fain Lehman, Carolyn Rosé, Harry Hochheiser

    Abstract: Natural language processing (NLP) research combines the study of universal principles, through basic science, with applied science targeting specific use cases and settings. However, the process of exchange between basic NLP and applications is often assumed to emerge naturally, resulting in many innovations going unapplied and many important questions left unstudied. We describe a new paradigm of… ▽ More

    Submitted 15 April, 2021; originally announced April 2021.

    Comments: Accepted to NAACL-HLT 2021

  23. Hugo: A Cluster Scheduler that Efficiently Learns to Select Complementary Data-Parallel Jobs

    Authors: Lauritz Thamsen, Ilya Verbitskiy, Sasho Nedelkoski, Vinh Thuy Tran, Vinicius Meyer, Miguel G. Xavier, Odej Kao, Cesar A. F. De Rose

    Abstract: Distributed data processing systems like MapReduce, Spark, and Flink are popular tools for analysis of large datasets with cluster resources. Yet, users often overprovision resources for their data processing jobs, while the resource usage of these jobs also typically fluctuates considerably. Therefore, multiple jobs usually get scheduled onto the same shared resources to increase the resource uti… ▽ More

    Submitted 14 February, 2021; originally announced February 2021.

  24. arXiv:2102.04639  [pdf, other

    cs.CV

    Absolute 3D Pose Estimation and Length Measurement of Severely Deformed Fish from Monocular Videos in Longline Fishing

    Authors: Jie Mei, Jenq-Neng Hwang, Suzanne Romain, Craig Rose, Braden Moore, Kelsey Magrane

    Abstract: Monocular absolute 3D fish pose estimation allows for efficient fish length measurement in the longline fisheries, where fishes are under severe deformation during the catching process. This task is challenging since it requires locating absolute 3D fish keypoints based on a short monocular video clip. Unlike related works, which either require expensive 3D ground-truth data and/or multiple-view i… ▽ More

    Submitted 8 February, 2021; originally announced February 2021.

    Comments: Accepted to ICASSP2021

  25. arXiv:2102.03520  [pdf, other

    cs.CV

    Video-based Hierarchical Species Classification for Longline Fishing Monitoring

    Authors: Jie Mei, Jenq-Neng Hwang, Suzanne Romain, Craig Rose, Braden Moore, Kelsey Magrane

    Abstract: The goal of electronic monitoring (EM) of longline fishing is to monitor the fish catching activities on fishing vessels, either for the regulatory compliance or catch counting. Hierarchical classification based on videos allows for inexpensive and efficient fish species identification of catches from longline fishing, where fishes are under severe deformation and self-occlusion during the catchin… ▽ More

    Submitted 6 February, 2021; originally announced February 2021.

    Comments: To be published in CVAUI2020 in conjunction with ICPR2020

  26. arXiv:2101.10545  [pdf, other

    cs.CL cs.AI

    RESPER: Computationally Modelling Resisting Strategies in Persuasive Conversations

    Authors: Ritam Dutt, Sayan Sinha, Rishabh Joshi, Surya Shekhar Chakraborty, Meredith Riggs, Xinru Yan, Haogang Bao, Carolyn Penstein Rosé

    Abstract: Modelling persuasion strategies as predictors of task outcome has several real-world applications and has received considerable attention from the computational linguistics community. However, previous research has failed to account for the resisting strategies employed by an individual to foil such persuasion attempts. Grounded in prior literature in cognitive and social psychology, we propose a… ▽ More

    Submitted 25 January, 2021; originally announced January 2021.

    Comments: Accepted as a long paper at the 16th Conference of the European Chapter of the Association for Computational Linguistics (EACL 2021)

  27. arXiv:2010.08695  [pdf, other

    cs.DC

    RECEIPT: REfine CoarsE-grained IndePendent Tasks for Parallel Tip decomposition of Bipartite Graphs

    Authors: Kartik Lakhotia, Rajgopal Kannan, Viktor Prasanna, Cesar A. F. De Rose

    Abstract: Tip decomposition is a crucial kernel for mining dense subgraphs in bipartite networks, with applications in spam detection, analysis of affiliation networks etc. It creates a hierarchy of vertex-induced subgraphs with varying densities determined by the participation of vertices in butterflies (2,2-bicliques). To build the hierarchy, existing algorithms iteratively follow a delete-update(peeling)… ▽ More

    Submitted 16 October, 2020; originally announced October 2020.

    Comments: To appear in Proceedings of VLDB Vol. 14

  28. arXiv:2010.05738  [pdf, other

    cs.CL cs.AI cs.LG

    Using Type Information to Improve Entity Coreference Resolution

    Authors: Sopan Khosla, Carolyn Rose

    Abstract: Coreference resolution (CR) is an essential part of discourse analysis. Most recently, neural approaches have been proposed to improve over SOTA models from earlier paradigms. So far none of the published neural models leverage external semantic knowledge such as type information. This paper offers the first such model and evaluation, demonstrating modest gains in accuracy by introducing either go… ▽ More

    Submitted 12 October, 2020; originally announced October 2020.

    Comments: Accepted as Long Paper at CODI workshop EMNLP 2020

  29. arXiv:2010.02246  [pdf, other

    cs.CL cs.LG

    MedFilter: Improving Extraction of Task-relevant Utterances from Doctor-Patient Conversations through Integration of Discourse Structure and Ontological Knowledge

    Authors: Sopan Khosla, Shikhar Vashishth, Jill Fain Lehman, Carolyn Rose

    Abstract: Information extraction from conversational data is particularly challenging because the task-centric nature of conversation allows for effective communication of implicit information by humans, but is challenging for machines. The challenges may differ between utterances depending on the role of the speaker within the conversation, especially when relevant expertise is distributed asymmetrically a… ▽ More

    Submitted 21 June, 2022; v1 submitted 5 October, 2020; originally announced October 2020.

    Comments: Accepted as Long Paper to EMNLP 2020

  30. arXiv:2009.10815  [pdf, other

    cs.CL

    Kee** Up Appearances: Computational Modeling of Face Acts in Persuasion Oriented Discussions

    Authors: Ritam Dutt, Rishabh Joshi, Carolyn Penstein Rose

    Abstract: The notion of face refers to the public self-image of an individual that emerges both from the individual's own actions as well as from the interaction with others. Modeling face and understanding its state changes throughout a conversation is critical to the study of maintenance of basic human needs in and through interaction. Grounded in the politeness theory of Brown and Levinson (1978), we pro… ▽ More

    Submitted 23 September, 2020; v1 submitted 22 September, 2020; originally announced September 2020.

    Comments: To appear at Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP, 2020) as a full paper

  31. arXiv:2009.09150  [pdf, other

    q-bio.PE cs.SI eess.SY

    Population Susceptibility Variation and Its Effect on Contagion Dynamics

    Authors: Christopher Rose, Andrew J. Medford, C. Franklin Goldsmith, Tejs Vegge, Joshua Weitz, Andrew A. Peterson

    Abstract: Susceptibility governs the dynamics of contagion. The classical SIR model is one of the simplest compartmental models of contagion spread, assuming a single shared susceptibility level. However, variation in susceptibility over a population can fundamentally affect the dynamics of contagion and thus the ultimate outcome of a pandemic. We develop mathematical machinery which explicitly considers su… ▽ More

    Submitted 18 September, 2020; originally announced September 2020.

    Comments: 12 pages, 2 figures

  32. arXiv:2008.09266  [pdf, other

    cs.CL

    Adapting Event Extractors to Medical Data: Bridging the Covariate Shift

    Authors: Aakanksha Naik, Jill Lehman, Carolyn Rose

    Abstract: We tackle the task of adapting event extractors to new domains without labeled data, by aligning the marginal distributions of source and target domains. As a testbed, we create two new event extraction datasets using English texts from two medical domains: (i) clinical notes, and (ii) doctor-patient conversations. We test the efficacy of three marginal alignment techniques: (i) adversarial domain… ▽ More

    Submitted 20 August, 2020; originally announced August 2020.

  33. arXiv:2005.12890  [pdf, other

    cond-mat.stat-mech cond-mat.dis-nn cs.LG

    A reinforcement learning approach to rare trajectory sampling

    Authors: Dominic C. Rose, Jamie F. Mair, Juan P. Garrahan

    Abstract: Very often when studying non-equilibrium systems one is interested in analysing dynamical behaviour that occurs with very low probability, so called rare events. In practice, since rare events are by definition atypical, they are often difficult to access in a statistically significant way. What are required are strategies to "make rare events typical" so that they can be generated on demand. Here… ▽ More

    Submitted 25 November, 2020; v1 submitted 26 May, 2020; originally announced May 2020.

    Comments: 55+6 pages, 7+1 figures

    Journal ref: New J. Phys. (2020)

  34. arXiv:2005.11355  [pdf, other

    cs.CL

    Towards Open Domain Event Trigger Identification using Adversarial Domain Adaptation

    Authors: Aakanksha Naik, Carolyn Rosé

    Abstract: We tackle the task of building supervised event trigger identification models which can generalize better across domains. Our work leverages the adversarial domain adaptation (ADA) framework to introduce domain-invariance. ADA uses adversarial training to construct representations that are predictive for trigger identification, but not predictive of the example's domain. It requires no labeled dat… ▽ More

    Submitted 22 May, 2020; originally announced May 2020.

    Comments: To appear at ACL 2020

  35. arXiv:2005.10682  [pdf, other

    cs.IT

    Capacities and Optimal Input Distributions for Particle-Intensity Channels

    Authors: Nariman Farsad, Will Chuang, Andrea Goldsmith, Christos Komninakis, Muriel Médard, Christopher Rose, Lieven Vandenberghe, Emily E. Wesel, Richard D. Wesel

    Abstract: This work introduces the particle-intensity channel (PIC) as a model for molecular communication systems and characterizes the capacity limits as well as properties of the optimal (capacity-achieving) input distributions for such channels. In the PIC, the transmitter encodes information, in symbols of a given duration, based on the probability of particle release, and the receiver detects and deco… ▽ More

    Submitted 20 May, 2020; originally announced May 2020.

    Comments: arXiv admin note: text overlap with arXiv:1705.08040

  36. arXiv:2005.04171  [pdf, other

    cs.NE cs.LG

    Hyperparameter Optimization in Binary Communication Networks for Neuromorphic Deployment

    Authors: Maryam Parsa, Catherine D. Schuman, Prasanna Date, Derek C. Rose, Bill Kay, J. Parker Mitchell, Steven R. Young, Ryan Dellana, William Severa, Thomas E. Potok, Kaushik Roy

    Abstract: Training neural networks for neuromorphic deployment is non-trivial. There have been a variety of approaches proposed to adapt back-propagation or back-propagation-like algorithms appropriate for training. Considering that these networks often have very different performance characteristics than traditional neural networks, it is often unclear how to set either the network topology or the hyperpar… ▽ More

    Submitted 20 April, 2020; originally announced May 2020.

    Comments: 9 pages, 3 figures, To appear in WCCI 2020

  37. Improving Broad-Coverage Medical Entity Linking with Semantic Type Prediction and Large-Scale Datasets

    Authors: Shikhar Vashishth, Denis Newman-Griffis, Rishabh Joshi, Ritam Dutt, Carolyn Rose

    Abstract: Medical entity linking is the task of identifying and standardizing medical concepts referred to in an unstructured text. Most of the existing methods adopt a three-step approach of (1) detecting mentions, (2) generating a list of candidate concepts, and finally (3) picking the best concept among them. In this paper, we probe into alleviating the problem of overgeneration of candidate concepts in… ▽ More

    Submitted 22 August, 2021; v1 submitted 1 May, 2020; originally announced May 2020.

    Comments: 44 pages

    Journal ref: Journal of Biomedical Informatics 2021

  38. arXiv:2002.05185  [pdf, other

    cond-mat.stat-mech cs.LG quant-ph

    A Tensor Network Approach to Finite Markov Decision Processes

    Authors: Edward Gillman, Dominic C. Rose, Juan P. Garrahan

    Abstract: Tensor network (TN) techniques - often used in the context of quantum many-body physics - have shown promise as a tool for tackling machine learning (ML) problems. The application of TNs to ML, however, has mostly focused on supervised and unsupervised learning. Yet, with their direct connection to hidden Markov chains, TNs are also naturally suited to Markov decision processes (MDPs) which provid… ▽ More

    Submitted 12 February, 2020; originally announced February 2020.

    Comments: 10 pages, 2 figures

  39. arXiv:1912.10204  [pdf, other

    cs.CL cs.IR cs.LG stat.ML

    A Machine Learning Framework for Authorship Identification From Texts

    Authors: Rahul Radhakrishnan Iyer, Carolyn Penstein Rose

    Abstract: Authorship identification is a process in which the author of a text is identified. Most known literary texts can easily be attributed to a certain author because they are, for example, signed. Yet sometimes we find unfinished pieces of work or a whole bunch of manuscripts with a wide variety of possible authors. In order to assess the importance of such a manuscript, it is vital to know who wrote… ▽ More

    Submitted 21 December, 2019; originally announced December 2019.

    Comments: 8 pages, 2 figures

  40. arXiv:1909.12291  [pdf, other

    cs.LG cs.DC stat.ML

    Exascale Deep Learning to Accelerate Cancer Research

    Authors: Robert M. Patton, J. Travis Johnston, Steven R. Young, Catherine D. Schuman, Thomas E. Potok, Derek C. Rose, Seung-Hwan Lim, Junghoon Chae, Le Hou, Shahira Abousamra, Dimitris Samaras, Joel Saltz

    Abstract: Deep learning, through the use of neural networks, has demonstrated remarkable ability to automate many routine tasks when presented with sufficient data for training. The neural network architecture (e.g. number of layers, types of layers, connections between layers, etc.) plays a critical role in determining what, if anything, the neural network is able to learn from the training data. The trend… ▽ More

    Submitted 26 September, 2019; originally announced September 2019.

    Comments: Submitted to IEEE Big Data

  41. arXiv:1905.02187  [pdf, other

    cs.ET cond-mat.other q-bio.MN

    Principles of Information Storage in Small-Molecule Mixtures

    Authors: Jacob K. Rosenstein, Christopher Rose, Sherief Reda, Peter M. Weber, Eunsuk Kim, Jason Sello, Joseph Geiser, Eamonn Kennedy, Christopher Arcadia, Amanda Dombroski, Kady Oakley, Shui Ling Chen, Hokchhay Tann, Brenda M. Rubenstein

    Abstract: Molecular data systems have the potential to store information at dramatically higher density than existing electronic media. Some of the first experimental demonstrations of this idea have used DNA, but nature also uses a wide diversity of smaller non-polymeric molecules to preserve, process, and transmit information. In this paper, we present a general framework for quantifying chemical memory,… ▽ More

    Submitted 6 May, 2019; originally announced May 2019.

  42. arXiv:1905.00422  [pdf, other

    cs.CL cs.DL

    Time-series Insights into the Process of Passing or Failing Online University Courses using Neural-Induced Interpretable Student States

    Authors: Byungsoo Jeon, Eyal Shafran, Luke Breitfeller, Jason Levin, Carolyn P. Rose

    Abstract: This paper addresses a key challenge in Educational Data Mining, namely to model student behavioral trajectories in order to provide a means for identifying students most at-risk, with the goal of providing supportive interventions. While many forms of data including clickstream data or data from sensors have been used extensively in time series models for such purposes, in this paper we explore t… ▽ More

    Submitted 1 May, 2019; originally announced May 2019.

    Comments: 11 pages, conference

  43. arXiv:1901.03735  [pdf, other

    cs.CL

    EQUATE: A Benchmark Evaluation Framework for Quantitative Reasoning in Natural Language Inference

    Authors: Abhilasha Ravichander, Aakanksha Naik, Carolyn Rose, Eduard Hovy

    Abstract: Quantitative reasoning is a higher-order reasoning skill that any intelligent natural language understanding system can reasonably be expected to handle. We present EQUATE (Evaluating Quantitative Understanding Aptitude in Textual Entailment), a new framework for quantitative reasoning in textual entailment. We benchmark the performance of 9 published NLI models on EQUATE, and find that on average… ▽ More

    Submitted 26 October, 2019; v1 submitted 11 January, 2019; originally announced January 2019.

    Comments: To appear at CoNLL 2019

  44. arXiv:1810.05214  [pdf, ps, other

    cs.ET cond-mat.other physics.chem-ph q-bio.MN

    Parallelized Linear Classification with Volumetric Chemical Perceptrons

    Authors: Christopher E. Arcadia, Hokchhay Tann, Amanda Dombroski, Kady Ferguson, Shui Ling Chen, Eunsuk Kim, Christopher Rose, Brenda M. Rubenstein, Sherief Reda, Jacob K. Rosenstein

    Abstract: In this work, we introduce a new type of linear classifier that is implemented in a chemical form. We propose a novel encoding technique which simultaneously represents multiple datasets in an array of microliter-scale chemical mixtures. Parallel computations on these datasets are performed as robotic liquid handling sequences, whose outputs are analyzed by high-performance liquid chromatography.… ▽ More

    Submitted 11 October, 2018; originally announced October 2018.

    Comments: Accepted to 2018 IEEE International Conference on Rebooting Computing

  45. arXiv:1806.04552  [pdf, other

    cs.LG cs.AI cs.CV cs.NE stat.ML

    Combining Model-Free Q-Ensembles and Model-Based Approaches for Informed Exploration

    Authors: Sreecharan Sankaranarayanan, Raghuram Mandyam Annasamy, Katia Sycara, Carolyn Penstein Rosé

    Abstract: Q-Ensembles are a model-free approach where input images are fed into different Q-networks and exploration is driven by the assumption that uncertainty is proportional to the variance of the output Q-values obtained. They have been shown to perform relatively well compared to other exploration strategies. Further, model-based approaches, such as encoder-decoder models have been used successfully f… ▽ More

    Submitted 12 June, 2018; originally announced June 2018.

    Comments: Submitted to the Thirty-Second Annual Conference on Neural Information Processing Systems (NIPS 2018)

  46. arXiv:1806.00692  [pdf, ps, other

    cs.CL

    Stress Test Evaluation for Natural Language Inference

    Authors: Aakanksha Naik, Abhilasha Ravichander, Norman Sadeh, Carolyn Rose, Graham Neubig

    Abstract: Natural language inference (NLI) is the task of determining if a natural language hypothesis can be inferred from a given premise in a justifiable manner. NLI was proposed as a benchmark task for natural language understanding. Existing models perform well at standard datasets for NLI, achieving impressive results across different genres of text. However, the extent to which these models understan… ▽ More

    Submitted 13 June, 2018; v1 submitted 2 June, 2018; originally announced June 2018.

    Comments: COLING 2018

  47. arXiv:1804.00065  [pdf, other

    cs.CL cs.CY

    Attentive Interaction Model: Modeling Changes in View in Argumentation

    Authors: Yohan Jo, Shivani Poddar, Byungsoo Jeon, Qinlan Shen, Carolyn P. Rose, Graham Neubig

    Abstract: We present a neural architecture for modeling argumentative dialogue that explicitly models the interplay between an Opinion Holder's (OH's) reasoning and a challenger's argument, with the goal of predicting if the argument successfully changes the OH's view. The model has two components: (1) vulnerable region detection, an attention model that identifies parts of the OH's reasoning that are amena… ▽ More

    Submitted 18 April, 2018; v1 submitted 30 March, 2018; originally announced April 2018.

    Comments: 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies

  48. arXiv:1707.04546  [pdf, other

    cs.CL cs.SI

    Linguistic Markers of Influence in Informal Interactions

    Authors: Shrimai Prabhumoye, Samridhi Choudhary, Evangelia Spiliopoulou, Christopher Bogart, Carolyn Penstein Rose, Alan W Black

    Abstract: There has been a long standing interest in understanding `Social Influence' both in Social Sciences and in Computational Linguistics. In this paper, we present a novel approach to study and measure interpersonal influence in daily interactions. Motivated by the basic principles of influence, we attempt to identify indicative linguistic features of the posts in an online knitting community. We pres… ▽ More

    Submitted 14 July, 2017; originally announced July 2017.

    Comments: 10 pages, Accepted in NLP+CSS workshop for ACL (Association for Computational Linguistics) 2017

  49. arXiv:1705.08040  [pdf, other

    cs.IT cs.ET

    Capacity of Molecular Channels with Imperfect Particle-Intensity Modulation and Detection

    Authors: Nariman Farsad, Christopher Rose, Muriel Médard, Andrea Goldsmith

    Abstract: This work introduces the particle-intensity channel (PIC) as a model for molecular communication systems and characterizes the properties of the optimal input distribution and the capacity limits for this system. In the PIC, the transmitter encodes information, in symbols of a given duration, based on the number of particles released, and the receiver detects and decodes the message based on the n… ▽ More

    Submitted 22 May, 2017; originally announced May 2017.

    Comments: Accepted at IEEE International Symposium on Information Theory (ISIT)

  50. arXiv:1704.05543  [pdf

    cs.CY cs.AI cs.CL cs.HC

    Coordinating Collaborative Chat in Massive Open Online Courses

    Authors: Gaurav Singh Tomar, Sreecharan Sankaranarayanan, Xu Wang, Carolyn Penstein Rosé

    Abstract: An earlier study of a collaborative chat intervention in a Massive Open Online Course (MOOC) identified negative effects on attrition stemming from a requirement for students to be matched with exactly one partner prior to beginning the activity. That study raised questions about how to orchestrate a collaborative chat intervention in a MOOC context in order to provide the benefit of synchronous s… ▽ More

    Submitted 18 April, 2017; originally announced April 2017.

    Comments: 8 pages

    Journal ref: Proceedings of the International Conference of the Learning Sciences 2016, Volume 1, pp 607-614