Skip to main content

Showing 1–14 of 14 results for author: Carton, S

.
  1. arXiv:2406.05348  [pdf, other

    cs.CL cs.AI cs.IR

    Toward Reliable Ad-hoc Scientific Information Extraction: A Case Study on Two Materials Datasets

    Authors: Satanu Ghosh, Neal R. Brodnik, Carolina Frey, Collin Holgate, Tresa M. Pollock, Samantha Daly, Samuel Carton

    Abstract: We explore the ability of GPT-4 to perform ad-hoc schema based information extraction from scientific literature. We assess specifically whether it can, with a basic prompting approach, replicate two existing material science datasets, given the manuscripts from which they were originally manually extracted. We employ materials scientists to perform a detailed manual error analysis to assess where… ▽ More

    Submitted 8 June, 2024; originally announced June 2024.

  2. arXiv:2404.10486  [pdf, other

    astro-ph.GA astro-ph.SR

    Discovery of a dormant 33 solar-mass black hole in pre-release Gaia astrometry

    Authors: Gaia Collaboration, P. Panuzzo, T. Mazeh, F. Arenou, B. Holl, E. Caffau, A. Jorissen, C. Babusiaux, P. Gavras, J. Sahlmann, U. Bastian, Ł. Wyrzykowski, L. Eyer, N. Leclerc, N. Bauchet, A. Bombrun, N. Mowlavi, G. M. Seabroke, D. Teyssier, E. Balbinot, A. Helmi, A. G. A. Brown, A. Vallenari, T. Prusti, J. H. J. de Bruijne , et al. (390 additional authors not shown)

    Abstract: Gravitational waves from black-hole merging events have revealed a population of extra-galactic BHs residing in short-period binaries with masses that are higher than expected based on most stellar evolution models - and also higher than known stellar-origin black holes in our Galaxy. It has been proposed that those high-mass BHs are the remnants of massive metal-poor stars. Gaia astrometry is exp… ▽ More

    Submitted 19 April, 2024; v1 submitted 16 April, 2024; originally announced April 2024.

    Comments: 23 pages, accepted fro publication in A&A Letters. New version with small fixes

  3. arXiv:2310.06551  [pdf, other

    astro-ph.SR astro-ph.GA

    Gaia Focused Product Release: Sources from Service Interface Function image analysis -- Half a million new sources in omega Centauri

    Authors: Gaia Collaboration, K. Weingrill, A. Mints, J. Castañeda, Z. Kostrzewa-Rutkowska, M. Davidson, F. De Angeli, J. Hernández, F. Torra, M. Ramos-Lerate, C. Babusiaux, M. Biermann, C. Crowley, D. W. Evans, L. Lindegren, J. M. Martín-Fleitas, L. Palaversa, D. Ruz Mieres, K. Tisanić, A. G. A. Brown, A. Vallenari, T. Prusti, J. H. J. de Bruijne, F. Arenou, A. Barbier , et al. (378 additional authors not shown)

    Abstract: Gaia's readout window strategy is challenged by very dense fields in the sky. Therefore, in addition to standard Gaia observations, full Sky Mapper (SM) images were recorded for nine selected regions in the sky. A new software pipeline exploits these Service Interface Function (SIF) images of crowded fields (CFs), making use of the availability of the full two-dimensional (2D) information. This ne… ▽ More

    Submitted 8 November, 2023; v1 submitted 10 October, 2023; originally announced October 2023.

    Journal ref: A&A 680, A35 (2023)

  4. arXiv:2310.06295  [pdf, other

    astro-ph.GA astro-ph.CO astro-ph.IM

    Gaia Focused Product Release: A catalogue of sources around quasars to search for strongly lensed quasars

    Authors: Gaia Collaboration, A. Krone-Martins, C. Ducourant, L. Galluccio, L. Delchambre, I. Oreshina-Slezak, R. Teixeira, J. Braine, J. -F. Le Campion, F. Mignard, W. Roux, A. Blazere, L. Pegoraro, A. G. A. Brown, A. Vallenari, T. Prusti, J. H. J. de Bruijne, F. Arenou, C. Babusiaux, A. Barbier, M. Biermann, O. L. Creevey, D. W. Evans, L. Eyer, R. Guerra , et al. (376 additional authors not shown)

    Abstract: Context. Strongly lensed quasars are fundamental sources for cosmology. The Gaia space mission covers the entire sky with the unprecedented resolution of $0.18$" in the optical, making it an ideal instrument to search for gravitational lenses down to the limiting magnitude of 21. Nevertheless, the previous Gaia Data Releases are known to be incomplete for small angular separations such as those ex… ▽ More

    Submitted 10 October, 2023; originally announced October 2023.

    Comments: 35 pages, 60 figures, accepted for publication by Astronomy and Astrophysics

    Journal ref: A&A 685, A130 (2024)

  5. arXiv:2310.06051  [pdf, other

    astro-ph.SR

    Gaia Focused Product Release: Radial velocity time series of long-period variables

    Authors: Gaia Collaboration, Gaia Collaboration, M. Trabucchi, N. Mowlavi, T. Lebzelter, I. Lecoeur-Taibi, M. Audard, L. Eyer, P. García-Lario, P. Gavras, B. Holl, G. Jevardat de Fombelle, K. Nienartowicz, L. Rimoldini, P. Sartoretti, R. Blomme, Y. Frémat, O. Marchal, Y. Damerdji, A. G. A. Brown, A. Guerrier, P. Panuzzo, D. Katz, G. M. Seabroke, K. Benson , et al. (382 additional authors not shown)

    Abstract: The third Gaia Data Release (DR3) provided photometric time series of more than 2 million long-period variable (LPV) candidates. Anticipating the publication of full radial-velocity (RV) in DR4, this Focused Product Release (FPR) provides RV time series for a selection of LPVs with high-quality observations. We describe the production and content of the Gaia catalog of LPV RV time series, and the… ▽ More

    Submitted 9 October, 2023; originally announced October 2023.

    Comments: 36 pages, 38 figures

  6. arXiv:2205.11551  [pdf, other

    cs.CL cs.AI

    Learning to Ignore Adversarial Attacks

    Authors: Yiming Zhang, Yangqiaoyu Zhou, Samuel Carton, Chenhao Tan

    Abstract: Despite the strong performance of current NLP models, they can be brittle against adversarial attacks. To enable effective learning against adversarial inputs, we introduce the use of rationale models that can explicitly learn to ignore attack tokens. We find that the rationale models can successfully ignore over 90% of attack tokens. This approach leads to consistent sizable improvements ($\sim$1… ▽ More

    Submitted 20 February, 2023; v1 submitted 23 May, 2022; originally announced May 2022.

    Comments: EACL 2023, code is available at https://github.com/ChicagoHAI/rationalization-robustness

  7. arXiv:2204.11788  [pdf, other

    cs.AI cs.HC cs.LG

    Human-AI Collaboration via Conditional Delegation: A Case Study of Content Moderation

    Authors: Vivian Lai, Samuel Carton, Rajat Bhatnagar, Q. Vera Liao, Yunfeng Zhang, Chenhao Tan

    Abstract: Despite impressive performance in many benchmark datasets, AI models can still make mistakes, especially among out-of-distribution examples. It remains an open question how such imperfect models can be used effectively in collaboration with humans. Prior work has focused on AI assistance that helps people make individual high-stakes decisions, which is not scalable for a large amount of relatively… ▽ More

    Submitted 25 April, 2022; originally announced April 2022.

    Comments: 18 pages, 44 figures

  8. arXiv:2112.00071  [pdf, other

    cs.LG

    What to Learn, and How: Toward Effective Learning from Rationales

    Authors: Samuel Carton, Surya Kanoria, Chenhao Tan

    Abstract: Learning from rationales seeks to augment model prediction accuracy using human-annotated rationales (i.e. subsets of input tokens) that justify their chosen labels, often in the form of intermediate or multitask supervision. While intuitive, this idea has proven elusive in practice. We make two observations about human rationales via empirical analyses: 1) maximizing rationale supervision accurac… ▽ More

    Submitted 28 March, 2022; v1 submitted 30 November, 2021; originally announced December 2021.

    Comments: Accepted to ACL Findings 2022 13 pages, 8 figures

  9. arXiv:2010.04736  [pdf, other

    cs.CL cs.AI cs.CY cs.HC cs.LG

    Evaluating and Characterizing Human Rationales

    Authors: Samuel Carton, Anirudh Rathore, Chenhao Tan

    Abstract: Two main approaches for evaluating the quality of machine-generated rationales are: 1) using human rationales as a gold standard; and 2) automated metrics based on how rationales affect model behavior. An open question, however, is how human rationales fare with these automatic metrics. Analyzing a variety of datasets and models, we find that human rationales do not necessarily perform well on the… ▽ More

    Submitted 9 October, 2020; originally announced October 2020.

    Comments: 14 pages, 15 figures, to appear in EMNLP 2020. Code is available at https://github.com/BoulderDS/evaluating-human-rationales

  10. arXiv:2007.15823  [pdf, other

    cs.CL cs.AI cs.LG

    Explainable Prediction of Text Complexity: The Missing Preliminaries for Text Simplification

    Authors: Cristina Garbacea, Mengtian Guo, Samuel Carton, Qiaozhu Mei

    Abstract: Text simplification reduces the language complexity of professional content for accessibility purposes. End-to-end neural network models have been widely adopted to directly generate the simplified version of input text, usually functioning as a blackbox. We show that text simplification can be decomposed into a compact pipeline of tasks to ensure the transparency and explainability of the process… ▽ More

    Submitted 6 July, 2021; v1 submitted 30 July, 2020; originally announced July 2020.

    Comments: ACL 2021

  11. arXiv:2003.07370  [pdf, ps, other

    cs.HC cs.AI cs.CL cs.CY

    Harnessing Explanations to Bridge AI and Humans

    Authors: Vivian Lai, Samuel Carton, Chenhao Tan

    Abstract: Machine learning models are increasingly integrated into societally critical applications such as recidivism prediction and medical diagnosis, thanks to their superior predictive power. In these applications, however, full automation is often not desired due to ethical and legal concerns. The research community has thus ventured into develo** interpretable methods that explain machine prediction… ▽ More

    Submitted 16 March, 2020; originally announced March 2020.

    Comments: 4 pages, CHI 2020 Fair & Responsible AI Workshop

  12. arXiv:1901.00398  [pdf, other

    cs.CL cs.LG stat.ML

    Judge the Judges: A Large-Scale Evaluation Study of Neural Language Models for Online Review Generation

    Authors: Cristina Garbacea, Samuel Carton, Shiyan Yan, Qiaozhu Mei

    Abstract: We conduct a large-scale, systematic study to evaluate the existing evaluation methods for natural language generation in the context of generating online product reviews. We compare human-based evaluators with a variety of automated evaluation procedures, including discriminative evaluators that measure how well machine-generated text can be distinguished from human-written text, as well as word… ▽ More

    Submitted 5 September, 2019; v1 submitted 2 January, 2019; originally announced January 2019.

  13. arXiv:1809.01499  [pdf, other

    cs.CL cs.IR cs.LG stat.ML

    Extractive Adversarial Networks: High-Recall Explanations for Identifying Personal Attacks in Social Media Posts

    Authors: Samuel Carton, Qiaozhu Mei, Paul Resnick

    Abstract: We introduce an adversarial method for producing high-recall explanations of neural text classifier decisions. Building on an existing architecture for extractive explanations via hard attention, we add an adversarial layer which scans the residual of the attention for remaining predictive signal. Motivated by the important domain of detecting personal attacks in social media comments, we addition… ▽ More

    Submitted 19 October, 2018; v1 submitted 31 August, 2018; originally announced September 2018.

    Comments: Accepted to EMNLP 2018 Code and data available at https://github.com/shcarton/rcnn

  14. arXiv:1611.09900  [pdf, other

    cs.CL

    Context-aware Natural Language Generation with Recurrent Neural Networks

    Authors: Jian Tang, Yifan Yang, Sam Carton, Ming Zhang, Qiaozhu Mei

    Abstract: This paper studied generating natural languages at particular contexts or situations. We proposed two novel approaches which encode the contexts into a continuous semantic representation and then decode the semantic representation into text sequences with recurrent neural networks. During decoding, the context information are attended through a gating mechanism, addressing the problem of long-rang… ▽ More

    Submitted 29 November, 2016; originally announced November 2016.