Skip to main content

Showing 1–50 of 270 results for author: Schwartz, R

.
  1. arXiv:2407.00861  [pdf, other

    cond-mat.soft physics.bio-ph physics.chem-ph

    Enantiospecificity in NMR Enabled by Chirality-Induced Spin Selectivity

    Authors: T. Georgiou, J. L. Palma, V. Mujica, S. Varela, M. Galante, V. Santamarıa Garcıa, L. Mboning, R. N. Schwartz, G. Cuniberti, L. -S. Bouchard

    Abstract: Spin polarization in chiral molecules is a magnetic molecular response associated with electron transport and enantioselective bond polarization that occurs even in the absence of an external magnetic field. An unexpected finding by Santos and co-workers reported enantiospecific NMR responses in solid-state cross-polarization (CP) experiments, suggesting a possible additional contribution to the i… ▽ More

    Submitted 2 July, 2024; v1 submitted 30 June, 2024; originally announced July 2024.

    Comments: 102 pages, 16 figures, 40 tables

  2. arXiv:2406.15936  [pdf, other

    cs.CY cs.AI cs.DB cs.LG

    An Automated SQL Query Grading System Using An Attention-Based Convolutional Neural Network

    Authors: Donald R. Schwartz, Pablo Rivas

    Abstract: Grading SQL queries can be a time-consuming, tedious and challenging task, especially as the number of student submissions increases. Several systems have been introduced in an attempt to mitigate these challenges, but those systems have their own limitations. This paper describes our novel approach to automating the process of grading SQL queries. Unlike previous approaches, we employ a unique co… ▽ More

    Submitted 22 June, 2024; originally announced June 2024.

    Comments: 12 pages, 8 figures, paper accepted at "The 18th International Conference on Frontiers in Education: Computer Science and Computer Engineering"

    ACM Class: I.2.6; H.2.3; K.3.2

  3. arXiv:2406.06386  [pdf, other

    cs.CV

    FPN-IAIA-BL: A Multi-Scale Interpretable Deep Learning Model for Classification of Mass Margins in Digital Mammography

    Authors: Julia Yang, Alina Jade Barnett, Jon Donnelly, Satvik Kishore, Jerry Fang, Fides Regina Schwartz, Chaofan Chen, Joseph Y. Lo, Cynthia Rudin

    Abstract: Digital mammography is essential to breast cancer detection, and deep learning offers promising tools for faster and more accurate mammogram analysis. In radiology and other high-stakes environments, uninterpretable ("black box") deep learning models are unsuitable and there is a call in these fields to make interpretable models. Recent work in interpretable computer vision provides transparency t… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

    Comments: 8 pages, 6 figures, Accepted for oral presentation at the 2024 CVPR Workshop on Domain adaptation, Explainability, Fairness in AI for Medical Image Analysis (DEF-AI-MIA)

  4. arXiv:2405.06563  [pdf, other

    cs.CL

    What Can Natural Language Processing Do for Peer Review?

    Authors: Ilia Kuznetsov, Osama Mohammed Afzal, Koen Dercksen, Nils Dycke, Alexander Goldberg, Tom Hope, Dirk Hovy, Jonathan K. Kummerfeld, Anne Lauscher, Kevin Leyton-Brown, Sheng Lu, Mausam, Margot Mieskes, Aurélie Névéol, Danish Pruthi, Lizhen Qu, Roy Schwartz, Noah A. Smith, Thamar Solorio, **gyan Wang, Xiaodan Zhu, Anna Rogers, Nihar B. Shah, Iryna Gurevych

    Abstract: The number of scientific articles produced every year is growing rapidly. Providing quality control over them is crucial for scientists and, ultimately, for the public good. In modern science, this process is largely delegated to peer review -- a distributed procedure in which each submission is evaluated by several independent experts in the field. Peer review is widely used, yet it is hard, time… ▽ More

    Submitted 10 May, 2024; originally announced May 2024.

  5. arXiv:2405.04304  [pdf, other

    cs.CL

    Dynamic Speculation Lookahead Accelerates Speculative Decoding of Large Language Models

    Authors: Jonathan Mamou, Oren Pereg, Daniel Korat, Moshe Berchansky, Nadav Timor, Moshe Wasserblat, Roy Schwartz

    Abstract: Speculative decoding is commonly used for reducing the inference latency of large language models. Its effectiveness depends highly on the speculation lookahead (SL)-the number of tokens generated by the draft model at each iteration. In this work we show that the common practice of using the same SL for all iterations (static SL) is suboptimal. We introduce DISCO (DynamIc SpeCulation lookahead Op… ▽ More

    Submitted 23 June, 2024; v1 submitted 7 May, 2024; originally announced May 2024.

  6. arXiv:2405.02743  [pdf, other

    cs.CL

    Beyond Performance: Quantifying and Mitigating Label Bias in LLMs

    Authors: Yuval Reif, Roy Schwartz

    Abstract: Large language models (LLMs) have shown remarkable adaptability to diverse tasks, by leveraging context prompts containing instructions, or minimal input-output examples. However, recent work revealed they also exhibit label bias -- an undesirable preference toward predicting certain answers over others. Still, detecting and measuring this bias reliably and at scale has remained relatively unexplo… ▽ More

    Submitted 4 May, 2024; originally announced May 2024.

    Comments: NAACL 2024

  7. arXiv:2404.00725  [pdf, other

    cs.SE cs.AI cs.CL cs.LG

    The Larger the Better? Improved LLM Code-Generation via Budget Reallocation

    Authors: Michael Hassid, Tal Remez, Jonas Gehring, Roy Schwartz, Yossi Adi

    Abstract: It is a common belief that large language models (LLMs) are better than smaller-sized ones. However, larger models also require significantly more time and compute during inference. This begs the question: what happens when both models operate under the same budget? (e.g., compute, run-time). To address this question, we analyze code generation LLMs of various sizes and make comparisons such as ru… ▽ More

    Submitted 31 March, 2024; originally announced April 2024.

  8. arXiv:2403.05735  [pdf, ps, other

    math.DS math.CO

    The Flap** Birds in the Pentagram Zoo

    Authors: Richard Evan Schwartz

    Abstract: We study the $(k+1,k)$ diagonal map for $k=2,3,4,...$. We call this map $Δ_k$. The map $Δ_1$ is the pentagram map and $Δ_k$ is a generalization. $Δ_k$ does not preserve convexity, but we prove that $Δ_k$ preserves a subset $B_k$ of certain star-shaped polygons which we call $k$-birds. The action of $Δ_k$ on $B_k$ seems similar to the action of $Δ_1$ on the space of convex polygons. We show that so… ▽ More

    Submitted 8 March, 2024; originally announced March 2024.

    Comments: 60 pages, computer experiment inspired but mostly traditional math

  9. arXiv:2401.17356  [pdf, other

    astro-ph.EP

    A New Database of Giant Impacts over a Wide Range of Masses and with Material Strength: A First Analysis of Outcomes

    Authors: Alexandre Emsenhuber, Erik Asphaug, Saverio Cambioni, Travis S. J. Gabriel, Stephen R. Schwartz, Robert E. Melikyan, C. Adeene Denton

    Abstract: In the late stage of terrestrial planet formation, planets are predicted to undergo pairwise collisions known as giant impacts. Here we present a high-resolution database of giant impacts for differentiated colliding bodies of iron-silicate composition, with target masses ranging from 10^-4 M_Earth up to super-Earths (5 M_Earth). We vary impactor-to-target mass ratio, core-mantle (iron-silicate) f… ▽ More

    Submitted 30 January, 2024; originally announced January 2024.

    Comments: Accepted for publication in PSJ; Table 2 is available in full in an ancillary file

  10. arXiv:2401.06104  [pdf, other

    cs.CL

    Transformers are Multi-State RNNs

    Authors: Matanel Oren, Michael Hassid, Nir Yarden, Yossi Adi, Roy Schwartz

    Abstract: Transformers are considered conceptually different from the previous generation of state-of-the-art NLP models - recurrent neural networks (RNNs). In this work, we demonstrate that decoder-only transformers can in fact be conceptualized as unbounded multi-state RNNs - an RNN variant with unlimited hidden state size. We further show that transformers can be converted into $\textit{bounded}$ multi-s… ▽ More

    Submitted 18 June, 2024; v1 submitted 11 January, 2024; originally announced January 2024.

    Comments: preprint

  11. arXiv:2312.07275  [pdf, other

    astro-ph.GA

    The SARAO MeerKAT 1.3 GHz Galactic Plane Survey

    Authors: S. Goedhart, W. D. Cotton, F. Camilo, M. A. Thompson, G. Umana, M. Bietenholz, P. A. Woudt, L. D. Anderson, C. Bordiu, D. A. H. Buckley, C. S. Buemi, F. Bufano, F. Cavallaro, H. Chen, J. O. Chibueze, D. Egbo, B. S. Frank, M. G. Hoare, A. Ingallinera, T. Irabor, R. C. Kraan-Korteweg, S. Kurapati, P. Leto, S. Loru, M. Mutale , et al. (105 additional authors not shown)

    Abstract: We present the SARAO MeerKAT Galactic Plane Survey (SMGPS), a 1.3 GHz continuum survey of almost half of the Galactic Plane (251°$\le l \le$ 358°and 2°$\le l \le$ 61°at $|b| \le 1.5°$). SMGPS is the largest, most sensitive and highest angular resolution 1 GHz survey of the Plane yet carried out, with an angular resolution of 8" and a broadband RMS sensitivity of $\sim$10--20 $μ$ Jy/beam. Here we d… ▽ More

    Submitted 2 May, 2024; v1 submitted 12 December, 2023; originally announced December 2023.

    Comments: Accepted for publication in MNRAS. The data release is live and links can be found in the Data Availability Statement in the paper

  12. arXiv:2311.15639  [pdf, other

    cs.DS

    On Approximating Cutwidth and Pathwidth

    Authors: Nikhil Bansal, Dor Katzelnick, Roy Schwartz

    Abstract: We study graph ordering problems with a min-max objective. A classical problem of this type is cutwidth, where given a graph we want to order its vertices such that the number of edges crossing any point is minimized. We give a $ \log^{1+o(1)}(n)$ approximation for the problem, substantially improving upon the previous poly-logarithmic guarantees based on the standard recursive balanced partitioni… ▽ More

    Submitted 12 April, 2024; v1 submitted 27 November, 2023; originally announced November 2023.

  13. arXiv:2311.02251  [pdf

    cs.LG cs.AI eess.SP

    The Potential of Wearable Sensors for Assessing Patient Acuity in Intensive Care Unit (ICU)

    Authors: Jessica Sena, Mohammad Tahsin Mostafiz, Jiaqing Zhang, Andrea Davidson, Sabyasachi Bandyopadhyay, Ren Yuanfang, Tezcan Ozrazgat-Baslanti, Benjamin Shickel, Tyler Loftus, William Robson Schwartz, Azra Bihorac, Parisa Rashidi

    Abstract: Acuity assessments are vital in critical care settings to provide timely interventions and fair resource allocation. Traditional acuity scores rely on manual assessments and documentation of physiological states, which can be time-consuming, intermittent, and difficult to use for healthcare providers. Furthermore, such scores do not incorporate granular information such as patients' mobility level… ▽ More

    Submitted 3 November, 2023; originally announced November 2023.

  14. arXiv:2311.00400  [pdf, other

    cs.CV

    Open-Set Face Recognition with Maximal Entropy and Objectosphere Loss

    Authors: Rafael Henrique Vareto, Yu Linghu, Terrance E. Boult, William Robson Schwartz, Manuel Günther

    Abstract: Open-set face recognition characterizes a scenario where unknown individuals, unseen during the training and enrollment stages, appear on operation time. This work concentrates on watchlists, an open-set task that is expected to operate at a low False Positive Identification Rate and generally includes only a few enrollment samples per identity. We introduce a compact adapter network that benefits… ▽ More

    Submitted 1 November, 2023; originally announced November 2023.

    Comments: Accepted for publication in Image and Vision Computing 2023

  15. arXiv:2310.18877  [pdf, other

    cs.CL cs.LG cs.SD eess.AS

    Pre-trained Speech Processing Models Contain Human-Like Biases that Propagate to Speech Emotion Recognition

    Authors: Isaac Slaughter, Craig Greenberg, Reva Schwartz, Aylin Caliskan

    Abstract: Previous work has established that a person's demographics and speech style affect how well speech processing models perform for them. But where does this bias come from? In this work, we present the Speech Embedding Association Test (SpEAT), a method for detecting bias in one type of model used for many speech tasks: pre-trained models. The SpEAT is inspired by word embedding association tests in… ▽ More

    Submitted 28 October, 2023; originally announced October 2023.

  16. arXiv:2310.10000  [pdf, ps, other

    math.MG

    The Crisscross and the Cup: Two Short 3-Twist Paper Moebius Bands

    Authors: Brienne Elisabeth Brown, Richard Evan Schwartz

    Abstract: We introduce the crisscross and the cup, both of which are immersed $3$-twist polygonal paper Moebius band of aspect ratio $3$. We explain why these two objects are limits of smooth embedded paper Moebius bands having knotted boundary. We conjecture that any smooth embedded paper Moebius band with knotted boundary has aspect ratio greater than $3$. The crisscross is planar but the cup is not.

    Submitted 15 October, 2023; originally announced October 2023.

    Comments: This is, in some sense, a sequel to arXiv 2308.12641. However, the material here is independent from the earlier paper

  17. arXiv:2309.14033  [pdf, ps, other

    math.MG

    The Optimal Twisted Paper Cylinder

    Authors: Richard Evan Schwartz

    Abstract: A smooth twisted paper cylinder of aspect ratio $λ$ is an isometric embedding of a $1 \times λ$ cylinder into $\pmb{R}^3$ such that the images of the boundary components are linked. We prove that for such an object to exist we must have $λ>2$ and that this bound is sharp. We also show that any sequence of examples having aspect ratio converging to $2$ must converge, up to isometries, to a certain… ▽ More

    Submitted 14 October, 2023; v1 submitted 25 September, 2023; originally announced September 2023.

    Comments: This paper is a sequel to my paper about the optimal paper Moebius band, arXiv:2308.12641. This version is the same as the previous one except that (1) I correct a misstatement about the uniqueness of the right isosceles cylinder and (2) I mention the connection to folded ribbon knots

  18. arXiv:2309.09784  [pdf, other

    cond-mat.mes-hall cond-mat.mtrl-sci

    Rapid spin depolarization in the layered 2D Ruddlesden Popper perovskite (BA)(MA)PbI

    Authors: Michael Kempf, Philipp Moser, Maximilian Tomoscheit, Julian Schröer, Jean-Christophe Blancon, Rico Schwartz, Swarup Deb, Aditya Mohite, Andreas V. Stier, Jonathan J. Finley, Tobias Korn

    Abstract: We report temperature-dependent spectroscopy on the layered (n=4) two-dimensional (2D) Ruddlesden-Popper perovskite (BA)(MA)PbI. Helicity-resolved steady-state photoluminescence (PL) reveals no optical degree of polarization. Time-resolved PL shows a photocarrier lifetime on the order of nanoseconds. From simultaneaously recorded time-resolved differential reflectivity (TR$Δ$R) and time-resolved K… ▽ More

    Submitted 18 September, 2023; originally announced September 2023.

    Journal ref: ACS nano 2023

  19. arXiv:2309.05088  [pdf

    cs.CY q-bio.OT

    Towards Trustworthy Artificial Intelligence for Equitable Global Health

    Authors: Hong Qin, Jude Kong, Wandi Ding, Ramneek Ahluwalia, Christo El Morr, Zeynep Engin, Jake Okechukwu Effoduh, Rebecca Hwa, Serena **gchuan Guo, Laleh Seyyed-Kalantari, Sylvia Kiwuwa Muyingo, Candace Makeda Moore, Ravi Parikh, Reva Schwartz, Dongxiao Zhu, Xiaoqian Wang, Yiye Zhang

    Abstract: Artificial intelligence (AI) can potentially transform global health, but algorithmic bias can exacerbate social inequities and disparity. Trustworthy AI entails the intentional design to ensure equity and mitigate potential biases. To advance trustworthy AI in global health, we convened a workshop on Fairness in Machine Intelligence for Global Health (FairMI4GH). The event brought together a glob… ▽ More

    Submitted 10 September, 2023; originally announced September 2023.

    Comments: 7 pages

  20. arXiv:2308.12641  [pdf, ps, other

    math.MG

    The Optimal Paper Moebius Band

    Authors: Richard Evan Schwartz

    Abstract: In this paper we prove that a smooth embedded paper Moebius band must have aspect ratio greater than $\sqrt 3$. We also prove that any sequence of smooth embedded paper Moebius bands whose aspect ratio converges to $\sqrt 3$ must converge, up to isometry, to the famous triangular Moebius band. These results answer the minimum aspect ratio question discussed by W. Wunderlich in 1962 and prove the m… ▽ More

    Submitted 21 May, 2024; v1 submitted 24 August, 2023; originally announced August 2023.

    Comments: The very polished revision is based partly on feedback I got from Annals of Mathematics referees. Most likely this paper will be published in the Annals, though it is still pending. Some of the commentary and auxiliary material here might get chopped out of the published version

  21. arXiv:2308.12371  [pdf, other

    cs.CV cs.AI cs.LG

    Open-set Face Recognition with Neural Ensemble, Maximal Entropy Loss and Feature Augmentation

    Authors: Rafael Henrique Vareto, Manuel Günther, William Robson Schwartz

    Abstract: Open-set face recognition refers to a scenario in which biometric systems have incomplete knowledge of all existing subjects. Therefore, they are expected to prevent face samples of unregistered subjects from being identified as previously enrolled identities. This watchlist context adds an arduous requirement that calls for the dismissal of irrelevant faces by focusing mainly on subjects of inter… ▽ More

    Submitted 23 August, 2023; originally announced August 2023.

    Journal ref: 36th Conference on Graphics, Patterns and Images (SIBGRAPI 2023)

  22. arXiv:2308.07746  [pdf, other

    cs.DS

    A Tight Competitive Ratio for Online Submodular Welfare Maximization

    Authors: Amit Ganz, Pranav Nuti, Roy Schwartz

    Abstract: In this paper we consider the online Submodular Welfare (SW) problem. In this problem we are given $n$ bidders each equipped with a general (not necessarily monotone) submodular utility and $m$ items that arrive online. The goal is to assign each item, once it arrives, to a bidder or discard it, while maximizing the sum of utilities. When an adversary determines the items' arrival order we present… ▽ More

    Submitted 15 August, 2023; originally announced August 2023.

  23. Open-set Face Recognition using Ensembles trained on Clustered Data

    Authors: Rafael Henrique Vareto, William Robson Schwartz

    Abstract: Open-set face recognition describes a scenario where unknown subjects, unseen during the training stage, appear on test time. Not only it requires methods that accurately identify individuals of interest, but also demands approaches that effectively deal with unfamiliar faces. This work details a scalable open-set face identification approach to galleries composed of hundreds and thousands of subj… ▽ More

    Submitted 14 August, 2023; originally announced August 2023.

    Comments: [Original paper title: Unconstrained Face Identification using Ensembles trained on Clustered Data] [2020 IEEE International Joint Conference on Biometrics (IJCB)] [https://ieeexplore.ieee.org/document/9304882]

  24. arXiv:2308.03516  [pdf, other

    cs.DS

    An Improved Approximation Algorithm for the Max-$3$-Section Problem

    Authors: Dor Katzelnick, Aditya Pillai, Roy Schwartz, Mohit Singh

    Abstract: We consider the Max-$3$-Section problem, where we are given an undirected graph $ G=(V,E)$ equipped with non-negative edge weights $w :E\rightarrow \mathbb{R}_+$ and the goal is to find a partition of $V$ into three equisized parts while maximizing the total weight of edges crossing between different parts. Max-$3$-Section is closely related to other well-studied graph partitioning problems, e.g.,… ▽ More

    Submitted 7 August, 2023; originally announced August 2023.

  25. arXiv:2307.12259  [pdf, ps, other

    math.DS

    Symplectic Tiling Billiards, Planar Linkages, and Hyperbolic Geometry

    Authors: Richard Evan Schwartz

    Abstract: The purpose of this paper is to unite two games, symplectic billiards and tiling billiards. The new game is called symplectic tiling billiards. I will prove a result about periodic orbits of symplectic tiling billiards in a very special case and then show how this result is related to planar linkages and hyperbolic geometry.

    Submitted 4 November, 2023; v1 submitted 23 July, 2023; originally announced July 2023.

    Comments: This version is quite similar to the previous one. I eliminated a typo and also (in Sect 3.1) I added a remark describing a very nice observation of Jannik Westermann and some forthcoming joint work of mine and Jannik's that is based on this observation

  26. arXiv:2307.04532  [pdf, other

    cs.CV cs.AI cs.CL eess.AS

    Read, Look or Listen? What's Needed for Solving a Multimodal Dataset

    Authors: Netta Madvil, Yonatan Bitton, Roy Schwartz

    Abstract: The prevalence of large-scale multimodal datasets presents unique challenges in assessing dataset quality. We propose a two-step method to analyze multimodal datasets, which leverages a small seed of human annotation to map each multimodal instance to the modalities required to process it. Our method sheds light on the importance of different modalities in datasets, as well as the relationship bet… ▽ More

    Submitted 6 July, 2023; originally announced July 2023.

  27. arXiv:2306.16900  [pdf, other

    cs.CL

    Surveying (Dis)Parities and Concerns of Compute Hungry NLP Research

    Authors: Ji-Ung Lee, Haritz Puerto, Betty van Aken, Yuki Arase, Jessica Zosa Forde, Leon Derczynski, Andreas Rücklé, Iryna Gurevych, Roy Schwartz, Emma Strubell, Jesse Dodge

    Abstract: Many recent improvements in NLP stem from the development and use of large pre-trained language models (PLMs) with billions of parameters. Large model sizes makes computational cost one of the main limiting factors for training and evaluating such models; and has raised severe concerns about the sustainability, reproducibility, and inclusiveness for researching PLMs. These concerns are often based… ▽ More

    Submitted 9 November, 2023; v1 submitted 29 June, 2023; originally announced June 2023.

  28. Morphosyntactic probing of multilingual BERT models

    Authors: Judit Acs, Endre Hamerlik, Roy Schwartz, Noah A. Smith, Andras Kornai

    Abstract: We introduce an extensive dataset for multilingual probing of morphological information in language models (247 tasks across 42 languages from 10 families), each consisting of a sentence with a target word and a morphological tag as the desired label, derived from the Universal Dependencies treebanks. We find that pre-trained Transformer models (mBERT and XLM-RoBERTa) learn features that attain st… ▽ More

    Submitted 9 June, 2023; originally announced June 2023.

    Comments: to appear in the Journal of Natural Language Engineering

  29. arXiv:2306.02307  [pdf, other

    cs.CL cs.AI cs.LG

    Finding the SWEET Spot: Analysis and Improvement of Adaptive Inference in Low Resource Settings

    Authors: Daniel Rotem, Michael Hassid, Jonathan Mamou, Roy Schwartz

    Abstract: Adaptive inference is a simple method for reducing inference costs. The method works by maintaining multiple classifiers of different capacities, and allocating resources to each test instance according to its difficulty. In this work, we compare the two main approaches for adaptive inference, Early-Exit and Multi-Model, when training data is limited. First, we observe that for models with the sam… ▽ More

    Submitted 4 June, 2023; originally announced June 2023.

    Comments: Proceedings of ACL 2023

  30. arXiv:2305.18917  [pdf, other

    cs.CL

    Fighting Bias with Bias: Promoting Model Robustness by Amplifying Dataset Biases

    Authors: Yuval Reif, Roy Schwartz

    Abstract: NLP models often rely on superficial cues known as dataset biases to achieve impressive performance, and can fail on examples where these biases do not hold. Recent work sought to develop robust, unbiased models by filtering biased examples from training sets. In this work, we argue that such filtering can obscure the true capabilities of models to overcome biases, which might never be removed in… ▽ More

    Submitted 30 May, 2023; originally announced May 2023.

    Comments: Findings of ACL 2023

  31. Super-Resolution of License Plate Images Using Attention Modules and Sub-Pixel Convolution Layers

    Authors: Valfride Nascimento, Rayson Laroca, Jorge de A. Lambert, William Robson Schwartz, David Menotti

    Abstract: Recent years have seen significant developments in the field of License Plate Recognition (LPR) through the integration of deep learning techniques and the increasing availability of training data. Nevertheless, reconstructing license plates (LPs) from low-resolution (LR) surveillance footage remains challenging. To address this issue, we introduce a Single-Image Super-Resolution (SISR) approach t… ▽ More

    Submitted 26 May, 2023; originally announced May 2023.

    Journal ref: Computers & Graphics, vol. 113, pp. 69-76, 2023

  32. arXiv:2305.13009  [pdf, other

    cs.CL cs.LG cs.SD eess.AS

    Textually Pretrained Speech Language Models

    Authors: Michael Hassid, Tal Remez, Tu Anh Nguyen, Itai Gat, Alexis Conneau, Felix Kreuk, Jade Copet, Alexandre Defossez, Gabriel Synnaeve, Emmanuel Dupoux, Roy Schwartz, Yossi Adi

    Abstract: Speech language models (SpeechLMs) process and generate acoustic data only, without textual supervision. In this work, we propose TWIST, a method for training SpeechLMs using a warm-start from a pretrained textual language models. We show using both automatic and human evaluations that TWIST outperforms a cold-start SpeechLM across the board. We empirically analyze the effect of different model de… ▽ More

    Submitted 30 January, 2024; v1 submitted 22 May, 2023; originally announced May 2023.

    Comments: NeurIPS 2023

  33. MCTrans++: A 0-D Model for Centrifugal Mirrors

    Authors: Nick R. Schwartz, Ian G. Abel, Adil B. Hassam, Myles Kelly, Carlos A. Romero-Talamas

    Abstract: The centrifugal mirror confinement scheme incorporates supersonic rotation of a plasma into a magnetic mirror device. This concept has been shown experimentally to drastically decrease parallel losses and increase plasma stability as compared to prior axisymmetric mirrors. MCTrans++ is a 0D sco** tool which rapidly models experimental operating points in the Centrifugal Mirror Fusion Experiment… ▽ More

    Submitted 11 March, 2024; v1 submitted 3 April, 2023; originally announced April 2023.

    Comments: Submitted to Journal of Plasma Physics

  34. arXiv:2303.07274  [pdf, other

    cs.CV cs.AI cs.CL

    Breaking Common Sense: WHOOPS! A Vision-and-Language Benchmark of Synthetic and Compositional Images

    Authors: Nitzan Bitton-Guetta, Yonatan Bitton, Jack Hessel, Ludwig Schmidt, Yuval Elovici, Gabriel Stanovsky, Roy Schwartz

    Abstract: Weird, unusual, and uncanny images pique the curiosity of observers because they challenge commonsense. For example, an image released during the 2022 world cup depicts the famous soccer stars Lionel Messi and Cristiano Ronaldo playing chess, which playfully violates our expectation that their competition should occur on the football field. Humans can easily recognize and interpret these unconvent… ▽ More

    Submitted 12 August, 2023; v1 submitted 13 March, 2023; originally announced March 2023.

    Comments: Accepted to ICCV 2023. Website: whoops-benchmark.github.io

  35. arXiv:2303.02300  [pdf, other

    cond-mat.mes-hall quant-ph

    Quantum Gates Between Mesoscopic Spin Ensembles

    Authors: Mohamad Niknam, Robert N. Schwartz, Louis-S. Bouchard

    Abstract: Quantum algorithmics with single spins poses serious technological challenges such as precision fabrication, rapid decoherence, atomic-scale addressing and readout. To circumvent atomic-scale challenges, we examine the case of fully polarized mesoscopic spin ensembles (spin-coherent states) whose total angular momenta states map to qudit submanifolds. We show that in the limit where the size of th… ▽ More

    Submitted 3 March, 2023; originally announced March 2023.

    Comments: 15 pages, 3 figures

    Journal ref: Phys. Rev. A 107, 032601 (2023)

  36. Successful Kinetic Impact into an Asteroid for Planetary Defense

    Authors: R. Terik Daly, Carolyn M. Ernst, Olivier S. Barnouin, Nancy L. Chabot, Andrew S. Rivkin, Andrew F. Cheng, Elena Y. Adams, Harrison F. Agrusa, Elisabeth D. Abel, Amy L. Alford, Erik I. Asphaug, Justin A. Atchison, Andrew R. Badger, Paul Baki, Ronald-L. Ballouz, Dmitriy L. Bekker, Julie Bellerose, Shyam Bhaskaran, Bonnie J. Buratti, Saverio Cambioni, Michelle H. Chen, Steven R. Chesley, George Chiu, Gareth S. Collins, Matthew W. Cox , et al. (76 additional authors not shown)

    Abstract: While no known asteroid poses a threat to Earth for at least the next century, the catalog of near-Earth asteroids is incomplete for objects whose impacts would produce regional devastation. Several approaches have been proposed to potentially prevent an asteroid impact with Earth by deflecting or disrupting an asteroid. A test of kinetic impact technology was identified as the highest priority sp… ▽ More

    Submitted 3 March, 2023; originally announced March 2023.

    Comments: Accepted by Nature

  37. Ejecta from the DART-produced active asteroid Dimorphos

    Authors: Jian-Yang Li, Masatoshi Hirabayashi, Tony L. Farnham, Jessica M. Sunshine, Matthew M. Knight, Gonzalo Tancredi, Fernando Moreno, Brian Murphy, Cyrielle Opitom, Steve Chesley, Daniel J. Scheeres, Cristina A. Thomas, Eugene G. Fahnestock, Andrew F. Cheng, Linda Dressel, Carolyn M. Ernst, Fabio Ferrari, Alan Fitzsimmons, Simone Ieva, Stavro L. Ivanovski, Teddy Kareta, Ludmilla Kolokolova, Tim Lister, Sabina D. Raducan, Andrew S. Rivkin , et al. (39 additional authors not shown)

    Abstract: Some active asteroids have been proposed to be the result of impact events. Because active asteroids are generally discovered serendipitously only after their tail formation, the process of the impact ejecta evolving into a tail has never been directly observed. NASA's Double Asteroid Redirection Test (DART) mission, apart from having successfully changed the orbital period of Dimorphos, demonstra… ▽ More

    Submitted 2 March, 2023; originally announced March 2023.

    Comments: accepted by Nature

  38. arXiv:2301.05090  [pdf, ps, other

    math.MG

    Divide and Conquer: A Distributed Approach to Five Point Energy Minimization

    Authors: Richard Evan Schwartz

    Abstract: This work rigorously verifies the phase transition in 5-point energy minimization first observed by Melnyk-Knop-Smith in 1977. More precisely, we prove that there is a constant S = [15+24/512,15+25/512] such that the triangular bi-pyramid is the energy minimizer with respect to the s-power law potential for all s in (0,S) and some pyramid with square base is the unique minimizer for all s in (S,15… ▽ More

    Submitted 22 January, 2023; v1 submitted 12 January, 2023; originally announced January 2023.

    Comments: 77 pages long. This is a computer assisted proof, and the code is available from my website as indicated in the text. I further shortened the proof (and removed typos). My goal is to make this thing as short and as easily verifiable as possible

  39. arXiv:2212.04542  [pdf, other

    cs.CV cs.AI cs.CL

    VASR: Visual Analogies of Situation Recognition

    Authors: Yonatan Bitton, Ron Yosef, Eli Strugo, Dafna Shahaf, Roy Schwartz, Gabriel Stanovsky

    Abstract: A core process in human cognition is analogical map**: the ability to identify a similar relational structure between different situations. We introduce a novel task, Visual Analogies of Situation Recognition, adapting the classical word-analogy task into the visual domain. Given a triplet of images, the task is to select an image candidate B' that completes the analogy (A to A' is like B to wha… ▽ More

    Submitted 8 December, 2022; originally announced December 2022.

    Comments: Accepted to AAAI 2023. Website: https://vasr-dataset.github.io/

  40. arXiv:2211.03495  [pdf, other

    cs.CL cs.LG

    How Much Does Attention Actually Attend? Questioning the Importance of Attention in Pretrained Transformers

    Authors: Michael Hassid, Hao Peng, Daniel Rotem, Jungo Kasai, Ivan Montero, Noah A. Smith, Roy Schwartz

    Abstract: The attention mechanism is considered the backbone of the widely-used Transformer architecture. It contextualizes the input by computing input-specific attention matrices. We find that this mechanism, while powerful and elegant, is not as important as typically thought for pretrained language models. We introduce PAPA, a new probing method that replaces the input-dependent attention matrices with… ▽ More

    Submitted 7 November, 2022; originally announced November 2022.

    Comments: Findings of EMNLP 2022

  41. Combining Attention Module and Pixel Shuffle for License Plate Super-Resolution

    Authors: Valfride Nascimento, Rayson Laroca, Jorge de A. Lambert, William Robson Schwartz, David Menotti

    Abstract: The License Plate Recognition (LPR) field has made impressive advances in the last decade due to novel deep learning approaches combined with the increased availability of training data. However, it still has some open issues, especially when the data come from low-resolution (LR) and low-quality images/videos, as in surveillance systems. This work focuses on license plate (LP) reconstruction in L… ▽ More

    Submitted 30 October, 2022; originally announced October 2022.

    Comments: Accepted for presentation at the Conference on Graphics, Patterns and Images (SIBGRAPI) 2022

  42. arXiv:2209.11873  [pdf

    astro-ph.EP

    After DART: Using the first full-scale test of a kinetic impactor to inform a future planetary defense mission

    Authors: Thomas S. Statler, Sabina D. Raducan, Olivier S. Barnouin, Mallory E. DeCoster, Steven R. Chesley, Brent Barbee, Harrison F. Agrusa, Saverio Cambioni, Andrew F. Cheng, Elisabetta Dotto, Siegfried Eggl, Eugene G. Fahnestock, Fabio Ferrari, Dawn Graninger, Alain Herique, Isabel Herreros, Masatoshi Hirabayashi, Stavro Ivanovski, Martin Jutzi, Özgür Karatekin, Alice Lucchetti, Robert Luther, Rahil Makadia, Francesco Marzari, Patrick Michel , et al. (16 additional authors not shown)

    Abstract: NASA's Double Asteroid Redirection Test (DART) is the first full-scale test of an asteroid deflection technology. Results from the hypervelocity kinetic impact and Earth-based observations, coupled with LICIACube and the later Hera mission, will result in measurement of the momentum transfer efficiency accurate to ~10% and characterization of the Didymos binary system. But DART is a single experim… ▽ More

    Submitted 23 September, 2022; originally announced September 2022.

    Comments: 30 pages, 7 figures. Planetary Science Journal, in press, accepted 2022 September 22

  43. arXiv:2209.06659  [pdf

    astro-ph.EP

    Effects of impact and target parameters on the results of a kinetic impactor: predictions for the Double Asteroid Redirection Test (DART) mission

    Authors: Angela M. Stickle, Mallory E. DeCoster, Christoph Burger, Wendy K. Caldwell, Dawn Graninger, Kathryn M. Kumamoto, Robert Luther, Jens Ormö, Sabina Raducan, Emma Rainey, Christoph M. Schäfer, James D. Walker, Yun Zhang, Patrick Michel, J. Michael Owen, Olivier Barnouin, Andy F. Cheng, Sidney Cochron, Gareth S. Collins, Thomas M. Davison, Elisabetta Dotto, Fabio Ferrari, M. Isabel Herreros, Stavro L. Ivanovski, Martin Jutzi , et al. (8 additional authors not shown)

    Abstract: The Double Asteroid Redirection Test (DART) spacecraft will impact into the asteroid Dimorphos on September 26, 2022 as a test of the kinetic impactor technique for planetary defense. The efficiency of the deflection following a kinetic impactor can be represented using the momentum enhancement factor, Beta, which is dependent on factors such as impact geometry and the specific target material pro… ▽ More

    Submitted 14 September, 2022; originally announced September 2022.

    Comments: Accepted to PSJ Didymos-DART Focus Issue

  44. arXiv:2209.00099  [pdf, other

    cs.CL

    Efficient Methods for Natural Language Processing: A Survey

    Authors: Marcos Treviso, Ji-Ung Lee, Tianchu Ji, Betty van Aken, Qingqing Cao, Manuel R. Ciosici, Michael Hassid, Kenneth Heafield, Sara Hooker, Colin Raffel, Pedro H. Martins, André F. T. Martins, Jessica Zosa Forde, Peter Milder, Edwin Simpson, Noam Slonim, Jesse Dodge, Emma Strubell, Niranjan Balasubramanian, Leon Derczynski, Iryna Gurevych, Roy Schwartz

    Abstract: Recent work in natural language processing (NLP) has yielded appealing results from scaling model parameters and training data; however, using only scale to improve performance means that resource consumption also grows. Such resources include data, time, storage, or energy, all of which are naturally limited and unevenly distributed. This motivates research into efficient methods that require few… ▽ More

    Submitted 24 March, 2023; v1 submitted 31 August, 2022; originally announced September 2022.

    Comments: Accepted at TACL, pre publication version

  45. arXiv:2208.05254  [pdf, ps, other

    math.CO math.NT

    Continued Fractions and the 4-Color Theorem

    Authors: Richard Evan Schwartz

    Abstract: We study the geometry of some proper 4-colorings of the vertices of sphere triangulations with degree sequence 6,...,6,2,2,2. Such triangulations are the simplest examples which have non-negative combinatorial curvature. The examples we construct, which are roughly extremal in some sense, are based on a novel geometric interpretation of continued fractions. We also present a conjectural sharp "iso… ▽ More

    Submitted 29 December, 2023; v1 submitted 10 August, 2022; originally announced August 2022.

    Comments: This version is the same as the previous one, except that I edited it to remove typos and other little glitches

  46. arXiv:2207.12576  [pdf, other

    cs.CL cs.AI cs.CV cs.HC

    WinoGAViL: Gamified Association Benchmark to Challenge Vision-and-Language Models

    Authors: Yonatan Bitton, Nitzan Bitton Guetta, Ron Yosef, Yuval Elovici, Mohit Bansal, Gabriel Stanovsky, Roy Schwartz

    Abstract: While vision-and-language models perform well on tasks such as visual question answering, they struggle when it comes to basic human commonsense reasoning skills. In this work, we introduce WinoGAViL: an online game of vision-and-language associations (e.g., between werewolves and a full moon), used as a dynamic evaluation benchmark. Inspired by the popular card game Codenames, a spymaster gives a… ▽ More

    Submitted 11 October, 2022; v1 submitted 25 July, 2022; originally announced July 2022.

    Comments: Accepted to NeurIPS 2022, Datasets and Benchmarks. Website: https://winogavil.github.io/

  47. Predictions for the Dynamical States of the Didymos System before and after the Planned DART Impact

    Authors: Derek C. Richardson, Harrison F. Agrusa, Brent Barbee, William F. Bottke, Andrew F. Cheng, Siegfried Eggl, Fabio Ferrari, Masatoshi Hirabayashi, Özgür Karatekin, Jay McMahon, Stephen R. Schwartz, Ronald-Louis Ballouz, Adriano Campo Bagatin, Elisabetta Dotto, Eugene G. Fahnestock, Oscar Fuentes-Muñoz, Ioannis Gkolias, Douglas P. Hamilton, Seth A. Jacobson, Martin Jutzi, Josh Lyzhoft, Rahil Makadia, Alex J. Meyer, Patrick Michel, Ryota Nakano , et al. (11 additional authors not shown)

    Abstract: NASA's Double Asteroid Redirection Test (DART) spacecraft is planned to impact the natural satellite of (65803) Didymos, Dimorphos, around 23:14 UTC on 26 September 2022, causing a reduction in its orbital period that will be measurable with ground-based observations. This test of kinetic impactor technology will provide the first estimate of the momentum transfer enhancement factor $β$ at a reali… ▽ More

    Submitted 14 July, 2022; originally announced July 2022.

    Comments: 23 pages, 13 figures, published in PSJ

    Journal ref: Planet. Sci. J. 3 157 (2022)

  48. arXiv:2206.09860  [pdf, other

    cs.CL

    Fewer Errors, but More Stereotypes? The Effect of Model Size on Gender Bias

    Authors: Yarden Tal, Inbal Magar, Roy Schwartz

    Abstract: The size of pretrained models is increasing, and so is their performance on a variety of NLP tasks. However, as their memorization capacity grows, they might pick up more social biases. In this work, we examine the connection between model size and its gender bias (specifically, occupational gender bias). We measure bias in three masked language model families (RoBERTa, DeBERTa, and T5) in two set… ▽ More

    Submitted 20 June, 2022; originally announced June 2022.

  49. arXiv:2206.05229  [pdf, other

    cs.LG

    Measuring the Carbon Intensity of AI in Cloud Instances

    Authors: Jesse Dodge, Taylor Prewitt, Remi Tachet Des Combes, Erika Odmark, Roy Schwartz, Emma Strubell, Alexandra Sasha Luccioni, Noah A. Smith, Nicole DeCario, Will Buchanan

    Abstract: By providing unprecedented access to computational resources, cloud computing has enabled rapid growth in technologies such as machine learning, the computational demands of which incur a high energy cost and a commensurate carbon footprint. As a result, recent scholarship has called for better estimates of the greenhouse gas impact of AI: data scientists today do not have easy or reliable access… ▽ More

    Submitted 10 June, 2022; originally announced June 2022.

    Comments: In ACM Conference on Fairness, Accountability, and Transparency (ACM FAccT) 2022

  50. arXiv:2205.00595  [pdf, ps, other

    math.AT

    Trisecting the 9-vertex complex projective plane

    Authors: Richard Evan Schwartz

    Abstract: In this paper we will give a short and direct proof that Wolfgang Kuehnel's 9-vertex triangulation of the complex projective plane really is the complex projective plane. The idea of our proof is to recall the trisection of the complex projective plane into 3 bi-disks and then to see this trisection inside a symmetry-breaking subdivision of the triangulation. Following the basic proof, we will ela… ▽ More

    Submitted 4 July, 2022; v1 submitted 1 May, 2022; originally announced May 2022.

    Comments: This is the version that will appear as an article in the Mathematical Intelligencer. I revised the paper according to the many helpful comments of a referee who had a supernatural understanding of this complex