Search | arXiv e-print repository

arXiv:2405.05437 [pdf, other]

Measurement of Coherent Vibrational Dynamics with X-ray Transient Absorption Spectroscopy Simultaneously at the Carbon K- and Chlorine L$_{2,3}$- Edges

Authors: Andrew D. Ross, Diptarka Hait, Valeriu Scutelnic, Daniel M. Neumark, Martin Head-Gordon, Stephen R. Leone

Abstract: X-ray Transient Absorption Spectroscopy near the carbon K-edge (1s, $\sim$ 285 eV) and chlorine L$_{2,3}$ edges (2p, $\sim$ 200 eV) is used to study the nuclear dynamics of CCl$_4$ vibrationally activated by impulsive stimulated Raman scattering with a few-cycle 800 nm pump pulse. The totally symmetric stretching mode leads to a strong response in the inner-shell spectra, with the concerted elonga… ▽ More X-ray Transient Absorption Spectroscopy near the carbon K-edge (1s, $\sim$ 285 eV) and chlorine L$_{2,3}$ edges (2p, $\sim$ 200 eV) is used to study the nuclear dynamics of CCl$_4$ vibrationally activated by impulsive stimulated Raman scattering with a few-cycle 800 nm pump pulse. The totally symmetric stretching mode leads to a strong response in the inner-shell spectra, with the concerted elongation (contraction) in bond lengths leading to a red (blue) shift in the X-ray absorption energies associated with core-to-antibonding excitations. The relative slopes of the potential energy surfaces associated with the relevant core-excited states along the symmetric stretching mode are experimentally measured and compared to results from restricted open-shell Kohn-Sham calculations. A combination of experiment and theory indicates that the slope of the core-excited potential energy surface vs totally symmetric bond elongation is $-11.1 \pm 0.8$ eV/Å for the Cl 2p$\to7a_1^*$ excitation, $-9.0\pm0.6$ eV/Å for the Cl 2p$\to8t_2^*$ excitation and $-5.2\pm 0.4$ eV/Å for the C 1s$\to8t_2^*$ excitation, to 95% confidence. The much larger slopes for the Cl 2p excitations compared to the C 1s state are attributed to greater contributions from Cl to the $7a_1^*$ or $8t_2^*$ antibonding orbitals to which the inner-shell electrons are being excited. No net displacement of the center of the vibrational wavefunction along the other vibrational modes is induced by the pump pulse, leading to absence of transient signal. The results highlight the ability of X-ray Transient Absorption Spectroscopy to reveal nuclear dynamics involving tiny ($<0.01$ Å) atomic displacements and also provide direct measurement of forces on core-excited potential energy surfaces. △ Less

Submitted 8 May, 2024; originally announced May 2024.

Comments: ADR and DH contributed equally to this work

arXiv:2405.04552 [pdf, ps, other]

Nonstandard arguments for results about infinite systems of equations in infinitely many variables

Authors: David A. Ross

Abstract: Short nonstandard proofs are given for some results about infinite systems of equations in infinitely many variables. Short nonstandard proofs are given for some results about infinite systems of equations in infinitely many variables. △ Less

Submitted 6 May, 2024; originally announced May 2024.

MSC Class: 26E35; 46S20; 15A06; 40H05; 46A45; 12E12

arXiv:2405.01766 [pdf, ps, other]

Multiplicative polynomial equations in infinitely many variables

Authors: Melvyn B. Nathanson, David A. Ross

Abstract: This paper describes infinite sets of polynomial equations in infinitely many variables with the property that the existence of a solution or even an approximate solution for every finite subset of the equations implies the existence of a solution for the infinite set of equations. This paper describes infinite sets of polynomial equations in infinitely many variables with the property that the existence of a solution or even an approximate solution for every finite subset of the equations implies the existence of a solution for the infinite set of equations. △ Less

Submitted 2 May, 2024; originally announced May 2024.

Comments: 16 pages

MSC Class: 12D10; 12E12; 15A06; 40H05; 46A45; 54B10; 54C30

arXiv:2404.02388 [pdf, other]

CAPE: CAM as a Probabilistic Ensemble for Enhanced DNN Interpretation

Authors: Townim Faisal Chowdhury, Kewen Liao, Vu Minh Hieu Phan, Minh-Son To, Yutong Xie, Kevin Hung, David Ross, Anton van den Hengel, Johan W. Verjans, Zhibin Liao

Abstract: Deep Neural Networks (DNNs) are widely used for visual classification tasks, but their complex computation process and black-box nature hinder decision transparency and interpretability. Class activation maps (CAMs) and recent variants provide ways to visually explain the DNN decision-making process by displaying 'attention' heatmaps of the DNNs. Nevertheless, the CAM explanation only offers relat… ▽ More Deep Neural Networks (DNNs) are widely used for visual classification tasks, but their complex computation process and black-box nature hinder decision transparency and interpretability. Class activation maps (CAMs) and recent variants provide ways to visually explain the DNN decision-making process by displaying 'attention' heatmaps of the DNNs. Nevertheless, the CAM explanation only offers relative attention information, that is, on an attention heatmap, we can interpret which image region is more or less important than the others. However, these regions cannot be meaningfully compared across classes, and the contribution of each region to the model's class prediction is not revealed. To address these challenges that ultimately lead to better DNN Interpretation, in this paper, we propose CAPE, a novel reformulation of CAM that provides a unified and probabilistically meaningful assessment of the contributions of image regions. We quantitatively and qualitatively compare CAPE with state-of-the-art CAM methods on CUB and ImageNet benchmark datasets to demonstrate enhanced interpretability. We also test on a cytology imaging dataset depicting a challenging Chronic Myelomonocytic Leukemia (CMML) diagnosis problem. Code is available at: https://github.com/AIML-MED/CAPE. △ Less

Submitted 4 April, 2024; v1 submitted 2 April, 2024; originally announced April 2024.

arXiv:2403.03212 [pdf, other]

Performance of a modular ton-scale pixel-readout liquid argon time projection chamber

Authors: DUNE Collaboration, A. Abed Abud, B. Abi, R. Acciarri, M. A. Acero, M. R. Adames, G. Adamov, M. Adamowski, D. Adams, M. Adinolfi, C. Adriano, A. Aduszkiewicz, J. Aguilar, B. Aimard, F. Akbar, K. Allison, S. Alonso Monsalve, M. Alrashed, A. Alton, R. Alvarez, T. Alves, H. Amar, P. Amedo, J. Anderson, D. A. Andrade , et al. (1340 additional authors not shown)

Abstract: The Module-0 Demonstrator is a single-phase 600 kg liquid argon time projection chamber operated as a prototype for the DUNE liquid argon near detector. Based on the ArgonCube design concept, Module-0 features a novel 80k-channel pixelated charge readout and advanced high-coverage photon detection system. In this paper, we present an analysis of an eight-day data set consisting of 25 million cosmi… ▽ More The Module-0 Demonstrator is a single-phase 600 kg liquid argon time projection chamber operated as a prototype for the DUNE liquid argon near detector. Based on the ArgonCube design concept, Module-0 features a novel 80k-channel pixelated charge readout and advanced high-coverage photon detection system. In this paper, we present an analysis of an eight-day data set consisting of 25 million cosmic ray events collected in the spring of 2021. We use this sample to demonstrate the imaging performance of the charge and light readout systems as well as the signal correlations between the two. We also report argon purity and detector uniformity measurements, and provide comparisons to detector simulations. △ Less

Submitted 5 March, 2024; originally announced March 2024.

Comments: 47 pages, 41 figures

Report number: FERMILAB-PUB-24-0073-LBNF

arXiv:2403.01332 [pdf, other]

Chaining thoughts and LLMs to learn DNA structural biophysics

Authors: Tyler D. Ross, Ashwin Gopinath

Abstract: The future development of an AI scientist, a tool that is capable of integrating a variety of experimental data and generating testable hypotheses, holds immense potential. So far, bespoke machine learning models have been created to specialize in singular scientific tasks, but otherwise lack the flexibility of a general purpose model. Here, we show that a general purpose large language model, cha… ▽ More The future development of an AI scientist, a tool that is capable of integrating a variety of experimental data and generating testable hypotheses, holds immense potential. So far, bespoke machine learning models have been created to specialize in singular scientific tasks, but otherwise lack the flexibility of a general purpose model. Here, we show that a general purpose large language model, chatGPT 3.5-turbo, can be fine-tuned to learn the structural biophysics of DNA. We find that both fine-tuning models to return chain-of-thought responses and chaining together models fine-tuned for subtasks have an enhanced ability to analyze and design DNA sequences and their structures. △ Less

Submitted 2 March, 2024; originally announced March 2024.

arXiv:2403.01248 [pdf, other]

SceneCraft: An LLM Agent for Synthesizing 3D Scene as Blender Code

Authors: Ziniu Hu, Ahmet Iscen, Aashi Jain, Thomas Kipf, Yisong Yue, David A. Ross, Cordelia Schmid, Alireza Fathi

Abstract: This paper introduces SceneCraft, a Large Language Model (LLM) Agent converting text descriptions into Blender-executable Python scripts which render complex scenes with up to a hundred 3D assets. This process requires complex spatial planning and arrangement. We tackle these challenges through a combination of advanced abstraction, strategic planning, and library learning. SceneCraft first models… ▽ More This paper introduces SceneCraft, a Large Language Model (LLM) Agent converting text descriptions into Blender-executable Python scripts which render complex scenes with up to a hundred 3D assets. This process requires complex spatial planning and arrangement. We tackle these challenges through a combination of advanced abstraction, strategic planning, and library learning. SceneCraft first models a scene graph as a blueprint, detailing the spatial relationships among assets in the scene. SceneCraft then writes Python scripts based on this graph, translating relationships into numerical constraints for asset layout. Next, SceneCraft leverages the perceptual strengths of vision-language foundation models like GPT-V to analyze rendered images and iteratively refine the scene. On top of this process, SceneCraft features a library learning mechanism that compiles common script functions into a reusable library, facilitating continuous self-improvement without expensive LLM parameter tuning. Our evaluation demonstrates that SceneCraft surpasses existing LLM-based agents in rendering complex scenes, as shown by its adherence to constraints and favorable human assessments. We also showcase the broader application potential of SceneCraft by reconstructing detailed 3D scenes from the Sintel movie and guiding a video generative model with generated scenes as intermediary control signal. △ Less

Submitted 2 March, 2024; originally announced March 2024.

arXiv:2402.13217 [pdf, other]

VideoPrism: A Foundational Visual Encoder for Video Understanding

Authors: Long Zhao, Nitesh B. Gundavarapu, Liangzhe Yuan, Hao Zhou, Shen Yan, Jennifer J. Sun, Luke Friedman, Rui Qian, Tobias Weyand, Yue Zhao, Rachel Hornung, Florian Schroff, Ming-Hsuan Yang, David A. Ross, Huisheng Wang, Hartwig Adam, Mikhail Sirotenko, Ting Liu, Boqing Gong

Abstract: We introduce VideoPrism, a general-purpose video encoder that tackles diverse video understanding tasks with a single frozen model. We pretrain VideoPrism on a heterogeneous corpus containing 36M high-quality video-caption pairs and 582M video clips with noisy parallel text (e.g., ASR transcripts). The pretraining approach improves upon masked autoencoding by global-local distillation of semantic… ▽ More We introduce VideoPrism, a general-purpose video encoder that tackles diverse video understanding tasks with a single frozen model. We pretrain VideoPrism on a heterogeneous corpus containing 36M high-quality video-caption pairs and 582M video clips with noisy parallel text (e.g., ASR transcripts). The pretraining approach improves upon masked autoencoding by global-local distillation of semantic video embeddings and a token shuffling scheme, enabling VideoPrism to focus primarily on the video modality while leveraging the invaluable text associated with videos. We extensively test VideoPrism on four broad groups of video understanding tasks, from web video question answering to CV for science, achieving state-of-the-art performance on 31 out of 33 video understanding benchmarks. △ Less

Submitted 15 June, 2024; v1 submitted 20 February, 2024; originally announced February 2024.

Comments: Accepted to ICML 2024. v2: added retrieval results on MSRVTT (1K-A), more data analyses, and ablation studies

arXiv:2402.01568 [pdf, other]

Do** Liquid Argon with Xenon in ProtoDUNE Single-Phase: Effects on Scintillation Light

Authors: DUNE Collaboration, A. Abed Abud, B. Abi, R. Acciarri, M. A. Acero, M. R. Adames, G. Adamov, M. Adamowski, D. Adams, M. Adinolfi, C. Adriano, A. Aduszkiewicz, J. Aguilar, B. Aimard, F. Akbar, K. Allison, S. Alonso Monsalve, M. Alrashed, A. Alton, R. Alvarez, H. Amar Es-sghir, P. Amedo, J. Anderson, D. A. Andrade, C. Andreopoulos , et al. (1300 additional authors not shown)

Abstract: Do** of liquid argon TPCs (LArTPCs) with a small concentration of xenon is a technique for light-shifting and facilitates the detection of the liquid argon scintillation light. In this paper, we present the results of the first do** test ever performed in a kiloton-scale LArTPC. From February to May 2020, we carried out this special run in the single-phase DUNE Far Detector prototype (ProtoDUN… ▽ More Do** of liquid argon TPCs (LArTPCs) with a small concentration of xenon is a technique for light-shifting and facilitates the detection of the liquid argon scintillation light. In this paper, we present the results of the first do** test ever performed in a kiloton-scale LArTPC. From February to May 2020, we carried out this special run in the single-phase DUNE Far Detector prototype (ProtoDUNE-SP) at CERN, featuring 770 t of total liquid argon mass with 410 t of fiducial mass. The goal of the run was to measure the light and charge response of the detector to the addition of xenon, up to a concentration of 18.8 ppm. The main purpose was to test the possibility for reduction of non-uniformities in light collection, caused by deployment of photon detectors only within the anode planes. Light collection was analysed as a function of the xenon concentration, by using the pre-existing photon detection system (PDS) of ProtoDUNE-SP and an additional smaller set-up installed specifically for this run. In this paper we first summarize our current understanding of the argon-xenon energy transfer process and the impact of the presence of nitrogen in argon with and without xenon dopant. We then describe the key elements of ProtoDUNE-SP and the injection method deployed. Two dedicated photon detectors were able to collect the light produced by xenon and the total light. The ratio of these components was measured to be about 0.65 as 18.8 ppm of xenon were injected. We performed studies of the collection efficiency as a function of the distance between tracks and light detectors, demonstrating enhanced uniformity of response for the anode-mounted PDS. We also show that xenon do** can substantially recover light losses due to contamination of the liquid argon by nitrogen. △ Less

Submitted 9 February, 2024; v1 submitted 2 February, 2024; originally announced February 2024.

Comments: 35 pages, 20 figures

Report number: CERN-EP-2024-024; FERMILAB-PUB-23-0819-LBNF

arXiv:2312.14125 [pdf, other]

VideoPoet: A Large Language Model for Zero-Shot Video Generation

Authors: Dan Kondratyuk, Lijun Yu, Xiuye Gu, José Lezama, Jonathan Huang, Grant Schindler, Rachel Hornung, Vighnesh Birodkar, Jimmy Yan, Ming-Chang Chiu, Krishna Somandepalli, Hassan Akbari, Yair Alon, Yong Cheng, Josh Dillon, Agrim Gupta, Meera Hahn, Anja Hauth, David Hendon, Alonso Martinez, David Minnen, Mikhail Sirotenko, Kihyuk Sohn, Xuan Yang, Hartwig Adam , et al. (6 additional authors not shown)

Abstract: We present VideoPoet, a language model capable of synthesizing high-quality video, with matching audio, from a large variety of conditioning signals. VideoPoet employs a decoder-only transformer architecture that processes multimodal inputs -- including images, videos, text, and audio. The training protocol follows that of Large Language Models (LLMs), consisting of two stages: pretraining and tas… ▽ More We present VideoPoet, a language model capable of synthesizing high-quality video, with matching audio, from a large variety of conditioning signals. VideoPoet employs a decoder-only transformer architecture that processes multimodal inputs -- including images, videos, text, and audio. The training protocol follows that of Large Language Models (LLMs), consisting of two stages: pretraining and task-specific adaptation. During pretraining, VideoPoet incorporates a mixture of multimodal generative objectives within an autoregressive Transformer framework. The pretrained LLM serves as a foundation that can be adapted for a range of video generation tasks. We present empirical results demonstrating the model's state-of-the-art capabilities in zero-shot video generation, specifically highlighting VideoPoet's ability to generate high-fidelity motions. Project page: http://sites.research.google/videopoet/ △ Less

Submitted 4 June, 2024; v1 submitted 21 December, 2023; originally announced December 2023.

Comments: To appear at ICML 2024; Project page: http://sites.research.google/videopoet/

arXiv:2312.03130 [pdf, other]

The DUNE Far Detector Vertical Drift Technology, Technical Design Report

Authors: DUNE Collaboration, A. Abed Abud, B. Abi, R. Acciarri, M. A. Acero, M. R. Adames, G. Adamov, M. Adamowski, D. Adams, M. Adinolfi, C. Adriano, A. Aduszkiewicz, J. Aguilar, B. Aimard, F. Akbar, K. Allison, S. Alonso Monsalve, M. Alrashed, A. Alton, R. Alvarez, H. Amar, P. Amedo, J. Anderson, D. A. Andrade, C. Andreopoulos , et al. (1304 additional authors not shown)

Abstract: DUNE is an international experiment dedicated to addressing some of the questions at the forefront of particle physics and astrophysics, including the mystifying preponderance of matter over antimatter in the early universe. The dual-site experiment will employ an intense neutrino beam focused on a near and a far detector as it aims to determine the neutrino mass hierarchy and to make high-precisi… ▽ More DUNE is an international experiment dedicated to addressing some of the questions at the forefront of particle physics and astrophysics, including the mystifying preponderance of matter over antimatter in the early universe. The dual-site experiment will employ an intense neutrino beam focused on a near and a far detector as it aims to determine the neutrino mass hierarchy and to make high-precision measurements of the PMNS matrix parameters, including the CP-violating phase. It will also stand ready to observe supernova neutrino bursts, and seeks to observe nucleon decay as a signature of a grand unified theory underlying the standard model. The DUNE far detector implements liquid argon time-projection chamber (LArTPC) technology, and combines the many tens-of-kiloton fiducial mass necessary for rare event searches with the sub-centimeter spatial resolution required to image those events with high precision. The addition of a photon detection system enhances physics capabilities for all DUNE physics drivers and opens prospects for further physics explorations. Given its size, the far detector will be implemented as a set of modules, with LArTPC designs that differ from one another as newer technologies arise. In the vertical drift LArTPC design, a horizontal cathode bisects the detector, creating two stacked drift volumes in which ionization charges drift towards anodes at either the top or bottom. The anodes are composed of perforated PCB layers with conductive strips, enabling reconstruction in 3D. Light-trap-style photon detection modules are placed both on the cryostat's side walls and on the central cathode where they are optically powered. This Technical Design Report describes in detail the technical implementations of each subsystem of this LArTPC that, together with the other far detector modules and the near detector, will enable DUNE to achieve its physics goals. △ Less

Submitted 5 December, 2023; originally announced December 2023.

Comments: 425 pages; 281 figures Central editing team: A. Heavey, S. Kettell, A. Marchionni, S. Palestini, S. Rajogopalan, R. J. Wilson

Report number: Fermilab Report no: TM-2813-LBNF

arXiv:2310.05737 [pdf, other]

Language Model Beats Diffusion -- Tokenizer is Key to Visual Generation

Authors: Lijun Yu, José Lezama, Nitesh B. Gundavarapu, Luca Versari, Kihyuk Sohn, David Minnen, Yong Cheng, Vighnesh Birodkar, Agrim Gupta, Xiuye Gu, Alexander G. Hauptmann, Boqing Gong, Ming-Hsuan Yang, Irfan Essa, David A. Ross, Lu Jiang

Abstract: While Large Language Models (LLMs) are the dominant models for generative tasks in language, they do not perform as well as diffusion models on image and video generation. To effectively use LLMs for visual generation, one crucial component is the visual tokenizer that maps pixel-space inputs to discrete tokens appropriate for LLM learning. In this paper, we introduce MAGVIT-v2, a video tokenizer… ▽ More While Large Language Models (LLMs) are the dominant models for generative tasks in language, they do not perform as well as diffusion models on image and video generation. To effectively use LLMs for visual generation, one crucial component is the visual tokenizer that maps pixel-space inputs to discrete tokens appropriate for LLM learning. In this paper, we introduce MAGVIT-v2, a video tokenizer designed to generate concise and expressive tokens for both videos and images using a common token vocabulary. Equipped with this new tokenizer, we show that LLMs outperform diffusion models on standard image and video generation benchmarks including ImageNet and Kinetics. In addition, we demonstrate that our tokenizer surpasses the previously top-performing video tokenizer on two more tasks: (1) video compression comparable to the next-generation video codec (VCC) according to human evaluations, and (2) learning effective representations for action recognition tasks. △ Less

Submitted 29 March, 2024; v1 submitted 9 October, 2023; originally announced October 2023.

Comments: ICLR 2024

arXiv:2309.09560 [pdf, other]

doi 10.22323/1.444.0675

Evaluation of the effective mirror area of CTA Small-Sized Telescopes for camera design and Monte Carlo simulation

Authors: Akira Okumura, Duncan Ross, Francesco G. Saturni, Giorgia Sironi, Richard White

Abstract: The effective mirror area of an imaging atmospheric Cherenkov telescope is a crucial key parameter for trigger threshold determination and energy calibration. It is usually calculated by 3D ray-tracing simulation using a simplified telescope model, and the result is used in Monte Carlo simulations. However, simplified telescope and camera models are not adequate for the Schwarzschild-Couder config… ▽ More The effective mirror area of an imaging atmospheric Cherenkov telescope is a crucial key parameter for trigger threshold determination and energy calibration. It is usually calculated by 3D ray-tracing simulation using a simplified telescope model, and the result is used in Monte Carlo simulations. However, simplified telescope and camera models are not adequate for the Schwarzschild-Couder configuration to be used in Small-Sized Telescopes (SSTs) of the Cherenkov Telescope Array. This is because the complex 3D structure of the secondary mirror, telescope masts, and camera body block a significant fraction of Cherenkov and night-sky photons. To evaluate the effective mirror area of an SST and to finalize its camera body design with minimal shadowing, a complex 3D model was built and simulated using the ROBAST ray-tracing library. A camera body size of 570 mm and a window size of 430 mm were selected for the final camera design based on the evaluation of shadowing by simulation. A non-axisymmetric effective area distribution was determined via the modeling of the complex telescope structure, while meeting the SST effective area requirement. △ Less

Submitted 18 September, 2023; originally announced September 2023.

Comments: Presented at the 38th International Cosmic Ray Conference (ICRC 2023), 2023 (arXiv:2309.08219)

Report number: CTA-ICRC/2023/6

arXiv:2308.11062 [pdf, other]

UnLoc: A Unified Framework for Video Localization Tasks

Authors: Shen Yan, Xuehan Xiong, Arsha Nagrani, Anurag Arnab, Zhonghao Wang, Weina Ge, David Ross, Cordelia Schmid

Abstract: While large-scale image-text pretrained models such as CLIP have been used for multiple video-level tasks on trimmed videos, their use for temporal localization in untrimmed videos is still a relatively unexplored task. We design a new approach for this called UnLoc, which uses pretrained image and text towers, and feeds tokens to a video-text fusion model. The output of the fusion module are then… ▽ More While large-scale image-text pretrained models such as CLIP have been used for multiple video-level tasks on trimmed videos, their use for temporal localization in untrimmed videos is still a relatively unexplored task. We design a new approach for this called UnLoc, which uses pretrained image and text towers, and feeds tokens to a video-text fusion model. The output of the fusion module are then used to construct a feature pyramid in which each level connects to a head to predict a per-frame relevancy score and start/end time displacements. Unlike previous works, our architecture enables Moment Retrieval, Temporal Localization, and Action Segmentation with a single stage model, without the need for action proposals, motion based pretrained features or representation masking. Unlike specialized models, we achieve state of the art results on all three different localization tasks with a unified approach. Code will be available at: \url{https://github.com/google-research/scenic}. △ Less

Submitted 21 August, 2023; originally announced August 2023.

Comments: ICCV 2023

arXiv:2306.17842 [pdf, other]

SPAE: Semantic Pyramid AutoEncoder for Multimodal Generation with Frozen LLMs

Authors: Lijun Yu, Yong Cheng, Zhiruo Wang, Vivek Kumar, Wolfgang Macherey, Yan** Huang, David A. Ross, Irfan Essa, Yonatan Bisk, Ming-Hsuan Yang, Kevin Murphy, Alexander G. Hauptmann, Lu Jiang

Abstract: In this work, we introduce Semantic Pyramid AutoEncoder (SPAE) for enabling frozen LLMs to perform both understanding and generation tasks involving non-linguistic modalities such as images or videos. SPAE converts between raw pixels and interpretable lexical tokens (or words) extracted from the LLM's vocabulary. The resulting tokens capture both the semantic meaning and the fine-grained details n… ▽ More In this work, we introduce Semantic Pyramid AutoEncoder (SPAE) for enabling frozen LLMs to perform both understanding and generation tasks involving non-linguistic modalities such as images or videos. SPAE converts between raw pixels and interpretable lexical tokens (or words) extracted from the LLM's vocabulary. The resulting tokens capture both the semantic meaning and the fine-grained details needed for visual reconstruction, effectively translating the visual content into a language comprehensible to the LLM, and empowering it to perform a wide array of multimodal tasks. Our approach is validated through in-context learning experiments with frozen PaLM 2 and GPT 3.5 on a diverse set of image understanding and generation tasks. Our method marks the first successful attempt to enable a frozen LLM to generate image content while surpassing state-of-the-art performance in image understanding tasks, under the same setting, by over 25%. △ Less

Submitted 28 October, 2023; v1 submitted 30 June, 2023; originally announced June 2023.

Comments: NeurIPS 2023 spotlight

arXiv:2306.08129 [pdf, other]

AVIS: Autonomous Visual Information Seeking with Large Language Model Agent

Authors: Ziniu Hu, Ahmet Iscen, Chen Sun, Kai-Wei Chang, Yizhou Sun, David A Ross, Cordelia Schmid, Alireza Fathi

Abstract: In this paper, we propose an autonomous information seeking visual question answering framework, AVIS. Our method leverages a Large Language Model (LLM) to dynamically strategize the utilization of external tools and to investigate their outputs, thereby acquiring the indispensable knowledge needed to provide answers to the posed questions. Responding to visual questions that necessitate external… ▽ More In this paper, we propose an autonomous information seeking visual question answering framework, AVIS. Our method leverages a Large Language Model (LLM) to dynamically strategize the utilization of external tools and to investigate their outputs, thereby acquiring the indispensable knowledge needed to provide answers to the posed questions. Responding to visual questions that necessitate external knowledge, such as "What event is commemorated by the building depicted in this image?", is a complex task. This task presents a combinatorial search space that demands a sequence of actions, including invoking APIs, analyzing their responses, and making informed decisions. We conduct a user study to collect a variety of instances of human decision-making when faced with this task. This data is then used to design a system comprised of three components: an LLM-powered planner that dynamically determines which tool to use next, an LLM-powered reasoner that analyzes and extracts key information from the tool outputs, and a working memory component that retains the acquired information throughout the process. The collected user behavior serves as a guide for our system in two key ways. First, we create a transition graph by analyzing the sequence of decisions made by users. This graph delineates distinct states and confines the set of actions available at each state. Second, we use examples of user decision-making to provide our LLM-powered planner and reasoner with relevant contextual instances, enhancing their capacity to make informed decisions. We show that AVIS achieves state-of-the-art results on knowledge-intensive visual question answering benchmarks such as Infoseek and OK-VQA. △ Less

Submitted 2 November, 2023; v1 submitted 13 June, 2023; originally announced June 2023.

Comments: Published on NeurIPS 2023

arXiv:2306.01736 [pdf, other]

DaTaSeg: Taming a Universal Multi-Dataset Multi-Task Segmentation Model

Authors: Xiuye Gu, Yin Cui, Jonathan Huang, Abdullah Rashwan, Xuan Yang, Xingyi Zhou, Golnaz Ghiasi, Weicheng Kuo, Huizhong Chen, Liang-Chieh Chen, David A Ross

Abstract: Observing the close relationship among panoptic, semantic and instance segmentation tasks, we propose to train a universal multi-dataset multi-task segmentation model: DaTaSeg.We use a shared representation (mask proposals with class predictions) for all tasks. To tackle task discrepancy, we adopt different merge operations and post-processing for different tasks. We also leverage weak-supervision… ▽ More Observing the close relationship among panoptic, semantic and instance segmentation tasks, we propose to train a universal multi-dataset multi-task segmentation model: DaTaSeg.We use a shared representation (mask proposals with class predictions) for all tasks. To tackle task discrepancy, we adopt different merge operations and post-processing for different tasks. We also leverage weak-supervision, allowing our segmentation model to benefit from cheaper bounding box annotations. To share knowledge across datasets, we use text embeddings from the same semantic embedding space as classifiers and share all network parameters among datasets. We train DaTaSeg on ADE semantic, COCO panoptic, and Objects365 detection datasets. DaTaSeg improves performance on all datasets, especially small-scale datasets, achieving 54.0 mIoU on ADE semantic and 53.5 PQ on COCO panoptic. DaTaSeg also enables weakly-supervised knowledge transfer on ADE panoptic and Objects365 instance segmentation. Experiments show DaTaSeg scales with the number of training datasets and enables open-vocabulary segmentation through direct transfer. In addition, we annotate an Objects365 instance segmentation set of 1,000 images and will release it as a public benchmark. △ Less

Submitted 2 June, 2023; originally announced June 2023.

arXiv:2304.13176 [pdf, ps, other]

Lorentzian fans

Authors: Dustin Ross

Abstract: We introduce the notion of Lorentzian fans, which form a special class of tropical fans that are particularly well-suited for proving Alexandrov-Fenchel type inequalities. To demonstrate the utility of Lorentzian fans, we prove a practical characterization of them in terms of their two-dimensional star fans. We also show that Lorentzian fans are closed under many common tropical fan operations, an… ▽ More We introduce the notion of Lorentzian fans, which form a special class of tropical fans that are particularly well-suited for proving Alexandrov-Fenchel type inequalities. To demonstrate the utility of Lorentzian fans, we prove a practical characterization of them in terms of their two-dimensional star fans. We also show that Lorentzian fans are closed under many common tropical fan operations, and we discuss how the Lorentzian property descends to the underlying tropical variety, allowing us to deduce Alexandrov-Fenchel type inequalities in the general setting of tropical intersection theory on tropical fan varieties. △ Less

Submitted 25 April, 2023; originally announced April 2023.

Comments: 37 pages, comments welcome

arXiv:2302.01328 [pdf, other]

IC3: Image Captioning by Committee Consensus

Authors: David M. Chan, Austin Myers, Sudheendra Vijayanarasimhan, David A. Ross, John Canny

Abstract: If you ask a human to describe an image, they might do so in a thousand different ways. Traditionally, image captioning models are trained to generate a single "best" (most like a reference) image caption. Unfortunately, doing so encourages captions that are "informationally impoverished," and focus on only a subset of the possible details, while ignoring other potentially useful information in th… ▽ More If you ask a human to describe an image, they might do so in a thousand different ways. Traditionally, image captioning models are trained to generate a single "best" (most like a reference) image caption. Unfortunately, doing so encourages captions that are "informationally impoverished," and focus on only a subset of the possible details, while ignoring other potentially useful information in the scene. In this work, we introduce a simple, yet novel, method: "Image Captioning by Committee Consensus" (IC3), designed to generate a single caption that captures high-level details from several annotator viewpoints. Humans rate captions produced by IC3 at least as helpful as baseline SOTA models more than two thirds of the time, and IC3 can improve the performance of SOTA automated recall systems by up to 84%, outperforming single human-generated reference captions, and indicating significant improvements over SOTA approaches for visual description. Code is available at https://davidmchan.github.io/caption-by-committee/ △ Less

Submitted 19 October, 2023; v1 submitted 2 February, 2023; originally announced February 2023.

Comments: To Appear at EMNLP 2023

arXiv:2301.05278 [pdf, other]

Mixed volumes of normal complexes

Authors: Lauren Nowak, Patrick O'Melveny, Dustin Ross

Abstract: Normal complexes are orthogonal truncations of polyhedral fans. In this paper, we develop the study of mixed volumes for normal complexes. Our main result is a sufficiency condition that ensures when the mixed volumes of normal complexes associated to a given fan satisfy the Alexandrov-Fenchel inequalities. By specializing to Bergman fans of matroids, we give a new proof of the Heron-Rota-Welsh Co… ▽ More Normal complexes are orthogonal truncations of polyhedral fans. In this paper, we develop the study of mixed volumes for normal complexes. Our main result is a sufficiency condition that ensures when the mixed volumes of normal complexes associated to a given fan satisfy the Alexandrov-Fenchel inequalities. By specializing to Bergman fans of matroids, we give a new proof of the Heron-Rota-Welsh Conjecture as a consequence of the Alexandrov-Fenchel inequalities for normal complexes. △ Less

Submitted 12 January, 2023; originally announced January 2023.

Comments: 37 pages, 17 images, comments welcome

arXiv:2212.12875 [pdf, other]

doi 10.1126/science.adg4421

Femtosecond Symmetry Breaking and Coherent Relaxation of Methane Cations at the Carbon K-Edge

Authors: Enrico Ridente, Diptarka Hait, Eric A. Haugen, Andrew D. Ross, Daniel M. Neumark, Martin Head-Gordon, Stephen R. Leone

Abstract: Understanding the relaxation pathways of photoexcited molecules is essential to gain atomistic level insight into photochemistry. Herein, we perform a time-resolved study of ultrafast molecular symmetry breaking via geometric relaxation (Jahn-Teller distortion) on the methane cation. Attosecond transient absorption spectroscopy with soft X-rays at the carbon K-edge reveals that the distortion occu… ▽ More Understanding the relaxation pathways of photoexcited molecules is essential to gain atomistic level insight into photochemistry. Herein, we perform a time-resolved study of ultrafast molecular symmetry breaking via geometric relaxation (Jahn-Teller distortion) on the methane cation. Attosecond transient absorption spectroscopy with soft X-rays at the carbon K-edge reveals that the distortion occurs within $10\pm 2$ femtoseconds after few-femtosecond strong-field ionization of methane. The distortion activates coherent oscillations in the scissoring vibrational mode of the symmetry broken cation, which are detected in the X-ray signal. These oscillations are damped within $58\pm13$ femtoseconds, as vibrational coherence is lost with the energy redistributing into lower-frequency vibrational modes. This study completely reconstructs the molecular relaxation dynamics of this prototypical example and opens new avenues for exploring complex systems. △ Less

Submitted 25 December, 2022; originally announced December 2022.

Journal ref: Science 380,713-717(2023)

arXiv:2212.12785 [pdf, other]

zkFaith: Soonami's Zero-Knowledge Identity Protocol

Authors: Mina Namazi, Duncan Ross, Xiaojie Zhu, Erman Ayday

Abstract: Individuals are encouraged to prove their eligibility to access specific services regularly. However, providing various organizations with personal data spreads sensitive information and endangers people's privacy. Hence, privacy-preserving identification systems that enable individuals to prove they are permitted to use specific services are required to fill the gap. Cryptographic techniques are… ▽ More Individuals are encouraged to prove their eligibility to access specific services regularly. However, providing various organizations with personal data spreads sensitive information and endangers people's privacy. Hence, privacy-preserving identification systems that enable individuals to prove they are permitted to use specific services are required to fill the gap. Cryptographic techniques are deployed to construct identity proofs across the internet; nonetheless, they do not offer complete control over personal data or prevent users from forging and submitting fake data. In this paper, we design a privacy-preserving identity protocol called "zkFaith." A new approach to obtain a verified zero-knowledge identity unique to each individual. The protocol verifies the integrity of the documents provided by the individuals and issues a zero-knowledge-based id without revealing any information to the authenticator or verifier. The zkFaith leverages an aggregated version of the Camenisch-Lysyanskaya (CL) signature scheme to sign the user's commitment to the verified personal data. Then the users with a zero-knowledge proof system can prove that they own the required attributes of the access criterion of the requested service providers. Vector commitment and their position binding property enables us to, later on, update the commitments based on the modification of the personal data; hence update the issued zkFaith id with no requirement of initiating the protocol from scratch. We show that the design and implementation of the zkFaith with the generated proofs in real-world scenarios are scalable and comparable with the state-of-the-art schemes. △ Less

Submitted 24 December, 2022; originally announced December 2022.

arXiv:2212.10596 [pdf, other]

Open-Vocabulary Temporal Action Detection with Off-the-Shelf Image-Text Features

Authors: Vivek Rathod, Bryan Seybold, Sudheendra Vijayanarasimhan, Austin Myers, Xiuye Gu, Vighnesh Birodkar, David A. Ross

Abstract: Detecting actions in untrimmed videos should not be limited to a small, closed set of classes. We present a simple, yet effective strategy for open-vocabulary temporal action detection utilizing pretrained image-text co-embeddings. Despite being trained on static images rather than videos, we show that image-text co-embeddings enable openvocabulary performance competitive with fully-supervised mod… ▽ More Detecting actions in untrimmed videos should not be limited to a small, closed set of classes. We present a simple, yet effective strategy for open-vocabulary temporal action detection utilizing pretrained image-text co-embeddings. Despite being trained on static images rather than videos, we show that image-text co-embeddings enable openvocabulary performance competitive with fully-supervised models. We show that the performance can be further improved by ensembling the image-text features with features encoding local motion, like optical flow based features, or other modalities, like audio. In addition, we propose a more reasonable open-vocabulary evaluation setting for the ActivityNet data set, where the category splits are based on similarity rather than random assignment. △ Less

Submitted 10 January, 2023; v1 submitted 20 December, 2022; originally announced December 2022.

arXiv:2212.05221 [pdf, other]

REVEAL: Retrieval-Augmented Visual-Language Pre-Training with Multi-Source Multimodal Knowledge Memory

Authors: Ziniu Hu, Ahmet Iscen, Chen Sun, Zirui Wang, Kai-Wei Chang, Yizhou Sun, Cordelia Schmid, David A. Ross, Alireza Fathi

Abstract: In this paper, we propose an end-to-end Retrieval-Augmented Visual Language Model (REVEAL) that learns to encode world knowledge into a large-scale memory, and to retrieve from it to answer knowledge-intensive queries. REVEAL consists of four key components: the memory, the encoder, the retriever and the generator. The large-scale memory encodes various sources of multimodal world knowledge (e.g.… ▽ More In this paper, we propose an end-to-end Retrieval-Augmented Visual Language Model (REVEAL) that learns to encode world knowledge into a large-scale memory, and to retrieve from it to answer knowledge-intensive queries. REVEAL consists of four key components: the memory, the encoder, the retriever and the generator. The large-scale memory encodes various sources of multimodal world knowledge (e.g. image-text pairs, question answering pairs, knowledge graph triplets, etc) via a unified encoder. The retriever finds the most relevant knowledge entries in the memory, and the generator fuses the retrieved knowledge with the input query to produce the output. A key novelty in our approach is that the memory, encoder, retriever and generator are all pre-trained end-to-end on a massive amount of data. Furthermore, our approach can use a diverse set of multimodal knowledge sources, which is shown to result in significant gains. We show that REVEAL achieves state-of-the-art results on visual question answering and image captioning. △ Less

Submitted 3 April, 2023; v1 submitted 10 December, 2022; originally announced December 2022.

Comments: Published on CVPR 2023

arXiv:2210.04741 [pdf, ps, other]

doi 10.1051/epjconf/202328404020

Comprehensive investigation of fission yields by using spallation- and (p,2p)-induced fission reactions in inverse kinematics

Authors: J. L. Rodríguez-Sánchez, A. Graña-González, J. Benlliure, A. Chatillon, G. García-Jiménez, J. Taieb, H. Alvarez-Pol, L. Atar, L. Audouin, G. Authelet, A. Besteiro, G. Blanchon, K. Boretzky, P. Cabanelas, E. Casarejos, J. Cederkall, D. Cortina-Gil, A. Corsi, E. De Filippo, M. Feijoo, D. Galaviz, I. Gasparic, R. Gernhäuser, E. Haettner, M. Heil , et al. (44 additional authors not shown)

Abstract: In the last decades, measurements of spallation, fragmentation and Coulex induced fission reactions in inverse kinematics have provided valuable data to accurately investigate the fission dynamics and nuclear structure at large deformations of a large variety of stable and non-stable heavy nuclei. To go a step further, we propose now to induce fission by the use of quasi-free (p,2p) scattering rea… ▽ More In the last decades, measurements of spallation, fragmentation and Coulex induced fission reactions in inverse kinematics have provided valuable data to accurately investigate the fission dynamics and nuclear structure at large deformations of a large variety of stable and non-stable heavy nuclei. To go a step further, we propose now to induce fission by the use of quasi-free (p,2p) scattering reactions in inverse kinematics, which allows us to reconstruct the excitation energy of the compound fissioning system by using the four-momenta of the two outgoing protons. Therefore, this new approach might permit to correlate the excitation energy with the charge and mass distributions of the fission fragments and with the fission probabilities, given for the first time direct access to the simultaneous measurement of the fission yield dependence on temperature and fission barrier heights of exotic heavy nuclei, respectively. The first experiment based on this methodology was realized recently at the GSI/FAIR facility and a detailed description of the experimental setup is given here. △ Less

Submitted 10 October, 2022; originally announced October 2022.

Comments: 4 pages, 15th International Conference on Nuclear Data for Science and Technology (ND2022)

arXiv:2210.00113 [pdf]

Institutional Foundations of Adaptive Planning: Exploration of Flood Planning in the Lower Rio Grande Valley, Texas, USA

Authors: Ashley D. Ross, Ali Nejat, Virgie Greb

Abstract: Adaptive planning is ideally suited for the deep uncertainties presented by climate change. While there is a robust scholarship on the theory and methods of adaptive planning, this has largely neglected how adaptive planning is affected by existing planning institutions and how to move forward within the constraints of traditional planning organizations. This study asks: How do existing traditiona… ▽ More Adaptive planning is ideally suited for the deep uncertainties presented by climate change. While there is a robust scholarship on the theory and methods of adaptive planning, this has largely neglected how adaptive planning is affected by existing planning institutions and how to move forward within the constraints of traditional planning organizations. This study asks: How do existing traditional planning institutions support adaptive planning? We explore this for flood planning in the Lower Rio Grande Valley of Texas, United States. We draw on county hazard plan and regional flood plan documents as well as transcripts of regional flood planning meetings to explore the emergent topics of these institutional outputs. Using Natural Language Processing to analyze this large amount of text, we find that hazard plans and discussions develo** these plans are largely lacking an adaptive approach. △ Less

Submitted 30 September, 2022; originally announced October 2022.

arXiv:2209.07518 [pdf, other]

Distribution Aware Metrics for Conditional Natural Language Generation

Authors: David M Chan, Yiming Ni, David A Ross, Sudheendra Vijayanarasimhan, Austin Myers, John Canny

Abstract: Traditional automated metrics for evaluating conditional natural language generation use pairwise comparisons between a single generated text and the best-matching gold-standard ground truth text. When multiple ground truths are available, scores are aggregated using an average or max operation across references. While this approach works well when diversity in the ground truth data (i.e. dispersi… ▽ More Traditional automated metrics for evaluating conditional natural language generation use pairwise comparisons between a single generated text and the best-matching gold-standard ground truth text. When multiple ground truths are available, scores are aggregated using an average or max operation across references. While this approach works well when diversity in the ground truth data (i.e. dispersion of the distribution of conditional texts) can be ascribed to noise, such as in automated speech recognition, it does not allow for robust evaluation in the case where diversity in the ground truths represents signal for the model. In this work we argue that existing metrics are not appropriate for domains such as visual description or summarization where ground truths are semantically diverse, and where the diversity in those captions captures useful additional information about the context. We propose a novel paradigm for multi-candidate evaluation of conditional language generation models, and a new family of metrics that compare the distributions of reference and model-generated caption sets using small sample sets of each. We demonstrate the utility of our approach with a case study in visual description: where we show that existing models optimize for single-description quality over diversity, and gain some insights into how sampling methods and temperature impact description quality and diversity. △ Less

Submitted 29 September, 2022; v1 submitted 15 September, 2022; originally announced September 2022.

arXiv:2209.06277 [pdf, other]

Spatiotemporal patterning of extensile active stresses in microtubule-based active fluids

Authors: Linnea M. Lemma, Minu Varghese, Tyler D. Ross, Matt Thomson, Aparana Baskaran, Zvonimir Dogic

Abstract: Active stresses, which are collectively generated by the motion of energy-consuming rod-like constituents, generate chaotic autonomous flows. Controlling active stresses in space and time is an essential prerequisite for controlling the intrinsically chaotic dynamics of extensile active fluids. We design single-headed kinesin molecular motors that exhibit optically enhanced clustering, and thus en… ▽ More Active stresses, which are collectively generated by the motion of energy-consuming rod-like constituents, generate chaotic autonomous flows. Controlling active stresses in space and time is an essential prerequisite for controlling the intrinsically chaotic dynamics of extensile active fluids. We design single-headed kinesin molecular motors that exhibit optically enhanced clustering, and thus enable precise and repeatable spatial and temporal control of extensile active stresses. Such motors enable rapid, reversible switching between flowing and quiescent states. In turn, spatio-temporal patterning of the active stress controls the evolution of the ubiquitous bend-instability of extensile active fluids and determines its critical length dependence. Combining optically controlled clusters with conventional kinesin motors enables one-time switching from contractile to extensile active stresses. These results open a path towards real-time control of the autonomous flows generated by active fluids. △ Less

Submitted 13 September, 2022; originally announced September 2022.

arXiv:2209.04061 [pdf, other]

im2nerf: Image to Neural Radiance Field in the Wild

Authors: Lu Mi, Abhijit Kundu, David Ross, Frank Dellaert, Noah Snavely, Alireza Fathi

Abstract: We propose im2nerf, a learning framework that predicts a continuous neural object representation given a single input image in the wild, supervised by only segmentation output from off-the-shelf recognition methods. The standard approach to constructing neural radiance fields takes advantage of multi-view consistency and requires many calibrated views of a scene, a requirement that cannot be satis… ▽ More We propose im2nerf, a learning framework that predicts a continuous neural object representation given a single input image in the wild, supervised by only segmentation output from off-the-shelf recognition methods. The standard approach to constructing neural radiance fields takes advantage of multi-view consistency and requires many calibrated views of a scene, a requirement that cannot be satisfied when learning on large-scale image data in the wild. We take a step towards addressing this shortcoming by introducing a model that encodes the input image into a disentangled object representation that contains a code for object shape, a code for object appearance, and an estimated camera pose from which the object image is captured. Our model conditions a NeRF on the predicted object representation and uses volume rendering to generate images from novel views. We train the model end-to-end on a large collection of input images. As the model is only provided with single-view images, the problem is highly under-constrained. Therefore, in addition to using a reconstruction loss on the synthesized input view, we use an auxiliary adversarial loss on the novel rendered views. Furthermore, we leverage object symmetry and cycle camera pose consistency. We conduct extensive quantitative and qualitative experiments on the ShapeNet dataset as well as qualitative experiments on Open Images dataset. We show that in all cases, im2nerf achieves the state-of-the-art performance for novel view synthesis from a single-view unposed image in the wild. △ Less

Submitted 8 September, 2022; originally announced September 2022.

Comments: 12 pages, 8 figures, 4 tables

arXiv:2207.00123 [pdf, ps, other]

Yet another proof that the roots of a polynomial depend continuously on the coefficients

Authors: David A. Ross

Abstract: The roots of a complex polynomial depend continuously on the coefficients; that is, an infinitesimal perturbation of the coefficients results in an infinitesimal perturbation of the roots. A short, straightforward proof of this is possible using infinitesimals. The roots of a complex polynomial depend continuously on the coefficients; that is, an infinitesimal perturbation of the coefficients results in an infinitesimal perturbation of the roots. A short, straightforward proof of this is possible using infinitesimals. △ Less

Submitted 6 July, 2022; v1 submitted 30 June, 2022; originally announced July 2022.

Comments: 6 pages. Clarifies some statements in an earlier version

MSC Class: 12D10; 03H05; 12L15

arXiv:2206.15125 [pdf, other]

doi 10.1103/PhysRevA.108.012805

Core-excited states of SF$_{6}$ probed with soft X-ray femtosecond transient absorption of vibrational wavepackets

Authors: Lou Barreau, Andrew D. Ross, Victor Kimberg, Pavel Krasnov, Svyatoslav Blinov, Daniel M. Neumark, Stephen R. Leone

Abstract: A vibrational wavepacket in SF$_6$, created by impulsive stimulated Raman scattering with a few-cycle infrared pulse, is mapped onto five sulfur core-excited states using table-top soft X-ray transient absorption spectroscopy between 170-200 eV. The amplitudes of the X-ray energy shifts of the femtosecond oscillations depend strongly on the nature of the state. The prepared wavepacket is controlle… ▽ More A vibrational wavepacket in SF$_6$, created by impulsive stimulated Raman scattering with a few-cycle infrared pulse, is mapped onto five sulfur core-excited states using table-top soft X-ray transient absorption spectroscopy between 170-200 eV. The amplitudes of the X-ray energy shifts of the femtosecond oscillations depend strongly on the nature of the state. The prepared wavepacket is controlled with the pump laser intensity to probe the core-excited levels for various extensions of the S-F stretching motion. This allows the determination of the relative core-level potential energy gradients, in good agreement with TDDFT calculations. This experiment demonstrates a new means of characterizing core-excited potential energy surfaces. △ Less

Submitted 30 June, 2022; originally announced June 2022.

arXiv:2206.13013 [pdf, ps, other]

Continuity of the roots of a polynomial

Authors: Melvyn B. Nathanson, David A. Ross

Abstract: Let $K$ be an algebraically closed field with an absolute value. This note gives an elementary proof of the classical result that the roots of a polynomial with coefficients in $K$ are continuous functions of the coefficients of the polynomial. Let $K$ be an algebraically closed field with an absolute value. This note gives an elementary proof of the classical result that the roots of a polynomial with coefficients in $K$ are continuous functions of the coefficients of the polynomial. △ Less

Submitted 10 November, 2023; v1 submitted 26 June, 2022; originally announced June 2022.

Comments: 10 pages; minor improvements

MSC Class: 12D10; 12E05; 12J10; 12L15

arXiv:2205.06253 [pdf, other]

What's in a Caption? Dataset-Specific Linguistic Diversity and Its Effect on Visual Description Models and Metrics

Authors: David M. Chan, Austin Myers, Sudheendra Vijayanarasimhan, David A. Ross, Bryan Seybold, John F. Canny

Abstract: While there have been significant gains in the field of automated video description, the generalization performance of automated description models to novel domains remains a major barrier to using these systems in the real world. Most visual description methods are known to capture and exploit patterns in the training data leading to evaluation metric increases, but what are those patterns? In th… ▽ More While there have been significant gains in the field of automated video description, the generalization performance of automated description models to novel domains remains a major barrier to using these systems in the real world. Most visual description methods are known to capture and exploit patterns in the training data leading to evaluation metric increases, but what are those patterns? In this work, we examine several popular visual description datasets, and capture, analyze, and understand the dataset-specific linguistic patterns that models exploit but do not generalize to new domains. At the token level, sample level, and dataset level, we find that caption diversity is a major driving factor behind the generation of generic and uninformative captions. We further show that state-of-the-art models even outperform held-out ground truth captions on modern metrics, and that this effect is an artifact of linguistic diversity in datasets. Understanding this linguistic diversity is key to building strong captioning models, we recommend several methods and approaches for maintaining diversity in the collection of new data, and dealing with the consequences of limited diversity when using current models and metrics. △ Less

Submitted 12 January, 2023; v1 submitted 12 May, 2022; originally announced May 2022.

Comments: The 1st Workshop on Vision Datasets Understanding, IEEE / CVF Computer Vision and Pattern Recognition Conference (CVPR), 2022

arXiv:2204.13800 [pdf, other]

doi 10.1039/D2SC02402K

Jahn-Teller Distortion and Dissociation of CCl$_4^+$ by Transient X-ray Spectroscopy Simultaneously at the Carbon K- and Chlorine L-Edge

Authors: Andrew D. Ross, Diptarka Hait, Valeriu Scutelnic, Eric A. Haugen, Enrico Ridente, Mikias B. Balkew, Daniel M. Neumark, Martin Head-Gordon, Stephen R. Leone

Abstract: X-ray Transient Absorption Spectroscopy (XTAS) and theoretical calculations are used to study CCl$_4^+$ prepared by 800 nm strong-field ionization. XTAS simultaneously probes atoms at the carbon K-edge (280-300 eV) and chlorine L-edge (195-220 eV). Comparison of experiment to X-ray spectra computed by orbital-optimized density functional theory (OO-DFT) indicates that after ionization, CCl$_4^+$ u… ▽ More X-ray Transient Absorption Spectroscopy (XTAS) and theoretical calculations are used to study CCl$_4^+$ prepared by 800 nm strong-field ionization. XTAS simultaneously probes atoms at the carbon K-edge (280-300 eV) and chlorine L-edge (195-220 eV). Comparison of experiment to X-ray spectra computed by orbital-optimized density functional theory (OO-DFT) indicates that after ionization, CCl$_4^+$ undergoes symmetry breaking driven by Jahn-Teller distortion away from the initial tetrahedral structure (T$_d$) in 6$\pm$2 fs. The resultant symmetry-broken covalently bonded form subsequently separates to a noncovalently bound complex between CCl$_3^+$ and Cl over 90$\pm$10 fs, which is again predicted by theory. Finally, after more than 800 fs, L-edge signals for atomic Cl are observed, indicating dissociation to free CCl$_3^+$ and Cl. The results for Jahn-Teller distortion to the symmetry-broken form of CCl$_4^+$ and formation of the Cl -- CCl$_3^+$ complex characterize previously unobserved new species along the route to dissociation. △ Less

Submitted 6 June, 2022; v1 submitted 28 April, 2022; originally announced April 2022.

Journal ref: Chem. Sci., 2022,13, 9310-9320

arXiv:2204.03510 [pdf, other]

A Fermionic Portal to Vector Dark Matter from a New Gauge Sector

Authors: Alexander Belyaev, Aldo Deandrea, Stefano Moretti, Luca Panizzi, Douglas A. Ross, Nakorn Thongyoi

Abstract: We present a new class of Dark Matter (DM) models wherein the Standard Model (SM) is extended with a new $SU(2)_D$ dark gauge sector. In this framework the stability of DM is provided by the conservation of a $U(1)$ global symmetry, which upon appropriate charge assignments for the $SU(2)_D$ multiplets, effectively leads to a $\mathbb{Z}_2$ symmetry subgroup. The origin of the global $U(1)$ symmet… ▽ More We present a new class of Dark Matter (DM) models wherein the Standard Model (SM) is extended with a new $SU(2)_D$ dark gauge sector. In this framework the stability of DM is provided by the conservation of a $U(1)$ global symmetry, which upon appropriate charge assignments for the $SU(2)_D$ multiplets, effectively leads to a $\mathbb{Z}_2$ symmetry subgroup. The origin of the global $U(1)$ symmetry which ensures the stability of DM can be justified in the form of a dark EW sector or through an underlying composite structure. The key ingredient of the model is a Vector-Like (VL) fermion doublet of $SU(2)_D$ , the members of which are singlets of the SM Electro-Weak (EW) gauge group, which mediate the interactions between the dark sector and the SM, via new Yukawa interactions. This class of models, labelled as Fermion Portal Vector DM (FPVDM), allows multiple realisations, depending on the properties of the the VL partner and the scalar potential. After spontaneous breaking of the $SU(2)_D$ symmetry via a new scalar doublet, the ensuing massive vector bosons with non-zero dark-isospin are DM candidates. The new class of FPVDM models suggested here has numerous phenomenological implications for collider and non-collider studies. As a practical example, we discuss here in detail a realisation involving a VL top partner assuming no mixing between the two physical scalars of the theory, the SM Higgs boson and its counterpart in the dark sector. We thus provide bounds on this setup from both collider and astroparticle observables. △ Less

Submitted 26 September, 2023; v1 submitted 7 April, 2022; originally announced April 2022.

Comments: 37 pages, 14 figures, 4 tables. Version accepted by PRD

arXiv:2203.08169 [pdf, other]

doi 10.1117/1.JATIS.8.1.014007

Design and Performance of the Prototype Schwarzschild-Couder Telescope Camera

Authors: Colin B. Adams, Giovanni Ambrosi, Michelangelo Ambrosio, Carla Aramo, Timothy Arlen, Wystan Benbow, Bruna Bertucci, Elisabetta Bissaldi, Jonathan Biteau, Massimiliano Bitossi, Alfonso Boiano, Carmela Bonavolontà, Richard Bose, Aurelien Bouvier, Mario Buscemi, Aryeh Brill, Anthony M. Brown, James H. Buckley, Rodolfo Canestrari, Massimo Capasso, Mirco Caprai, Paolo Coppi, Corbin E. Covault, Davide Depaoli, Leonardo Di Venere , et al. (64 additional authors not shown)

Abstract: The prototype Schwarzschild-Couder Telescope (pSCT) is a candidate for a medium-sized telescope in the Cherenkov Telescope Array. The pSCT is based on a novel dual mirror optics design which reduces the plate scale and allows for the use of silicon photomultipliers as photodetectors. The prototype pSCT camera currently has only the central sector instrumented with 25 camera modules (1600 pixels)… ▽ More The prototype Schwarzschild-Couder Telescope (pSCT) is a candidate for a medium-sized telescope in the Cherenkov Telescope Array. The pSCT is based on a novel dual mirror optics design which reduces the plate scale and allows for the use of silicon photomultipliers as photodetectors. The prototype pSCT camera currently has only the central sector instrumented with 25 camera modules (1600 pixels), providing a 2.68$^{\circ}$ field of view (FoV). The camera electronics are based on custom TARGET (TeV array readout with GSa/s sampling and event trigger) application specific integrated circuits. Field programmable gate arrays sample incoming signals at a gigasample per second. A single backplane provides camera-wide triggers. An upgrade of the pSCT camera is in progress, which will fully populate the focal plane. This will increase the number of pixels to 11,328, the number of backplanes to 9, and the FoV to 8.04$^{\circ}$. Here we give a detailed description of the pSCT camera, including the basic concept, mechanical design, detectors, electronics, current status and first light. △ Less

Submitted 15 March, 2022; originally announced March 2022.

Journal ref: J. Astron. Telesc. Instrum. Syst. 8(1), 014007 (2022)

arXiv:2202.14010

Proceedings of the Artificial Intelligence for Cyber Security (AICS) Workshop at AAAI 2022

Authors: James Holt, Edward Raff, Ahmad Ridley, Dennis Ross, Arunesh Sinha, Diane Staheli, William Streilen, Milind Tambe, Yevgeniy Vorobeychik, Allan Wollaber

Abstract: The workshop will focus on the application of AI to problems in cyber security. Cyber systems generate large volumes of data, utilizing this effectively is beyond human capabilities. Additionally, adversaries continue to develop new attacks. Hence, AI methods are required to understand and protect the cyber domain. These challenges are widely studied in enterprise networks, but there are many gaps… ▽ More The workshop will focus on the application of AI to problems in cyber security. Cyber systems generate large volumes of data, utilizing this effectively is beyond human capabilities. Additionally, adversaries continue to develop new attacks. Hence, AI methods are required to understand and protect the cyber domain. These challenges are widely studied in enterprise networks, but there are many gaps in research and practice as well as novel problems in other domains. In general, AI techniques are still not widely adopted in the real world. Reasons include: (1) a lack of certification of AI for security, (2) a lack of formal study of the implications of practical constraints (e.g., power, memory, storage) for AI systems in the cyber domain, (3) known vulnerabilities such as evasion, poisoning attacks, (4) lack of meaningful explanations for security analysts, and (5) lack of analyst trust in AI solutions. There is a need for the research community to develop novel solutions for these practical issues. △ Less

Submitted 1 March, 2022; v1 submitted 28 February, 2022; originally announced February 2022.

arXiv:2201.04585 [pdf, ps, other]

Pseudostable Hodge integrals

Authors: Renzo Cavalieri, Joel Gallegos, Dustin Ross, Brandon Van Over, Jonathan Wise

Abstract: This paper initiates a study of Hodge integrals on moduli spaces of pseudostable curves. We prove an explicit comparison formula that allows one to effectively compute any pseudostable Hodge integral in terms of intersection numbers on moduli spaces of stable curves, and we use this comparison to prove that pseudostable Hodge integrals are equal to their stable counterparts when they are linear in… ▽ More This paper initiates a study of Hodge integrals on moduli spaces of pseudostable curves. We prove an explicit comparison formula that allows one to effectively compute any pseudostable Hodge integral in terms of intersection numbers on moduli spaces of stable curves, and we use this comparison to prove that pseudostable Hodge integrals are equal to their stable counterparts when they are linear in lambda classes, but not when they are nonlinear. This suggests that pseudostable Gromov-Witten invariants are equal to usual Gromov-Witten invariants for target curves, but not for higher-dimensional target varieties. △ Less

Submitted 12 January, 2022; originally announced January 2022.

Comments: 20 pages, comments welcome!

arXiv:2111.05098 [pdf, other]

Scalable platform for nanocrystal-based quantum electronics

Authors: Joachim E. Sestoft, Aske N. Gejl, Thomas Kanne, Rasmus D. Schlosser, Daniel Ross, Daniel Kjær, Kasper Grove-Rasmussen, Jesper Nygård

Abstract: Unlocking the full potential of nanocrystals in electronic devices requires scalable and deterministic manufacturing techniques. A platform offering promising alternative paths to scalable production is microtomy, the technique of cutting thin lamellae with large areas containing embedded nanostructures. This platform has so far not been used for fabrication of electronic quantum devices. Here, we… ▽ More Unlocking the full potential of nanocrystals in electronic devices requires scalable and deterministic manufacturing techniques. A platform offering promising alternative paths to scalable production is microtomy, the technique of cutting thin lamellae with large areas containing embedded nanostructures. This platform has so far not been used for fabrication of electronic quantum devices. Here, we combine microtomy with vapor-liquid-solid growth of III/V nanowires to create a scalable platform that can deterministically transfer large arrays of single and fused nanocrystals - offering single unit control and free choice of target substrate. We fabricate electronic devices on cross-sectioned InAs nanowires with good yield and demonstrate their ability to exhibit quantum phenomena such as conductance quantization, single electron charging, and wave interference. Finally, we devise how the platform can host rationally designed semiconductor/superconductor networks relevant for emerging quantum technologies. △ Less

Submitted 9 November, 2021; originally announced November 2021.

Report number: NBI QDEV 2021

arXiv:2110.08647 [pdf, other]

Tropical fans and normal complexes

Authors: Anastasia Nathanson, Dustin Ross

Abstract: Associated to any divisor in the Chow ring of a simplicial tropical fan, we construct a family of polytopal complexes, called normal complexes, which we propose as an analogue of the well-studied notion of normal polytopes from the setting of complete fans. We describe certain closed convex polyhedral cones of divisors for which the "volume" of each divisor in the cone - that is, the degree of its… ▽ More Associated to any divisor in the Chow ring of a simplicial tropical fan, we construct a family of polytopal complexes, called normal complexes, which we propose as an analogue of the well-studied notion of normal polytopes from the setting of complete fans. We describe certain closed convex polyhedral cones of divisors for which the "volume" of each divisor in the cone - that is, the degree of its top power - is equal to the volume of the associated normal complexes. For the Bergman fan of any matroid with building set, we prove that there exists an open family of such cones of divisors with nonempty interiors. We view the theory of normal complexes developed in this paper as a polytopal model underlying the combinatorial Hodge theory pioneered by Adiprasito, Huh, and Katz. △ Less

Submitted 11 March, 2023; v1 submitted 16 October, 2021; originally announced October 2021.

Comments: 39 pages, 12 figures; minor revisions in Version 3; to appear in Advances in Mathematics

arXiv:2109.12163 [pdf, other]

doi 10.3390/math9212731

Is the Finite-Time Lyapunov Exponent Field a Koopman Eigenfunction?

Authors: Erik M. Bollt, Shane D. Ross

Abstract: This work serves as a bridge between two approaches to analysis of dynamical systems: the local, geometric analysis and the global, operator theoretic, Koopman analysis. We explicitly construct vector fields where the instantaneous Lyapunov exponent field is a Koopman eigenfunction. Restricting ourselves to polynomial vector fields to make this construction easier, we find that such vector fields… ▽ More This work serves as a bridge between two approaches to analysis of dynamical systems: the local, geometric analysis and the global, operator theoretic, Koopman analysis. We explicitly construct vector fields where the instantaneous Lyapunov exponent field is a Koopman eigenfunction. Restricting ourselves to polynomial vector fields to make this construction easier, we find that such vector fields do exist, and we explore whether such vector fields have a special structure, thus making a link between the geometric theory and the transfer operator theory. △ Less

Submitted 24 September, 2021; originally announced September 2021.

Comments: 30 pages, 4 figures

Journal ref: Mathematics 9 (2021), 2731

arXiv:2109.09620 [pdf, other]

NASA Space Robotics Challenge 2 Qualification Round: An Approach to Autonomous Lunar Rover Operations

Authors: Cagri Kilic, Bernardo Martinez R. Jr., Christopher A. Tatsch, Jared Beard, Jared Strader, Shounak Das, Derek Ross, Yu Gu, Guilherme A. S. Pereira, Jason N. Gross

Abstract: Plans for establishing a long-term human presence on the Moon will require substantial increases in robot autonomy and multi-robot coordination to support establishing a lunar outpost. To achieve these objectives, algorithm design choices for the software developments need to be tested and validated for expected scenarios such as autonomous in-situ resource utilization (ISRU), localization in chal… ▽ More Plans for establishing a long-term human presence on the Moon will require substantial increases in robot autonomy and multi-robot coordination to support establishing a lunar outpost. To achieve these objectives, algorithm design choices for the software developments need to be tested and validated for expected scenarios such as autonomous in-situ resource utilization (ISRU), localization in challenging environments, and multi-robot coordination. However, real-world experiments are extremely challenging and limited for extraterrestrial environment. Also, realistic simulation demonstrations in these environments are still rare and demanded for initial algorithm testing capabilities. To help some of these needs, the NASA Centennial Challenges program established the Space Robotics Challenge Phase 2 (SRC2) which consist of virtual robotic systems in a realistic lunar simulation environment, where a group of mobile robots were tasked with reporting volatile locations within a global map, excavating and transporting these resources, and detecting and localizing a target of interest. The main goal of this article is to share our team's experiences on the design trade-offs to perform autonomous robotic operations in a virtual lunar environment and to share strategies to complete the mission requirements posed by NASA SRC2 competition during the qualification round. Of the 114 teams that registered for participation in the NASA SRC2, team Mountaineers finished as one of only six teams to receive the top qualification round prize. △ Less

Submitted 20 September, 2021; originally announced September 2021.

Comments: 15 pages, 15 figures, 5 tables. Accepted for publications in IEEE Aerospace and Electronic Systems Magazine, 2021. (preprint version)

arXiv:2109.06360 [pdf, other]

doi 10.1021/acsnano.2c05015

Ray Optics for Gliders

Authors: Tyler D. Ross, Dino Osmanović, John F. Brady, Paul W. K. Rothemund

Abstract: Control of self-propelled particles is central to the development of many microrobotic technologies, from dynamically reconfigurable materials to advanced lab-on-a-chip systems. However, there are few physical principles by which particle trajectories can be specified and can be used to generate a wide range of behaviors. Within the field of ray optics, a single principle for controlling the traje… ▽ More Control of self-propelled particles is central to the development of many microrobotic technologies, from dynamically reconfigurable materials to advanced lab-on-a-chip systems. However, there are few physical principles by which particle trajectories can be specified and can be used to generate a wide range of behaviors. Within the field of ray optics, a single principle for controlling the trajectory of light -- Snell's law -- yields an intuitive framework for engineering a broad range of devices, from microscopes to cameras and telescopes. Here we show that the motion of self-propelled particles gliding across a resistance discontinuity is governed by a variant of Snell's law, and develop a corresponding ray optics for gliders. Just as the ratio of refractive indexes sets the path of a light ray, the ratio of resistance coefficients is shown to determine the trajectories of gliders. The magnitude of refraction depends on the glider's shape, in particular its aspect ratio, which serves as an analog to the wavelength of light. This enables the demixing of a polymorphic, many-shaped, beam of gliders into distinct monomorphic, single-shaped, beams through a friction prism. In turn, beams of monomorphic gliders can be focused by spherical and gradient friction lenses. Alternatively, the critical angle for total internal reflection can be used to create shape-selective glider traps. Overall our work suggests that furthering the analogy between light and microscopic gliders will result in a wide range of new devices for sorting, concentrating, and analyzing self-propelled particles. △ Less

Submitted 8 June, 2022; v1 submitted 13 September, 2021; originally announced September 2021.

arXiv:2109.05127 [pdf, other]

doi 10.22323/1.395.0748

Design and performance of the prototype Schwarzschild-Couder Telescope camera

Authors: C. B. Adams, G. Ambrosi, M. Ambrosio, C. Aramo, P. I. Batista, W. Benbow, B. Bertucci, E. Bissaldi, M. Bitossi, A. Boiano, C. Bonavolonta, R. Bose, A. Brill, A. M. Brown, J. H. Buckley, R. A. Cameron, M. Capasso, M. Caprai, C. E. Covault, D. Depaoli, L. Di Venere, M. Errando, S. Fegan, Q. Feng, E. Fiandrini , et al. (49 additional authors not shown)

Abstract: The Cherenkov Telescope Array (CTA) is the next-generation ground-based observatory for very-high-energy gamma-ray astronomy. An innovative 9.7 m aperture, dual-mirror Schwarzschild-Couder Telescope (SCT) design is a candidate design for CTA Medium-Sized Telescopes. A prototype SCT (pSCT) has been constructed at the Fred Lawrence Whipple Observatory in Arizona, USA. Its camera is currently partial… ▽ More The Cherenkov Telescope Array (CTA) is the next-generation ground-based observatory for very-high-energy gamma-ray astronomy. An innovative 9.7 m aperture, dual-mirror Schwarzschild-Couder Telescope (SCT) design is a candidate design for CTA Medium-Sized Telescopes. A prototype SCT (pSCT) has been constructed at the Fred Lawrence Whipple Observatory in Arizona, USA. Its camera is currently partially instrumented with 1600 pixels covering a field of view of 2.7 degrees square. The small plate scale of the optical system allows densely packed silicon photomultipliers to be used, which combined with high-density trigger and waveform readout electronics enable the high-resolution camera. The camera's electronics are capable of imaging air shower development at a rate of one billion samples per second. We describe the commissioning and performance of the pSCT camera, including trigger and waveform readout performance, calibration, and absolute GPS time stam**. We also present the upgrade to the camera, which is currently underway. The upgrade will fully populate the focal plane, increasing the field of view to 8 degree diameter, and lower the front-end electronics noise, enabling a lower trigger threshold and improved reconstruction and background rejection. △ Less

Submitted 10 September, 2021; originally announced September 2021.

Comments: 8 pages, 5 figures, Proceedings of the 37th International Cosmic Ray Conference (ICRC 2021), Berlin, Germany

arXiv:2106.09251 [pdf, other]

Optical Mouse: 3D Mouse Pose From Single-View Video

Authors: Bo Hu, Bryan Seybold, Shan Yang, David Ross, Avneesh Sud, Graham Ruby, Yi Liu

Abstract: We present a method to infer the 3D pose of mice, including the limbs and feet, from monocular videos. Many human clinical conditions and their corresponding animal models result in abnormal motion, and accurately measuring 3D motion at scale offers insights into health. The 3D poses improve classification of health-related attributes over 2D representations. The inferred poses are accurate enough… ▽ More We present a method to infer the 3D pose of mice, including the limbs and feet, from monocular videos. Many human clinical conditions and their corresponding animal models result in abnormal motion, and accurately measuring 3D motion at scale offers insights into health. The 3D poses improve classification of health-related attributes over 2D representations. The inferred poses are accurate enough to estimate stride length even when the feet are mostly occluded. This method could be applied as part of a continuous monitoring system to non-invasively measure animal health. △ Less

Submitted 17 June, 2021; originally announced June 2021.

arXiv:2106.01339 [pdf, other]

doi 10.1088/1751-8121/ac16c7

Transition criteria and phase space structures in a three degree of freedom system with dissipation

Authors: Jun Zhong, Shane D. Ross

Abstract: Escape from a potential well through an index-1 saddle can be widely found in some important physical systems. Knowing the criteria and phase space geometry that govern escape events plays an important role in making use of such phenomenon, particularly when realistic frictional or dissipative forces are present. We aim to extend the study the escape dynamics around the saddle from two degrees of… ▽ More Escape from a potential well through an index-1 saddle can be widely found in some important physical systems. Knowing the criteria and phase space geometry that govern escape events plays an important role in making use of such phenomenon, particularly when realistic frictional or dissipative forces are present. We aim to extend the study the escape dynamics around the saddle from two degrees of freedom to three degrees of freedom, presenting both a methodology and phase space structures. Both the ideal conservative system and a perturbed, dissipative system are considered. We define the five-dimensional transition region, $\mathcal{T}_h$, as the set of initial conditions of a given initial energy $h$ for which the trajectories will escape from one side of the saddle to another. Invariant manifold arguments demonstrate that in the six-dimensional phase space, the boundary of the transition region, $\partial \mathcal{T}_h$, is topologically a four-dimensional hyper-cylinder in the conservative system, and a four-dimensional hyper-sphere in the dissipative system. The transition region $\mathcal{T}_h$ can be constructed by a solid three-dimensional ellipsoid (solid three-dimensional cylinder) in the three-dimensional configuration space, where at each point, there is a cone of velocity -- the velocity directions leading to transition are given by cones, with velocity magnitude given by the initial energy and the direction by two spherical angles with given limits. To illustrate our analysis, we consider an example system which has two potential minima connected by an index 1 saddle. △ Less

Submitted 1 June, 2021; originally announced June 2021.

Comments: 26 pages, 6 figures

arXiv:2105.11342 [pdf, ps, other]

doi 10.1016/j.hal.2021.102149

Beach-level 24-hour forecasts of Florida red tide-induced respiratory irritation

Authors: Shane D. Ross, Jeremie Fish, Klaus Moeltner, Erik M. Bollt, Landon Bilyeu, Tracy Fanara

Abstract: An accurate forecast of the red tide respiratory irritation level would improve the lives of many people living in areas affected by algal blooms. Using a decades-long database of daily beach conditions, two conceptually different models to forecast the respiratory irritation risk level one day ahead of time are trained. One model is wind-based, using the current days' respiratory level and the pr… ▽ More An accurate forecast of the red tide respiratory irritation level would improve the lives of many people living in areas affected by algal blooms. Using a decades-long database of daily beach conditions, two conceptually different models to forecast the respiratory irritation risk level one day ahead of time are trained. One model is wind-based, using the current days' respiratory level and the predicted wind direction of the following day. The other model is a probabilistic self-exciting Hawkes process model. Both models are trained on beaches in Florida during 2011-2017 and applied to the red tide bloom during 2018-2019. For beaches where there is enough historical data to develop a model, the model which performs best depends on the beach. The wind-based model is the most accurate at half the beaches, correctly predicting the respiratory risk level on average about 84% of the time. The Hawkes model is the most accurate (81% accuracy) at nearly all of the remaining beaches. △ Less

Submitted 11 October, 2021; v1 submitted 24 May, 2021; originally announced May 2021.

Comments: 31 pages, 9 figures

Journal ref: Harmful Algae 111:102149 (2022)

arXiv:2104.13254

Proceedings - AI/ML for Cybersecurity: Challenges, Solutions, and Novel Ideas at SIAM Data Mining 2021

Authors: John Emanuello, Kimberly Ferguson-Walter, Erik Hemberg, Una-May O Reilly, Ahmad Ridley, Dennis Ross, Diane Staheli, William Streilein

Abstract: Malicious cyber activity is ubiquitous and its harmful effects have dramatic and often irreversible impacts on society. Given the shortage of cybersecurity professionals, the ever-evolving adversary, the massive amounts of data which could contain evidence of an attack, and the speed at which defensive actions must be taken, innovations which enable autonomy in cybersecurity must continue to expan… ▽ More Malicious cyber activity is ubiquitous and its harmful effects have dramatic and often irreversible impacts on society. Given the shortage of cybersecurity professionals, the ever-evolving adversary, the massive amounts of data which could contain evidence of an attack, and the speed at which defensive actions must be taken, innovations which enable autonomy in cybersecurity must continue to expand, in order to move away from a reactive defense posture and towards a more proactive one. The challenges in this space are quite different from those associated with applying AI in other domains such as computer vision. The environment suffers from an incredibly high degree of uncertainty, stemming from the intractability of ingesting all the available data, as well as the possibility that malicious actors are manipulating the data. Another unique challenge in this space is the dynamism of the adversary causes the indicators of compromise to change frequently and without warning. In spite of these challenges, machine learning has been applied to this domain and has achieved some success in the realm of detection. While this aspect of the problem is far from solved, a growing part of the commercial sector is providing ML-enhanced capabilities as a service. Many of these entities also provide platforms which facilitate the deployment of these automated solutions. Academic research in this space is growing and continues to influence current solutions, as well as strengthen foundational knowledge which will make autonomous agents in this space a possibility. △ Less

Submitted 1 June, 2021; v1 submitted 27 April, 2021; originally announced April 2021.

arXiv:2103.13938 [pdf, other]

Double nanowires for hybrid quantum devices

Authors: Thomas Kanne, Dags Olsteins, Mikelis Marnauza, Alexandros Vekris, Juan Carlos Estrada Saldana, Sara Loric, Rasmus D. Schlosser, Daniel Ross, Szabolcs Csonka, Kasper Grove-Rasmussen, Jesper Nygård

Abstract: Parallel one-dimensional semiconductor channels connected by a superconducting strip constitute the core platform in several recent quantum device proposals that rely e.g. on Andreev processes or topological effects. In order to realize these proposals, the actual material systems must have high crystalline purity and the coupling between the different elements should be controllable in terms of t… ▽ More Parallel one-dimensional semiconductor channels connected by a superconducting strip constitute the core platform in several recent quantum device proposals that rely e.g. on Andreev processes or topological effects. In order to realize these proposals, the actual material systems must have high crystalline purity and the coupling between the different elements should be controllable in terms of their interfaces and geometry. We present a strategy for synthesizing double InAs nanowires by the vapor-liquid-solid mechanism using III-V molecular beam epitaxy. A superconducting layer is deposited onto nanowires without breaking vacuum, ensuring pristine interfaces between the superconductor and the two semiconductor nanowires. The method allows for a high yield of merged as well as separate parallel nanowires, with full or half-shell superconductor coatings. We demonstrate their utility in complex quantum devices by electron transport measurements. △ Less

Submitted 25 March, 2021; originally announced March 2021.

Report number: NBI QDEV 2021

arXiv:2103.03137 [pdf, other]

doi 10.1103/PhysRevResearch.3.033210

Coherent energy exchange between carriers and phonons in Peierls-distorted bismuth unveiled by broadband XUV pulses

Authors: Romain Géneaux, Iurii Timrov, Christopher J. Kaplan, Andrew D. Ross, Peter M. Kraus, Stephen R. Leone

Abstract: In Peierls-distorted materials, photoexcitation leads to a strongly coupled transient response between structural and electronic degrees of freedom, always measured independently of each other. Here we use transient reflectivity in the extreme ultraviolet to quantify both responses in photoexcited bismuth in a single measurement. With the help of first-principles calculations based on density-func… ▽ More In Peierls-distorted materials, photoexcitation leads to a strongly coupled transient response between structural and electronic degrees of freedom, always measured independently of each other. Here we use transient reflectivity in the extreme ultraviolet to quantify both responses in photoexcited bismuth in a single measurement. With the help of first-principles calculations based on density-functional theory (DFT) and time-dependent DFT, the real-space atomic motion and the temperature of both electrons and holes as a function of time are captured simultaneously, retrieving an anticorrelation between the $A_{1g}$ phonon dynamics and carrier temperature. The results reveal a coherent, bi-directional energy exchange between carriers and phonons, which is a dynamical counterpart of the static Peierls-Jones distortion, providing first-time validation of previous theoretical predictions. △ Less

Submitted 11 August, 2021; v1 submitted 4 March, 2021; originally announced March 2021.

Journal ref: Phys. Rev. Research 3, 033210 (2021)

Showing 1–50 of 195 results for author: Ross, D