Search | arXiv e-print repository

BraTS-PEDs: Results of the Multi-Consortium International Pediatric Brain Tumor Segmentation Challenge 2023

Authors: Anahita Fathi Kazerooni, Nastaran Khalili, Xinyang Liu, Debanjan Haldar, Zhifan Jiang, Anna Zapaishchykova, Julija Pavaine, Lubdha M. Shah, Blaise V. Jones, Nakul Sheth, Sanjay P. Prabhu, Aaron S. McAllister, Wenxin Tu, Khanak K. Nandolia, Andres F. Rodriguez, Ibraheem Salman Shaikh, Mariana Sanchez Montano, Hollie Anne Lai, Maruf Adewole, Jake Albrecht, Udunna Anazodo, Hannah Anderson, Syed Muhammed Anwar, Alejandro Aristizabal, Sina Bagheri , et al. (54 additional authors not shown)

Abstract: Pediatric central nervous system tumors are the leading cause of cancer-related deaths in children. The five-year survival rate for high-grade glioma in children is less than 20%. The development of new treatments is dependent upon multi-institutional collaborative clinical trials requiring reproducible and accurate centralized response assessment. We present the results of the BraTS-PEDs 2023 cha… ▽ More Pediatric central nervous system tumors are the leading cause of cancer-related deaths in children. The five-year survival rate for high-grade glioma in children is less than 20%. The development of new treatments is dependent upon multi-institutional collaborative clinical trials requiring reproducible and accurate centralized response assessment. We present the results of the BraTS-PEDs 2023 challenge, the first Brain Tumor Segmentation (BraTS) challenge focused on pediatric brain tumors. This challenge utilized data acquired from multiple international consortia dedicated to pediatric neuro-oncology and clinical trials. BraTS-PEDs 2023 aimed to evaluate volumetric segmentation algorithms for pediatric brain gliomas from magnetic resonance imaging using standardized quantitative performance evaluation metrics employed across the BraTS 2023 challenges. The top-performing AI approaches for pediatric tumor analysis included ensembles of nnU-Net and Swin UNETR, Auto3DSeg, or nnU-Net with a self-supervised framework. The BraTSPEDs 2023 challenge fostered collaboration between clinicians (neuro-oncologists, neuroradiologists) and AI/imaging scientists, promoting faster data sharing and the development of automated volumetric analysis techniques. These advancements could significantly benefit clinical trials and improve the care of children with brain tumors. △ Less

Submitted 11 July, 2024; originally announced July 2024.

arXiv:2407.08028 [pdf, other]

AutoMate: Specialist and Generalist Assembly Policies over Diverse Geometries

Authors: Bingjie Tang, Iretiayo Akinola, Jie Xu, Bowen Wen, Ankur Handa, Karl Van Wyk, Dieter Fox, Gaurav S. Sukhatme, Fabio Ramos, Yashraj Narang

Abstract: Robotic assembly for high-mixture settings requires adaptivity to diverse parts and poses, which is an open challenge. Meanwhile, in other areas of robotics, large models and sim-to-real have led to tremendous progress. Inspired by such work, we present AutoMate, a learning framework and system that consists of 4 parts: 1) a dataset of 100 assemblies compatible with simulation and the real world,… ▽ More Robotic assembly for high-mixture settings requires adaptivity to diverse parts and poses, which is an open challenge. Meanwhile, in other areas of robotics, large models and sim-to-real have led to tremendous progress. Inspired by such work, we present AutoMate, a learning framework and system that consists of 4 parts: 1) a dataset of 100 assemblies compatible with simulation and the real world, along with parallelized simulation environments for policy learning, 2) a novel simulation-based approach for learning specialist (i.e., part-specific) policies and generalist (i.e., unified) assembly policies, 3) demonstrations of specialist policies that individually solve 80 assemblies with 80% or higher success rates in simulation, as well as a generalist policy that jointly solves 20 assemblies with an 80%+ success rate, and 4) zero-shot sim-to-real transfer that achieves similar (or better) performance than simulation, including on perception-initialized assembly. The key methodological takeaway is that a union of diverse algorithms from manufacturing engineering, character animation, and time-series analysis provides a generic and robust solution for a diverse range of robotic assembly problems.To our knowledge, AutoMate provides the first simulation-based framework for learning specialist and generalist policies over a wide range of assemblies, as well as the first system demonstrating zero-shot sim-to-real transfer over such a range. △ Less

Submitted 10 July, 2024; originally announced July 2024.

arXiv:2407.02274 [pdf, other]

DextrAH-G: Pixels-to-Action Dexterous Arm-Hand Gras** with Geometric Fabrics

Authors: Tyler Ga Wei Lum, Martin Matak, Viktor Makoviychuk, Ankur Handa, Arthur Allshire, Tucker Hermans, Nathan D. Ratliff, Karl Van Wyk

Abstract: A pivotal challenge in robotics is achieving fast, safe, and robust dexterous gras** across a diverse range of objects, an important goal within industrial applications. However, existing methods often have very limited speed, dexterity, and generality, along with limited or no hardware safety guarantees. In this work, we introduce DextrAH-G, a depth-based dexterous gras** policy trained entir… ▽ More A pivotal challenge in robotics is achieving fast, safe, and robust dexterous gras** across a diverse range of objects, an important goal within industrial applications. However, existing methods often have very limited speed, dexterity, and generality, along with limited or no hardware safety guarantees. In this work, we introduce DextrAH-G, a depth-based dexterous gras** policy trained entirely in simulation that combines reinforcement learning, geometric fabrics, and teacher-student distillation. We address key challenges in joint arm-hand policy learning, such as high-dimensional observation and action spaces, the sim2real gap, collision avoidance, and hardware constraints. DextrAH-G enables a 23 motor arm-hand robot to safely and continuously grasp and transport a large variety of objects at high speed using multi-modal inputs including depth images, allowing generalization across object geometry. Videos at https://sites.google.com/view/dextrah-g. △ Less

Submitted 3 July, 2024; v1 submitted 2 July, 2024; originally announced July 2024.

arXiv:2406.19616 [pdf, other]

The space coronagraph optical bench (SCoOB): 3. Mueller matrix polarimetry of a coronagraphic exit pupil

Authors: Jaren N. Ashcraft, Ewan S. Douglas, Ramya M. Anche, Kyle Van Gorkom, Emory Jenkins, William Melby, Maxwell A. Millar-Blanchaer

Abstract: High-contrast imaging in the next decade aims to image exoplanets at smaller angular separations and deeper contrasts than ever before. A problem that has recently garnered attention for telescopes equipped with high-contrast coronagraphs is polarization aberration arising from the optics. These aberrations manifest as low-order aberrations of different magnitudes for orthogonal polarization state… ▽ More High-contrast imaging in the next decade aims to image exoplanets at smaller angular separations and deeper contrasts than ever before. A problem that has recently garnered attention for telescopes equipped with high-contrast coronagraphs is polarization aberration arising from the optics. These aberrations manifest as low-order aberrations of different magnitudes for orthogonal polarization states and spread light into the dark hole of the coronagraph that cannot be fully corrected. The origin of polarization aberrations has been modeled at the telescope level. However, we don't fully understand how polarization aberrations arise at the instrument level. To directly measure this effect, we construct a dual-rotating-retarder polarimeter around the SCoOB high-contrast imaging testbed to measure its Mueller matrix. With this matrix, we directly characterize the diattenuation, retardance, and depolarization of the instrument as a function of position in the exit pupil. We measure the polarization aberrations in the Lyot plane to understand how polarization couples into high-contrast imaging residuals. △ Less

Submitted 27 June, 2024; originally announced June 2024.

Comments: 16 pages, 13 figures

arXiv:2406.19028 [pdf, other]

Black Silicon BRDF and Polarization for Coronagraphic Pupil Masks

Authors: Emory L. Jenkins, Ramya M. Anche, Kyle J. Van Gorkom, A. J. Eldorado Riggs, Ewan S. Douglas

Abstract: Future space observatories will likely have segmented primaries, causing diffraction effects that reduce coronagraph performance. Reflective binary pupil apodizer masks can mitigate these, with the metamaterial black silicon (BSi) showing promise as a strong absorber. To bring contrast ratios to the $10^-{10}$ level as needed to observe Earth-like exoplanets, feature sizes on these BSi masks will… ▽ More Future space observatories will likely have segmented primaries, causing diffraction effects that reduce coronagraph performance. Reflective binary pupil apodizer masks can mitigate these, with the metamaterial black silicon (BSi) showing promise as a strong absorber. To bring contrast ratios to the $10^-{10}$ level as needed to observe Earth-like exoplanets, feature sizes on these BSi masks will need to be less than $5$ microns when paired with MEMS (micro-electromechanical systems) deformable mirrors. As scalar diffraction cannot reliably model this feature size, we developed a Finite-Difference Time-Domain (FDTD) model of BSi masks using Meep software. We characterize the FDTD-derived polarization-dependent bidirectional reflectance distribution function of BSi and discuss the model's shortcomings. △ Less

Submitted 27 June, 2024; originally announced June 2024.

Comments: 8 pages, 10 figures, submitted to SPIE Astronomical Telescopes and Instrumentation (AS24)

arXiv:2406.18886 [pdf, other]

The Space Coronagraph Optical Bench (SCoOB): 5. End-to-end simulations of polarization aberrations

Authors: Ramya M Anche, Kyle J. Van Gorkom, Jaren N. Ashcraft, Ewan Douglas, Emory L Jenkins, Sebastiaan Y. Haffert, Maxwell A. Millar-Blanchaer

Abstract: Polarization aberrations originating from the telescope and high-contrast imaging instrument optics introduce polarization-dependent speckles and associated errors in the image plane, affecting the measured exoplanet signal. Understanding this effect is critical for future space-based high-contrast imaging instruments that aim to image the Earth analogs with 1e-10 raw contrast and characterize the… ▽ More Polarization aberrations originating from the telescope and high-contrast imaging instrument optics introduce polarization-dependent speckles and associated errors in the image plane, affecting the measured exoplanet signal. Understanding this effect is critical for future space-based high-contrast imaging instruments that aim to image the Earth analogs with 1e-10 raw contrast and characterize their atmospheres. We present end-to-end modeling of the polarization aberrations for a high-contrast imaging testbed, SCoOB. We use a vector vortex coronagraph (VVC) as the focal plane mask, incorporate polarization filtering, and estimate the peak contrast in the dark hole region. The dominant polarization aberrations in the system are retardance defocus and tilt due to the OAPs and fold mirrors. Although the mean contrast in the dark hole region remains unaffected by the polarization aberrations, we see brighter speckles limiting the contrast to 1e-9 at smaller inner working angles. We extend the simulations using the measured retardance maps for the VVC. We find that the mean contrast in SCoOB is more sensitive to the VVC and the QWP retardance errors than the polarization aberrations. △ Less

Submitted 27 June, 2024; originally announced June 2024.

Comments: 11 pages, 10 figures, SPIE Astronomical Telescopes and Instruments conference 2024, Paper no: 13092-189

arXiv:2406.18885 [pdf, other]

The space coronagraph optical bench (SCoOB): 4. vacuum performance of a high contrast imaging testbed

Authors: Kyle Van Gorkom, Ewan S Douglas, Kian Milani, Jaren N Ashcraft, Ramya M Anche, Emory Jenkins, Patrick Ingraham, Sebastiaan Haffert, Daewook Kim, Heejoo Choi, Olivier Durney

Abstract: The Space Coronagraph Optical Bench (SCoOB) is a high-contrast imaging testbed built to demonstrate starlight suppression techniques at visible wavelengths in a space-like vacuum environment. The testbed is designed to achieve ${<}10^{-8}$ contrast from $3-10λ/D$ in a one-sided dark hole using a liquid crystal vector vortex waveplate and a 952-actuator Kilo-C deformable mirror (DM) from Boston Mic… ▽ More The Space Coronagraph Optical Bench (SCoOB) is a high-contrast imaging testbed built to demonstrate starlight suppression techniques at visible wavelengths in a space-like vacuum environment. The testbed is designed to achieve ${<}10^{-8}$ contrast from $3-10λ/D$ in a one-sided dark hole using a liquid crystal vector vortex waveplate and a 952-actuator Kilo-C deformable mirror (DM) from Boston Micromachines (BMC). We have recently expanded the testbed to include a field stop for mitigation of stray/scattered light, a precision-fabricated pinhole in the source simulator, a Minus K passive vibration isolation table for jitter reduction, and a low-noise vacuum-compatible CMOS sensor. We report the latest contrast performance achieved using implicit electric field conjugation (iEFC) at a vacuum of ${\sim}10^{-6}$ Torr and over a range of bandpasses with central wavelengths from 500 to 650nm and bandwidths (BW) from $\ll 1\%$ to 15\%. Our jitter in vacuum is $<3\times10^{-3} λ/D$, and the best contrast performance to-date in a half-sided D-shaped dark hole is $2.2\times10^{-9}$ in a $\ll 1 \%$ BW, $4\times10^{-9}$ in a 2\% BW, and $2.5\times10^{-8}$ in a 15\% BW. △ Less

Submitted 27 June, 2024; originally announced June 2024.

Comments: 16 pages, 12 figures, SPIE Astronomical Telescopes and Instrumentation 2024

arXiv:2406.17716 [pdf, other]

ViANLI: Adversarial Natural Language Inference for Vietnamese

Authors: Tin Van Huynh, Kiet Van Nguyen, Ngan Luu-Thuy Nguyen

Abstract: The development of Natural Language Processing (NLI) datasets and models has been inspired by innovations in annotation design. With the rapid development of machine learning models today, the performance of existing machine learning models has quickly reached state-of-the-art results on a variety of tasks related to natural language processing, including natural language inference tasks. By using… ▽ More The development of Natural Language Processing (NLI) datasets and models has been inspired by innovations in annotation design. With the rapid development of machine learning models today, the performance of existing machine learning models has quickly reached state-of-the-art results on a variety of tasks related to natural language processing, including natural language inference tasks. By using a pre-trained model during the annotation process, it is possible to challenge current NLI models by having humans produce premise-hypothesis combinations that the machine model cannot correctly predict. To remain attractive and challenging in the research of natural language inference for Vietnamese, in this paper, we introduce the adversarial NLI dataset to the NLP research community with the name ViANLI. This data set contains more than 10K premise-hypothesis pairs and is built by a continuously adjusting process to obtain the most out of the patterns generated by the annotators. ViANLI dataset has brought many difficulties to many current SOTA models when the accuracy of the most powerful model on the test set only reached 48.4%. Additionally, the experimental results show that the models trained on our dataset have significantly improved the results on other Vietnamese NLI datasets. △ Less

Submitted 1 July, 2024; v1 submitted 25 June, 2024; originally announced June 2024.

arXiv:2406.14858 [pdf, other]

A mechanism for quantum-critical Planckian metal phase in high-temperature cuprate superconductors

Authors: Yung-Yeh Chang, Khoe Van Nguyen, Kim Remund, Chung-Hou Chung

Abstract: The mysterious metallic phase showing perfect $T$-linear resistivity and a universal scattering rate $1/τ= α_P k_B T /\hbar$ with a universal prefactor $α_P \sim 1$ and logarithmic-in-temperature singular specific heat coefficient, so-called Planckian metal phase was observed in various overdoped high-$T_c$ cuprate superconductors over a finite range in do**. Here, we propose a microscopic mecha… ▽ More The mysterious metallic phase showing perfect $T$-linear resistivity and a universal scattering rate $1/τ= α_P k_B T /\hbar$ with a universal prefactor $α_P \sim 1$ and logarithmic-in-temperature singular specific heat coefficient, so-called Planckian metal phase was observed in various overdoped high-$T_c$ cuprate superconductors over a finite range in do**. Here, we propose a microscopic mechanism for this exotic state based on quantum-critical bosonic charge Kondo fluctuations coupled to both spinon and a heavy conduction-electron Fermi surfaces within the heavy-fermion formulation of the slave-boson $t$-$J$ model. Using a controlled perturbative renormalization group (RG) analysis, we examine the competition between the pseudogap phase, characterized by Anderson's Resonating-Valence-Bond spin-liquid, and the Fermi-liquid state, characterized by the electron ho** (effective charge Kondo effect). We find a quantum-critical metallic phase with a universal Planckian $\hbar ω/k_B T$ scaling in scattering rate near a localized-delocalized (pseudogap-to-Fermi liquid) charge Kondo breakdown transition. Our results are in excellent agreement with the recent experimental observations on optical conductivity (without fine-tuning) in Nat. Commun. 14, 3033 (2023), universal do**-independent field-to-temperature scaling in magnetoresistance in Nature 595, 661 (2021), and the marginal Fermi-liquid spectral function observed in ARPES (Science 366, 1099 (2019)) as well as Hall coefficient in various overdoped cuprates in Nature 595, 661 (2021) and Annu. Rev. Condens. Matter Phys. 10, 409 (2019). Our mechanism offers a microscopic understanding of the quantum-critical Planckian metal phase observed in cuprates d-wave superconducting, and Fermi liquid phases. △ Less

Submitted 21 June, 2024; originally announced June 2024.

Comments: 43 pages, 13 figures

arXiv:2406.08586 [pdf, other]

Schrödinger Unitary Cellular Automata

Authors: Kees van Berkel, Jan de Graaf, Kees van Hee

Abstract: We propose a class of cellular automata for the Hamiltonian of a free particle. It is based on a two-step unitary evolution operator in discrete time and space. Various experiments with one and two-dimensional cellular automata are used to analyze 1) phase velocities of plane waves, 2) dispersion and group velocities of wavepackets, 3) energy levels of infinite potential wells and harmonic oscilla… ▽ More We propose a class of cellular automata for the Hamiltonian of a free particle. It is based on a two-step unitary evolution operator in discrete time and space. Various experiments with one and two-dimensional cellular automata are used to analyze 1) phase velocities of plane waves, 2) dispersion and group velocities of wavepackets, 3) energy levels of infinite potential wells and harmonic oscillators, and 4) interference from double-slit diffraction. Some of the differences between their known (analytical) results and the cellular-automata approximations are intriguing. △ Less

Submitted 12 June, 2024; originally announced June 2024.

arXiv:2406.05211 [pdf, other]

Binary neutron star mergers in massive scalar-tensor theory: Properties of post-merger remnants

Authors: Alan Tsz-Lok Lam, Hao-Jui Kuan, Masaru Shibata, Karim Van Aelst, Kenta Kiuchi

Abstract: We investigate the properties of post-merger remnants of binary neutron star mergers in the framework of Damour-Esposito-Farese-type scalar-tensor theory of gravity with a massive scalar field by numerical relativity simulation. It is found that the threshold mass for prompt collapse is raised in the presence of the excited scalar field. Our simulation results also suggest the existence of long-li… ▽ More We investigate the properties of post-merger remnants of binary neutron star mergers in the framework of Damour-Esposito-Farese-type scalar-tensor theory of gravity with a massive scalar field by numerical relativity simulation. It is found that the threshold mass for prompt collapse is raised in the presence of the excited scalar field. Our simulation results also suggest the existence of long-lived $φ-$mode in hypermassive neutron stars due to the presence of the massive scalar field which enhances the quasi-radial oscillation in the remnant. We investigate the descalarization condition in hypermassive neutron stars and discover a distinctive signature in post-merger gravitational waves. △ Less

Submitted 7 June, 2024; originally announced June 2024.

Comments: 22 pages, 21 figures

arXiv:2406.00193 [pdf, other]

Learning topological states from randomized measurements using variational tensor network tomography

Authors: Yanting Teng, Rhine Samajdar, Katherine Van Kirk, Frederik Wilde, Subir Sachdev, Jens Eisert, Ryan Sweke, Khadijeh Najafi

Abstract: Learning faithful representations of quantum states is crucial to fully characterizing the variety of many-body states created on quantum processors. While various tomographic methods such as classical shadow and MPS tomography have shown promise in characterizing a wide class of quantum states, they face unique limitations in detecting topologically ordered two-dimensional states. To address this… ▽ More Learning faithful representations of quantum states is crucial to fully characterizing the variety of many-body states created on quantum processors. While various tomographic methods such as classical shadow and MPS tomography have shown promise in characterizing a wide class of quantum states, they face unique limitations in detecting topologically ordered two-dimensional states. To address this problem, we implement and study a heuristic tomographic method that combines variational optimization on tensor networks with randomized measurement techniques. Using this approach, we demonstrate its ability to learn the ground state of the surface code Hamiltonian as well as an experimentally realizable quantum spin liquid state. In particular, we perform numerical experiments using MPS ansätze and systematically investigate the sample complexity required to achieve high fidelities for systems of sizes up to $48$ qubits. In addition, we provide theoretical insights into the scaling of our learning algorithm by analyzing the statistical properties of maximum likelihood estimation. Notably, our method is sample-efficient and experimentally friendly, only requiring snapshots of the quantum state measured randomly in the $X$ or $Z$ bases. Using this subset of measurements, our approach can effectively learn any real pure states represented by tensor networks, and we rigorously prove that random-$XZ$ measurements are tomographically complete for such states. △ Less

Submitted 28 June, 2024; v1 submitted 31 May, 2024; originally announced June 2024.

Comments: 11+35 pages, 4+3 figures; Added additional references

arXiv:2406.00092 [pdf, other]

How Random is Random? Evaluating the Randomness and Humaness of LLMs' Coin Flips

Authors: Katherine Van Koevering, Jon Kleinberg

Abstract: One uniquely human trait is our inability to be random. We see and produce patterns where there should not be any and we do so in a predictable way. LLMs are supplied with human data and prone to human biases. In this work, we explore how LLMs approach randomness and where and how they fail through the lens of the well studied phenomena of generating binary random sequences. We find that GPT 4 and… ▽ More One uniquely human trait is our inability to be random. We see and produce patterns where there should not be any and we do so in a predictable way. LLMs are supplied with human data and prone to human biases. In this work, we explore how LLMs approach randomness and where and how they fail through the lens of the well studied phenomena of generating binary random sequences. We find that GPT 4 and Llama 3 exhibit and exacerbate nearly every human bias we test in this context, but GPT 3.5 exhibits more random behavior. This dichotomy of randomness or humaness is proposed as a fundamental question of LLMs and that either behavior may be useful in different circumstances. △ Less

Submitted 31 May, 2024; originally announced June 2024.

arXiv:2405.17711 [pdf, other]

doi 10.1145/3643834.3661631

RealityEffects: Augmenting 3D Volumetric Videos with Object-Centric Annotation and Dynamic Visual Effects

Authors: Jian Liao, Kevin Van, Zhijie Xia, Ryo Suzuki

Abstract: This paper introduces RealityEffects, a desktop authoring interface designed for editing and augmenting 3D volumetric videos with object-centric annotations and visual effects. RealityEffects enhances volumetric capture by introducing a novel method for augmenting captured physical motion with embedded, responsive visual effects, referred to as object-centric augmentation. In RealityEffects, users… ▽ More This paper introduces RealityEffects, a desktop authoring interface designed for editing and augmenting 3D volumetric videos with object-centric annotations and visual effects. RealityEffects enhances volumetric capture by introducing a novel method for augmenting captured physical motion with embedded, responsive visual effects, referred to as object-centric augmentation. In RealityEffects, users can interactively attach various visual effects to physical objects within the captured 3D scene, enabling these effects to dynamically move and animate in sync with the corresponding physical motion and body movements. The primary contribution of this paper is the development of a taxonomy for such object-centric augmentations, which includes annotated labels, highlighted objects, ghost effects, and trajectory visualization. This taxonomy is informed by an analysis of 120 edited videos featuring object-centric visual effects. The findings from our user study confirm that our direct manipulation techniques lower the barriers to editing and annotating volumetric captures, thereby enhancing interactive and engaging viewing experiences of 3D volumetric videos. △ Less

Submitted 27 May, 2024; originally announced May 2024.

Comments: DIS 2024

arXiv:2405.09787 [pdf, other]

Analysis of the BraTS 2023 Intracranial Meningioma Segmentation Challenge

Authors: Dominic LaBella, Ujjwal Baid, Omaditya Khanna, Shan McBurney-Lin, Ryan McLean, Pierre Nedelec, Arif Rashid, Nourel Hoda Tahon, Talissa Altes, Radhika Bhalerao, Yaseen Dhemesh, Devon Godfrey, Fathi Hilal, Scott Floyd, Anastasia Janas, Anahita Fathi Kazerooni, John Kirkpatrick, Collin Kent, Florian Kofler, Kevin Leu, Nazanin Maleki, Bjoern Menze, Maxence Pajot, Zachary J. Reitman, Jeffrey D. Rudie , et al. (96 additional authors not shown)

Abstract: We describe the design and results from the BraTS 2023 Intracranial Meningioma Segmentation Challenge. The BraTS Meningioma Challenge differed from prior BraTS Glioma challenges in that it focused on meningiomas, which are typically benign extra-axial tumors with diverse radiologic and anatomical presentation and a propensity for multiplicity. Nine participating teams each developed deep-learning… ▽ More We describe the design and results from the BraTS 2023 Intracranial Meningioma Segmentation Challenge. The BraTS Meningioma Challenge differed from prior BraTS Glioma challenges in that it focused on meningiomas, which are typically benign extra-axial tumors with diverse radiologic and anatomical presentation and a propensity for multiplicity. Nine participating teams each developed deep-learning automated segmentation models using image data from the largest multi-institutional systematically expert annotated multilabel multi-sequence meningioma MRI dataset to date, which included 1000 training set cases, 141 validation set cases, and 283 hidden test set cases. Each case included T2, T2/FLAIR, T1, and T1Gd brain MRI sequences with associated tumor compartment labels delineating enhancing tumor, non-enhancing tumor, and surrounding non-enhancing T2/FLAIR hyperintensity. Participant automated segmentation models were evaluated and ranked based on a scoring system evaluating lesion-wise metrics including dice similarity coefficient (DSC) and 95% Hausdorff Distance. The top ranked team had a lesion-wise median dice similarity coefficient (DSC) of 0.976, 0.976, and 0.964 for enhancing tumor, tumor core, and whole tumor, respectively and a corresponding average DSC of 0.899, 0.904, and 0.871, respectively. These results serve as state-of-the-art benchmarks for future pre-operative meningioma automated segmentation algorithms. Additionally, we found that 1286 of 1424 cases (90.3%) had at least 1 compartment voxel abutting the edge of the skull-stripped image edge, which requires further investigation into optimal pre-processing face anonymization steps. △ Less

Submitted 15 May, 2024; originally announced May 2024.

Comments: 16 pages, 11 tables, 10 figures, MICCAI

arXiv:2405.07615 [pdf, other]

ViWikiFC: Fact-Checking for Vietnamese Wikipedia-Based Textual Knowledge Source

Authors: Hung Tuan Le, Long Truong To, Manh Trong Nguyen, Kiet Van Nguyen

Abstract: Fact-checking is essential due to the explosion of misinformation in the media ecosystem. Although false information exists in every language and country, most research to solve the problem mainly concentrated on huge communities like English and Chinese. Low-resource languages like Vietnamese are necessary to explore corpora and models for fact verification. To bridge this gap, we construct ViWik… ▽ More Fact-checking is essential due to the explosion of misinformation in the media ecosystem. Although false information exists in every language and country, most research to solve the problem mainly concentrated on huge communities like English and Chinese. Low-resource languages like Vietnamese are necessary to explore corpora and models for fact verification. To bridge this gap, we construct ViWikiFC, the first manual annotated open-domain corpus for Vietnamese Wikipedia Fact Checking more than 20K claims generated by converting evidence sentences extracted from Wikipedia articles. We analyze our corpus through many linguistic aspects, from the new dependency rate, the new n-gram rate, and the new word rate. We conducted various experiments for Vietnamese fact-checking, including evidence retrieval and verdict prediction. BM25 and InfoXLM (Large) achieved the best results in two tasks, with BM25 achieving an accuracy of 88.30% for SUPPORTS, 86.93% for REFUTES, and only 56.67% for the NEI label in the evidence retrieval task, InfoXLM (Large) achieved an F1 score of 86.51%. Furthermore, we also conducted a pipeline approach, which only achieved a strict accuracy of 67.00% when using InfoXLM (Large) and BM25. These results demonstrate that our dataset is challenging for the Vietnamese language model in fact-checking tasks. △ Less

Submitted 13 May, 2024; originally announced May 2024.

arXiv:2405.03899 [pdf, other]

doi 10.1117/1.JATIS.10.2.029001

Modeling and performance analysis of Implicit Electric Field Conjugation with two deformable mirrors applied to the Roman Coronagraph

Authors: Kian Milani, Ewan S. Douglas, Sebastiaan Y. Haffert, Kyle Van Gorkom

Abstract: High-order wavefront sensing and control (HOWFSC) is key to create a dark hole region within the coronagraphic image plane where high contrasts are achieved. The Roman Coronagraph is expected to perform its HOWFSC with a ground-in-the-loop scheme due to the computational complexity of the Electric Field Conjugation (EFC) algorithm. This scheme provides the flexibility to alter the HOWFSC algorithm… ▽ More High-order wavefront sensing and control (HOWFSC) is key to create a dark hole region within the coronagraphic image plane where high contrasts are achieved. The Roman Coronagraph is expected to perform its HOWFSC with a ground-in-the-loop scheme due to the computational complexity of the Electric Field Conjugation (EFC) algorithm. This scheme provides the flexibility to alter the HOWFSC algorithm for given science objectives. The baseline HOWFSC scheme involves running EFC while observing a bright star such as ζ Puppis to create the initial dark hole followed by a slew to the science target. The new implicit EFC (iEFC) algorithm removes the optical diffraction model from the controller, making the final contrast independent of model accuracy. While previously demonstrated with a single DM, iEFC is extended to two deformable mirror systems in order to create annular dark holes. The algorithm is then applied to the Wide-Field-of-View Shaped Pupil Coronagraph (SPC-WFOV) mode designed for the Roman Space Telescope using end-to-end physical optics models. Initial monochromatic simulations demonstrate the efficacy of iEFC as well as the optimal choice of modes for the SPC-WFOV instrument. Further simulations with a 3.6% wavefront control bandpass and a broader 10% bandpass then demonstrate that iEFC can be used in broadband scenarios to achieve contrasts below 1E-8 with Roman. Finally, an EMCCD model is implemented to estimate calibration times and predict the controller's performance. Here, 1E-8 contrasts are achieved with a calibration time of about 6.8 hours assuming the reference star is ζ Puppis. The results here indicate that iEFC can be a valid HOWFSC method that can mitigate the risk of model errors associated with space-borne coronagraphs, but to maximize iEFC performance, lengthy calibration times will be required to mitigate the noise accumulated during calibration. △ Less

Submitted 6 May, 2024; originally announced May 2024.

Comments: 20 pages, 15 figures

Report number: Volume 10 Issue 2

Journal ref: SPIE Journal of Astronomical Telescopes, Instruments, and Systems. April 2024

arXiv:2405.02250 [pdf, other]

Geometric Fabrics: a Safe Guiding Medium for Policy Learning

Authors: Karl Van Wyk, Ankur Handa, Viktor Makoviychuk, Yijie Guo, Arthur Allshire, Nathan D. Ratliff

Abstract: Robotics policies are always subjected to complex, second order dynamics that entangle their actions with resulting states. In reinforcement learning (RL) contexts, policies have the burden of deciphering these complicated interactions over massive amounts of experience and complex reward functions to learn how to accomplish tasks. Moreover, policies typically issue actions directly to controllers… ▽ More Robotics policies are always subjected to complex, second order dynamics that entangle their actions with resulting states. In reinforcement learning (RL) contexts, policies have the burden of deciphering these complicated interactions over massive amounts of experience and complex reward functions to learn how to accomplish tasks. Moreover, policies typically issue actions directly to controllers like Operational Space Control (OSC) or joint PD control, which induces straightline motion towards these action targets in task or joint space. However, straightline motion in these spaces for the most part do not capture the rich, nonlinear behavior our robots need to exhibit, shifting the burden of discovering these behaviors more completely to the agent. Unlike these simpler controllers, geometric fabrics capture a much richer and desirable set of behaviors via artificial, second order dynamics grounded in nonlinear geometry. These artificial dynamics shift the uncontrolled dynamics of a robot via an appropriate control law to form behavioral dynamics. Behavioral dynamics unlock a new action space and safe, guiding behavior over which RL policies are trained. Behavioral dynamics enable bang-bang-like RL policy actions that are still safe for real robots, simplify reward engineering, and help sequence real-world, high-performance policies. We describe the framework more generally and create a specific instantiation for the problem of dexterous, in-hand reorientation of a cube by a highly actuated robot hand. △ Less

Submitted 3 May, 2024; originally announced May 2024.

arXiv:2405.00543 [pdf, other]

New Benchmark Dataset and Fine-Grained Cross-Modal Fusion Framework for Vietnamese Multimodal Aspect-Category Sentiment Analysis

Authors: Quy Hoang Nguyen, Minh-Van Truong Nguyen, Kiet Van Nguyen

Abstract: The emergence of multimodal data on social media platforms presents new opportunities to better understand user sentiments toward a given aspect. However, existing multimodal datasets for Aspect-Category Sentiment Analysis (ACSA) often focus on textual annotations, neglecting fine-grained information in images. Consequently, these datasets fail to fully exploit the richness inherent in multimodal.… ▽ More The emergence of multimodal data on social media platforms presents new opportunities to better understand user sentiments toward a given aspect. However, existing multimodal datasets for Aspect-Category Sentiment Analysis (ACSA) often focus on textual annotations, neglecting fine-grained information in images. Consequently, these datasets fail to fully exploit the richness inherent in multimodal. To address this, we introduce a new Vietnamese multimodal dataset, named ViMACSA, which consists of 4,876 text-image pairs with 14,618 fine-grained annotations for both text and image in the hotel domain. Additionally, we propose a Fine-Grained Cross-Modal Fusion Framework (FCMF) that effectively learns both intra- and inter-modality interactions and then fuses these information to produce a unified multimodal representation. Experimental results show that our framework outperforms SOTA models on the ViMACSA dataset, achieving the highest F1 score of 79.73%. We also explore characteristics and challenges in Vietnamese multimodal sentiment analysis, including misspellings, abbreviations, and the complexities of the Vietnamese language. This work contributes both a benchmark dataset and a new framework that leverages fine-grained multimodal information to improve multimodal aspect-category sentiment analysis. Our dataset is available for research purposes: https://github.com/hoangquy18/Multimodal-Aspect-Category-Sentiment-Analysis. △ Less

Submitted 1 May, 2024; originally announced May 2024.

arXiv:2404.18397 [pdf, other]

ViOCRVQA: Novel Benchmark Dataset and Vision Reader for Visual Question Answering by Understanding Vietnamese Text in Images

Authors: Huy Quang Pham, Thang Kien-Bao Nguyen, Quan Van Nguyen, Dan Quang Tran, Nghia Hieu Nguyen, Kiet Van Nguyen, Ngan Luu-Thuy Nguyen

Abstract: Optical Character Recognition - Visual Question Answering (OCR-VQA) is the task of answering text information contained in images that have just been significantly developed in the English language in recent years. However, there are limited studies of this task in low-resource languages such as Vietnamese. To this end, we introduce a novel dataset, ViOCRVQA (Vietnamese Optical Character Recogniti… ▽ More Optical Character Recognition - Visual Question Answering (OCR-VQA) is the task of answering text information contained in images that have just been significantly developed in the English language in recent years. However, there are limited studies of this task in low-resource languages such as Vietnamese. To this end, we introduce a novel dataset, ViOCRVQA (Vietnamese Optical Character Recognition - Visual Question Answering dataset), consisting of 28,000+ images and 120,000+ question-answer pairs. In this dataset, all the images contain text and questions about the information relevant to the text in the images. We deploy ideas from state-of-the-art methods proposed for English to conduct experiments on our dataset, revealing the challenges and difficulties inherent in a Vietnamese dataset. Furthermore, we introduce a novel approach, called VisionReader, which achieved 0.4116 in EM and 0.6990 in the F1-score on the test set. Through the results, we found that the OCR system plays a very important role in VQA models on the ViOCRVQA dataset. In addition, the objects in the image also play a role in improving model performance. We open access to our dataset at link (https://github.com/qhnhynmm/ViOCRVQA.git) for further research in OCR-VQA task in Vietnamese. △ Less

Submitted 28 April, 2024; originally announced April 2024.

arXiv:2404.15009 [pdf, other]

The Brain Tumor Segmentation in Pediatrics (BraTS-PEDs) Challenge: Focus on Pediatrics (CBTN-CONNECT-DIPGR-ASNR-MICCAI BraTS-PEDs)

Authors: Anahita Fathi Kazerooni, Nastaran Khalili, Xinyang Liu, Deep Gandhi, Zhifan Jiang, Syed Muhammed Anwar, Jake Albrecht, Maruf Adewole, Udunna Anazodo, Hannah Anderson, Ujjwal Baid, Timothy Bergquist, Austin J. Borja, Evan Calabrese, Verena Chung, Gian-Marco Conte, Farouk Dako, James Eddy, Ivan Ezhov, Ariana Familiar, Keyvan Farahani, Andrea Franson, Anurag Gottipati, Shuvanjan Haldar, Juan Eugenio Iglesias , et al. (46 additional authors not shown)

Abstract: Pediatric tumors of the central nervous system are the most common cause of cancer-related death in children. The five-year survival rate for high-grade gliomas in children is less than 20%. Due to their rarity, the diagnosis of these entities is often delayed, their treatment is mainly based on historic treatment concepts, and clinical trials require multi-institutional collaborations. Here we pr… ▽ More Pediatric tumors of the central nervous system are the most common cause of cancer-related death in children. The five-year survival rate for high-grade gliomas in children is less than 20%. Due to their rarity, the diagnosis of these entities is often delayed, their treatment is mainly based on historic treatment concepts, and clinical trials require multi-institutional collaborations. Here we present the CBTN-CONNECT-DIPGR-ASNR-MICCAI BraTS-PEDs challenge, focused on pediatric brain tumors with data acquired across multiple international consortia dedicated to pediatric neuro-oncology and clinical trials. The CBTN-CONNECT-DIPGR-ASNR-MICCAI BraTS-PEDs challenge brings together clinicians and AI/imaging scientists to lead to faster development of automated segmentation techniques that could benefit clinical trials, and ultimately the care of children with brain tumors. △ Less

Submitted 11 July, 2024; v1 submitted 23 April, 2024; originally announced April 2024.

Comments: arXiv admin note: substantial text overlap with arXiv:2305.17033

arXiv:2404.11150 [pdf, ps, other]

Automated, efficient and model-free inference for randomized clinical trials via data-driven covariate adjustment

Authors: Kelly Van Lancker, Iván Díaz, Stijn Vansteelandt

Abstract: In May 2023, the U.S. Food and Drug Administration (FDA) released guidance for industry on "Adjustment for Covariates in Randomized Clinical Trials for Drugs and Biological Products". Covariate adjustment is a statistical analysis method for improving precision and power in clinical trials by adjusting for pre-specified, prognostic baseline variables. Though recommended by the FDA and the European… ▽ More In May 2023, the U.S. Food and Drug Administration (FDA) released guidance for industry on "Adjustment for Covariates in Randomized Clinical Trials for Drugs and Biological Products". Covariate adjustment is a statistical analysis method for improving precision and power in clinical trials by adjusting for pre-specified, prognostic baseline variables. Though recommended by the FDA and the European Medicines Agency (EMA), many trials do not exploit the available information in baseline variables or make use only of the baseline measurement of the outcome. This is likely (partly) due to the regulatory mandate to pre-specify baseline covariates for adjustment, leading to challenges in determining appropriate covariates and their functional forms. We will explore the potential of automated data-adaptive methods, such as machine learning algorithms, for covariate adjustment, addressing the challenge of pre-specification. Specifically, our approach allows the use of complex models or machine learning algorithms without compromising the interpretation or validity of the treatment effect estimate and its corresponding standard error, even in the presence of misspecified outcome working models. This contrasts the majority of competing works which assume correct model specification for the validity of standard errors. Our proposed estimators either necessitate ultra-sparsity in the outcome model (which can be relaxed by limiting the number of predictors in the model) or necessitate integration with sample splitting to enhance their performance. As such, we will arrive at simple estimators and standard errors for the marginal treatment effect in randomized clinical trials, which exploit data-adaptive outcome predictions based on prognostic baseline covariates, and have low (or no) bias in finite samples even when those predictions are themselves biased. △ Less

Submitted 17 April, 2024; originally announced April 2024.

arXiv:2404.10652 [pdf, other]

ViTextVQA: A Large-Scale Visual Question Answering Dataset for Evaluating Vietnamese Text Comprehension in Images

Authors: Quan Van Nguyen, Dan Quang Tran, Huy Quang Pham, Thang Kien-Bao Nguyen, Nghia Hieu Nguyen, Kiet Van Nguyen, Ngan Luu-Thuy Nguyen

Abstract: Visual Question Answering (VQA) is a complicated task that requires the capability of simultaneously processing natural language and images. Initially, this task was researched, focusing on methods to help machines understand objects and scene contexts in images. However, some text appearing in the image that carries explicit information about the full content of the image is not mentioned. Along… ▽ More Visual Question Answering (VQA) is a complicated task that requires the capability of simultaneously processing natural language and images. Initially, this task was researched, focusing on methods to help machines understand objects and scene contexts in images. However, some text appearing in the image that carries explicit information about the full content of the image is not mentioned. Along with the continuous development of the AI era, there have been many studies on the reading comprehension ability of VQA models in the world. As a develo** country, conditions are still limited, and this task is still open in Vietnam. Therefore, we introduce the first large-scale dataset in Vietnamese specializing in the ability to understand text appearing in images, we call it ViTextVQA (\textbf{Vi}etnamese \textbf{Text}-based \textbf{V}isual \textbf{Q}uestion \textbf{A}nswering dataset) which contains \textbf{over 16,000} images and \textbf{over 50,000} questions with answers. Through meticulous experiments with various state-of-the-art models, we uncover the significance of the order in which tokens in OCR text are processed and selected to formulate answers. This finding helped us significantly improve the performance of the baseline models on the ViTextVQA dataset. Our dataset is available at this \href{https://github.com/minhquan6203/ViTextVQA-Dataset}{link} for research purposes. △ Less

Submitted 16 April, 2024; originally announced April 2024.

Comments: Preprint submitted to IJCV

arXiv:2404.03723 [pdf, other]

Metropolitan-scale heralded entanglement of solid-state qubits

Authors: Arian J. Stolk, Kian L. van der Enden, Marie-Christine Slater, Ingmar te Raa-Derckx, Pieter Botma, Joris van Rantwijk, Benjamin Biemond, Ronald A. J. Hagen, Rodolf W. Herfst, Wouter D. Koek, Arjan J. H. Meskers, René Vollmer, Erwin J. van Zwet, Matthew Markham, Andrew M. Edmonds, Jan Fabian Geus, Florian Elsen, Bernd Jungbluth, Constantin Haefner, Christoph Tresp, Jürgen Stuhler, Stephan Ritter, Ronald Hanson

Abstract: A key challenge towards future quantum internet technology is connecting quantum processors at metropolitan scale. Here, we report on heralded entanglement between two independently operated quantum network nodes separated by 10km. The two nodes hosting diamond spin qubits are linked with a midpoint station via 25km of deployed optical fiber. We minimize the effects of fiber photon loss by quantum… ▽ More A key challenge towards future quantum internet technology is connecting quantum processors at metropolitan scale. Here, we report on heralded entanglement between two independently operated quantum network nodes separated by 10km. The two nodes hosting diamond spin qubits are linked with a midpoint station via 25km of deployed optical fiber. We minimize the effects of fiber photon loss by quantum frequency conversion of the qubit-native photons to the telecom L-band and by embedding the link in an extensible phase-stabilized architecture enabling the use of the loss-resilient single-photon entangling protocol. By capitalizing on the full heralding capabilities of the network link in combination with real-time feedback logic on the long-lived qubits, we demonstrate the delivery of a predefined entangled state on the nodes irrespective of the heralding detection pattern. Addressing key scaling challenges and being compatible with different qubit systems, our architecture establishes a generic platform for exploring metropolitan-scale quantum networks. △ Less

Submitted 4 April, 2024; originally announced April 2024.

Comments: 10 pages, 4 figures, supplementary materials

arXiv:2404.03093 [pdf, other]

Unified laser stabilization and isolation on a silicon chip

Authors: Alexander D. White, Geun Ho Ahn, Richard Luhtaru, Joel Guo, Theodore J. Morin, Abhi Saxena, Lin Chang, Arka Majumdar, Kasper Van Gasse, John E. Bowers, Jelena Vučković

Abstract: Rapid progress in photonics has led to an explosion of integrated devices that promise to deliver the same performance as table-top technology at the nanoscale; heralding the next generation of optical communications, sensing and metrology, and quantum technologies. However, the challenge of co-integrating the multiple components of high-performance laser systems has left application of these nano… ▽ More Rapid progress in photonics has led to an explosion of integrated devices that promise to deliver the same performance as table-top technology at the nanoscale; heralding the next generation of optical communications, sensing and metrology, and quantum technologies. However, the challenge of co-integrating the multiple components of high-performance laser systems has left application of these nanoscale devices thwarted by bulky laser sources that are orders of magnitude larger than the devices themselves. Here we show that the two main ingredients for high-performance lasers -- noise reduction and isolation -- currently requiring serial combination of incompatible technologies, can be sourced simultaneously from a single, passive, CMOS-compatible nanophotonic device. To do this, we take advantage of both the long photon lifetime and the nonreciprocal Kerr nonlinearity of a high quality factor silicon nitride ring resonator to self-injection lock a semiconductor laser chip while also providing isolation. Additionally, we identify a previously unappreciated power regime limitation of current on-chip laser architectures which our system overcomes. Using our device, which we term a unified laser stabilizer, we demonstrate an on-chip integrated laser system with built-in isolation and noise reduction that operates with turnkey reliability. This approach departs from efforts to directly miniaturize and integrate traditional laser system components and serves to bridge the gap to fully integrated optical technologies. △ Less

Submitted 24 May, 2024; v1 submitted 3 April, 2024; originally announced April 2024.

arXiv:2404.00732 [pdf, other]

An Abundance of Katherines: The Game Theory of Baby Naming

Authors: Katy Blumer, Kate Donahue, Katie Fritz, Kate Ivanovich, Katherine Lee, Katie Luo, Cathy Meng, Katie Van Koevering

Abstract: In this paper, we study the highly competitive arena of baby naming. Through making several Extremely Reasonable Assumptions (namely, that parents are myopic, perfectly knowledgeable agents who pick a name based solely on its uniquness), we create a model which is not only tractable and clean, but also perfectly captures the real world. We then extend our investigation with numerical experiments,… ▽ More In this paper, we study the highly competitive arena of baby naming. Through making several Extremely Reasonable Assumptions (namely, that parents are myopic, perfectly knowledgeable agents who pick a name based solely on its uniquness), we create a model which is not only tractable and clean, but also perfectly captures the real world. We then extend our investigation with numerical experiments, as well as analysis of large language model tools. We conclude by discussing avenues for future research. △ Less

Submitted 1 April, 2024; v1 submitted 31 March, 2024; originally announced April 2024.

Comments: Accepted at SIGBOVIK 2024

arXiv:2403.15883 [pdf, other]

From Raw Data to Safety: Reducing Conservatism by Set Expansion

Authors: Mohammad Bajelani, Klaske van Heusden

Abstract: In response to safety concerns associated with learning-based algorithms, safety filters have been proposed as a modular technique. Generally, these filters heavily rely on the system's model, which is contradictory if they are intended to enhance a data-driven or end-to-end learning solution. This paper extends our previous work, a purely Data-Driven Safety Filter (DDSF) based on Willems' lemma,… ▽ More In response to safety concerns associated with learning-based algorithms, safety filters have been proposed as a modular technique. Generally, these filters heavily rely on the system's model, which is contradictory if they are intended to enhance a data-driven or end-to-end learning solution. This paper extends our previous work, a purely Data-Driven Safety Filter (DDSF) based on Willems' lemma, to an extremely short-sighted and non-conservative solution. Specifically, we propose online and offline sample-based methods to expand the safe set of DDSF and reduce its conservatism. Since this method is defined in an input-output framework, it can systematically handle both unknown and time-delay LTI systems using only one single batch of data. To evaluate its performance, we apply the proposed method to a time-delay system under various settings. The simulation results validate the effectiveness of the set expansion algorithm in generating a notably large input-output safe set, resulting in safety filters that are not conservative, even with an extremely short prediction horizon. △ Less

Submitted 23 March, 2024; originally announced March 2024.

arXiv:2403.15882 [pdf, other]

VLUE: A New Benchmark and Multi-task Knowledge Transfer Learning for Vietnamese Natural Language Understanding

Authors: Phong Nguyen-Thuan Do, Son Quoc Tran, Phu Gia Hoang, Kiet Van Nguyen, Ngan Luu-Thuy Nguyen

Abstract: The success of Natural Language Understanding (NLU) benchmarks in various languages, such as GLUE for English, CLUE for Chinese, KLUE for Korean, and IndoNLU for Indonesian, has facilitated the evaluation of new NLU models across a wide range of tasks. To establish a standardized set of benchmarks for Vietnamese NLU, we introduce the first Vietnamese Language Understanding Evaluation (VLUE) benchm… ▽ More The success of Natural Language Understanding (NLU) benchmarks in various languages, such as GLUE for English, CLUE for Chinese, KLUE for Korean, and IndoNLU for Indonesian, has facilitated the evaluation of new NLU models across a wide range of tasks. To establish a standardized set of benchmarks for Vietnamese NLU, we introduce the first Vietnamese Language Understanding Evaluation (VLUE) benchmark. The VLUE benchmark encompasses five datasets covering different NLU tasks, including text classification, span extraction, and natural language understanding. To provide an insightful overview of the current state of Vietnamese NLU, we then evaluate seven state-of-the-art pre-trained models, including both multilingual and Vietnamese monolingual models, on our proposed VLUE benchmark. Furthermore, we present CafeBERT, a new state-of-the-art pre-trained model that achieves superior results across all tasks in the VLUE benchmark. Our model combines the proficiency of a multilingual pre-trained model with Vietnamese linguistic knowledge. CafeBERT is developed based on the XLM-RoBERTa model, with an additional pretraining step utilizing a significant amount of Vietnamese textual data to enhance its adaptation to the Vietnamese language. For the purpose of future research, CafeBERT is made publicly available for research purposes. △ Less

Submitted 23 March, 2024; originally announced March 2024.

Comments: Accepted at NAACL 2024 (Findings)

arXiv:2403.15854 [pdf, other]

A Modular Safety Filter for Safety-Certified Cyber-Physical Systems

Authors: Mohammad Bajelani, Walter Lucia, Klaske van Heusden

Abstract: Nowadays, many control systems are networked and embed communication and computation capabilities. Such control architectures are prone to cyber attacks on the cyberinfrastructure. Consequently, there is an impellent need to develop solutions to preserve the plant's safety against potential attacks. To ensure safety, this paper introduces a modular safety filter approach that is effective for a va… ▽ More Nowadays, many control systems are networked and embed communication and computation capabilities. Such control architectures are prone to cyber attacks on the cyberinfrastructure. Consequently, there is an impellent need to develop solutions to preserve the plant's safety against potential attacks. To ensure safety, this paper introduces a modular safety filter approach that is effective for a variety of cyber-attack types. This solution can be implemented in combination with existing control and detection algorithms, effectively separating safety from performance. The safety filter does not require information on the reliability of the received command or the feature of the used anomaly detector. It can be implemented in conjunction with high-performance, resilient controllers, to achieve both high performance during normal operation and safety during an attack. As an illustrative example, we have shown the effectiveness of the proposed design considering a multi-agent formation task involving 20 mobile robots. The simulation results testify that the safety filter operates effectively during false data injection and intelligent attacks. △ Less

Submitted 23 March, 2024; originally announced March 2024.

arXiv:2403.12912 [pdf]

Strangers in a foreign land: 'Yeastizing' plant enzymes

Authors: Kristen Van Gelder, Steffen N. Lindner, Andrew D. Hanson, Juannan Zhou

Abstract: Expressing plant metabolic pathways in microbial platforms is an efficient, cost-effective solution for producing many desired plant compounds. As eukaryotic organisms, yeasts are often the preferred platform. However, expression of plant enzymes in a yeast frequently leads to failure because the enzymes are poorly adapted to the foreign yeast cellular environment. Here we first summarize current… ▽ More Expressing plant metabolic pathways in microbial platforms is an efficient, cost-effective solution for producing many desired plant compounds. As eukaryotic organisms, yeasts are often the preferred platform. However, expression of plant enzymes in a yeast frequently leads to failure because the enzymes are poorly adapted to the foreign yeast cellular environment. Here we first summarize current engineering approaches for optimizing performance of plant enzymes in yeast. A critical limitation of these approaches is that they are labor-intensive and must be customized for each individual enzyme, which significantly hinders the establishment of plant pathways in cellular factories. In response to this challenge, we propose the development of a cost-effective computational pipeline to redesign plant enzymes for better adaptation to the yeast cellular milieu. This proposition is underpinned by compelling evidence that plant and yeast enzymes exhibit distinct sequence features that are generalizable across enzyme families. Consequently, we introduce a data-driven machine learning framework designed to extract 'yeastizing' rules from natural protein sequence variations, which can be broadly applied to all enzymes. Additionally, we discuss the potential to integrate the machine learning model into a full design-build-test-cycle. △ Less

Submitted 19 March, 2024; v1 submitted 19 March, 2024; originally announced March 2024.

Comments: 37 pages, 3 figures

arXiv:2403.04376 [pdf, other]

Computational Modelling of Plurality and Definiteness in Chinese Noun Phrases

Authors: Yuqi Liu, Guanyi Chen, Kees van Deemter

Abstract: Theoretical linguists have suggested that some languages (e.g., Chinese and Japanese) are "cooler" than other languages based on the observation that the intended meaning of phrases in these languages depends more on their contexts. As a result, many expressions in these languages are shortened, and their meaning is inferred from the context. In this paper, we focus on the omission of the pluralit… ▽ More Theoretical linguists have suggested that some languages (e.g., Chinese and Japanese) are "cooler" than other languages based on the observation that the intended meaning of phrases in these languages depends more on their contexts. As a result, many expressions in these languages are shortened, and their meaning is inferred from the context. In this paper, we focus on the omission of the plurality and definiteness markers in Chinese noun phrases (NPs) to investigate the predictability of their intended meaning given the contexts. To this end, we built a corpus of Chinese NPs, each of which is accompanied by its corresponding context, and by labels indicating its singularity/plurality and definiteness/indefiniteness. We carried out corpus assessments and analyses. The results suggest that Chinese speakers indeed drop plurality and definiteness markers very frequently. Building on the corpus, we train a bank of computational models using both classic machine learning models and state-of-the-art pre-trained language models to predict the plurality and definiteness of each NP. We report on the performance of these models and analyse their behaviours. △ Less

Submitted 7 March, 2024; originally announced March 2024.

Comments: Accepted to LREC-COLING 2024

arXiv:2403.02144 [pdf, other]

Improved Tests for Mediation

Authors: Grant Hillier, Kees Jan van Garderen, Noud van Giersbergen

Abstract: Testing for a mediation effect is important in many disciplines, but is made difficult - even asymptotically - by the influence of nuisance parameters. Classical tests such as likelihood ratio (LR) and Wald (Sobel) tests have very poor power properties in parts of the parameter space, and many attempts have been made to produce improved tests, with limited success. In this paper we show that augme… ▽ More Testing for a mediation effect is important in many disciplines, but is made difficult - even asymptotically - by the influence of nuisance parameters. Classical tests such as likelihood ratio (LR) and Wald (Sobel) tests have very poor power properties in parts of the parameter space, and many attempts have been made to produce improved tests, with limited success. In this paper we show that augmenting the critical region of the LR test can produce a test with much improved behavior everywhere. In fact, we first show that there exists a test of this type that is (asymptotically) exact for certain test levels $α$, including the common choices $α=.01,.05,.10.$ The critical region of this exact test has some undesirable properties. We go on to show that there is a very simple class of augmented LR critical regions which provides tests that are nearly exact, and avoid the issues inherent in the exact test. We suggest an optimal and coherent member of this class, provide the table needed to implement the test and to report p-values if desired. Simulation confirms validity with non-Gaussian disturbances, under heteroskedasticity, and in a nonlinear (logit) model. A short application of the method to an entrepreneurial attitudes study is included for illustration. △ Less

Submitted 4 March, 2024; originally announced March 2024.

Comments: This is a revised version of the paper by Grant Hillier, Kees Jan van Garderen, Noud van Giersbergen (2022): Improved tests for mediation, cemmap working paper, No. CWP01/22, Centre for Microdata Methods and Practice (cemmap), London, https://doi.org/10.47004/wp.cem.2022.0122

arXiv:2402.07432 [pdf, other]

Intrinsic Task-based Evaluation for Referring Expression Generation

Authors: Guanyi Chen, Fahime Same, Kees van Deemter

Abstract: Recently, a human evaluation study of Referring Expression Generation (REG) models had an unexpected conclusion: on \textsc{webnlg}, Referring Expressions (REs) generated by the state-of-the-art neural models were not only indistinguishable from the REs in \textsc{webnlg} but also from the REs generated by a simple rule-based system. Here, we argue that this limitation could stem from the use of a… ▽ More Recently, a human evaluation study of Referring Expression Generation (REG) models had an unexpected conclusion: on \textsc{webnlg}, Referring Expressions (REs) generated by the state-of-the-art neural models were not only indistinguishable from the REs in \textsc{webnlg} but also from the REs generated by a simple rule-based system. Here, we argue that this limitation could stem from the use of a purely ratings-based human evaluation (which is a common practice in Natural Language Generation). To investigate these issues, we propose an intrinsic task-based evaluation for REG models, in which, in addition to rating the quality of REs, participants were asked to accomplish two meta-level tasks. One of these tasks concerns the referential success of each RE; the other task asks participants to suggest a better alternative for each RE. The outcomes suggest that, in comparison to previous evaluations, the new evaluation protocol assesses the performance of each REG model more comprehensively and makes the participants' ratings more reliable and discriminable. △ Less

Submitted 12 February, 2024; originally announced February 2024.

arXiv:2402.03482 [pdf, ps, other]

On time-fractional partial differential equations of time-dependent piecewise constant order

Authors: Yavar Kian, Marián Slodička, Éric Soccorsi, Karel Van Bockstal

Abstract: This contribution considers the time-fractional subdiffusion with a time-dependent variable-order fractional operator of order $β(t)$. It is assumed that $β(t)$ is a piecewise constant function with a finite number of jumps. A proof technique based on the Fourier method and results from constant-order fractional subdiffusion equations has been designed. This novel approach results in the well-pose… ▽ More This contribution considers the time-fractional subdiffusion with a time-dependent variable-order fractional operator of order $β(t)$. It is assumed that $β(t)$ is a piecewise constant function with a finite number of jumps. A proof technique based on the Fourier method and results from constant-order fractional subdiffusion equations has been designed. This novel approach results in the well-posedness of the problem. △ Less

Submitted 5 February, 2024; originally announced February 2024.

MSC Class: 35R11

arXiv:2402.03148 [pdf, ps, other]

Proof Theory and Decision Procedures for Deontic STIT Logics

Authors: Tim S. Lyon, Kees van Berkel

Abstract: This paper addresses the automation of reasoning with deontic STIT logics by means of proof theory. Our methodology consists of leveraging sound and cut-free complete sequent-style calculi to write a proof-search algorithm deciding deontic, multi-agent STIT logics with (un)limited choice. In order to ensure the termination of our proof-search algorithm, we introduce a special loop-checking mechani… ▽ More This paper addresses the automation of reasoning with deontic STIT logics by means of proof theory. Our methodology consists of leveraging sound and cut-free complete sequent-style calculi to write a proof-search algorithm deciding deontic, multi-agent STIT logics with (un)limited choice. In order to ensure the termination of our proof-search algorithm, we introduce a special loop-checking mechanism. Despite the acknowledged potential for deontic reasoning in the context of autonomous vehicles and other areas of AI, this work is the first to provide a syntactic decision procedure for deontic STIT logics. Our proof-search procedures are designed to provide verifiable witnesses/certificates of the (in)validity of formulae, which permit an analysis of the (non)theoremhood of formulae and act as explanations thereof. We utilize our proof-search algorithm to address agent-based normative reasoning tasks such as compliance checking. △ Less

Submitted 5 February, 2024; originally announced February 2024.

arXiv:2402.02655 [pdf, other]

VlogQA: Task, Dataset, and Baseline Models for Vietnamese Spoken-Based Machine Reading Comprehension

Authors: Thinh Phuoc Ngo, Khoa Tran Anh Dang, Son T. Luu, Kiet Van Nguyen, Ngan Luu-Thuy Nguyen

Abstract: This paper presents the development process of a Vietnamese spoken language corpus for machine reading comprehension (MRC) tasks and provides insights into the challenges and opportunities associated with using real-world data for machine reading comprehension tasks. The existing MRC corpora in Vietnamese mainly focus on formal written documents such as Wikipedia articles, online newspapers, or te… ▽ More This paper presents the development process of a Vietnamese spoken language corpus for machine reading comprehension (MRC) tasks and provides insights into the challenges and opportunities associated with using real-world data for machine reading comprehension tasks. The existing MRC corpora in Vietnamese mainly focus on formal written documents such as Wikipedia articles, online newspapers, or textbooks. In contrast, the VlogQA consists of 10,076 question-answer pairs based on 1,230 transcript documents sourced from YouTube -- an extensive source of user-uploaded content, covering the topics of food and travel. By capturing the spoken language of native Vietnamese speakers in natural settings, an obscure corner overlooked in Vietnamese research, the corpus provides a valuable resource for future research in reading comprehension tasks for the Vietnamese language. Regarding performance evaluation, our deep-learning models achieved the highest F1 score of 75.34% on the test set, indicating significant progress in machine reading comprehension for Vietnamese spoken language data. In terms of EM, the highest score we accomplished is 53.97%, which reflects the challenge in processing spoken-based content and highlights the need for further improvement. △ Less

Submitted 6 April, 2024; v1 submitted 4 February, 2024; originally announced February 2024.

Comments: To appear as the main conference paper at EACL 2024

arXiv:2401.16403 [pdf, other]

ViLexNorm: A Lexical Normalization Corpus for Vietnamese Social Media Text

Authors: Thanh-Nhi Nguyen, Thanh-Phong Le, Kiet Van Nguyen

Abstract: Lexical normalization, a fundamental task in Natural Language Processing (NLP), involves the transformation of words into their canonical forms. This process has been proven to benefit various downstream NLP tasks greatly. In this work, we introduce Vietnamese Lexical Normalization (ViLexNorm), the first-ever corpus developed for the Vietnamese lexical normalization task. The corpus comprises over… ▽ More Lexical normalization, a fundamental task in Natural Language Processing (NLP), involves the transformation of words into their canonical forms. This process has been proven to benefit various downstream NLP tasks greatly. In this work, we introduce Vietnamese Lexical Normalization (ViLexNorm), the first-ever corpus developed for the Vietnamese lexical normalization task. The corpus comprises over 10,000 pairs of sentences meticulously annotated by human annotators, sourced from public comments on Vietnam's most popular social media platforms. Various methods were used to evaluate our corpus, and the best-performing system achieved a result of 57.74% using the Error Reduction Rate (ERR) metric (van der Goot, 2019a) with the Leave-As-Is (LAI) baseline. For extrinsic evaluation, employing the model trained on ViLexNorm demonstrates the positive impact of the Vietnamese lexical normalization task on other NLP tasks. Our corpus is publicly available exclusively for research purposes. △ Less

Submitted 31 January, 2024; v1 submitted 29 January, 2024; originally announced January 2024.

Comments: Accepted at the EACL 2024 Main Conference

arXiv:2401.16325 [pdf, other]

Making the unmodulated Pyramid wavefront sensor smart. Closed-loop demonstration of neural network wavefront reconstruction with MagAO-X

Authors: Rico Landman, Sebastiaan Haffert, Jared Males, Laird Close, Warren Foster, Kyle Van Gorkom, Olivier Guyon, Alex Hedglen, Maggie Kautz, Jay Kueny, Joseph Long, Jennifer Lumbres, Eden McEwen, Avalon McLeod, Lauren Schatz

Abstract: Almost all current and future high-contrast imaging instruments will use a Pyramid wavefront sensor (PWFS) as a primary or secondary wavefront sensor. The main issue with the PWFS is its nonlinear response to large phase aberrations, especially under strong atmospheric turbulence. Most instruments try to increase its linearity range by using dynamic modulation, but this leads to decreased sensitiv… ▽ More Almost all current and future high-contrast imaging instruments will use a Pyramid wavefront sensor (PWFS) as a primary or secondary wavefront sensor. The main issue with the PWFS is its nonlinear response to large phase aberrations, especially under strong atmospheric turbulence. Most instruments try to increase its linearity range by using dynamic modulation, but this leads to decreased sensitivity, most prominently for low-order modes, and makes it blind to petal-piston modes. In the push toward high-contrast imaging of fainter stars and deeper contrasts, there is a strong interest in using the PWFS in its unmodulated form. Here, we present closed-loop lab results of a nonlinear reconstructor for the unmodulated PWFS of the Magellan Adaptive Optics eXtreme (MagAO-X) system based on convolutional neural networks (CNNs). We show that our nonlinear reconstructor has a dynamic range of >600 nm root-mean-square (RMS), significantly outperforming the linear reconstructor that only has a 50 nm RMS dynamic range. The reconstructor behaves well in closed loop and can obtain >80% Strehl at 875 nm under a large variety of conditions and reaches higher Strehl ratios than the linear reconstructor under all simulated conditions. The CNN reconstructor also achieves the theoretical sensitivity limit of a PWFS, showing that it does not lose its sensitivity in exchange for dynamic range. The current CNN's computational time is 690 microseconds, which enables loop speeds of >1 kHz. On-sky tests are foreseen soon and will be important for pushing future high-contrast imaging instruments toward their limits. △ Less

Submitted 29 January, 2024; originally announced January 2024.

Comments: Accepted for publication in A&A

arXiv:2401.09549 [pdf, other]

Interferometric Single-Shot Parity Measurement in an InAs-Al Hybrid Device

Authors: Morteza Aghaee, Alejandro Alcaraz Ramirez, Zulfi Alam, Rizwan Ali, Mariusz Andrzejczuk, Andrey Antipov, Mikhail Astafev, Amin Barzegar, Bela Bauer, Jonathan Becker, Umesh Kumar Bhaskar, Alex Bocharov, Srini Boddapati, David Bohn, Jouri Bommer, Leo Bourdet, Arnaud Bousquet, Samuel Boutin, Lucas Casparis, Benjamin James Chapman, Sohail Chatoor, Anna Wulff Christensen, Cassandra Chua, Patrick Codd, William Cole , et al. (137 additional authors not shown)

Abstract: The fusion of non-Abelian anyons or topological defects is a fundamental operation in measurement-only topological quantum computation. In topological superconductors, this operation amounts to a determination of the shared fermion parity of Majorana zero modes. As a step towards this, we implement a single-shot interferometric measurement of fermion parity in indium arsenide-aluminum heterostruct… ▽ More The fusion of non-Abelian anyons or topological defects is a fundamental operation in measurement-only topological quantum computation. In topological superconductors, this operation amounts to a determination of the shared fermion parity of Majorana zero modes. As a step towards this, we implement a single-shot interferometric measurement of fermion parity in indium arsenide-aluminum heterostructures with a gate-defined nanowire. The interferometer is formed by tunnel-coupling the proximitized nanowire to quantum dots. The nanowire causes a state-dependent shift of these quantum dots' quantum capacitance of up to 1 fF. Our quantum capacitance measurements show flux h/2e-periodic bimodality with a signal-to-noise ratio of 1 in 3.7 $μ$s at optimal flux values. From the time traces of the quantum capacitance measurements, we extract a dwell time in the two associated states that is longer than 1 ms at in-plane magnetic fields of approximately 2 T. These results are consistent with a measurement of the fermion parity encoded in a pair of Majorana zero modes that are separated by approximately 3 $μ$m and subjected to a low rate of poisoning by non-equilibrium quasiparticles. The large capacitance shift and long poisoning time enable a parity measurement error probability of 1%. △ Less

Submitted 2 April, 2024; v1 submitted 17 January, 2024; originally announced January 2024.

Comments: Added data on a second measurement of device A and a measurement of device B, expanded discussion of a trivial scenario. Refs added, author list updated

arXiv:2401.09041 [pdf, other]

Textual Summarisation of Large Sets: Towards a General Approach

Authors: Kittipitch Kuptavanich, Ehud Reiter, Kees Van Deemter, Advaith Siddharthan

Abstract: We are develo** techniques to generate summary descriptions of sets of objects. In this paper, we present and evaluate a rule-based NLG technique for summarising sets of bibliographical references in academic papers. This extends our previous work on summarising sets of consumer products and shows how our model generalises across these two very different domains. We are develo** techniques to generate summary descriptions of sets of objects. In this paper, we present and evaluate a rule-based NLG technique for summarising sets of bibliographical references in academic papers. This extends our previous work on summarising sets of consumer products and shows how our model generalises across these two very different domains. △ Less

Submitted 17 January, 2024; originally announced January 2024.

arXiv:2401.08745 [pdf, other]

Wake Forces

Authors: Ken Van Tilburg

Abstract: Two particles can exert forces on each other when embedded in a sea of weakly-coupled particles. These "wake forces'' occur whenever the source and target particles have quadratic interactions with the mediating particles; they are proportional to the ambient energy density, and typically have a range of order the characteristic de Broglie wavelength of the background. The effect can be understood… ▽ More Two particles can exert forces on each other when embedded in a sea of weakly-coupled particles. These "wake forces'' occur whenever the source and target particles have quadratic interactions with the mediating particles; they are proportional to the ambient energy density, and typically have a range of order the characteristic de Broglie wavelength of the background. The effect can be understood as source particles causing a disturbance in the background waves -- a wake -- which subsequently interacts with the target particles. Wake forces can be mediated by bosons or fermions, can have spin dependence, may be attractive or repulsive, and have a generally anisotropic spatial profile and range that depends on the phase-space distribution of the ambient particles. In this work, I investigate the application of wake forces to dark matter searches, recast existing limits on short-range forces into leading constraints on dark matter with quadratic couplings, and sketch out potential experimental modifications to optimize sensitivity. Wake forces occur in the Standard Model: the presence of the cosmic neutrino background induces a millimeter-range force about 22 orders of magnitude weaker than gravity. Wake forces may also be relevant in condensed-matter and atomic physics. △ Less

Submitted 16 January, 2024; originally announced January 2024.

Comments: 15+7 pages, 11 figures

arXiv:2401.07897 [pdf, ps, other]

The Pitfalls of Defining Hallucination

Authors: Kees van Deemter

Abstract: Despite impressive advances in Natural Language Generation (NLG) and Large Language Models (LLMs), researchers are still unclear about important aspects of NLG evaluation. To substantiate this claim, I examine current classifications of hallucination and omission in Data-text NLG, and I propose a logic-based synthesis of these classfications. I conclude by highlighting some remaining limitations o… ▽ More Despite impressive advances in Natural Language Generation (NLG) and Large Language Models (LLMs), researchers are still unclear about important aspects of NLG evaluation. To substantiate this claim, I examine current classifications of hallucination and omission in Data-text NLG, and I propose a logic-based synthesis of these classfications. I conclude by highlighting some remaining limitations of all current thinking about hallucination and by discussing implications for LLMs. △ Less

Submitted 15 January, 2024; originally announced January 2024.

Comments: Accepted for publication in Computational Linguistics on 30 Dec. 2023. (9 Pages.)

arXiv:2401.05570 [pdf, other]

Siamese Networks with Soft Labels for Unsupervised Lesion Detection and Patch Pretraining on Screening Mammograms

Authors: Kevin Van Vorst, Li Shen

Abstract: Self-supervised learning has become a popular way to pretrain a deep learning model and then transfer it to perform downstream tasks. However, most of these methods are developed on large-scale image datasets that contain natural objects with clear textures, outlines, and distinct color contrasts. It remains uncertain whether these methods are equally effective for medical imaging, where the regio… ▽ More Self-supervised learning has become a popular way to pretrain a deep learning model and then transfer it to perform downstream tasks. However, most of these methods are developed on large-scale image datasets that contain natural objects with clear textures, outlines, and distinct color contrasts. It remains uncertain whether these methods are equally effective for medical imaging, where the regions of interest often blend subtly and indistinctly with the surrounding tissues. In this study, we propose an alternative method that uses contralateral mammograms to train a neural network to encode similar embeddings when a pair contains both normal images and different embeddings when a pair contains normal and abnormal images. Our approach leverages the natural symmetry of human body as weak labels to learn to distinguish abnormal lesions from background tissues in a fully unsupervised manner. Our findings suggest that it's feasible by incorporating soft labels derived from the Euclidean distances between the embeddings of the image pairs into the Siamese network loss. Our method demonstrates superior performance in mammogram patch classification compared to existing self-supervised learning methods. This approach not only leverages a vast amount of image data effectively but also minimizes reliance on costly labels, a significant advantage particularly in the field of medical imaging. △ Less

Submitted 10 January, 2024; originally announced January 2024.

arXiv:2312.11476 [pdf]

The geometry of flow: Advancing predictions of river geometry with multi-model machine learning

Authors: Shuyu Y Chang, Zahra Ghahremani, Laura Manuel, Mohammad Erfani, Chaopeng Shen, Sagy Cohen, Kimberly Van Meter, Jennifer L Pierce, Ehab A Meselhe, Erfan Goharian

Abstract: Hydraulic geometry parameters describing river hydrogeomorphic is important for flood forecasting. Although well-established, power-law hydraulic geometry curves have been widely used to understand riverine systems and map** flooding inundation worldwide for the past 70 years, we have become increasingly aware of the limitations of these approaches. In the present study, we have moved beyond the… ▽ More Hydraulic geometry parameters describing river hydrogeomorphic is important for flood forecasting. Although well-established, power-law hydraulic geometry curves have been widely used to understand riverine systems and map** flooding inundation worldwide for the past 70 years, we have become increasingly aware of the limitations of these approaches. In the present study, we have moved beyond these traditional power-law relationships for river geometry, testing the ability of machine-learning models to provide improved predictions of river width and depth. For this work, we have used an unprecedentedly large river measurement dataset (HYDRoSWOT) as well as a suite of watershed predictor data to develop novel data-driven approaches to better estimate river geometries over the contiguous United States (CONUS). Our Random Forest, XGBoost, and neural network models out-performed the traditional, regionalized power law-based hydraulic geometry equations for both width and depth, providing R-squared values of as high as 0.75 for width and as high as 0.67 for depth, compared with R-squared values of 0.57 for width and 0.18 for depth from the regional hydraulic geometry equations. Our results also show diverse performance outcomes across stream orders and geographical regions for the different machine-learning models, demonstrating the value of using multi-model approaches to maximize the predictability of river geometry. The developed models have been used to create the newly publicly available STREAM-geo dataset, which provides river width, depth, width/depth ratio, and river and stream surface area (%RSSA) for nearly 2.7 million NHDPlus stream reaches across the rivers and streams across the contiguous US. △ Less

Submitted 27 November, 2023; originally announced December 2023.

Comments: 30 pages, 10 figures

arXiv:2312.11264 [pdf, other]

Enabling agency: trade-offs between regional and integrated energy systems design flexibility

Authors: Koen van Greevenbroek, Aleksander Grochowicz, Marianne Zeyringer, Fred Espen Benth

Abstract: Europe as a whole as well as individual countries have many distinct pathways to net carbon neutrality by 2050. We use novel near-optimal modelling techniques to illuminate trade-offs and interactions between national and continental energy transitions under uncertainty. Our results reveal extensive and robust flexibility at a regional level in renewable and hydrogen investments as well as in hydr… ▽ More Europe as a whole as well as individual countries have many distinct pathways to net carbon neutrality by 2050. We use novel near-optimal modelling techniques to illuminate trade-offs and interactions between national and continental energy transitions under uncertainty. Our results reveal extensive and robust flexibility at a regional level in renewable and hydrogen investments as well as in hydrogen and electricity exports. However, Europe's energy interconnections lead to significant cross-border effects of national energy strategies. Wind and hydrogen investments can easily be shifted geographically within Europe, and Northern Europe's capacity as energy exporter or importer can shape and be shaped by the remaining system. Solar in Southern Europe and Germany comes out as an enabler, and can unlock design flexibility for the rest of the system. Quantifying these regional trade-offs in energy system planning is crucial in order to facilitate meaningful policy discussion and enable a fair energy transition. △ Less

Submitted 18 December, 2023; originally announced December 2023.

Comments: 26 pages, 15 figures, 1 table

arXiv:2312.11173 [pdf, other]

Capillary adhesion of stick insects

Authors: Guillermo J. Amador, Brett Klaassen van Oorschot, Uddalok Sen, Benjamin Karman, Rutger Leenders

Abstract: Scientific progress within the last few decades has revealed the functional morphology of an insect's sticky footpads -- a soft, sponge-like pad that secretes a thin liquid film. However, the physico-chemical mechanisms underlying their adhesion remain elusive. Here, we explore these underlying mechanisms by simultaneously measuring adhesive force and contact geometry of the adhesive footpads of l… ▽ More Scientific progress within the last few decades has revealed the functional morphology of an insect's sticky footpads -- a soft, sponge-like pad that secretes a thin liquid film. However, the physico-chemical mechanisms underlying their adhesion remain elusive. Here, we explore these underlying mechanisms by simultaneously measuring adhesive force and contact geometry of the adhesive footpads of live, tethered Indian stick insects, \textit{Carausius morosus}, spanning more than two orders of magnitude in body mass. We find that the adhesive force we measure is similar to previous measurements that use a centrifuge. Our measurements afford use the opportunity to directly probe the adhesive stress \textit{in vivo}, and use existing theory on capillary adhesion to predict the surface tension of the secreted liquid and compare it to previous assumptions. From our predictions, we find that the surface tension required to generate the adhesive stresses we observed ranges between 0.68 mN/m and 12 mN/m. The low surface tension of the liquid would enhance the wetting of the stick insect's footpads and promote their ability to conform to various substrates. Our insights may inform the biomimetic design of capillary-based, reversible adhesives and motivate future studies on the capillary properties of the secreted liquid. △ Less

Submitted 2 April, 2024; v1 submitted 18 December, 2023; originally announced December 2023.

Comments: 14 pages, 5 figures

arXiv:2312.07284 [pdf, other]

Your Vulnerability Disclosure Is Important To Us: An Analysis of Coordinated Vulnerability Disclosure Responses Using a Real Security Issue

Authors: Koen van Hove, Jeroen van der Ham-de Vos, Roland van Rijswijk-Deij

Abstract: It is a public secret that doing email securely is fraught with challenges. We found a vulnerability present at many email providers, allowing us to spoof email on behalf of many organisations. As email vulnerabilities are ten a penny, instead of focusing on yet another email vulnerability we ask a different question: how do organisations react to the disclosure of such a security issue in the wil… ▽ More It is a public secret that doing email securely is fraught with challenges. We found a vulnerability present at many email providers, allowing us to spoof email on behalf of many organisations. As email vulnerabilities are ten a penny, instead of focusing on yet another email vulnerability we ask a different question: how do organisations react to the disclosure of such a security issue in the wild? We specifically focus on organisations from the public and critical infrastructure sector who are required to respond to such notifications by law. We find that many organisations are difficult to reach when it concerns security issues, even if they have a security contact point. Additionally, our findings show that having policy in place improves the response and resolution rate, but that even with a policy in place, half of our reports remain unanswered and unsolved after 90~days. Based on these findings we provide recommendations to organisations and bodies such as ENISA to improve future coordinated vulnerability disclosure processes. △ Less

Submitted 12 December, 2023; originally announced December 2023.

Comments: 15 pages, 15 figures

arXiv:2312.00256 [pdf, other]

doi 10.1038/s41586-024-07457-2

Titanium:Sapphire-on-insulator for broadband tunable lasers and high-power amplifiers on chip

Authors: Joshua Yang, Kasper Van Gasse, Daniil M. Lukin, Melissa A. Guidry, Geun Ho Ahn, Alexander D. White, Jelena Vučković

Abstract: Titanium:Sapphire (Ti:Sa) lasers have been essential for advancing fundamental research and technological applications. Ti:Sa lasers are unmatched in bandwidth and tuning range, yet their use is severely restricted due to their large size, cost, and need for high optical pump powers. Here, we demonstrate a monocrystalline Ti:Sa-on-insulator (Ti:SaOI) photonics platform which enables dramatic minia… ▽ More Titanium:Sapphire (Ti:Sa) lasers have been essential for advancing fundamental research and technological applications. Ti:Sa lasers are unmatched in bandwidth and tuning range, yet their use is severely restricted due to their large size, cost, and need for high optical pump powers. Here, we demonstrate a monocrystalline Ti:Sa-on-insulator (Ti:SaOI) photonics platform which enables dramatic miniaturization, cost-reduction, and scalability of Ti:Sa technology. First, through fabrication of low-loss whispering gallery mode resonators, we realize a Ti:Sa laser operating with an ultra-low lasing threshold of 290 $μ$W. Then, through orders-of-magnitude improvement in mode confinement in Ti:SaOI waveguides, we realize the first integrated solid-state (i.e., non-semiconductor) optical amplifier operating below 1 $μ$m, with an ultra-wide bandwidth of 700 - 950 nm and peak gain of 64 dB/cm. We demonstrate unprecedented 17 dB distortion-free amplification of picosecond pulses to up to 2.3 nJ pulse energy, corresponding to a peak power of 1.0 kW. Finally, we demonstrate the first tunable integrated Ti:Sa laser, featuring narrow linewidths and a 24.7 THz tuning range, which, for the first time, can be pumped with low-cost, miniature, off-the-shelf green laser diodes. This opens doors to new modalities of Ti:Sa lasers (now occupying a footprint less than 0.15 mm$^2$), such as massively-scalable Ti:Sa laser array systems for a variety of applications. As a proof-of-concept demonstration, we employ a Ti:SaOI laser array as the sole optical control for a cavity quantum electrodynamics experiment with artificial atoms in silicon carbide. This work is a key step towards the democratization of Ti:Sa technology through a three orders-of-magnitude reduction in cost and footprint, as well as the introduction of solid-state broadband amplification of sub-micron wavelength light. △ Less

Submitted 30 November, 2023; originally announced December 2023.

arXiv:2311.15831 [pdf, other]

Temporal Action Localization for Inertial-based Human Activity Recognition

Authors: Marius Bock, Michael Moeller, Kristof Van Laerhoven

Abstract: A persistent trend in Deep Learning has been the applicability of machine learning concepts to other areas than originally introduced for. As of today, state-of-the-art activity recognition from wearable sensors relies on classifiers being trained on fixed windows of data. Contrarily, video-based Human Activity Recognition has followed a segment-based prediction approach, localizing activity occur… ▽ More A persistent trend in Deep Learning has been the applicability of machine learning concepts to other areas than originally introduced for. As of today, state-of-the-art activity recognition from wearable sensors relies on classifiers being trained on fixed windows of data. Contrarily, video-based Human Activity Recognition has followed a segment-based prediction approach, localizing activity occurrences from start to end. This paper is the first to systematically demonstrate the applicability of state-of-the-art TAL models for wearable Human Activity Recongition (HAR) using raw inertial data as input. Our results show that state-of-the-art TAL models are able to outperform popular inertial models on 4 out of 6 wearable activity recognition benchmark datasets, with improvements ranging as much as 25% in F1-score. Introducing the TAL community's most popular metric to inertial-based HAR, namely mean Average Precision, our analysis shows that TAL models are able to produce more coherent segments along with an overall higher NULL-class accuracy across all datasets. Being the first to provide such an analysis, the TAL community offers an interesting new perspective to inertial-based HAR with yet to be explored design choices and training concepts, which could be of significant value for the inertial-based HAR community. △ Less

Submitted 27 November, 2023; originally announced November 2023.

Comments: 20 pages, 7 figures, 2 tables

arXiv:2311.06851 [pdf, other]

Automatic Textual Normalization for Hate Speech Detection

Authors: Anh Thi-Hoang Nguyen, Dung Ha Nguyen, Nguyet Thi Nguyen, Khanh Thanh-Duy Ho, Kiet Van Nguyen

Abstract: Social media data is a valuable resource for research, yet it contains a wide range of non-standard words (NSW). These irregularities hinder the effective operation of NLP tools. Current state-of-the-art methods for the Vietnamese language address this issue as a problem of lexical normalization, involving the creation of manual rules or the implementation of multi-staged deep learning frameworks,… ▽ More Social media data is a valuable resource for research, yet it contains a wide range of non-standard words (NSW). These irregularities hinder the effective operation of NLP tools. Current state-of-the-art methods for the Vietnamese language address this issue as a problem of lexical normalization, involving the creation of manual rules or the implementation of multi-staged deep learning frameworks, which necessitate extensive efforts to craft intricate rules. In contrast, our approach is straightforward, employing solely a sequence-to-sequence (Seq2Seq) model. In this research, we provide a dataset for textual normalization, comprising 2,181 human-annotated comments with an inter-annotator agreement of 0.9014. By leveraging the Seq2Seq model for textual normalization, our results reveal that the accuracy achieved falls slightly short of 70%. Nevertheless, textual normalization enhances the accuracy of the Hate Speech Detection (HSD) task by approximately 2%, demonstrating its potential to improve the performance of complex NLP tasks. Our dataset is accessible for research purposes. △ Less

Submitted 4 December, 2023; v1 submitted 12 November, 2023; originally announced November 2023.

Comments: Accepted to present at 2023 International Conference on Intelligent Systems Design and Applications (ISDA2023)

Showing 1–50 of 772 results for author: Van, K