Search | arXiv e-print repository

Conditional Forecasts in Large Bayesian VARs with Multiple Equality and Inequality Constraints

Authors: Joshua C. C. Chan, Davide Pettenuzzo, Aubrey Poon, Dan Zhu

Abstract: Conditional forecasts, i.e. projections of a set of variables of interest on the future paths of some other variables, are used routinely by empirical macroeconomists in a number of applied settings. In spite of this, the existing algorithms used to generate conditional forecasts tend to be very computationally intensive, especially when working with large Vector Autoregressions or when multiple l… ▽ More Conditional forecasts, i.e. projections of a set of variables of interest on the future paths of some other variables, are used routinely by empirical macroeconomists in a number of applied settings. In spite of this, the existing algorithms used to generate conditional forecasts tend to be very computationally intensive, especially when working with large Vector Autoregressions or when multiple linear equality and inequality constraints are imposed at once. We introduce a novel precision-based sampler that is fast, scales well, and yields conditional forecasts from linear equality and inequality constraints. We show in a simulation study that the proposed method produces forecasts that are identical to those from the existing algorithms but in a fraction of the time. We then illustrate the performance of our method in a large Bayesian Vector Autoregression where we simultaneously impose a mix of linear equality and inequality constraints on the future trajectories of key US macroeconomic indicators over the 2020--2022 period. △ Less

Submitted 2 July, 2024; originally announced July 2024.

arXiv:2407.00440 [pdf]

Three-dimensional non-reciprocal transport in photonic topological heterostructure of arbitrary shape

Authors: Mudi Wang, Ruo-Yang Zhang, Chenyu Zhang, Haoran Xue, Hongwei Jia, **g Hu, Dongyang Wang, Tianshu Jiang, C. T. Chan

Abstract: Electromagnetic wave propagation in three-dimensional space typically suffers omnidirectional scattering when encountering obstacles. In this study, we employed Chern vectors to construct a topological heterostructure, where large-volume non-reciprocal topological transport in three-dimension is achieved. The shape of the cross-section in the heterostructure can be arbitrary designed, and we exper… ▽ More Electromagnetic wave propagation in three-dimensional space typically suffers omnidirectional scattering when encountering obstacles. In this study, we employed Chern vectors to construct a topological heterostructure, where large-volume non-reciprocal topological transport in three-dimension is achieved. The shape of the cross-section in the heterostructure can be arbitrary designed, and we experimentally observed the distinctive cross-shaped field pattern transport, non-reciprocal energy harvesting, and most importantly, the remarkable ability of electromagnetic wave to traverse obstacles and abrupt structure changes without encountering reflections in 3D space. △ Less

Submitted 29 June, 2024; originally announced July 2024.

Comments: 17 pages, 3 figures

arXiv:2406.19620 [pdf]

Monolithic lithium niobate photonic chip for efficient terahertz-optic modulation and terahertz generation

Authors: Yiwen Zhang, **gwei Yang, Zhaoxi Chen, Hanke Feng, Sha Zhu, Kam-Man Shum, Chi Hou Chan, Cheng Wang

Abstract: The terahertz (THz) frequency range, bridging the gap between microwave and infrared frequencies, presents unparalleled opportunities for advanced imaging, sensing, communications, and spectroscopy applications. Terahertz photonics, in analogy with microwave photonics, is a promising solution to address the critical challenges in THz technologies through optical methods. Despite its vast potential… ▽ More The terahertz (THz) frequency range, bridging the gap between microwave and infrared frequencies, presents unparalleled opportunities for advanced imaging, sensing, communications, and spectroscopy applications. Terahertz photonics, in analogy with microwave photonics, is a promising solution to address the critical challenges in THz technologies through optical methods. Despite its vast potential, key technical challenges remain in effectively interfacing THz signals with the optical domain, especially THz-optic modulation and optical generation of THz waves. Here, we address these challenges using a monolithic integrated photonic chip designed to support efficient bidirectional interaction between THz and optical waves. Leveraging the significant second-order optical nonlinearity and strong optical and THz confinement in a thin-film lithium niobate on quartz platform, the chip supports both efficient THz-optic modulation and continuous THz wave generation at up to 500 GHz. The THz-optic modulator features a radio frequency (RF) half-wave voltage of 8V at 500 GHz, representing more than an order of magnitude reduction in modulation power consumption from previous works. The measured continuous wave THz generation efficiency of 4.8*10-6 /W at 500 GHz also marks a tenfold improvement over existing tunable THz generation devices based on lithium niobate. We further leverage the coherent nature of the optical THz generation process and mature optical modulation techniques to realize high-speed electro-THz modulation at frequencies up to 35 GHz. The chip-scale THz-photonic platform paves the way for more compact, efficient, and cost-effective THz systems with potential applications in THz communications, remote sensing, and spectroscopy. △ Less

Submitted 27 June, 2024; originally announced June 2024.

arXiv:2406.18463 [pdf, ps, other]

Complexity Aversion

Authors: Yuan Gu, Chao Hung Chan

Abstract: This paper proposes a model of decision-making under uncertainty in which an agent is constrained in her cognitive ability to consider complex acts. We identify the complexity of an act according to the corresponding partition of state space. The agent ranks acts according to the expected utility net of complexity cost. A key feature of this model is that the agent is able to update her complexity… ▽ More This paper proposes a model of decision-making under uncertainty in which an agent is constrained in her cognitive ability to consider complex acts. We identify the complexity of an act according to the corresponding partition of state space. The agent ranks acts according to the expected utility net of complexity cost. A key feature of this model is that the agent is able to update her complexity cost function after the arrival of new information. The main result characterizes axiomatically an updating rule for complexity cost function, the Minimal Complexity Aversion representation. According to this rule, the agent measures the complexity cost of an act conditional on the new information by using the cost of another act that gives exactly the same partition of the event but with the lowest ex-ante cost. △ Less

Submitted 26 June, 2024; originally announced June 2024.

arXiv:2406.15653 [pdf, other]

Circular Polarization of Simulated Images of Black Holes

Authors: Abhishek V. Joshi, Ben S. Prather, Chi-kwan Chan, Maciek Wielgus, Charles F. Gammie

Abstract: Models of the resolved Event Horizon Telescope (EHT) sources Sgr A* and M87* are constrained by observations at multiple wavelengths, resolutions, polarizations, and time cadences. In this paper we compare unresolved circular polarization (CP) measurements to a library of models, where each model is characterized by a distribution of CP over time. In the library we vary the spin of the black hole,… ▽ More Models of the resolved Event Horizon Telescope (EHT) sources Sgr A* and M87* are constrained by observations at multiple wavelengths, resolutions, polarizations, and time cadences. In this paper we compare unresolved circular polarization (CP) measurements to a library of models, where each model is characterized by a distribution of CP over time. In the library we vary the spin of the black hole, the magnetic field strength at the horizon (i.e. both SANE and MAD models), the observer inclination, a parameter for the maximum ion-electron temperature ratio assuming a thermal plasma, and the direction of the magnetic field dipole moment. We find that ALMA observations of Sgr A* are inconsistent with all edge-on ($i = 90^\circ$) models. Restricting attention to the magnetically arrested disk (MAD) models favored by earlier EHT studies of Sgr A*, we find that only models with magnetic dipole moment pointing away from the observer are consistent with ALMA data. We also note that in 26 of the 27 passing MAD models the accretion flow rotates clockwise on the sky. We provide a table of the mean and standard deviation of the CP distributions for all model parameters along with their trends. △ Less

Submitted 21 June, 2024; originally announced June 2024.

Comments: 33 pages, 17 figures, 2 tables. Accepted for publication in ApJ

arXiv:2406.15621 [pdf, other]

On the 96-well plate coverglass tilt and curvature suppression in 96-camera imaging system

Authors: Antony C Chan

Abstract: The 96-eyes instrument is capable of computational extended depth of focus (eDOF) of up to +/- 30 micrometer in the phase channel, and conventional depth of field (DOF) of +/- 5 micrometer in the fluorescence channel. However, it requires minimal plate-to-plate cover glass depth variation to function. Plate depths are measured using a third-party plate scanner (Opera Phenix) grouped by plate types… ▽ More The 96-eyes instrument is capable of computational extended depth of focus (eDOF) of up to +/- 30 micrometer in the phase channel, and conventional depth of field (DOF) of +/- 5 micrometer in the fluorescence channel. However, it requires minimal plate-to-plate cover glass depth variation to function. Plate depths are measured using a third-party plate scanner (Opera Phenix) grouped by plate types (Greiner UV-Star, Cell-Star, and Eppendorf meniscus-free). The two-dimensional (2D) depth dataset is aggregated through principal component analysis to obtain the top eight dominating 2D surface deformation modes. More than 90% of the variation can be explained by the plate's absolute depth and tilt (Pitch, Gradient-Y, and Gradient-X), followed by (~= 2%) the cover glass's curvature (Curve-Y and Curve-XY). Plate-to-plate average depth and tilt variations are suppressed by a customized kinematic mount anchoring the plate's cover glass at the instrument's imaging plane. The plate's average curvature is compensated by manually aligning all 96-eyes microscope objective lenses to track the plate's surface; an one-off calibration procedure aided by the backlash-free piezo-flexure z-stage. Design validation is conducted in silico, with the proof of concept experiment conducted on the 96-eyes with new mounting bracket retrofits. △ Less

Submitted 13 March, 2024; originally announced June 2024.

arXiv:2406.12917 [pdf, other]

The Black Hole Explorer: Motivation and Vision

Authors: Michael D. Johnson, Kazunori Akiyama, Rebecca Baturin, Bryan Bilyeu, Lindy Blackburn, Don Boroson, Alejandro Cardenas-Avendano, Andrew Chael, Chi-kwan Chan, Dominic Chang, Peter Cheimets, Cathy Chou, Sheperd S. Doeleman, Joseph Farah, Peter Galison, Ronald Gamble, Charles F. Gammie, Zachary Gelles, Jose L. Gomez, Samuel E. Gralla, Paul Grimes, Leonid I. Gurvits, Shahar Hadar, Kari Haworth, Kazuhiro Hada , et al. (43 additional authors not shown)

Abstract: We present the Black Hole Explorer (BHEX), a mission that will produce the sharpest images in the history of astronomy by extending submillimeter Very-Long-Baseline Interferometry (VLBI) to space. BHEX will discover and measure the bright and narrow "photon ring" that is predicted to exist in images of black holes, produced from light that has orbited the black hole before esca**. This discovery… ▽ More We present the Black Hole Explorer (BHEX), a mission that will produce the sharpest images in the history of astronomy by extending submillimeter Very-Long-Baseline Interferometry (VLBI) to space. BHEX will discover and measure the bright and narrow "photon ring" that is predicted to exist in images of black holes, produced from light that has orbited the black hole before esca**. This discovery will expose universal features of a black hole's spacetime that are distinct from the complex astrophysics of the emitting plasma, allowing the first direct measurements of a supermassive black hole's spin. In addition to studying the properties of the nearby supermassive black holes M87* and Sgr A*, BHEX will measure the properties of dozens of additional supermassive black holes, providing crucial insights into the processes that drive their creation and growth. BHEX will also connect these supermassive black holes to their relativistic jets, elucidating the power source for the brightest and most efficient engines in the universe. BHEX will address fundamental open questions in the physics and astrophysics of black holes that cannot be answered without submillimeter space VLBI. The mission is enabled by recent technological breakthroughs, including the development of ultra-high-speed downlink using laser communications, and it leverages billions of dollars of existing ground infrastructure. We present the motivation for BHEX, its science goals and associated requirements, and the pathway to launch within the next decade. △ Less

Submitted 13 June, 2024; originally announced June 2024.

Comments: Proceedings for SPIE Astronomical Telescopes and Instrumentation

arXiv:2406.05168 [pdf, other]

doi 10.1103/PhysRevLett.132.223802

Topological photonic alloy

Authors: Tiantao Qu, Mudi Wang, Xiaoyu Cheng, Xiaohan Cui, Ruo-Yang Zhang, Zhao-Qing Zhang, Lei Zhang, Jun Chen, C. T. Chan

Abstract: We present the new concept of photonic alloy as a non-periodic topological material. By mixing non-magnetized and magnetized rods in a non-periodic 2D photonic crystal configuration, we realized photonic alloys in the microwave regime. Our experimental findings reveal that the photonic alloy sustains non-reciprocal chiral edge states (CESs) even at very low concentration of magnetized rods. The no… ▽ More We present the new concept of photonic alloy as a non-periodic topological material. By mixing non-magnetized and magnetized rods in a non-periodic 2D photonic crystal configuration, we realized photonic alloys in the microwave regime. Our experimental findings reveal that the photonic alloy sustains non-reciprocal chiral edge states (CESs) even at very low concentration of magnetized rods. The non-trivial topology and the associated edge states of these non-periodic systems can be characterized by the winding of the reflection phase. Our results indicate that the threshold concentrations for the investigated system within the first non-trivial band gap to exhibit topological behavior approach zero in the thermodynamic limit for substitutional alloys, while the threshold remains non-zero for interstitial alloys. At low concentration, the system exhibits an inhomogeneous structure characterized by isolated patches of non-percolating magnetic domains that are spaced far apart within a topologically trivial photonic crystal. Surprisingly, the system manifests CESs despite a local breakdown of time-reversal symmetry rather than a global one. Photonic alloys represent a new category of disordered topological materials, offering exciting opportunities for exploring topological materials with adjustable gaps. △ Less

Submitted 7 June, 2024; originally announced June 2024.

Comments: 6 pages, 4 figures

Journal ref: Phys. Rev. Lett. 132, 223802 (2024)

arXiv:2406.04398 [pdf, other]

lenscat: a Public and Community-Contributed Catalog of Known Strong Gravitational Lenses

Authors: L. Vujeva, R. K. L. Lo, J. M. Ezquiaga, J. C. L. Chan

Abstract: We present lenscat, a public and community-contributed catalog of strong gravitational lenses found by electromagnetic surveys. The main objective of lenscat is to compile a simple, easy-to-access catalog that can be used in a variety of lensing studies, such as facilitating the search for the host galaxy of a candidate strongly lensed transient event. We also provide a python package to interact… ▽ More We present lenscat, a public and community-contributed catalog of strong gravitational lenses found by electromagnetic surveys. The main objective of lenscat is to compile a simple, easy-to-access catalog that can be used in a variety of lensing studies, such as facilitating the search for the host galaxy of a candidate strongly lensed transient event. We also provide a python package to interact with tools commonly used by the community. This allows end users both with and without lensing expertise to obtain a list of known strong lenses within a given search area, and to also rank them by their respective searched probabilities. Here, we exemplify this by crossmatching the gravitational wave joint sky localization region of an interesting pair of events GW170104-GW170814. Other examples with short gamma-ray bursts are given. Thanks to the open and simple infrastructure of lenscat, members of the lensing community can directly add newly found lenses from their own studies to help create a long-lasting catalog that is as exhaustive and accessible as possible. △ Less

Submitted 6 June, 2024; originally announced June 2024.

Comments: 7 pages, 2 figures

arXiv:2405.17462 [pdf, other]

Ferrari: Federated Feature Unlearning via Optimizing Feature Sensitivity

Authors: Hanlin Gu, WinKent Ong, Chee Seng Chan, Lixin Fan

Abstract: The advent of Federated Learning (FL) highlights the practical necessity for the 'right to be forgotten' for all clients, allowing them to request data deletion from the machine learning model's service provider. This necessity has spurred a growing demand for Federated Unlearning (FU). Feature unlearning has gained considerable attention due to its applications in unlearning sensitive features, b… ▽ More The advent of Federated Learning (FL) highlights the practical necessity for the 'right to be forgotten' for all clients, allowing them to request data deletion from the machine learning model's service provider. This necessity has spurred a growing demand for Federated Unlearning (FU). Feature unlearning has gained considerable attention due to its applications in unlearning sensitive features, backdoor features, and bias features. Existing methods employ the influence function to achieve feature unlearning, which is impractical for FL as it necessitates the participation of other clients in the unlearning process. Furthermore, current research lacks an evaluation of the effectiveness of feature unlearning. To address these limitations, we define feature sensitivity in the evaluation of feature unlearning according to Lipschitz continuity. This metric characterizes the rate of change or sensitivity of the model output to perturbations in the input feature. We then propose an effective federated feature unlearning framework called Ferrari, which minimizes feature sensitivity. Extensive experimental results and theoretical analysis demonstrate the effectiveness of Ferrari across various feature unlearning scenarios, including sensitive, backdoor, and biased features. △ Less

Submitted 29 May, 2024; v1 submitted 23 May, 2024; originally announced May 2024.

Comments: TLDR: The need for a "right to be forgotten" in Federated Learning has led to the development of the Ferrari framework, which efficiently unlearns sensitive features using a Lipschitz continuity-based metric, proven effective in extensive testing

arXiv:2405.16695 [pdf, other]

Oscillations in neuronal activity: a neuron-centered spatiotemporal model of the Unfolded Protein Response in prion diseases

Authors: Elliot M. Miller, Tat Chung D. Chan, Carlos Montes-Matamoros, Omar Sharif, Laurent Pujo-Menjouet, Michael R. Lindstrom

Abstract: Many neurodegenerative diseases (NDs) are characterized by the slow spatial spread of toxic protein species in the brain. The toxic proteins can induce neuronal stress, triggering the Unfolded Protein Response (UPR), which slows or stops protein translation and can indirectly reduce the toxic load. However, the UPR may also trigger processes leading to apoptotic cell death and the UPR is implicate… ▽ More Many neurodegenerative diseases (NDs) are characterized by the slow spatial spread of toxic protein species in the brain. The toxic proteins can induce neuronal stress, triggering the Unfolded Protein Response (UPR), which slows or stops protein translation and can indirectly reduce the toxic load. However, the UPR may also trigger processes leading to apoptotic cell death and the UPR is implicated in the progression of several NDs. In this paper, we develop a novel mathematical model to describe the spatiotemporal dynamics of the UPR mechanism for prion diseases. Our model is centered around a single neuron, with representative proteins P (healthy) and S (toxic) interacting with heterodimer dynamics (S interacts with P to form two S's). The model takes the form of a coupled system of nonlinear reaction-diffusion equations with a delayed, nonlinear flux for P (delay from the UPR). Through the delay, we find parameter regimes that exhibit oscillations in the P- and S-protein levels. We find that oscillations are more pronounced when the S-clearance rate and S-diffusivity are small in comparison to the P-clearance rate and P-diffusivity, respectively. The oscillations become more pronounced as delays in initiating the UPR increase. We also consider quasi-realistic clinical parameters to understand how possible drug therapies can alter the course of a prion disease. We find that decreasing the production of P, decreasing the recruitment rate, increasing the diffusivity of S, increasing the UPR S-threshold, and increasing the S clearance rate appear to be the most powerful modifications to reduce the mean UPR intensity and potentially moderate the disease progression. △ Less

Submitted 26 May, 2024; originally announced May 2024.

Comments: 35 pages, 11 tables, 13 figures

arXiv:2405.16021 [pdf, other]

VADER: Visual Affordance Detection and Error Recovery for Multi Robot Human Collaboration

Authors: Michael Ahn, Montserrat Gonzalez Arenas, Matthew Bennice, Noah Brown, Christine Chan, Byron David, Anthony Francis, Gavin Gonzalez, Rainer Hessmer, Tomas Jackson, Nikhil J Joshi, Daniel Lam, Tsang-Wei Edward Lee, Alex Luong, Sharath Maddineni, Harsh Patel, Jodilyn Peralta, Jornell Quiambao, Diego Reyes, Rosario M Jauregui Ruano, Dorsa Sadigh, Pannag Sanketi, Leila Takayama, Pavel Vodenski, Fei Xia

Abstract: Robots today can exploit the rich world knowledge of large language models to chain simple behavioral skills into long-horizon tasks. However, robots often get interrupted during long-horizon tasks due to primitive skill failures and dynamic environments. We propose VADER, a plan, execute, detect framework with seeking help as a new skill that enables robots to recover and complete long-horizon ta… ▽ More Robots today can exploit the rich world knowledge of large language models to chain simple behavioral skills into long-horizon tasks. However, robots often get interrupted during long-horizon tasks due to primitive skill failures and dynamic environments. We propose VADER, a plan, execute, detect framework with seeking help as a new skill that enables robots to recover and complete long-horizon tasks with the help of humans or other robots. VADER leverages visual question answering (VQA) modules to detect visual affordances and recognize execution errors. It then generates prompts for a language model planner (LMP) which decides when to seek help from another robot or human to recover from errors in long-horizon task execution. We show the effectiveness of VADER with two long-horizon robotic tasks. Our pilot study showed that VADER is capable of performing complex long-horizon tasks by asking for help from another robot to clear a table. Our user study showed that VADER is capable of performing complex long-horizon tasks by asking for help from a human to clear a path. We gathered feedback from people (N=19) about the performance of the VADER performance vs. a robot that did not ask for help. https://google-vader.github.io/ △ Less

Submitted 30 May, 2024; v1 submitted 24 May, 2024; originally announced May 2024.

Comments: 9 pages, 4 figures

arXiv:2405.13322 [pdf, ps, other]

The Meyers-Serrin theorem on Riemannian manifolds: a survey

Authors: Chi Hin Chan, Magdalena Czubak

Abstract: We revisit the questions of density of smooth functions, and differential forms, in Sobolev spaces on Riemannian manifolds. We carefully show equivalence of weak covariant derivatives to weak partial derivatives. We revisit the questions of density of smooth functions, and differential forms, in Sobolev spaces on Riemannian manifolds. We carefully show equivalence of weak covariant derivatives to weak partial derivatives. △ Less

Submitted 27 June, 2024; v1 submitted 21 May, 2024; originally announced May 2024.

Comments: 13 pages, minor updates (added some definitions, fixed typos);

arXiv:2405.09510 [pdf, other]

The Instrumental Variable Model with Categorical Instrument, Treatment and Outcome

Authors: Yilin Song, K. C. Gary Chan, Thomas S. Richardson

Abstract: Instrumental variable models are central to the inference of causal effects in many settings. We consider the instrumental variable model with discrete variables where the instrument (Z), exposure (X) and outcome (Y) take Q, K, and M levels respectively. We assume that the instrument is randomized and that there is no direct effect of Z on Y so that Y(x,z) = Y(x). We first provide a simple charact… ▽ More Instrumental variable models are central to the inference of causal effects in many settings. We consider the instrumental variable model with discrete variables where the instrument (Z), exposure (X) and outcome (Y) take Q, K, and M levels respectively. We assume that the instrument is randomized and that there is no direct effect of Z on Y so that Y(x,z) = Y(x). We first provide a simple characterization of the set of joint distributions of the potential outcomes P(Y(x=1), ..., Y(x=K)) compatible with a given observed distribution P(X, Y | Z). We then discuss the variation (in)dependence property of the marginal probability distribution of the potential outcomes P(Y(x=1)), ..., P(Y(x=K)) which has direct implications for partial identification of average causal effect contrasts such as E[Y(x=i) - Y(x=j)]. We also include simulation results on the volume of the observed distributions not compatible with the IV model as K and Q change. △ Less

Submitted 27 June, 2024; v1 submitted 15 May, 2024; originally announced May 2024.

arXiv:2405.07675 [pdf]

Super-concentrated alkali hydroxide electrolytes for rechargeable Zn batteries

Authors: Yilin Ma, Jiajia Huang, Shengyong Gao, iangyu Li, Zhibin Yi, Diwen Xiao, Cheuk Kai Kevin Chan, Ding Pan, Qing Chen

Abstract: Rechargeable Zn batteries offer safe, inexpensive energy storage, but when deeply discharged to compete with lithium-ion batteries, they are plagued by parasitic reactions at the Zn anodes. We apply super-concentrated alkaline electrolytes to suppress two key parasitic reactions, hydrogen evolution and ZnO passivation. An electrolyte with 15 M KOH displays a broad electrochemical window (>2.5 V on… ▽ More Rechargeable Zn batteries offer safe, inexpensive energy storage, but when deeply discharged to compete with lithium-ion batteries, they are plagued by parasitic reactions at the Zn anodes. We apply super-concentrated alkaline electrolytes to suppress two key parasitic reactions, hydrogen evolution and ZnO passivation. An electrolyte with 15 M KOH displays a broad electrochemical window (>2.5 V on Au), a high ZnO solubility (>1.5 M), and an exceptionally high ionic conductivity (>0.27 S/cm at 25 C). Spectroscopies and ab-initio molecular dynamics simulation suggest K+-OH- pairs and a tightened water network to underpin the stability. The simulation further reveals unique triggered proton hop** that offsets the lack of water wires to sustain the conductivity. Low hydrogen evolution, confirmed via online mass spectroscopy, and slow passivation enable a NiOOH||Zn battery to deliver a cumulative capacity of 8.4 Ah cm-2 and a Zn-air battery to last for over 110 hours. △ Less

Submitted 13 May, 2024; originally announced May 2024.

arXiv:2405.07667 [pdf, other]

Backdoor Removal for Generative Large Language Models

Authors: Haoran Li, Yulin Chen, Zihao Zheng, Qi Hu, Chunkit Chan, Heshan Liu, Yangqiu Song

Abstract: With rapid advances, generative large language models (LLMs) dominate various Natural Language Processing (NLP) tasks from understanding to reasoning. Yet, language models' inherent vulnerabilities may be exacerbated due to increased accessibility and unrestricted model training on massive textual data from the Internet. A malicious adversary may publish poisoned data online and conduct backdoor a… ▽ More With rapid advances, generative large language models (LLMs) dominate various Natural Language Processing (NLP) tasks from understanding to reasoning. Yet, language models' inherent vulnerabilities may be exacerbated due to increased accessibility and unrestricted model training on massive textual data from the Internet. A malicious adversary may publish poisoned data online and conduct backdoor attacks on the victim LLMs pre-trained on the poisoned data. Backdoored LLMs behave innocuously for normal queries and generate harmful responses when the backdoor trigger is activated. Despite significant efforts paid to LLMs' safety issues, LLMs are still struggling against backdoor attacks. As Anthropic recently revealed, existing safety training strategies, including supervised fine-tuning (SFT) and Reinforcement Learning from Human Feedback (RLHF), fail to revoke the backdoors once the LLM is backdoored during the pre-training stage. In this paper, we present Simulate and Eliminate (SANDE) to erase the undesired backdoored map**s for generative LLMs. We initially propose Overwrite Supervised Fine-tuning (OSFT) for effective backdoor removal when the trigger is known. Then, to handle the scenarios where the trigger patterns are unknown, we integrate OSFT into our two-stage framework, SANDE. Unlike previous works that center on the identification of backdoors, our safety-enhanced LLMs are able to behave normally even when the exact triggers are activated. We conduct comprehensive experiments to show that our proposed SANDE is effective against backdoor attacks while bringing minimal harm to LLMs' powerful capability without any additional access to unbackdoored clean models. We will release the reproducible code. △ Less

Submitted 13 May, 2024; originally announced May 2024.

arXiv:2405.05847 [pdf, other]

Learned feature representations are biased by complexity, learning order, position, and more

Authors: Andrew Kyle Lampinen, Stephanie C. Y. Chan, Katherine Hermann

Abstract: Representation learning, and interpreting learned representations, are key areas of focus in machine learning and neuroscience. Both fields generally use representations as a means to understand or improve a system's computations. In this work, however, we explore surprising dissociations between representation and computation that may pose challenges for such efforts. We create datasets in which… ▽ More Representation learning, and interpreting learned representations, are key areas of focus in machine learning and neuroscience. Both fields generally use representations as a means to understand or improve a system's computations. In this work, however, we explore surprising dissociations between representation and computation that may pose challenges for such efforts. We create datasets in which we attempt to match the computational role that different features play, while manipulating other properties of the features or the data. We train various deep learning architectures to compute these multiple abstract features about their inputs. We find that their learned feature representations are systematically biased towards representing some features more strongly than others, depending upon extraneous properties such as feature complexity, the order in which features are learned, and the distribution of features over the inputs. For example, features that are simpler to compute or learned first tend to be represented more strongly and densely than features that are more complex or learned later, even if all features are learned equally well. We also explore how these biases are affected by architectures, optimizers, and training regimes (e.g., in transformers, features decoded earlier in the output sequence also tend to be represented more strongly). Our results help to characterize the inductive biases of gradient-based representation learning. These results also highlight a key challenge for interpretability $-$ or for comparing the representations of models and brains $-$ disentangling extraneous biases from the computationally important aspects of a system's internal representations. △ Less

Submitted 6 June, 2024; v1 submitted 9 May, 2024; originally announced May 2024.

arXiv:2405.03141 [pdf, other]

Automatic Ultrasound Curve Angle Measurement via Affinity Clustering for Adolescent Idiopathic Scoliosis Evaluation

Authors: Yihao Zhou, Timothy Tin-Yan Lee, Kelly Ka-Lee Lai, Chonglin Wu, Hin Ting Lau, De Yang, Chui-Yi Chan, Winnie Chiu-Wing Chu, Jack Chun-Yiu Cheng, Tsz-** Lam, Yong-** Zheng

Abstract: The current clinical gold standard for evaluating adolescent idiopathic scoliosis (AIS) is X-ray radiography, using Cobb angle measurement. However, the frequent monitoring of the AIS progression using X-rays poses a challenge due to the cumulative radiation exposure. Although 3D ultrasound has been validated as a reliable and radiation-free alternative for scoliosis assessment, the process of mea… ▽ More The current clinical gold standard for evaluating adolescent idiopathic scoliosis (AIS) is X-ray radiography, using Cobb angle measurement. However, the frequent monitoring of the AIS progression using X-rays poses a challenge due to the cumulative radiation exposure. Although 3D ultrasound has been validated as a reliable and radiation-free alternative for scoliosis assessment, the process of measuring spinal curvature is still carried out manually. Consequently, there is a considerable demand for a fully automatic system that can locate bony landmarks and perform angle measurements. To this end, we introduce an estimation model for automatic ultrasound curve angle (UCA) measurement. The model employs a dual-branch network to detect candidate landmarks and perform vertebra segmentation on ultrasound coronal images. An affinity clustering strategy is utilized within the vertebral segmentation area to illustrate the affinity relationship between candidate landmarks. Subsequently, we can efficiently perform line delineation from a clustered affinity map for UCA measurement. As our method is specifically designed for UCA calculation, this method outperforms other state-of-the-art methods for landmark and line detection tasks. The high correlation between the automatic UCA and Cobb angle (R$^2$=0.858) suggests that our proposed method can potentially replace manual UCA measurement in ultrasound scoliosis assessment. △ Less

Submitted 6 May, 2024; v1 submitted 5 May, 2024; originally announced May 2024.

arXiv:2405.01356 [pdf, other]

Improving Subject-Driven Image Synthesis with Subject-Agnostic Guidance

Authors: Kelvin C. K. Chan, Yang Zhao, Xuhui Jia, Ming-Hsuan Yang, Huisheng Wang

Abstract: In subject-driven text-to-image synthesis, the synthesis process tends to be heavily influenced by the reference images provided by users, often overlooking crucial attributes detailed in the text prompt. In this work, we propose Subject-Agnostic Guidance (SAG), a simple yet effective solution to remedy the problem. We show that through constructing a subject-agnostic condition and applying our pr… ▽ More In subject-driven text-to-image synthesis, the synthesis process tends to be heavily influenced by the reference images provided by users, often overlooking crucial attributes detailed in the text prompt. In this work, we propose Subject-Agnostic Guidance (SAG), a simple yet effective solution to remedy the problem. We show that through constructing a subject-agnostic condition and applying our proposed dual classifier-free guidance, one could obtain outputs consistent with both the given subject and input text prompts. We validate the efficacy of our approach through both optimization-based and encoder-based methods. Additionally, we demonstrate its applicability in second-order customization methods, where an encoder-based model is fine-tuned with DreamBooth. Our approach is conceptually simple and requires only minimal code modifications, but leads to substantial quality improvements, as evidenced by our evaluations and user studies. △ Less

Submitted 2 May, 2024; originally announced May 2024.

Comments: Accepted to CVPR 2024

arXiv:2405.00671 [pdf, ps, other]

The scalar product formula for parahoric Deligne--Lusztig induction

Authors: Charlotte Chan

Abstract: Parahoric Deligne--Lusztig induction gives rise to positive-depth representations of parahoric subgroups of $p$-adic groups. The most fundamental basic question about parahoric Deligne--Lusztig induction is whether it satisfies the scalar product formula. We resolve this conjecture for all split-generic pairs $(T,θ)$ -- in particular, for all characters $θ$ if $T$ is elliptic. Parahoric Deligne--Lusztig induction gives rise to positive-depth representations of parahoric subgroups of $p$-adic groups. The most fundamental basic question about parahoric Deligne--Lusztig induction is whether it satisfies the scalar product formula. We resolve this conjecture for all split-generic pairs $(T,θ)$ -- in particular, for all characters $θ$ if $T$ is elliptic. △ Less

Submitted 1 May, 2024; originally announced May 2024.

Comments: 22 pages

arXiv:2404.14291 [pdf, ps, other]

Classification of a class of planar polynomials

Authors: Chin Hei Chan, Maosheng Xiong

Abstract: Let $p$ be an odd prime, $k,\ell$ be positive integers, $q=p^k, Q=p^{\ell}$. In this paper we characterise planar functions of the form $f_{\underline{c}}(X)=c_0X^{qQ+q}+c_1X^{qQ+1}+c_2X^{Q+q}+c_3X^{Q+1}$ over $\mathbb{F}_{q^2}$ for any $\underline{c}=(c_0,c_1,c_2,c_3) \in \mathbb{F}_{q^2}^4$ in terms of linear equivalence. Let $p$ be an odd prime, $k,\ell$ be positive integers, $q=p^k, Q=p^{\ell}$. In this paper we characterise planar functions of the form $f_{\underline{c}}(X)=c_0X^{qQ+q}+c_1X^{qQ+1}+c_2X^{Q+q}+c_3X^{Q+1}$ over $\mathbb{F}_{q^2}$ for any $\underline{c}=(c_0,c_1,c_2,c_3) \in \mathbb{F}_{q^2}^4$ in terms of linear equivalence. △ Less

Submitted 22 April, 2024; originally announced April 2024.

arXiv:2404.14215 [pdf, other]

Text-Tuple-Table: Towards Information Integration in Text-to-Table Generation via Global Tuple Extraction

Authors: Zheye Deng, Chunkit Chan, Weiqi Wang, Yuxi Sun, Wei Fan, Tianshi Zheng, Yauwai Yim, Yangqiu Song

Abstract: The task of condensing large chunks of textual information into concise and structured tables has gained attention recently due to the emergence of Large Language Models (LLMs) and their potential benefit for downstream tasks, such as text summarization and text mining. Previous approaches often generate tables that directly replicate information from the text, limiting their applicability in broa… ▽ More The task of condensing large chunks of textual information into concise and structured tables has gained attention recently due to the emergence of Large Language Models (LLMs) and their potential benefit for downstream tasks, such as text summarization and text mining. Previous approaches often generate tables that directly replicate information from the text, limiting their applicability in broader contexts, as text-to-table generation in real-life scenarios necessitates information extraction, reasoning, and integration. However, there is a lack of both datasets and methodologies towards this task. In this paper, we introduce LiveSum, a new benchmark dataset created for generating summary tables of competitions based on real-time commentary texts. We evaluate the performances of state-of-the-art LLMs on this task in both fine-tuning and zero-shot settings, and additionally propose a novel pipeline called $T^3$(Text-Tuple-Table) to improve their performances. Extensive experimental results demonstrate that LLMs still struggle with this task even after fine-tuning, while our approach can offer substantial performance gains without explicit training. Further analyses demonstrate that our method exhibits strong generalization abilities, surpassing previous approaches on several other text-to-table datasets. Our code and data can be found at https://github.com/HKUST-KnowComp/LiveSum-TTT. △ Less

Submitted 22 April, 2024; originally announced April 2024.

arXiv:2404.14135 [pdf, other]

Text in the Dark: Extremely Low-Light Text Image Enhancement

Authors: Che-Tsung Lin, Chun Chet Ng, Zhi Qin Tan, Wan Jun Nah, Xinyu Wang, Jie Long Kew, Pohao Hsu, Shang Hong Lai, Chee Seng Chan, Christopher Zach

Abstract: Extremely low-light text images are common in natural scenes, making scene text detection and recognition challenging. One solution is to enhance these images using low-light image enhancement methods before text extraction. However, previous methods often do not try to particularly address the significance of low-level features, which are crucial for optimal performance on downstream scene text t… ▽ More Extremely low-light text images are common in natural scenes, making scene text detection and recognition challenging. One solution is to enhance these images using low-light image enhancement methods before text extraction. However, previous methods often do not try to particularly address the significance of low-level features, which are crucial for optimal performance on downstream scene text tasks. Further research is also hindered by the lack of extremely low-light text datasets. To address these limitations, we propose a novel encoder-decoder framework with an edge-aware attention module to focus on scene text regions during enhancement. Our proposed method uses novel text detection and edge reconstruction losses to emphasize low-level scene text features, leading to successful text extraction. Additionally, we present a Supervised Deep Curve Estimation (Supervised-DCE) model to synthesize extremely low-light images based on publicly available scene text datasets such as ICDAR15 (IC15). We also labeled texts in the extremely low-light See In the Dark (SID) and ordinary LOw-Light (LOL) datasets to allow for objective assessment of extremely low-light image enhancement through scene text tasks. Extensive experiments show that our model outperforms state-of-the-art methods in terms of both image quality and scene text metrics on the widely-used LOL, SID, and synthetic IC15 datasets. Code and dataset will be released publicly at https://github.com/chunchet-ng/Text-in-the-Dark. △ Less

Submitted 22 April, 2024; originally announced April 2024.

Comments: The first two authors contributed equally to this work

arXiv:2404.13944 [pdf, other]

Gorgeous: Create Your Desired Character Facial Makeup from Any Ideas

Authors: Jia Wei Sii, Chee Seng Chan

Abstract: Contemporary makeup transfer methods primarily focus on replicating makeup from one face to another, considerably limiting their use in creating diverse and creative character makeup essential for visual storytelling. Such methods typically fail to address the need for uniqueness and contextual relevance, specifically aligning with character and story settings as they depend heavily on existing fa… ▽ More Contemporary makeup transfer methods primarily focus on replicating makeup from one face to another, considerably limiting their use in creating diverse and creative character makeup essential for visual storytelling. Such methods typically fail to address the need for uniqueness and contextual relevance, specifically aligning with character and story settings as they depend heavily on existing facial makeup in reference images. This approach also presents a significant challenge when attempting to source a perfectly matched facial makeup style, further complicating the creation of makeup designs inspired by various story elements, such as theme, background, and props that do not necessarily feature faces. To address these limitations, we introduce $Gorgeous$, a novel diffusion-based makeup application method that goes beyond simple transfer by innovatively crafting unique and thematic facial makeup. Unlike traditional methods, $Gorgeous$ does not require the presence of a face in the reference images. Instead, it draws artistic inspiration from a minimal set of three to five images, which can be of any type, and transforms these elements into practical makeup applications directly on the face. Our comprehensive experiments demonstrate that $Gorgeous$ can effectively generate distinctive character facial makeup inspired by the chosen thematic reference images. This approach opens up new possibilities for integrating broader story elements into character makeup, thereby enhancing the narrative depth and visual impact in storytelling. △ Less

Submitted 22 April, 2024; originally announced April 2024.

Comments: Project page: https://github.com/JiaWeiSii/gorgeous/

arXiv:2404.13627 [pdf, other]

NegotiationToM: A Benchmark for Stress-testing Machine Theory of Mind on Negotiation Surrounding

Authors: Chunkit Chan, Cheng Jiayang, Yauwai Yim, Zheye Deng, Wei Fan, Haoran Li, Xin Liu, Hongming Zhang, Weiqi Wang, Yangqiu Song

Abstract: Large Language Models (LLMs) have sparked substantial interest and debate concerning their potential emergence of Theory of Mind (ToM) ability. Theory of mind evaluations currently focuses on testing models using machine-generated data or game settings prone to shortcuts and spurious correlations, which lacks evaluation of machine ToM ability in real-world human interaction scenarios. This poses a… ▽ More Large Language Models (LLMs) have sparked substantial interest and debate concerning their potential emergence of Theory of Mind (ToM) ability. Theory of mind evaluations currently focuses on testing models using machine-generated data or game settings prone to shortcuts and spurious correlations, which lacks evaluation of machine ToM ability in real-world human interaction scenarios. This poses a pressing demand to develop new real-world scenario benchmarks. We introduce NegotiationToM, a new benchmark designed to stress-test machine ToM in real-world negotiation surrounding covered multi-dimensional mental states (i.e., desires, beliefs, and intentions). Our benchmark builds upon the Belief-Desire-Intention (BDI) agent modeling theory and conducts the necessary empirical experiments to evaluate large language models. Our findings demonstrate that NegotiationToM is challenging for state-of-the-art LLMs, as they consistently perform significantly worse than humans, even when employing the chain-of-thought (CoT) method. △ Less

Submitted 21 April, 2024; originally announced April 2024.

arXiv:2404.13287 [pdf, other]

Spontaneous emission decay and excitation in photonic temporal crystals

Authors: Jagang Park, Kyungmin Lee, Ruo-Yang Zhang, Hee-Chul Park, Jung-Wan Ryu, Gil Young Cho, Min Yeul Lee, Zhaoqing Zhang, Namkyoo Park, Wonju Jeon, Jonghwa Shin, C. T. Chan, Bumki Min

Abstract: Over the last few decades, the prominent strategies for controlling spontaneous emission has been the use of resonant or space-periodic photonic structures. This approach, initially articulated by Purcell and later expanded upon by Yablonovitch in the context of photonic crystals, leverages the spatial surroundings to modify the spontaneous emission decay rate of atoms or quantum emitters. However… ▽ More Over the last few decades, the prominent strategies for controlling spontaneous emission has been the use of resonant or space-periodic photonic structures. This approach, initially articulated by Purcell and later expanded upon by Yablonovitch in the context of photonic crystals, leverages the spatial surroundings to modify the spontaneous emission decay rate of atoms or quantum emitters. However, the rise of time-varying photonics has compelled a reevaluation of the spontaneous emission process within dynamically changing environments, especially concerning photonic temporal crystals where optical properties undergo time-periodic modulation. Here, we apply classical light-matter interaction theory along with Floquet analysis to reveal a substantial enhancement in the spontaneous emission decay rate at the momentum gap frequency in photonic temporal crystals. This enhancement is attributed to time-periodicity-induced loss and gain mechanisms, as well as the non-orthogonality of Floquet eigenstates that are inherent to photonic temporal crystals. Intriguingly, our findings also suggest that photonic temporal crystals enable the spontaneous excitation of an atom from its ground state to an excited state, accompanied by the concurrent emission of a photon. △ Less

Submitted 20 April, 2024; originally announced April 2024.

arXiv:2404.11475 [pdf, other]

AdaIR: Exploiting Underlying Similarities of Image Restoration Tasks with Adapters

Authors: Hao-Wei Chen, Yu-Syuan Xu, Kelvin C. K. Chan, Hsien-Kai Kuo, Chun-Yi Lee, Ming-Hsuan Yang

Abstract: Existing image restoration approaches typically employ extensive networks specifically trained for designated degradations. Despite being effective, such methods inevitably entail considerable storage costs and computational overheads due to the reliance on task-specific networks. In this work, we go beyond this well-established framework and exploit the inherent commonalities among image restorat… ▽ More Existing image restoration approaches typically employ extensive networks specifically trained for designated degradations. Despite being effective, such methods inevitably entail considerable storage costs and computational overheads due to the reliance on task-specific networks. In this work, we go beyond this well-established framework and exploit the inherent commonalities among image restoration tasks. The primary objective is to identify components that are shareable across restoration tasks and augment the shared components with modules specifically trained for individual tasks. Towards this goal, we propose AdaIR, a novel framework that enables low storage cost and efficient training without sacrificing performance. Specifically, a generic restoration network is first constructed through self-supervised pre-training using synthetic degradations. Subsequent to the pre-training phase, adapters are trained to adapt the pre-trained network to specific degradations. AdaIR requires solely the training of lightweight, task-specific modules, ensuring a more efficient storage and training regimen. We have conducted extensive experiments to validate the effectiveness of AdaIR and analyze the influence of the pre-training strategy on discovering shareable components. Extensive experimental results show that AdaIR achieves outstanding results on multi-task restoration while utilizing significantly fewer parameters (1.9 MB) and less training time (7 hours) for each restoration task. The source codes and trained models will be released. △ Less

Submitted 17 April, 2024; originally announced April 2024.

arXiv:2404.10179 [pdf, other]

Scaling Instructable Agents Across Many Simulated Worlds

Authors: SIMA Team, Maria Abi Raad, Arun Ahuja, Catarina Barros, Frederic Besse, Andrew Bolt, Adrian Bolton, Bethanie Brownfield, Gavin Buttimore, Max Cant, Sarah Chakera, Stephanie C. Y. Chan, Jeff Clune, Adrian Collister, Vikki Copeman, Alex Cullum, Ishita Dasgupta, Dario de Cesare, Julia Di Trapani, Yani Donchev, Emma Dunleavy, Martin Engelcke, Ryan Faulkner, Frankie Garcia, Charles Gbadamosi , et al. (68 additional authors not shown)

Abstract: Building embodied AI systems that can follow arbitrary language instructions in any 3D environment is a key challenge for creating general AI. Accomplishing this goal requires learning to ground language in perception and embodied actions, in order to accomplish complex tasks. The Scalable, Instructable, Multiworld Agent (SIMA) project tackles this by training agents to follow free-form instructio… ▽ More Building embodied AI systems that can follow arbitrary language instructions in any 3D environment is a key challenge for creating general AI. Accomplishing this goal requires learning to ground language in perception and embodied actions, in order to accomplish complex tasks. The Scalable, Instructable, Multiworld Agent (SIMA) project tackles this by training agents to follow free-form instructions across a diverse range of virtual 3D environments, including curated research environments as well as open-ended, commercial video games. Our goal is to develop an instructable agent that can accomplish anything a human can do in any simulated 3D environment. Our approach focuses on language-driven generality while imposing minimal assumptions. Our agents interact with environments in real-time using a generic, human-like interface: the inputs are image observations and language instructions and the outputs are keyboard-and-mouse actions. This general approach is challenging, but it allows agents to ground language across many visually complex and semantically rich environments while also allowing us to readily run agents in new environments. In this paper we describe our motivation and goal, the initial progress we have made, and promising preliminary results on several diverse research environments and a variety of commercial video games. △ Less

Submitted 17 April, 2024; v1 submitted 13 March, 2024; originally announced April 2024.

arXiv:2404.07129 [pdf, other]

What needs to go right for an induction head? A mechanistic study of in-context learning circuits and their formation

Authors: Aaditya K. Singh, Ted Moskovitz, Felix Hill, Stephanie C. Y. Chan, Andrew M. Saxe

Abstract: In-context learning is a powerful emergent ability in transformer models. Prior work in mechanistic interpretability has identified a circuit element that may be critical for in-context learning -- the induction head (IH), which performs a match-and-copy operation. During training of large transformers on natural language data, IHs emerge around the same time as a notable phase change in the loss.… ▽ More In-context learning is a powerful emergent ability in transformer models. Prior work in mechanistic interpretability has identified a circuit element that may be critical for in-context learning -- the induction head (IH), which performs a match-and-copy operation. During training of large transformers on natural language data, IHs emerge around the same time as a notable phase change in the loss. Despite the robust evidence for IHs and this interesting coincidence with the phase change, relatively little is known about the diversity and emergence dynamics of IHs. Why is there more than one IH, and how are they dependent on each other? Why do IHs appear all of a sudden, and what are the subcircuits that enable them to emerge? We answer these questions by studying IH emergence dynamics in a controlled setting by training on synthetic data. In doing so, we develop and share a novel optogenetics-inspired causal framework for modifying activations throughout training. Using this framework, we delineate the diverse and additive nature of IHs. By clam** subsets of activations throughout training, we then identify three underlying subcircuits that interact to drive IH formation, yielding the phase change. Furthermore, these subcircuits shed light on data-dependent properties of formation, such as phase change timing, already showing the promise of this more in-depth understanding of subcircuits that need to "go right" for an induction head. △ Less

Submitted 10 April, 2024; originally announced April 2024.

Comments: 26 pages, 18 figures

arXiv:2404.00610 [pdf, other]

RQ-RAG: Learning to Refine Queries for Retrieval Augmented Generation

Authors: Chi-Min Chan, Chunpu Xu, Ruibin Yuan, Hongyin Luo, Wei Xue, Yike Guo, Jie Fu

Abstract: Large Language Models (LLMs) exhibit remarkable capabilities but are prone to generating inaccurate or hallucinatory responses. This limitation stems from their reliance on vast pretraining datasets, making them susceptible to errors in unseen scenarios. To tackle these challenges, Retrieval-Augmented Generation (RAG) addresses this by incorporating external, relevant documents into the response g… ▽ More Large Language Models (LLMs) exhibit remarkable capabilities but are prone to generating inaccurate or hallucinatory responses. This limitation stems from their reliance on vast pretraining datasets, making them susceptible to errors in unseen scenarios. To tackle these challenges, Retrieval-Augmented Generation (RAG) addresses this by incorporating external, relevant documents into the response generation process, thus leveraging non-parametric knowledge alongside LLMs' in-context learning abilities. However, existing RAG implementations primarily focus on initial input for context retrieval, overlooking the nuances of ambiguous or complex queries that necessitate further clarification or decomposition for accurate responses. To this end, we propose learning to Refine Query for Retrieval Augmented Generation (RQ-RAG) in this paper, endeavoring to enhance the model by equip** it with capabilities for explicit rewriting, decomposition, and disambiguation. Our experimental results indicate that our method, when applied to a 7B Llama2 model, surpasses the previous state-of-the-art (SOTA) by an average of 1.9\% across three single-hop QA datasets, and also demonstrates enhanced performance in handling complex, multi-hop QA datasets. Our code is available at https://github.com/chanchimin/RQ-RAG. △ Less

Submitted 31 March, 2024; originally announced April 2024.

arXiv:2404.00543 [pdf, other]

Dynamic Transfer Policies for Parallel Queues

Authors: Timothy C. Y. Chan, Jangwon Park, Vahid Sarhangian

Abstract: We consider the problem of load balancing in parallel queues by transferring customers between them at discrete points in time. Holding costs accrue as customers wait in the queue, while transfer decisions incur both fixed (setup) and variable costs proportional to the number and direction of transfers. Our work is primarily motivated by inter-facility patient transfers between hospitals during a… ▽ More We consider the problem of load balancing in parallel queues by transferring customers between them at discrete points in time. Holding costs accrue as customers wait in the queue, while transfer decisions incur both fixed (setup) and variable costs proportional to the number and direction of transfers. Our work is primarily motivated by inter-facility patient transfers between hospitals during a surge in demand for hospitalization (e.g., during a pandemic). By analyzing an associated fluid control problem, we show that under fairly general assumptions including time-varying arrivals and convex increasing holding costs, the optimal policy in each period partitions the state-space into a well-defined $\textit{no-transfer region}$ and its complement, such that transferring is optimal if and only if the system is sufficiently imbalanced. In the absence of fixed transfer costs, an optimal policy moves the state to the no-transfer region's boundary; in contrast, with fixed costs, the state is moved to the no-transfer region's relative interior. We further leverage the fluid control problem to provide insights on the trade-off between holding and transfer costs, emphasizing the importance of preventing excessive idleness when transfers are not feasible in continuous-time. Using simulation experiments, we investigate the performance and robustness of the fluid policy for the stochastic system. In particular, our case study calibrated using data during the pandemic in the Greater Toronto Area demonstrates that transferring patients between hospitals could result in up to 27.7% reduction in total cost with relatively few transfers. △ Less

Submitted 30 March, 2024; originally announced April 2024.

arXiv:2404.00209 [pdf, other]

EventGround: Narrative Reasoning by Grounding to Eventuality-centric Knowledge Graphs

Authors: Cheng Jiayang, Lin Qiu, Chunkit Chan, Xin Liu, Yangqiu Song, Zheng Zhang

Abstract: Narrative reasoning relies on the understanding of eventualities in story contexts, which requires a wealth of background world knowledge. To help machines leverage such knowledge, existing solutions can be categorized into two groups. Some focus on implicitly modeling eventuality knowledge by pretraining language models (LMs) with eventuality-aware objectives. However, this approach breaks down k… ▽ More Narrative reasoning relies on the understanding of eventualities in story contexts, which requires a wealth of background world knowledge. To help machines leverage such knowledge, existing solutions can be categorized into two groups. Some focus on implicitly modeling eventuality knowledge by pretraining language models (LMs) with eventuality-aware objectives. However, this approach breaks down knowledge structures and lacks interpretability. Others explicitly collect world knowledge of eventualities into structured eventuality-centric knowledge graphs (KGs). However, existing research on leveraging these knowledge sources for free-texts is limited. In this work, we propose an initial comprehensive framework called EventGround, which aims to tackle the problem of grounding free-texts to eventuality-centric KGs for contextualized narrative reasoning. We identify two critical problems in this direction: the event representation and sparsity problems. We provide simple yet effective parsing and partial information extraction methods to tackle these problems. Experimental results demonstrate that our approach consistently outperforms baseline models when combined with graph neural network (GNN) or large language model (LLM) based graph reasoning models. Our framework, incorporating grounded knowledge, achieves state-of-the-art performance while providing interpretable evidence. △ Less

Submitted 29 March, 2024; originally announced April 2024.

arXiv:2403.19471 [pdf, other]

Network Flow Models for Robust Binary Optimization with Selective Adaptability

Authors: Merve Bodur, Timothy C. Y. Chan, Ian Yihang Zhu

Abstract: Adaptive robust optimization problems have received significant attention in recent years, but remain notoriously difficult to solve when recourse decisions are discrete in nature. In this paper, we propose new reformulation techniques for adaptive robust binary optimization (ARBO) problems with objective uncertainty. Without loss of generality, we focus on ARBO problems with "selective adaptabili… ▽ More Adaptive robust optimization problems have received significant attention in recent years, but remain notoriously difficult to solve when recourse decisions are discrete in nature. In this paper, we propose new reformulation techniques for adaptive robust binary optimization (ARBO) problems with objective uncertainty. Without loss of generality, we focus on ARBO problems with "selective adaptability", a term we coin to describe a common class of linking constraints between first-stage and second-stage solutions. Our main contribution revolves around a collection of exact and approximate network flow reformulations for the ARBO problem, which we develop by building upon ideas from the decision diagram literature. Our proposed models can generate feasible solutions, primal bounds and dual bounds, while their size and approximation quality can be precisely controlled through user-specified parameters. Furthermore, and in contrast with existing solution methods, these models are easy to implement and can be solved directly with standard off-the-shelf solvers. Through an extensive set of computational experiments, we show that our models can generate high-quality solutions and dual bounds in significantly less time than popular benchmark methods, often by orders of magnitude. △ Less

Submitted 28 March, 2024; originally announced March 2024.

arXiv:2403.12943 [pdf, other]

Vid2Robot: End-to-end Video-conditioned Policy Learning with Cross-Attention Transformers

Authors: Vidhi Jain, Maria Attarian, Nikhil J Joshi, Ayzaan Wahid, Danny Driess, Quan Vuong, Pannag R Sanketi, Pierre Sermanet, Stefan Welker, Christine Chan, Igor Gilitschenski, Yonatan Bisk, Debidatta Dwibedi

Abstract: While large-scale robotic systems typically rely on textual instructions for tasks, this work explores a different approach: can robots infer the task directly from observing humans? This shift necessitates the robot's ability to decode human intent and translate it into executable actions within its physical constraints and environment. We introduce Vid2Robot, a novel end-to-end video-based learn… ▽ More While large-scale robotic systems typically rely on textual instructions for tasks, this work explores a different approach: can robots infer the task directly from observing humans? This shift necessitates the robot's ability to decode human intent and translate it into executable actions within its physical constraints and environment. We introduce Vid2Robot, a novel end-to-end video-based learning framework for robots. Given a video demonstration of a manipulation task and current visual observations, Vid2Robot directly produces robot actions. This is achieved through a unified representation model trained on a large dataset of human video and robot trajectory. The model leverages cross-attention mechanisms to fuse prompt video features to the robot's current state and generate appropriate actions that mimic the observed task. To further improve policy performance, we propose auxiliary contrastive losses that enhance the alignment between human and robot video representations. We evaluate Vid2Robot on real-world robots, demonstrating a 20% improvement in performance compared to other video-conditioned policies when using human demonstration videos. Additionally, our model exhibits emergent capabilities, such as successfully transferring observed motions from one object to another, and long-horizon composition, thus showcasing its potential for real-world applications. Project website: vid2robot.github.io △ Less

Submitted 19 March, 2024; originally announced March 2024.

Comments: Robot learning: Imitation Learning, Robot Perception, Sensing & Vision, Gras** & Manipulation

arXiv:2403.06219 [pdf, other]

Affine Semigroup Algebras And Their Fibered Sums

Authors: C-Y. Jean Chan, I-Chiau Huang, Jung-Chen Liu

Abstract: We study affine semigroup rings as algebras over subsemigroup rings. From this relative viewpoint with respect to a given subsemigroup ring, the fibered sum of two affine semigroup algebras is constructed. Such a construction is compared to the tensor product and to the classical gluings of affine semigroup rings as defined in Rosales (1997). While fibered sum can always be achieved, gluings of… ▽ More We study affine semigroup rings as algebras over subsemigroup rings. From this relative viewpoint with respect to a given subsemigroup ring, the fibered sum of two affine semigroup algebras is constructed. Such a construction is compared to the tensor product and to the classical gluings of affine semigroup rings as defined in Rosales (1997). While fibered sum can always be achieved, gluings of affine semigroup rings do not always exist. Therefore, we further investigate when the fibered sum of affine semigroup algebras gives rise to a gluing. A criterion is recovered in terms of the defining semigroups under which the gluing may take place. △ Less

Submitted 10 March, 2024; originally announced March 2024.

MSC Class: 13F65; 13B10; 20M25; 20M50

arXiv:2403.03004 [pdf, other]

Ultralight vector dark matter search using data from the KAGRA O3GK run

Authors: The LIGO Scientific Collaboration, the Virgo Collaboration, the KAGRA Collaboration, A. G. Abac, R. Abbott, H. Abe, I. Abouelfettouh, F. Acernese, K. Ackley, C. Adamcewicz, S. Adhicary, N. Adhikari, R. X. Adhikari, V. K. Adkins, V. B. Adya, C. Affeldt, D. Agarwal, M. Agathos, O. D. Aguiar, I. Aguilar, L. Aiello, A. Ain, P. Ajith, T. Akutsu, S. Albanesi , et al. (1778 additional authors not shown)

Abstract: Among the various candidates for dark matter (DM), ultralight vector DM can be probed by laser interferometric gravitational wave detectors through the measurement of oscillating length changes in the arm cavities. In this context, KAGRA has a unique feature due to differing compositions of its mirrors, enhancing the signal of vector DM in the length change in the auxiliary channels. Here we prese… ▽ More Among the various candidates for dark matter (DM), ultralight vector DM can be probed by laser interferometric gravitational wave detectors through the measurement of oscillating length changes in the arm cavities. In this context, KAGRA has a unique feature due to differing compositions of its mirrors, enhancing the signal of vector DM in the length change in the auxiliary channels. Here we present the result of a search for $U(1)_{B-L}$ gauge boson DM using the KAGRA data from auxiliary length channels during the first joint observation run together with GEO600. By applying our search pipeline, which takes into account the stochastic nature of ultralight DM, upper bounds on the coupling strength between the $U(1)_{B-L}$ gauge boson and ordinary matter are obtained for a range of DM masses. While our constraints are less stringent than those derived from previous experiments, this study demonstrates the applicability of our method to the lower-mass vector DM search, which is made difficult in this measurement by the short observation time compared to the auto-correlation time scale of DM. △ Less

Submitted 5 March, 2024; originally announced March 2024.

Comments: 20 pages, 5 figures

Report number: LIGO-P2300250

arXiv:2402.18827 [pdf, other]

Measurement of the photometric Baryon Acoustic Oscillations with self-calibrated redshift distribution

Authors: Ruiyu Song, Kwan Chuen Chan, Haojie Xu, Weilun Zheng

Abstract: We use a galaxy sample derived from the DECaLS DR9 to measure the Baryonic Acoustic Oscillations (BAO). The magnitude-limited sample consists of 10.6 million galaxies in an area of 4974 deg$^2$ over the redshift range of [0.6, 1]. A key novelty of this work is that the true redshift distribution of the photo-$z$ sample is derived from the self calibration method, which determines the true redshift… ▽ More We use a galaxy sample derived from the DECaLS DR9 to measure the Baryonic Acoustic Oscillations (BAO). The magnitude-limited sample consists of 10.6 million galaxies in an area of 4974 deg$^2$ over the redshift range of [0.6, 1]. A key novelty of this work is that the true redshift distribution of the photo-$z$ sample is derived from the self calibration method, which determines the true redshift distribution using the clustering information of the photometric data alone. Through the angular correlation function in four tomographic bins, we constrain the BAO scale dilation parameter $α$ to be $1.025\pm 0.033 $, consistent with the fiducial Planck cosmology. Alternatively, the ratio between the comoving angular diameter distance and the sound horizon, $D_{\rm M} / r_{\rm s}$ is constrained to be $18.94 \pm 0.61 $ at the effective redshift of 0.749. We corroborate our results with the true redshift distribution obtained from a weighted spectroscopic sample, finding very good agreement. We have conducted a series of tests to demonstrate the robustness of the measurement. Our work demonstrates that the self calibration method can effectively constrain the true redshift distribution in cosmological applications, especially in the context of photometric BAO measurement. △ Less

Submitted 12 April, 2024; v1 submitted 28 February, 2024; originally announced February 2024.

Comments: 13 pages, 10 figures, matched to the published version

arXiv:2402.17757 [pdf, other]

Reducing leakage of single-qubit gates for superconducting quantum processors using analytical control pulse envelopes

Authors: Eric Hyyppä, Antti Vepsäläinen, Miha Papič, Chun Fai Chan, Sinan Inel, Alessandro Landra, Wei Liu, Jürgen Luus, Fabian Marxer, Caspar Ockeloen-Korppi, Sebastian Orbell, Brian Tarasinski, Johannes Heinsoo

Abstract: Improving the speed and fidelity of quantum logic gates is essential to reach quantum advantage with future quantum computers. However, fast logic gates lead to increased leakage errors in superconducting quantum processors based on qubits with low anharmonicity, such as transmons. To reduce leakage errors, we propose and experimentally demonstrate two new analytical methods, Fourier ansatz spectr… ▽ More Improving the speed and fidelity of quantum logic gates is essential to reach quantum advantage with future quantum computers. However, fast logic gates lead to increased leakage errors in superconducting quantum processors based on qubits with low anharmonicity, such as transmons. To reduce leakage errors, we propose and experimentally demonstrate two new analytical methods, Fourier ansatz spectrum tuning derivative removal by adiabatic gate (FAST DRAG) and higher-derivative (HD) DRAG, both of which enable sha** single-qubit control pulses in the frequency domain to achieve stronger suppression of leakage transitions compared to previously demonstrated pulse shapes. Using the new methods to suppress the $ef$-transition of a transmon qubit with an anharmonicity of -212 MHz, we implement $R_X(π/2)$-gates with a leakage error below $3.0 \times 10^{-5}$ down to a gate duration of 6.25 ns, which corresponds to a 20-fold reduction in leakage compared to a conventional Cosine DRAG pulse. Employing the FAST DRAG method, we further achieve an error per gate of $(1.56 \pm 0.07)\times 10^{-4}$ at a 7.9-ns gate duration, outperforming conventional pulse shapes both in terms of error and gate speed. Furthermore, we study error-amplifying measurements for the characterization of temporal microwave control pulse distortions, and demonstrate that non-Markovian coherent errors caused by such distortions may be a significant source of error for sub-10-ns single-qubit gates unless corrected using predistortion. △ Less

Submitted 27 February, 2024; originally announced February 2024.

Comments: 23 pages, 5 figures in main text, 7 figures in Appendix

arXiv:2402.17209 [pdf]

Deep Learning-based Kinetic Analysis in Paper-based Analytical Cartridges Integrated with Field-effect Transistors

Authors: Hyun-June Jang, Hyou-Arm Joung, Artem Goncharov, Anastasia Gant Kanegusuku, Clarence W. Chan, Kiang-Teck Jerry Yeo, Wen Zhuang, Aydogan Ozcan, Junhong Chen

Abstract: This study explores the fusion of a field-effect transistor (FET), a paper-based analytical cartridge, and the computational power of deep learning (DL) for quantitative biosensing via kinetic analyses. The FET sensors address the low sensitivity challenge observed in paper analytical devices, enabling electrical measurements with kinetic data. The paper-based cartridge eliminates the need for sur… ▽ More This study explores the fusion of a field-effect transistor (FET), a paper-based analytical cartridge, and the computational power of deep learning (DL) for quantitative biosensing via kinetic analyses. The FET sensors address the low sensitivity challenge observed in paper analytical devices, enabling electrical measurements with kinetic data. The paper-based cartridge eliminates the need for surface chemistry required in FET sensors, ensuring economical operation (cost < $0.15/test). The DL analysis mitigates chronic challenges of FET biosensors such as sample matrix interference, by leveraging kinetic data from target-specific bioreactions. In our proof-of-concept demonstration, our DL-based analyses showcased a coefficient of variation of < 6.46% and a decent concentration measurement correlation with an r2 value of > 0.976 for cholesterol testing when blindly compared to results obtained from a CLIA-certified clinical laboratory. These integrated technologies can create a new generation of FET-based biosensors, potentially transforming point-of-care diagnostics and at-home testing through enhanced accessibility, ease-of-use, and accuracy. △ Less

Submitted 27 February, 2024; originally announced February 2024.

Comments: 18 pages, 4 figures

arXiv:2402.12689 [pdf]

Janus Bound States in the Continuum with Asymmetric Topological Charges and Intrinsic Chirality

Authors: Meng Kang, Meng Xiao, C. T. Chan

Abstract: We propose a novel topological defect called Janus bound states in the continuum (BICs), featuring asymmetric topological charges in upward and downward radiation channels. Our approach involves a photonic crystal slab (PCS) that initially exhibits both out-of-plane and in-plane mirror symmetry, and this PCS possesses one BIC at the $Γ$ point and two BICs off the $Γ$ point. By introducing perturba… ▽ More We propose a novel topological defect called Janus bound states in the continuum (BICs), featuring asymmetric topological charges in upward and downward radiation channels. Our approach involves a photonic crystal slab (PCS) that initially exhibits both out-of-plane and in-plane mirror symmetry, and this PCS possesses one BIC at the $Γ$ point and two BICs off the $Γ$ point. By introducing perturbations that break the out-of-plane mirror symmetry, the two off-$Γ$ BICs decompose into four circularly polarized states (C points) with identical topological charges. Then, we selectively manipulate the four C points associated with downward radiation channel to converge at the at-$Γ$ BIC, forming a Janus BIC with Janus topological charges. By further introducing in-plane mirror symmetry perturbation, we can bring two of the C points with the same handedness and identical topological charges for upward radiation to merge into the Janus BIC. This process results in a Janus chiral BIC which exhibits large intrinsic chirality and an infinite Q factor. Janus BICs can induce distinct Pancharatnam-Berry phase singularities in momentum space for different incident channels, providing a new approach to control optical angular momentum. Janus chiral BICs hold promise in enhancing direction-dependent and spin-dependent asymmetric light-matter interaction, opening new pathways for improving chirality-dependent operation for on-chip devices. △ Less

Submitted 19 February, 2024; originally announced February 2024.

arXiv:2402.11590 [pdf, other]

Designing interactive data visualizations representing recovery progress for patients after stroke

Authors: Alicia Ouskine, Adrian D. C. Chan, Fateme Rajabiyazdi

Abstract: Stroke is one of the leading causes of disability worldwide. The efficacy of recovery is determined by a variety of factors, including patient adherence to rehabilitation programs. One way to increase patient adherence to their rehabilitation program is to show patients their progress that is visualized in a simple and intuitive way. We begin to gather preliminary information on Functional Capacit… ▽ More Stroke is one of the leading causes of disability worldwide. The efficacy of recovery is determined by a variety of factors, including patient adherence to rehabilitation programs. One way to increase patient adherence to their rehabilitation program is to show patients their progress that is visualized in a simple and intuitive way. We begin to gather preliminary information on Functional Capacity, Motor Function, and Mood/cognition from occupational Therapists at the Bruyere Hospital to gain a better understanding of how stroke recovery data is collected within in-patient stroke rehabilitation centers. The future aim is to design, develop, and evaluate a data visualization tool representing progress made by patients recovering from stroke. △ Less

Submitted 18 February, 2024; originally announced February 2024.

Comments: 2 pages

arXiv:2402.10697 [pdf, other]

Dark Energy Survey: Galaxy Sample for the Baryonic Acoustic Oscillation Measurement from the Final Dataset

Authors: J. Mena-Fernández, M. Rodríguez-Monroy, S. Avila, A. Porredon, K. C. Chan, H. Camacho, N. Weaverdyck, I. Sevilla-Noarbe, E. Sanchez, L. Toribio San Cipriano, J. De Vicente, I. Ferrero, R. Cawthon, A. Carnero Rosell, J. Elvin-Poole, G. Giannini, M. Adamow, K. Bechtol, A. Drlica-Wagner, R. A. Gruendl, W. G. Hartley, A. Pieres, A. J. Ross, E. S. Rykoff, E. Sheldon , et al. (63 additional authors not shown)

Abstract: In this paper we present and validate the galaxy sample used for the analysis of the baryon acoustic oscillation (BAO) signal in the Dark Energy Survey (DES) Y6 data. The definition is based on a color and redshift-dependent magnitude cut optimized to select galaxies at redshifts higher than 0.6, while ensuring a high-quality photo-$z$ determination. The optimization is performed using a Fisher fo… ▽ More In this paper we present and validate the galaxy sample used for the analysis of the baryon acoustic oscillation (BAO) signal in the Dark Energy Survey (DES) Y6 data. The definition is based on a color and redshift-dependent magnitude cut optimized to select galaxies at redshifts higher than 0.6, while ensuring a high-quality photo-$z$ determination. The optimization is performed using a Fisher forecast algorithm, finding the optimal $i$-magnitude cut to be given by $i$<19.64+2.894$z_{\rm ph}$. For the optimal sample, we forecast an increase in precision in the BAO measurement of $\sim$25% with respect to the Y3 analysis. Our BAO sample has a total of 15,937,556 galaxies in the redshift range 0.6<$z_{\rm ph}$<1.2, and its angular mask covers 4,273.42 deg${}^2$ to a depth of $i$=22.5. We validate its redshift distributions with three different methods: directional neighborhood fitting algorithm (DNF), which is our primary photo-$z$ estimation; direct calibration with spectroscopic redshifts from VIPERS; and clustering redshift using SDSS galaxies. The fiducial redshift distribution is a combination of these three techniques performed by modifying the mean and width of the DNF distributions to match those of VIPERS and clustering redshift. In this paper we also describe the methodology used to mitigate the effect of observational systematics, which is analogous to the one used in the Y3 analysis. This paper is one of the two dedicated to the analysis of the BAO signal in DES Y6. In its companion paper, we present the angular diameter distance constraints obtained through the fitting to the BAO scale. △ Less

Submitted 16 February, 2024; originally announced February 2024.

Comments: 23 pages, 10 figures. Submitted to PRD

Report number: FERMILAB-PUB-24-0072-PPD

arXiv:2402.10696 [pdf, other]

Dark Energy Survey: A 2.1% measurement of the angular Baryonic Acoustic Oscillation scale at redshift $z_{\rm eff}$=0.85 from the final dataset

Authors: DES Collaboration, T. M. C. Abbott, M. Adamow, M. Aguena, S. Allam, O. Alves, A. Amon, F. Andrade-Oliveira, J. Asorey, S. Avila, D. Bacon, K. Bechtol, G. M. Bernstein, E. Bertin, J. Blazek, S. Bocquet, D. Brooks, D. L. Burke, H. Camacho, A. Carnero Rosell, D. Carollo, J. Carretero, F. J. Castander, R. Cawthon, K. C. Chan , et al. (83 additional authors not shown)

Abstract: We present the angular diameter distance measurement obtained with the Baryonic Acoustic Oscillation feature from galaxy clustering in the completed Dark Energy Survey, consisting of six years (Y6) of observations. We use the Y6 BAO galaxy sample, optimized for BAO science in the redshift range 0.6<$z$<1.2, with an effective redshift at $z_{\rm eff}$=0.85 and split into six tomographic bins. The s… ▽ More We present the angular diameter distance measurement obtained with the Baryonic Acoustic Oscillation feature from galaxy clustering in the completed Dark Energy Survey, consisting of six years (Y6) of observations. We use the Y6 BAO galaxy sample, optimized for BAO science in the redshift range 0.6<$z$<1.2, with an effective redshift at $z_{\rm eff}$=0.85 and split into six tomographic bins. The sample has nearly 16 million galaxies over 4,273 square degrees. Our consensus measurement constrains the ratio of the angular distance to sound horizon scale to $D_M(z_{\rm eff})/r_d$ = 19.51$\pm$0.41 (at 68.3% confidence interval), resulting from comparing the BAO position in our data to that predicted by Planck $Λ$CDM via the BAO shift parameter $α=(D_M/r_d)/(D_M/r_d)_{\rm Planck}$. To achieve this, the BAO shift is measured with three different methods, Angular Correlation Function (ACF), Angular Power Spectrum (APS), and Projected Correlation Function (PCF) obtaining $α=$ 0.952$\pm$0.023, 0.962$\pm$0.022, and 0.955$\pm$0.020, respectively, which we combine to $α=$ 0.957$\pm$0.020, including systematic errors. When compared with the $Λ$CDM model that best fits Planck data, this measurement is found to be 4.3% and 2.1$σ$ below the angular BAO scale predicted. To date, it represents the most precise angular BAO measurement at $z$>0.75 from any survey and the most precise measurement at any redshift from photometric surveys. The analysis was performed blinded to the BAO position and it is shown to be robust against analysis choices, data removal, redshift calibrations and observational systematics. △ Less

Submitted 16 February, 2024; originally announced February 2024.

Comments: Submitted to PRD, 39 pages, 12 figures

Report number: FERMILAB-PUB-24-0027-PPD

arXiv:2402.09679 [pdf, other]

Design and Visual Servoing Control of a Hybrid Dual-Segment Flexible Neurosurgical Robot for Intraventricular Biopsy

Authors: Jian Chen, Mingcong Chen, Qingxiang Zhao, Shuai Wang, Yihe Wang, Ying Xiao, Jian Hu, Danny Tat Ming Chan, Kam Tong Leo Yeung, David Yuen Chung Chan, Hongbin Liu

Abstract: Traditional rigid endoscopes have challenges in flexibly treating tumors located deep in the brain, and low operability and fixed viewing angles limit its development. This study introduces a novel dual-segment flexible robotic endoscope MicroNeuro, designed to perform biopsies with dexterous surgical manipulation deep in the brain. Taking into account the uncertainty of the control model, an imag… ▽ More Traditional rigid endoscopes have challenges in flexibly treating tumors located deep in the brain, and low operability and fixed viewing angles limit its development. This study introduces a novel dual-segment flexible robotic endoscope MicroNeuro, designed to perform biopsies with dexterous surgical manipulation deep in the brain. Taking into account the uncertainty of the control model, an image-based visual servoing with online robot Jacobian estimation has been implemented to enhance motion accuracy. Furthermore, the application of model predictive control with constraints significantly bolsters the flexible robot's ability to adaptively track mobile objects and resist external interference. Experimental results underscore that the proposed control system enhances motion stability and precision. Phantom testing substantiates its considerable potential for deployment in neurosurgery. △ Less

Submitted 23 February, 2024; v1 submitted 14 February, 2024; originally announced February 2024.

Comments: Accepted by IEEE International Conference on Robotics and Automation (ICRA) 2024, 7 pages, 9 figures

arXiv:2402.08873 [pdf, ps, other]

Balancing Method for Non-monotone Missing Data

Authors: Jianing Dong, Raymond K. W. Wong, Kwun Chuen Gary Chan

Abstract: Covariate balancing methods have been widely applied to single or monotone missing patterns and have certain advantages over likelihood-based methods and inverse probability weighting approaches based on standard logistic regression. In this paper, we consider non-monotone missing data under the complete-case missing variable condition (CCMV), which is a case of missing not at random (MNAR). Using… ▽ More Covariate balancing methods have been widely applied to single or monotone missing patterns and have certain advantages over likelihood-based methods and inverse probability weighting approaches based on standard logistic regression. In this paper, we consider non-monotone missing data under the complete-case missing variable condition (CCMV), which is a case of missing not at random (MNAR). Using relationships between each missing pattern and the complete-case subsample, a weighted estimator can be used for estimation, where the weight is a sum of ratios of conditional probability of observing a particular missing pattern versus that of observing the complete-case pattern, given the variables observed in the corresponding missing pattern. Plug-in estimators of the propensity ratios, however, can be unbounded and lead to unstable estimation. Using further relations between propensity ratios and balancing of moments across missing patterns, we employ tailored loss functions each encouraging empirical balance across patterns to estimate propensity ratios flexibly using functional basis expansion. We propose two penalizations to separately control propensity ratio model complexity and covariate imbalance. We study the asymptotic properties of the proposed estimators and show that they are consistent under mild smoothness assumptions. Asymptotic normality and efficiency are also developed. Numerical simulation results show that the proposed method achieves smaller mean squared errors than other methods. △ Less

Submitted 13 February, 2024; originally announced February 2024.

arXiv:2402.05492 [pdf, other]

Cosmological Forecast of the Void Size Function Measurement from the CSST Spectroscopic Survey

Authors: Yingxiao Song, Qi Xiong, Yan Gong, Furen Deng, Kwan Chuen Chan, Xuelei Chen, Qi Guo, Jiaxin Han, Guoliang Li, Ming Li, Yun Liu, Yu Luo, Wenxiang Pei, Chengliang Wei

Abstract: Void size function (VSF) contains information of the cosmic large-scale structure (LSS), and can be used to derive the properties of dark energy and dark matter. We predict the VSFs measured from the spectroscopic galaxy survey operated by the China Space Station Telescope (CSST), and study the strength of cosmological constraint. We employ a high-resolution Jiutian simulation to get CSST galaxy m… ▽ More Void size function (VSF) contains information of the cosmic large-scale structure (LSS), and can be used to derive the properties of dark energy and dark matter. We predict the VSFs measured from the spectroscopic galaxy survey operated by the China Space Station Telescope (CSST), and study the strength of cosmological constraint. We employ a high-resolution Jiutian simulation to get CSST galaxy mock samples based on an improved semi-analytical model. We identify voids from this galaxy catalog using the watershed algorithm without assuming a spherical shape, and estimate the VSFs at different redshift bins from $z=0.5$ to 1.1. We propose a void selection method based on the ellipticity, and assume the void linear underdensity threshold $δ_{\rm v}$ in the theoretical model is redshift-dependent and set it as a free parameter in each redshift bin. The Markov Chain Monte Carlo (MCMC) method is adopted to implement the constraints on the cosmological and void parameters. We find that the CSST VSF measurement can constrain the cosmological parameters to a few percent level. The best-fit values of $δ_{\rm v}$ are ranging from $\sim-0.4$ to $-0.1$ as the redshift increases from 0.5 to 1.1, which has a distinct difference from the theoretical calculation with $δ_{\rm v}\simeq-2.7$ assuming the spherical evolution and using particles as tracer. Our method can provide a good reference for void identification and selection in the VSF analysis of the spectroscopic galaxy surveys. △ Less

Submitted 24 June, 2024; v1 submitted 8 February, 2024; originally announced February 2024.

Comments: 10 pages, 7 figures, 3 tables. Accepted for publication in MNRAS

arXiv:2402.03670 [pdf, ps, other]

Large order behavior near the AD point: the case of $\mathcal{N} =2$, $su(2)$, $N_f =2$

Authors: Chuan-Tsung Chan, Hiroshi Itoyama, Reiji Yoshioka

Abstract: A non-perturbative effect in $κ$ (renormalized string coupling) obtained from the large order behavior in the vicinity of the prototypical Argyres-Douglas critical point of $su(2)$, $N_f =2$, $\mathcal{N} =2$ susy gauge theory can be studied in the GWW unitary matrix model with the log term: the one as the work done against the barrier of the effective potential by a single eigenvalue lifted from… ▽ More A non-perturbative effect in $κ$ (renormalized string coupling) obtained from the large order behavior in the vicinity of the prototypical Argyres-Douglas critical point of $su(2)$, $N_f =2$, $\mathcal{N} =2$ susy gauge theory can be studied in the GWW unitary matrix model with the log term: the one as the work done against the barrier of the effective potential by a single eigenvalue lifted from the sea and the other as a non-perturbative function contained in the solutions of the nonlinear differential equation PII that goes beyond the asymptotic series. The leading behaviors are of the form $\exp (-\frac{4}{3}\frac{1}κ \, (1, \left(\frac{s}{K}\right)^{\frac{3}{2}} ))$ respectively. We make comments on their agreement. △ Less

Submitted 5 February, 2024; originally announced February 2024.

Comments: 14 pages

Report number: NITEP 196

arXiv:2402.01489 [pdf, other]

Conformal Inverse Optimization

Authors: Bo Lin, Erick Delage, Timothy C. Y. Chan

Abstract: Inverse optimization has been increasingly used to estimate unknown parameters in an optimization model based on decision data. We show that such a point estimation is insufficient in a prescriptive setting where the estimated parameters are used to prescribe new decisions. The prescribed decisions may be low-quality and misaligned with human intuition and thus are unlikely to be adopted. To tackl… ▽ More Inverse optimization has been increasingly used to estimate unknown parameters in an optimization model based on decision data. We show that such a point estimation is insufficient in a prescriptive setting where the estimated parameters are used to prescribe new decisions. The prescribed decisions may be low-quality and misaligned with human intuition and thus are unlikely to be adopted. To tackle this challenge, we propose conformal inverse optimization, which seeks to learn an uncertainty set for the unknown parameters and then solve a robust optimization model to prescribe new decisions. Under mild assumptions, we show that our method enjoys provable guarantees on solution quality, as evaluated using both the ground-truth parameters and the decision maker's perception of the unknown parameters. Our method demonstrates strong empirical performance compared to classic inverse optimization. △ Less

Submitted 15 May, 2024; v1 submitted 2 February, 2024; originally announced February 2024.

arXiv:2402.00927 [pdf, other]

doi 10.1051/0004-6361/202348308

Ordered magnetic fields around the 3C 84 central black hole

Authors: G. F. Paraschos, J. -Y. Kim, M. Wielgus, J. Röder, T. P. Krichbaum, E. Ros, I. Agudo, I. Myserlis, M. Moscibrodzka, E. Traianou, J. A. Zensus, L. Blackburn, C. -K. Chan, S. Issaoun, M. Janssen, M. D. Johnson, V. L. Fish, K. Akiyama, A. Alberdi, W. Alef, J. C. Algaba, R. Anantua, K. Asada, R. Azulay, U. Bach , et al. (258 additional authors not shown)

Abstract: 3C84 is a nearby radio source with a complex total intensity structure, showing linear polarisation and spectral patterns. A detailed investigation of the central engine region necessitates the use of VLBI above the hitherto available maximum frequency of 86GHz. Using ultrahigh resolution VLBI observations at the highest available frequency of 228GHz, we aim to directly detect compact structures a… ▽ More 3C84 is a nearby radio source with a complex total intensity structure, showing linear polarisation and spectral patterns. A detailed investigation of the central engine region necessitates the use of VLBI above the hitherto available maximum frequency of 86GHz. Using ultrahigh resolution VLBI observations at the highest available frequency of 228GHz, we aim to directly detect compact structures and understand the physical conditions in the compact region of 3C84. We used EHT 228GHz observations and, given the limited (u,v)-coverage, applied geometric model fitting to the data. We also employed quasi-simultaneously observed, multi-frequency VLBI data for the source in order to carry out a comprehensive analysis of the core structure. We report the detection of a highly ordered, strong magnetic field around the central, SMBH of 3C84. The brightness temperature analysis suggests that the system is in equipartition. We determined a turnover frequency of $ν_m=(113\pm4)$GHz, a corresponding synchrotron self-absorbed magnetic field of $B_{SSA}=(2.9\pm1.6)$G, and an equipartition magnetic field of $B_{eq}=(5.2\pm0.6)$G. Three components are resolved with the highest fractional polarisation detected for this object ($m_\textrm{net}=(17.0\pm3.9)$%). The positions of the components are compatible with those seen in low-frequency VLBI observations since 2017-2018. We report a steeply negative slope of the spectrum at 228GHz. We used these findings to test models of jet formation, propagation, and Faraday rotation in 3C84. The findings of our investigation into different flow geometries and black hole spins support an advection-dominated accretion flow in a magnetically arrested state around a rapidly rotating supermassive black hole as a model of the jet-launching system in the core of 3C84. However, systematic uncertainties due to the limited (u,v)-coverage, however, cannot be ignored. △ Less

Submitted 1 February, 2024; originally announced February 2024.

Comments: 15 pages, 6 figures, published in A&A

Journal ref: Issue: A&A Volume 682, February 2024; Article number: L3; Number of pages: 15

arXiv:2401.09495

IPR-NeRF: Ownership Verification meets Neural Radiance Field

Authors: Win Kent Ong, Kam Woh Ng, Chee Seng Chan, Yi Zhe Song, Tao Xiang

Abstract: Neural Radiance Field (NeRF) models have gained significant attention in the computer vision community in the recent past with state-of-the-art visual quality and produced impressive demonstrations. Since then, technopreneurs have sought to leverage NeRF models into a profitable business. Therefore, NeRF models make it worth the risk of plagiarizers illegally copying, re-distributing, or misusing… ▽ More Neural Radiance Field (NeRF) models have gained significant attention in the computer vision community in the recent past with state-of-the-art visual quality and produced impressive demonstrations. Since then, technopreneurs have sought to leverage NeRF models into a profitable business. Therefore, NeRF models make it worth the risk of plagiarizers illegally copying, re-distributing, or misusing those models. This paper proposes a comprehensive intellectual property (IP) protection framework for the NeRF model in both black-box and white-box settings, namely IPR-NeRF. In the black-box setting, a diffusion-based solution is introduced to embed and extract the watermark via a two-stage optimization process. In the white-box setting, a designated digital signature is embedded into the weights of the NeRF model by adopting the sign loss objective. Our extensive experiments demonstrate that not only does our approach maintain the fidelity (\ie, the rendering quality) of IPR-NeRF models, but it is also robust against both ambiguity and removal attacks compared to prior arts. △ Less

Submitted 22 January, 2024; v1 submitted 16 January, 2024; originally announced January 2024.

Comments: Error on result tabulation of state of the art method which might cause misleading to readers

Showing 1–50 of 1,058 results for author: Chan, C