-
LLM-based policy generation for intent-based management of applications
Authors:
Kristina Dzeparoska,
Jieyu Lin,
Ali Tizghadam,
Alberto Leon-Garcia
Abstract:
Automated management requires decomposing high-level user requests, such as intents, to an abstraction that the system can understand and execute. This is challenging because even a simple intent requires performing a number of ordered steps. And the task of identifying and adapting these steps (as conditions change) requires a decomposition approach that cannot be exactly pre-defined beforehand.…
▽ More
Automated management requires decomposing high-level user requests, such as intents, to an abstraction that the system can understand and execute. This is challenging because even a simple intent requires performing a number of ordered steps. And the task of identifying and adapting these steps (as conditions change) requires a decomposition approach that cannot be exactly pre-defined beforehand. To tackle these challenges and support automated intent decomposition and execution, we explore the few-shot capability of Large Language Models (LLMs). We propose a pipeline that progressively decomposes intents by generating the required actions using a policy-based abstraction. This allows us to automate the policy execution by creating a closed control loop for the intent deployment. To do so, we generate and map the policies to APIs and form application management loops that perform the necessary monitoring, analysis, planning and execution. We evaluate our proposal with a use-case to fulfill and assure an application service chain of virtual network functions. Using our approach, we can generalize and generate the necessary steps to realize intents, thereby enabling intent automation for application management.
△ Less
Submitted 22 January, 2024;
originally announced February 2024.
-
GraphiQ: Quantum circuit design for photonic graph states
Authors:
Jie Lin,
Benjamin MacLellan,
Sobhan Ghanbari,
Julie Belleville,
Khuong Tran,
Luc Robichaud,
Roger G. Melko,
Hoi-Kwong Lo,
Piotr Roztocki
Abstract:
GraphiQ is a versatile open-source framework for designing photonic graph state generation schemes, with a particular emphasis on photon-emitter hybrid circuits. Built in Python, GraphiQ consists of a suite of design tools, including multiple simulation backends and optimization methods. The library supports scheme optimization in the presence of circuit imperfections, as well as user-defined opti…
▽ More
GraphiQ is a versatile open-source framework for designing photonic graph state generation schemes, with a particular emphasis on photon-emitter hybrid circuits. Built in Python, GraphiQ consists of a suite of design tools, including multiple simulation backends and optimization methods. The library supports scheme optimization in the presence of circuit imperfections, as well as user-defined optimization goals. Our framework thus represents a valuable tool for the development of practical schemes adhering to experimentally-relevant constraints. As graph states are a key resource for measurement-based quantum computing, all-photonic quantum repeaters, and robust quantum metrology, among others, we envision GraphiQ's broad impact for advancing quantum technologies.
△ Less
Submitted 14 February, 2024;
originally announced February 2024.
-
Subwavelength Photorefractive Grating in a Thin-Film Lithium Niobate Microcavity
Authors:
Jiankun Hou,
Jiefu Zhu,
Ruixin Ma,
Boyi Xue,
Yicheng Zhu,
**tian Lin,
Xiaoshun Jiang,
Xianfeng Chen,
Ya Cheng,
Li Ge,
Yuanlin Zheng,
Wenjie Wan
Abstract:
Subwavelength gratings play a fundamental and pivotal role in numerous science and applications for wave manipulation, exhibiting distinctive features such as filtering, phase manipulation, and anti-reflection. However, conventional fabrication methods for ultrasmall periodic structures are constrained by the fundamental optical diffraction limit, making it challenging to produce subwavelength gra…
▽ More
Subwavelength gratings play a fundamental and pivotal role in numerous science and applications for wave manipulation, exhibiting distinctive features such as filtering, phase manipulation, and anti-reflection. However, conventional fabrication methods for ultrasmall periodic structures are constrained by the fundamental optical diffraction limit, making it challenging to produce subwavelength gratings for optics. Here, we demonstrate a novel technique to build a reconfigurable subwavelength photorefractive grating (SPG) in a thin-film lithium niobate on the platform of an optical microcavity. Such SPGs are optically induced through the photorefractive effect and the subwavelength features originate from the spatial phase modulations of the pump's standing wave. The resulting SPGs lead to the mode splitting of two counter-propagating modes inside the microcavity, exhibiting an Electromagnetically Induced Transparency (EIT)-like transmission spectrum. Moreover, the unique subwavelength characteristic of SPGs enables first-order quasi-phase-matching for backward second-harmonic generation, a long-standing problem in nonlinear optics. Also, free-space-to-chip vertical nonlinear frequency conversion can be achieved in a similar manner. These results provide a flexible approach towards fabricating subwavelength gratings, which holds significant potential in various applications such as nonlinear frequency conversion, optical communication, sensing, and quantum technologies.
△ Less
Submitted 13 February, 2024;
originally announced February 2024.
-
New constraints on ultraheavy dark matter from the LZ experiment
Authors:
J. Aalbers,
D. S. Akerib,
A. K. Al Musalhi,
C. S. Amarasinghe,
A. Ames,
T. J. Anderson,
N. Angelides,
H. M. Araújo,
J. E. Armstrong,
M. Arthurs,
A. Baker,
S. Balashov,
J. Bang,
J. W. Bargemann,
A. Baxter,
K. Beattie,
T. Benson,
A. Bhatti,
A. Biekert,
T. P. Biesiadzinski,
H. J. Birch,
E. Bishop,
G. M. Blockinger,
B. Boxer,
C. A. J. Brew
, et al. (174 additional authors not shown)
Abstract:
Searches for dark matter with liquid xenon time projection chamber experiments have traditionally focused on the region of the parameter space that is characteristic of weakly interacting massive particles, ranging from a few GeV/$c^2$ to a few TeV/$c^2$. Models of dark matter with a mass much heavier than this are well motivated by early production mechanisms different from the standard thermal f…
▽ More
Searches for dark matter with liquid xenon time projection chamber experiments have traditionally focused on the region of the parameter space that is characteristic of weakly interacting massive particles, ranging from a few GeV/$c^2$ to a few TeV/$c^2$. Models of dark matter with a mass much heavier than this are well motivated by early production mechanisms different from the standard thermal freeze-out, but they have generally been less explored experimentally. In this work, we present a re-analysis of the first science run (SR1) of the LZ experiment, with an exposure of $0.9$ tonne$\times$year, to search for ultraheavy particle dark matter. The signal topology consists of multiple energy deposits in the active region of the detector forming a straight line, from which the velocity of the incoming particle can be reconstructed on an event-by-event basis. Zero events with this topology were observed after applying the data selection calibrated on a simulated sample of signal-like events. New experimental constraints are derived, which rule out previously unexplored regions of the dark matter parameter space of spin-independent interactions beyond a mass of 10$^{17}$ GeV/$c^2$.
△ Less
Submitted 13 February, 2024;
originally announced February 2024.
-
Learning How To Ask: Cycle-Consistency Refines Prompts in Multimodal Foundation Models
Authors:
Maurice Diesendruck,
Jianzhe Lin,
Shima Imani,
Gayathri Mahalingam,
Mingyang Xu,
Jie Zhao
Abstract:
When LLMs perform zero-shot inference, they typically use a prompt with a task specification, and generate a completion. However, there is no work to explore the possibility of the reverse - going from completion to task specification. In this paper, we employ both directions to perform cycle-supervised learning entirely in-context. Our goal is to create a forward map f : X -> Y (e.g. image -> gen…
▽ More
When LLMs perform zero-shot inference, they typically use a prompt with a task specification, and generate a completion. However, there is no work to explore the possibility of the reverse - going from completion to task specification. In this paper, we employ both directions to perform cycle-supervised learning entirely in-context. Our goal is to create a forward map f : X -> Y (e.g. image -> generated caption), coupled with a backward map g : Y -> X (e.g. caption -> generated image) to construct a cycle-consistency "loss" (formulated as an update to the prompt) to enforce g(f(X)) ~= X. The technique, called CyclePrompt, uses cycle-consistency as a free supervisory signal to iteratively craft the prompt. Importantly, CyclePrompt reinforces model performance without expensive fine-tuning, without training data, and without the complexity of external environments (e.g. compilers, APIs). We demonstrate CyclePrompt in two domains: code generation and image captioning. Our results on the HumanEval coding benchmark put us in first place on the leaderboard among models that do not rely on extra training data or usage of external environments, and third overall. Compared to the GPT4 baseline, we improve accuracy from 80.5% to 87.2%. In the vision-language space, we generate detailed image captions which outperform baseline zero-shot GPT4V captions, when tested against natural (VQAv2) and diagrammatic (FigureQA) visual question-answering benchmarks. To the best of our knowledge, this is the first use of self-supervised learning for prompting.
△ Less
Submitted 13 February, 2024;
originally announced February 2024.
-
Coherent Imaging with Photonic Lanterns
Authors:
Yoo Jung Kim,
Michael P. Fitzgerald,
Jonathan Lin,
Steph Sallum,
Yinzi Xin,
Nemanja Jovanovic,
Sergio Leon-Saval
Abstract:
Photonic Lanterns (PLs) are tapered waveguides that gradually transition from a multi-mode fiber geometry to a bundle of single-mode fibers (SMFs). They can efficiently couple multi-mode telescope light into a multi-mode fiber entrance at the focal plane and convert it into multiple single-mode beams. Thus, each SMF samples its unique mode (lantern principal mode) of the telescope light in the pup…
▽ More
Photonic Lanterns (PLs) are tapered waveguides that gradually transition from a multi-mode fiber geometry to a bundle of single-mode fibers (SMFs). They can efficiently couple multi-mode telescope light into a multi-mode fiber entrance at the focal plane and convert it into multiple single-mode beams. Thus, each SMF samples its unique mode (lantern principal mode) of the telescope light in the pupil, analogous to subapertures in aperture masking interferometry (AMI). Coherent imaging with PLs can be enabled by interfering SMF outputs and applying phase modulation, which can be achieved using a photonic chip beam combiner at the backend (e.g., the ABCD beam combiner). In this study, we investigate the potential of coherent imaging by interfering SMF outputs of a PL with a single telescope. We demonstrate that the visibilities that can be measured from a PL are mutual intensities incident on the pupil weighted by the cross-correlation of a pair of lantern modes. From numerically simulated lantern principal modes of a 6-port PL, we find that interferometric observables using a PL behave similarly to separated-aperture visibilities for simple models on small angular scales ($<λ/D$) but with greater sensitivity to symmetries and capability to break phase angle degeneracies. Furthermore, we present simulated observations with wavefront errors and compare them to AMI. Despite the redundancy caused by extended lantern principal modes, spatial filtering offers stability to wavefront errors. Our simulated observations suggest that PLs may offer significant benefits in the photon noise-limited regime and in resolving small angular scales at low contrast regime.
△ Less
Submitted 12 February, 2024;
originally announced February 2024.
-
A magnetic reconnection model for the hot explosion with both ultraviolet and Hα wing emissions
Authors:
Guanchong Cheng,
Lei Ni,
Yajie Chen,
Jun Lin
Abstract:
Ellerman bombs (EBs) with significant H$α$ wing emissions and ultraviolet bursts (UV bursts) with strong Si IV emissions are two kinds of small transient brightening events that occur in the low solar atmosphere.We numerically investigated the magnetic reconnection process between the emerging arch magnetic field and the lower atmospheric background magnetic field. We aim to find out if the hot UV…
▽ More
Ellerman bombs (EBs) with significant H$α$ wing emissions and ultraviolet bursts (UV bursts) with strong Si IV emissions are two kinds of small transient brightening events that occur in the low solar atmosphere.We numerically investigated the magnetic reconnection process between the emerging arch magnetic field and the lower atmospheric background magnetic field. We aim to find out if the hot UV emissions and much colder H$α$ wing emissions can both appear in the same reconnection process and how they are located in the reconnection region. The open-source code NIRVANA was applied to perform the 2.5D magnetohydrodynamic (MHD) simulation. We developed the related sub-codes to include the more realistic radiative cooling process for the photosphere and chromosphere and the time-dependent ionization degree of hydrogen. The initial background magnetic field is 600 G, and the emerged magnetic field in the solar atmosphere is of the same magnitude, meaning that it results in a low- $β$ magnetic reconnection environment. We also used the radiative transfer code RH1.5D to synthesize the Si IV and H$α$ spectral line profiles based on the MHD simulation results. Magnetic reconnection between emerged and background magnetic fields creates a thin, curved current sheet, which then leads to the formation of plasmoid instability and the nonuniform density distributions. The mix of hot tenuous and much cooler dense plasmas in the turbulent reconnection region can appear at about the same height, or even in the same plasmoid. The turbulent current sheet is always in a dense plasma environment with an optical depth larger than 6.5$\times$10$^{-5}$ due to the emerged magnetic field pushing high-density plasmas upward.
△ Less
Submitted 20 February, 2024; v1 submitted 11 February, 2024;
originally announced February 2024.
-
Real-time Dynamics of the Schwinger Model as an Open Quantum System with Neural Density Operators
Authors:
Joshua Lin,
Di Luo,
Xiaojun Yao,
Phiala E. Shanahan
Abstract:
Ab-initio simulations of multiple heavy quarks propagating in a Quark-Gluon Plasma are computationally difficult to perform due to the large dimension of the space of density matrices. This work develops machine learning algorithms to overcome this difficulty by approximating exact quantum states with neural network parametrisations, specifically Neural Density Operators. As a proof of principle d…
▽ More
Ab-initio simulations of multiple heavy quarks propagating in a Quark-Gluon Plasma are computationally difficult to perform due to the large dimension of the space of density matrices. This work develops machine learning algorithms to overcome this difficulty by approximating exact quantum states with neural network parametrisations, specifically Neural Density Operators. As a proof of principle demonstration in a QCD-like theory, the approach is applied to solve the Lindblad master equation in the 1+1d lattice Schwinger Model as an open quantum system. Neural Density Operators enable the study of in-medium dynamics on large lattice volumes, where multiple-string interactions and their effects on string-breaking and recombination phenomena can be studied. Thermal properties of the system at equilibrium can also be probed with these methods by variationally constructing the steady state of the Lindblad master equation. Scaling of this approach with system size is studied, and numerical demonstrations on up to 32 spatial lattice sites and with up to 3 interacting strings are performed.
△ Less
Submitted 9 February, 2024;
originally announced February 2024.
-
Enhanced Frequency Conversion in Parity-Time Symmetry Line
Authors:
Jiankun Hou,
Jiefu Zhu,
Ruixin Ma,
Boyi Xue,
Yicheng Zhu,
**tian Lin,
Xiaoshun Jiang,
Yuanlin Zheng,
Xianfeng Chen,
Ya Cheng,
Li Ge,
Wenjie Wan
Abstract:
Non-Hermitian degeneracies reveal intriguing and non-trivial behaviors in open physical systems. Examples like Parity-Time (PT) symmetry breaking, topological encircling chirality, and enhanced sensing near an exceptional point (EP) are often associated with the abrupt nature of the phase transition around these degeneracies. Here we experimentally observe a cavity-enhanced second-harmonic frequen…
▽ More
Non-Hermitian degeneracies reveal intriguing and non-trivial behaviors in open physical systems. Examples like Parity-Time (PT) symmetry breaking, topological encircling chirality, and enhanced sensing near an exceptional point (EP) are often associated with the abrupt nature of the phase transition around these degeneracies. Here we experimentally observe a cavity-enhanced second-harmonic frequency (SHG) conversion on a PT symmetry line, i.e. a set consisting of open-ended isofrequency or isoloss lines, both terminated at EPs on the Riemann surface in parameter space. The enhancement factor can reach as high as 300, depending on the crossing point whether in the symmetry or the broken phase of the PT line. Moreover, such enhancement of SHG enables sensitive distance sensing with a nanometer resolution. Our works may pave the way for practical applications in sensing, frequency conversion, and coherent wave control.
△ Less
Submitted 9 February, 2024;
originally announced February 2024.
-
First measurement of the yield of $^8$He isotopes produced in liquid scintillator by cosmic-ray muons at Daya Bay
Authors:
Daya Bay Collaboration,
F. P. An,
W. D. Bai,
A. B. Balantekin,
M. Bishai,
S. Blyth,
G. F. Cao,
J. Cao,
J. F. Chang,
Y. Chang,
H. S. Chen,
H. Y. Chen,
S. M. Chen,
Y. Chen,
Y. X. Chen,
Z. Y. Chen,
J. Cheng,
Y. C. Cheng,
Z. K. Cheng,
J. J. Cherwinka,
M. C. Chu,
J. P. Cummings,
O. Dalager,
F. S. Deng,
X. Y. Ding
, et al. (177 additional authors not shown)
Abstract:
Daya Bay presents the first measurement of cosmogenic $^8$He isotope production in liquid scintillator, using an innovative method for identifying cascade decays of $^8$He and its child isotope, $^8$Li. We also measure the production yield of $^9$Li isotopes using well-established methodology. The results, in units of 10$^{-8}μ^{-1}$g$^{-1}$cm$^{2}$, are 0.307$\pm$0.042, 0.341$\pm$0.040, and 0.546…
▽ More
Daya Bay presents the first measurement of cosmogenic $^8$He isotope production in liquid scintillator, using an innovative method for identifying cascade decays of $^8$He and its child isotope, $^8$Li. We also measure the production yield of $^9$Li isotopes using well-established methodology. The results, in units of 10$^{-8}μ^{-1}$g$^{-1}$cm$^{2}$, are 0.307$\pm$0.042, 0.341$\pm$0.040, and 0.546$\pm$0.076 for $^8$He, and 6.73$\pm$0.73, 6.75$\pm$0.70, and 13.74$\pm$0.82 for $^9$Li at average muon energies of 63.9~GeV, 64.7~GeV, and 143.0~GeV, respectively. The measured production rate of $^8$He isotopes is more than an order of magnitude lower than any other measurement of cosmogenic isotope production. It replaces the results of previous attempts to determine the ratio of $^8$He to $^9$Li production that yielded a wide range of limits from 0 to 30\%. The results provide future liquid-scintillator-based experiments with improved ability to predict cosmogenic backgrounds.
△ Less
Submitted 7 February, 2024;
originally announced February 2024.
-
Searching for Giant Exoplanets around M-dwarf Stars (GEMS) I: Survey Motivation
Authors:
Shubham Kanodia,
Caleb I. Cañas,
Suvrath Mahadevan,
Eric B. Ford,
Ravit Helled,
Dana E. Anderson,
Alan Boss,
William D. Cochran,
Megan Delamer,
Te Han,
Jessica E. Libby-Roberts,
Andrea S. J. Lin,
Simon Müller,
Paul Robertson,
Guðmundur Stefánsson,
Johanna Teske
Abstract:
Recent discoveries of transiting giant exoplanets around M-dwarf stars (GEMS), aided by the all-sky coverage of TESS, are starting to stretch theories of planet formation through the core-accretion scenario. Recent upper limits on their occurrence suggest that they decrease with lower stellar masses, with fewer GEMS around lower-mass stars compared to solar-type. In this paper, we discuss existing…
▽ More
Recent discoveries of transiting giant exoplanets around M-dwarf stars (GEMS), aided by the all-sky coverage of TESS, are starting to stretch theories of planet formation through the core-accretion scenario. Recent upper limits on their occurrence suggest that they decrease with lower stellar masses, with fewer GEMS around lower-mass stars compared to solar-type. In this paper, we discuss existing GEMS both through confirmed planets, as well as protoplanetary disk observations, and a combination of tests to reconcile these with theoretical predictions. We then introduce the \textit{Searching for GEMS} survey, where we utilize multi-dimensional nonparameteric statistics to simulate hypothetical survey scenarios to predict the required sample size of transiting GEMS with mass measurements to robustly compare their bulk-density with canonical hot-Jupiters orbiting FGK stars. Our Monte-Carlo simulations predict that a robust comparison requires about 40 transiting GEMS (compared to the existing sample of $\sim$ 15) with 5-$σ$ mass measurements. Furthermore, we discuss the limitations of existing occurrence estimates for GEMS, and provide a brief description of our planned systematic search to improve the occurrence rate estimates for GEMS.
△ Less
Submitted 7 February, 2024;
originally announced February 2024.
-
ScreenAI: A Vision-Language Model for UI and Infographics Understanding
Authors:
Gilles Baechler,
Srinivas Sunkara,
Maria Wang,
Fedir Zubach,
Hassan Mansoor,
Vincent Etter,
Victor Cărbune,
Jason Lin,
**dong Chen,
Abhanshu Sharma
Abstract:
Screen user interfaces (UIs) and infographics, sharing similar visual language and design principles, play important roles in human communication and human-machine interaction. We introduce ScreenAI, a vision-language model that specializes in UI and infographics understanding. Our model improves upon the PaLI architecture with the flexible patching strategy of pix2struct and is trained on a uniqu…
▽ More
Screen user interfaces (UIs) and infographics, sharing similar visual language and design principles, play important roles in human communication and human-machine interaction. We introduce ScreenAI, a vision-language model that specializes in UI and infographics understanding. Our model improves upon the PaLI architecture with the flexible patching strategy of pix2struct and is trained on a unique mixture of datasets. At the heart of this mixture is a novel screen annotation task in which the model has to identify the type and location of UI elements. We use these text annotations to describe screens to Large Language Models and automatically generate question-answering (QA), UI navigation, and summarization training datasets at scale. We run ablation studies to demonstrate the impact of these design choices. At only 5B parameters, ScreenAI achieves new state-of-the-artresults on UI- and infographics-based tasks (Multi-page DocVQA, WebSRC, MoTIF and Widget Captioning), and new best-in-class performance on others (Chart QA, DocVQA, and InfographicVQA) compared to models of similar size. Finally, we release three new datasets: one focused on the screen annotation task and two others focused on question answering.
△ Less
Submitted 4 July, 2024; v1 submitted 7 February, 2024;
originally announced February 2024.
-
PAC-Bayesian Adversarially Robust Generalization Bounds for Graph Neural Network
Authors:
Tan Sun,
Junhong Lin
Abstract:
Graph neural networks (GNNs) have gained popularity for various graph-related tasks. However, similar to deep neural networks, GNNs are also vulnerable to adversarial attacks. Empirical studies have shown that adversarially robust generalization has a pivotal role in establishing effective defense algorithms against adversarial attacks. In this paper, we contribute by providing adversarially robus…
▽ More
Graph neural networks (GNNs) have gained popularity for various graph-related tasks. However, similar to deep neural networks, GNNs are also vulnerable to adversarial attacks. Empirical studies have shown that adversarially robust generalization has a pivotal role in establishing effective defense algorithms against adversarial attacks. In this paper, we contribute by providing adversarially robust generalization bounds for two kinds of popular GNNs, graph convolutional network (GCN) and message passing graph neural network, using the PAC-Bayesian framework. Our result reveals that spectral norm of the diffusion matrix on the graph and spectral norm of the weights as well as the perturbation factor govern the robust generalization bounds of both models. Our bounds are nontrivial generalizations of the results developed in (Liao et al., 2020) from the standard setting to adversarial setting while avoiding exponential dependence of the maximum node degree. As corollaries, we derive better PAC-Bayesian robust generalization bounds for GCN in the standard setting, which improve the bounds in (Liao et al., 2020) by avoiding exponential dependence on the maximum node degree.
△ Less
Submitted 6 February, 2024;
originally announced February 2024.
-
On Convergence of Adam for Stochastic Optimization under Relaxed Assumptions
Authors:
Yusu Hong,
Junhong Lin
Abstract:
The Adaptive Momentum Estimation (Adam) algorithm is highly effective in training various deep learning tasks. Despite this, there's limited theoretical understanding for Adam, especially when focusing on its vanilla form in non-convex smooth scenarios with potential unbounded gradients and affine variance noise. In this paper, we study vanilla Adam under these challenging conditions. We introduce…
▽ More
The Adaptive Momentum Estimation (Adam) algorithm is highly effective in training various deep learning tasks. Despite this, there's limited theoretical understanding for Adam, especially when focusing on its vanilla form in non-convex smooth scenarios with potential unbounded gradients and affine variance noise. In this paper, we study vanilla Adam under these challenging conditions. We introduce a comprehensive noise model which governs affine variance noise, bounded noise and sub-Gaussian noise. We show that Adam can find a stationary point with a $\mathcal{O}(\text{poly}(\log T)/\sqrt{T})$ rate in high probability under this general noise model where $T$ denotes total number iterations, matching the lower rate of stochastic first-order algorithms up to logarithm factors. More importantly, we reveal that Adam is free of tuning step-sizes with any problem-parameters, yielding a better adaptation property than the Stochastic Gradient Descent under the same conditions. We also provide a probabilistic convergence result for Adam under a generalized smooth condition which allows unbounded smoothness parameters and has been illustrated empirically to more accurately capture the smooth property of many practical objective functions.
△ Less
Submitted 6 February, 2024;
originally announced February 2024.
-
CAMBranch: Contrastive Learning with Augmented MILPs for Branching
Authors:
Jiacheng Lin,
Meng Xu,
Zhihua Xiong,
Huangang Wang
Abstract:
Recent advancements have introduced machine learning frameworks to enhance the Branch and Bound (B\&B) branching policies for solving Mixed Integer Linear Programming (MILP). These methods, primarily relying on imitation learning of Strong Branching, have shown superior performance. However, collecting expert samples for imitation learning, particularly for Strong Branching, is a time-consuming en…
▽ More
Recent advancements have introduced machine learning frameworks to enhance the Branch and Bound (B\&B) branching policies for solving Mixed Integer Linear Programming (MILP). These methods, primarily relying on imitation learning of Strong Branching, have shown superior performance. However, collecting expert samples for imitation learning, particularly for Strong Branching, is a time-consuming endeavor. To address this challenge, we propose \textbf{C}ontrastive Learning with \textbf{A}ugmented \textbf{M}ILPs for \textbf{Branch}ing (CAMBranch), a framework that generates Augmented MILPs (AMILPs) by applying variable shifting to limited expert data from their original MILPs. This approach enables the acquisition of a considerable number of labeled expert samples. CAMBranch leverages both MILPs and AMILPs for imitation learning and employs contrastive learning to enhance the model's ability to capture MILP features, thereby improving the quality of branching decisions. Experimental results demonstrate that CAMBranch, trained with only 10\% of the complete dataset, exhibits superior performance. Ablation studies further validate the effectiveness of our method.
△ Less
Submitted 5 February, 2024;
originally announced February 2024.
-
A Priori Error Estimation of Physics-Informed Neural Networks Solving Allen--Cahn and Cahn--Hilliard Equations
Authors:
Guangtao Zhang,
Jiani Lin,
Qijia Zhai,
Huiyu Yang,
Xujun Chen,
Xiaoning Zheng,
Ieng Tak Leong
Abstract:
This paper aims to analyze errors in the implementation of the Physics-Informed Neural Network (PINN) for solving the Allen--Cahn (AC) and Cahn--Hilliard (CH) partial differential equations (PDEs). The accuracy of PINN is still challenged when dealing with strongly non-linear and higher-order time-varying PDEs. To address this issue, we introduce a stable and bounded self-adaptive weighting scheme…
▽ More
This paper aims to analyze errors in the implementation of the Physics-Informed Neural Network (PINN) for solving the Allen--Cahn (AC) and Cahn--Hilliard (CH) partial differential equations (PDEs). The accuracy of PINN is still challenged when dealing with strongly non-linear and higher-order time-varying PDEs. To address this issue, we introduce a stable and bounded self-adaptive weighting scheme known as Residuals-RAE, which ensures fair training and effectively captures the solution. By incorporating this new training loss function, we conduct numerical experiments on 1D and 2D AC and CH systems to validate our theoretical findings. Our theoretical analysis demonstrates that feedforward neural networks with two hidden layers and tanh activation function effectively bound the PINN approximation errors for the solution field, temporal derivative, and nonlinear term of the AC and CH equations by the training loss and number of collocation points.
△ Less
Submitted 4 February, 2024;
originally announced February 2024.
-
Minute-Cadence Observations of the LAMOST Fields with the TMTS V. Machine Learning Classification of TMTS Catalogues of Periodic Variable Stars
Authors:
Fangzhou Guo,
Jie Lin,
Xiaofeng Wang,
Xiaodian Chen,
Tanda Li,
Liyang Chen,
Qiqi Xia,
Jun Mo,
Gaobo Xi,
Jicheng Zhang,
Qichun Liu,
Xiaojun Jiang,
Shengyu Yan,
Haowei Peng,
Jialian Liu,
Wenxiong Li,
Weili Lin,
Danfeng Xiang,
Xiaoran Ma,
Yongzhi Cai
Abstract:
Periodic variables are always of great scientific interest in astrophysics. Thanks to the rapid advancement of modern large-scale time-domain surveys, the number of reported variable stars has experienced substantial growth for several decades, which significantly deepened our comprehension of stellar structure and binary evolution. The Tsinghua University-Ma Huateng Telescopes for Survey (TMTS) h…
▽ More
Periodic variables are always of great scientific interest in astrophysics. Thanks to the rapid advancement of modern large-scale time-domain surveys, the number of reported variable stars has experienced substantial growth for several decades, which significantly deepened our comprehension of stellar structure and binary evolution. The Tsinghua University-Ma Huateng Telescopes for Survey (TMTS) has started to monitor the LAMOST sky areas since 2020, with a cadence of 1 minute. During the period from 2020 to 2022, this survey has resulted in densely sampled light curves for ~ 30,000 variables of the maximum powers in the Lomb-Scargle periodogram above the 5sigma threshold. In this paper, we classified 11,638 variable stars into 6 main types using XGBoost and Random Forest classifiers with accuracies of 98.83% and 98.73%, respectively. Among them, 5301 (45.55%) variables are newly discovered, primarily consisting of Delta Scuti stars, demonstrating the capability of TMTS in searching for short-period variables. We cross-matched the catalogue with Gaia's second Data Release (DR2) and LAMOST's seventh Data Release (DR7) to obtain important physical parameters of the variables. We identified 5504 Delta Scuti stars (including 4876 typical Delta Scuti stars and 628 high-amplitude Delta Scuti stars), 5899 eclipsing binaries (including EA-, EB- and EW-type) and 226 candidates of RS Canum Venaticorum. Leveraging the metal abundance data provided by LAMOST and the Galactic latitude, we discovered 8 candidates of SX Phe stars within the class of "Delta Scuti stars". Moreover, with the help of Gaia color-magnitude diagram, we identified 9 ZZ ceti stars.
△ Less
Submitted 4 February, 2024;
originally announced February 2024.
-
Benchmarking Spiking Neural Network Learning Methods with Varying Locality
Authors:
Jiaqi Lin,
Sen Lu,
Malyaban Bal,
Abhronil Sengupta
Abstract:
Spiking Neural Networks (SNNs), providing more realistic neuronal dynamics, have shown to achieve performance comparable to Artificial Neural Networks (ANNs) in several machine learning tasks. Information is processed as spikes within SNNs in an event-based mechanism that significantly reduces energy consumption. However, training SNNs is challenging due to the non-differentiable nature of the spi…
▽ More
Spiking Neural Networks (SNNs), providing more realistic neuronal dynamics, have shown to achieve performance comparable to Artificial Neural Networks (ANNs) in several machine learning tasks. Information is processed as spikes within SNNs in an event-based mechanism that significantly reduces energy consumption. However, training SNNs is challenging due to the non-differentiable nature of the spiking mechanism. Traditional approaches, such as Backpropagation Through Time (BPTT), have shown effectiveness but comes with additional computational and memory costs and are biologically implausible. In contrast, recent works propose alternative learning methods with varying degrees of locality, demonstrating success in classification tasks. In this work, we show that these methods share similarities during the training process, while they present a trade-off between biological plausibility and performance. Further, this research examines the implicitly recurrent nature of SNNs and investigates the influence of addition of explicit recurrence to SNNs. We experimentally prove that the addition of explicit recurrent weights enhances the robustness of SNNs. We also investigate the performance of local learning methods under gradient and non-gradient based adversarial attacks.
△ Less
Submitted 1 February, 2024;
originally announced February 2024.
-
3DG: A Framework for Using Generative AI for Handling Sparse Learner Performance Data From Intelligent Tutoring Systems
Authors:
Liang Zhang,
Jionghao Lin,
Conrad Borchers,
Meng Cao,
Xiangen Hu
Abstract:
Learning performance data (e.g., quiz scores and attempts) is significant for understanding learner engagement and knowledge mastery level. However, the learning performance data collected from Intelligent Tutoring Systems (ITSs) often suffers from sparsity, impacting the accuracy of learner modeling and knowledge assessments. To address this, we introduce the 3DG framework (3-Dimensional tensor f…
▽ More
Learning performance data (e.g., quiz scores and attempts) is significant for understanding learner engagement and knowledge mastery level. However, the learning performance data collected from Intelligent Tutoring Systems (ITSs) often suffers from sparsity, impacting the accuracy of learner modeling and knowledge assessments. To address this, we introduce the 3DG framework (3-Dimensional tensor for Densification and Generation), a novel approach combining tensor factorization with advanced generative models, including Generative Adversarial Network (GAN) and Generative Pre-trained Transformer (GPT), for enhanced data imputation and augmentation. The framework operates by first representing the data as a three-dimensional tensor, capturing dimensions of learners, questions, and attempts. It then densifies the data through tensor factorization and augments it using Generative AI models, tailored to individual learning patterns identified via clustering. Applied to data from an AutoTutor lesson by the Center for the Study of Adult Literacy (CSAL), the 3DG framework effectively generated scalable, personalized simulations of learning performance. Comparative analysis revealed GAN's superior reliability over GPT-4 in this context, underscoring its potential in addressing data sparsity challenges in ITSs and contributing to the advancement of personalized educational technology.
△ Less
Submitted 29 January, 2024;
originally announced February 2024.
-
Training-time Neuron Alignment through Permutation Subspace for Improving Linear Mode Connectivity and Model Fusion
Authors:
Zexi Li,
Zhiqi Li,
Jie Lin,
Tao Shen,
Tao Lin,
Chao Wu
Abstract:
In deep learning, stochastic gradient descent often yields functionally similar yet widely scattered solutions in the weight space even under the same initialization, causing barriers in the Linear Mode Connectivity (LMC) landscape. Overcoming these barriers is crucial for understanding deep learning dynamics and enhancing model-fusion algorithms. Previous studies highlight the role of permutation…
▽ More
In deep learning, stochastic gradient descent often yields functionally similar yet widely scattered solutions in the weight space even under the same initialization, causing barriers in the Linear Mode Connectivity (LMC) landscape. Overcoming these barriers is crucial for understanding deep learning dynamics and enhancing model-fusion algorithms. Previous studies highlight the role of permutation symmetry in reducing post-training barriers through network permutation. However, these post-hoc methods, demanding extra computations, are less effective for larger, complex models (e.g., ViT, LLM) due to numerous permutation matrices. Thus, in this paper, we study training-time neuron alignment. Our hypothesis suggests that training-time permutation subspace can reduce LMC barriers for free. We find that pruning at initialization supports this. Beyond pruning, we introduce TNA-PFN, a simple yet lossless algorithm using a partial gradient mask during training. TNA-PFN is theoretically and empirically validated for reducing LMC barriers. It excels in wide model fusion applications, especially in federated learning, two algorithms based on TNA-FPN that are proposed to show its prospects even under heterogeneous datasets. Moreover, TNA-PFN can enhance the generalization of model soup for vision transformers and ColD fusion for pretrained language models.
△ Less
Submitted 2 February, 2024;
originally announced February 2024.
-
Deep Room Impulse Response Completion
Authors:
Jackie Lin,
Georg Götz,
Sebastian J. Schlecht
Abstract:
Rendering immersive spatial audio in virtual reality (VR) and video games demands a fast and accurate generation of room impulse responses (RIRs) to recreate auditory environments plausibly. However, the conventional methods for simulating or measuring long RIRs are either computationally intensive or challenged by low signal-to-noise ratios. This study is propelled by the insight that direct soun…
▽ More
Rendering immersive spatial audio in virtual reality (VR) and video games demands a fast and accurate generation of room impulse responses (RIRs) to recreate auditory environments plausibly. However, the conventional methods for simulating or measuring long RIRs are either computationally intensive or challenged by low signal-to-noise ratios. This study is propelled by the insight that direct sound and early reflections encapsulate sufficient information about room geometry and absorption characteristics. Building upon this premise, we propose a novel task termed "RIR completion," aimed at synthesizing the late reverberation given only the early portion (50 ms) of the response. To this end, we introduce DECOR, Deep Exponential Completion Of Room impulse responses, a deep neural network structured as an autoencoder designed to predict multi-exponential decay envelopes of filtered noise sequences. The interpretability of DECOR's output facilitates its integration with diverse rendering techniques. The proposed method is compared against an adapted state-of-the-art network, and comparable performance shows promising results supporting the feasibility of the RIR completion task. The RIR completion can be widely adapted to enhance RIR generation tasks where fast late reverberation approximation is required.
△ Less
Submitted 1 February, 2024;
originally announced February 2024.
-
GUMsley: Evaluating Entity Salience in Summarization for 12 English Genres
Authors:
Jessica Lin,
Amir Zeldes
Abstract:
As NLP models become increasingly capable of understanding documents in terms of coherent entities rather than strings, obtaining the most salient entities for each document is not only an important end task in itself but also vital for Information Retrieval (IR) and other downstream applications such as controllable summarization. In this paper, we present and evaluate GUMsley, the first entity s…
▽ More
As NLP models become increasingly capable of understanding documents in terms of coherent entities rather than strings, obtaining the most salient entities for each document is not only an important end task in itself but also vital for Information Retrieval (IR) and other downstream applications such as controllable summarization. In this paper, we present and evaluate GUMsley, the first entity salience dataset covering all named and non-named salient entities for 12 genres of English text, aligned with entity types, Wikification links and full coreference resolution annotations. We promote a strict definition of salience using human summaries and demonstrate high inter-annotator agreement for salience based on whether a source entity is mentioned in the summary. Our evaluation shows poor performance by pre-trained SOTA summarization models and zero-shot LLM prompting in capturing salient entities in generated summaries. We also show that predicting or providing salient entities to several model architectures enhances performance and helps derive higher-quality summaries by alleviating the entity hallucination problem in existing abstractive summarization.
△ Less
Submitted 31 January, 2024;
originally announced January 2024.
-
Electronic conduction and superconducting properties of CoSi$_2$ films on silicon--an unconventional superconductor with technological potential
Authors:
Shao-Pin Chiu,
Chang-Jan Wang,
Yi-Chun Lin,
Shun-Tast Tu,
Shouray Sahu,
Ruey-Tay Wang,
Chih-Yuan Wu,
Sheng-Shiuan Yeh,
Stefan Kirchner,
Juhn-Jong Lin
Abstract:
We report observations of unusual normal-state electronic conduction properties and superconducting characteristics of high-quality CoSi$_2$/Si films grown on silicon Si(100) and Si(111) substrates. A good understanding of these features shall help to address the underlying physics of the unconventional pairing symmetry recently observed in transparent CoSi$_2$/TiSi$_2$ heterojunctions [S. P. Chiu…
▽ More
We report observations of unusual normal-state electronic conduction properties and superconducting characteristics of high-quality CoSi$_2$/Si films grown on silicon Si(100) and Si(111) substrates. A good understanding of these features shall help to address the underlying physics of the unconventional pairing symmetry recently observed in transparent CoSi$_2$/TiSi$_2$ heterojunctions [S. P. Chiu \textit{et al.}, Sci. Adv. \textbf{7}, eabg6569 (2021); Nanoscale \textbf{15}, 9179 (2023)], where CoSi$_2$/Si is a superconductor with a superconducting transition temperature $T_c \simeq$ (1.1--1.5) K, dependent on its dimensions, and TiSi$_2$ is a normal metal. In CoSi$_2$/Si films, we find a pronounced positive magnetoresistance caused by the weak-antilocalization effect, indicating a strong Rashba spin-orbit coupling (SOC). This SOC generates two-component superconductivity in CoSi$_2$/TiSi$_2$ heterojunctions. The CoSi$_2$/Si films are stable under ambient conditions and have ultralow 1/$f$ noise. Moreover, they can be patterned via the standard lithography techniques, which might be of considerable practical value for future scalable superconducting and quantum device fabrication.
△ Less
Submitted 31 January, 2024;
originally announced January 2024.
-
Poynting-Robertson dam** of laser beam driven lightsails
Authors:
Rhys Mackintosh,
Jadon Y. Lin,
Michael S. Wheatland,
Boris T. Kuhlmey
Abstract:
Lightsails using Earth-based lasers for propulsion require passive stabilization to stay within the beam. This can be achieved through the sail's scattering properties, creating optical restoring forces and torques. Undamped restoring forces produce uncontrolled oscillations, which could jeopardize the mission, but it is not obvious how to achieve dam** in the vacuum of space. Using a simple two…
▽ More
Lightsails using Earth-based lasers for propulsion require passive stabilization to stay within the beam. This can be achieved through the sail's scattering properties, creating optical restoring forces and torques. Undamped restoring forces produce uncontrolled oscillations, which could jeopardize the mission, but it is not obvious how to achieve dam** in the vacuum of space. Using a simple two-dimensional model we show that the Doppler effect and relativistic aberration of the propelling laser beam create dam** terms in the optical forces and torques. The effect is similar to the Poynting-Robertson effect causing loss of orbital momentum of dust particles around stars, but can be enhanced by design of the sail's geometry.
△ Less
Submitted 30 January, 2024;
originally announced January 2024.
-
A Scalable RISC-V Vector Processor Enabling Efficient Multi-Precision DNN Inference
Authors:
Chuanning Wang,
Chao Fang,
Xiao Wu,
Zhongfeng Wang,
Jun Lin
Abstract:
RISC-V processors encounter substantial challenges in deploying multi-precision deep neural networks (DNNs) due to their restricted precision support, constrained throughput, and suboptimal dataflow design. To tackle these challenges, a scalable RISC-V vector (RVV) processor, namely SPEED, is proposed to enable efficient multi-precision DNN inference by innovations from customized instructions, ha…
▽ More
RISC-V processors encounter substantial challenges in deploying multi-precision deep neural networks (DNNs) due to their restricted precision support, constrained throughput, and suboptimal dataflow design. To tackle these challenges, a scalable RISC-V vector (RVV) processor, namely SPEED, is proposed to enable efficient multi-precision DNN inference by innovations from customized instructions, hardware architecture, and dataflow map**. Firstly, dedicated customized RISC-V instructions are proposed based on RVV extensions, providing SPEED with fine-grained control over processing precision ranging from 4 to 16 bits. Secondly, a parameterized multi-precision systolic array unit is incorporated within the scalable module to enhance parallel processing capability and data reuse opportunities. Finally, a mixed multi-precision dataflow strategy, compatible with different convolution kernels and data precision, is proposed to effectively improve data utilization and computational efficiency. We perform synthesis of SPEED in TSMC 28nm technology. The experimental results demonstrate that SPEED achieves a peak throughput of 287.41 GOPS and an energy efficiency of 1335.79 GOPS/W at 4-bit precision condition, respectively. Moreover, when compared to the pioneer open-source vector processor Ara, SPEED provides an area efficiency improvement of 2.04$\times$ and 1.63$\times$ under 16-bit and 8-bit precision conditions, respectively, which shows SPEED's significant potential for efficient multi-precision DNN inference.
△ Less
Submitted 31 January, 2024; v1 submitted 30 January, 2024;
originally announced January 2024.
-
Spot the Error: Non-autoregressive Graphic Layout Generation with Wireframe Locator
Authors:
Jieru Lin,
Danqing Huang,
Tiejun Zhao,
Dechen Zhan,
Chin-Yew Lin
Abstract:
Layout generation is a critical step in graphic design to achieve meaningful compositions of elements. Most previous works view it as a sequence generation problem by concatenating element attribute tokens (i.e., category, size, position). So far the autoregressive approach (AR) has achieved promising results, but is still limited in global context modeling and suffers from error propagation since…
▽ More
Layout generation is a critical step in graphic design to achieve meaningful compositions of elements. Most previous works view it as a sequence generation problem by concatenating element attribute tokens (i.e., category, size, position). So far the autoregressive approach (AR) has achieved promising results, but is still limited in global context modeling and suffers from error propagation since it can only attend to the previously generated tokens. Recent non-autoregressive attempts (NAR) have shown competitive results, which provides a wider context range and the flexibility to refine with iterative decoding. However, current works only use simple heuristics to recognize erroneous tokens for refinement which is inaccurate. This paper first conducts an in-depth analysis to better understand the difference between the AR and NAR framework. Furthermore, based on our observation that pixel space is more sensitive in capturing spatial patterns of graphic layouts (e.g., overlap, alignment), we propose a learning-based locator to detect erroneous tokens which takes the wireframe image rendered from the generated layout sequence as input. We show that it serves as a complementary modality to the element sequence in object space and contributes greatly to the overall performance. Experiments on two public datasets show that our approach outperforms both AR and NAR baselines. Extensive studies further prove the effectiveness of different modules with interesting findings. Our code will be available at https://github.com/ffffatgoose/SpotError.
△ Less
Submitted 29 January, 2024;
originally announced January 2024.
-
Evaluation of k-means time series clustering based on z-normalization and NP-Free
Authors:
Ming-Chang Lee,
Jia-Chun Lin,
Volker Stolz
Abstract:
Despite the widespread use of k-means time series clustering in various domains, there exists a gap in the literature regarding its comprehensive evaluation with different time series normalization approaches. This paper seeks to fill this gap by conducting a thorough performance evaluation of k-means time series clustering on real-world open-source time series datasets. The evaluation focuses on…
▽ More
Despite the widespread use of k-means time series clustering in various domains, there exists a gap in the literature regarding its comprehensive evaluation with different time series normalization approaches. This paper seeks to fill this gap by conducting a thorough performance evaluation of k-means time series clustering on real-world open-source time series datasets. The evaluation focuses on two distinct normalization techniques: z-normalization and NP-Free. The former is one of the most commonly used normalization approach for time series. The latter is a real-time time series representation approach, which can serve as a time series normalization approach. The primary objective of this paper is to assess the impact of these two normalization techniques on k-means time series clustering in terms of its clustering quality. The experiments employ the silhouette score, a well-established metric for evaluating the quality of clusters in a dataset. By systematically investigating the performance of k-means time series clustering with these two normalization techniques, this paper addresses the current gap in k-means time series clustering evaluation and contributes valuable insights to the development of time series clustering.
△ Less
Submitted 28 January, 2024;
originally announced January 2024.
-
Variable white dwarfs in TMTS: Asteroseismological analysis of a ZZ Ceti star, TMTS J17184064+2524314
Authors:
**cheng Guo,
Yanhui Chen,
Yonghui Yang,
Xiaofeng Wang,
Jie Lin,
Xiao-Yu Ma,
Gaobo Xi,
Jun Mo,
Alexei V. Filippenko,
Thomas G. Brink,
Weikai Zong,
Huahui Yan,
**gkun Zhao,
Xiangyun Zeng,
Zhihao Chen,
Ali Esamdin,
Fangzhou Guo,
Abdusamatjan Iskandar,
Xiaojun Jiang,
Wenxiong Li,
Cheng Liu,
Jianrong Shi,
Xuan Song,
Letian Wang,
Danfeng Xiang
, et al. (2 additional authors not shown)
Abstract:
The Tsinghua University-Ma Huateng Telescope for Survey (TMTS) has been constantly monitoring the northern sky since 2020 in search of rapidly variable stars. To find variable white dwarfs (WDs), the TMTS catalog is cross-matched with the WD catalog of Gaia EDR3, resulting in over 3000 light curves of WD candidates. The WD TMTS J17184064+2524314 (hereafter J1718) is the second ZZ~Ceti star discove…
▽ More
The Tsinghua University-Ma Huateng Telescope for Survey (TMTS) has been constantly monitoring the northern sky since 2020 in search of rapidly variable stars. To find variable white dwarfs (WDs), the TMTS catalog is cross-matched with the WD catalog of Gaia EDR3, resulting in over 3000 light curves of WD candidates. The WD TMTS J17184064+2524314 (hereafter J1718) is the second ZZ~Ceti star discovered among these common sources. Based on the light curves from TMTS, follow-up photometric observations, and TESS, 10 periods and 3 combination periods are detected. A rotation period of $25.12\pm0.18$ hr is derived, according to the identified rotational splitting. Our spectroscopic observation indicates that this WD belongs to DA type with $T_{\rm eff}=11,670\pm604$ K, log $g=8.16\pm0.36$, $M = 0.70\pm0.23$ M$_{\odot}$, and age=$0.51\pm0.34$ Gyr. Based on core-parameterized asteroseismological model grids ($\geqslant$ 14 million), we derive a best-fit solution of $T_{\rm eff}=11,640\pm20$ K, log $g=8.267\pm0.008$, and $M = 0.750\pm0.005$ M$_{\odot}$ for J1718, consistent with the spectral fitting results. For this WD, the corresponding carbon and oxygen abundances in the core are 0.43 and 0.57, respectively. The distance derived from the intrinsic luminosity given by asteroseismology is $64\pm15$ pc, in accord with the distance of $70.1\pm0.2$ pc from Gaia DR3 within the uncertainties.
△ Less
Submitted 26 January, 2024;
originally announced January 2024.
-
Grounded SAM: Assembling Open-World Models for Diverse Visual Tasks
Authors:
Tianhe Ren,
Shilong Liu,
Ailing Zeng,
**g Lin,
Kunchang Li,
He Cao,
Jiayu Chen,
Xinyu Huang,
Yukang Chen,
Feng Yan,
Zhaoyang Zeng,
Hao Zhang,
Feng Li,
Jie Yang,
Hongyang Li,
Qing Jiang,
Lei Zhang
Abstract:
We introduce Grounded SAM, which uses Grounding DINO as an open-set object detector to combine with the segment anything model (SAM). This integration enables the detection and segmentation of any regions based on arbitrary text inputs and opens a door to connecting various vision models. As shown in Fig.1, a wide range of vision tasks can be achieved by using the versatile Grounded SAM pipeline.…
▽ More
We introduce Grounded SAM, which uses Grounding DINO as an open-set object detector to combine with the segment anything model (SAM). This integration enables the detection and segmentation of any regions based on arbitrary text inputs and opens a door to connecting various vision models. As shown in Fig.1, a wide range of vision tasks can be achieved by using the versatile Grounded SAM pipeline. For example, an automatic annotation pipeline based solely on input images can be realized by incorporating models such as BLIP and Recognize Anything. Additionally, incorporating Stable-Diffusion allows for controllable image editing, while the integration of OSX facilitates promptable 3D human motion analysis. Grounded SAM also shows superior performance on open-vocabulary benchmarks, achieving 48.7 mean AP on SegInW (Segmentation in the wild) zero-shot benchmark with the combination of Grounding DINO-Base and SAM-Huge models.
△ Less
Submitted 25 January, 2024;
originally announced January 2024.
-
Explicit evaluation of the Stokes matrices for certain quantum confluent hypergeometric equations
Authors:
**ghong Lin,
Xiaomeng Xu
Abstract:
In this paper, we compute the Stokes matrices of a special quantum confluent hypergeometric system with Poincaré rank one. The sources of the interests in the Stokes phenomenon of such system are from representation theory and the theory of isomonodromy deformation.
In this paper, we compute the Stokes matrices of a special quantum confluent hypergeometric system with Poincaré rank one. The sources of the interests in the Stokes phenomenon of such system are from representation theory and the theory of isomonodromy deformation.
△ Less
Submitted 24 January, 2024;
originally announced January 2024.
-
Wave-graphene: a full-auxetic carbon semiconductor with high flexibility and optical UV absorption
Authors:
Linfeng Yu,
Yi Zhang,
Jianzhou Lin,
Kexin Dong,
Xiong Zheng,
Zhenzhen Qin,
Guangzhao Qin
Abstract:
The abundant bonding possibilities of Carbon stimulate the design of numerous carbon allotropes, promising the foundation for exploring structure-functionality relationships. Herein, utilizing the space bending strategy, we successfully engineered a two-dimensional carbon allotrope with pure sp2 hybridization, named "Wave-graphene" from the unique wave-like ripple structure. The novel Wave-graphen…
▽ More
The abundant bonding possibilities of Carbon stimulate the design of numerous carbon allotropes, promising the foundation for exploring structure-functionality relationships. Herein, utilizing the space bending strategy, we successfully engineered a two-dimensional carbon allotrope with pure sp2 hybridization, named "Wave-graphene" from the unique wave-like ripple structure. The novel Wave-graphene exhibits full-auxetic behavior due to anisotropic mechanical response, possessing both negative and zero Poisson's ratios. The fundamental mechanism can be attributed to the fact that highly buckled out-of-plane structures lead to anisotropic responses of in-plane nonlinear interactions, which further lead to anisotropy of lattice vibrations. In addition, Wave-graphene is found having quasi-direct wide bandgap of 2.01 eV, the excellent optical transparency and the high flexibility. The successful design of Wave-graphene with excellent outstanding multifunctional properties shows that the utilization of space bending strategies can provide more degrees of freedom for designing novel materials, further enriching the carbon material family and supplementing its versatility.
△ Less
Submitted 24 January, 2024;
originally announced January 2024.
-
Distributed Multi-Task Learning for Stochastic Bandits with Context Distribution and Stage-wise Constraints
Authors:
Jiabin Lin,
Shana Moothedath
Abstract:
We present the problem of conservative distributed multi-task learning in stochastic linear contextual bandits with heterogeneous agents. This extends conservative linear bandits to a distributed setting where M agents tackle different but related tasks while adhering to stage-wise performance constraints. The exact context is unknown, and only a context distribution is available to the agents as…
▽ More
We present the problem of conservative distributed multi-task learning in stochastic linear contextual bandits with heterogeneous agents. This extends conservative linear bandits to a distributed setting where M agents tackle different but related tasks while adhering to stage-wise performance constraints. The exact context is unknown, and only a context distribution is available to the agents as in many practical applications that involve a prediction mechanism to infer context, such as stock market prediction and weather forecast. We propose a distributed upper confidence bound (UCB) algorithm, DiSC-UCB. Our algorithm constructs a pruned action set during each round to ensure the constraints are met. Additionally, it includes synchronized sharing of estimates among agents via a central server using well-structured synchronization steps. We prove the regret and communication bounds on the algorithm. We extend the problem to a setting where the agents are unaware of the baseline reward. For this setting, we provide a modified algorithm, DiSC-UCB2, and we show that the modified algorithm achieves the same regret and communication bounds. We empirically validated the performance of our algorithm on synthetic data and real-world Movielens-100K data.
△ Less
Submitted 9 April, 2024; v1 submitted 21 January, 2024;
originally announced January 2024.
-
Human-Centric and Integrative Lighting Asset Management in Public Libraries: Qualitative Insights and Challenges from a Swedish Field Study
Authors:
**g Lin,
Per Olof Hedekvist,
Nina Mylly,
Math Bollen,
**gchun Shen,
Jiawei Xiong,
Christofer Silfvenius
Abstract:
Traditional lighting source reliability evaluations, often covering just half of a lamp's volume, can misrepresent real-world performance. To overcome these limitations,adopting advanced asset management strategies for a more holistic evaluation is crucial. This paper investigates human-centric and integrative lighting asset management in Swedish public libraries. Through field observations, inter…
▽ More
Traditional lighting source reliability evaluations, often covering just half of a lamp's volume, can misrepresent real-world performance. To overcome these limitations,adopting advanced asset management strategies for a more holistic evaluation is crucial. This paper investigates human-centric and integrative lighting asset management in Swedish public libraries. Through field observations, interviews, and gap analysis, the study highlights a disparity between current lighting conditions and stakeholder expectations, with issues like eye strain suggesting significant improvement potential. We propose a shift towards more dynamic lighting asset management and reliability evaluations, emphasizing continuous enhancement and comprehensive training in human-centric and integrative lighting principles.
△ Less
Submitted 5 April, 2024; v1 submitted 19 January, 2024;
originally announced January 2024.
-
Upper bound of the lifespan of the solution to the nonlinear fractional wave equations with time-dependent dam**
Authors:
Jiayun Lin,
Masahiro Ikeda
Abstract:
In this paper, we study the Cauchy problem of the nonlinear wave equation with fractional Laplacian and time-dependent dam**. Firstly, we derive the weighted Sobolev estimate of the solution operators for the linear wave equation with the dam** of constant coefficient, and prove the local existence and uniqueness in the weighted Sobolev space for the power-type nonlinearity and…
▽ More
In this paper, we study the Cauchy problem of the nonlinear wave equation with fractional Laplacian and time-dependent dam**. Firstly, we derive the weighted Sobolev estimate of the solution operators for the linear wave equation with the dam** of constant coefficient, and prove the local existence and uniqueness in the weighted Sobolev space for the power-type nonlinearity and $b(t)\in L^\infty$, by the contraction map** principle. Secondly, we consider the case of the source nonlinearity $f(u)\approx |u|^p$. In the subcritical and critical cases $1<p\leq p_c=1+\frac σN$, based on the blow-up result on the ordinary differential inequality, we could prove the blow-up of the solution and obtain the upper bound of the lifespan. And the upper bound of the lifespan in the critical case is independent on the coefficient of the time-dependent dam** and is completely new even if the classical case $b(t)=1$.
△ Less
Submitted 19 January, 2024;
originally announced January 2024.
-
FinSQL: Model-Agnostic LLMs-based Text-to-SQL Framework for Financial Analysis
Authors:
Chao Zhang,
Yuren Mao,
Yijiang Fan,
Yu Mi,
Yunjun Gao,
Lu Chen,
Dongfang Lou,
**shu Lin
Abstract:
Text-to-SQL, which provides zero-code interface for operating relational databases, has gained much attention in financial analysis; because, financial professionals may not well-skilled in SQL programming. However, until now, there is no practical Text-to-SQL benchmark dataset for financial analysis, and existing Text-to-SQL methods have not considered the unique characteristics of databases in f…
▽ More
Text-to-SQL, which provides zero-code interface for operating relational databases, has gained much attention in financial analysis; because, financial professionals may not well-skilled in SQL programming. However, until now, there is no practical Text-to-SQL benchmark dataset for financial analysis, and existing Text-to-SQL methods have not considered the unique characteristics of databases in financial applications, such as commonly existing wide tables. To address these issues, we collect a practical Text-to-SQL benchmark dataset and propose a model-agnostic Large Language Model (LLMs)-based Text-to-SQL framework for financial analysis. The benchmark dataset, BULL, is collected from the practical financial analysis business of Hundsun Technologies Inc., including databases for fund, stock, and macro economy. Besides, the proposed LLMs-based Text-to-SQL framework, FinSQL, provides a systematic treatment for financial Text-to-SQL from the perspectives of prompt construction, parameter-efficient fine-tuning and output calibration. Extensive experimental results on BULL demonstrate that FinSQL achieves the state-of-the-art Text-to-SQL performance at a small cost; furthermore, FinSQL can bring up to 36.64% performance improvement in scenarios requiring few-shot cross-database model transfer.
△ Less
Submitted 19 January, 2024;
originally announced January 2024.
-
AGADIR: Towards Array-Geometry Agnostic Directional Speech Recognition
Authors:
Ju Lin,
Niko Moritz,
Yiteng Huang,
Ruiming Xie,
Ming Sun,
Christian Fuegen,
Frank Seide
Abstract:
Wearable devices like smart glasses are approaching the compute capability to seamlessly generate real-time closed captions for live conversations. We build on our recently introduced directional Automatic Speech Recognition (ASR) for smart glasses that have microphone arrays, which fuses multi-channel ASR with serialized output training, for wearer/conversation-partner disambiguation as well as s…
▽ More
Wearable devices like smart glasses are approaching the compute capability to seamlessly generate real-time closed captions for live conversations. We build on our recently introduced directional Automatic Speech Recognition (ASR) for smart glasses that have microphone arrays, which fuses multi-channel ASR with serialized output training, for wearer/conversation-partner disambiguation as well as suppression of cross-talk speech from non-target directions and noise.
When ASR work is part of a broader system-development process, one may be faced with changes to microphone geometries as system development progresses.
This paper aims to make multi-channel ASR insensitive to limited variations of microphone-array geometry. We show that a model trained on multiple similar geometries is largely agnostic and generalizes well to new geometries, as long as they are not too different. Furthermore, training the model this way improves accuracy for seen geometries by 15 to 28\% relative. Lastly, we refine the beamforming by a novel Non-Linearly Constrained Minimum Variance criterion.
△ Less
Submitted 18 January, 2024;
originally announced January 2024.
-
Active Control of Ballistic Orbital Transport
Authors:
Sobhan Subhra Mishra,
James Lourembam,
Dennis **g Xiong Lin,
Ranjan Singh
Abstract:
Orbital current, defined as the orbital character of Bloch states in solids, can ballistically travel with larger coherence length through a broader range of materials than its spin counterpart, facilitating a robust, higher density and energy efficient information transmission. Hence, active control of orbital transport plays a pivotal role in propelling the progress of the evolving field of quan…
▽ More
Orbital current, defined as the orbital character of Bloch states in solids, can ballistically travel with larger coherence length through a broader range of materials than its spin counterpart, facilitating a robust, higher density and energy efficient information transmission. Hence, active control of orbital transport plays a pivotal role in propelling the progress of the evolving field of quantum information technology. Unlike spin angular momentum, orbital angular momentum (OAM), couples to phonon angular momentum (PAM) efficiently via orbital-crystal momentum (L-k) coupling, giving us the opportunity to control orbital transport through crystal field potential mediated angular momentum transfer. Here, leveraging the orbital dependant efficient L-k coupling, we have experimentally demonstrated the active control of orbital current velocity using THz emission spectroscopy. Our findings include the identification of a critical energy density required to overcome collisions in orbital transport, enabling a swifter flow of orbital current. The capability to actively control the ballistic orbital transport lays the groundwork for the development of ultrafast devices capable of efficiently transmitting information over extended distance.
△ Less
Submitted 16 January, 2024;
originally announced January 2024.
-
Partial entanglement network and bulk geometry reconstruction in AdS/CFT
Authors:
Jiong Lin,
Yizhou Lu,
Qiang Wen
Abstract:
In the context of Anti-de Sitter / Conformal Field Theory (AdS/CFT) correspondence, we present a general scheme to reconstruct bulk geometric quantities in terms of a specific measure of the entanglement structure on the boundary CFT, the partial entanglement entropy (PEE). The PEE between any two points $\mathcal{I}(\vec x, \vec y)$ is the fundamental building block of the PEE structure. It can b…
▽ More
In the context of Anti-de Sitter / Conformal Field Theory (AdS/CFT) correspondence, we present a general scheme to reconstruct bulk geometric quantities in terms of a specific measure of the entanglement structure on the boundary CFT, the partial entanglement entropy (PEE). The PEE between any two points $\mathcal{I}(\vec x, \vec y)$ is the fundamental building block of the PEE structure. It can be geometrized into a bulk geodesic connecting the two boundary points $\vec x$ and $\vec y$, which we refer to as the PEE thread. Thus, we ave a network of the PEE threads in the bulk with a density of the threads determined by the boundary PEE structure \cite{Lin:2023rbd}.We demonstrate that, for any static boundary region $A$, the homologous surface $Σ_{A}$ that has the minimal flux of the PEE threads passing through it is exactly the Ryu-Takayanagi (RT) surface of $A$, and the minimal flux coincides with the holographic entanglement entropy of $A$.Furthermore, we show that the strength of the PEE flux at any bulk point along any direction is $1/4G$. Based on this observation, we prove that any area element in the bulk can be reconstructed by the PEE threads passing through it, which corresponds to a set of two-point PEEs on the CFT.
△ Less
Submitted 23 January, 2024; v1 submitted 14 January, 2024;
originally announced January 2024.
-
Nonconvex Deterministic Matrix Completion by Projected Gradient Descent Methods
Authors:
Hang Xu,
Song Li,
Junhong Lin
Abstract:
We study deterministic matrix completion problem, i.e., recovering a low-rank matrix from a few observed entries where the sampling set is chosen as the edge set of a Ramanujan graph. We first investigate projected gradient descent (PGD) applied to a Burer-Monteiro least-squares problem and show that it converges linearly to the incoherent ground-truth with respect to the condition number \k{appa}…
▽ More
We study deterministic matrix completion problem, i.e., recovering a low-rank matrix from a few observed entries where the sampling set is chosen as the edge set of a Ramanujan graph. We first investigate projected gradient descent (PGD) applied to a Burer-Monteiro least-squares problem and show that it converges linearly to the incoherent ground-truth with respect to the condition number \k{appa} of ground-truth under a benign initialization and large samples. We next apply the scaled variant of PGD to deal with the ill-conditioned case when \k{appa} is large, and we show the algorithm converges at a linear rate independent of the condition number \k{appa} under similar conditions. Finally, we provide numerical experiments to corroborate our results.
△ Less
Submitted 12 January, 2024;
originally announced January 2024.
-
A spectral data release for 104 Type II Supernovae from the Tsinghua Supernova Group
Authors:
Han Lin,
Xiaofeng Wang,
Jujia Zhang,
Danfeng Xiang,
Tianmeng Zhang,
Xulin Zhao,
Xinghan Zhang,
Hanna Sai,
Liming Rui,
Jun Mo,
Gaobo Xi,
Fang Huang,
Xue Li,
Yongzhi Cai,
Weili Lin,
Jie Lin,
Chengyuan Wu,
Jicheng Zhang,
Zhihao Chen,
Zhitong Li,
Wenxiong Li,
Linyi Li,
Kaicheng Zhang,
Cheng Miao,
Juncheng Chen
, et al. (11 additional authors not shown)
Abstract:
We present 206 unpublished optical spectra of 104 type II supernovae obtained by the Xinglong 2.16m telescope and Lijiang 2.4m telescope during the period from 2011 to 2018, spanning the phases from about 1 to 200 days after the SN explosion. The spectral line identifications, evolution of line velocities and pseudo equivalent widths, as well as correlations between some important spectral paramet…
▽ More
We present 206 unpublished optical spectra of 104 type II supernovae obtained by the Xinglong 2.16m telescope and Lijiang 2.4m telescope during the period from 2011 to 2018, spanning the phases from about 1 to 200 days after the SN explosion. The spectral line identifications, evolution of line velocities and pseudo equivalent widths, as well as correlations between some important spectral parameters are presented. Our sample displays a large range in expansion velocities. For instance, the Fe~{\sc ii} $5169$ velocities measured from spectra at $t\sim 50$ days after the explosion vary from ${\rm 2000\ km\ s^{-1}}$ to ${\rm 5500\ km\ s^{-1}}$, with an average value of ${\rm 3872 \pm 949\ km\ s^{-1}}$. Power-law functions can be used to fit the velocity evolution, with the power-law exponent quantifying the velocity decline rate. We found an anticorrelation existing between H$β$ velocity at mid-plateau phase and its velocity decay exponent, SNe II with higher velocities tending to have smaller velocity decay rate. Moreover, we noticed that the velocity decay rate inferred from the Balmer lines (i.e., H$α$ and H$β$) have moderate correlations with the ratio of absorption to emission for H$α$ (a/e). In our sample, two objects show possibly flash-ionized features at early phases. Besides, we noticed that multiple high-velocity components may exist on the blue side of hydrogen lines of SN 2013ab, possibly suggesting that these features arise from complex line forming region. All our spectra can be found in WISeREP and Zenodo.
△ Less
Submitted 11 January, 2024;
originally announced January 2024.
-
Multi-User Chat Assistant (MUCA): a Framework Using LLMs to Facilitate Group Conversations
Authors:
Manqing Mao,
Paishun Ting,
Yijian Xiang,
Mingyang Xu,
Julia Chen,
Jianzhe Lin
Abstract:
Recent advancements in large language models (LLMs) have provided a new avenue for chatbot development, while most existing research has primarily centered on single-user chatbots that focus on deciding "What" to answer after user inputs. In this paper, we identified that multi-user chatbots have more complex 3W design dimensions -- "What" to say, "When" to respond, and "Who" to answer. Additional…
▽ More
Recent advancements in large language models (LLMs) have provided a new avenue for chatbot development, while most existing research has primarily centered on single-user chatbots that focus on deciding "What" to answer after user inputs. In this paper, we identified that multi-user chatbots have more complex 3W design dimensions -- "What" to say, "When" to respond, and "Who" to answer. Additionally, we proposed Multi-User Chat Assistant (MUCA), which is an LLM-based framework for chatbots specifically designed for group discussions. MUCA consists of three main modules: Sub-topic Generator, Dialog Analyzer, and Utterance Strategies Arbitrator. These modules jointly determine suitable response contents, timings, and the appropriate recipients. To make the optimizing process for MUCA easier, we further propose an LLM-based Multi-User Simulator (MUS) that can mimic real user behavior. This enables faster simulation of a conversation between the chatbot and simulated users, making the early development of the chatbot framework much more efficient. MUCA demonstrates effectiveness, including appropriate chime-in timing, relevant content, and improving user engagement, in group conversations with a small to medium number of participants, as evidenced by case studies and experimental results from user studies.
△ Less
Submitted 16 February, 2024; v1 submitted 9 January, 2024;
originally announced January 2024.
-
North-South asymmetries in the Galactic thin disk associated with the vertical phase spiral as seen using LAMOST-Gaia stars
Authors:
Jun Lin,
Rui Guo,
Sarah A. Bird,
Haijun Tian,
Chao Liu,
Chris Flynn,
Gaochao Liu,
Sheng Cui
Abstract:
We select 1,052,469 (754,635) thin disk stars from {\it Gaia} eDR3 and LAMOST DR7 in the range of Galactocentric radius $R$ (guiding center radius $R_\mathrm{g}$) from 8 to 11\,kpc to investigate the asymmetries between the North and South of the disk midplane. More specifically we analyze the vertical velocity dispersion profiles ($σ_{v_{z}}(z$)) in different bins of $R$ ($R_\mathrm{g}$) and…
▽ More
We select 1,052,469 (754,635) thin disk stars from {\it Gaia} eDR3 and LAMOST DR7 in the range of Galactocentric radius $R$ (guiding center radius $R_\mathrm{g}$) from 8 to 11\,kpc to investigate the asymmetries between the North and South of the disk midplane. More specifically we analyze the vertical velocity dispersion profiles ($σ_{v_{z}}(z$)) in different bins of $R$ ($R_\mathrm{g}$) and $[\mathrm{Fe/H}]$. We find troughs in the profiles of $σ_{v_{z}}(z)$ located in both the North ($z \sim 0.7$\,kpc) and South ($z \sim -0.5$\,kpc) of the disk at all radial and chemical bins studied. The difference between the Northern and Southern vertical velocity dispersion profiles ($Δσ_{v_{z}}(|z|)$) shows a shift between curves of different $R$ and $R_\mathrm{g}$. A similar shift exists in these NS asymmetry profiles further divided into different $[\mathrm{Fe/H}]$ ranges. The sample binned with $R_\mathrm{g}$ more clearly displays the features in the velocity dispersion profiles. The shift in the peaks of the $Δσ_{v_{z}}$ profiles and the variation in the phase spiral shape binned by metallicity indicate the variation of the vertical potential profiles and the radial metallicity gradient. The wave-like signal in NS asymmetry of $σ_{v_{z}}(z)$ largely originates from phase spiral; while the NS asymmetry profiles of [Fe/H] only display a weak wave-like feature near solar radius. We perform a test particle simulation to qualitatively reproduce the observed results. A quantitative explanation of the NS asymmetry in the metallicity profile needs careful consideration of the spiral shape and the perturbation model, and we leave this for future work.
△ Less
Submitted 9 January, 2024;
originally announced January 2024.
-
$\bar{B}_s^0 \to D_{s1}(2460)^+ K^-, D_{s1}(2536)^+ K^-$ and the nature of the two $D_{s1}$ resonance
Authors:
Jia-Xin Lin,
Hua-Xing Chen,
Wei-Hong Liang,
Chu-Wen Xiao,
Eulogio Oset
Abstract:
Starting from the molecular picture for the $D_{s1}(2460)$ and $D_{s1}(2536)$ resonances, which are dynamically generated by the interaction of coupled channels, the most important of which are the $D^*K$ for the $D_{s1}(2460)$ and $DK^*$ for the $D_{s1}(2536)$, we evaluate the ratio of decay widths for the $\bar{B}_s^0 \to D_{s1}(2460)^+ K^-$ and $\bar{B}_s^0 \to D_{s1}(2536)^+ K^-$ decays, the l…
▽ More
Starting from the molecular picture for the $D_{s1}(2460)$ and $D_{s1}(2536)$ resonances, which are dynamically generated by the interaction of coupled channels, the most important of which are the $D^*K$ for the $D_{s1}(2460)$ and $DK^*$ for the $D_{s1}(2536)$, we evaluate the ratio of decay widths for the $\bar{B}_s^0 \to D_{s1}(2460)^+ K^-$ and $\bar{B}_s^0 \to D_{s1}(2536)^+ K^-$ decays, the latter of which has been recently investigated by the LHCb collaboration, and we obtain a ratio of the order of unity. The present results should provide an incentive for the related decay into the $D_{s1}(2460)$ resonance to be performed, which would provide valuable information on the nature of these two resonances.
△ Less
Submitted 22 April, 2024; v1 submitted 9 January, 2024;
originally announced January 2024.
-
TIER: Text-Image Encoder-based Regression for AIGC Image Quality Assessment
Authors:
Jiquan Yuan,
Xinyan Cao,
**ming Che,
Qinyuan Wang,
Sen Liang,
Wei Ren,
**long Lin,
Xixin Cao
Abstract:
Recently, AIGC image quality assessment (AIGCIQA), which aims to assess the quality of AI-generated images (AIGIs) from a human perception perspective, has emerged as a new topic in computer vision. Unlike common image quality assessment tasks where images are derived from original ones distorted by noise, blur, and compression, \textit{etc.}, in AIGCIQA tasks, images are typically generated by ge…
▽ More
Recently, AIGC image quality assessment (AIGCIQA), which aims to assess the quality of AI-generated images (AIGIs) from a human perception perspective, has emerged as a new topic in computer vision. Unlike common image quality assessment tasks where images are derived from original ones distorted by noise, blur, and compression, \textit{etc.}, in AIGCIQA tasks, images are typically generated by generative models using text prompts. Considerable efforts have been made in the past years to advance AIGCIQA. However, most existing AIGCIQA methods regress predicted scores directly from individual generated images, overlooking the information contained in the text prompts of these images. This oversight partially limits the performance of these AIGCIQA methods. To address this issue, we propose a text-image encoder-based regression (TIER) framework. Specifically, we process the generated images and their corresponding text prompts as inputs, utilizing a text encoder and an image encoder to extract features from these text prompts and generated images, respectively. To demonstrate the effectiveness of our proposed TIER method, we conduct extensive experiments on several mainstream AIGCIQA databases, including AGIQA-1K, AGIQA-3K, and AIGCIQA2023. The experimental results indicate that our proposed TIER method generally demonstrates superior performance compared to baseline in most cases.
△ Less
Submitted 11 January, 2024; v1 submitted 8 January, 2024;
originally announced January 2024.
-
N$^{3}$-Map**: Normal Guided Neural Non-Projective Signed Distance Fields for Large-scale 3D Map**
Authors:
Shuangfu Song,
Junqiao Zhao,
Kai Huang,
Jiaye Lin,
Chen Ye,
Tiantian Feng
Abstract:
Accurate and dense map** in large-scale environments is essential for various robot applications. Recently, implicit neural signed distance fields (SDFs) have shown promising advances in this task. However, most existing approaches employ projective distances from range data as SDF supervision, introducing approximation errors and thus degrading the map** quality. To address this problem, we i…
▽ More
Accurate and dense map** in large-scale environments is essential for various robot applications. Recently, implicit neural signed distance fields (SDFs) have shown promising advances in this task. However, most existing approaches employ projective distances from range data as SDF supervision, introducing approximation errors and thus degrading the map** quality. To address this problem, we introduce N$^{3}$-Map**, an implicit neural map** system featuring normal-guided neural non-projective signed distance fields. Specifically, we directly sample points along the surface normal, instead of the ray, to obtain more accurate non-projective distance values from range data. Then these distance values are used as supervision to train the implicit map. For large-scale map**, we apply a voxel-oriented sliding window mechanism to alleviate the forgetting issue with a bounded memory footprint. Besides, considering the uneven distribution of measured point clouds, a hierarchical sampling strategy is designed to improve training efficiency. Experiments demonstrate that our method effectively mitigates SDF approximation errors and achieves state-of-the-art map** quality compared to existing approaches.
△ Less
Submitted 29 April, 2024; v1 submitted 7 January, 2024;
originally announced January 2024.
-
Using Large Language Models to Assess Tutors' Performance in Reacting to Students Making Math Errors
Authors:
Sanjit Kakarla,
Danielle Thomas,
Jionghao Lin,
Shivang Gupta,
Kenneth R. Koedinger
Abstract:
Research suggests that tutors should adopt a strategic approach when addressing math errors made by low-efficacy students. Rather than drawing direct attention to the error, tutors should guide the students to identify and correct their mistakes on their own. While tutor lessons have introduced this pedagogical skill, human evaluation of tutors applying this strategy is arduous and time-consuming.…
▽ More
Research suggests that tutors should adopt a strategic approach when addressing math errors made by low-efficacy students. Rather than drawing direct attention to the error, tutors should guide the students to identify and correct their mistakes on their own. While tutor lessons have introduced this pedagogical skill, human evaluation of tutors applying this strategy is arduous and time-consuming. Large language models (LLMs) show promise in providing real-time assessment to tutors during their actual tutoring sessions, yet little is known regarding their accuracy in this context. In this study, we investigate the capacity of generative AI to evaluate real-life tutors' performance in responding to students making math errors. By analyzing 50 real-life tutoring dialogues, we find both GPT-3.5-Turbo and GPT-4 demonstrate proficiency in assessing the criteria related to reacting to students making errors. However, both models exhibit limitations in recognizing instances where the student made an error. Notably, GPT-4 tends to overidentify instances of students making errors, often attributing student uncertainty or inferring potential errors where human evaluators did not. Future work will focus on enhancing generalizability by assessing a larger dataset of dialogues and evaluating learning transfer. Specifically, we will analyze the performance of tutors in real-life scenarios when responding to students' math errors before and after lesson completion on this crucial tutoring skill.
△ Less
Submitted 6 January, 2024;
originally announced January 2024.
-
Certain functional identities on division rings
Authors:
Tsiu-Kwen Lee,
Jheng-Huei Lin
Abstract:
We study the functional identity $G(x)f(x)=H(x)$ on a division ring $D$, where $f \colon D\to D$ is an additive map and $G(X)\ne 0, H(X)$ are generalized polynomials in the variable $X$ with coefficients in $D$. Precisely, it is proved that either $D$ is finite-dimensional over its center or $f$ is an elementary operator. Applying the result and its consequences, we prove that if $D$ is a noncommu…
▽ More
We study the functional identity $G(x)f(x)=H(x)$ on a division ring $D$, where $f \colon D\to D$ is an additive map and $G(X)\ne 0, H(X)$ are generalized polynomials in the variable $X$ with coefficients in $D$. Precisely, it is proved that either $D$ is finite-dimensional over its center or $f$ is an elementary operator. Applying the result and its consequences, we prove that if $D$ is a noncommutative division ring of characteristic not $2$, then the only solution of additive maps $f, g$ on $D$ satisfying the identity $f(x) = x^n g(x^{-1})$ with $n\ne 2$ a positive integer is the trivial case, that is, $f=0$ and $g=0$. This extends Catalano and Merchán's result in 2023 to get a complete solution.
△ Less
Submitted 5 January, 2024;
originally announced January 2024.
-
Rule-Guided Joint Embedding Learning over Knowledge Graphs
Authors:
Qisong Li,
Ji Lin,
Sijia Wei,
Neng Liu
Abstract:
Recent studies focus on embedding learning over knowledge graphs, which map entities and relations in knowledge graphs into low-dimensional vector spaces. While existing models mainly consider the aspect of graph structure, there exists a wealth of contextual and literal information that can be utilized for more effective embedding learning. This paper introduces a novel model that incorporates bo…
▽ More
Recent studies focus on embedding learning over knowledge graphs, which map entities and relations in knowledge graphs into low-dimensional vector spaces. While existing models mainly consider the aspect of graph structure, there exists a wealth of contextual and literal information that can be utilized for more effective embedding learning. This paper introduces a novel model that incorporates both contextual and literal information into entity and relation embeddings by utilizing graph convolutional networks. Specifically, for contextual information, we assess its significance through confidence and relatedness metrics. In addition, a unique rule-based method is developed to calculate the confidence metric, and the relatedness metric is derived from the literal information's representations. We validate our model performance with thorough experiments on two established benchmark datasets.
△ Less
Submitted 27 January, 2024; v1 submitted 1 December, 2023;
originally announced January 2024.
-
Charged-current non-standard neutrino interactions at Daya Bay
Authors:
Daya Bay collaboration,
F. P. An,
W. D. Bai,
A. B. Balantekin,
M. Bishai,
S. Blyth,
G. F. Cao,
J. Cao,
J. F. Chang,
Y. Chang,
H. S. Chen,
H. Y. Chen,
S. M. Chen,
Y. Chen,
Y. X. Chen,
Z. Y. Chen,
J. Cheng,
Y. C. Cheng,
Z. K. Cheng,
J. J. Cherwinka,
M. C. Chu,
J. P. Cummings,
O. Dalager,
F. S. Deng,
X. Y. Ding
, et al. (177 additional authors not shown)
Abstract:
The full data set of the Daya Bay reactor neutrino experiment is used to probe the effect of the charged current non-standard interactions (CC-NSI) on neutrino oscillation experiments. Two different approaches are applied and constraints on the corresponding CC-NSI parameters are obtained with the neutrino flux taken from the Huber-Mueller model with a $5\%$ uncertainty. For the quantum mechanics-…
▽ More
The full data set of the Daya Bay reactor neutrino experiment is used to probe the effect of the charged current non-standard interactions (CC-NSI) on neutrino oscillation experiments. Two different approaches are applied and constraints on the corresponding CC-NSI parameters are obtained with the neutrino flux taken from the Huber-Mueller model with a $5\%$ uncertainty. For the quantum mechanics-based approach (QM-NSI), the constraints on the CC-NSI parameters $ε_{eα}$ and $ε_{eα}^{s}$ are extracted with and without the assumption that the effects of the new physics are the same in the production and detection processes, respectively. The approach based on the weak effective field theory (WEFT-NSI) deals with four types of CC-NSI represented by the parameters $[\varepsilon_{X}]_{eα}$. For both approaches, the results for the CC-NSI parameters are shown for cases with various fixed values of the CC-NSI and the Dirac CP-violating phases, and when they are allowed to vary freely. We find that constraints on the QM-NSI parameters $ε_{eα}$ and $ε_{eα}^{s}$ from the Daya Bay experiment alone can reach the order $\mathcal{O}(0.01)$ for the former and $\mathcal{O}(0.1)$ for the latter, while for WEFT-NSI parameters $[\varepsilon_{X}]_{eα}$, we obtain $\mathcal{O}(0.1)$ for both cases.
△ Less
Submitted 19 March, 2024; v1 submitted 5 January, 2024;
originally announced January 2024.
-
Stress-testing the coupled behavior of hybrid physics-machine learning climate simulations on an unseen, warmer climate
Authors:
Jerry Lin,
Mohamed Aziz Bhouri,
Tom Beucler,
Sungduk Yu,
Michael Pritchard
Abstract:
Accurate and computationally-viable representations of clouds and turbulence are a long-standing challenge for climate model development. Traditional parameterizations that crudely but efficiently approximate these processes are a leading source of uncertainty in long-term projected warming and precipitation patterns. Machine Learning (ML)-based parameterizations have long been hailed as a promisi…
▽ More
Accurate and computationally-viable representations of clouds and turbulence are a long-standing challenge for climate model development. Traditional parameterizations that crudely but efficiently approximate these processes are a leading source of uncertainty in long-term projected warming and precipitation patterns. Machine Learning (ML)-based parameterizations have long been hailed as a promising alternative with the potential to yield higher accuracy at a fraction of the cost of more explicit simulations. However, these ML variants are often unpredictably unstable and inaccurate in \textit{coupled} testing (i.e. in a downstream hybrid simulation task where they are dynamically interacting with the large-scale climate model). These issues are exacerbated in out-of-distribution climates. Certain design decisions such as ``climate-invariant" feature transformation for moisture inputs, input vector expansion, and temporal history incorporation have been shown to improve coupled performance, but they may be insufficient for coupled out-of-distribution generalization. If feature selection and transformations can inoculate hybrid physics-ML climate models from non-physical, out-of-distribution extrapolation in a changing climate, there is far greater potential in extrapolating from observational data. Otherwise, training on multiple simulated climates becomes an inevitable necessity. While our results show generalization benefits from these design decisions, the obtained improvment does not sufficiently preclude the necessity of using multi-climate simulated training data.
△ Less
Submitted 4 January, 2024;
originally announced January 2024.