Search | arXiv e-print repository

Integer programs with nearly totally unimodular matrices: the cographic case

Authors: Manuel Aprile, Samuel Fiorini, Gwenaël Joret, Stefan Kober, Michał T. Seweryn, Stefan Weltge, Yelena Yuditsky

Abstract: It is a notorious open question whether integer programs (IPs), with an integer coefficient matrix $M$ whose subdeterminants are all bounded by a constant $Δ$ in absolute value, can be solved in polynomial time. We answer this question in the affirmative if we further require that, by removing a constant number of rows and columns from $M$, one obtains a submatrix $A$ that is the transpose of a ne… ▽ More It is a notorious open question whether integer programs (IPs), with an integer coefficient matrix $M$ whose subdeterminants are all bounded by a constant $Δ$ in absolute value, can be solved in polynomial time. We answer this question in the affirmative if we further require that, by removing a constant number of rows and columns from $M$, one obtains a submatrix $A$ that is the transpose of a network matrix. Our approach focuses on the case where $A$ arises from $M$ after removing $k$ rows only, where $k$ is a constant. We achieve our result in two main steps, the first related to the theory of IPs and the second related to graph minor theory. First, we derive a strong proximity result for the case where $A$ is a general totally unimodular matrix: Given an optimal solution of the linear programming relaxation, an optimal solution to the IP can be obtained by finding a constant number of augmentations by circuits of $[A\; I]$. Second, for the case where $A$ is transpose of a network matrix, we reformulate the problem as a maximum constrained integer potential problem on a graph $G$. We observe that if $G$ is $2$-connected, then it has no rooted $K_{2,t}$-minor for $t = Ω(k Δ)$. We leverage this to obtain a tree-decomposition of $G$ into highly structured graphs for which we can solve the problem locally. This allows us to solve the global problem via dynamic programming. △ Less

Submitted 12 July, 2024; originally announced July 2024.

arXiv:2407.09462 [pdf, other]

Spin-filter tunneling detection of antiferromagnetic resonance with electrically-tunable dam**

Authors: Thow Min Jerald Cham, Daniel G. Chica, Kenji Watanabe, Takashi Taniguchi, Xavier Roy, Yunqiu Kelly Luo, Daniel C. Ralph

Abstract: Antiferromagnetic spintronics offers the potential for higher-frequency operations compared to ferromagnetic spintronics and improved insensitivity to magnetic fields. However, previous electrical techniques to detect antiferromagnetic dynamics have required millimeter-scale samples to achieve measurable signals. Here we demonstrate direct electrical detection of antiferromagnetic resonance in dev… ▽ More Antiferromagnetic spintronics offers the potential for higher-frequency operations compared to ferromagnetic spintronics and improved insensitivity to magnetic fields. However, previous electrical techniques to detect antiferromagnetic dynamics have required millimeter-scale samples to achieve measurable signals. Here we demonstrate direct electrical detection of antiferromagnetic resonance in devices 1000 times smaller using spin-filter tunneling in micron-scale PtTe$_2$/bilayer CrSBr/graphite junctions in which the tunnel barrier is the van der Waals antiferromaget CrSBr. This sample geometry allows not only efficient detection, but also electrical control of the antiferromagnetic resonance through spin-orbit torque from the PtTe$_2$ electrode. The ability to efficiently detect and control antiferromagnetic resonance provides the means to make detailed studies of the physics governing these high-frequency dynamics and to pursue applications including radiation sources, modulators, and detectors. △ Less

Submitted 12 July, 2024; originally announced July 2024.

arXiv:2407.09457 [pdf, other]

How coronal mass ejections are influenced by the morphology and toroidal flux of their source magnetic flux ropes?

Authors: J. H. Guo, L. Linan, S. Poedts, Y. Guo, B. Schmieder, A. Lani, Y. W. Ni, M. Brchnelova, B. Perri, T. Baratashvili, S. T. Li, P. F. Chen

Abstract: Coronal mass ejections (CMEs) stand as intense eruptions of magnetized plasma from the Sun, playing a pivotal role in driving significant changes of the heliospheric environment. Deducing the properties of CMEs from their progenitors in solar source regions is crucial for space weather forecasting. Deducing the properties of CMEs from their progenitors in solar source regions is crucial for space… ▽ More Coronal mass ejections (CMEs) stand as intense eruptions of magnetized plasma from the Sun, playing a pivotal role in driving significant changes of the heliospheric environment. Deducing the properties of CMEs from their progenitors in solar source regions is crucial for space weather forecasting. Deducing the properties of CMEs from their progenitors in solar source regions is crucial for space weather forecasting. The primary objective of this paper is to establish a connection between CMEs and their progenitors in solar source regions, enabling us to infer the magnetic structures of CMEs before their full development. To this end, we create a dataset comprising a magnetic flux rope series with varying projection shapes, sizes and toroidal fluxes, using the Regularized Biot-Savart Laws (RBSL). Thereafter, we simulate the propagation of these flux ropes from the solar surface to a distance of 25$R_{\odot}$ with our global coronal MHD model which is named COCONUT. Our parametric survey reveals significant impacts of source flux ropes on the consequent CMEs. We find that the projection shape can influence the magnetic structures of CMEs at 20$R_{\odot}$, albeit with minimal impacts on the propagation speed. However, these impacts diminish as source flux ropes become fat. In terms of toroidal flux, our simulation results demonstrate a pronounced correlation with the propagation speed of CMEs, as well as the successfulness in erupting. This work builds the bridge between the CMEs in the outer corona and their progenitors in solar source regions. Our parametric survey suggests that the projection shape, cross-section radius and toroidal flux of source flux ropes are crucial parameters in predicting magnetic structures and propagation speed of CMEs, providing valuable insights for space weather prediction. △ Less

Submitted 12 July, 2024; originally announced July 2024.

Comments: 11 pages, 10 figrues, accepted for publication by A&A

arXiv:2407.09453 [pdf, other]

Weight Block Sparsity: Training, Compilation, and AI Engine Accelerators

Authors: Paolo D'Alberto, Taehee Jeong, Akshai Jain, Shreyas Manjunath, Mrinal Sarmah, Samuel Hsu Yaswanth Raparti, Nitesh Pipralia

Abstract: Nowadays, increasingly larger Deep Neural Networks (DNNs) are being developed, trained, and utilized. These networks require significant computational resources, putting a strain on both advanced and limited devices. Our solution is to implement {\em weight block sparsity}, which is a structured sparsity that is friendly to hardware. By zeroing certain sections of the convolution and fully connect… ▽ More Nowadays, increasingly larger Deep Neural Networks (DNNs) are being developed, trained, and utilized. These networks require significant computational resources, putting a strain on both advanced and limited devices. Our solution is to implement {\em weight block sparsity}, which is a structured sparsity that is friendly to hardware. By zeroing certain sections of the convolution and fully connected layers parameters of pre-trained DNN models, we can efficiently speed up the DNN's inference process. This results in a smaller memory footprint, faster communication, and fewer operations. Our work presents a vertical system that allows for the training of convolution and matrix multiplication weights to exploit 8x8 block sparsity on a single GPU within a reasonable amount of time. Compilers recognize this sparsity and use it for both data compaction and computation splitting into threads. Blocks like these take full advantage of both spatial and temporal locality, paving the way for fast vector operations and memory reuse. By using this system on a Resnet50 model, we were able to reduce the weight by half with minimal accuracy loss, resulting in a two-times faster inference speed. We will present performance estimates using accurate and complete code generation for AIE2 configuration sets (AMD Versal FPGAs) with Resnet50, Inception V3, and VGG16 to demonstrate the necessary synergy between hardware overlay designs and software stacks for compiling and executing machine learning applications. △ Less

Submitted 12 July, 2024; originally announced July 2024.

Comments: 12 pages, 10 figures, 1 table

ACM Class: C.5; D.3.4

arXiv:2407.09451 [pdf, other]

Benchmarking Large Neighborhood Search for Multi-Agent Path Finding

Authors: Jiaqi Tan, Yudong Luo, Jiaoyang Li, Hang Ma

Abstract: Multi-Agent Path Finding (MAPF) aims to arrange collision-free goal-reaching paths for a group of agents. Anytime MAPF solvers based on large neighborhood search (LNS) have gained prominence recently due to their flexibility and scalability. Neighborhood selection strategy is crucial to the success of MAPF-LNS and a flurry of methods have been proposed. However, several pitfalls exist and hinder a… ▽ More Multi-Agent Path Finding (MAPF) aims to arrange collision-free goal-reaching paths for a group of agents. Anytime MAPF solvers based on large neighborhood search (LNS) have gained prominence recently due to their flexibility and scalability. Neighborhood selection strategy is crucial to the success of MAPF-LNS and a flurry of methods have been proposed. However, several pitfalls exist and hinder a comprehensive evaluation of these new methods, which mainly include: 1) Lower than actual or incorrect baseline performance; 2) Lack of a unified evaluation setting and criterion; 3) Lack of a codebase or executable model for supervised learning methods. To overcome these challenges, we conduct a fair comparison across prominent methods on the same benchmark and hyperparameter search settings. Additionally, we propose a simple neighborhood selection strategy which marks a clear advancement in terms of runtime efficiency in large maps with large number of agents. Our benchmarking evaluation promotes new challenges for existing learning based methods and presents opportunities for future research when machine learning is integrated with MAPF-LNS. Code and data are available at https://github.com/ChristinaTan0704/mapf-lns-benchmark. △ Less

Submitted 12 July, 2024; originally announced July 2024.

arXiv:2407.09446 [pdf, other]

Neutron-quark stars: Discerning viable alternatives for the higher-density part of the equation of state of compact stars

Authors: Sudipta Hensh, Yong-Jia Huang, Toru Kojo, Luca Baiotti, Kentaro Takami, Shigehiro Nagataki, Hajime Sotani

Abstract: By taking into account the latest observations and theoretical constraints, we investigate the merger and post-merger of binary neutron stars (NSs) with numerical simulations employing hadronic and hybrid equations of state (EOSs). We name our hybrid stars Neutron-quark stars (NQS), because the transition from hadrons to quarks starts at a density lower than the central density of… ▽ More By taking into account the latest observations and theoretical constraints, we investigate the merger and post-merger of binary neutron stars (NSs) with numerical simulations employing hadronic and hybrid equations of state (EOSs). We name our hybrid stars Neutron-quark stars (NQS), because the transition from hadrons to quarks starts at a density lower than the central density of $\sim 1 M_{\odot}$ stars. The two scenarios of transition to quark matter, a strong first-order phase transition (1PT) or a crossover, feature either a drop to almost zero or a rapid increase (peak) in the square of the sound speed $c_s^2$, implying a softening or stiffening during the transition, respectively. Although the properties of NQSs in equilibrium may not be distinguishable from those of NSs, we find that the post-merger gravitational-wave (GW) main frequency $f_2$ for the crossover scenario is generally lower than that of hadronic models with the same tidal deformability, indicating that a crossover transition is in principle observable when both the inspiral and post-merger signals are detected. Since it is viable according to current multi-messenger constraints, we also consider an EOS with a 1PT taking place at 1.8 times the nuclear saturation density ($n_0$), with a stiff quark EOS ($c_s^2 = 2/3~c^2$) after the transition. It is the first time that such a binary merger is studied numerically in full general relativity. Although its $f_2$ is $\sim 300$ Hz higher than that of its baseline, the relation between $f_2$ and the tidal deformability of inspiralling stars is close to that for hadronic EOSs. △ Less

Submitted 12 July, 2024; originally announced July 2024.

Comments: 7+7 pages, 5+9 figures

Report number: RIKEN-iTHEMS-Report-24

arXiv:2407.09432 [pdf, other]

International Astrophysical Consortium for High-energy Calibration: Summary of the 15th IACHEC Workshop

Authors: K. K. Madsen, V. Burwitz, K. Forster, C. E. Grant, M. Guainazzi, V. Kashyap, H. L. Marshall, E. D. Miller, L. Natalucci, P. P. Plucinsky, Y. Terada

Abstract: In this report, we summarize the activities of the International Astronomical Consortium for High Energy Calibration (IACHEC) from the 15th IACHEC Workshop in Pelham, Germany. Sixty scientists directly involved in the calibration of operational and future high-energy missions gathered for 3.5 days to discuss the status of the cross-calibration between the current international complement of X-ray… ▽ More In this report, we summarize the activities of the International Astronomical Consortium for High Energy Calibration (IACHEC) from the 15th IACHEC Workshop in Pelham, Germany. Sixty scientists directly involved in the calibration of operational and future high-energy missions gathered for 3.5 days to discuss the status of the cross-calibration between the current international complement of X-ray observatories and the possibilities to improve it. This summary consists of reports from the Working Groups with topics ranging across the identification and characterization of standard calibration sources, multi-observatory cross-calibration campaigns, appropriate and new statistical techniques, calibration of instruments and characterization of background, preservation of knowledge, and results for the benefit of the astronomical community. △ Less

Submitted 12 July, 2024; originally announced July 2024.

Comments: 10 pages, 1 figure. arXiv admin note: text overlap with arXiv:2111.01613

arXiv:2407.09431 [pdf, other]

Rethinking temporal self-similarity for repetitive action counting

Authors: Yanan Luo, **hui Yi, Yazan Abu Farha, Moritz Wolter, Juergen Gall

Abstract: Counting repetitive actions in long untrimmed videos is a challenging task that has many applications such as rehabilitation. State-of-the-art methods predict action counts by first generating a temporal self-similarity matrix (TSM) from the sampled frames and then feeding the matrix to a predictor network. The self-similarity matrix, however, is not an optimal input to a network since it discards… ▽ More Counting repetitive actions in long untrimmed videos is a challenging task that has many applications such as rehabilitation. State-of-the-art methods predict action counts by first generating a temporal self-similarity matrix (TSM) from the sampled frames and then feeding the matrix to a predictor network. The self-similarity matrix, however, is not an optimal input to a network since it discards too much information from the frame-wise embeddings. We thus rethink how a TSM can be utilized for counting repetitive actions and propose a framework that learns embeddings and predicts action start probabilities at full temporal resolution. The number of repeated actions is then inferred from the action start probabilities. In contrast to current approaches that have the TSM as an intermediate representation, we propose a novel loss based on a generated reference TSM, which enforces that the self-similarity of the learned frame-wise embeddings is consistent with the self-similarity of repeated actions. The proposed framework achieves state-of-the-art results on three datasets, i.e., RepCount, UCFRep, and Countix. △ Less

Submitted 12 July, 2024; originally announced July 2024.

Comments: Accepted to ICIP 2024

arXiv:2407.09429 [pdf, other]

Open (Clinical) LLMs are Sensitive to Instruction Phrasings

Authors: Alberto Mario Ceballos Arroyo, Monica Munnangi, Jiuding Sun, Karen Y. C. Zhang, Denis Jered McInerney, Byron C. Wallace, Silvio Amir

Abstract: Instruction-tuned Large Language Models (LLMs) can perform a wide range of tasks given natural language instructions to do so, but they are sensitive to how such instructions are phrased. This issue is especially concerning in healthcare, as clinicians are unlikely to be experienced prompt engineers and the potential consequences of inaccurate outputs are heightened in this domain. This raises a… ▽ More Instruction-tuned Large Language Models (LLMs) can perform a wide range of tasks given natural language instructions to do so, but they are sensitive to how such instructions are phrased. This issue is especially concerning in healthcare, as clinicians are unlikely to be experienced prompt engineers and the potential consequences of inaccurate outputs are heightened in this domain. This raises a practical question: How robust are instruction-tuned LLMs to natural variations in the instructions provided for clinical NLP tasks? We collect prompts from medical doctors across a range of tasks and quantify the sensitivity of seven LLMs -- some general, others specialized -- to natural (i.e., non-adversarial) instruction phrasings. We find that performance varies substantially across all models, and that -- perhaps surprisingly -- domain-specific models explicitly trained on clinical data are especially brittle, compared to their general domain counterparts. Further, arbitrary phrasing differences can affect fairness, e.g., valid but distinct instructions for mortality prediction yield a range both in overall performance, and in terms of differences between demographic groups. △ Less

Submitted 12 July, 2024; originally announced July 2024.

Comments: To appear at BioNLP, ACL 2024

arXiv:2407.09427 [pdf, other]

Flow-Based Generative Emulation of Grids of Stellar Evolutionary Models

Authors: Marc Hon, Yaguang Li, Joel Ong

Abstract: We present a flow-based generative approach to emulate grids of stellar evolutionary models. By interpreting the input parameters and output properties of these models as multi-dimensional probability distributions, we train conditional normalizing flows to learn and predict the complex relationships between grid inputs and outputs in the form of conditional joint distributions. Leveraging the exp… ▽ More We present a flow-based generative approach to emulate grids of stellar evolutionary models. By interpreting the input parameters and output properties of these models as multi-dimensional probability distributions, we train conditional normalizing flows to learn and predict the complex relationships between grid inputs and outputs in the form of conditional joint distributions. Leveraging the expressive power and versatility of these flows, we showcase their ability to emulate a variety of evolutionary tracks and isochrones across a continuous range of input parameters. In addition, we describe a simple Bayesian approach for estimating stellar parameters using these flows and demonstrate its application to asteroseismic datasets of red giants observed by the Kepler mission. By applying this approach to red giants in open clusters NGC 6791 and NGC 6819, we illustrate how large age uncertainties can arise when fitting only to global asteroseismic and spectroscopic parameters without prior information on initial helium abundances and mixing length parameter values. We also conduct inference using the flow at a large scale by determining revised estimates of masses and radii for 15,388 field red giants. These estimates show improved agreement with results from existing grid-based modelling, reveal distinct population-level features in the red clump, and suggest that the masses of Kepler red giants previously determined using the corrected asteroseismic scaling relations have been overestimated by 5-10%. △ Less

Submitted 12 July, 2024; originally announced July 2024.

Comments: 27 pages, 18 figures. Accepted for publication in ApJ. Code, animation, and interactive visualizations are available at https://github.com/mtyhon/modelflows/. Table 4 is also available as ancillary file attached to this submission

arXiv:2407.09426 [pdf, ps, other]

Infinitesimal conformal restriction and unitarizing measures for Virasoro algebra

Authors: Maria Gordina, Wei Qian, Yilin Wang

Abstract: We use the SLE$_κ$ loop measure to construct a natural representation of the Virasoro algebra of central charge $c = c(κ) \le 1$. In particular, we introduce a non-degenerate bilinear Hermitian form (not positive-definite) using the SLE loop measure and show that the representation is (indefinite) unitary. Our proof relies on the infinitesimal conformal restriction property of the SLE loop measure… ▽ More We use the SLE$_κ$ loop measure to construct a natural representation of the Virasoro algebra of central charge $c = c(κ) \le 1$. In particular, we introduce a non-degenerate bilinear Hermitian form (not positive-definite) using the SLE loop measure and show that the representation is (indefinite) unitary. Our proof relies on the infinitesimal conformal restriction property of the SLE loop measure. △ Less

Submitted 12 July, 2024; originally announced July 2024.

Comments: 21 pages

arXiv:2407.09424 [pdf, other]

TelecomGPT: A Framework to Build Telecom-Specfic Large Language Models

Authors: Hang Zou, Qiyang Zhao, Yu Tian, Lina Bariah, Faouzi Bader, Thierry Lestable, Merouane Debbah

Abstract: Large Language Models (LLMs) have the potential to revolutionize the Sixth Generation (6G) communication networks. However, current mainstream LLMs generally lack the specialized knowledge in telecom domain. In this paper, for the first time, we propose a pipeline to adapt any general purpose LLMs to a telecom-specific LLMs. We collect and build telecom-specific pre-train dataset, instruction data… ▽ More Large Language Models (LLMs) have the potential to revolutionize the Sixth Generation (6G) communication networks. However, current mainstream LLMs generally lack the specialized knowledge in telecom domain. In this paper, for the first time, we propose a pipeline to adapt any general purpose LLMs to a telecom-specific LLMs. We collect and build telecom-specific pre-train dataset, instruction dataset, preference dataset to perform continual pre-training, instruct tuning and alignment tuning respectively. Besides, due to the lack of widely accepted evaluation benchmarks in telecom domain, we extend existing evaluation benchmarks and proposed three new benchmarks, namely, Telecom Math Modeling, Telecom Open QnA and Telecom Code Tasks. These new benchmarks provide a holistic evaluation of the capabilities of LLMs including math modeling, Open-Ended question answering, code generation, infilling, summarization and analysis in telecom domain. Our fine-tuned LLM TelecomGPT outperforms state of the art (SOTA) LLMs including GPT-4, Llama-3 and Mistral in Telecom Math Modeling benchmark significantly and achieve comparable performance in various evaluation benchmarks such as TeleQnA, 3GPP technical documents classification, telecom code summary and generation and infilling. △ Less

Submitted 12 July, 2024; originally announced July 2024.

Comments: arXiv admin note: text overlap with arXiv:1303.2654 by other authors

arXiv:2407.09418 [pdf, other]

Efficient energy-stable parametric finite element methods for surface diffusion flow and applications in solid-state dewetting

Authors: Meng Li, Yihang Guo, **gjiang Bi

Abstract: Currently existing energy-stable parametric finite element methods for surface diffusion flow and other flows are usually limited to first-order accuracy in time. Designing a high-order algorithm for geometric flows that can also be theoretically proven to be energy-stable poses a significant challenge. Motivated by the new scalar auxiliary variable approach [F.Huang, J.Shen, Z.Yang, SIAM J. SCI.… ▽ More Currently existing energy-stable parametric finite element methods for surface diffusion flow and other flows are usually limited to first-order accuracy in time. Designing a high-order algorithm for geometric flows that can also be theoretically proven to be energy-stable poses a significant challenge. Motivated by the new scalar auxiliary variable approach [F.Huang, J.Shen, Z.Yang, SIAM J. SCI. Comput., 42 (2020), pp. A2514-A2536], we propose novel energy-stable parametric finite element approximations for isotropic/anisotropic surface diffusion flows, achieving both first-order and second-order accuracy in time. Additionally, we apply the algorithms to simulate the solid-state dewetting of thin films. Finally, extensive numerical experiments validate the accuracy, energy stability, and efficiency of our developed numerical methods. The designed algorithms in this work exhibit strong versatility, as they can be readily extended to other high-order time discretization methods (e.g., BDFk schemes). Meanwhile, the algorithms achieve remarkable computational efficiency and maintain excellent mesh quality. More importantly, the algorithm can be theoretically proven to possess unconditional energy stability, with the energy nearly equal to the original energy. △ Less

Submitted 12 July, 2024; originally announced July 2024.

arXiv:2407.09417 [pdf, other]

Mitigating Entity-Level Hallucination in Large Language Models

Authors: Weihang Su, Yichen Tang, Qingyao Ai, Changyue Wang, Zhi**g Wu, Yiqun Liu

Abstract: The emergence of Large Language Models (LLMs) has revolutionized how users access information, shifting from traditional search engines to direct question-and-answer interactions with LLMs. However, the widespread adoption of LLMs has revealed a significant challenge known as hallucination, wherein LLMs generate coherent yet factually inaccurate responses. This hallucination phenomenon has led to… ▽ More The emergence of Large Language Models (LLMs) has revolutionized how users access information, shifting from traditional search engines to direct question-and-answer interactions with LLMs. However, the widespread adoption of LLMs has revealed a significant challenge known as hallucination, wherein LLMs generate coherent yet factually inaccurate responses. This hallucination phenomenon has led to users' distrust in information retrieval systems based on LLMs. To tackle this challenge, this paper proposes Dynamic Retrieval Augmentation based on hallucination Detection (DRAD) as a novel method to detect and mitigate hallucinations in LLMs. DRAD improves upon traditional retrieval augmentation by dynamically adapting the retrieval process based on real-time hallucination detection. It features two main components: Real-time Hallucination Detection (RHD) for identifying potential hallucinations without external models, and Self-correction based on External Knowledge (SEK) for correcting these errors using external knowledge. Experiment results show that DRAD demonstrates superior performance in both detecting and mitigating hallucinations in LLMs. All of our code and data are open-sourced at https://github.com/oneal2000/EntityHallucination. △ Less

Submitted 12 July, 2024; originally announced July 2024.

arXiv:2407.09410 [pdf, ps, other]

Post-Newtonian Tests of Gravitational Quantum Field Theory with Spin and Scaling Gauge Symmetry

Authors: Ying-jian Chen, Peng Xu, Yue-liang Wu

Abstract: A self-consistent gravitational quantum field theory, with gravitational force treated on the same footing as the other three fundamental interactions, was established recently. The gravidynamics predicted by such a theory could lead to important implications, and the comparisons with experimental results may provide us opportunities to test such new approach of gravity based on the framework of t… ▽ More A self-consistent gravitational quantum field theory, with gravitational force treated on the same footing as the other three fundamental interactions, was established recently. The gravidynamics predicted by such a theory could lead to important implications, and the comparisons with experimental results may provide us opportunities to test such new approach of gravity based on the framework of the quantum field theory of gauge interactions. In this work, we start with the effective field equation of the gravitational quantum field theory, and then solve the perturbative gravigauge field order by order up to the 1st post-Newtonian level under the assumption of a simplified energy-momentum tensor of perfect fluids. Having the constraints on the related post-Newtonian parameters from the most up-to-date observational data, the new bound on the combined coupling in the gravitational quantum field theory $|γ_G(α_G-α_W/2)| \leq (2.4\pm30)\times10^{-6}$ is obtained. Under such bound, we found that the new gravitational quantum field theory successfully passed and found no conflict with the contemporary keynote Solar system experiments of gravity. △ Less

Submitted 12 July, 2024; originally announced July 2024.

Comments: 7 pages

arXiv:2407.09403 [pdf, ps, other]

A short proof of the Goldberg-Seymour conjecture

Authors: Guantao Chen, Yanli Hao, Xingxing Yu, Wenan Zang

Abstract: For a multigraph $G$, $χ'(G)$ denotes the chromatic index of $G$, $Δ(G)$ the maximum degree of $G$, and $Γ(G) = \max\left\{\left\lceil \frac{2|E(H)|}{|V(H)|-1} \right\rceil: H \subseteq G \text{ and } |V(H)| \text{ odd}\right\}$. As a generalization of Vizing's classical coloring result for simple graphs, the Goldberg-Seymour conjecture, posed in the 1970s, states that $χ'(G)=\max\{Δ(G), Γ(G)\}$ o… ▽ More For a multigraph $G$, $χ'(G)$ denotes the chromatic index of $G$, $Δ(G)$ the maximum degree of $G$, and $Γ(G) = \max\left\{\left\lceil \frac{2|E(H)|}{|V(H)|-1} \right\rceil: H \subseteq G \text{ and } |V(H)| \text{ odd}\right\}$. As a generalization of Vizing's classical coloring result for simple graphs, the Goldberg-Seymour conjecture, posed in the 1970s, states that $χ'(G)=\max\{Δ(G), Γ(G)\}$ or $χ'(G)=\max\{Δ(G) + 1, Γ(G)\}$. Hochbaum, Nishizeki, and Shmoys further conjectured in 1986 that such a coloring can be found in polynomial time. A long proof of the Goldberg-Seymour conjecture was announced in 2019 by Chen, **g, and Zang, and one case in that proof was eliminated recently by **g (but the proof is still long); and neither proof has been verified. In this paper, we give a proof of the Goldberg-Seymour conjecture that is significantly shorter and confirm the Hochbaum-Nishizeki-Shmoys conjecture by providing an $O(|V|^5|E|^3)$ time algorithm for finding a $\max\{Δ(G) + 1, Γ(G)\}$-edge-coloring of $G$. △ Less

Submitted 12 July, 2024; originally announced July 2024.

arXiv:2407.09400 [pdf, other]

Cosmic topology. Part IIIa. Microwave background parity violation without parity-violating microphysics

Authors: Amirhossein Samandar, Javier Carrón Duque, Craig J. Copi, Mikel Martin Barandiaran, Deyan P. Mihaylov, Thiago S. Pereira, Glenn D. Starkman, Yashar Akrami, Stefano Anselmi, Fernando Cornet-Gomez, Johannes R. Eskilt, Andrew H. Jaffe, Arthur Kosowsky, Andrius Tamosiunas

Abstract: The standard cosmological model, which assumes statistical isotropy and parity invariance, predicts the absence of correlations between even-parity and odd-parity observables of the cosmic microwave background (CMB). Contrary to these predictions, large-angle CMB temperature anomalies generically involve correlations between even-$\ell$ and odd-$\ell$ angular power spectrum $C_\ell$, while recent… ▽ More The standard cosmological model, which assumes statistical isotropy and parity invariance, predicts the absence of correlations between even-parity and odd-parity observables of the cosmic microwave background (CMB). Contrary to these predictions, large-angle CMB temperature anomalies generically involve correlations between even-$\ell$ and odd-$\ell$ angular power spectrum $C_\ell$, while recent analyses of CMB polarization have revealed non-zero equal-$\ell$ $EB$ correlations. These findings challenge the conventional understanding, suggesting deviations from statistical isotropy, violations of parity, or both. Cosmic topology, which involves changing only the boundary conditions of space relative to standard cosmology, offers a compelling framework to potentially account for such parity-violating observations. Topology inherently breaks statistical isotropy, and can also break homogeneity and parity, providing a natural paradigm for explaining observations of parity-breaking observables without the need to add parity violation to the underlying microphysics. Our investigation delves into the harmonic space implications of topology for CMB correlations, using as an illustrative example $EB$ correlations generated by tensor perturbations under both parity-preserving and parity-violating scenarios. Consequently, these findings not only challenge the foundational assumptions of the standard cosmological model but also open new avenues for exploring the topological structure of the Universe through CMB observations. △ Less

Submitted 12 July, 2024; originally announced July 2024.

Comments: 20 pages, 4 figures

Report number: IFT-UAM/CSIC-24-104

arXiv:2407.09399 [pdf, other]

Rich and diverse molecular gas environments of closely-separated dual quasars viewed by ALMA

Authors: Shenli Tang, John D. Silverman, Zhaoxuan Liu, Manda Banerji, Tomoko Suzuki, Seiji Fujimoto, Andy Goulding, Masatoshi Imanishi, Toshihiro Kawaguchi, Connor Bottrell, Tilman Hartwig, Knud Jahnke, Masafusa Onoue, Malte Schramm, Yoshihiro Ueda

Abstract: We present a study of the molecular gas in five closely-spaced ($R_{\perp}<20$ kpc) dual quasars ($L_{\rm bol}\gtrsim10^{44}~\mathrm{erg~s}^{-1}$) at redshifts $0.4<z<0.8$ with the Atacama Large Millimeter/submillimeter Array. The dual quasar phase represents a distinctive stage during the interaction between two galaxies for investigating quasar fueling and feedback effects on the gas reservoir.… ▽ More We present a study of the molecular gas in five closely-spaced ($R_{\perp}<20$ kpc) dual quasars ($L_{\rm bol}\gtrsim10^{44}~\mathrm{erg~s}^{-1}$) at redshifts $0.4<z<0.8$ with the Atacama Large Millimeter/submillimeter Array. The dual quasar phase represents a distinctive stage during the interaction between two galaxies for investigating quasar fueling and feedback effects on the gas reservoir. The dual quasars were selected from the Sloan Digital Sky Survey and Subaru/Hyper Suprime-Cam Subaru Strategic Program, with confirmatory spectroscopic validation. Based on the detection of the CO J=2--1 emission line with Band 4, we derived key properties including CO luminosities, line widths, and molecular gas masses for these systems. Among the ten quasars of the five pairs, eight have line detections exceeding $5σ$. The detected sources prominently harbor substantial molecular gas reservoirs, with molecular gas masses ($M_{\text{molgas}}$) between $10^{9.6-10.5}~\mathrm{M_{\odot}}$, and molecular gas-to-stellar mass ratios ($μ_{\text{molgas}}$) spanning $18-97\%$. The overall $μ_{\text{molgas}}$ of these dual quasars agrees with that of inactive star-forming main-sequence galaxies at comparable redshifts, indicating no clear evidence of quenching. However, intriguing features in each individual system show possible evidence of AGN feedback, matter transfer, and compaction processes. △ Less

Submitted 12 July, 2024; originally announced July 2024.

arXiv:2407.09392 [pdf, other]

Open-Canopy: A Country-Scale Benchmark for Canopy Height Estimation at Very High Resolution

Authors: Fajwel Fogel, Yohann Perron, Nikola Besic, Laurent Saint-André, Agnès Pellissier-Tanon, Martin Schwartz, Thomas Boudras, Ibrahim Fayad, Alexandre d'Aspremont, Loic Landrieu, Phillipe Ciais

Abstract: Estimating canopy height and canopy height change at meter resolution from satellite imagery has numerous applications, such as monitoring forest health, logging activities, wood resources, and carbon stocks. However, many existing forest datasets are based on commercial or closed data sources, restricting the reproducibility and evaluation of new approaches. To address this gap, we introduce Open… ▽ More Estimating canopy height and canopy height change at meter resolution from satellite imagery has numerous applications, such as monitoring forest health, logging activities, wood resources, and carbon stocks. However, many existing forest datasets are based on commercial or closed data sources, restricting the reproducibility and evaluation of new approaches. To address this gap, we introduce Open-Canopy, the first open-access and country-scale benchmark for very high resolution (1.5 m) canopy height estimation. Covering more than 87,000 km$^2$ across France, Open-Canopy combines SPOT satellite imagery with high resolution aerial LiDAR data. We also propose Open-Canopy-$Δ$, the first benchmark for canopy height change detection between two images taken at different years, a particularly challenging task even for recent models. To establish a robust foundation for these benchmarks, we evaluate a comprehensive list of state-of-the-art computer vision models for canopy height estimation. The dataset and associated codes can be accessed at https://github.com/fajwel/Open-Canopy. △ Less

Submitted 12 July, 2024; originally announced July 2024.

Comments: 22 pages, 8 figures, Submitted to NeurIPS 2024 Datasets and Benchmarks Track

arXiv:2407.09376 [pdf, ps, other]

Ground-state properties of the double trillium lattice antiferromagnet KBaCr$_2$(PO$_4$)$_3$

Authors: R. Kolay, Qing-** Ding, Y. Furukawa, A. A. Tsirlin, R. Nath

Abstract: Trillium lattices formed by corner-shared triangular units are the platform for magnetic frustration in three dimensions. Herein, we report structural and magnetic properties of the Cr-based double trillium lattice material KBaCr$_2$(PO$_4$)$_3$ studied by x-ray diffraction, magnetization, heat capacity, thermal conductivity, and $^{31}$P nuclear magnetic resonance (NMR) measurements complemented… ▽ More Trillium lattices formed by corner-shared triangular units are the platform for magnetic frustration in three dimensions. Herein, we report structural and magnetic properties of the Cr-based double trillium lattice material KBaCr$_2$(PO$_4$)$_3$ studied by x-ray diffraction, magnetization, heat capacity, thermal conductivity, and $^{31}$P nuclear magnetic resonance (NMR) measurements complemented by density-functional band-structure calculations. Heat capacity and $^{31}$P NMR measurements reveal the magnetic transition at $T_{\rm N1} \simeq 13.5$ K in zero field followed by another transition at $T_{\rm N2} \simeq 7$ K in weak applied fields. The NMR sublattice magnetization confirms that the transition at $T_{\rm N1}$ is 3D in nature. The $^{31}$P spin-lattice relaxation rate in the ordered state follows the $T^3$ behavior indicative of the two-magnon Raman process. The spin lattice of KBaCr$_2$(PO$_4$)$_3$ comprises two crystallographically nonequivalent ferromagnetic sublattices that are coupled antiferromagnetically, thus eliminating frustration in this trillium network. △ Less

Submitted 12 July, 2024; originally announced July 2024.

Comments: 12 pages, 13 figures

arXiv:2407.09371 [pdf, other]

Computationally Efficient Estimation of Large Probit Models

Authors: Patrick Ding, Guido Imbens, Zhaonan Qu, Yinyu Ye

Abstract: Probit models are useful for modeling correlated discrete responses in many disciplines, including discrete choice data in economics. However, the Gaussian latent variable feature of probit models coupled with identification constraints pose significant computational challenges for its estimation and inference, especially when the dimension of the discrete response variable is large. In this paper… ▽ More Probit models are useful for modeling correlated discrete responses in many disciplines, including discrete choice data in economics. However, the Gaussian latent variable feature of probit models coupled with identification constraints pose significant computational challenges for its estimation and inference, especially when the dimension of the discrete response variable is large. In this paper, we propose a computationally efficient Expectation-Maximization (EM) algorithm for estimating large probit models. Our work is distinct from existing methods in two important aspects. First, instead of simulation or sampling methods, we apply and customize expectation propagation (EP), a deterministic method originally proposed for approximate Bayesian inference, to estimate moments of the truncated multivariate normal (TMVN) in the E (expectation) step. Second, we take advantage of a symmetric identification condition to transform the constrained optimization problem in the M (maximization) step into a one-dimensional problem, which is solved efficiently using Newton's method instead of off-the-shelf solvers. Our method enables the analysis of correlated choice data in the presence of more than 100 alternatives, which is a reasonable size in modern applications, such as online shop** and booking platforms, but has been difficult in practice with probit models. We apply our probit estimation method to study ordering effects in hotel search results on Expedia.com. △ Less

Submitted 12 July, 2024; originally announced July 2024.

arXiv:2407.09367 [pdf, other]

Resha** the Online Data Buffering and Organizing Mechanism for Continual Test-Time Adaptation

Authors: Zhilin Zhu, Xiaopeng Hong, Zhiheng Ma, Weijun Zhuang, Yaohui Ma, Dai Yong, Yaowei Wang

Abstract: Continual Test-Time Adaptation (CTTA) involves adapting a pre-trained source model to continually changing unsupervised target domains. In this paper, we systematically analyze the challenges of this task: online environment, unsupervised nature, and the risks of error accumulation and catastrophic forgetting under continual domain shifts. To address these challenges, we reshape the online data bu… ▽ More Continual Test-Time Adaptation (CTTA) involves adapting a pre-trained source model to continually changing unsupervised target domains. In this paper, we systematically analyze the challenges of this task: online environment, unsupervised nature, and the risks of error accumulation and catastrophic forgetting under continual domain shifts. To address these challenges, we reshape the online data buffering and organizing mechanism for CTTA. We propose an {uncertainty-aware buffering approach} to identify {and aggregate} significant samples with high certainty from the unsupervised, single-pass data stream. {Based on this}, we propose a graph-based class relation preservation constraint to overcome catastrophic forgetting. Furthermore, a pseudo-target replay objective is used to mitigate error accumulation. Extensive experiments demonstrate the superiority of our method in both segmentation and classification CTTA tasks. Code is available at \href{https://github.com/z1358/OBAO}{this https URL}. △ Less

Submitted 12 July, 2024; originally announced July 2024.

Comments: This is the preprint version of our paper and supplemental material to appear in ECCV 2024

arXiv:2407.09361 [pdf, ps, other]

Nonreciprocal phonons in PT-symmetric antiferromagnet

Authors: Yafei Ren, Daniyar Saparov, Qian Niu

Abstract: Phonon nonreciprocity, indicating different transport properties along opposite directions, has been observed in experiments under a magnetic field. We show that nonreciprocal acoustic phonons can also exist without a magnetic field nor net magnetization. We focus on PT symmetric antiferromagnets that break both time-reversal T and inversion symmetry P. We identify crucial contributions in phenome… ▽ More Phonon nonreciprocity, indicating different transport properties along opposite directions, has been observed in experiments under a magnetic field. We show that nonreciprocal acoustic phonons can also exist without a magnetic field nor net magnetization. We focus on PT symmetric antiferromagnets that break both time-reversal T and inversion symmetry P. We identify crucial contributions in phenomenological elastic theory, dubbed flexo-viscosity and flexo-torque, that induce phonon nonreciprocity without changing the phonon polarization. The microscopic origin of these contributions is the molecular Berry curvature, manifested as emergent nonlocal magnetic fields on phonons. The symmetry breaking originated from spin order is transferred to the phonon system through spin-orbit coupling, where the orbital degree of freedom affects the lattice dynamics directly. By electrically modifying the spin-orbit coupling, we show that both the phonon nonreciprocity and helicity can be controlled and enhanced. Importantly, the phonon nonreciprocity is an odd function of the Néel vector, serving as an indicator of the order parameter. △ Less

Submitted 12 July, 2024; originally announced July 2024.

Comments: 5 pages, 2 figures

arXiv:2407.09360 [pdf, other]

Novel clustered federated learning based on local loss

Authors: Endong Gu, Yongxin Chen, Hao Wen, Xingju Cai, Deren Han

Abstract: This paper proposes LCFL, a novel clustering metric for evaluating clients' data distributions in federated learning. LCFL aligns with federated learning requirements, accurately assessing client-to-client variations in data distribution. It offers advantages over existing clustered federated learning methods, addressing privacy concerns, improving applicability to non-convex models, and providing… ▽ More This paper proposes LCFL, a novel clustering metric for evaluating clients' data distributions in federated learning. LCFL aligns with federated learning requirements, accurately assessing client-to-client variations in data distribution. It offers advantages over existing clustered federated learning methods, addressing privacy concerns, improving applicability to non-convex models, and providing more accurate classification results. LCFL does not require prior knowledge of clients' data distributions. We provide a rigorous mathematical analysis, demonstrating the correctness and feasibility of our framework. Numerical experiments with neural network instances highlight the superior performance of LCFL over baselines on several clustered federated learning benchmarks. △ Less

Submitted 12 July, 2024; originally announced July 2024.

arXiv:2407.09357 [pdf, other]

Any-Property-Conditional Molecule Generation with Self-Criticism using Spanning Trees

Authors: Alexia Jolicoeur-Martineau, Aristide Baratin, Kisoo Kwon, Boris Knyazev, Yan Zhang

Abstract: Generating novel molecules is challenging, with most representations leading to generative models producing many invalid molecules. Spanning Tree-based Graph Generation (STGG) is a promising approach to ensure the generation of valid molecules, outperforming state-of-the-art SMILES and graph diffusion models for unconditional generation. In the real world, we want to be able to generate molecules… ▽ More Generating novel molecules is challenging, with most representations leading to generative models producing many invalid molecules. Spanning Tree-based Graph Generation (STGG) is a promising approach to ensure the generation of valid molecules, outperforming state-of-the-art SMILES and graph diffusion models for unconditional generation. In the real world, we want to be able to generate molecules conditional on one or multiple desired properties rather than unconditionally. Thus, in this work, we extend STGG to multi-property-conditional generation. Our approach, STGG+, incorporates a modern Transformer architecture, random masking of properties during training (enabling conditioning on any subset of properties and classifier-free guidance), an auxiliary property-prediction loss (allowing the model to self-criticize molecules and select the best ones), and other improvements. We show that STGG+ achieves state-of-the-art performance on in-distribution and out-of-distribution conditional generation, and reward maximization. △ Less

Submitted 12 July, 2024; originally announced July 2024.

arXiv:2407.09344 [pdf, other]

Pre-training Point Cloud Compact Model with Partial-aware Reconstruction

Authors: Yaohua Zha, Yanzi Wang, Tao Dai, Shu-Tao Xia

Abstract: The pre-trained point cloud model based on Masked Point Modeling (MPM) has exhibited substantial improvements across various tasks. However, two drawbacks hinder their practical application. Firstly, the positional embedding of masked patches in the decoder results in the leakage of their central coordinates, leading to limited 3D representations. Secondly, the excessive model size of existing MPM… ▽ More The pre-trained point cloud model based on Masked Point Modeling (MPM) has exhibited substantial improvements across various tasks. However, two drawbacks hinder their practical application. Firstly, the positional embedding of masked patches in the decoder results in the leakage of their central coordinates, leading to limited 3D representations. Secondly, the excessive model size of existing MPM methods results in higher demands for devices. To address these, we propose to pre-train Point cloud Compact Model with Partial-aware \textbf{R}econstruction, named Point-CPR. Specifically, in the decoder, we couple the vanilla masked tokens with their positional embeddings as randomly masked queries and introduce a partial-aware prediction module before each decoder layer to predict them from the unmasked partial. It prevents the decoder from creating a shortcut between the central coordinates of masked patches and their reconstructed coordinates, enhancing the robustness of models. We also devise a compact encoder composed of local aggregation and MLPs, reducing the parameters and computational requirements compared to existing Transformer-based encoders. Extensive experiments demonstrate that our model exhibits strong performance across various tasks, especially surpassing the leading MPM-based model PointGPT-B with only 2% of its parameters. △ Less

Submitted 12 July, 2024; originally announced July 2024.

Comments: arXiv admin note: text overlap with arXiv:2405.17149

arXiv:2407.09343 [pdf, other]

Thermodynamics of Giant Molecular Clouds: The Effects of Dust Grain Size

Authors: Nadine H. Soliman, Philip F. Hopkins, Michael Y. Grudić

Abstract: The dust grain size distribution (GSD) likely varies significantly across different star-forming environments in the Universe, but the overall impact of this variation on star formation remains unclear. This ambiguity arises because the GSD interacts non-linearly with processes like heating/cooling, radiation, and chemistry, which have competing effects and different environmental dependencies. In… ▽ More The dust grain size distribution (GSD) likely varies significantly across different star-forming environments in the Universe, but the overall impact of this variation on star formation remains unclear. This ambiguity arises because the GSD interacts non-linearly with processes like heating/cooling, radiation, and chemistry, which have competing effects and different environmental dependencies. In this study, we investigate the effects of GSD variation on the thermochemistry and evolution of giant molecular clouds (GMCs). To achieve this, we conducted radiation-dust-magnetohydrodynamic simulations spanning a range of cloud masses and grain sizes, which explicitly incorporate the dynamics of dust grains within the full-physics framework of the STARFORGE project. We find that differences in grain size significantly alter the thermochemistry of GMCs. Specifically, we show that the leading-order effect is that larger grains, under fixed dust mass and dust-to-gas ratio conditions, result in lower dust opacities. This reduced opacity permits ISRF photons to penetrate more deeply and allows internal radiation field photons to permeate more extensively into the cloud, resulting in rapid gas heating and the inhibition of star formation. We find that star formation efficiency is highly sensitive to grain size, with an order of magnitude reduction in efficiency when grain size increases from 0.1 $\rmμm$ to 10 $\rmμm$. Additionally, we note that warmer gas suppresses the formation of low-mass stars. Moreover, as a consequence of the decreased opacities, we observe a greater proportion of gas residing in diffuse ionized structures. △ Less

Submitted 12 July, 2024; originally announced July 2024.

Comments: 13 pages, 5 figures, submitted to ApJ

arXiv:2407.09339 [pdf, other]

Rates of Stellar Tidal Disruption Events Around Intermediate-Mass Black Holes

Authors: Janet N. Y. Chang, Lixin Dai, Hugo Pfister, Rudrani Kar Chowdhury, Priyamvada Natarajan

Abstract: Rates of stellar tidal disruption events (TDEs) around supermassive black holes (SMBHs) have been extensively calculated using the loss cone theory, while theoretical work on TDE rates around intermediate-mass black holes (IMBHs) has been lacking. In this work, we aim to accurately calculate the IMBH TDE rates based on their black hole masses and the stellar profiles of their host galaxies obtaine… ▽ More Rates of stellar tidal disruption events (TDEs) around supermassive black holes (SMBHs) have been extensively calculated using the loss cone theory, while theoretical work on TDE rates around intermediate-mass black holes (IMBHs) has been lacking. In this work, we aim to accurately calculate the IMBH TDE rates based on their black hole masses and the stellar profiles of their host galaxies obtained from the latest observations. We find that IMBH TDEs from the center of small galaxies have an overall rate comparable to SMBH TDEs, while off-nuclei IMBH TDEs from globular clusters have a much lower rate. Very interestingly, we show that the rate of IMBH TDE per galaxy generally increases with the black hole mass, which is opposite to the trend seen in SMBH TDEs. Furthermore, we report that IMBH TDEs typically occur in the pinhole regime, which means that deeply plunging events are more likely for IMBH TDEs compared to SMBH TDEs. We also calculate the volumetric TDE rates for IMBH and SMBH TDEs and compare with observed rates. △ Less

Submitted 12 July, 2024; originally announced July 2024.

Comments: 21 pages, 11 figures

arXiv:2407.09334 [pdf, other]

Adynkra Genomes, Adynkrafields, and the 4D, ${\cal N}$ = 1 Supergravity Superfield Prepotential

Authors: S. James Gates, Jr., Yangrui Hu

Abstract: A re-imagining of the supergravity prepotential formulation of 4D, $\cal N$ = 1 supergravity and its Salam-Strathdee superfield superconformal gauge group is presented. A re-imagining of the supergravity prepotential formulation of 4D, $\cal N$ = 1 supergravity and its Salam-Strathdee superfield superconformal gauge group is presented. △ Less

Submitted 12 July, 2024; originally announced July 2024.

arXiv:2407.09333 [pdf, other]

HETOCompiler: An MLIR-based crypTOgraphic Compilation Framework for HEterogeneous Devices

Authors: Zhiyuan Tan, Liutong Han, Mingjie Xing, Yanjun Wu

Abstract: Hash algorithms are fundamental tools in cryptography, offering irreversible and sensitive transformations of input data for various security purposes. As computing architectures evolve towards heterogeneous systems, efficiently harnessing diverse computing resources for hash encryption algorithms becomes crucial. This paper presents HETOCompiler, a novel cryptography compilation framework designe… ▽ More Hash algorithms are fundamental tools in cryptography, offering irreversible and sensitive transformations of input data for various security purposes. As computing architectures evolve towards heterogeneous systems, efficiently harnessing diverse computing resources for hash encryption algorithms becomes crucial. This paper presents HETOCompiler, a novel cryptography compilation framework designed for heterogeneous systems. Leveraging Multi-Level Intermediate Representation (MLIR), HETOCompiler abstracts syntax and semantics for cryptographic primitives and heterogeneous computing models, facilitating efficient compilation of high-level hash encryption algorithms into executable programs compatible with diverse devices. Experimental results demonstrate significant performance improvements over existing OpenSSL library, with average enhancements of 49.3x, 1.5x, and 23.4x for SHA-1, MD5, and SM3 algorithms respectively. △ Less

Submitted 12 July, 2024; originally announced July 2024.

arXiv:2407.09331 [pdf, other]

Suppression of quantum dissipation: A cooperative effect of quantum squeezing and quantum measurement

Authors: Yi-Ming Xia, Yi-Fei Wang, Xiao-Yun Zhang, Hai-Chao Li, Wei Xiong

Abstract: The ability to isolate a quantum system from its environment is of fundamental interest and importance in optical quantum science and technology. Here we propose an experimentally feasible scheme for beating environment-induced dissipation in an open two-level system coupled to a parametrically driven cavity. The mechanism relies on a novel cooperation between light-matter coupling enhancement and… ▽ More The ability to isolate a quantum system from its environment is of fundamental interest and importance in optical quantum science and technology. Here we propose an experimentally feasible scheme for beating environment-induced dissipation in an open two-level system coupled to a parametrically driven cavity. The mechanism relies on a novel cooperation between light-matter coupling enhancement and frequent measurements. We demonstrate that, in the presence of the cooperation, the system dynamics can be completely dominated by the effective system-cavity interaction and the dissipative effects from the system-environment coupling can be surprisingly ignored. This work provides a generic method of dissipation suppression in a variety of quantum mechanical platforms, including natural atoms and superconducting circuits. △ Less

Submitted 12 July, 2024; originally announced July 2024.

arXiv:2407.09328 [pdf]

> 2π Phase Modulation using Exciton-Polaritons in a Two-Dimensional Superlattice

Authors: Jason Lynch, Pawan Kumar, Chen Chen, Nicholas Trainor, Shalini Kumari, Tzu-Yu Peng, Cindy Yueli Chen, Yu-Jung Lu, Joan Redwing, Deep Jariwala

Abstract: Active metamaterials promise to enable arbitrary, temporal control over the propagation of wavefronts of light for applications such as beam steering, optical communication modulators, and holograms. This has been done in the past using patterned silicon photonics to locally control the phase of light such that the metasurface acts as a large number of wavelets. Although phase modulation only requ… ▽ More Active metamaterials promise to enable arbitrary, temporal control over the propagation of wavefronts of light for applications such as beam steering, optical communication modulators, and holograms. This has been done in the past using patterned silicon photonics to locally control the phase of light such that the metasurface acts as a large number of wavelets. Although phase modulation only requires refractive index modulation when the interaction length is on the order of the wavelength, this is not enough to significantly modulate the phase of light in flatland. Instead, phase modulation is achieved using a resonant mode such as a plasmon or high-Q cavity mode that enable light to accumulate a large amount of phase over a short distance and coupling it to an active material that modulates the light-matter interactions. Here, we report that electrostatic do** can modulate the light-matter interaction strength of a two-dimensional WS2 based multi quantum well (MQW) structure going from strongly-coupled, phase-accumulating exciton-polaritons to weakly-coupled exciton-trion-polaritons. As a result of this transition, 2.02π radians of phase modulation is observed using spectroscopic ellipsometry. This result demonstrates the potential of the MQW structure as a compact, lightweight electro-optical modulators for LiDAR and optical communications in the red region of visible spectrum. △ Less

Submitted 12 July, 2024; originally announced July 2024.

arXiv:2407.09317 [pdf, other]

Bonding states underpinning structural transitions in IrTe$_2$ observed with micro-ARPES

Authors: C. W. Nicholson, M. D. Watson, A. Pulkkinen, M. Rumo, G. Kremer, K. Y. Ma, F. O. von Rohr, C. Cacho, C. Monney

Abstract: Competing interactions in low-dimensional materials can produce nearly degenerate electronic and structural phases. We investigate the staircase of structural phase transitions in layered IrTe$_2$ for which a number of potential transition mechanisms have been postulated. The spatial coexistence of multiple phases on the micron scale has prevented a detailed analysis of the electronic structure. B… ▽ More Competing interactions in low-dimensional materials can produce nearly degenerate electronic and structural phases. We investigate the staircase of structural phase transitions in layered IrTe$_2$ for which a number of potential transition mechanisms have been postulated. The spatial coexistence of multiple phases on the micron scale has prevented a detailed analysis of the electronic structure. By exploiting micro-ARPES obtained with synchrotron radiation we extract the electronic structure of the multiple structural phases in IrTe$_2$ in order to address the mechanism underlying the phase transitions. We find direct evidence of lowered energy states that appear in the low-temperature phases, states previously predicted by \textit{ab initio} calculations and extended here. Our results validate a proposed scenario of bonding and anti-bonding states as the driver of the phase transitions. △ Less

Submitted 12 July, 2024; originally announced July 2024.

Comments: None

arXiv:2407.09315 [pdf, other]

RBMD: A molecular dynamics package enabling to simulate 10 million all-atom particles in a single graphics processing unit

Authors: Weihang Gao, Teng Zhao, Yongfa Guo, Jiuyang Liang, Huan Liu, Maoying Luo, Zedong Luo, Wei Qin, Yichao Wang, Qi Zhou, Shi **, Zhenli Xu

Abstract: This paper introduces a random-batch molecular dynamics (RBMD) package for fast simulations of particle systems at the nano/micro scale. Different from existing packages, the RBMD uses random batch methods for nonbonded interactions of particle systems. The long-range part of Coulomb interactions is calculated in Fourier space by the random batch Ewald algorithm, which achieves linear complexity a… ▽ More This paper introduces a random-batch molecular dynamics (RBMD) package for fast simulations of particle systems at the nano/micro scale. Different from existing packages, the RBMD uses random batch methods for nonbonded interactions of particle systems. The long-range part of Coulomb interactions is calculated in Fourier space by the random batch Ewald algorithm, which achieves linear complexity and superscalability, surpassing classical lattice-based Ewald methods. For the short-range part, the random batch list algorithm is used to construct neighbor lists, significantly reducing both computational and memory costs. The RBMD is implemented on GPU-CPU heterogeneous architectures, with classical force fields for all-atom systems. Benchmark systems are used to validate accuracy and performance of the package. Comparison with the particle-particle particle-mesh method and the Verlet list method in the LAMMPS package is performed on three different NVIDIA GPUs, demonstrating high efficiency of the RBMD on heterogeneous architectures. Our results also show that the RBMD enables simulations on a single GPU with a CPU core up to 10 million particles. Typically, for systems of one million particles, the RBMD allows simulating all-atom systems with a high efficiency of 8.20 ms per step, demonstrating the attractive feature for running large-scale simulations of practical applications on a desktop machine. △ Less

Submitted 12 July, 2024; originally announced July 2024.

Comments: 26 pages, 8 figures

arXiv:2407.09312 [pdf, other]

doi 10.1093/mnras/stae1148

Detailed Map** of the Galactic Disk Structure in the Solar Neighborhood through LAMOST K Dwarfs

Authors: Xi-Can Tang, Hao Tian, **g Li, Bing-qiu Chen, Yi-Rong Chen, Chao Liu, Dan Qiu

Abstract: The Galactic disk is one of the main components of the Milky Way, which contributes most of the luminosity. Its structure is essential for understanding the formation and evolution of the Milky Way. Using 174,443 K-type dwarf stars observed by both LAMOST and Gaia DR3, we study the disk density profile in the local volume within 1,200 pc. In the azimuthal dimension, we find strong asymmetric signa… ▽ More The Galactic disk is one of the main components of the Milky Way, which contributes most of the luminosity. Its structure is essential for understanding the formation and evolution of the Milky Way. Using 174,443 K-type dwarf stars observed by both LAMOST and Gaia DR3, we study the disk density profile in the local volume within 1,200 pc. In the azimuthal dimension, we find strong asymmetric signal of the thin disk. The surface density and the scale height of the southern disk significantly change versus the azimuthal angle at the same galactocentric distance $R$. Meanwhile, in the vertical dimension, the scale height of the northern disk has quite different trend than that of the southern one. The scale height of the southern disk shows a decreasing trend with $φ\sim-2.5^\circ$, and change to an increasing one with $φ\sim5.0^°$. Meanwhile, the scale height of the northern disk has a consistently smaller increase. Finally, we divide the entire sample into three subsamples based on metallicity and all three subsamples show significant non-axisymmetric and north-south asymmetric signals in the Galactic disk. Furthermore, we find that the scale height of the metal-poor ([Fe/H] $<$ -0.4 dex) subsample in the northern disk is greater than that of the metal-rich ([Fe/H] $>$ -0.1 dex) subsample. However, in the southern disk, the scale height exhibits varying relationships across different metallicity slices. △ Less

Submitted 12 July, 2024; originally announced July 2024.

Comments: 15 pages, 24 figures, 6 tables; accepted for publication in MNRAS

arXiv:2407.09309 [pdf, other]

Dynamic-Mode Decomposition of Geostrophically Balanced and Unbalanced Motions from SWOT

Authors: Takaya Uchida, Yadidya Badarvada, Karl E. Lapo, Xiaobiao Xu, Brian K. Arbic, Dimitris Menemenlis, Luna Hiron, Eric P. Chassignet, Jay F. Shriver

Abstract: The decomposition of oceanic flow into its balanced and unbalanced motions carries theoretical and practical significance for the oceanographic community. These two motions have distinct dynamical characteristics and affect the transport of tracers differently from one another. The launch of Surface Water and Ocean Topography (SWOT) satellite provides a prime opportunity to diagnose the surface ba… ▽ More The decomposition of oceanic flow into its balanced and unbalanced motions carries theoretical and practical significance for the oceanographic community. These two motions have distinct dynamical characteristics and affect the transport of tracers differently from one another. The launch of Surface Water and Ocean Topography (SWOT) satellite provides a prime opportunity to diagnose the surface balanced and unbalanced motions on a global scale at an unprecedented spatial resolution. Here, we apply dynamic-mode decomposition (DMD), a linear-algebraic data-driven method, to a tidally-forced numerical simulation and one-day-repeat SWOT observations of sea-surface height (SSH) in the Gulf Stream extension. DMD is able to separate out the spatial modes associated with sub-inertial periods from super-inertial periods. The sub-inertial modes of DMD can be used to extract geostrophically balanced motions from SSH fields, which have an imprint of internal tides. △ Less

Submitted 12 July, 2024; originally announced July 2024.

arXiv:2407.09302 [pdf, other]

Excision for Spaces of Admissible Skeins

Authors: Ingo Runkel, Christoph Schweigert, Ying Hong Tham

Abstract: The skein module for a d-dimensional manifold is a vector space spanned by embedded framed graphs decorated by a category A with suitable extra structure depending on the dimension d, modulo local relations which hold inside d-balls. For a full subcategory S of A, an S-admissible skein module is defined analogously, except that local relations for a given ball may only be applied if outside the ba… ▽ More The skein module for a d-dimensional manifold is a vector space spanned by embedded framed graphs decorated by a category A with suitable extra structure depending on the dimension d, modulo local relations which hold inside d-balls. For a full subcategory S of A, an S-admissible skein module is defined analogously, except that local relations for a given ball may only be applied if outside the ball at least one edge is coloured in S. In this paper we prove that admissible skein modules in any dimension satisfy excision, namely that the skein module of a glued manifold is expressed as a coend over boundary values on the boundary components glued together. We furthermore relate skein modules for different choices of S, apply our result to cylinder categories, and recover the relation to modified traces. △ Less

Submitted 12 July, 2024; originally announced July 2024.

Comments: 64 pages

arXiv:2407.09299 [pdf, other]

PID: Physics-Informed Diffusion Model for Infrared Image Generation

Authors: Fangyuan Mao, Jilin Mei, Shun Lu, Fuyang Liu, Liang Chen, Fangzhou Zhao, Yu Hu

Abstract: Infrared imaging technology has gained significant attention for its reliable sensing ability in low visibility conditions, prompting many studies to convert the abundant RGB images to infrared images. However, most existing image translation methods treat infrared images as a stylistic variation, neglecting the underlying physical laws, which limits their practical application. To address these i… ▽ More Infrared imaging technology has gained significant attention for its reliable sensing ability in low visibility conditions, prompting many studies to convert the abundant RGB images to infrared images. However, most existing image translation methods treat infrared images as a stylistic variation, neglecting the underlying physical laws, which limits their practical application. To address these issues, we propose a Physics-Informed Diffusion (PID) model for translating RGB images to infrared images that adhere to physical laws. Our method leverages the iterative optimization of the diffusion model and incorporates strong physical constraints based on prior knowledge of infrared laws during training. This approach enhances the similarity between translated infrared images and the real infrared domain without increasing extra training parameters. Experimental results demonstrate that PID significantly outperforms existing state-of-the-art methods. Our code is available at https://github.com/fangyuanmao/PID. △ Less

Submitted 12 July, 2024; originally announced July 2024.

arXiv:2407.09295 [pdf, other]

Security Matrix for Multimodal Agents on Mobile Devices: A Systematic and Proof of Concept Study

Authors: Yulong Yang, Xinshan Yang, Shuaidong Li, Chenhao Lin, Zhengyu Zhao, Chao Shen, Tianwei Zhang

Abstract: The rapid progress in the reasoning capability of the Multi-modal Large Language Models (MLLMs) has triggered the development of autonomous agent systems on mobile devices. MLLM-based mobile agent systems consist of perception, reasoning, memory, and multi-agent collaboration modules, enabling automatic analysis of user instructions and the design of task pipelines with only natural language and d… ▽ More The rapid progress in the reasoning capability of the Multi-modal Large Language Models (MLLMs) has triggered the development of autonomous agent systems on mobile devices. MLLM-based mobile agent systems consist of perception, reasoning, memory, and multi-agent collaboration modules, enabling automatic analysis of user instructions and the design of task pipelines with only natural language and device screenshots as inputs. Despite the increased human-machine interaction efficiency, the security risks of MLLM-based mobile agent systems have not been systematically studied. Existing security benchmarks for agents mainly focus on Web scenarios, and the attack techniques against MLLMs are also limited in the mobile agent scenario. To close these gaps, this paper proposes a mobile agent security matrix covering 3 functional modules of the agent systems. Based on the security matrix, this paper proposes 4 realistic attack paths and verifies these attack paths through 8 attack methods. By analyzing the attack results, this paper reveals that MLLM-based mobile agent systems are not only vulnerable to multiple traditional attacks, but also raise new security concerns previously unconsidered. This paper highlights the need for security awareness in the design of MLLM-based systems and paves the way for future research on attacks and defense methods. △ Less

Submitted 12 July, 2024; originally announced July 2024.

Comments: Preprint. Work in progress

arXiv:2407.09292 [pdf, other]

CEIPA: Counterfactual Explainable Incremental Prompt Attack Analysis on Large Language Models

Authors: Dong Shu, Mingyu **, Tianle Chen, Chong Zhang, Yongfeng Zhang

Abstract: This study sheds light on the imperative need to bolster safety and privacy measures in large language models (LLMs), such as GPT-4 and LLaMA-2, by identifying and mitigating their vulnerabilities through explainable analysis of prompt attacks. We propose Counterfactual Explainable Incremental Prompt Attack (CEIPA), a novel technique where we guide prompts in a specific manner to quantitatively me… ▽ More This study sheds light on the imperative need to bolster safety and privacy measures in large language models (LLMs), such as GPT-4 and LLaMA-2, by identifying and mitigating their vulnerabilities through explainable analysis of prompt attacks. We propose Counterfactual Explainable Incremental Prompt Attack (CEIPA), a novel technique where we guide prompts in a specific manner to quantitatively measure attack effectiveness and explore the embedded defense mechanisms in these models. Our approach is distinctive for its capacity to elucidate the reasons behind the generation of harmful responses by LLMs through an incremental counterfactual methodology. By organizing the prompt modification process into four incremental levels: (word, sentence, character, and a combination of character and word) we facilitate a thorough examination of the susceptibilities inherent to LLMs. The findings from our study not only provide counterfactual explanation insight but also demonstrate that our framework significantly enhances the effectiveness of attack prompts. △ Less

Submitted 12 July, 2024; originally announced July 2024.

Comments: 23 pages, 6 figures

arXiv:2407.09285 [pdf, other]

MetaFood CVPR 2024 Challenge on Physically Informed 3D Food Reconstruction: Methods and Results

Authors: Jiangpeng He, Yuhao Chen, Gautham Vinod, Talha Ibn Mahmud, Fengqing Zhu, Edward Delp, Alexander Wong, Pengcheng Xi, Ahmad AlMughrabi, Umair Haroon, Ricardo Marques, Petia Radeva, Jiadong Tang, Dianyi Yang, Yu Gao, Zhaoxiang Liang, Yawei Jueluo, Chengyu Shi, Pengyu Wang

Abstract: The increasing interest in computer vision applications for nutrition and dietary monitoring has led to the development of advanced 3D reconstruction techniques for food items. However, the scarcity of high-quality data and limited collaboration between industry and academia have constrained progress in this field. Building on recent advancements in 3D reconstruction, we host the MetaFood Workshop… ▽ More The increasing interest in computer vision applications for nutrition and dietary monitoring has led to the development of advanced 3D reconstruction techniques for food items. However, the scarcity of high-quality data and limited collaboration between industry and academia have constrained progress in this field. Building on recent advancements in 3D reconstruction, we host the MetaFood Workshop and its challenge for Physically Informed 3D Food Reconstruction. This challenge focuses on reconstructing volume-accurate 3D models of food items from 2D images, using a visible checkerboard as a size reference. Participants were tasked with reconstructing 3D models for 20 selected food items of varying difficulty levels: easy, medium, and hard. The easy level provides 200 images, the medium level provides 30 images, and the hard level provides only 1 image for reconstruction. In total, 16 teams submitted results in the final testing phase. The solutions developed in this challenge achieved promising results in 3D food reconstruction, with significant potential for improving portion estimation for dietary assessment and nutritional monitoring. More details about this workshop challenge and access to the dataset can be found at https://sites.google.com/view/cvpr-metafood-2024. △ Less

Submitted 12 July, 2024; originally announced July 2024.

Comments: Technical report for MetaFood CVPR 2024 Challenge on Physically Informed 3D Food Reconstruction. arXiv admin note: substantial text overlap with arXiv:2407.01717

arXiv:2407.09276 [pdf, other]

H2O-Danube3 Technical Report

Authors: Pascal Pfeiffer, Philipp Singer, Yauhen Babakhin, Gabor Fodor, Nischay Dhankhar, Sri Satish Ambati

Abstract: We present H2O-Danube3, a series of small language models consisting of H2O-Danube3-4B, trained on 6T tokens and H2O-Danube3-500M, trained on 4T tokens. Our models are pre-trained on high quality Web data consisting of primarily English tokens in three stages with different data mixes before final supervised tuning for chat version. The models exhibit highly competitive metrics across a multitude… ▽ More We present H2O-Danube3, a series of small language models consisting of H2O-Danube3-4B, trained on 6T tokens and H2O-Danube3-500M, trained on 4T tokens. Our models are pre-trained on high quality Web data consisting of primarily English tokens in three stages with different data mixes before final supervised tuning for chat version. The models exhibit highly competitive metrics across a multitude of academic, chat, and fine-tuning benchmarks. Thanks to its compact architecture, H2O-Danube3 can be efficiently run on a modern smartphone, enabling local inference and rapid processing capabilities even on mobile devices. We make all models openly available under Apache 2.0 license further democratizing LLMs to a wider audience economically. △ Less

Submitted 12 July, 2024; originally announced July 2024.

arXiv:2407.09274 [pdf, other]

Unifying Sequences, Structures, and Descriptions for Any-to-Any Protein Generation with the Large Multimodal Model HelixProtX

Authors: Zhiyuan Chen, Tianhao Chen, Chenggang Xie, Yang Xue, Xiaonan Zhang, **gbo Zhou, Xiaomin Fang

Abstract: Proteins are fundamental components of biological systems and can be represented through various modalities, including sequences, structures, and textual descriptions. Despite the advances in deep learning and scientific large language models (LLMs) for protein research, current methodologies predominantly focus on limited specialized tasks -- often predicting one protein modality from another. Th… ▽ More Proteins are fundamental components of biological systems and can be represented through various modalities, including sequences, structures, and textual descriptions. Despite the advances in deep learning and scientific large language models (LLMs) for protein research, current methodologies predominantly focus on limited specialized tasks -- often predicting one protein modality from another. These approaches restrict the understanding and generation of multimodal protein data. In contrast, large multimodal models have demonstrated potential capabilities in generating any-to-any content like text, images, and videos, thus enriching user interactions across various domains. Integrating these multimodal model technologies into protein research offers significant promise by potentially transforming how proteins are studied. To this end, we introduce HelixProtX, a system built upon the large multimodal model, aiming to offer a comprehensive solution to protein research by supporting any-to-any protein modality generation. Unlike existing methods, it allows for the transformation of any input protein modality into any desired protein modality. The experimental results affirm the advanced capabilities of HelixProtX, not only in generating functional descriptions from amino acid sequences but also in executing critical tasks such as designing protein sequences and structures from textual descriptions. Preliminary findings indicate that HelixProtX consistently achieves superior accuracy across a range of protein-related tasks, outperforming existing state-of-the-art models. By integrating multimodal large models into protein research, HelixProtX opens new avenues for understanding protein biology, thereby promising to accelerate scientific discovery. △ Less

Submitted 12 July, 2024; originally announced July 2024.

arXiv:2407.09271 [pdf, other]

iNeMo: Incremental Neural Mesh Models for Robust Class-Incremental Learning

Authors: Tom Fischer, Yaoyao Liu, Artur Jesslen, Noor Ahmed, Prakhar Kaushik, Angtian Wang, Alan Yuille, Adam Kortylewski, Eddy Ilg

Abstract: Different from human nature, it is still common practice today for vision tasks to train deep learning models only initially and on fixed datasets. A variety of approaches have recently addressed handling continual data streams. However, extending these methods to manage out-of-distribution (OOD) scenarios has not effectively been investigated. On the other hand, it has recently been shown that no… ▽ More Different from human nature, it is still common practice today for vision tasks to train deep learning models only initially and on fixed datasets. A variety of approaches have recently addressed handling continual data streams. However, extending these methods to manage out-of-distribution (OOD) scenarios has not effectively been investigated. On the other hand, it has recently been shown that non-continual neural mesh models exhibit strong performance in generalizing to such OOD scenarios. To leverage this decisive property in a continual learning setting, we propose incremental neural mesh models that can be extended with new meshes over time. In addition, we present a latent space initialization strategy that enables us to allocate feature space for future unseen classes in advance and a positional regularization term that forces the features of the different classes to consistently stay in respective latent space regions. We demonstrate the effectiveness of our method through extensive experiments on the Pascal3D and ObjectNet3D datasets and show that our approach outperforms the baselines for classification by $2-6\%$ in the in-domain and by $6-50\%$ in the OOD setting. Our work also presents the first incremental learning approach for pose estimation. Our code and model can be found at https://github.com/Fischer-Tom/iNeMo. △ Less

Submitted 12 July, 2024; originally announced July 2024.

arXiv:2407.09270 [pdf, other]

Computational co-design of structure and feedback controller for locomoting soft robots

Authors: Yuki Sato, Changyoung Yuhn, Hiroki Kobayashi, Atsushi Kawamoto, Tsuyoshi Nomura

Abstract: Soft robots have gained significant attention due to their flexibility and safety, particularly in human-centric applications. The co-design of structure and controller in soft robotics has presented a longstanding challenge owing to the complexity of the dynamics involved. Despite some pioneering work dealing with the co-design of soft robot structures and actuation, design freedom has been limit… ▽ More Soft robots have gained significant attention due to their flexibility and safety, particularly in human-centric applications. The co-design of structure and controller in soft robotics has presented a longstanding challenge owing to the complexity of the dynamics involved. Despite some pioneering work dealing with the co-design of soft robot structures and actuation, design freedom has been limited by stochastic design search approaches. This study proposes the simultaneous optimization of structure and controller for soft robots in locomotion tasks, integrating topology optimization-based structural design with neural network-based feedback controller design. Here, the feedback controller receives information about the surrounding terrain and outputs actuation signals that induce the expansion and contraction of the material. We formulate the simultaneous optimization problem under uncertainty in terrains and construct an optimization algorithm that utilizes automatic differentiation within topology optimization and neural networks. We present numerical experiments to demonstrate the validity and effectiveness of our proposed method. △ Less

Submitted 12 July, 2024; originally announced July 2024.

Comments: 23 pages, 22 figures

arXiv:2407.09268 [pdf, other]

Region Attention Transformer for Medical Image Restoration

Authors: Zhiwen Yang, Haowei Chen, Ziniu Qian, Yang Zhou, Hui Zhang, Dan Zhao, Bingzheng Wei, Yan Xu

Abstract: Transformer-based methods have demonstrated impressive results in medical image restoration, attributed to the multi-head self-attention (MSA) mechanism in the spatial dimension. However, the majority of existing Transformers conduct attention within fixed and coarsely partitioned regions (\text{e.g.} the entire image or fixed patches), resulting in interference from irrelevant regions and fragmen… ▽ More Transformer-based methods have demonstrated impressive results in medical image restoration, attributed to the multi-head self-attention (MSA) mechanism in the spatial dimension. However, the majority of existing Transformers conduct attention within fixed and coarsely partitioned regions (\text{e.g.} the entire image or fixed patches), resulting in interference from irrelevant regions and fragmentation of continuous image content. To overcome these challenges, we introduce a novel Region Attention Transformer (RAT) that utilizes a region-based multi-head self-attention mechanism (R-MSA). The R-MSA dynamically partitions the input image into non-overlap** semantic regions using the robust Segment Anything Model (SAM) and then performs self-attention within these regions. This region partitioning is more flexible and interpretable, ensuring that only pixels from similar semantic regions complement each other, thereby eliminating interference from irrelevant regions. Moreover, we introduce a focal region loss to guide our model to adaptively focus on recovering high-difficulty regions. Extensive experiments demonstrate the effectiveness of RAT in various medical image restoration tasks, including PET image synthesis, CT image denoising, and pathological image super-resolution. Code is available at \href{https://github.com/Yaziwel/Region-Attention-Transformer-for-Medical-Image-Restoration.git}{https://github.com/RAT}. △ Less

Submitted 12 July, 2024; originally announced July 2024.

Comments: This paper has been accepted by MICCAI 2024

arXiv:2407.09257 [pdf, other]

A multiscale Consensus-Based algorithm for multi-level optimization

Authors: Michael Herty, Yuyang Huang, Dante Kalise, Hicham Kouhkouh

Abstract: A novel multiscale consensus-based optimization (CBO) algorithm for solving bi- and tri-level optimization problems is introduced. Existing CBO techniques are generalized by the proposed method through the employment of multiple interacting populations of particles, each of which is used to optimize one level of the problem. These particle populations are evolved through multiscale-in-time dynamic… ▽ More A novel multiscale consensus-based optimization (CBO) algorithm for solving bi- and tri-level optimization problems is introduced. Existing CBO techniques are generalized by the proposed method through the employment of multiple interacting populations of particles, each of which is used to optimize one level of the problem. These particle populations are evolved through multiscale-in-time dynamics, which are formulated as a singularly perturbed system of stochastic differential equations. Theoretical convergence analysis for the multiscale CBO model to an averaged effective dynamics as the time-scale separation parameter approaches zero is provided. The resulting algorithm is presented for both bi-level and tri-level optimization problems. The effectiveness of the approach in tackling complex multi-level optimization tasks is demonstrated through numerical experiments on various benchmark functions. Additionally, it is shown that the proposed method performs well on min-max optimization problems, comparing favorably with existing CBO algorithms for saddle point problems. △ Less

Submitted 12 July, 2024; originally announced July 2024.

MSC Class: 65C35; 90C56; 90C26; 49M37; 93C70

arXiv:2407.09254 [pdf, other]

Power Optimization and Deep Learning for Channel Estimation of Active IRS-Aided IoT

Authors: Yan Wang, Wei Gao, Qi Zhang, Jiajia Liu, Feng Shu

Abstract: In this paper, channel estimation of an active intelligent reflecting surface (IRS) aided uplink Internet of Things (IoT) network is investigated. Firstly, the least square (LS) estimators for the direct channel and the cascaded channel are presented, respectively. The corresponding mean square errors (MSE) of channel estimators are derived. Subsequently, in order to evaluate the influence of adju… ▽ More In this paper, channel estimation of an active intelligent reflecting surface (IRS) aided uplink Internet of Things (IoT) network is investigated. Firstly, the least square (LS) estimators for the direct channel and the cascaded channel are presented, respectively. The corresponding mean square errors (MSE) of channel estimators are derived. Subsequently, in order to evaluate the influence of adjusting the transmit power at the IoT devices or the reflected power at the active IRS on Sum-MSE performance, two situations are considered. In the first case, under the total power sum constraint of the IoT devices and active IRS, the closed-form expression of the optimal power allocation factor is derived. In the second case, when the transmit power at the IoT devices is fixed, there exists an optimal reflective power at active IRS. To further improve the estimation performance, the convolutional neural network (CNN)-based direct channel estimation (CDCE) algorithm and the CNN-based cascaded channel estimation (CCCE) algorithm are designed. Finally, simulation results demonstrate the existence of an optimal power allocation strategy that minimizes the Sum-MSE, and further validate the superiority of the proposed CDCE / CCCE algorithms over their respective traditional LS and minimum mean square error (MMSE) baselines. △ Less

Submitted 12 July, 2024; originally announced July 2024.

arXiv:2407.09251 [pdf, other]

Deep Adversarial Defense Against Multilevel-Lp Attacks

Authors: Ren Wang, Yuxuan Li, Alfred Hero

Abstract: Deep learning models have shown considerable vulnerability to adversarial attacks, particularly as attacker strategies become more sophisticated. While traditional adversarial training (AT) techniques offer some resilience, they often focus on defending against a single type of attack, e.g., the $\ell_\infty$-norm attack, which can fail for other types. This paper introduces a computationally effi… ▽ More Deep learning models have shown considerable vulnerability to adversarial attacks, particularly as attacker strategies become more sophisticated. While traditional adversarial training (AT) techniques offer some resilience, they often focus on defending against a single type of attack, e.g., the $\ell_\infty$-norm attack, which can fail for other types. This paper introduces a computationally efficient multilevel $\ell_p$ defense, called the Efficient Robust Mode Connectivity (EMRC) method, which aims to enhance a deep learning model's resilience against multiple $\ell_p$-norm attacks. Similar to analytical continuation approaches used in continuous optimization, the method blends two $p$-specific adversarially optimal models, the $\ell_1$- and $\ell_\infty$-norm AT solutions, to provide good adversarial robustness for a range of $p$. We present experiments demonstrating that our approach performs better on various attacks as compared to AT-$\ell_\infty$, E-AT, and MSD, for datasets/architectures including: CIFAR-10, CIFAR-100 / PreResNet110, WideResNet, ViT-Base. △ Less

Submitted 12 July, 2024; originally announced July 2024.

arXiv:2407.09239 [pdf, other]

doi 10.1109/VTC2023-Fall60731.2023.10333794

FedVAE: Trajectory privacy preserving based on Federated Variational AutoEncoder

Authors: Yuchen Jiang, Ying Wu, Shiyao Zhang, James J. Q. Yu

Abstract: The use of trajectory data with abundant spatial-temporal information is pivotal in Intelligent Transport Systems (ITS) and various traffic system tasks. Location-Based Services (LBS) capitalize on this trajectory data to offer users personalized services tailored to their location information. However, this trajectory data contains sensitive information about users' movement patterns and habits,… ▽ More The use of trajectory data with abundant spatial-temporal information is pivotal in Intelligent Transport Systems (ITS) and various traffic system tasks. Location-Based Services (LBS) capitalize on this trajectory data to offer users personalized services tailored to their location information. However, this trajectory data contains sensitive information about users' movement patterns and habits, necessitating confidentiality and protection from unknown collectors. To address this challenge, privacy-preserving methods like K-anonymity and Differential Privacy have been proposed to safeguard private information in the dataset. Despite their effectiveness, these methods can impact the original features by introducing perturbations or generating unrealistic trajectory data, leading to suboptimal performance in downstream tasks. To overcome these limitations, we propose a Federated Variational AutoEncoder (FedVAE) approach, which effectively generates a new trajectory dataset while preserving the confidentiality of private information and retaining the structure of the original features. In addition, FedVAE leverages Variational AutoEncoder (VAE) to maintain the original feature space and generate new trajectory data, and incorporates Federated Learning (FL) during the training stage, ensuring that users' data remains locally stored to protect their personal information. The results demonstrate its superior performance compared to other existing methods, affirming FedVAE as a promising solution for enhancing data privacy and utility in location-based applications. △ Less

Submitted 12 July, 2024; originally announced July 2024.

Comments: 2023 IEEE 98th Vehicular Technology Conference

Showing 1–50 of 329,772 results for author: Y.