-
Pseudo-magnetic fields in square lattices
Authors:
Junsong Sun,
Xingchuan Zhu,
Tianyu Liu,
Shi** Feng,
Huaiming Guo
Abstract:
We have investigated the effects of strain on two-dimensional square lattices and examined the methods for inducing pseudo-magnetic fields. In both the columnar and staggered $π$-flux square lattices, we have found that strain only modulates Fermi velocities rather than inducing pseudo-magnetic fields. However, spatially non-uniform on-site potentials (anisotropic hop**s) can create pseudo-magne…
▽ More
We have investigated the effects of strain on two-dimensional square lattices and examined the methods for inducing pseudo-magnetic fields. In both the columnar and staggered $π$-flux square lattices, we have found that strain only modulates Fermi velocities rather than inducing pseudo-magnetic fields. However, spatially non-uniform on-site potentials (anisotropic hop**s) can create pseudo-magnetic fields in columnar (staggered) $π$-flux square lattices. On the other hand, we demonstrate that strain does induce pseudo-magnetic fields in staggered zero-flux square lattices. By breaking a quarter of the bonds, we clarify that a staggered zero-flux square lattice is topologically equivalent to a honeycomb lattice and displays pseudo-vector potentials and pseudo-Landau levels at the Dirac points.
△ Less
Submitted 15 October, 2023; v1 submitted 31 August, 2023;
originally announced September 2023.
-
Observation of Flat Band and Van Hove Singularity in Non-superconducting Nitrogen-doped Lutetium Hydride
Authors:
Xin Liang,
Zihan Lin,
Jun Zhang,
Jianfa Zhao,
Shiyu Feng,
Wenlong Lu,
Guodong Wang,
Luchuan Shi,
Ningning Wang,
Pengfei Shan,
Zao Zhang,
Muntaser Naamneh,
Runzhe Liu,
Bastien Michon,
**guang Cheng,
Changqing **,
Yang Ren,
Junzhang Ma
Abstract:
Hydrogen-rich materials offer a compelling avenue towards room temperature superconductivity, albeit under ultra-high pressure. However, the experimental investigation of the electronic band structure remains elusive, due to the inherent instability of most of the hydrogen-rich materials upon pressure release. Very recently, nitrogen-doped lutetium hydride was claimed to host room temperature supe…
▽ More
Hydrogen-rich materials offer a compelling avenue towards room temperature superconductivity, albeit under ultra-high pressure. However, the experimental investigation of the electronic band structure remains elusive, due to the inherent instability of most of the hydrogen-rich materials upon pressure release. Very recently, nitrogen-doped lutetium hydride was claimed to host room temperature superconductivity under near ambient pressure but was disproven by following works. Upon decompression, nitrogen doped lutetium hydride manifests a stable metallic phase with dark blue color. Moreover, high temperature superconductivity has been reported in lutetium hydrides Lu4H23 (~71 K) under around 200 GPa. These properties engender an unprecedented opportunity, allowing for the experimental investigation of the electronic band structure intrinsic to hydrogen-rich material. In this work, using angle resolved photoemission spectroscopy to investigate the non-superconducting nitrogen doped lutetium hydride, we observed significant flat band and Van Hove singularity marginally below the Fermi level. These salient features, identified as critical elements, proffer potential amplifiers for the realization of heightened superconductivity, as evidenced by prior research. Our results not only unveil a confluence of potent strong correlation effects and anisotropy within the Lu-H-N compound, but also provide a prospect for engineering high temperature superconductivity through the strategic manipulation of flat band and the VHS, effectively tailoring their alignment with the Fermi energy.
△ Less
Submitted 8 September, 2023; v1 submitted 30 August, 2023;
originally announced August 2023.
-
IDVT: Interest-aware Denoising and View-guided Tuning for Social Recommendation
Authors:
Dezhao Yang,
Jianghong Ma,
Shanshan Feng,
Haijun Zhang,
Zhao Zhang
Abstract:
In the information age, recommendation systems are vital for efficiently filtering information and identifying user preferences. Online social platforms have enriched these systems by providing valuable auxiliary information. Socially connected users are assumed to share similar preferences, enhancing recommendation accuracy and addressing cold start issues. However, empirical findings challenge t…
▽ More
In the information age, recommendation systems are vital for efficiently filtering information and identifying user preferences. Online social platforms have enriched these systems by providing valuable auxiliary information. Socially connected users are assumed to share similar preferences, enhancing recommendation accuracy and addressing cold start issues. However, empirical findings challenge the assumption, revealing that certain social connections can actually harm system performance. Our statistical analysis indicates a significant amount of noise in the social network, where many socially connected users do not share common interests. To address this issue, we propose an innovative \underline{I}nterest-aware \underline{D}enoising and \underline{V}iew-guided \underline{T}uning (IDVT) method for the social recommendation. The first ID part effectively denoises social connections. Specifically, the denoising process considers both social network structure and user interaction interests in a global view. Moreover, in this global view, we also integrate denoised social information (social domain) into the propagation of the user-item interactions (collaborative domain) and aggregate user representations from two domains using a gating mechanism. To tackle potential user interest loss and enhance model robustness within the global view, our second VT part introduces two additional views (local view and dropout-enhanced view) for fine-tuning user representations in the global view through contrastive learning. Extensive evaluations on real-world datasets with varying noise ratios demonstrate the superiority of IDVT over state-of-the-art social recommendation methods.
△ Less
Submitted 17 June, 2024; v1 submitted 30 August, 2023;
originally announced August 2023.
-
DRGame: Diversified Recommendation for Multi-category Video Games with Balanced Implicit Preferences
Authors:
Kangzhe Liu,
Jianghong Ma,
Shanshan Feng,
Haijun Zhang,
Zhao Zhang
Abstract:
The growing popularity of subscription services in video game consumption has emphasized the importance of offering diversified recommendations. Providing users with a diverse range of games is essential for ensuring continued engagement and fostering long-term subscriptions. However, existing recommendation models face challenges in effectively handling highly imbalanced implicit feedback in gami…
▽ More
The growing popularity of subscription services in video game consumption has emphasized the importance of offering diversified recommendations. Providing users with a diverse range of games is essential for ensuring continued engagement and fostering long-term subscriptions. However, existing recommendation models face challenges in effectively handling highly imbalanced implicit feedback in gaming interactions. Additionally, they struggle to take into account the distinctive characteristics of multiple categories and the latent user interests associated with these categories. In response to these challenges, we propose a novel framework, named DRGame, to obtain diversified recommendation. It is centered on multi-category video games, consisting of two {components}: Balance-driven Implicit Preferences Learning for data pre-processing and Clustering-based Diversified Recommendation {Module} for final prediction. The first module aims to achieve a balanced representation of implicit feedback in game time, thereby discovering a comprehensive view of player interests across different categories. The second module adopts category-aware representation learning to cluster and select players and games based on balanced implicit preferences, and then employs asymmetric neighbor aggregation to achieve diversified recommendations. Experimental results on a real-world dataset demonstrate the superiority of our proposed method over existing approaches in terms of game diversity recommendations.
△ Less
Submitted 30 August, 2023;
originally announced August 2023.
-
PEM: Representing Binary Program Semantics for Similarity Analysis via a Probabilistic Execution Model
Authors:
Xiangzhe Xu,
Zhou Xuan,
Shiwei Feng,
Siyuan Cheng,
Yapeng Ye,
Qingkai Shi,
Guanhong Tao,
Le Yu,
Zhuo Zhang,
Xiangyu Zhang
Abstract:
Binary similarity analysis determines if two binary executables are from the same source program. Existing techniques leverage static and dynamic program features and may utilize advanced Deep Learning techniques. Although they have demonstrated great potential, the community believes that a more effective representation of program semantics can further improve similarity analysis. In this paper,…
▽ More
Binary similarity analysis determines if two binary executables are from the same source program. Existing techniques leverage static and dynamic program features and may utilize advanced Deep Learning techniques. Although they have demonstrated great potential, the community believes that a more effective representation of program semantics can further improve similarity analysis. In this paper, we propose a new method to represent binary program semantics. It is based on a novel probabilistic execution engine that can effectively sample the input space and the program path space of subject binaries. More importantly, it ensures that the collected samples are comparable across binaries, addressing the substantial variations of input specifications. Our evaluation on 9 real-world projects with 35k functions, and comparison with 6 state-of-the-art techniques show that PEM can achieve a precision of 96% with common settings, outperforming the baselines by 10-20%.
△ Less
Submitted 29 August, 2023; v1 submitted 29 August, 2023;
originally announced August 2023.
-
Reinforcement Learning-assisted Evolutionary Algorithm: A Survey and Research Opportunities
Authors:
Yanjie Song,
Yutong Wu,
Yangyang Guo,
Ran Yan,
P. N. Suganthan,
Yue Zhang,
Witold Pedrycz,
Swagatam Das,
Rammohan Mallipeddi,
Oladayo Solomon Ajani. Qiang Feng
Abstract:
Evolutionary algorithms (EA), a class of stochastic search methods based on the principles of natural evolution, have received widespread acclaim for their exceptional performance in various real-world optimization problems. While researchers worldwide have proposed a wide variety of EAs, certain limitations remain, such as slow convergence speed and poor generalization capabilities. Consequently,…
▽ More
Evolutionary algorithms (EA), a class of stochastic search methods based on the principles of natural evolution, have received widespread acclaim for their exceptional performance in various real-world optimization problems. While researchers worldwide have proposed a wide variety of EAs, certain limitations remain, such as slow convergence speed and poor generalization capabilities. Consequently, numerous scholars actively explore improvements to algorithmic structures, operators, search patterns, etc., to enhance their optimization performance. Reinforcement learning (RL) integrated as a component in the EA framework has demonstrated superior performance in recent years. This paper presents a comprehensive survey on integrating reinforcement learning into the evolutionary algorithm, referred to as reinforcement learning-assisted evolutionary algorithm (RL-EA). We begin with the conceptual outlines of reinforcement learning and the evolutionary algorithm. We then provide a taxonomy of RL-EA. Subsequently, we discuss the RL-EA integration method, the RL-assisted strategy adopted by RL-EA, and its applications according to the existing literature. The RL-assisted procedure is divided according to the implemented functions including solution generation, learnable objective function, algorithm/operator/sub-population selection, parameter adaptation, and other strategies. Additionally, different attribute settings of RL in RL-EA are discussed. In the applications of RL-EA section, we also demonstrate the excellent performance of RL-EA on several benchmarks and a range of public datasets to facilitate a quick comparative study. Finally, we analyze potential directions for future research.
△ Less
Submitted 27 January, 2024; v1 submitted 25 August, 2023;
originally announced August 2023.
-
From Chatter to Matter: Addressing Critical Steps of Emotion Recognition Learning in Task-oriented Dialogue
Authors:
Shutong Feng,
Nurul Lubis,
Benjamin Ruppik,
Christian Geishauser,
Michael Heck,
Hsien-chin Lin,
Carel van Niekerk,
Renato Vukovic,
Milica Gašić
Abstract:
Emotion recognition in conversations (ERC) is a crucial task for building human-like conversational agents. While substantial efforts have been devoted to ERC for chit-chat dialogues, the task-oriented counterpart is largely left unattended. Directly applying chit-chat ERC models to task-oriented dialogues (ToDs) results in suboptimal performance as these models overlook key features such as the c…
▽ More
Emotion recognition in conversations (ERC) is a crucial task for building human-like conversational agents. While substantial efforts have been devoted to ERC for chit-chat dialogues, the task-oriented counterpart is largely left unattended. Directly applying chit-chat ERC models to task-oriented dialogues (ToDs) results in suboptimal performance as these models overlook key features such as the correlation between emotions and task completion in ToDs. In this paper, we propose a framework that turns a chit-chat ERC model into a task-oriented one, addressing three critical aspects: data, features and objective. First, we devise two ways of augmenting rare emotions to improve ERC performance. Second, we use dialogue states as auxiliary features to incorporate key information from the goal of the user. Lastly, we leverage a multi-aspect emotion definition in ToDs to devise a multi-task learning objective and a novel emotion-distance weighted loss function. Our framework yields significant improvements for a range of chit-chat ERC models on EmoWOZ, a large-scale dataset for user emotion in ToDs. We further investigate the generalisability of the best resulting model to predict user satisfaction in different ToD datasets. A comparison with supervised baselines shows a strong zero-shot capability, highlighting the potential usage of our framework in wider scenarios.
△ Less
Submitted 24 August, 2023;
originally announced August 2023.
-
Quantized distributed Nash equilibrium seeking under DoS attacks: A quantized consensus based approach
Authors:
Shuai Feng,
Maojiao Ye,
Lihua Xie,
Shengyuan Xu
Abstract:
This paper studies distributed Nash equilibrium (NE) seeking under Denial-of-Service (DoS) attacks and quantization. The players can only exchange information with their own direct neighbors. The transmitted information is subject to quantization and packet losses induced by malicious DoS attacks. We propose a quantized distributed NE seeking strategy based on the approach of dynamic quantized con…
▽ More
This paper studies distributed Nash equilibrium (NE) seeking under Denial-of-Service (DoS) attacks and quantization. The players can only exchange information with their own direct neighbors. The transmitted information is subject to quantization and packet losses induced by malicious DoS attacks. We propose a quantized distributed NE seeking strategy based on the approach of dynamic quantized consensus. To solve the quantizer saturation problem caused by DoS attacks, the quantization mechanism is equipped to have zooming-in and holding capabilities, in which the holding capability is consistent with the results in quantized consensus under DoS. A sufficient condition on the number of quantizer levels is provided, under which the quantizers are free from saturation under DoS attacks. The proposed distributed quantized NE seeking strategy is shown to have the so-called maximum resilience to DoS attacks. Namely, if the bound characterizing the maximum resilience is violated, an attacker can deny all the transmissions and hence distributed NE seeking is impossible.
△ Less
Submitted 24 August, 2023;
originally announced August 2023.
-
Convergence guarantee for consistency models
Authors:
Junlong Lyu,
Zhitang Chen,
Shoubo Feng
Abstract:
We provide the first convergence guarantees for the Consistency Models (CMs), a newly emerging type of one-step generative models that can generate comparable samples to those generated by Diffusion Models. Our main result is that, under the basic assumptions on score-matching errors, consistency errors and smoothness of the data distribution, CMs can efficiently sample from any realistic data dis…
▽ More
We provide the first convergence guarantees for the Consistency Models (CMs), a newly emerging type of one-step generative models that can generate comparable samples to those generated by Diffusion Models. Our main result is that, under the basic assumptions on score-matching errors, consistency errors and smoothness of the data distribution, CMs can efficiently sample from any realistic data distribution in one step with small $W_2$ error. Our results (1) hold for $L^2$-accurate score and consistency assumption (rather than $L^\infty$-accurate); (2) do note require strong assumptions on the data distribution such as log-Sobelev inequality; (3) scale polynomially in all parameters; and (4) match the state-of-the-art convergence guarantee for score-based generative models (SGMs). We also provide the result that the Multistep Consistency Sampling procedure can further reduce the error comparing to one step sampling, which support the original statement of "Consistency Models, Yang Song 2023". Our result further imply a TV error guarantee when take some Langevin-based modifications to the output distributions.
△ Less
Submitted 22 August, 2023;
originally announced August 2023.
-
MetaGCD: Learning to Continually Learn in Generalized Category Discovery
Authors:
Yanan Wu,
Zhixiang Chi,
Yang Wang,
Songhe Feng
Abstract:
In this paper, we consider a real-world scenario where a model that is trained on pre-defined classes continually encounters unlabeled data that contains both known and novel classes. The goal is to continually discover novel classes while maintaining the performance in known classes. We name the setting Continual Generalized Category Discovery (C-GCD). Existing methods for novel class discovery c…
▽ More
In this paper, we consider a real-world scenario where a model that is trained on pre-defined classes continually encounters unlabeled data that contains both known and novel classes. The goal is to continually discover novel classes while maintaining the performance in known classes. We name the setting Continual Generalized Category Discovery (C-GCD). Existing methods for novel class discovery cannot directly handle the C-GCD setting due to some unrealistic assumptions, such as the unlabeled data only containing novel classes. Furthermore, they fail to discover novel classes in a continual fashion. In this work, we lift all these assumptions and propose an approach, called MetaGCD, to learn how to incrementally discover with less forgetting. Our proposed method uses a meta-learning framework and leverages the offline labeled data to simulate the testing incremental learning process. A meta-objective is defined to revolve around two conflicting learning objectives to achieve novel class discovery without forgetting. Furthermore, a soft neighborhood-based contrastive network is proposed to discriminate uncorrelated images while attracting correlated images. We build strong baselines and conduct extensive experiments on three widely used benchmarks to demonstrate the superiority of our method.
△ Less
Submitted 17 October, 2023; v1 submitted 21 August, 2023;
originally announced August 2023.
-
DocPrompt: Large-scale continue pretrain for zero-shot and few-shot document question answering
Authors:
Si** Wu,
Dan Zhang,
Teng Hu,
Shikun Feng
Abstract:
In this paper, we propose Docprompt for document question answering tasks with powerful zero-shot and few-shot performance. We proposed a novel weakly supervised data generation method, a novel multl-stage training method and a novel understanding model \& generation model ensemble method. We achieved state-of-the-art performance on 4 document question answering tasks. This method greatly improves…
▽ More
In this paper, we propose Docprompt for document question answering tasks with powerful zero-shot and few-shot performance. We proposed a novel weakly supervised data generation method, a novel multl-stage training method and a novel understanding model \& generation model ensemble method. We achieved state-of-the-art performance on 4 document question answering tasks. This method greatly improves the delivery efficiency and model performance of document question answering customer projects, reducing annotation costs and labor costs. Our demo can be found at https://huggingface.co/spaces/PaddlePaddle/ERNIE-Layout.
△ Less
Submitted 31 August, 2023; v1 submitted 21 August, 2023;
originally announced August 2023.
-
Improving Sample Efficiency of Model-Free Algorithms for Zero-Sum Markov Games
Authors:
Songtao Feng,
Ming Yin,
Yu-Xiang Wang,
**g Yang,
Yingbin Liang
Abstract:
The problem of two-player zero-sum Markov games has recently attracted increasing interests in theoretical studies of multi-agent reinforcement learning (RL). In particular, for finite-horizon episodic Markov decision processes (MDPs), it has been shown that model-based algorithms can find an $ε$-optimal Nash Equilibrium (NE) with the sample complexity of $O(H^3SAB/ε^2)$, which is optimal in the d…
▽ More
The problem of two-player zero-sum Markov games has recently attracted increasing interests in theoretical studies of multi-agent reinforcement learning (RL). In particular, for finite-horizon episodic Markov decision processes (MDPs), it has been shown that model-based algorithms can find an $ε$-optimal Nash Equilibrium (NE) with the sample complexity of $O(H^3SAB/ε^2)$, which is optimal in the dependence of the horizon $H$ and the number of states $S$ (where $A$ and $B$ denote the number of actions of the two players, respectively). However, none of the existing model-free algorithms can achieve such an optimality. In this work, we propose a model-free stage-based Q-learning algorithm and show that it achieves the same sample complexity as the best model-based algorithm, and hence for the first time demonstrate that model-free algorithms can enjoy the same optimality in the $H$ dependence as model-based algorithms. The main improvement of the dependency on $H$ arises by leveraging the popular variance reduction technique based on the reference-advantage decomposition previously used only for single-agent RL. However, such a technique relies on a critical monotonicity property of the value function, which does not hold in Markov games due to the update of the policy via the coarse correlated equilibrium (CCE) oracle. Thus, to extend such a technique to Markov games, our algorithm features a key novel design of updating the reference value functions as the pair of optimistic and pessimistic value functions whose value difference is the smallest in the history in order to achieve the desired improvement in the sample efficiency.
△ Less
Submitted 5 June, 2024; v1 submitted 17 August, 2023;
originally announced August 2023.
-
Dimensional reduction of Kitaev spin liquid at quantum criticality
Authors:
Shi Feng,
Adhip Agarwala,
Nandini Trivedi
Abstract:
We investigate the fate of the Kitaev spin liquid (KSL) under the influence of an external magnetic field $h$ in the [001] direction and upon tuning bond anisotropy of the Kitaev coupling $K_z$ kee** $K_x = K_y = K$. Guided by density matrix renormalization group, exact diagonalization, and with insights from parton mean field theory, we uncover a field-induced gapless-to-gapless Lifshitz transi…
▽ More
We investigate the fate of the Kitaev spin liquid (KSL) under the influence of an external magnetic field $h$ in the [001] direction and upon tuning bond anisotropy of the Kitaev coupling $K_z$ kee** $K_x = K_y = K$. Guided by density matrix renormalization group, exact diagonalization, and with insights from parton mean field theory, we uncover a field-induced gapless-to-gapless Lifshitz transition from the nodal KSL to an intermediate gapless phase. The intermediate phase sandwiched between $h_{c1}$ and $h_{c2}$, which persists for a wide range of anisotropy $K_z/K > 0$, is composed of weakly coupled one-dimensional quantum critical chains. This intermediate phase is a dimensional crossover which asymptotically leads to the one-dimensional quantum Ising criticality characterized by the (1+1)D conformal field theory as the field reaches the phase transition at $h_{c2}$. Beyond $h_{c2}$ the system enters a partially polarized phase describable as effectively decoupled bosonic chains in which spin waves propagate along the one-dimensional zigzag direction. Our findings provide a comprehensive phase diagram and offer insights into the unusual physics of dimensional reduction generated by a uniform magnetic field in an otherwise two-dimensional quantum spin liquid.
△ Less
Submitted 18 March, 2024; v1 submitted 15 August, 2023;
originally announced August 2023.
-
Towards Efficient Record and Replay: A Case Study in WeChat
Authors:
Sidong Feng,
Haochuan Lu,
Ting Xiong,
Yuetang Deng,
Chunyang Chen
Abstract:
WeChat, a widely-used messenger app boasting over 1 billion monthly active users, requires effective app quality assurance for its complex features. Record-and-replay tools are crucial in achieving this goal. Despite the extensive development of these tools, the impact of waiting time between replay events has been largely overlooked. On one hand, a long waiting time for executing replay events on…
▽ More
WeChat, a widely-used messenger app boasting over 1 billion monthly active users, requires effective app quality assurance for its complex features. Record-and-replay tools are crucial in achieving this goal. Despite the extensive development of these tools, the impact of waiting time between replay events has been largely overlooked. On one hand, a long waiting time for executing replay events on fully-rendered GUIs slows down the process. On the other hand, a short waiting time can lead to events executing on partially-rendered GUIs, negatively affecting replay effectiveness. An optimal waiting time should strike a balance between effectiveness and efficiency. We introduce WeReplay, a lightweight image-based approach that dynamically adjusts inter-event time based on the GUI rendering state. Given the real-time streaming on the GUI, WeReplay employs a deep learning model to infer the rendering state and synchronize with the replaying tool, scheduling the next event when the GUI is fully rendered. Our evaluation shows that our model achieves 92.1% precision and 93.3% recall in discerning GUI rendering states in the WeChat app. Through assessing the performance in replaying 23 common WeChat usage scenarios, WeReplay successfully replays all scenarios on the same and different devices more efficiently than the state-of-the-practice baselines.
△ Less
Submitted 25 August, 2023; v1 submitted 12 August, 2023;
originally announced August 2023.
-
Unveiling the Tricks: Automated Detection of Dark Patterns in Mobile Applications
Authors:
Jieshan Chen,
Jiamou Sun,
Sidong Feng,
Zhenchang Xing,
Qinghua Lu,
Xiwei Xu,
Chunyang Chen
Abstract:
Mobile apps bring us many conveniences, such as online shop** and communication, but some use malicious designs called dark patterns to trick users into doing things that are not in their best interest. Many works have been done to summarize the taxonomy of these patterns and some have tried to mitigate the problems through various techniques. However, these techniques are either time-consuming,…
▽ More
Mobile apps bring us many conveniences, such as online shop** and communication, but some use malicious designs called dark patterns to trick users into doing things that are not in their best interest. Many works have been done to summarize the taxonomy of these patterns and some have tried to mitigate the problems through various techniques. However, these techniques are either time-consuming, not generalisable or limited to specific patterns. To address these issues, we propose UIGuard, a knowledge-driven system that utilizes computer vision and natural language pattern matching to automatically detect a wide range of dark patterns in mobile UIs. Our system relieves the need for manually creating rules for each new UI/app and covers more types with superior performance. In detail, we integrated existing taxonomies into a consistent one, conducted a characteristic analysis and distilled knowledge from real-world examples and the taxonomy. Our UIGuard consists of two components, Property Extraction and Knowledge-Driven Dark Pattern Checker. We collected the first dark pattern dataset, which contains 4,999 benign UIs and 1,353 malicious UIs of 1,660 instances spanning 1,023 mobile apps. Our system achieves a superior performance in detecting dark patterns (micro averages: 0.82 in precision, 0.77 in recall, 0.79 in F1 score). A user study involving 58 participants further shows that \tool{} significantly increases users' knowledge of dark patterns.
△ Less
Submitted 10 August, 2023;
originally announced August 2023.
-
Video2Action: Reducing Human Interactions in Action Annotation of App Tutorial Videos
Authors:
Sidong Feng,
Chunyang Chen,
Zhenchang Xing
Abstract:
Tutorial videos of mobile apps have become a popular and compelling way for users to learn unfamiliar app features. To make the video accessible to the users, video creators always need to annotate the actions in the video, including what actions are performed and where to tap. However, this process can be time-consuming and labor-intensive. In this paper, we introduce a lightweight approach Video…
▽ More
Tutorial videos of mobile apps have become a popular and compelling way for users to learn unfamiliar app features. To make the video accessible to the users, video creators always need to annotate the actions in the video, including what actions are performed and where to tap. However, this process can be time-consuming and labor-intensive. In this paper, we introduce a lightweight approach Video2Action, to automatically generate the action scenes and predict the action locations from the video by using image-processing and deep-learning methods. The automated experiments demonstrate the good performance of Video2Action in acquiring actions from the videos, and a user study shows the usefulness of our generated action cues in assisting video creators with action annotation.
△ Less
Submitted 6 August, 2023;
originally announced August 2023.
-
Universal approach to deterministic spatial search via alternating quantum walks
Authors:
Qingwen Wang,
Ying Jiang,
Shiguang Feng,
Lvzhou Li
Abstract:
Spatial search is an important problem in quantum computation, which aims to find a marked vertex on a graph. We propose a novel approach for designing deterministic quantum search algorithms on a variety of graphs via alternating quantum walks. Our approach is universal because it does not require an instance-specific analysis for different graphs. We highlight the flexibility of our approach by…
▽ More
Spatial search is an important problem in quantum computation, which aims to find a marked vertex on a graph. We propose a novel approach for designing deterministic quantum search algorithms on a variety of graphs via alternating quantum walks. Our approach is universal because it does not require an instance-specific analysis for different graphs. We highlight the flexibility of our approach by proving that for Johnson graphs, rook graphs, complete-square graphs and complete bipartite graphs, our quantum algorithms can find the marked vertex with $100\%$ success probability and achieve quadratic speedups over classical algorithms. This not only gives an alternative succinct way to prove the existing results, but also leads to new interesting findings on more general graphs.
△ Less
Submitted 23 August, 2023; v1 submitted 30 July, 2023;
originally announced July 2023.
-
Non-Equilibrium Nature of Fracture Determines the Crack Paths
Authors:
Pengjie Shi,
Shizhe Feng,
Zhi** Xu
Abstract:
A high-fidelity neural network-based force field, NN-F$^{3}$, is developed to cover the strain states up to material failure and the non-equilibrium, intermediate nature of fracture. Simulations of fracture in 2D crystals using NN-F$^{3}$ reveal spatial complexities from lattice-scale kinks to sample-scale patterns. We find that the fracture resistance cannot be quantified by the energy densities…
▽ More
A high-fidelity neural network-based force field, NN-F$^{3}$, is developed to cover the strain states up to material failure and the non-equilibrium, intermediate nature of fracture. Simulations of fracture in 2D crystals using NN-F$^{3}$ reveal spatial complexities from lattice-scale kinks to sample-scale patterns. We find that the fracture resistance cannot be quantified by the energy densities of relaxed edges as in the literature. Instead, the fracture patterns, critical stress intensity factors at the kinks, and energy densities of edges in the intermediate, unrelaxed states offer reasonable measures for the fracture toughness and its anisotropy.
△ Less
Submitted 30 July, 2023;
originally announced July 2023.
-
RoCar: A Relationship Network-based Evaluation Method to Large Language Models
Authors:
Ming Wang,
Wenfang Wu,
Chongyun Gao,
Daling Wang,
Shi Feng,
Yifei Zhang
Abstract:
Large language models (LLMs) have received increasing attention. However, due to the complexity of its capabilities, how to rationally evaluate the capabilities of LLMs is still a task to be solved. We propose the RoCar method, which utilizes the defined basic schemas to randomly construct a task graph and generates natural language evaluation tasks based on the task graph to evaluate the reasonin…
▽ More
Large language models (LLMs) have received increasing attention. However, due to the complexity of its capabilities, how to rationally evaluate the capabilities of LLMs is still a task to be solved. We propose the RoCar method, which utilizes the defined basic schemas to randomly construct a task graph and generates natural language evaluation tasks based on the task graph to evaluate the reasoning and memory abilities of LLMs respectively. Due to the very large randomness of the task construction process, it is possible to ensure that none of the LLMs to be tested has directly learned the evaluation tasks, guaranteeing the fairness of the evaluation method.
△ Less
Submitted 29 July, 2023;
originally announced July 2023.
-
An Empirical Study of Large-Scale Data-Driven Full Waveform Inversion
Authors:
Peng **,
Yinan Feng,
Shihang Feng,
Hanchen Wang,
Yinpeng Chen,
Benjamin Consolvo,
Zicheng Liu,
Youzuo Lin
Abstract:
This paper investigates the impact of big data on deep learning models to help solve the full waveform inversion (FWI) problem. While it is well known that big data can boost the performance of deep learning models in many tasks, its effectiveness has not been validated for FWI. To address this gap, we present an empirical study that investigates how deep learning models in FWI behave when trained…
▽ More
This paper investigates the impact of big data on deep learning models to help solve the full waveform inversion (FWI) problem. While it is well known that big data can boost the performance of deep learning models in many tasks, its effectiveness has not been validated for FWI. To address this gap, we present an empirical study that investigates how deep learning models in FWI behave when trained on OpenFWI, a collection of large-scale, multi-structural, synthetic datasets published recently. In particular, we train and evaluate the FWI models on a combination of 10 2D subsets in OpenFWI that contain 470K pairs of seismic data and velocity maps in total. Our experiments demonstrate that training on the combined dataset yields an average improvement of 13.03% in MAE, 7.19% in MSE and 1.87% in SSIM compared to each split dataset, and an average improvement of 28.60%, 21.55% and 8.22% in the leave-one-out generalization test. We further demonstrate that model capacity needs to scale in accordance with data size for optimal improvement, where our largest model yields an average improvement of 20.06%, 13.39% and 0.72% compared to the smallest one.
△ Less
Submitted 24 April, 2024; v1 submitted 28 July, 2023;
originally announced July 2023.
-
Shear viscosity coefficient of magnetized QCD medium with anomalous magnetic moments near chiral phase transition
Authors:
Yi-Wei Qiu,
Sheng-Qin Feng,
Xue-Qiang Zhu
Abstract:
We study the properties of the shear viscosity coefficient of quark matter near the chiral phase transition at finite temperature and chemical potential, and the kinds of high temperature, high density and strong magnetic field background. The strong magnetic field induces anisotropy, that is, the quantization of Landau energy levels in phase space. If the magnetic field is strong enough, it will…
▽ More
We study the properties of the shear viscosity coefficient of quark matter near the chiral phase transition at finite temperature and chemical potential, and the kinds of high temperature, high density and strong magnetic field background. The strong magnetic field induces anisotropy, that is, the quantization of Landau energy levels in phase space. If the magnetic field is strong enough, it will interfere with significant QCD phenomena, such as the generation of dynamic quark mass, which may affect the transport properties of quark matter. The inclusion of the anomalous magnetic moments of the quarks at finite density into the Nambu-Jona-Lasinio model gives rise to additional spin polarization magnetic effects. It is found that both the ratio $η/s$ of shear viscosity coefficient to entropy and the collision relaxation time $τ$ show similar trend with temperature, both of which reach minima around the critical temperature. The shear viscosity coefficient of the dissipative fluid system can be decomposed into five different components as the strong magnetic field exists. The influences of the order of chiral phase transition and the critical end point on dissipative phenomena in such a magnetized medium are quantitatively investigated. It is found that $η_{1}$, $η_{2}$, $η_{3}$, and $η_{4}$ all increase with temperature. For first-order phase transitions, $η_{1}$, $η_{2}$, $η_{3}$, and $η_{4}$ exhibit discontinuous characteristics.
△ Less
Submitted 28 December, 2023; v1 submitted 24 July, 2023;
originally announced July 2023.
-
Effects of Coronal Magnetic Field Configuration on Particle Acceleration and Release during the Ground Level Enhancement Events in Solar Cycle 24
Authors:
Wenlong Liu,
Xiangliang Kong,
Fan Guo,
Lulu Zhao,
Shiwei Feng,
Feiyu Yu,
Zelong Jiang,
Yao Chen,
Joe Giacalone
Abstract:
Ground level enhancements (GLEs) are extreme solar energetic particle (SEP) events that are of particular importance in space weather. In solar cycle 24, two GLEs were recorded on 2012 May 17 (GLE 71) and 2017 September 10 (GLE 72), respectively, by a range of advanced modern instruments. Here we conduct a comparative analysis of the two events by focusing on the effects of large-scale magnetic fi…
▽ More
Ground level enhancements (GLEs) are extreme solar energetic particle (SEP) events that are of particular importance in space weather. In solar cycle 24, two GLEs were recorded on 2012 May 17 (GLE 71) and 2017 September 10 (GLE 72), respectively, by a range of advanced modern instruments. Here we conduct a comparative analysis of the two events by focusing on the effects of large-scale magnetic field configuration near active regions on particle acceleration and release. Although the active regions both located near the western limb, temporal variations of SEP intensities and energy spectra measured in-situ display different behaviors at early stages. By combining a potential field model, we find the CME in GLE 71 originated below the streamer belt, while in GLE 72 near the edge of the streamer belt. We reconstruct the CME shock fronts with an ellipsoid model based on nearly simultaneous coronagraph images from multi-viewpoints, and further derive the 3D shock geometry at the GLE onset. The highest-energy particles are primarily accelerated in the shock-streamer interaction regions, i.e., likely at the nose of the shock in GLE 71 and the eastern flank in GLE 72, due to quasi-perpendicular shock geometry and confinement of closed fields. Subsequently, they are released to the field lines connecting to near-Earth spacecraft when the shocks move through the streamer cusp region. This suggests that magnetic structures in the corona, especially shock-streamer interactions, may have played an important role in the acceleration and release of the highest-energy particles in the two events.
△ Less
Submitted 22 July, 2023;
originally announced July 2023.
-
Co-Design with Myself: A Brain-Computer Interface Design Tool that Predicts Live Emotion to Enhance Metacognitive Monitoring of Designers
Authors:
Qi Yang,
Shuo Feng,
Tianlin Zhao,
Saleh Kalantari
Abstract:
Intuition, metacognition, and subjective uncertainty interact in complex ways to shape the creative design process. Design intuition, a designer's innate ability to generate creative ideas and solutions based on implicit knowledge and experience, is often evaluated and refined through metacognitive monitoring. This self-awareness and management of cognitive processes can be triggered by subjective…
▽ More
Intuition, metacognition, and subjective uncertainty interact in complex ways to shape the creative design process. Design intuition, a designer's innate ability to generate creative ideas and solutions based on implicit knowledge and experience, is often evaluated and refined through metacognitive monitoring. This self-awareness and management of cognitive processes can be triggered by subjective uncertainty, reflecting the designer's self-assessed confidence in their decisions. Despite their significance, few creativity support tools have targeted the enhancement of these intertwined components using biofeedback, particularly the affect associated with these processes. In this study, we introduce "Multi-Self," a BCI-VR design tool designed to amplify metacognitive monitoring in architectural design. Multi-Self evaluates designers' affect (valence and arousal) to their work, providing real-time, visual biofeedback. A proof-of-concept pilot study with 24 participants assessed its feasibility. While feedback accuracy responses were mixed, most participants found the tool useful, reporting that it sparked metacognitive monitoring, encouraged exploration of the design space, and helped modulate subjective uncertainty.
△ Less
Submitted 21 July, 2023;
originally announced July 2023.
-
Quantum Gravity Induced Entanglement of Masses With Extra Dimensions
Authors:
Shuai Feng,
Bao-Min Gu,
Fu-Wen Shu
Abstract:
It is believed that gravity can be considered as a quantum coherent mediator. In this study, we propose a plan to test the existence of extra dimensions using the Quantum Gravity Induced Entanglement of Masses (QGEM) experiment. This experiment involves two freely falling test masses passing through a Stern-Gerlach-like device. We investigate the entanglement witness between these masses within th…
▽ More
It is believed that gravity can be considered as a quantum coherent mediator. In this study, we propose a plan to test the existence of extra dimensions using the Quantum Gravity Induced Entanglement of Masses (QGEM) experiment. This experiment involves two freely falling test masses passing through a Stern-Gerlach-like device. We investigate the entanglement witness between these masses within the framework of the Randall-Sundrum II model (RS-II). Our findings indicate that the system reaches entanglement more rapidly in the presence of extra dimensions, particularly when the radius of the extra dimension is large.
△ Less
Submitted 17 March, 2024; v1 submitted 21 July, 2023;
originally announced July 2023.
-
Fractional Denoising for 3D Molecular Pre-training
Authors:
Shikun Feng,
Yuyan Ni,
Yanyan Lan,
Zhi-Ming Ma,
Wei-Ying Ma
Abstract:
Coordinate denoising is a promising 3D molecular pre-training method, which has achieved remarkable performance in various downstream drug discovery tasks. Theoretically, the objective is equivalent to learning the force field, which is revealed helpful for downstream tasks. Nevertheless, there are two challenges for coordinate denoising to learn an effective force field, i.e. low coverage samples…
▽ More
Coordinate denoising is a promising 3D molecular pre-training method, which has achieved remarkable performance in various downstream drug discovery tasks. Theoretically, the objective is equivalent to learning the force field, which is revealed helpful for downstream tasks. Nevertheless, there are two challenges for coordinate denoising to learn an effective force field, i.e. low coverage samples and isotropic force field. The underlying reason is that molecular distributions assumed by existing denoising methods fail to capture the anisotropic characteristic of molecules. To tackle these challenges, we propose a novel hybrid noise strategy, including noises on both dihedral angel and coordinate. However, denoising such hybrid noise in a traditional way is no more equivalent to learning the force field. Through theoretical deductions, we find that the problem is caused by the dependency of the input conformation for covariance. To this end, we propose to decouple the two types of noise and design a novel fractional denoising method (Frad), which only denoises the latter coordinate part. In this way, Frad enjoys both the merits of sampling more low-energy structures and the force field equivalence. Extensive experiments show the effectiveness of Frad in molecular representation, with a new state-of-the-art on 9 out of 12 tasks of QM9 and on 7 out of 8 targets of MD17.
△ Less
Submitted 26 February, 2024; v1 submitted 20 July, 2023;
originally announced July 2023.
-
Bridging the Gap: Multi-Level Cross-Modality Joint Alignment for Visible-Infrared Person Re-Identification
Authors:
Tengfei Liang,
Yi **,
Wu Liu,
Tao Wang,
Songhe Feng,
Yidong Li
Abstract:
Visible-Infrared person Re-IDentification (VI-ReID) is a challenging cross-modality image retrieval task that aims to match pedestrians' images across visible and infrared cameras. To solve the modality gap, existing mainstream methods adopt a learning paradigm converting the image retrieval task into an image classification task with cross-entropy loss and auxiliary metric learning losses. These…
▽ More
Visible-Infrared person Re-IDentification (VI-ReID) is a challenging cross-modality image retrieval task that aims to match pedestrians' images across visible and infrared cameras. To solve the modality gap, existing mainstream methods adopt a learning paradigm converting the image retrieval task into an image classification task with cross-entropy loss and auxiliary metric learning losses. These losses follow the strategy of adjusting the distribution of extracted embeddings to reduce the intra-class distance and increase the inter-class distance. However, such objectives do not precisely correspond to the final test setting of the retrieval task, resulting in a new gap at the optimization level. By rethinking these keys of VI-ReID, we propose a simple and effective method, the Multi-level Cross-modality Joint Alignment (MCJA), bridging both modality and objective-level gap. For the former, we design the Modality Alignment Augmentation, which consists of three novel strategies, the weighted grayscale, cross-channel cutmix, and spectrum jitter augmentation, effectively reducing modality discrepancy in the image space. For the latter, we introduce a new Cross-Modality Retrieval loss. It is the first work to constrain from the perspective of the ranking list, aligning with the goal of the testing stage. Moreover, based on the global feature only, our method exhibits good performance and can serve as a strong baseline method for the VI-ReID community.
△ Less
Submitted 17 July, 2023;
originally announced July 2023.
-
Multimodal Molecular Pretraining via Modality Blending
Authors:
Qiying Yu,
Yudi Zhang,
Yuyan Ni,
Shikun Feng,
Yanyan Lan,
Hao Zhou,
**g**g Liu
Abstract:
Self-supervised learning has recently gained growing interest in molecular modeling for scientific tasks such as AI-assisted drug discovery. Current studies consider leveraging both 2D and 3D molecular structures for representation learning. However, relying on straightforward alignment strategies that treat each modality separately, these methods fail to exploit the intrinsic correlation between…
▽ More
Self-supervised learning has recently gained growing interest in molecular modeling for scientific tasks such as AI-assisted drug discovery. Current studies consider leveraging both 2D and 3D molecular structures for representation learning. However, relying on straightforward alignment strategies that treat each modality separately, these methods fail to exploit the intrinsic correlation between 2D and 3D representations that reflect the underlying structural characteristics of molecules, and only perform coarse-grained molecule-level alignment. To derive fine-grained alignment and promote structural molecule understanding, we introduce an atomic-relation level "blend-then-predict" self-supervised learning approach, MoleBLEND, which first blends atom relations represented by different modalities into one unified relation matrix for joint encoding, then recovers modality-specific information for 2D and 3D structures individually. By treating atom relationships as anchors, MoleBLEND organically aligns and integrates visually dissimilar 2D and 3D modalities of the same molecule at fine-grained atomic level, painting a more comprehensive depiction of each molecule. Extensive experiments show that MoleBLEND achieves state-of-the-art performance across major 2D/3D molecular benchmarks. We further provide theoretical insights from the perspective of mutual-information maximization, demonstrating that our method unifies contrastive, generative (cross-modality prediction) and mask-then-predict (single-modality prediction) objectives into one single cohesive framework.
△ Less
Submitted 8 October, 2023; v1 submitted 12 July, 2023;
originally announced July 2023.
-
Microwave conductivity due to impurity scattering in cuprate superconductors
Authors:
Minghuan Zeng,
Xiang Li,
Yongjun Wang,
Shi** Feng
Abstract:
The microwave surface impedance measurements on cuprate superconductors provide the crucial information of the effect of the impurity scattering on the quasiparticle transport, however, the full understanding of the effect of the impurity scattering on the quasiparticle transport is still a challenging issue. Here based on the microscopic octet scattering model, the effect of the impurity scatteri…
▽ More
The microwave surface impedance measurements on cuprate superconductors provide the crucial information of the effect of the impurity scattering on the quasiparticle transport, however, the full understanding of the effect of the impurity scattering on the quasiparticle transport is still a challenging issue. Here based on the microscopic octet scattering model, the effect of the impurity scattering on the low-temperature microwave conductivity in cuprate superconductors is investigated in the self-consistent $T$-matrix approach. The impurity-dressed electron propagator obtained in the Fermi-arc-tip approximation of the quasiparticle excitations and scattering processes is employed to derive the electron current-current correlation function by taking into account the impurity-induced vertex correction. It is shown that the microwave conductivity spectrum is a non-Drude-like, with a sharp cusp-like peak extending to zero-energy and a high-energy tail falling slowly with energy. Moreover, the microwave conductivity decreases with the increase of the impurity concentration or with the increase of the strength of the impurity scattering potential. In a striking contrast to the dome-like shape of the do** dependence of the superconducting transition temperature, the microwave conductivity exhibits a reverse dome-like shape of the do** dependence. The theory also show that the highly unconventional features of the microwave conductivity are generated by both the strong electron correlation and impurity-scattering effects.
△ Less
Submitted 12 July, 2023;
originally announced July 2023.
-
Unleashing the Potential of Li-Metal Batteries A Breakthrough Ultra-High Room-Temperature Ionic Conductivity Composite Solid-State Electrolyte
Authors:
Xiong Xiong Liu,
Shengfa Feng,
Pengcheng Yuan,
Ya** Wang,
Long Pan,
ZhengMing Sun
Abstract:
The solid-state electrolyte is critical for achieving next-generation high energy density and high-safety batteries. Solid polymer electrolytes (SPEs) possess great potential for commercial application owing to their compatibility with the existing manufacturing systems. However, unsatisfactory room-temperature ionic conductivity severely limits its application. Herein, an ultra-high room-temperat…
▽ More
The solid-state electrolyte is critical for achieving next-generation high energy density and high-safety batteries. Solid polymer electrolytes (SPEs) possess great potential for commercial application owing to their compatibility with the existing manufacturing systems. However, unsatisfactory room-temperature ionic conductivity severely limits its application. Herein, an ultra-high room-temperature ionic conductivity composite solid-state electrolyte (CSE) is prepared by introducing an appropriate amount of SiO2 nanosphere to the PVDF-HFP matrix. By doing this, the polymer particles are divided and surrounded by SiO2. And the interface amount is maximized resulting in the high ionic conductivity of 1.35 mS cm-1 under room temperature. In addition, the CSE shows a wide electrochemical window of 4.95 V and a moderate Li+ transference number of 0.44. The CSE demonstrates good stability with Li anode, with Li symmetric cells that could cycle 1000 h at a current density of 0.2 mA cm-2. The full cell assembled with LiFePO4 (LFP) and Li metal displays a high reversible specific capacity of 157.8 mAh g-1 at 0.1C, and it could maintain 92.9% of initial capacity after 300 cycles at 3C. Moreover, the strategy is applied in solid-state sodium/potassium batteries and displays excellent performance.
△ Less
Submitted 3 July, 2023;
originally announced July 2023.
-
Optimal and Stable Multi-Layer Object Rearrangement on a Tabletop
Authors:
Andy Xu,
Kai Gao,
Si Wei Feng,
**g** Yu
Abstract:
Object rearrangement is a fundamental sub-task in accomplishing a great many physical tasks. As such, effectively executing rearrangement is an important skill for intelligent robots to master. In this study, we conduct the first algorithmic study on optimally solving the problem of Multi-layer Object Rearrangement on a Tabletop (MORT), in which one object may be relocated at a time, and an object…
▽ More
Object rearrangement is a fundamental sub-task in accomplishing a great many physical tasks. As such, effectively executing rearrangement is an important skill for intelligent robots to master. In this study, we conduct the first algorithmic study on optimally solving the problem of Multi-layer Object Rearrangement on a Tabletop (MORT), in which one object may be relocated at a time, and an object can only be moved if other objects do not block its top surface. In addition, any intermediate structure during the reconfiguration process must be physically stable, i.e., it should stand without external support. To tackle the dual challenges of untangling the dependencies between objects and ensuring structural stability, we develop an algorithm that interleaves the computation of the optimal rearrangement plan and structural stability checking. Using a carefully constructed integer linear programming (ILP) model, our algorithm, Stability-aware Integer Programming-based Planner (SIPP), readily scales to optimally solve complex rearrangement problems of 3D structures with over 60 building blocks, with solution quality significantly outperforming natural greedy best-first approaches.
Upon the publication of the manuscript, source code and data will be available at https://github.com/arc-l/mort/
△ Less
Submitted 30 June, 2023; v1 submitted 25 June, 2023;
originally announced June 2023.
-
$\mathbf{\mathbb{E}^{FWI}}$: Multi-parameter Benchmark Datasets for Elastic Full Waveform Inversion of Geophysical Properties
Authors:
Shihang Feng,
Hanchen Wang,
Chengyuan Deng,
Yinan Feng,
Yanhua Liu,
Min Zhu,
Peng **,
Yinpeng Chen,
Youzuo Lin
Abstract:
Elastic geophysical properties (such as P- and S-wave velocities) are of great importance to various subsurface applications like CO$_2$ sequestration and energy exploration (e.g., hydrogen and geothermal). Elastic full waveform inversion (FWI) is widely applied for characterizing reservoir properties. In this paper, we introduce $\mathbf{\mathbb{E}^{FWI}}$, a comprehensive benchmark dataset that…
▽ More
Elastic geophysical properties (such as P- and S-wave velocities) are of great importance to various subsurface applications like CO$_2$ sequestration and energy exploration (e.g., hydrogen and geothermal). Elastic full waveform inversion (FWI) is widely applied for characterizing reservoir properties. In this paper, we introduce $\mathbf{\mathbb{E}^{FWI}}$, a comprehensive benchmark dataset that is specifically designed for elastic FWI. $\mathbf{\mathbb{E}^{FWI}}$ encompasses 8 distinct datasets that cover diverse subsurface geologic structures (flat, curve, faults, etc). The benchmark results produced by three different deep learning methods are provided. In contrast to our previously presented dataset (pressure recordings) for acoustic FWI (referred to as OpenFWI), the seismic dataset in $\mathbf{\mathbb{E}^{FWI}}$ has both vertical and horizontal components. Moreover, the velocity maps in $\mathbf{\mathbb{E}^{FWI}}$ incorporate both P- and S-wave velocities. While the multicomponent data and the added S-wave velocity make the data more realistic, more challenges are introduced regarding the convergence and computational cost of the inversion. We conduct comprehensive numerical experiments to explore the relationship between P-wave and S-wave velocities in seismic data. The relation between P- and S-wave velocities provides crucial insights into the subsurface properties such as lithology, porosity, fluid content, etc. We anticipate that $\mathbf{\mathbb{E}^{FWI}}$ will facilitate future research on multiparameter inversions and stimulate endeavors in several critical research topics of carbon-zero and new energy exploration. All datasets, codes and relevant information can be accessed through our website at https://efwi-lanl.github.io/
△ Less
Submitted 7 September, 2023; v1 submitted 21 June, 2023;
originally announced June 2023.
-
Quantum and classical query complexities for determining connectedness of matroids
Authors:
Xiaowei Huang,
Shiguang Feng,
Lvzhou Li
Abstract:
Connectivity is a fundamental structural property of matroids, and has been studied algorithmically over 50 years. In 1974, Cunningham proposed a deterministic algorithm consuming $O(n^{2})$ queries to the independence oracle to determine whether a matroid is connected. Since then, no algorithm, not even a random one, has worked better. To the best of our knowledge, the classical query complexity…
▽ More
Connectivity is a fundamental structural property of matroids, and has been studied algorithmically over 50 years. In 1974, Cunningham proposed a deterministic algorithm consuming $O(n^{2})$ queries to the independence oracle to determine whether a matroid is connected. Since then, no algorithm, not even a random one, has worked better. To the best of our knowledge, the classical query complexity lower bound and the quantum complexity for this problem have not been considered. Thus, in this paper we are devoted to addressing these issues, and our contributions are threefold as follows: (i) First, we prove that the randomized query complexity of determining whether a matroid is connected is $Ω(n^2)$ and thus the algorithm proposed by Cunningham is optimal in classical computing. (ii) Second, we present a quantum algorithm with $O(n^{3/2})$ queries, which exhibits provable quantum speedups over classical ones. (iii) Third, we prove that any quantum algorithm requires $Ω(n)$ queries, which indicates that quantum algorithms can achieve at most a quadratic speedup over classical ones. Therefore, we have a relatively comprehensive understanding of the potential of quantum computing in determining the connectedness of matroids.\
△ Less
Submitted 21 June, 2023;
originally announced June 2023.
-
HOSSnet: an Efficient Physics-Guided Neural Network for Simulating Crack Propagation
Authors:
Shengyu Chen,
Shihang Feng,
Yao Huang,
Zhou Lei,
Xiaowei Jia,
Youzuo Lin,
Estaben Rougier
Abstract:
Hybrid Optimization Software Suite (HOSS), which is a combined finite-discrete element method (FDEM), is one of the advanced approaches to simulating high-fidelity fracture and fragmentation processes but the application of pure HOSS simulation is computationally expensive. At the same time, machine learning methods, shown tremendous success in several scientific problems, are increasingly being c…
▽ More
Hybrid Optimization Software Suite (HOSS), which is a combined finite-discrete element method (FDEM), is one of the advanced approaches to simulating high-fidelity fracture and fragmentation processes but the application of pure HOSS simulation is computationally expensive. At the same time, machine learning methods, shown tremendous success in several scientific problems, are increasingly being considered promising alternatives to physics-based models in the scientific domains. Thus, our goal in this work is to build a new data-driven methodology to reconstruct the crack fracture accurately in the spatial and temporal fields. We leverage physical constraints to regularize the fracture propagation in the long-term reconstruction. In addition, we introduce perceptual loss and several extra pure machine learning optimization approaches to improve the reconstruction performance of fracture data further. We demonstrate the effectiveness of our proposed method through both extrapolation and interpolation experiments. The results confirm that our proposed method can reconstruct high-fidelity fracture data over space and time in terms of pixel-wise reconstruction error and structural similarity. Visual comparisons also show promising results in long-term
△ Less
Submitted 14 June, 2023;
originally announced June 2023.
-
ICDAR 2023 Competition on Structured Text Extraction from Visually-Rich Document Images
Authors:
Wenwen Yu,
Chengquan Zhang,
Haoyu Cao,
Wei Hua,
Bohan Li,
Huang Chen,
Mingyu Liu,
Mingrui Chen,
Jianfeng Kuang,
Mengjun Cheng,
Yuning Du,
Shikun Feng,
Xiaoguang Hu,
Pengyuan Lyu,
Kun Yao,
Yuechen Yu,
Yuliang Liu,
Wanxiang Che,
Errui Ding,
Cheng-Lin Liu,
Jiebo Luo,
Shuicheng Yan,
Min Zhang,
Dimosthenis Karatzas,
Xing Sun
, et al. (2 additional authors not shown)
Abstract:
Structured text extraction is one of the most valuable and challenging application directions in the field of Document AI. However, the scenarios of past benchmarks are limited, and the corresponding evaluation protocols usually focus on the submodules of the structured text extraction scheme. In order to eliminate these problems, we organized the ICDAR 2023 competition on Structured text extracti…
▽ More
Structured text extraction is one of the most valuable and challenging application directions in the field of Document AI. However, the scenarios of past benchmarks are limited, and the corresponding evaluation protocols usually focus on the submodules of the structured text extraction scheme. In order to eliminate these problems, we organized the ICDAR 2023 competition on Structured text extraction from Visually-Rich Document images (SVRD). We set up two tracks for SVRD including Track 1: HUST-CELL and Track 2: Baidu-FEST, where HUST-CELL aims to evaluate the end-to-end performance of Complex Entity Linking and Labeling, and Baidu-FEST focuses on evaluating the performance and generalization of Zero-shot / Few-shot Structured Text extraction from an end-to-end perspective. Compared to the current document benchmarks, our two tracks of competition benchmark enriches the scenarios greatly and contains more than 50 types of visually-rich document images (mainly from the actual enterprise applications). The competition opened on 30th December, 2022 and closed on 24th March, 2023. There are 35 participants and 91 valid submissions received for Track 1, and 15 participants and 26 valid submissions received for Track 2. In this report we will presents the motivation, competition datasets, task definition, evaluation protocol, and submission summaries. According to the performance of the submissions, we believe there is still a large gap on the expected information extraction performance for complex and zero-shot scenarios. It is hoped that this competition will attract many researchers in the field of CV and NLP, and bring some new thoughts to the field of Document AI.
△ Less
Submitted 5 June, 2023;
originally announced June 2023.
-
Machine learning reveals features of spinon Fermi surface
Authors:
Kevin Zhang,
Shi Feng,
Yuri D. Lensky,
Nandini Trivedi,
Eun-Ah Kim
Abstract:
With rapid progress in simulation of strongly interacting quantum Hamiltonians, the challenge in characterizing unknown phases becomes a bottleneck for scientific progress. We demonstrate that a Quantum-Classical hybrid approach (QuCl) of mining sampled projective snapshots with interpretable classical machine learning can unveil signatures of seemingly featureless quantum states. The Kitaev-Heise…
▽ More
With rapid progress in simulation of strongly interacting quantum Hamiltonians, the challenge in characterizing unknown phases becomes a bottleneck for scientific progress. We demonstrate that a Quantum-Classical hybrid approach (QuCl) of mining sampled projective snapshots with interpretable classical machine learning can unveil signatures of seemingly featureless quantum states. The Kitaev-Heisenberg model on a honeycomb lattice under external magnetic field presents an ideal system to test QuCl, where simulations have found an intermediate gapless phase (IGP) sandwiched between known phases, launching a debate over its elusive nature. We use the correlator convolutional neural network, trained on labeled projective snapshots, in conjunction with regularization path analysis to identify signatures of phases. We show that QuCl reproduces known features of established phases. Significantly, we also identify a signature of the IGP in the spin channel perpendicular to the field direction, which we interpret as a signature of Friedel oscillations of gapless spinons forming a Fermi surface. Our predictions can guide future experimental searches for spin liquids.
△ Less
Submitted 11 March, 2024; v1 submitted 5 June, 2023;
originally announced June 2023.
-
PolyVoice: Language Models for Speech to Speech Translation
Authors:
Qianqian Dong,
Zhiying Huang,
Qiao Tian,
Chen Xu,
Tom Ko,
Yunlong Zhao,
Siyuan Feng,
Tang Li,
Kexin Wang,
Xuxin Cheng,
Fengpeng Yue,
Ye Bai,
Xi Chen,
Lu Lu,
Zejun Ma,
Yu** Wang,
Mingxuan Wang,
Yuxuan Wang
Abstract:
We propose PolyVoice, a language model-based framework for speech-to-speech translation (S2ST) system. Our framework consists of two language models: a translation language model and a speech synthesis language model. We use discretized speech units, which are generated in a fully unsupervised way, and thus our framework can be used for unwritten languages. For the speech synthesis part, we adopt…
▽ More
We propose PolyVoice, a language model-based framework for speech-to-speech translation (S2ST) system. Our framework consists of two language models: a translation language model and a speech synthesis language model. We use discretized speech units, which are generated in a fully unsupervised way, and thus our framework can be used for unwritten languages. For the speech synthesis part, we adopt the existing VALL-E X approach and build a unit-based audio language model. This grants our framework the ability to preserve the voice characteristics and the speaking style of the original speech. We examine our system on Chinese $\rightarrow$ English and English $\rightarrow$ Spanish pairs. Experimental results show that our system can generate speech with high translation quality and audio quality. Speech samples are available at https://speechtranslation.github.io/polyvoice.
△ Less
Submitted 13 June, 2023; v1 submitted 5 June, 2023;
originally announced June 2023.
-
Leveraging Generative Models to Recover Variable Names from Stripped Binary
Authors:
Xiangzhe Xu,
Zhuo Zhang,
Zian Su,
Ziyang Huang,
Shiwei Feng,
Yapeng Ye,
Nan Jiang,
Danning Xie,
Siyuan Cheng,
Lin Tan,
Xiangyu Zhang
Abstract:
Decompilation aims to recover the source code form of a binary executable. It has many security applications such as malware analysis, vulnerability detection and code hardening. A prominent challenge in decompilation is to recover variable names. We propose a novel technique that leverages the strengths of generative models while suppressing potential hallucinations and overcoming the input token…
▽ More
Decompilation aims to recover the source code form of a binary executable. It has many security applications such as malware analysis, vulnerability detection and code hardening. A prominent challenge in decompilation is to recover variable names. We propose a novel technique that leverages the strengths of generative models while suppressing potential hallucinations and overcoming the input token limitation. We build a prototype, GenNm, from a pre-trained generative model Code-Llama. We fine-tune GenNm on decompiled functions, and leverage program analysis to validate the results produced by the generative model. GenNm includes names from callers and callees while querying a function, providing rich contextual information within the model's input token limitation. Our results show that GenNm improves the state-of-the-art from 48.1% to 57.9% in the most challenging setup where a query function is not seen in the training dataset.
△ Less
Submitted 30 April, 2024; v1 submitted 4 June, 2023;
originally announced June 2023.
-
Efficient Text-Guided 3D-Aware Portrait Generation with Score Distillation Sampling on Distribution
Authors:
Yiji Cheng,
Fei Yin,
Xiaoke Huang,
Xintong Yu,
Jiaxiang Liu,
Shikun Feng,
Yujiu Yang,
Yansong Tang
Abstract:
Text-to-3D is an emerging task that allows users to create 3D content with infinite possibilities. Existing works tackle the problem by optimizing a 3D representation with guidance from pre-trained diffusion models. An apparent drawback is that they need to optimize from scratch for each prompt, which is computationally expensive and often yields poor visual fidelity. In this paper, we propose Dre…
▽ More
Text-to-3D is an emerging task that allows users to create 3D content with infinite possibilities. Existing works tackle the problem by optimizing a 3D representation with guidance from pre-trained diffusion models. An apparent drawback is that they need to optimize from scratch for each prompt, which is computationally expensive and often yields poor visual fidelity. In this paper, we propose DreamPortrait, which aims to generate text-guided 3D-aware portraits in a single-forward pass for efficiency. To achieve this, we extend Score Distillation Sampling from datapoint to distribution formulation, which injects semantic prior into a 3D distribution. However, the direct extension will lead to the mode collapse problem since the objective only pursues semantic alignment. Hence, we propose to optimize a distribution with hierarchical condition adapters and GAN loss regularization. For better 3D modeling, we further design a 3D-aware gated cross-attention mechanism to explicitly let the model perceive the correspondence between the text and the 3D-aware space. These elaborated designs enable our model to generate portraits with robust multi-view semantic consistency, eliminating the need for optimization-based methods. Extensive experiments demonstrate our model's highly competitive performance and significant speed boost against existing methods.
△ Less
Submitted 3 June, 2023;
originally announced June 2023.
-
Prompting Is All You Need: Automated Android Bug Replay with Large Language Models
Authors:
Sidong Feng,
Chunyang Chen
Abstract:
Bug reports are vital for software maintenance that allow users to inform developers of the problems encountered while using the software. As such, researchers have committed considerable resources toward automating bug replay to expedite the process of software maintenance. Nonetheless, the success of current automated approaches is largely dictated by the characteristics and quality of bug repor…
▽ More
Bug reports are vital for software maintenance that allow users to inform developers of the problems encountered while using the software. As such, researchers have committed considerable resources toward automating bug replay to expedite the process of software maintenance. Nonetheless, the success of current automated approaches is largely dictated by the characteristics and quality of bug reports, as they are constrained by the limitations of manually-crafted patterns and pre-defined vocabulary lists. Inspired by the success of Large Language Models (LLMs) in natural language understanding, we propose AdbGPT, a new lightweight approach to automatically reproduce the bugs from bug reports through prompt engineering, without any training and hard-coding effort. AdbGPT leverages few-shot learning and chain-of-thought reasoning to elicit human knowledge and logical reasoning from LLMs to accomplish the bug replay in a manner similar to a developer. Our evaluations demonstrate the effectiveness and efficiency of our AdbGPT to reproduce 81.3% of bug reports in 253.6 seconds, outperforming the state-of-the-art baselines and ablation studies. We also conduct a small-scale user study to confirm the usefulness of AdbGPT in enhancing developers' bug replay capabilities.
△ Less
Submitted 8 May, 2024; v1 submitted 2 June, 2023;
originally announced June 2023.
-
EmoUS: Simulating User Emotions in Task-Oriented Dialogues
Authors:
Hsien-Chin Lin,
Shutong Feng,
Christian Geishauser,
Nurul Lubis,
Carel van Niekerk,
Michael Heck,
Benjamin Ruppik,
Renato Vukovic,
Milica Gašić
Abstract:
Existing user simulators (USs) for task-oriented dialogue systems only model user behaviour on semantic and natural language levels without considering the user persona and emotions. Optimising dialogue systems with generic user policies, which cannot model diverse user behaviour driven by different emotional states, may result in a high drop-off rate when deployed in the real world. Thus, we pres…
▽ More
Existing user simulators (USs) for task-oriented dialogue systems only model user behaviour on semantic and natural language levels without considering the user persona and emotions. Optimising dialogue systems with generic user policies, which cannot model diverse user behaviour driven by different emotional states, may result in a high drop-off rate when deployed in the real world. Thus, we present EmoUS, a user simulator that learns to simulate user emotions alongside user behaviour. EmoUS generates user emotions, semantic actions, and natural language responses based on the user goal, the dialogue history, and the user persona. By analysing what kind of system behaviour elicits what kind of user emotions, we show that EmoUS can be used as a probe to evaluate a variety of dialogue systems and in particular their effect on the user's emotional state. Develo** such methods is important in the age of large language model chat-bots and rising ethical concerns.
△ Less
Submitted 2 June, 2023;
originally announced June 2023.
-
ChatGPT for Zero-shot Dialogue State Tracking: A Solution or an Opportunity?
Authors:
Michael Heck,
Nurul Lubis,
Benjamin Ruppik,
Renato Vukovic,
Shutong Feng,
Christian Geishauser,
Hsien-Chin Lin,
Carel van Niekerk,
Milica Gašić
Abstract:
Recent research on dialogue state tracking (DST) focuses on methods that allow few- and zero-shot transfer to new domains or schemas. However, performance gains heavily depend on aggressive data augmentation and fine-tuning of ever larger language model based architectures. In contrast, general purpose language models, trained on large amounts of diverse data, hold the promise of solving any kind…
▽ More
Recent research on dialogue state tracking (DST) focuses on methods that allow few- and zero-shot transfer to new domains or schemas. However, performance gains heavily depend on aggressive data augmentation and fine-tuning of ever larger language model based architectures. In contrast, general purpose language models, trained on large amounts of diverse data, hold the promise of solving any kind of task without task-specific training. We present preliminary experimental results on the ChatGPT research preview, showing that ChatGPT achieves state-of-the-art performance in zero-shot DST. Despite our findings, we argue that properties inherent to general purpose models limit their ability to replace specialized systems. We further theorize that the in-context learning capabilities of such models will likely become powerful tools to support the development of dedicated and dynamic dialogue state trackers.
△ Less
Submitted 2 June, 2023;
originally announced June 2023.
-
Non-stationary Reinforcement Learning under General Function Approximation
Authors:
Songtao Feng,
Ming Yin,
Ruiquan Huang,
Yu-Xiang Wang,
**g Yang,
Yingbin Liang
Abstract:
General function approximation is a powerful tool to handle large state and action spaces in a broad range of reinforcement learning (RL) scenarios. However, theoretical understanding of non-stationary MDPs with general function approximation is still limited. In this paper, we make the first such an attempt. We first propose a new complexity metric called dynamic Bellman Eluder (DBE) dimension fo…
▽ More
General function approximation is a powerful tool to handle large state and action spaces in a broad range of reinforcement learning (RL) scenarios. However, theoretical understanding of non-stationary MDPs with general function approximation is still limited. In this paper, we make the first such an attempt. We first propose a new complexity metric called dynamic Bellman Eluder (DBE) dimension for non-stationary MDPs, which subsumes majority of existing tractable RL problems in static MDPs as well as non-stationary MDPs. Based on the proposed complexity metric, we propose a novel confidence-set based model-free algorithm called SW-OPEA, which features a sliding window mechanism and a new confidence set design for non-stationary MDPs. We then establish an upper bound on the dynamic regret for the proposed algorithm, and show that SW-OPEA is provably efficient as long as the variation budget is not significantly large. We further demonstrate via examples of non-stationary linear and tabular MDPs that our algorithm performs better in small variation budget scenario than the existing UCB-type algorithms. To the best of our knowledge, this is the first dynamic regret analysis in non-stationary MDPs with general function approximation.
△ Less
Submitted 1 June, 2023;
originally announced June 2023.
-
Dynamic quantized consensus under DoS attacks: Towards a tight zooming-out factor
Authors:
Shuai Feng,
Maopeng Ran,
Hideaki Ishii,
Shengyuan Xu
Abstract:
This paper deals with dynamic quantized consensus of dynamical agents in a general form under packet losses induced by Denial-of-Service (DoS) attacks. The communication channel has limited bandwidth and hence the transmitted signals over the network are subject to quantization. To deal with agent's output, an observer is implemented at each node. The state of the observer is quantized by a finite…
▽ More
This paper deals with dynamic quantized consensus of dynamical agents in a general form under packet losses induced by Denial-of-Service (DoS) attacks. The communication channel has limited bandwidth and hence the transmitted signals over the network are subject to quantization. To deal with agent's output, an observer is implemented at each node. The state of the observer is quantized by a finite-level quantizer and then transmitted over the network. To solve the problem of quantizer overflow under malicious packet losses, a zooming-in and out dynamic quantization mechanism is designed. By the new quantized controller proposed in the paper, the zooming-out factor is lower bounded by the spectral radius of the agent's dynamic matrix. A sufficient condition of quantization range is provided under which the finite-level quantizer is free of overflow. A sufficient condition of tolerable DoS attacks for achieving consensus is also provided. At last, we study scalar dynamical agents as a special case and further tighten the zooming-out factor to a value smaller than the agent's dynamic parameter. Under such a zooming-out factor, it is possible to recover the level of tolerable DoS attacks to that of unquantized consensus, and the quantizer is free of overflow.
△ Less
Submitted 31 May, 2023;
originally announced June 2023.
-
Spectral Heterogeneous Graph Convolutions via Positive Noncommutative Polynomials
Authors:
Mingguo He,
Zhewei Wei,
Shikun Feng,
Zhengjie Huang,
Weibin Li,
Yu Sun,
Dianhai Yu
Abstract:
Heterogeneous Graph Neural Networks (HGNNs) have gained significant popularity in various heterogeneous graph learning tasks. However, most existing HGNNs rely on spatial domain-based methods to aggregate information, i.e., manually selected meta-paths or some heuristic modules, lacking theoretical guarantees. Furthermore, these methods cannot learn arbitrary valid heterogeneous graph filters with…
▽ More
Heterogeneous Graph Neural Networks (HGNNs) have gained significant popularity in various heterogeneous graph learning tasks. However, most existing HGNNs rely on spatial domain-based methods to aggregate information, i.e., manually selected meta-paths or some heuristic modules, lacking theoretical guarantees. Furthermore, these methods cannot learn arbitrary valid heterogeneous graph filters within the spectral domain, which have limited expressiveness. To tackle these issues, we present a positive spectral heterogeneous graph convolution via positive noncommutative polynomials. Then, using this convolution, we propose PSHGCN, a novel Positive Spectral Heterogeneous Graph Convolutional Network. PSHGCN offers a simple yet effective method for learning valid heterogeneous graph filters. Moreover, we demonstrate the rationale of PSHGCN in the graph optimization framework. We conducted an extensive experimental study to show that PSHGCN can learn diverse heterogeneous graph filters and outperform all baselines on open benchmarks. Notably, PSHGCN exhibits remarkable scalability, efficiently handling large real-world graphs comprising millions of nodes and edges. Our codes are available at https://github.com/ivam-he/PSHGCN.
△ Less
Submitted 6 May, 2024; v1 submitted 31 May, 2023;
originally announced May 2023.
-
Hybrid higher-order skin-topological effect in hyperbolic lattices
Authors:
Junsong Sun,
Chang-An Li,
Shi** Feng,
Huaiming Guo
Abstract:
We investigate the non-Hermitian Haldane model on hyperbolic $\{8, 3\}$ and $\{12, 3\}$ lattices, and showcase its intriguing topological properties in the simultaneous presence of non-Hermitian effect and hyperbolic geometry. From bulk descriptions of the system, we calculate the real space non-Hermitian Chern numbers by generalizing the method from its Hermitian counterpart and present correspon…
▽ More
We investigate the non-Hermitian Haldane model on hyperbolic $\{8, 3\}$ and $\{12, 3\}$ lattices, and showcase its intriguing topological properties in the simultaneous presence of non-Hermitian effect and hyperbolic geometry. From bulk descriptions of the system, we calculate the real space non-Hermitian Chern numbers by generalizing the method from its Hermitian counterpart and present corresponding phase diagram of the model. For boundaries, we find that skin-topological modes appear in the range of the bulk energy gap under certain boundary conditions, which can be explained by an effective one-dimensional zigzag chain model mapped from hyperbolic lattice boundary. Remarkably, these skin-topological modes are localized at specific corners of the boundary, constituting a hybrid higher-order skin-topological effect on hyperbolic lattices.
△ Less
Submitted 31 May, 2023;
originally announced May 2023.
-
The geometry of $Φ_{(3)}$-harmonic maps
Authors:
Shuxiang Feng,
Yingbo Han,
Kaige Jiang,
Shihshu Walter Wei
Abstract:
In this paper, we motivate and extend the study of harmonic maps or $Φ_{(1)}$-harmonic maps (cf [15], Remark 1.3 (iii)), $Φ$-harmonic maps or $Φ_{(2)}$-harmonic maps (cf. [24], Remark 1.3 (v)), and explore geometric properties of $Φ_{(3)}$-harmonic maps by unified geometric analytic methods. We define the notion of $Φ_{(3)}$-harmonic maps and obtain the first variation formula and the second varia…
▽ More
In this paper, we motivate and extend the study of harmonic maps or $Φ_{(1)}$-harmonic maps (cf [15], Remark 1.3 (iii)), $Φ$-harmonic maps or $Φ_{(2)}$-harmonic maps (cf. [24], Remark 1.3 (v)), and explore geometric properties of $Φ_{(3)}$-harmonic maps by unified geometric analytic methods. We define the notion of $Φ_{(3)}$-harmonic maps and obtain the first variation formula and the second variation formula of the $Φ_{(3)}$-energy functional $E_{Φ_{(3)}}$. By using a stress-energy tensor, the $Φ_{(3)}$-conservation law, a monotonicity formula, and the asymptotic assumption of maps at infinity, we prove Liouville type results for $Φ_{(3)}$-harmonic maps. We introduce the notion of $Φ_{(3)}$-Superstrongly Unstable ($Φ_{(3)}$-SSU) manifold and provide many interesting examples. By using an extrinsic average variational method in the calculus of variations (cf. [51, 49]), we find $Φ_{(3)}$-SSU manifold and prove that for $i=1,2,3$, every compact $Φ_{(i)}$-$\operatorname{SSU}$ manifold is $Φ_{(i)}$-$\operatorname{SU}$, and hence is $Φ_{(i)}$-$\operatorname{U}$ (cf. Theorem 9.3). As consequences, we obtain topological vanishing theorems and sphere theorems by employing a $Φ_{(3)}$-harmoic map as a catalyst. This is in contrast to the approaches of utilizing a geodesic ([45]), minimal surface, stable rectifiable current ([34, 29, 50]), $p$-harmonic map (cf. [53]), etc., as catalysts. These mysterious phenomena are analogs of harmonic maps or $Φ_{(1)}$-harmonic maps, $p$-harmonic maps, $Φ_{S}$-harmonic maps, $Φ_{S,p}$-harmonic maps, $Φ_{(2)}$-harmonic maps, etc., (cf. [21, 40, 42, 41, 12, 13]).
△ Less
Submitted 30 May, 2023;
originally announced May 2023.
-
Enhanced sum-frequency generation from etchless lithium niobate empowered by dual quasi-bound states in the continuum
Authors:
Siqi Feng,
Tingting Liu,
Wenya Chen,
Feng Wu,
Shuyuan Xiao
Abstract:
The miniaturization of nonlinear light sources is central to the integrated photonic platform, driving a quest for high-efficiency frequency generation and mixing at the nanoscale. In this quest, the high-quality ($Q$) resonant dielectric nanostructures hold great promise, as they enhance nonlinear effects through the resonantly local electromagnetic fields overlap** the chosen nonlinear materia…
▽ More
The miniaturization of nonlinear light sources is central to the integrated photonic platform, driving a quest for high-efficiency frequency generation and mixing at the nanoscale. In this quest, the high-quality ($Q$) resonant dielectric nanostructures hold great promise, as they enhance nonlinear effects through the resonantly local electromagnetic fields overlap** the chosen nonlinear materials. Here, we propose a method for the enhanced sum-frequency generation (SFG) from etcheless lithium niobate (LiNbO$_{3}$) by utilizing the dual quasi-bound states in the continuum (quasi-BICs) in a one-dimensional resonant grating waveguide structure. Two high-$Q$ guided mode resonances corresponding to the dual quasi-BICs are respectively excited by two near-infrared input beams, generating a strong visible SFG signal with a remarkably high conversion efficiency of $3.66\times10^{-2}$ (five orders of magnitude higher than that of LiNbO$_{3}$ films of the same thickness) and a small full-width at half-maximum less than 0.2 nm. The SFG efficiency can be tuned via adjusting the grating geometry parameter or choosing the input beam polarization combination. Furthermore, the generated SFG signal can be maintained at a fixed wavelength without the appreciable loss of efficiency by selectively exciting the angle-dependent quasi-BICs, even if the wavelengths of input beams are tuned within a broad spectral range. Our results provide a simple but robust paradigm of high-efficiency frequency conversion on an easy-fabricated platform, which may find applications in nonlinear light sources and quantum photonics.
△ Less
Submitted 5 June, 2023; v1 submitted 29 May, 2023;
originally announced May 2023.
-
Fourier-DeepONet: Fourier-enhanced deep operator networks for full waveform inversion with improved accuracy, generalizability, and robustness
Authors:
Min Zhu,
Shihang Feng,
Youzuo Lin,
Lu Lu
Abstract:
Full waveform inversion (FWI) infers the subsurface structure information from seismic waveform data by solving a non-convex optimization problem. Data-driven FWI has been increasingly studied with various neural network architectures to improve accuracy and computational efficiency. Nevertheless, the applicability of pre-trained neural networks is severely restricted by potential discrepancies be…
▽ More
Full waveform inversion (FWI) infers the subsurface structure information from seismic waveform data by solving a non-convex optimization problem. Data-driven FWI has been increasingly studied with various neural network architectures to improve accuracy and computational efficiency. Nevertheless, the applicability of pre-trained neural networks is severely restricted by potential discrepancies between the source function used in the field survey and the one utilized during training. Here, we develop a Fourier-enhanced deep operator network (Fourier-DeepONet) for FWI with the generalization of seismic sources, including the frequencies and locations of sources. Specifically, we employ the Fourier neural operator as the decoder of DeepONet, and we utilize source parameters as one input of Fourier-DeepONet, facilitating the resolution of FWI with variable sources. To test Fourier-DeepONet, we develop three new and realistic FWI benchmark datasets (FWI-F, FWI-L, and FWI-FL) with varying source frequencies, locations, or both. Our experiments demonstrate that compared with existing data-driven FWI methods, Fourier-DeepONet obtains more accurate predictions of subsurface structures in a wide range of source parameters. Moreover, the proposed Fourier-DeepONet exhibits superior robustness when handling data with Gaussian noise or missing traces and sources with Gaussian noise, paving the way for more reliable and accurate subsurface imaging across diverse real conditions.
△ Less
Submitted 24 July, 2023; v1 submitted 26 May, 2023;
originally announced May 2023.
-
The First LHAASO Catalog of Gamma-Ray Sources
Authors:
Zhen Cao,
F. Aharonian,
Q. An,
Axikegu,
Y. X. Bai,
Y. W. Bao,
D. Bastieri,
X. J. Bi,
Y. J. Bi,
J. T. Cai,
Q. Cao,
W. Y. Cao,
Zhe Cao,
J. Chang,
J. F. Chang,
A. M. Chen,
E. S. Chen,
Liang Chen,
Lin Chen,
Long Chen,
M. J. Chen,
M. L. Chen,
Q. H. Chen,
S. H. Chen,
S. Z. Chen
, et al. (255 additional authors not shown)
Abstract:
We present the first catalog of very-high energy and ultra-high energy gamma-ray sources detected by the Large High Altitude Air Shower Observatory (LHAASO). The catalog was compiled using 508 days of data collected by the Water Cherenkov Detector Array (WCDA) from March 2021 to September 2022 and 933 days of data recorded by the Kilometer Squared Array (KM2A) from January 2020 to September 2022.…
▽ More
We present the first catalog of very-high energy and ultra-high energy gamma-ray sources detected by the Large High Altitude Air Shower Observatory (LHAASO). The catalog was compiled using 508 days of data collected by the Water Cherenkov Detector Array (WCDA) from March 2021 to September 2022 and 933 days of data recorded by the Kilometer Squared Array (KM2A) from January 2020 to September 2022. This catalog represents the main result from the most sensitive large coverage gamma-ray survey of the sky above 1 TeV, covering declination from $-$20$^{\circ}$ to 80$^{\circ}$. In total, the catalog contains 90 sources with an extended size smaller than $2^\circ$ and a significance of detection at $> 5σ$. Based on our source association criteria, 32 new TeV sources are proposed in this study. Among the 90 sources, 43 sources are detected with ultra-high energy ($E > 100$ TeV) emission at $> 4σ$ significance level. We provide the position, extension, and spectral characteristics of all the sources in this catalog.
△ Less
Submitted 27 November, 2023; v1 submitted 26 May, 2023;
originally announced May 2023.
-
Efficient Neural Music Generation
Authors:
Max W. Y. Lam,
Qiao Tian,
Tang Li,
Zongyu Yin,
Siyuan Feng,
Ming Tu,
Yuliang Ji,
Rui Xia,
Mingbo Ma,
Xuchen Song,
Jitong Chen,
Yu** Wang,
Yuxuan Wang
Abstract:
Recent progress in music generation has been remarkably advanced by the state-of-the-art MusicLM, which comprises a hierarchy of three LMs, respectively, for semantic, coarse acoustic, and fine acoustic modelings. Yet, sampling with the MusicLM requires processing through these LMs one by one to obtain the fine-grained acoustic tokens, making it computationally expensive and prohibitive for a real…
▽ More
Recent progress in music generation has been remarkably advanced by the state-of-the-art MusicLM, which comprises a hierarchy of three LMs, respectively, for semantic, coarse acoustic, and fine acoustic modelings. Yet, sampling with the MusicLM requires processing through these LMs one by one to obtain the fine-grained acoustic tokens, making it computationally expensive and prohibitive for a real-time generation. Efficient music generation with a quality on par with MusicLM remains a significant challenge. In this paper, we present MeLoDy (M for music; L for LM; D for diffusion), an LM-guided diffusion model that generates music audios of state-of-the-art quality meanwhile reducing 95.7% or 99.6% forward passes in MusicLM, respectively, for sampling 10s or 30s music. MeLoDy inherits the highest-level LM from MusicLM for semantic modeling, and applies a novel dual-path diffusion (DPD) model and an audio VAE-GAN to efficiently decode the conditioning semantic tokens into waveform. DPD is proposed to simultaneously model the coarse and fine acoustics by incorporating the semantic information into segments of latents effectively via cross-attention at each denoising step. Our experimental results suggest the superiority of MeLoDy, not only in its practical advantages on sampling speed and infinitely continuable generation, but also in its state-of-the-art musicality, audio quality, and text correlation.
Our samples are available at https://Efficient-MeLoDy.github.io/.
△ Less
Submitted 25 May, 2023;
originally announced May 2023.