Search | arXiv e-print repository

Exploring the Compositional Deficiency of Large Language Models in Mathematical Reasoning

Authors: Jun Zhao, **gqi Tong, Yurong Mou, Ming Zhang, Qi Zhang, Xuan**g Huang

Abstract: Human cognition exhibits systematic compositionality, the algebraic ability to generate infinite novel combinations from finite learned components, which is the key to understanding and reasoning about complex logic. In this work, we investigate the compositionality of large language models (LLMs) in mathematical reasoning. Specifically, we construct a new dataset \textsc{MathTrap}\footnotemark[3]… ▽ More Human cognition exhibits systematic compositionality, the algebraic ability to generate infinite novel combinations from finite learned components, which is the key to understanding and reasoning about complex logic. In this work, we investigate the compositionality of large language models (LLMs) in mathematical reasoning. Specifically, we construct a new dataset \textsc{MathTrap}\footnotemark[3] by introducing carefully designed logical traps into the problem descriptions of MATH and GSM8k. Since problems with logical flaws are quite rare in the real world, these represent ``unseen'' cases to LLMs. Solving these requires the models to systematically compose (1) the mathematical knowledge involved in the original problems with (2) knowledge related to the introduced traps. Our experiments show that while LLMs possess both components of requisite knowledge, they do not \textbf{spontaneously} combine them to handle these novel cases. We explore several methods to mitigate this deficiency, such as natural language prompts, few-shot demonstrations, and fine-tuning. We find that LLMs' performance can be \textbf{passively} improved through the above external intervention. Overall, systematic compositionality remains an open challenge for large language models. △ Less

Submitted 5 May, 2024; originally announced May 2024.

arXiv:2405.03966 [pdf]

Memristive switching in the surface of a charge-density-wave topological semimetal

Authors: Jianwen Ma, Xianghao Meng, Binhua Zhang, Yuxiang Wang, Yicheng Mou, Wenting Lin, Yannan Dai, Luqiu Chen, Haonan Wang, Haoqi Wu, Jiaming Gu, Jiayu Wang, Yuhan Du, Chunsen Liu, Wu Shi, Zhenzhong Yang, Bobo Tian, Lin Miao, Peng Zhou, Chun-Gang Duan, Changsong Xu, Xiang Yuan, Cheng Zhang

Abstract: Owing to the outstanding properties provided by nontrivial band topology, topological phases of matter are considered as a promising platform towards low-dissipation electronics, efficient spin-charge conversion, and topological quantum computation. Achieving ferroelectricity in topological materials enables the non-volatile control of the quantum states, which could greatly facilitate topological… ▽ More Owing to the outstanding properties provided by nontrivial band topology, topological phases of matter are considered as a promising platform towards low-dissipation electronics, efficient spin-charge conversion, and topological quantum computation. Achieving ferroelectricity in topological materials enables the non-volatile control of the quantum states, which could greatly facilitate topological electronic research. However, ferroelectricity is generally incompatible with systems featuring metallicity due to the screening effect of free carriers. In this study, we report the observation of memristive switching based on the ferroelectric surface state of a topological semimetal (TaSe4)2I. We find that the surface state of (TaSe4)2I presents out-of-plane ferroelectric polarization due to surface reconstruction. With the combination of ferroelectric surface and charge-density-wave-gapped bulk states, an electric switchable barrier height can be achieved in (TaSe4)2I-metal contact. By employing a multi-terminal grounding design, we manage to construct a prototype ferroelectric memristor based on (TaSe4)2I with on/off ratio up to 10^3, endurance over 10^3 cycles, and good retention characteristics. The origin of the ferroelectric surface state is further investigated by first-principles calculations, which reveals an interplay between ferroelectricity and band topology. The emergence of ferroelectricity in (TaSe4)2I not only demonstrates it as a rare but essential case of ferroelectric topological materials, but also opens new routes towards the implementation of topological materials in functional electronic devices. △ Less

Submitted 6 May, 2024; originally announced May 2024.

Comments: 14 pages, 5 figures

arXiv:2403.20248 [pdf]

Gate-tunable quantum acoustoelectric transport in graphene

Authors: Yicheng Mou, Haonan Chen, Jiaqi Liu, Qing Lan, Jiayu Wang, Chuanxin Zhang, Yuxiang Wang, Jiaming Gu, Tuoyu Zhao, Xue Jiang, Wu Shi, Cheng Zhang

Abstract: Transport probes the motion of quasiparticles in response to external excitations. Apart from the well-known electric and thermoelectric transport, acoustoelectric transport induced by traveling acoustic waves has been rarely explored. Here, by adopting a hybrid nanodevices integrated with piezoelectric substrates, we establish a simple design of acoustoelectric transport with gate tunability. We… ▽ More Transport probes the motion of quasiparticles in response to external excitations. Apart from the well-known electric and thermoelectric transport, acoustoelectric transport induced by traveling acoustic waves has been rarely explored. Here, by adopting a hybrid nanodevices integrated with piezoelectric substrates, we establish a simple design of acoustoelectric transport with gate tunability. We fabricate dual-gated acoustoelectric devices based on BN-encapsuled graphene on LiNbO3. Longitudinal and transverse acoustoelectric voltages are generated by launching pulsed surface acoustic wave. The gate dependence of zero-field longitudinal acoustoelectric signal presents strikingly similar profiles as that of Hall resistivity, providing a valid approach for extracting carrier density without magnetic field. In magnetic fields, acoustoelectric quantum oscillations appear due to Landau quantization, which are more robust and pronounced than Shubnikov-de Haas oscillations. Our work demonstrates a feasible acoustoelectric setup with gate tunability, which can be extended to the broad scope of various Van der Waals materials. △ Less

Submitted 29 March, 2024; originally announced March 2024.

Comments: 16 pages, 5 figures

arXiv:2403.09283 [pdf]

Observation of quantum oscillations near the Mott-Ioffe-Regel limit in CaAs3

Authors: Yuxiang Wang, Minhao Zhao, **glei Zhang, Wenbin Wu, Shichao Li, Yong Zhang, Wenxiang Jiang, Nesta Benno Joseph, Liangcai Xu, Yicheng Mou, Yunkun Yang, Pengliang Leng, Yong Zhang, Li Pi, Alexey Suslov, Mykhaylo Ozerov, Jan Wyzula, Milan Orlita, Fengfeng Zhu, Yi Zhang, Xufeng Kou, Zengwei Zhu, Awadhesh Narayan, Dong Qian, **sheng Wen , et al. (3 additional authors not shown)

Abstract: The Mott-Ioffe-Regel limit sets the lower bound of carrier mean free path for coherent quasiparticle transport. Metallicity beyond this limit is of great interest because it is often closely related to quantum criticality and unconventional superconductivity. Progress along this direction mainly focuses on the strange-metal behaviors originating from the evolution of quasiparticle scattering rate… ▽ More The Mott-Ioffe-Regel limit sets the lower bound of carrier mean free path for coherent quasiparticle transport. Metallicity beyond this limit is of great interest because it is often closely related to quantum criticality and unconventional superconductivity. Progress along this direction mainly focuses on the strange-metal behaviors originating from the evolution of quasiparticle scattering rate such as linear-in-temperature resistivity, while the quasiparticle coherence phenomena in this regime are much less explored due to the short mean free path at the diffusive bound. Here we report the observation of quantum oscillations from Landau quantization near the Mott-Ioffe-Regel limit in CaAs3. Despite the insulator-like temperature dependence of resistivity, CaAs3 presents giant magnetoresistance and prominent Shubnikov-de Haas oscillations from Fermi surfaces, indicating highly coherent band transport. In contrast, the quantum oscillation is absent in the magnetic torque. The quasiparticle effective mass increases systematically with magnetic fields, manifesting a much larger value than the expectation given by magneto-infrared spectroscopy. It suggests a strong many-body renormalization effect near Fermi surface. We find that these unconventional behaviors may be explained by the interplay between the mobility edge and the van Hove singularity, which results in the formation of coherent cyclotron orbits emerging at the diffusive bound. Our results call for further study on the electron correlation effect of the van Hove singularity. △ Less

Submitted 14 March, 2024; originally announced March 2024.

Comments: 18 pages, 5 figures

arXiv:2402.17426 [pdf]

Skyrmion Generation in a Plasmonic Nanoantenna through the Inverse Faraday Effect

Authors: Xingyu Yang, Ye Mou, Bruno Gallas, Sébastien Bidault, Mathieu Mivelle

Abstract: Skyrmions are topological structures characterized by a winding vectorial configuration that provides a quantized topological charge. In magnetic materials, skyrmions are localized spin textures that exhibit unique stability and mobility properties, making them highly relevant to the burgeoning field of spintronics. In optics, these structures open new frontiers in manipulating and controlling lig… ▽ More Skyrmions are topological structures characterized by a winding vectorial configuration that provides a quantized topological charge. In magnetic materials, skyrmions are localized spin textures that exhibit unique stability and mobility properties, making them highly relevant to the burgeoning field of spintronics. In optics, these structures open new frontiers in manipulating and controlling light at the nanoscale. The convergence of optics and magnetics holds therefore immense potential for manipulating magnetic processes at ultrafast timescales. Here, we explore the possibility of generating skyrmionic topological structures within the magnetic field induced by the inverse Faraday effect in a plasmonic nanostructure. Our investigation reveals that a gold nanoring, featuring a dark mode, can generate counter-propagating photocurrents between its inner and outer segments, thereby enabling the magnetization of gold and supporting a skyrmionic vectorial distribution. We elucidate that these photocurrents arise from the localized control of light polarization, facilitating their counter-propagative motion. The generation of skyrmions through the inverse Faraday effect at the nanoscale presents a pathway towards directly integrating this topology into magnetic layers. This advancement holds promise for ultrafast timescales, offering direct applications in ultrafast data writing and processing. △ Less

Submitted 27 February, 2024; originally announced February 2024.

arXiv:2402.17256 [pdf, other]

Beyond the Known: Investigating LLMs Performance on Out-of-Domain Intent Detection

Authors: Pei Wang, Keqing He, Yejie Wang, Xiaoshuai Song, Yutao Mou, **gang Wang, Yunsen Xian, Xunliang Cai, Weiran Xu

Abstract: Out-of-domain (OOD) intent detection aims to examine whether the user's query falls outside the predefined domain of the system, which is crucial for the proper functioning of task-oriented dialogue (TOD) systems. Previous methods address it by fine-tuning discriminative models. Recently, some studies have been exploring the application of large language models (LLMs) represented by ChatGPT to var… ▽ More Out-of-domain (OOD) intent detection aims to examine whether the user's query falls outside the predefined domain of the system, which is crucial for the proper functioning of task-oriented dialogue (TOD) systems. Previous methods address it by fine-tuning discriminative models. Recently, some studies have been exploring the application of large language models (LLMs) represented by ChatGPT to various downstream tasks, but it is still unclear for their ability on OOD detection task.This paper conducts a comprehensive evaluation of LLMs under various experimental settings, and then outline the strengths and weaknesses of LLMs. We find that LLMs exhibit strong zero-shot and few-shot capabilities, but is still at a disadvantage compared to models fine-tuned with full resource. More deeply, through a series of additional analysis experiments, we discuss and summarize the challenges faced by LLMs and provide guidance for future work including injecting domain knowledge, strengthening knowledge transfer from IND(In-domain) to OOD, and understanding long instructions. △ Less

Submitted 4 March, 2024; v1 submitted 27 February, 2024; originally announced February 2024.

Journal ref: LREC-COLING 2024

arXiv:2402.09136 [pdf, other]

DolphCoder: Echo-Locating Code Large Language Models with Diverse and Multi-Objective Instruction Tuning

Authors: Yejie Wang, Keqing He, Guanting Dong, Pei Wang, Weihao Zeng, Muxi Diao, Yutao Mou, Mengdi Zhang, **gang Wang, Xunliang Cai, Weiran Xu

Abstract: Code Large Language Models (Code LLMs) have demonstrated outstanding performance in code-related tasks. Several instruction tuning approaches have been proposed to boost the code generation performance of pre-trained Code LLMs. In this paper, we introduce a diverse instruction model (DolphCoder) with self-evaluating for code generation. It learns diverse instruction targets and combines a code eva… ▽ More Code Large Language Models (Code LLMs) have demonstrated outstanding performance in code-related tasks. Several instruction tuning approaches have been proposed to boost the code generation performance of pre-trained Code LLMs. In this paper, we introduce a diverse instruction model (DolphCoder) with self-evaluating for code generation. It learns diverse instruction targets and combines a code evaluation objective to enhance its code generation ability. Our model achieves superior performance on the HumanEval and MBPP benchmarks, demonstrating new insights for future code instruction tuning work. Our key findings are: (1) Augmenting more diverse responses with distinct reasoning paths increases the code capability of LLMs. (2) Improving one's ability to evaluate the correctness of code solutions also enhances their ability to create it. △ Less

Submitted 14 February, 2024; originally announced February 2024.

Comments: 14 pages, 6 figures

arXiv:2402.08631 [pdf, other]

Knowledge Editing on Black-box Large Language Models

Authors: Xiaoshuai Song, Zhengyang Wang, Keqing He, Guanting Dong, Yutao Mou, **xu Zhao, Weiran Xu

Abstract: Knowledge editing (KE) aims to efficiently and precisely modify the behavior of large language models (LLMs) to update specific knowledge without negatively influencing other knowledge. Current research primarily focuses on white-box LLMs editing, overlooking an important scenario: black-box LLMs editing, where LLMs are accessed through interfaces and only textual output is available. In this pape… ▽ More Knowledge editing (KE) aims to efficiently and precisely modify the behavior of large language models (LLMs) to update specific knowledge without negatively influencing other knowledge. Current research primarily focuses on white-box LLMs editing, overlooking an important scenario: black-box LLMs editing, where LLMs are accessed through interfaces and only textual output is available. In this paper, we first officially introduce KE on black-box LLMs and then propose a comprehensive evaluation framework to overcome the limitations of existing evaluations that are not applicable to black-box LLMs editing and lack comprehensiveness. To tackle privacy leaks of editing data and style over-editing in current methods, we introduce a novel postEdit framework, resolving privacy concerns through downstream post-processing and maintaining textual style consistency via fine-grained editing to original responses. Experiments and analysis on two benchmarks demonstrate that postEdit outperforms all baselines and achieves strong generalization, especially with huge improvements on style retention (average $+20.82\%\uparrow$). △ Less

Submitted 17 February, 2024; v1 submitted 13 February, 2024; originally announced February 2024.

Comments: Work in progress

arXiv:2402.00655 [pdf]

Femtosecond drift photocurrents generated by an inversely designed plasmonic antenna

Authors: Ye Mou, Xingyu Yang, Marlo Vega, Bruno Gallas, Jean-Francois Bryche, Alexandre Bouhelier, Mathieu Mivelle

Abstract: Photocurrents play a crucial role in various applications, including light detection, photovoltaics, and THz radiation generation. Despite the abundance of methods and materials for converting light into electrical signals, the use of metals in this context has been relatively limited. Nanostructures supporting surface plasmons in metals offer precise light manipulation and induce light-driven ele… ▽ More Photocurrents play a crucial role in various applications, including light detection, photovoltaics, and THz radiation generation. Despite the abundance of methods and materials for converting light into electrical signals, the use of metals in this context has been relatively limited. Nanostructures supporting surface plasmons in metals offer precise light manipulation and induce light-driven electron motion. Through inverse design optimization of a gold nanostructure, we demonstrate enhanced volumetric, unidirectional, intense, and ultrafast photocurrents via a magneto-optical process derived from the inverse Faraday effect. This is achieved through fine-tuning the amplitude, polarization, and their gradients in the local light field. The virtually instantaneous process allows dynamic photocurrent modulation by varying optical pulse duration, potentially yielding nanosources of intense, ultrafast, planar magnetic fields, and frequency-tunable THz emission. These findings opens avenues for ultrafast magnetic material manipulation and holds promise for nanoscale THz spectroscopy. △ Less

Submitted 1 February, 2024; originally announced February 2024.

Comments: 5 figures

arXiv:2401.15071 [pdf, other]

From GPT-4 to Gemini and Beyond: Assessing the Landscape of MLLMs on Generalizability, Trustworthiness and Causality through Four Modalities

Authors: Chaochao Lu, Chen Qian, Guodong Zheng, Hongxing Fan, Hongzhi Gao, Jie Zhang, **g Shao, **gyi Deng, **lan Fu, Kexin Huang, Kunchang Li, Lijun Li, Limin Wang, Lu Sheng, Meiqi Chen, Ming Zhang, Qibing Ren, Sirui Chen, Tao Gui, Wanli Ouyang, Yali Wang, Yan Teng, Yaru Wang, Yi Wang, Yinan He , et al. (11 additional authors not shown)

Abstract: Multi-modal Large Language Models (MLLMs) have shown impressive abilities in generating reasonable responses with respect to multi-modal contents. However, there is still a wide gap between the performance of recent MLLM-based applications and the expectation of the broad public, even though the most powerful OpenAI's GPT-4 and Google's Gemini have been deployed. This paper strives to enhance unde… ▽ More Multi-modal Large Language Models (MLLMs) have shown impressive abilities in generating reasonable responses with respect to multi-modal contents. However, there is still a wide gap between the performance of recent MLLM-based applications and the expectation of the broad public, even though the most powerful OpenAI's GPT-4 and Google's Gemini have been deployed. This paper strives to enhance understanding of the gap through the lens of a qualitative study on the generalizability, trustworthiness, and causal reasoning capabilities of recent proprietary and open-source MLLMs across four modalities: ie, text, code, image, and video, ultimately aiming to improve the transparency of MLLMs. We believe these properties are several representative factors that define the reliability of MLLMs, in supporting various downstream applications. To be specific, we evaluate the closed-source GPT-4 and Gemini and 6 open-source LLMs and MLLMs. Overall we evaluate 230 manually designed cases, where the qualitative results are then summarized into 12 scores (ie, 4 modalities times 3 properties). In total, we uncover 14 empirical findings that are useful to understand the capabilities and limitations of both proprietary and open-source MLLMs, towards more reliable downstream multi-modal applications. △ Less

Submitted 29 January, 2024; v1 submitted 26 January, 2024; originally announced January 2024.

arXiv:2312.15939 [pdf]

Discovery of acousto-drag photovoltaic effect

Authors: Jiaming Gu, Yicheng Mou, Jianwen Ma, Haonan Chen, Chuanxin Zhang, Yuxiang Wang, Jiayu Wang, Hangwen Guo, Wu Shi, Xiang Yuan, Xue Jiang, Dean Ta, Jian Shen, Cheng Zhang

Abstract: As a key ingredient in energy harvesting and photodetection, light-to-electricity conversion requires efficient separation of photoexcited electron-hole pairs before recombination. Traditional junction-based mechanisms mainly use build-in electric fields to achieve pair separation and generate photovoltaic effect, which fail to collect photoexcited pairs away from local barrier region. The ability… ▽ More As a key ingredient in energy harvesting and photodetection, light-to-electricity conversion requires efficient separation of photoexcited electron-hole pairs before recombination. Traditional junction-based mechanisms mainly use build-in electric fields to achieve pair separation and generate photovoltaic effect, which fail to collect photoexcited pairs away from local barrier region. The ability to harvest photovoltaic effect in a homogeneous material upon uniform illumination is appealing, but has only been realized in very few cases such as non-centrosymmetric systems through bulk photovoltaic effect. Here we realize a new type of photovoltaic effect, termed as acousto-drag photovoltaic effect, by travelling surface acoustic waves (t-SAW) in a conventional layered semiconductor MoSe2. Instead of immediately driving the electron-hole pairs to opposite directions after generation, t-SAW induces periodic modulation to electronic bands and drags the photoexcited pairs toward the same travelling direction. The photocurrent can then be extracted by a local barrier, e.g. the metal-semiconductor contact as we used here. By spatially separating the electron-hole generation and extraction processes, the acousto-drag mechanism strongly suppresses charge recombination and yields large nonlocal photoresponse outside the barrier region. We show that when t-SAW is applied, the photoresponse can be enhanced by over two orders of magnitude with exceptionally high external quantum efficiency above 60%. The discovery of acousto-drag photovoltaic effect establishes a new approach towards efficient light-to-electricity conversion without the restriction of crystal symmetry. △ Less

Submitted 26 December, 2023; originally announced December 2023.

arXiv:2311.11515 [pdf]

doi 10.1007/s11433-023-2283-0

Absence of metallicity and bias-dependent resistivity in low-carrier-density EuCd2As2

Authors: Yuxiang Wang, Jianwen Ma, Jian Yuan, Wenbin Wu, Yong Zhang, Yicheng Mou, Jiaming Gu, Peihong Cheng, Wu Shi, Xiang Yuan, **glei Zhang, Yanfeng Guo, Cheng Zhang

Abstract: EuCd2As2 was theoretically predicted to be a minimal model of Weyl semimetals with a single pair of Weyl points in the ferromagnet state. However, the heavily p-doped EuCd2As2 crystals in previous experiments prevent direct identification of the semimetal hypothesis. Here we present a comprehensive magneto-transport study of high-quality EuCd2As2 crystals with ultralow bulk carrier density (10^13… ▽ More EuCd2As2 was theoretically predicted to be a minimal model of Weyl semimetals with a single pair of Weyl points in the ferromagnet state. However, the heavily p-doped EuCd2As2 crystals in previous experiments prevent direct identification of the semimetal hypothesis. Here we present a comprehensive magneto-transport study of high-quality EuCd2As2 crystals with ultralow bulk carrier density (10^13 cm-3). In contrast to the general expectation of a Weyl semimetal phase, EuCd2As2 shows insulating behavior in both antiferromagnetic and ferromagnetic states as well as surface-dominated conduction from band bending. Moreover, the application of a dc bias current can dramatically modulate the resistance by over one order of magnitude, and induce a periodic resistance oscillation due to the geometric resonance. Such nonlinear transport results from the highly nonequilibrium state induced by electrical field near the band edge. Our results suggest an insulating phase in EuCd2As2 and put a strong constraint on the underlying mechanism of anomalous transport properties in this system. △ Less

Submitted 19 November, 2023; originally announced November 2023.

Comments: 13 pages, 4 figures

Journal ref: SCIENCE CHINA Physics, Mechanics & Astronomy, 67(4) 247311 (2024)

arXiv:2310.13380 [pdf, other]

doi 10.18653/v1/2023.findings-emnlp.258

APP: Adaptive Prototypical Pseudo-Labeling for Few-shot OOD Detection

Authors: Pei Wang, Keqing He, Yutao Mou, Xiaoshuai Song, Yanan Wu, **gang Wang, Yunsen Xian, Xunliang Cai, Weiran Xu

Abstract: Detecting out-of-domain (OOD) intents from user queries is essential for a task-oriented dialogue system. Previous OOD detection studies generally work on the assumption that plenty of labeled IND intents exist. In this paper, we focus on a more practical few-shot OOD setting where there are only a few labeled IND data and massive unlabeled mixed data that may belong to IND or OOD. The new scenari… ▽ More Detecting out-of-domain (OOD) intents from user queries is essential for a task-oriented dialogue system. Previous OOD detection studies generally work on the assumption that plenty of labeled IND intents exist. In this paper, we focus on a more practical few-shot OOD setting where there are only a few labeled IND data and massive unlabeled mixed data that may belong to IND or OOD. The new scenario carries two key challenges: learning discriminative representations using limited IND data and leveraging unlabeled mixed data. Therefore, we propose an adaptive prototypical pseudo-labeling (APP) method for few-shot OOD detection, including a prototypical OOD detection framework (ProtoOOD) to facilitate low-resource OOD detection using limited IND data, and an adaptive pseudo-labeling method to produce high-quality pseudo OOD\&IND labels. Extensive experiments and analysis demonstrate the effectiveness of our method for few-shot OOD detection. △ Less

Submitted 20 October, 2023; originally announced October 2023.

Journal ref: EMNLP2023, Findings

arXiv:2310.10374 [pdf, other]

Multi-Factor Spatio-Temporal Prediction based on Graph Decomposition Learning

Authors: Jiahao Ji, **gyuan Wang, Yu Mou, Cheng Long

Abstract: Spatio-temporal (ST) prediction is an important and widely used technique in data mining and analytics, especially for ST data in urban systems such as transportation data. In practice, the ST data generation is usually influenced by various latent factors tied to natural phenomena or human socioeconomic activities, impacting specific spatial areas selectively. However, existing ST prediction meth… ▽ More Spatio-temporal (ST) prediction is an important and widely used technique in data mining and analytics, especially for ST data in urban systems such as transportation data. In practice, the ST data generation is usually influenced by various latent factors tied to natural phenomena or human socioeconomic activities, impacting specific spatial areas selectively. However, existing ST prediction methods usually do not refine the impacts of different factors, but directly model the entangled impacts of multiple factors. This amplifies the modeling complexity of ST data and compromises model interpretability. To this end, we propose a multi-factor ST prediction task that predicts partial ST data evolution under different factors, and combines them for a final prediction. We make two contributions to this task: an effective theoretical solution and a portable instantiation framework. Specifically, we first propose a theoretical solution called decomposed prediction strategy and prove its effectiveness from the perspective of information entropy theory. On top of that, we instantiate a novel model-agnostic framework, named spatio-temporal graph decomposition learning (STGDL), for multi-factor ST prediction. The framework consists of two main components: an automatic graph decomposition module that decomposes the original graph structure inherent in ST data into subgraphs corresponding to different factors, and a decomposed learning network that learns the partial ST data on each subgraph separately and integrates them for the final prediction. We conduct extensive experiments on four real-world ST datasets of two types of graphs, i.e., grid graph and network graph. Results show that our framework significantly reduces prediction errors of various ST models by 9.41% on average (35.36% at most). Furthermore, a case study reveals the interpretability potential of our framework. △ Less

Submitted 7 March, 2024; v1 submitted 16 October, 2023; originally announced October 2023.

arXiv:2310.10184 [pdf, other]

Continual Generalized Intent Discovery: Marching Towards Dynamic and Open-world Intent Recognition

Authors: Xiaoshuai Song, Yutao Mou, Keqing He, Yueyan Qiu, Pei Wang, Weiran Xu

Abstract: In a practical dialogue system, users may input out-of-domain (OOD) queries. The Generalized Intent Discovery (GID) task aims to discover OOD intents from OOD queries and extend them to the in-domain (IND) classifier. However, GID only considers one stage of OOD learning, and needs to utilize the data in all previous stages for joint training, which limits its wide application in reality. In this… ▽ More In a practical dialogue system, users may input out-of-domain (OOD) queries. The Generalized Intent Discovery (GID) task aims to discover OOD intents from OOD queries and extend them to the in-domain (IND) classifier. However, GID only considers one stage of OOD learning, and needs to utilize the data in all previous stages for joint training, which limits its wide application in reality. In this paper, we introduce a new task, Continual Generalized Intent Discovery (CGID), which aims to continuously and automatically discover OOD intents from dynamic OOD data streams and then incrementally add them to the classifier with almost no previous data, thus moving towards dynamic intent recognition in an open world. Next, we propose a method called Prototype-guided Learning with Replay and Distillation (PLRD) for CGID, which bootstraps new intent discovery through class prototypes and balances new and old intents through data replay and feature distillation. Finally, we conduct detailed experiments and analysis to verify the effectiveness of PLRD and understand the key challenges of CGID for future research. △ Less

Submitted 16 October, 2023; originally announced October 2023.

Comments: Accpeted to EMNLP 2023 (Findings)

arXiv:2310.10176 [pdf, other]

Large Language Models Meet Open-World Intent Discovery and Recognition: An Evaluation of ChatGPT

Authors: Xiaoshuai Song, Keqing He, Pei Wang, Guanting Dong, Yutao Mou, **gang Wang, Yunsen Xian, Xunliang Cai, Weiran Xu

Abstract: The tasks of out-of-domain (OOD) intent discovery and generalized intent discovery (GID) aim to extend a closed intent classifier to open-world intent sets, which is crucial to task-oriented dialogue (TOD) systems. Previous methods address them by fine-tuning discriminative models. Recently, although some studies have been exploring the application of large language models (LLMs) represented by Ch… ▽ More The tasks of out-of-domain (OOD) intent discovery and generalized intent discovery (GID) aim to extend a closed intent classifier to open-world intent sets, which is crucial to task-oriented dialogue (TOD) systems. Previous methods address them by fine-tuning discriminative models. Recently, although some studies have been exploring the application of large language models (LLMs) represented by ChatGPT to various downstream tasks, it is still unclear for the ability of ChatGPT to discover and incrementally extent OOD intents. In this paper, we comprehensively evaluate ChatGPT on OOD intent discovery and GID, and then outline the strengths and weaknesses of ChatGPT. Overall, ChatGPT exhibits consistent advantages under zero-shot settings, but is still at a disadvantage compared to fine-tuned models. More deeply, through a series of analytical experiments, we summarize and discuss the challenges faced by LLMs including clustering, domain-specific understanding, and cross-domain in-context learning scenarios. Finally, we provide empirical guidance for future directions to address these challenges. △ Less

Submitted 16 October, 2023; originally announced October 2023.

Comments: Accpeted to EMNLP 2023 (Main Conference)

arXiv:2309.06171 [pdf, other]

Privacy-Preserving Linkage of Distributed Datasets using the Personal Health Train

Authors: Maximilian Jugl, Sascha Welten, Yongli Mou, Yeliz Ucer Yediel, Oya Deniz Beyan, Ulrich Sax, Toralf Kirsten

Abstract: With the generation of personal and medical data at several locations, medical data science faces unique challenges when working on distributed datasets. Growing data protection requirements in recent years drastically limit the use of personally identifiable information. Distributed data analysis aims to provide solutions for securely working on highly sensitive data while minimizing the risk of… ▽ More With the generation of personal and medical data at several locations, medical data science faces unique challenges when working on distributed datasets. Growing data protection requirements in recent years drastically limit the use of personally identifiable information. Distributed data analysis aims to provide solutions for securely working on highly sensitive data while minimizing the risk of information leaks, which would not be possible to the same degree in a centralized approach. A novel concept in this field is the Personal Health Train (PHT), which encapsulates the idea of bringing the analysis to the data, not vice versa. Data sources are represented as train stations. Trains containing analysis tasks move between stations and aggregate results. Train executions are coordinated by a central station which data analysts can interact with. Data remains at their respective stations and analysis results are only stored inside the train, providing a safe and secure environment for distributed data analysis. Duplicate records across multiple locations can skew results in a distributed data analysis. On the other hand, merging information from several datasets referring to the same real-world entities may improve data completeness and therefore data quality. In this paper, we present an approach for record linkage on distributed datasets using the Personal Health Train. We verify this approach and evaluate its effectiveness by applying it to two datasets based on real-world data and outline its possible applications in the context of distributed data analysis tasks. △ Less

Submitted 12 September, 2023; originally announced September 2023.

Comments: 12 pages, 4 figures, 3 tables

ACM Class: C.2.4

arXiv:2305.17699 [pdf, other]

Decoupling Pseudo Label Disambiguation and Representation Learning for Generalized Intent Discovery

Authors: Yutao Mou, Xiaoshuai Song, Keqing He, Chen Zeng, Pei Wang, **gang Wang, Yunsen Xian, Weiran Xu

Abstract: Generalized intent discovery aims to extend a closed-set in-domain intent classifier to an open-world intent set including in-domain and out-of-domain intents. The key challenges lie in pseudo label disambiguation and representation learning. Previous methods suffer from a coupling of pseudo label disambiguation and representation learning, that is, the reliability of pseudo labels relies on repre… ▽ More Generalized intent discovery aims to extend a closed-set in-domain intent classifier to an open-world intent set including in-domain and out-of-domain intents. The key challenges lie in pseudo label disambiguation and representation learning. Previous methods suffer from a coupling of pseudo label disambiguation and representation learning, that is, the reliability of pseudo labels relies on representation learning, and representation learning is restricted by pseudo labels in turn. In this paper, we propose a decoupled prototype learning framework (DPL) to decouple pseudo label disambiguation and representation learning. Specifically, we firstly introduce prototypical contrastive representation learning (PCL) to get discriminative representations. And then we adopt a prototype-based label disambiguation method (PLD) to obtain pseudo labels. We theoretically prove that PCL and PLD work in a collaborative fashion and facilitate pseudo label disambiguation. Experiments and analysis on three benchmark datasets show the effectiveness of our method. △ Less

Submitted 28 May, 2023; originally announced May 2023.

Comments: Accepted at ACL2023 main conference

arXiv:2305.14469 [pdf]

A Reversed Inverse Faraday Effect

Authors: Ye Mou, Xingyu Yang, Bruno Gallas, Mathieu Mivelle

Abstract: The inverse Faraday effect is a magneto-optical process allowing the magnetization of matter by an optical excitation carrying a non-zero spin of light. In particular, a right circular polarization generates a magnetization in the direction of light propagation and a left circular polarization in the opposite direction to this propagation. We demonstrate here that by manipulating the spin density… ▽ More The inverse Faraday effect is a magneto-optical process allowing the magnetization of matter by an optical excitation carrying a non-zero spin of light. In particular, a right circular polarization generates a magnetization in the direction of light propagation and a left circular polarization in the opposite direction to this propagation. We demonstrate here that by manipulating the spin density of light, i.e., its polarization, in a plasmonic nanostructure, we generate a reversed inverse Faraday effect. A right circular polarization will generate a magnetization in the opposite direction of the light propagation, a left circular polarization in the direction of propagation. Also, we demonstrate that this new physical phenomenon is chiral, generating a strong magnetic field only for one helicity of the light, the opposite helicity producing this effect only for the mirror structure. This new optical concept opens the way to the generation of magnetic fields with unpolarized light, finding application in the ultrafast manipulation of magnetic domains and processes, such as spin precession, spin currents, and waves, magnetic skyrmion or magnetic circular dichroism, with direct applications in data storage and processing technologies. △ Less

Submitted 23 May, 2023; originally announced May 2023.

Comments: arXiv admin note: text overlap with arXiv:2301.05971

arXiv:2304.08792 [pdf]

doi 10.1103/PhysRevB.107.134513

Precise dd excitations and commensurate intersite Coulomb interactions in the dissimilar cuprate YBa_2Cu_3O_(7-x) and La_(2-x)Sr_xCuO_4

Authors: Shih-Wen Huang, L. Andrew Wray, Yu-Cheng Shao, Cheng-Yau Wu, Shun-Hung Wang, Jenn-Min Lee, Y-J. Chen, R. W. Schoenlein, C. Y. Mou, Yi-De Chuang, J. -Y. Lin

Abstract: Using high-resolution extreme ultraviolet resonant inelastic X-ray scattering (EUVRIXS) spectroscopy at Cu M-edge, we observed the do** dependent spectral shifts of inter-orbital (dd) excitations of YBa_2Cu_3O_(7-x) and La_(2-x)Sr_xCuO_4. With increasing hole do** level from undoped to optimally doped superconducting compositions, the leading edge of dd excitations is found to shift towards lo… ▽ More Using high-resolution extreme ultraviolet resonant inelastic X-ray scattering (EUVRIXS) spectroscopy at Cu M-edge, we observed the do** dependent spectral shifts of inter-orbital (dd) excitations of YBa_2Cu_3O_(7-x) and La_(2-x)Sr_xCuO_4. With increasing hole do** level from undoped to optimally doped superconducting compositions, the leading edge of dd excitations is found to shift towards lower energy loss in a roughly linear trend that is irrespective to the cuprate species. The magnitude of energy shift can be explained by including a 0.15 eV Coulomb attraction between Cu 3d_(x^2-y^2) electrons and the doped holes on the surrounding oxygens in the atomic multiplet calculations. The consistent energy shift between distinct cuprate families suggests that this inter-site Coulomb interaction energy scale is relatively material-independent, and provides an important reference point for understanding charge density wave phenomena in the cuprate phase diagram. △ Less

Submitted 18 April, 2023; originally announced April 2023.

Comments: 29 pages; 8 figures. Physical Review B, in press. This paper reveals a Cu 3d-O 2p intersite interaction energy for the first time experimentally. It also explains why Tc of YBCO is higher than that of LSCO

arXiv:2303.07248 [pdf, ps, other]

Channel Estimation for Underwater Visible Light Communication: A Sparse Learning Perspective

Authors: Younan Mou, Sicong Liu

Abstract: The underwater propagation environment for visible light signals is affected by complex factors such as absorption, shadowing, and reflection, making it very challengeable to achieve effective underwater visible light communication (UVLC) channel estimation. It is difficult for the UVLC channel to be sparse represented in the time and frequency domains, which limits the chance of using sparse sign… ▽ More The underwater propagation environment for visible light signals is affected by complex factors such as absorption, shadowing, and reflection, making it very challengeable to achieve effective underwater visible light communication (UVLC) channel estimation. It is difficult for the UVLC channel to be sparse represented in the time and frequency domains, which limits the chance of using sparse signal processing techniques to achieve better performance of channel estimation. To this end, a compressed sensing (CS) based framework is established in this paper by fully exploiting the sparsity of the underwater visible light channel in the distance domain of the propagation links. In order to solve the sparse recovery problem and achieve more accurate UVLC channel estimation, a sparse learning based underwater visible light channel estimation (SL-UVCE) scheme is proposed. Specifically, a deep-unfolding neural network mimicking the classical iterative sparse recovery algorithm of approximate message passing (AMP) is employed, which decomposes the iterations of AMP into a series of layers with different learnable parameters. Compared with the existing non-CS-based and CS-based schemes, the proposed scheme shows better performance of accuracy in channel estimation, especially in severe conditions such as insufficient measurement pilots and large number of multipath components. △ Less

Submitted 13 March, 2023; originally announced March 2023.

Comments: This paper has been accepted by and is to appear in Proc. 2023 IEEE International Conference on Communications (ICC)

arXiv:2301.05971 [pdf]

A Chiral Inverse Faraday Effect Mediated by an Inversely Designed Plasmonic Antenna

Authors: Ye Mou, Xingyu Yang, Bruno Gallas, Mathieu Mivelle

Abstract: The inverse Faraday effect is a magneto-optical process allowing the magnetization of matter by an optical excitation carrying a non-zero spin or orbital moment of light. This phenomenon was considered until now as symmetric; right or left circular polarizations generate magnetic fields oriented in the direction of light propagation or in the counter-propagating direction. Here, we demonstrate tha… ▽ More The inverse Faraday effect is a magneto-optical process allowing the magnetization of matter by an optical excitation carrying a non-zero spin or orbital moment of light. This phenomenon was considered until now as symmetric; right or left circular polarizations generate magnetic fields oriented in the direction of light propagation or in the counter-propagating direction. Here, we demonstrate that by manipulating the spin density of light in a plasmonic nanostructure, we generate a chiral inverse Faraday effect, creating a strong magnetic field of 500 mT only for one helicity of the light, the opposite helicity producing this effect only for the mirror structure. This new optical concept opens the way to the generation of magnetic fields with unpolarized light, finding application in the ultrafast manipulation of magnetic domains and processes, such as spin precession, spin currents and waves, magnetic skyrmion or magnetic circular dichroism, with direct applications in data storage and data processing technologies. △ Less

Submitted 14 January, 2023; originally announced January 2023.

arXiv:2210.14427 [pdf, other]

ReSel: N-ary Relation Extraction from Scientific Text and Tables by Learning to Retrieve and Select

Authors: Yuchen Zhuang, Yinghao Li, Jerry Junyang Cheung, Yue Yu, Yingjun Mou, Xiang Chen, Le Song, Chao Zhang

Abstract: We study the problem of extracting N-ary relation tuples from scientific articles. This task is challenging because the target knowledge tuples can reside in multiple parts and modalities of the document. Our proposed method ReSel decomposes this task into a two-stage procedure that first retrieves the most relevant paragraph/table and then selects the target entity from the retrieved component. F… ▽ More We study the problem of extracting N-ary relation tuples from scientific articles. This task is challenging because the target knowledge tuples can reside in multiple parts and modalities of the document. Our proposed method ReSel decomposes this task into a two-stage procedure that first retrieves the most relevant paragraph/table and then selects the target entity from the retrieved component. For the high-level retrieval stage, ReSel designs a simple and effective feature set, which captures multi-level lexical and semantic similarities between the query and components. For the low-level selection stage, ReSel designs a cross-modal entity correlation graph along with a multi-view architecture, which models both semantic and document-structural relations between entities. Our experiments on three scientific information extraction datasets show that ReSel outperforms state-of-the-art baselines significantly. △ Less

Submitted 25 October, 2022; originally announced October 2022.

Comments: Accepted to EMNLP 2022

arXiv:2210.10722 [pdf, other]

UniNL: Aligning Representation Learning with Scoring Function for OOD Detection via Unified Neighborhood Learning

Authors: Yutao Mou, Pei Wang, Keqing He, Yanan Wu, **gang Wang, Wei Wu, Weiran Xu

Abstract: Detecting out-of-domain (OOD) intents from user queries is essential for avoiding wrong operations in task-oriented dialogue systems. The key challenge is how to distinguish in-domain (IND) and OOD intents. Previous methods ignore the alignment between representation learning and scoring function, limiting the OOD detection performance. In this paper, we propose a unified neighborhood learning fra… ▽ More Detecting out-of-domain (OOD) intents from user queries is essential for avoiding wrong operations in task-oriented dialogue systems. The key challenge is how to distinguish in-domain (IND) and OOD intents. Previous methods ignore the alignment between representation learning and scoring function, limiting the OOD detection performance. In this paper, we propose a unified neighborhood learning framework (UniNL) to detect OOD intents. Specifically, we design a K-nearest neighbor contrastive learning (KNCL) objective for representation learning and introduce a KNN-based scoring function for OOD detection. We aim to align representation learning with scoring function. Experiments and analysis on two benchmark datasets show the effectiveness of our method. △ Less

Submitted 19 October, 2022; originally announced October 2022.

Comments: Accepted at EMNLP2022 main conference

arXiv:2210.08909 [pdf, other]

Watch the Neighbors: A Unified K-Nearest Neighbor Contrastive Learning Framework for OOD Intent Discovery

Authors: Yutao Mou, Keqing He, Pei Wang, Yanan Wu, **gang Wang, Wei Wu, Weiran Xu

Abstract: Discovering out-of-domain (OOD) intent is important for develo** new skills in task-oriented dialogue systems. The key challenges lie in how to transfer prior in-domain (IND) knowledge to OOD clustering, as well as jointly learn OOD representations and cluster assignments. Previous methods suffer from in-domain overfitting problem, and there is a natural gap between representation learning and c… ▽ More Discovering out-of-domain (OOD) intent is important for develo** new skills in task-oriented dialogue systems. The key challenges lie in how to transfer prior in-domain (IND) knowledge to OOD clustering, as well as jointly learn OOD representations and cluster assignments. Previous methods suffer from in-domain overfitting problem, and there is a natural gap between representation learning and clustering objectives. In this paper, we propose a unified K-nearest neighbor contrastive learning framework to discover OOD intents. Specifically, for IND pre-training stage, we propose a KCL objective to learn inter-class discriminative features, while maintaining intra-class diversity, which alleviates the in-domain overfitting problem. For OOD clustering stage, we propose a KCC method to form compact clusters by mining true hard negative samples, which bridges the gap between clustering and representation learning. Extensive experiments on three benchmark datasets show that our method achieves substantial improvements over the state-of-the-art methods. △ Less

Submitted 17 October, 2022; originally announced October 2022.

Comments: Accepted at EMNLP2022 main conference

arXiv:2210.08830 [pdf, other]

Disentangling Confidence Score Distribution for Out-of-Domain Intent Detection with Energy-Based Learning

Authors: Yanan Wu, Zhiyuan Zeng, Keqing He, Yutao Mou, Pei Wang, Yuanmeng Yan, Weiran Xu

Abstract: Detecting Out-of-Domain (OOD) or unknown intents from user queries is essential in a task-oriented dialog system. Traditional softmax-based confidence scores are susceptible to the overconfidence issue. In this paper, we propose a simple but strong energy-based score function to detect OOD where the energy scores of OOD samples are higher than IND samples. Further, given a small set of labeled OOD… ▽ More Detecting Out-of-Domain (OOD) or unknown intents from user queries is essential in a task-oriented dialog system. Traditional softmax-based confidence scores are susceptible to the overconfidence issue. In this paper, we propose a simple but strong energy-based score function to detect OOD where the energy scores of OOD samples are higher than IND samples. Further, given a small set of labeled OOD samples, we introduce an energy-based margin objective for supervised OOD detection to explicitly distinguish OOD samples from INDs. Comprehensive experiments and analysis prove our method helps disentangle confidence score distributions of IND and OOD data.\footnote{Our code is available at \url{https://github.com/pris-nlp/EMNLP2022-energy_for_OOD/}.} △ Less

Submitted 17 October, 2022; originally announced October 2022.

Comments: accepted by the EMNLP2022 SereTOD workshop

arXiv:2209.06612 [pdf, other]

Distribution Calibration for Out-of-Domain Detection with Bayesian Approximation

Authors: Yanan Wu, Zhiyuan Zeng, Keqing He, Yutao Mou, Pei Wang, Weiran Xu

Abstract: Out-of-Domain (OOD) detection is a key component in a task-oriented dialog system, which aims to identify whether a query falls outside the predefined supported intent set. Previous softmax-based detection algorithms are proved to be overconfident for OOD samples. In this paper, we analyze overconfident OOD comes from distribution uncertainty due to the mismatch between the training and test distr… ▽ More Out-of-Domain (OOD) detection is a key component in a task-oriented dialog system, which aims to identify whether a query falls outside the predefined supported intent set. Previous softmax-based detection algorithms are proved to be overconfident for OOD samples. In this paper, we analyze overconfident OOD comes from distribution uncertainty due to the mismatch between the training and test distributions, which makes the model can't confidently make predictions thus probably causing abnormal softmax scores. We propose a Bayesian OOD detection framework to calibrate distribution uncertainty using Monte-Carlo Dropout. Our method is flexible and easily pluggable into existing softmax-based baselines and gains 33.33\% OOD F1 improvements with increasing only 0.41\% inference time compared to MSP. Further analyses show the effectiveness of Bayesian learning for OOD detection. △ Less

Submitted 14 September, 2022; originally announced September 2022.

Comments: Accepted in COLING2022

arXiv:2209.06030 [pdf, other]

Generalized Intent Discovery: Learning from Open World Dialogue System

Authors: Yutao Mou, Keqing He, Yanan Wu, Pei Wang, **gang Wang, Wei Wu, Yi Huang, Junlan Feng, Weiran Xu

Abstract: Traditional intent classification models are based on a pre-defined intent set and only recognize limited in-domain (IND) intent classes. But users may input out-of-domain (OOD) queries in a practical dialogue system. Such OOD queries can provide directions for future improvement. In this paper, we define a new task, Generalized Intent Discovery (GID), which aims to extend an IND intent classifier… ▽ More Traditional intent classification models are based on a pre-defined intent set and only recognize limited in-domain (IND) intent classes. But users may input out-of-domain (OOD) queries in a practical dialogue system. Such OOD queries can provide directions for future improvement. In this paper, we define a new task, Generalized Intent Discovery (GID), which aims to extend an IND intent classifier to an open-world intent set including IND and OOD intents. We hope to simultaneously classify a set of labeled IND intent classes while discovering and recognizing new unlabeled OOD types incrementally. We construct three public datasets for different application scenarios and propose two kinds of frameworks, pipeline-based and end-to-end for future work. Further, we conduct exhaustive experiments and qualitative analysis to comprehend key challenges and provide new guidance for future GID research. △ Less

Submitted 13 September, 2022; originally announced September 2022.

Comments: This paper has been accepted at COLING2022

arXiv:2206.00954 [pdf]

An inverse Faraday effect through linear polarized light

Authors: Xingyu Yang, Ye Mou, Homero Zapata, Benoît Reynier, Bruno Gallas, Mathieu Mivelle

Abstract: The inverse Faraday effect (IFE) allows the generation of magnetic fields by optical excitation only. Since its discovery in the 60s, it was believed that only an elliptical or circular polarization could magnetize matter by this magneto-optical phenomenon. Here, we demonstrate the generation of an IFE via a linear polarization of light. This new physical concept results from the local manipulatio… ▽ More The inverse Faraday effect (IFE) allows the generation of magnetic fields by optical excitation only. Since its discovery in the 60s, it was believed that only an elliptical or circular polarization could magnetize matter by this magneto-optical phenomenon. Here, we demonstrate the generation of an IFE via a linear polarization of light. This new physical concept results from the local manipulation of light by a plasmonic nano-antenna. We demonstrate that a gold nanorod excited by a linear polarization generates a non-zero magnetic field by IFE when the incident polarization of the light is not parallel to the long axis of the rod. We show that this dissymmetry generates hot spots of local non-vanishing spin densities (local elliptical polarization state), introducing the concept of super circular light, allowing this magnetization. Moreover, by varying the angle of the incident linear polarization with respect to the nano-antenna, we demonstrate the on-demand flip** of the magnetic field orientation. Finally, this linear IFE generates a stationary magnetic field 25 times stronger than what a gold nanoparticle produces when excited by a circular polarization and via a classical IFE. The creation of stationary magnetic fields by IFE in a plasmonic nanostructure is nowadays the only technique allowing the creation of ultra-short, intense magnetic field pulses at the nanoscale. Thus, it finds applications in the ultrafast control of magnetic domains with applications not only in data storage technologies but also in research fields such as magnetic trap**, magnetic skyrmion, magnetic circular dichroism, to spin control, spin precession, spin currents, and spin waves, among others. △ Less

Submitted 2 June, 2022; originally announced June 2022.

Comments: 13 pages, 6 figures, article

arXiv:2205.02377 [pdf, other]

doi 10.1103/PhysRevB.106.125116

Bilayer Hubbard model: Analysis based on the fermionic sign problem

Authors: Ying** Mou, Rubem Mondaini, Richard T. Scalettar

Abstract: The bilayer Hubbard model describes the antiferromagnet to spin singlet transition and, potentially, aspects of the physics of unconventional superconductors. Despite these important applications, significant aspects of its `phase diagram' in the interplane hop** $t_\perp$ versus on-site interaction $U$ parameter space, at half filling, are largely in disagreement. Here we provide an analysis ma… ▽ More The bilayer Hubbard model describes the antiferromagnet to spin singlet transition and, potentially, aspects of the physics of unconventional superconductors. Despite these important applications, significant aspects of its `phase diagram' in the interplane hop** $t_\perp$ versus on-site interaction $U$ parameter space, at half filling, are largely in disagreement. Here we provide an analysis making use of the average sign of weights over the course of the importance sampling in quantum Monte Carlo simulations to resolve several central open questions. Specifically, this metric of the weights clarifies the finite-sized metallic regimes at small $U$. Furthermore, at strong interactions, it points to the existence of a crossover from a correlated to uncorrelated band insulator not yet explored in a variety of existing, unbiased numerical methods. Our work demonstrates the versatility of using properties of the weights in quantum Monte Carlo simulations to reveal important physical characteristics of the models under study. △ Less

Submitted 16 September, 2022; v1 submitted 4 May, 2022; originally announced May 2022.

Comments: 6+7 pages, 3+9 figures, update the figures

Journal ref: Phys. Rev. B 106, 125116 (2022)

arXiv:2112.04078 [pdf, ps, other]

doi 10.1007/s10909-022-02737-5

Renormalization of dispersion in electron-doped bilayer cuprate superconductors

Authors: Shuning Tan, Yiqun Liu, Ying** Mou, Huaiming Guo, Shi** Feng

Abstract: The renormalization of the electrons in cuprate superconductors is characterized by the kink in the quasiparticle dispersion. Here the bilayer coupling effect on the quasiparticle dispersion kink in the electron-doped bilayer cuprate superconductors is studied based on the kinetic-energy-driven superconductivity. It is shown that the kink in the quasiparticle dispersion is present all around the e… ▽ More The renormalization of the electrons in cuprate superconductors is characterized by the kink in the quasiparticle dispersion. Here the bilayer coupling effect on the quasiparticle dispersion kink in the electron-doped bilayer cuprate superconductors is studied based on the kinetic-energy-driven superconductivity. It is shown that the kink in the quasiparticle dispersion is present all around the electron Fermi surface, as the quasiparticle dispersion kink in the single-layer case. However, in comparison with the corresponding single-layer case, the kink effect in the quasiparticle dispersion at around the antinodal region becomes the most pronounced, indicating that the kink effect in the quasiparticle dispersion at around the antinodal region is enhanced by the bilayer coupling. △ Less

Submitted 7 December, 2021; originally announced December 2021.

Comments: 6 pages, 3 figures

Journal ref: Journal of Low Temperature Physics 207, 250-263 (2022)

arXiv:2111.12830 [pdf, other]

TSO-DSOs Stable Cost Allocation for the Joint Procurement of Flexibility: A Cooperative Game Approach

Authors: Anibal Sanjab, Hélène Le Cadre, Yuting Mou

Abstract: In this paper, a transmission-distribution systems flexibility market is introduced, in which system operators (SOs) jointly procure flexibility from different systems to meet their needs (balancing and congestion management) using a common market. This common market is, then, formulated as a cooperative game aiming at identifying a stable and efficient split of costs of the jointly procured flexi… ▽ More In this paper, a transmission-distribution systems flexibility market is introduced, in which system operators (SOs) jointly procure flexibility from different systems to meet their needs (balancing and congestion management) using a common market. This common market is, then, formulated as a cooperative game aiming at identifying a stable and efficient split of costs of the jointly procured flexibility among the participating SOs to incentivize their cooperation. The non-emptiness of the core of this game is then mathematically proven, implying the stability of the game and the naturally-arising incentive for cooperation among the SOs. Several cost allocation mechanisms are then introduced, while characterizing their mathematical properties. Numerical results focusing on an interconnected system (composed of the IEEE 14-bus transmission system and the Matpower 18-bus, 69-bus, and 141-bus distributions systems) showcase the cooperation-induced reduction in system-wide flexibility procurement costs, and identifies the varying costs borne by different SOs under various cost allocations methods. △ Less

Submitted 24 November, 2021; originally announced November 2021.

arXiv:2111.02328 [pdf, other]

A Linear Model for Distributed Flexibility Markets and DLMPs: A Comparison with the SOCP Formulation

Authors: Anibal Sanjab, Yuting Mou, Ana Virag, Kris Kessels

Abstract: This paper examines the performance trade-offs between an introduced linear flexibility market model for congestion management and a benchmark second-order cone programming (SOCP) formulation. The linear market model incorporates voltage magnitudes and reactive powers, while providing a simpler formulation than the SOCP model, which enables its practical implementation. The paper provides a struct… ▽ More This paper examines the performance trade-offs between an introduced linear flexibility market model for congestion management and a benchmark second-order cone programming (SOCP) formulation. The linear market model incorporates voltage magnitudes and reactive powers, while providing a simpler formulation than the SOCP model, which enables its practical implementation. The paper provides a structured comparison of the two formulations relying on developed deterministic and statistical Monte Carlo case analyses using two distribution test systems (the Matpower 69-bus and 141-bus systems). The case analyses show that with the increasing spread of offered flexibility throughout the system, the linear formulation increasingly preserves the reliability of the computed system variables as compared to the SOCP formulation, while more lenient imposed voltage limits can improve the approximation of prices and power flows at the expense of a less accurate computation of voltage magnitudes. △ Less

Submitted 3 November, 2021; originally announced November 2021.

Comments: CIRED'21

arXiv:2110.14310 [pdf, ps, other]

doi 10.1088/1674-1137/ac3411

Gravitational leptogenesis in teleparallel and symmetric teleparallel gravities

Authors: Mingzhe Li, Yicen Mou, Haomin Rao, Dehao Zhao

Abstract: In this paper, we consider the possibilities of generating baryon number asymmetry in thermal equilibrium within the frameworks of teleparallel and symmetric teleparallel gravities. Through the derivative couplings of the torsion scalar or the non-metricity scalar to baryons, the baryon number asymmetry is indeed produced in the radiation dominated epoch. For gravitational baryogenesis mechanisms… ▽ More In this paper, we consider the possibilities of generating baryon number asymmetry in thermal equilibrium within the frameworks of teleparallel and symmetric teleparallel gravities. Through the derivative couplings of the torsion scalar or the non-metricity scalar to baryons, the baryon number asymmetry is indeed produced in the radiation dominated epoch. For gravitational baryogenesis mechanisms in these two frameworks, the produced baryon-to-entropy ratio is too small to be consistent with observations. But the gravitational leptogenesis models within both frameworks have the possibilities to interpret the observed baryon-antibaryon asymmetry. △ Less

Submitted 27 October, 2021; originally announced October 2021.

Comments: 7 pages, to be published in Chinese Physics C

Report number: USTC-ICTS/PCFT-21-38

arXiv:2110.09074 [pdf, other]

Towards General Deep Leakage in Federated Learning

Authors: Jiahui Geng, Yongli Mou, Feifei Li, Qing Li, Oya Beyan, Stefan Decker, Chunming Rong

Abstract: Unlike traditional central training, federated learning (FL) improves the performance of the global model by sharing and aggregating local models rather than local data to protect the users' privacy. Although this training approach appears secure, some research has demonstrated that an attacker can still recover private data based on the shared gradient information. This on-the-fly reconstruction… ▽ More Unlike traditional central training, federated learning (FL) improves the performance of the global model by sharing and aggregating local models rather than local data to protect the users' privacy. Although this training approach appears secure, some research has demonstrated that an attacker can still recover private data based on the shared gradient information. This on-the-fly reconstruction attack deserves to be studied in depth because it can occur at any stage of training, whether at the beginning or at the end of model training; no relevant dataset is required and no additional models need to be trained. We break through some unrealistic assumptions and limitations to apply this reconstruction attack in a broader range of scenarios. We propose methods that can reconstruct the training data from shared gradients or weights, corresponding to the FedSGD and FedAvg usage scenarios, respectively. We propose a zero-shot approach to restore labels even if there are duplicate labels in the batch. We study the relationship between the label and image restoration. We find that image restoration fails even if there is only one incorrectly inferred label in the batch; we also find that when batch images have the same label, the corresponding image is restored as a fusion of that class of images. Our approaches are evaluated on classic image benchmarks, including CIFAR-10 and ImageNet. The batch size, image quality, and the adaptability of the label distribution of our approach exceed those of GradInversion, the state-of-the-art. △ Less

Submitted 25 January, 2022; v1 submitted 18 October, 2021; originally announced October 2021.

arXiv:2109.00213 [pdf, other]

doi 10.1103/PhysRevB.105.155154

Enhancement of $d$-wave pairing in the striped phase with the nearest neighbour attraction

Authors: Lufeng Zhang, Ting Guo, Ying** Mou, Qiaoni Chen, Tianxing Ma

Abstract: Recently, the experimental results by the angle-resolved photoemission spectroscopy suggested that an additional strong nearest neighbor attraction in the Hubbard model might be significant to describe the properties of doped cuprates more accurately. The stripe-ordered patterns, formed by the inhomogeneous distribution of spin, charge and pairing correlations in the CuO$_{2}$ planes, is a known f… ▽ More Recently, the experimental results by the angle-resolved photoemission spectroscopy suggested that an additional strong nearest neighbor attraction in the Hubbard model might be significant to describe the properties of doped cuprates more accurately. The stripe-ordered patterns, formed by the inhomogeneous distribution of spin, charge and pairing correlations in the CuO$_{2}$ planes, is a known feature of doped cuprates. In this work, the effect of the nearest neighbor attraction and the stripe phase are examined by using the constrained path quantum Monte Carlo method within the repulsive Hubbard model on two-dimensional square lattice. The ground state spin correlations along and cross the stripe regions, and the $d$-wave pairing correlation are calculated. It is found that the spin-spin correlation is the highest when the interstripe region is fairly close to half-filling, and $d$-wave superconducting correlation on neighboring sites could be enhanced in the presence of stripe pattern and strong nearest neighbor attraction, which reveals their crucial roles on superconductivity in the doped cuprates. △ Less

Submitted 20 April, 2022; v1 submitted 1 September, 2021; originally announced September 2021.

Comments: Accepted for publication in Physical Review B

Journal ref: Phys. Rev. B, 105 155154(2022)

arXiv:2108.11425 [pdf, other]

doi 10.1103/PhysRevX.11.041038

Quantum fluctuations of charge order induce phonon softening in a superconducting cuprate

Authors: H. Y. Huang, A. Singh, C. Y. Mou, S. Johnston, A. F. Kemper, J. van den Brink, P. J. Chen, T. K. Lee, J. Okamoto, Y. Y. Chu, J. H. Li, S. Komiya, A. C. Komarek, A. Fujimori, C. T. Chen, D. J. Huang

Abstract: Quantum phase transitions play an important role in sha** the phase diagram of high-temperature cuprate superconductors. These cuprates possess intertwined orders which interact strongly with superconductivity. However, the evidence for the quantum critical point associated with the charge order in the superconducting phase remains elusive. Here we show the short-range charge orders and the spec… ▽ More Quantum phase transitions play an important role in sha** the phase diagram of high-temperature cuprate superconductors. These cuprates possess intertwined orders which interact strongly with superconductivity. However, the evidence for the quantum critical point associated with the charge order in the superconducting phase remains elusive. Here we show the short-range charge orders and the spectral signature of the quantum fluctuations in La$_{2-x}$Sr$_x$CuO$_4$ (LSCO) near the optimal do** using high-resolution resonant inelastic X-ray scattering. On performing calculations through a diagrammatic framework, we discovered that the charge correlations significantly soften several branches of phonons. These results elucidate the role of charge order in the LSCO compound, providing evidence for quantum critical scaling and discommensurations associated with charge order. △ Less

Submitted 26 September, 2021; v1 submitted 25 August, 2021; originally announced August 2021.

Comments: 9 pages, 4 figures

Journal ref: Phys. Rev. X 11, 041038 (2021)

arXiv:2108.01606 [pdf, other]

Optical vortex coronagraph imaging of a laser-induced plasma filament

Authors: Qingqing Liang, Xia Huang, Yanfei Mou, Shaodong Zhou, Wenxing Zhang, Jieyu Gui, Grover A. Swartzlander, JR., Qingqing Cheng, Yi Liu

Abstract: A high contrast imaging technique based on an optical vortex coronagraph (OVC) is used to measure the spatial phase profile induced by an air plasma generated by a femtosecond laser pulse. The sensitivity of the OVC method significantly surpassed both in-line holographic and direct imaging methods based on air plasma fluorescence. The estimated phase sensitivity of 0.046 waves provides opportuniti… ▽ More A high contrast imaging technique based on an optical vortex coronagraph (OVC) is used to measure the spatial phase profile induced by an air plasma generated by a femtosecond laser pulse. The sensitivity of the OVC method significantly surpassed both in-line holographic and direct imaging methods based on air plasma fluorescence. The estimated phase sensitivity of 0.046 waves provides opportunities for OVC applications in areas such as bioimaging, material characterization, as well as plasma diagnostics. △ Less

Submitted 3 August, 2021; originally announced August 2021.

arXiv:2104.10422 [pdf, other]

doi 10.1103/PhysRevC.104.024903

Investigations on mixed harmonic cumulants in heavy-ion collisions at the LHC

Authors: Ming Li, You Zhou, Wenbin Zhao, Baochi Fu, Yawen Mou, Huichao Song

Abstract: A series of new flow observables mixed harmonic multi-particle cumulants (MHC), which allow for the first time to quantify the correlations strength between different order of flow coefficients with various moments, was investigated using hydrodynamic model. These new observables are constructed based on multi-particle cumulants, and thus by design will be less sensitive to the non-flow contaminat… ▽ More A series of new flow observables mixed harmonic multi-particle cumulants (MHC), which allow for the first time to quantify the correlations strength between different order of flow coefficients with various moments, was investigated using hydrodynamic model. These new observables are constructed based on multi-particle cumulants, and thus by design will be less sensitive to the non-flow contaminations. In addition to the previous study of correlation involving two flow coefficients with their second moments, both correlations of three flow coefficients and the correlations of higher order moments of $v_2$ and $v_3$ are systematically investigated using iEBE-VISHNU hybrid model with two different initial conditions, AMPT and TRENTo, respectively. These systematic studies using hydrodynamic models will significantly improve the understanding on the correlations between different orders (and moments) of flow coefficients. The hydrodynamic predictions shown in this paper and the future comparisons to experimental measurements will provide more constraints on theoretical models and extract more information about the transport properties of the quark-gluon plasma created in heavy-ion collisions. △ Less

Submitted 21 April, 2021; originally announced April 2021.

Comments: 10 pages, 5 figures

Journal ref: Phys. Rev. C 104, 024903 (2021)

arXiv:2103.13226 [pdf, other]

Distributed Learning for Melanoma Classification using Personal Health Train

Authors: Yongli Mou, Sascha Welten, Yeliz Ucer Yediel, Toralf Kirsten, Oya Deniz Beyan

Abstract: Skin cancer is the most common cancer type. Usually, patients with suspicion of cancer are treated by doctors without any aided visual inspection. At this point, dermoscopy has become a suitable tool to support physicians in their decision-making. However, clinicians need years of expertise to classify possibly malicious skin lesions correctly. Therefore, research has applied image processing and… ▽ More Skin cancer is the most common cancer type. Usually, patients with suspicion of cancer are treated by doctors without any aided visual inspection. At this point, dermoscopy has become a suitable tool to support physicians in their decision-making. However, clinicians need years of expertise to classify possibly malicious skin lesions correctly. Therefore, research has applied image processing and analysis tools to improve the treatment process. In order to perform image analysis and train a model on dermoscopic images data needs to be centralized. Nevertheless, data centralization does not often comply with local data protection regulations due to its sensitive nature and due to the loss of sovereignty if data providers allow unlimited access to the data. A method to circumvent all privacy-related challenges of data centralization is Distributed Analytics (DA) approaches, which bring the analysis to the data instead of vice versa. This paradigm shift enables data analyses - in our case, image analysis - with data remaining inside institutional borders, i.e., the origin. In this documentation, we describe a straightforward use case including a model training for skin lesion classification based on decentralised data. △ Less

Submitted 24 March, 2021; originally announced March 2021.

Comments: 11 pages, 7 figures

arXiv:2010.06781 [pdf, other]

doi 10.1103/PhysRevB.103.014503

Anisotropic dressing of electrons in electron-doped cuprate superconductors

Authors: Shuning Tan, Yiqun Liu, Ying** Mou, Shi** Feng

Abstract: The recent experiments revealed a remarkable possibility for the absence of the disparity between the phase diagrams of the electron- and hole-doped cuprate superconductors, while such an aspect should be also reflected in the dressing of the electrons. Here the phase diagram of the electron-doped cuprate superconductors and the related exotic features of the anisotropic dressing of the electrons… ▽ More The recent experiments revealed a remarkable possibility for the absence of the disparity between the phase diagrams of the electron- and hole-doped cuprate superconductors, while such an aspect should be also reflected in the dressing of the electrons. Here the phase diagram of the electron-doped cuprate superconductors and the related exotic features of the anisotropic dressing of the electrons are studied based on the kinetic-energy driven superconductivity. It is shown that although the optimized Tc in the electron-doped side is much smaller than that in the hole-doped case, the electron- and hole-doped cuprate superconductors rather resemble each other in the do** range of the superconducting dome, indicating an absence of the disparity between the phase diagrams of the electron- and hole-doped cuprate superconductors. In particular, the anisotropic dressing of the electrons due to the strong electron's coupling to a strongly dispersive spin excitation leads to that the electron Fermi surface is truncated to form the disconnected Fermi arcs centered around the nodal region. Concomitantly, the dip in the peak-dip-hump structure of the quasiparticle excitation spectrum is directly associated with the corresponding peak in the quasiparticle scattering rate, while the dispersion kink is always accompanied by the corresponding inflection point in the total self-energy, as the dip in the peak-dip-hump structure and dispersion kink in the hole-doped counterparts. The theory also predicts that both the normal and anomalous self-energies exhibit the well-pronounced low-energy peak-structures. △ Less

Submitted 8 January, 2021; v1 submitted 13 October, 2020; originally announced October 2020.

Comments: 13 pages, 9 figures,added figure, references, and discussions; accepted for publication in Physical Review B

Journal ref: Phys. Rev. B 103, 014503 (2021)

arXiv:2009.12093 [pdf, other]

Designing Day-Ahead Multi-carrier Markets for Flexibility: Models and Clearing Algorithms

Authors: Shahab Shariat Torbaghan, Mehdi Madani, Peter Sels, Ana Virag, Hélène Le Cadre, Kris Kessels, Yuting Mou

Abstract: There is an intrinsic value in higher integration of multi-carrier energy systems (especially gas and electricity), to increase operational flexibility in the electricity system and to improve allocation of resources in gas and electricity networks. The integration of different energy carrier markets is challenging due to the existence of physical and economic dependencies between the different en… ▽ More There is an intrinsic value in higher integration of multi-carrier energy systems (especially gas and electricity), to increase operational flexibility in the electricity system and to improve allocation of resources in gas and electricity networks. The integration of different energy carrier markets is challenging due to the existence of physical and economic dependencies between the different energy carriers. We propose in this paper an integrated day-ahead multi-carrier gas, electricity and heat market clearing which includes new types of orders and constraints on these orders to represent techno-economic constraints of conversion and storage technologies. We prove that the proposed market clearing gives rise to competitive equilibria. In addition, we propose two decentralised clearing algorithms which differ in how the decomposition of the underlying centralised clearing optimisation problem is performed. This has implications in terms of the involved agents and their mutual information exchange. It is proven that they yield solutions equivalent to the centralised market clearing under a mild assumption of sufficient number of iterations. We argue that such an integrated multi-carrier energy market mitigates (spot) market risks faced by market participants and enables better spot pricing of the different energy carriers. The results show that conversion/storage technology owners would suffer from losses and/or opportunity costs, if they were obliged to only use elementary orders. For the test cases considered in this article, sum of losses and opportunity costs could reach up to 13,000 EUR/day and 9,000 EUR/day respectively, compared with the case where conversion and storage orders are used. △ Less

Submitted 3 January, 2021; v1 submitted 25 September, 2020; originally announced September 2020.

arXiv:2001.06748 [pdf, other]

doi 10.1007/s10948-019-05369-1

ARPES autocorrelation in electron-doped cuprate superconductors

Authors: Shuning Tan, Ying** Mou, Yiqun Liu, Shi** Feng

Abstract: The angle-resolved photoemission spectroscopy (ARPES) autocorrelation in the electron-doped cuprate superconductors is studied based on the kinetic-energy driven superconducting (SC) mechanism. It is shown that the strong electron correlation induces the electron Fermi surface (EFS) reconstruction, where the most of the quasiparticles locate at around the hot spots on EFS, and then these hot spots… ▽ More The angle-resolved photoemission spectroscopy (ARPES) autocorrelation in the electron-doped cuprate superconductors is studied based on the kinetic-energy driven superconducting (SC) mechanism. It is shown that the strong electron correlation induces the electron Fermi surface (EFS) reconstruction, where the most of the quasiparticles locate at around the hot spots on EFS, and then these hot spots connected by the scattering wave vectors ${\bf q}_{i}$ construct an {\it octet} scattering model. In a striking analogy to the hole-doped case, the sharp ARPES autocorrelation peaks are directly correlated with the scattering wave vectors ${\bf q}_{i}$, and are weakly dispersive in momentum space. However, in a clear contrast to the hole-doped counterparts, the position of the ARPES autocorrelation peaks move toward to the opposite direction with the increase of do**. The theory also indicates that there is an intrinsic connection between the ARPES autocorrelation and quasiparticle scattering interference (QSI) in the electron-doped cuprate superconductors. △ Less

Submitted 18 January, 2020; originally announced January 2020.

Comments: 6 pages, 6 figures, to be published in Journal of Superconductivity and Novel Magnetism

Journal ref: Journal of Superconductivity and Novel Magnetism 33, 2305 (2020)

arXiv:2001.01054 [pdf, other]

doi 10.1016/j.physc.2020.1353661

Renormalization of electrons in bilayer cuprate superconductors

Authors: Yiqun Liu, Yu Lan, Ying** Mou, Shi** Feng

Abstract: The characteristic features of the renormalization of the electrons in the bilayer cuprate superconductors are investigated within the kinetic-energy driven superconductivity. It is shown that the quasiparticle excitation spectrum is split into its bonding and antibonding components due to the presence of the bilayer coupling, with each component that is independent. However, in the underdoped and… ▽ More The characteristic features of the renormalization of the electrons in the bilayer cuprate superconductors are investigated within the kinetic-energy driven superconductivity. It is shown that the quasiparticle excitation spectrum is split into its bonding and antibonding components due to the presence of the bilayer coupling, with each component that is independent. However, in the underdoped and optimally doped regimes, although the bonding and antibonding electron Fermi surface (EFS) contours deriving from the bonding and antibonding layers are truncated to form the bonding and antibonding Fermi arcs, almost all spectral weights in the bonding and antibonding Fermi arcs are reduced to the tips of the bonding and antibonding Fermi arcs, which in this case coincide with the bonding and antibonding hot spots. These hot spots connected by the scattering wave vectors ${\bf q}_{i} $ construct an octet scattering model, and then the enhancement of the quasiparticle scattering processes with the scattering wave vectors ${\bf q}_{i}$ is confirmed via the result of the autocorrelation of the ARPES spectral intensities. Moreover, the peak-dip-hump (PDH) structure developed in each component of the quasiparticle excitation spectrum along the corresponding EFS is directly related with the peak structure in the quasiparticle scattering rate except for at around the hot spots, where the PDH structure is caused mainly by the bilayer coupling. Although the kink in the quasiparticle dispersion is present all around EFS, when the momentum moves away from the node to the antinode, the kink energy smoothly decreases, while the dispersion kink becomes more pronounced, and in particular, near the cut close to the antinode, develops into a break separating of the fasting dispersing high-energy part of the quasiparticle excitation spectrum from the slower dispersing low-energy part. △ Less

Submitted 1 April, 2020; v1 submitted 4 January, 2020; originally announced January 2020.

Comments: 26 pages, 16 figures, added discussions and updated the references

Journal ref: Physica C 576, 1353661 (2020)

arXiv:1909.04355 [pdf, ps, other]

Minimization of Sum Inverse Energy Efficiency for Multiple Base Station Systems

Authors: Zijian Wang, Luc Vandendorpe, Mateen Ashraf, Yuting Mou, Nafiseh Janatian

Abstract: A sum inverse energy efficiency (SIEE) minimization problem is solved. Compared with conventional sum energy efficiency (EE) maximization problems, minimizing SIEE achieves a better fairness. The paper begins by proposing a framework for solving sum-fraction minimization (SFMin) problems, then uses a novel transform to solve the SIEE minimization problem in a multiple base station (BS) system. Aft… ▽ More A sum inverse energy efficiency (SIEE) minimization problem is solved. Compared with conventional sum energy efficiency (EE) maximization problems, minimizing SIEE achieves a better fairness. The paper begins by proposing a framework for solving sum-fraction minimization (SFMin) problems, then uses a novel transform to solve the SIEE minimization problem in a multiple base station (BS) system. After the reformulation into a multi-convex problem, the alternating direction method of multipliers (ADMM) is used to further simplify the problem. Numerical results confirm the efficiency of the transform and the fairness improvement of the SIEE minimization. Simulation results show that the algorithm convergences fast and the ADMM method is efficient. △ Less

Submitted 10 September, 2019; originally announced September 2019.

arXiv:1907.06882 [pdf, other]

Learning Depth from Monocular Videos Using Synthetic Data: A Temporally-Consistent Domain Adaptation Approach

Authors: Yipeng Mou, Mingming Gong, Huan Fu, Kayhan Batmanghelich, Kun Zhang, Dacheng Tao

Abstract: Majority of state-of-the-art monocular depth estimation methods are supervised learning approaches. The success of such approaches heavily depends on the high-quality depth labels which are expensive to obtain. Some recent methods try to learn depth networks by leveraging unsupervised cues from monocular videos which are easier to acquire but less reliable. In this paper, we propose to resolve thi… ▽ More Majority of state-of-the-art monocular depth estimation methods are supervised learning approaches. The success of such approaches heavily depends on the high-quality depth labels which are expensive to obtain. Some recent methods try to learn depth networks by leveraging unsupervised cues from monocular videos which are easier to acquire but less reliable. In this paper, we propose to resolve this dilemma by transferring knowledge from synthetic videos with easily obtainable ground-truth depth labels. Due to the stylish difference between synthetic and real images, we propose a temporally-consistent domain adaptation (TCDA) approach that simultaneously explores labels in the synthetic domain and temporal constraints in the videos to improve style transfer and depth prediction. Furthermore, we make use of the ground-truth optical flow and pose information in the synthetic data to learn moving mask and pose prediction networks. The learned moving masks can filter out moving regions that produces erroneous temporal constraints and the estimated poses provide better initializations for estimating temporal constraints. Experimental results demonstrate the effectiveness of our method and comparable performance against state-of-the-art. △ Less

Submitted 26 November, 2019; v1 submitted 16 July, 2019; originally announced July 2019.

arXiv:1907.00663 [pdf, other]

doi 10.1007/s10948-019-05279-2

Do** dependence of electromagnetic response in cuprate superconductors

Authors: Yiqun Liu, Ying** Mou, Shi** Feng

Abstract: The study of the electromagnetic response in cuprate superconductors plays a crucial role in the understanding of the essential physics of these materials. Here the do** dependence of the electromagnetic response in cuprate superconductors is studied within the kinetic-energy driven superconducting mechanism. The kernel of the response function is evaluated based on the linear response approxima… ▽ More The study of the electromagnetic response in cuprate superconductors plays a crucial role in the understanding of the essential physics of these materials. Here the do** dependence of the electromagnetic response in cuprate superconductors is studied within the kinetic-energy driven superconducting mechanism. The kernel of the response function is evaluated based on the linear response approximation for a purely transverse vector potential, and can be broken up into its diamagnetic and paramagnetic parts. In particular, this paramagnetic part exactly cancels the corresponding diamagnetic part in the normal-state, and then the Meissner effect is obtained within the entire superconducting phase. Following this kernel of the response function, the electromagnetic response calculation in terms of the specular reflection model qualitatively reproduces many of the striking features observed in the experiments. In particular, the local magnetic-field profile follows an exponential law, while the superfluid density exhibits the nonlinear temperature behavior at the lowest temperatures, followed by the linear temperature dependence extending over the most of the superconducting temperature range. Moreover, the maximal value of the superfluid density occurs at around the critical do** $δ_{\rm critical}\sim 0.16$, and then decreases in both lower doped and higher doped regimes. The theory also shows that the nonlinear temperature dependence of the superfluid density at the lowest temperatures can be attributed to the nonlocal effects induced by the d-wave gap nodes on the electron Fermi surface. △ Less

Submitted 4 September, 2019; v1 submitted 1 July, 2019; originally announced July 2019.

Comments: 11 pages, 5 figures, typos corrected. To be published in a special issue of Journal of Superconductivity and Novel Magnetism in honor of Ted Geballe on the occasion of his 100th birthday. arXiv admin note: text overlap with arXiv:1501.02420, arXiv:1008.1472

Journal ref: J. Supercond. Nov. Magn. 33, 69 (2020)

arXiv:1904.09435 [pdf, other]

Estimating Emotional Intensity from Body Poses for Human-Robot Interaction

Authors: Mingfei Sun, Yiqing Mou, Hongwen Xie, Meng Xia, Michelle Wong, Xiaojuan Ma

Abstract: Equip** social and service robots with the ability to perceive human emotional intensities during an interaction is in increasing demand. Most of existing work focuses on determining which emotion(s) participants are expressing from facial expressions but largely overlooks the emotional intensities spontaneously revealed by other social cues, especially body languages. In this paper, we present… ▽ More Equip** social and service robots with the ability to perceive human emotional intensities during an interaction is in increasing demand. Most of existing work focuses on determining which emotion(s) participants are expressing from facial expressions but largely overlooks the emotional intensities spontaneously revealed by other social cues, especially body languages. In this paper, we present a real-time method for robots to capture fluctuations of participants' emotional intensities from their body poses. Unlike conventional joint-position-based approaches, our method adopts local joint transformations as pose descriptors which are invariant to subject body differences as well as the pose sensor positions. In addition, we use a Long Short-Term Memory Recurrent Neural Network (LSTM-RNN) architecture to take the specific emotion context into account when estimating emotional intensities from body poses. The dataset evaluation suggests that the proposed method is effective and performs better than baseline method on the test dataset. Also, a series of succeeding field tests on a physical robot demonstrates that the proposed method effectively estimates subjects emotional intensities in real-time. Furthermore, the robot equipped with our method is perceived to be more emotion-sensitive and more emotionally intelligent. △ Less

Submitted 20 April, 2019; originally announced April 2019.

Comments: 2018 IEEE International Conference on Systems, Man, and Cybernetics (SMC2018)

arXiv:1903.00803 [pdf, other]

doi 10.1080/14786435.2019.1635722

Do** and momentum dependence of coupling strength in cuprate superconductors

Authors: Ying** Mou, Yiqun Liu, Shuning Tan, Shi** Feng

Abstract: Superconductivity is caused by the interaction between electrons by the exchange of collective bosonic excitations, however, this bosonic glue forming electron pairs is manifested itself by the coupling strength of the electrons to collective bosonic excitations. Here the do** and momentum dependence of the coupling strength of the electrons to spin excitations in cuprate superconductors is stud… ▽ More Superconductivity is caused by the interaction between electrons by the exchange of collective bosonic excitations, however, this bosonic glue forming electron pairs is manifested itself by the coupling strength of the electrons to collective bosonic excitations. Here the do** and momentum dependence of the coupling strength of the electrons to spin excitations in cuprate superconductors is studied within the framework of the kinetic-energy-driven superconducting mechanism. The normal self-energy in the particle-hole channel and pairing self-energy in the particle-pariticle channel generated by the interaction between electrons by the exchange of spin excitation are employed to extract the coupling strengths of the electrons to spin excitations in the particle-hole and particle-particle channels, respectively. It is shown that below the superconducting transition temperature, both the coupling strengths in the particle-hole and particle-particle channels around the antinodes consist of two peaks, with a sharp low-energy peak located at around 5 meV in the optimally doped regime, and a broad band with a weak peak centered at around 40 meV. In particular, this two-peak structure in the coupling strength in the particle-hole channel can persist into the normal-state, while as a consequence of the d-wave type symmetry of the superconducting gap, the coupling strength in the particle-particle channel vanishes at the nodes. However, the positions of the peaks in the coupling strengths in the underdoped regime shift towards to higher energies with the increase of do**. More specifically, although the positions of the peaks in the coupling strengths move to lower energies from the antinode to the hot spot on the electron Fermi surface, the weights of the peaks decrease smoothly with the move of the momentum from the antinode to the hot spot, and fade away at the hot spots. △ Less

Submitted 21 June, 2019; v1 submitted 2 March, 2019; originally announced March 2019.

Comments: 10 pages, 6 figures, added discussions, accepted for publication in Philosophical Magazine

Journal ref: Philosophical Magazine 99, 2718-2735 (2019)

arXiv:1810.08919 [pdf, other]

doi 10.1080/14786435.2018.1551635

Autocorrelation of quasiparticle spectral intensities and its connection with quasiparticle scattering interference in cuprate superconductors

Authors: Deheng Gao, Ying** Mou, Yiqun Liu, Shuning Tan, Shi** Feng

Abstract: The quasiparticle excitation is one of the most fundamental and ubiquitous physical observables in cuprate superconductors, carrying information about the bosonic glue forming electron pairs. Here the autocorrelation of the quasiparticle excitation spectral intensities in cuprate superconductors and its connection with the quasiparticle scattering interference are investigated based on the framewo… ▽ More The quasiparticle excitation is one of the most fundamental and ubiquitous physical observables in cuprate superconductors, carrying information about the bosonic glue forming electron pairs. Here the autocorrelation of the quasiparticle excitation spectral intensities in cuprate superconductors and its connection with the quasiparticle scattering interference are investigated based on the framework of the kinetic-energy driven superconducting mechanism by taking into account the pseudogap effect. It is shown that the octet scattering model of the quasiparticle scattering processes with the scattering wave vectors ${\bf q}_{i}$ connecting the hot spots on the constant energy contours is intrinsically related to the emergence of the highly anisotropic momentum-dependence of the pseudogap. Concomitantly, the sharp peaks in the autocorrelation of the quasiparticle excitation spectral intensities with the wave vectors ${\bf q}_{i}$ are directly correlated to the regions of the highest joint density of states. Moreover, the momentum-space structure of the autocorrelation patterns of the quasiparticle excitation spectral intensities is well consistent with the momentum-space structure of the quasiparticle scattering interference patterns observed from Fourier-transform scanning tunneling spectroscopy experiments. The theory therefore confirms an intimate connection between the angle-resolved photoemission spectroscopy autocorrelation and quasiparticle scattering interference in cuprate superconductors. △ Less

Submitted 29 November, 2018; v1 submitted 21 October, 2018; originally announced October 2018.

Comments: 10 pages, 7 figures,typos corrected, accepted for publication in Philosophical Magazine

Journal ref: Philosophical Magazine 99, 752 (2019)

Showing 1–50 of 61 results for author: Mou, Y