Search | arXiv e-print repository

Orbital cluster-based network modelling

Authors: Antonio Colanera, Nan Deng, Matteo Chiatto, Luigi de Luca, Bernd R. Noack

Abstract: We propose a novel reduced-order methodology to describe complex multi-frequency fluid dynamics from time-resolved snapshot data. Starting point is the Cluster-based Network Model (CNM) thanks to its fully automatable development and human interpretability. Our key innovation is to model the transitions from cluster to cluster much more accurately by replacing snapshot states with short-term traje… ▽ More We propose a novel reduced-order methodology to describe complex multi-frequency fluid dynamics from time-resolved snapshot data. Starting point is the Cluster-based Network Model (CNM) thanks to its fully automatable development and human interpretability. Our key innovation is to model the transitions from cluster to cluster much more accurately by replacing snapshot states with short-term trajectories ("orbits") over multiple clusters, thus avoiding nonphysical intra-cluster diffusion in the dynamic reconstruction. The proposed orbital CNM (oCNM) employs functional clustering to coarse-grain the short-term trajectories. Specifically, different filtering techniques, resulting in different temporal basis expansions, demonstrate the versatility and capability of the oCNM to adapt to diverse flow phenomena. The oCNM is illustrated on the Stuart-Landau oscillator and its post-transient solution with time-varying parameters to test its ability to capture the amplitude selection mechanism and multi-frequency behaviours. Then, the oCNM is applied to the fluidic pinball across varying flow regimes at different Reynolds numbers, including the periodic, quasi-periodic, and chaotic dynamics. This orbital-focused perspective enhances the understanding of complex temporal behaviours by incorporating high-frequency behaviour into the kinematics of short-time trajectories while modelling the dynamics of the lower frequencies. In analogy to Spectral Proper Orthogonal Decomposition, which marked the transition from spatial-only modes to spatio-temporal ones, this work advances from analysing temporal local states to examining piecewise short-term trajectories, or orbits. By merging advanced analytical methods, such as the functional representation of short-time trajectories with CNM, this study paves the way for new approaches to dissect the complex dynamics characterising turbulent systems. △ Less

Submitted 1 July, 2024; originally announced July 2024.

arXiv:2406.12754 [pdf, other]

Chumor 1.0: A Truly Funny and Challenging Chinese Humor Understanding Dataset from Ruo Zhi Ba

Authors: Ruiqi He, Yushu He, Longju Bai, Jiarui Liu, Zhenjie Sun, Zenghao Tang, He Wang, Hanchen Xia, Naihao Deng

Abstract: Existing humor datasets and evaluations predominantly focus on English, lacking resources for culturally nuanced humor in non-English languages like Chinese. To address this gap, we construct Chumor, a dataset sourced from Ruo Zhi Ba (RZB), a Chinese Reddit-like platform dedicated to sharing intellectually challenging and culturally specific jokes. We annotate explanations for each joke and evalua… ▽ More Existing humor datasets and evaluations predominantly focus on English, lacking resources for culturally nuanced humor in non-English languages like Chinese. To address this gap, we construct Chumor, a dataset sourced from Ruo Zhi Ba (RZB), a Chinese Reddit-like platform dedicated to sharing intellectually challenging and culturally specific jokes. We annotate explanations for each joke and evaluate human explanations against two state-of-the-art LLMs, GPT-4o and ERNIE Bot, through A/B testing by native Chinese speakers. Our evaluation shows that Chumor is challenging even for SOTA LLMs, and the human explanations for Chumor jokes are significantly better than explanations generated by the LLMs. △ Less

Submitted 18 June, 2024; originally announced June 2024.

arXiv:2405.03520 [pdf, other]

Is Sora a World Simulator? A Comprehensive Survey on General World Models and Beyond

Authors: Zheng Zhu, Xiaofeng Wang, Wangbo Zhao, Chen Min, Nianchen Deng, Min Dou, Yuqi Wang, Botian Shi, Kai Wang, Chi Zhang, Yang You, Zhaoxiang Zhang, Dawei Zhao, Liang Xiao, Jian Zhao, Jiwen Lu, Guan Huang

Abstract: General world models represent a crucial pathway toward achieving Artificial General Intelligence (AGI), serving as the cornerstone for various applications ranging from virtual environments to decision-making systems. Recently, the emergence of the Sora model has attained significant attention due to its remarkable simulation capabilities, which exhibits an incipient comprehension of physical law… ▽ More General world models represent a crucial pathway toward achieving Artificial General Intelligence (AGI), serving as the cornerstone for various applications ranging from virtual environments to decision-making systems. Recently, the emergence of the Sora model has attained significant attention due to its remarkable simulation capabilities, which exhibits an incipient comprehension of physical laws. In this survey, we embark on a comprehensive exploration of the latest advancements in world models. Our analysis navigates through the forefront of generative methodologies in video generation, where world models stand as pivotal constructs facilitating the synthesis of highly realistic visual content. Additionally, we scrutinize the burgeoning field of autonomous-driving world models, meticulously delineating their indispensable role in resha** transportation and urban mobility. Furthermore, we delve into the intricacies inherent in world models deployed within autonomous agents, shedding light on their profound significance in enabling intelligent interactions within dynamic environmental contexts. At last, we examine challenges and limitations of world models, and discuss their potential future directions. We hope this survey can serve as a foundational reference for the research community and inspire continued innovation. This survey will be regularly updated at: https://github.com/GigaAI-research/General-World-Models-Survey. △ Less

Submitted 6 May, 2024; originally announced May 2024.

Comments: This survey will be regularly updated at: https://github.com/GigaAI-research/General-World-Models-Survey

arXiv:2402.14851 [pdf, other]

SQL-CRAFT: Text-to-SQL through Interactive Refinement and Enhanced Reasoning

Authors: Hanchen Xia, Feng Jiang, Naihao Deng, Cunxiang Wang, Guojiang Zhao, Rada Mihalcea, Yue Zhang

Abstract: Modern LLMs have become increasingly powerful, but they are still facing challenges in specialized tasks such as Text-to-SQL. We propose SQL-CRAFT, a framework to advance LLMs' SQL generation Capabilities through inteRActive reFinemenT and enhanced reasoning. We leverage an Interactive Correction Loop (IC-Loop) for LLMs to interact with databases automatically, as well as Python-enhanced reasoning… ▽ More Modern LLMs have become increasingly powerful, but they are still facing challenges in specialized tasks such as Text-to-SQL. We propose SQL-CRAFT, a framework to advance LLMs' SQL generation Capabilities through inteRActive reFinemenT and enhanced reasoning. We leverage an Interactive Correction Loop (IC-Loop) for LLMs to interact with databases automatically, as well as Python-enhanced reasoning. We conduct experiments on two Text-to-SQL datasets, Spider and Bird, with performance improvements of up to 5.7% compared to the naive prompting method. Moreover, our method surpasses the current state-of-the-art on the Spider Leaderboard, demonstrating the effectiveness of our framework. △ Less

Submitted 19 February, 2024; originally announced February 2024.

Comments: 11 pages, 3 figures, 6 tables

arXiv:2402.12424 [pdf, other]

Tables as Texts or Images: Evaluating the Table Reasoning Ability of LLMs and MLLMs

Authors: Naihao Deng, Zhenjie Sun, Ruiqi He, Aman Sikka, Yulong Chen, Lin Ma, Yue Zhang, Rada Mihalcea

Abstract: In this paper, we investigate the effectiveness of various LLMs in interpreting tabular data through different prompting strategies and data formats. Our analyses extend across six benchmarks for table-related tasks such as question-answering and fact-checking. We introduce for the first time the assessment of LLMs' performance on image-based table representations. Specifically, we compare five te… ▽ More In this paper, we investigate the effectiveness of various LLMs in interpreting tabular data through different prompting strategies and data formats. Our analyses extend across six benchmarks for table-related tasks such as question-answering and fact-checking. We introduce for the first time the assessment of LLMs' performance on image-based table representations. Specifically, we compare five text-based and three image-based table representations, demonstrating the role of representation and prompting on LLM performance. Our study provides insights into the effective use of LLMs on table-related tasks. △ Less

Submitted 5 June, 2024; v1 submitted 19 February, 2024; originally announced February 2024.

Comments: Accepted to ACL 2024 Findings

arXiv:2402.03830 [pdf, other]

OASim: an Open and Adaptive Simulator based on Neural Rendering for Autonomous Driving

Authors: Guohang Yan, Jiahao Pi, Jianfei Guo, Zhaotong Luo, Min Dou, Nianchen Deng, Qiusheng Huang, Daocheng Fu, Licheng Wen, Pinlong Cai, Xing Gao, Xinyu Cai, Bo Zhang, Xuemeng Yang, Yeqi Bai, Hongbin Zhou, Botian Shi

Abstract: With deep learning and computer vision technology development, autonomous driving provides new solutions to improve traffic safety and efficiency. The importance of building high-quality datasets is self-evident, especially with the rise of end-to-end autonomous driving algorithms in recent years. Data plays a core role in the algorithm closed-loop system. However, collecting real-world data is ex… ▽ More With deep learning and computer vision technology development, autonomous driving provides new solutions to improve traffic safety and efficiency. The importance of building high-quality datasets is self-evident, especially with the rise of end-to-end autonomous driving algorithms in recent years. Data plays a core role in the algorithm closed-loop system. However, collecting real-world data is expensive, time-consuming, and unsafe. With the development of implicit rendering technology and in-depth research on using generative models to produce data at scale, we propose OASim, an open and adaptive simulator and autonomous driving data generator based on implicit neural rendering. It has the following characteristics: (1) High-quality scene reconstruction through neural implicit surface reconstruction technology. (2) Trajectory editing of the ego vehicle and participating vehicles. (3) Rich vehicle model library that can be freely selected and inserted into the scene. (4) Rich sensors model library where you can select specified sensors to generate data. (5) A highly customizable data generation system can generate data according to user needs. We demonstrate the high quality and fidelity of the generated data through perception performance evaluation on the Carla simulator and real-world data acquisition. Code is available at https://github.com/PJLab-ADG/OASim. △ Less

Submitted 6 February, 2024; originally announced February 2024.

Comments: 10 pages, 9 figures

arXiv:2311.16636 [pdf, ps, other]

On well-posedness of the space-time fractional nonlinear Schrödinger equation

Authors: Mingxuan He, Na Deng, Lu Zhang

Abstract: We study the Cauhcy problem for space-time fractional nonlinear Schrödinger equation with a general nonlinearity. We prove the local well-posedness of it in fractional Sobolev spaces based on the decay estimates and Hölder type estimates. Due to the lack of the semigroup structure of the solution operators, we deduce the decay estimates and Hölder type estimates via the asymptotic expansion of the… ▽ More We study the Cauhcy problem for space-time fractional nonlinear Schrödinger equation with a general nonlinearity. We prove the local well-posedness of it in fractional Sobolev spaces based on the decay estimates and Hölder type estimates. Due to the lack of the semigroup structure of the solution operators, we deduce the decay estimates and Hölder type estimates via the asymptotic expansion of the Mittag-Leffler functions and Bessel functions. In particular, these results also show the dispersion of the solutions. △ Less

Submitted 1 July, 2024; v1 submitted 28 November, 2023; originally announced November 2023.

arXiv:2310.16755 [pdf, other]

HI-TOM: A Benchmark for Evaluating Higher-Order Theory of Mind Reasoning in Large Language Models

Authors: Yinghui He, Yufan Wu, Yilin Jia, Rada Mihalcea, Yulong Chen, Naihao Deng

Abstract: Theory of Mind (ToM) is the ability to reason about one's own and others' mental states. ToM plays a critical role in the development of intelligence, language understanding, and cognitive processes. While previous work has primarily focused on first and second-order ToM, we explore higher-order ToM, which involves recursive reasoning on others' beliefs. We introduce HI-TOM, a Higher Order Theory… ▽ More Theory of Mind (ToM) is the ability to reason about one's own and others' mental states. ToM plays a critical role in the development of intelligence, language understanding, and cognitive processes. While previous work has primarily focused on first and second-order ToM, we explore higher-order ToM, which involves recursive reasoning on others' beliefs. We introduce HI-TOM, a Higher Order Theory of Mind benchmark. Our experimental evaluation using various Large Language Models (LLMs) indicates a decline in performance on higher-order ToM tasks, demonstrating the limitations of current LLMs. We conduct a thorough analysis of different failure cases of LLMs, and share our thoughts on the implications of our findings on the future of NLP. △ Less

Submitted 25 October, 2023; originally announced October 2023.

Comments: Accepted at Findings of EMNLP 2023

Journal ref: Findings of EMNLP 2023

arXiv:2310.11266 [pdf]

Emulating Human Cognitive Processes for Expert-Level Medical Question-Answering with Large Language Models

Authors: Khushboo Verma, Marina Moore, Stephanie Wottrich, Karla Robles López, Nishant Aggarwal, Zeel Bhatt, Aagamjit Singh, Bradford Unroe, Salah Basheer, Nitish Sachdeva, Prinka Arora, Harmanjeet Kaur, Tanupreet Kaur, Tevon Hood, Anahi Marquez, Tushar Varshney, Nanfu Deng, Azaan Ramani, Pawanraj Ishwara, Maimoona Saeed, Tatiana López Velarde Peña, Bryan Barksdale, Sushovan Guha, Satwant Kumar

Abstract: In response to the pressing need for advanced clinical problem-solving tools in healthcare, we introduce BooksMed, a novel framework based on a Large Language Model (LLM). BooksMed uniquely emulates human cognitive processes to deliver evidence-based and reliable responses, utilizing the GRADE (Grading of Recommendations, Assessment, Development, and Evaluations) framework to effectively quantify… ▽ More In response to the pressing need for advanced clinical problem-solving tools in healthcare, we introduce BooksMed, a novel framework based on a Large Language Model (LLM). BooksMed uniquely emulates human cognitive processes to deliver evidence-based and reliable responses, utilizing the GRADE (Grading of Recommendations, Assessment, Development, and Evaluations) framework to effectively quantify evidence strength. For clinical decision-making to be appropriately assessed, an evaluation metric that is clinically aligned and validated is required. As a solution, we present ExpertMedQA, a multispecialty clinical benchmark comprised of open-ended, expert-level clinical questions, and validated by a diverse group of medical professionals. By demanding an in-depth understanding and critical appraisal of up-to-date clinical literature, ExpertMedQA rigorously evaluates LLM performance. BooksMed outperforms existing state-of-the-art models Med-PaLM 2, Almanac, and ChatGPT in a variety of medical scenarios. Therefore, a framework that mimics human cognitive stages could be a useful tool for providing reliable and evidence-based responses to clinical inquiries. △ Less

Submitted 17 October, 2023; originally announced October 2023.

arXiv:2310.10311 [pdf, other]

Dynamics-augmented cluster-based network model

Authors: Chang Hou, Nan Deng, Bernd R. Noack

Abstract: In this study, we propose a novel data-driven reduced-order model for complex dynamics, including nonlinear, multi-attractor, multi-frequency, and multiscale behaviours. The starting point is a fully automatable cluster-based network model (CNM) (Li et al. J. Fluid Mech. vol.906, 2021, A21) which kinematically coarse-grains the state with clusters and dynamically predicts the transitions in a netw… ▽ More In this study, we propose a novel data-driven reduced-order model for complex dynamics, including nonlinear, multi-attractor, multi-frequency, and multiscale behaviours. The starting point is a fully automatable cluster-based network model (CNM) (Li et al. J. Fluid Mech. vol.906, 2021, A21) which kinematically coarse-grains the state with clusters and dynamically predicts the transitions in a network model. In the proposed dynamics-augmented CNM (dCNM), the prediction error is reduced with trajectory-based clustering using the same number of centroids. The dCNM is first exemplified for the Lorenz system and then implemented for the three-dimensional sphere wake featuring periodic, quasi-periodic and chaotic flow regimes. For both plants, the dCNM significantly outperforms the CNM in resolving the multi-frequency and multiscale dynamics. This increased prediction accuracy is obtained by stratification of the state space aligned with the direction of the trajectories. Thus, the dCNM has numerous potential applications to a large spectrum of shear flows, even for complex dynamics. △ Less

Submitted 1 March, 2024; v1 submitted 16 October, 2023; originally announced October 2023.

arXiv:2310.06145 [pdf, other]

Revisit the intrinsic features of flip-flop** flow behind side-by-side circular cylinders

Authors: Wailing Chen, Yuhao Yan, Chunning Ji, Md. Mahbub Alam, Narakron Srinil, Bernd R. Noack, Nan Deng

Abstract: As one of the most intriguing wake patterns of two side-by-side circular cylinders at an intermediate gap spacing, the flip-flop** (FF) flow has attracted great attention of fundamental research interest. This FF flow is featured by the intermittently and randomly switching gap flow with correspondingly changing forces of the two cylinders. In this paper, we first present a partition map of the… ▽ More As one of the most intriguing wake patterns of two side-by-side circular cylinders at an intermediate gap spacing, the flip-flop** (FF) flow has attracted great attention of fundamental research interest. This FF flow is featured by the intermittently and randomly switching gap flow with correspondingly changing forces of the two cylinders. In this paper, we first present a partition map of the wake patterns behind two side-by-side circular cylinders and briefly introduce intrinsic features of each flow pattern. We focus on the FF flow aiming to explain: (i) the origin of the FF flow between laminar and turbulent regimes, (ii) their connections in different flow regimes, and (iii) mechanisms of the significantly varying flip-over time scale of the FF flows. In the laminar regime, we further divide the FF flow into the sub-classed I (FF1) and II (FF2), based on their different origins from the in-phase and anti-phase synchronized vortex shedding instabilities, respectively. By exploring the vortex interactions, we show that the FF flow in the turbulent regime has the same origin and similar vortex dynamics as the FF2 wake in the laminar regime, despite some minor disparities. Thus, a connection is established between the FF2 pattern in the laminar flow and the FF pattern in the turbulent flow. For the FF flow in the laminar regime (Re < 150-200), the mildly decreasing switching time, is several vortex shedding periods. However, for the FF flow in the weak turbulence regime (150-200 < Re < 1000-1700), the switching time scale increases significantly with Re owing to the increased vortex formation length. The FF in the strong turbulence regime (Re > 1000-1700) has a switching time scale of several orders of magnitude longer than the vortex shedding period, where the switching scale decreases gradually with Re due to the stronger Kelvin-Helmholtz vortices. △ Less

Submitted 9 October, 2023; originally announced October 2023.

arXiv:2310.05317 [pdf, other]

Task-Adaptive Tokenization: Enhancing Long-Form Text Generation Efficacy in Mental Health and Beyond

Authors: Siyang Liu, Naihao Deng, Sahand Sabour, Yilin Jia, Minlie Huang, Rada Mihalcea

Abstract: We propose task-adaptive tokenization as a way to adapt the generation pipeline to the specifics of a downstream task and enhance long-form generation in mental health. Inspired by insights from cognitive science, our task-adaptive tokenizer samples variable segmentations from multiple outcomes, with sampling probabilities optimized based on task-specific data. We introduce a strategy for building… ▽ More We propose task-adaptive tokenization as a way to adapt the generation pipeline to the specifics of a downstream task and enhance long-form generation in mental health. Inspired by insights from cognitive science, our task-adaptive tokenizer samples variable segmentations from multiple outcomes, with sampling probabilities optimized based on task-specific data. We introduce a strategy for building a specialized vocabulary and introduce a vocabulary merging protocol that allows for the integration of task-specific tokens into the pre-trained model's tokenization step. Through extensive experiments on psychological question-answering tasks in both Chinese and English, we find that our task-adaptive tokenization approach brings a significant improvement in generation performance while using up to 60% fewer tokens. Preliminary experiments point to promising results when using our tokenization approach with very large language models. △ Less

Submitted 13 November, 2023; v1 submitted 8 October, 2023; originally announced October 2023.

Comments: Accepted at the main conference of The 2023 Conference on Empirical Methods in Natural Language Processing; 8 pages

MSC Class: 68 ACM Class: I.2.7

Journal ref: The 2023 Conference on Empirical Methods in Natural Language Processing (EMNLP 2023)

arXiv:2309.08278 [pdf, ps, other]

On the fractional abstract Schrodinger-type evolution equations on the Hilbert space and its applications to the fractional dispersive equations

Authors: Mingxuan He, Na Deng

Abstract: In this paper we prove the local and global well-posedness of the time fractional abstract Schrödinger type evolution equation on the Hilbert space and as an application, we prove the local and global well-posedness of the fractional dispersive equation with static potential under the only assumption that the symbol of P(D) behaves like a polynomial of highest degree m at infinity. In appendix, we… ▽ More In this paper we prove the local and global well-posedness of the time fractional abstract Schrödinger type evolution equation on the Hilbert space and as an application, we prove the local and global well-posedness of the fractional dispersive equation with static potential under the only assumption that the symbol of P(D) behaves like a polynomial of highest degree m at infinity. In appendix, we also give the Holder regularities and the asymptotic behaviors of the mild solution to the linear time fractional abstract Schrödinger type equation. Because of the lack of the semigroup properties of the solution operators, we employ a strategy of proof based on the spectral theorem of the self-adjoint operators and the asymptotic behaviors of the Mittag-Leffler functions. △ Less

Submitted 28 November, 2023; v1 submitted 15 September, 2023; originally announced September 2023.

Comments: 44 pages, 0 figures

arXiv:2306.04988 [pdf, other]

StreetSurf: Extending Multi-view Implicit Surface Reconstruction to Street Views

Authors: Jianfei Guo, Nianchen Deng, Xinyang Li, Yeqi Bai, Botian Shi, Chiyu Wang, Chen**g Ding, Dongliang Wang, Yikang Li

Abstract: We present a novel multi-view implicit surface reconstruction technique, termed StreetSurf, that is readily applicable to street view images in widely-used autonomous driving datasets, such as Waymo-perception sequences, without necessarily requiring LiDAR data. As neural rendering research expands rapidly, its integration into street views has started to draw interests. Existing approaches on str… ▽ More We present a novel multi-view implicit surface reconstruction technique, termed StreetSurf, that is readily applicable to street view images in widely-used autonomous driving datasets, such as Waymo-perception sequences, without necessarily requiring LiDAR data. As neural rendering research expands rapidly, its integration into street views has started to draw interests. Existing approaches on street views either mainly focus on novel view synthesis with little exploration of the scene geometry, or rely heavily on dense LiDAR data when investigating reconstruction. Neither of them investigates multi-view implicit surface reconstruction, especially under settings without LiDAR data. Our method extends prior object-centric neural surface reconstruction techniques to address the unique challenges posed by the unbounded street views that are captured with non-object-centric, long and narrow camera trajectories. We delimit the unbounded space into three parts, close-range, distant-view and sky, with aligned cuboid boundaries, and adapt cuboid/hyper-cuboid hash-grids along with road-surface initialization scheme for finer and disentangled representation. To further address the geometric errors arising from textureless regions and insufficient viewing angles, we adopt geometric priors that are estimated using general purpose monocular models. Coupled with our implementation of efficient and fine-grained multi-stage ray marching strategy, we achieve state of the art reconstruction quality in both geometry and appearance within only one to two hours of training time with a single RTX3090 GPU for each street view sequence. Furthermore, we demonstrate that the reconstructed implicit surfaces have rich potential for various downstream tasks, including ray tracing and LiDAR simulation. △ Less

Submitted 8 June, 2023; originally announced June 2023.

Comments: https://ventusff.github.io/streetsurf_web/

arXiv:2305.14663 [pdf, other]

You Are What You Annotate: Towards Better Models through Annotator Representations

Authors: Naihao Deng, Xinliang Frederick Zhang, Siyang Liu, Winston Wu, Lu Wang, Rada Mihalcea

Abstract: Annotator disagreement is ubiquitous in natural language processing (NLP) tasks. There are multiple reasons for such disagreements, including the subjectivity of the task, difficult cases, unclear guidelines, and so on. Rather than simply aggregating labels to obtain data annotations, we instead try to directly model the diverse perspectives of the annotators, and explicitly account for annotators… ▽ More Annotator disagreement is ubiquitous in natural language processing (NLP) tasks. There are multiple reasons for such disagreements, including the subjectivity of the task, difficult cases, unclear guidelines, and so on. Rather than simply aggregating labels to obtain data annotations, we instead try to directly model the diverse perspectives of the annotators, and explicitly account for annotators' idiosyncrasies in the modeling process by creating representations for each annotator (annotator embeddings) and also their annotations (annotation embeddings). In addition, we propose TID-8, The Inherent Disagreement - 8 dataset, a benchmark that consists of eight existing language understanding datasets that have inherent annotator disagreement. We test our approach on TID-8 and show that our approach helps models learn significantly better from disagreements on six different datasets in TID-8 while increasing model size by fewer than 1% parameters. By capturing the unique tendencies and subjectivity of individual annotators through embeddings, our representations prime AI models to be inclusive of diverse viewpoints. △ Less

Submitted 22 October, 2023; v1 submitted 23 May, 2023; originally announced May 2023.

Comments: Accepted to Findings of EMNLP 2023

arXiv:2305.14169 [pdf, other]

EASE: An Easily-Customized Annotation System Powered by Efficiency Enhancement Mechanisms

Authors: Naihao Deng, Yikai Liu, Mingye Chen, Winston Wu, Siyang Liu, Yulong Chen, Yue Zhang, Rada Mihalcea

Abstract: The performance of current supervised AI systems is tightly connected to the availability of annotated datasets. Annotations are usually collected through annotation tools, which are often designed for specific tasks and are difficult to customize. Moreover, existing annotation tools with an active learning mechanism often only support limited use cases. To address these limitations, we present EA… ▽ More The performance of current supervised AI systems is tightly connected to the availability of annotated datasets. Annotations are usually collected through annotation tools, which are often designed for specific tasks and are difficult to customize. Moreover, existing annotation tools with an active learning mechanism often only support limited use cases. To address these limitations, we present EASE, an Easily-Customized Annotation System Powered by Efficiency Enhancement Mechanisms. \sysname provides modular annotation units for building customized annotation interfaces and also provides multiple back-end options that suggest annotations using (1) multi-task active learning; (2) demographic feature based active learning; (3) a prompt system that can query the API of large language models. We conduct multiple experiments and user studies to evaluate our system's flexibility and effectiveness. Our results show that our system can meet the diverse needs of NLP researchers and significantly accelerate the annotation process. △ Less

Submitted 23 May, 2023; originally announced May 2023.

Comments: 20 pages

arXiv:2305.12544 [pdf, other]

Has It All Been Solved? Open NLP Research Questions Not Solved by Large Language Models

Authors: Oana Ignat, Zhi**g **, Artem Abzaliev, Laura Biester, Santiago Castro, Naihao Deng, Xinyi Gao, Aylin Gunal, Jacky He, Ashkan Kazemi, Muhammad Khalifa, Namho Koh, Andrew Lee, Siyang Liu, Do June Min, Shinka Mori, Joan Nwatu, Veronica Perez-Rosas, Siqi Shen, Zekun Wang, Winston Wu, Rada Mihalcea

Abstract: Recent progress in large language models (LLMs) has enabled the deployment of many generative NLP applications. At the same time, it has also led to a misleading public discourse that ``it's all been solved.'' Not surprisingly, this has, in turn, made many NLP researchers -- especially those at the beginning of their careers -- worry about what NLP research area they should focus on. Has it all be… ▽ More Recent progress in large language models (LLMs) has enabled the deployment of many generative NLP applications. At the same time, it has also led to a misleading public discourse that ``it's all been solved.'' Not surprisingly, this has, in turn, made many NLP researchers -- especially those at the beginning of their careers -- worry about what NLP research area they should focus on. Has it all been solved, or what remaining questions can we work on regardless of LLMs? To address this question, this paper compiles NLP research directions rich for exploration. We identify fourteen different research areas encompassing 45 research directions that require new research and are not directly solvable by LLMs. While we identify many research areas, many others exist; we do not cover areas currently addressed by LLMs, but where LLMs lag behind in performance or those focused on LLM development. We welcome suggestions for other research directions to include: https://bit.ly/nlp-era-llm △ Less

Submitted 15 March, 2024; v1 submitted 21 May, 2023; originally announced May 2023.

Comments: Accepted at COLING 2024

arXiv:2305.07497 [pdf, other]

Dynamically Conservative Self-Driving Planner for Long-Tail Cases

Authors: Weitao Zhou, Zhong Cao, Nanshan Deng, Xiaoyu Liu, Kun Jiang, Diange Yang

Abstract: Self-driving vehicles (SDVs) are becoming reality but still suffer from "long-tail" challenges during natural driving: the SDVs will continually encounter rare, safety-critical cases that may not be included in the dataset they were trained. Some safety-assurance planners solve this problem by being conservative in all possible cases, which may significantly affect driving mobility. To this end, t… ▽ More Self-driving vehicles (SDVs) are becoming reality but still suffer from "long-tail" challenges during natural driving: the SDVs will continually encounter rare, safety-critical cases that may not be included in the dataset they were trained. Some safety-assurance planners solve this problem by being conservative in all possible cases, which may significantly affect driving mobility. To this end, this work proposes a method to automatically adjust the conservative level according to each case's "long-tail" rate, named dynamically conservative planner (DCP). We first define the "long-tail" rate as an SDV's confidence to pass a driving case. The rate indicates the probability of safe-critical events and is estimated using the statistics bootstrapped method with historical data. Then, a reinforcement learning-based planner is designed to contain candidate policies with different conservative levels. The final policy is optimized based on the estimated "long-tail" rate. In this way, the DCP is designed to automatically adjust to be more conservative in low-confidence "long-tail" cases while kee** efficient otherwise. The DCP is evaluated in the CARLA simulator using driving cases with "long-tail" distributed training data. The results show that the DCP can accurately estimate the "long-tail" rate to identify potential risks. Based on the rate, the DCP automatically avoids potential collisions in "long-tail" cases using conservative decisions while not affecting the average velocity in other typical cases. Thus, the DCP is safer and more efficient than the baselines with fixed conservative levels, e.g., an always conservative planner. This work provides a technique to guarantee SDV's performance in unexpected driving cases without resorting to a global conservative setting, which contributes to solving the "long-tail" problem practically. △ Less

Submitted 12 May, 2023; originally announced May 2023.

arXiv:2305.07487 [pdf, other]

Identify, Estimate and Bound the Uncertainty of Reinforcement Learning for Autonomous Driving

Authors: Weitao Zhou, Zhong Cao, Nanshan Deng, Kun Jiang, Diange Yang

Abstract: Deep reinforcement learning (DRL) has emerged as a promising approach for develo** more intelligent autonomous vehicles (AVs). A typical DRL application on AVs is to train a neural network-based driving policy. However, the black-box nature of neural networks can result in unpredictable decision failures, making such AVs unreliable. To this end, this work proposes a method to identify and protec… ▽ More Deep reinforcement learning (DRL) has emerged as a promising approach for develo** more intelligent autonomous vehicles (AVs). A typical DRL application on AVs is to train a neural network-based driving policy. However, the black-box nature of neural networks can result in unpredictable decision failures, making such AVs unreliable. To this end, this work proposes a method to identify and protect unreliable decisions of a DRL driving policy. The basic idea is to estimate and constrain the policy's performance uncertainty, which quantifies potential performance drop due to insufficient training data or network fitting errors. By constraining the uncertainty, the DRL model's performance is always greater than that of a baseline policy. The uncertainty caused by insufficient data is estimated by the bootstrapped method. Then, the uncertainty caused by the network fitting error is estimated using an ensemble network. Finally, a baseline policy is added as the performance lower bound to avoid potential decision failures. The overall framework is called uncertainty-bound reinforcement learning (UBRL). The proposed UBRL is evaluated on DRL policies with different amounts of training data, taking an unprotected left-turn driving case as an example. The result shows that the UBRL method can identify potentially unreliable decisions of DRL policy. The UBRL guarantees to outperform baseline policy even when the DRL policy is not well-trained and has high uncertainty. Meanwhile, the performance of UBRL improves with more training data. Such a method is valuable for the DRL application on real-road driving and provides a metric to evaluate a DRL policy. △ Less

Submitted 12 May, 2023; originally announced May 2023.

arXiv:2210.07467 [pdf, other]

Query Rewriting for Effective Misinformation Discovery

Authors: Ashkan Kazemi, Artem Abzaliev, Naihao Deng, Rui Hou, Scott A. Hale, Verónica Pérez-Rosas, Rada Mihalcea

Abstract: We propose a novel system to help fact-checkers formulate search queries for known misinformation claims and effectively search across multiple social media platforms. We introduce an adaptable rewriting strategy, where editing actions for queries containing claims (e.g., swap a word with its synonym; change verb tense into present simple) are automatically learned through offline reinforcement le… ▽ More We propose a novel system to help fact-checkers formulate search queries for known misinformation claims and effectively search across multiple social media platforms. We introduce an adaptable rewriting strategy, where editing actions for queries containing claims (e.g., swap a word with its synonym; change verb tense into present simple) are automatically learned through offline reinforcement learning. Our model uses a decision transformer to learn a sequence of editing actions that maximizes query retrieval metrics such as mean average precision. We conduct a series of experiments showing that our query rewriting system achieves a relative increase in the effectiveness of the queries of up to 42%, while producing editing action sequences that are human interpretable. △ Less

Submitted 2 October, 2023; v1 submitted 13 October, 2022; originally announced October 2022.

Comments: AACL 2023 (long paper)

arXiv:2209.06650 [pdf, other]

WildQA: In-the-Wild Video Question Answering

Authors: Santiago Castro, Naihao Deng, **xuan Huang, Mihai Burzo, Rada Mihalcea

Abstract: Existing video understanding datasets mostly focus on human interactions, with little attention being paid to the "in the wild" settings, where the videos are recorded outdoors. We propose WILDQA, a video understanding dataset of videos recorded in outside settings. In addition to video question answering (Video QA), we also introduce the new task of identifying visual support for a given question… ▽ More Existing video understanding datasets mostly focus on human interactions, with little attention being paid to the "in the wild" settings, where the videos are recorded outdoors. We propose WILDQA, a video understanding dataset of videos recorded in outside settings. In addition to video question answering (Video QA), we also introduce the new task of identifying visual support for a given question and answer (Video Evidence Selection). Through evaluations using a wide range of baseline models, we show that WILDQA poses new challenges to the vision and language research communities. The dataset is available at https://lit.eecs.umich.edu/wildqa/. △ Less

Submitted 14 September, 2022; originally announced September 2022.

Comments: *: Equal contribution; COLING 2022 oral; project webpage: https://lit.eecs.umich.edu/wildqa/

arXiv:2208.10933 [pdf]

doi 10.1021/acsnano.2c06432

Large-Scale Integrated Flexible Tactile Sensor Array for Sensitive Smart Robotic Touch

Authors: Zhenxuan Zhao, Jianshi Tang, Jian Yuan, Yijun Li, Yuan Dai, Jian Yao, Qingtian Zhang, Sanchuan Ding, Tingyu Li, Ruirui Zhang, Yu Zheng, Zhengyou Zhang, Song Qiu, Qingwen Li, Bin Gao, Ning Deng, He Qian, Fei Xing, Zheng You, Huaqiang Wu

Abstract: In the long pursuit of smart robotics, it has been envisioned to empower robots with human-like senses, especially vision and touch. While tremendous progress has been made in image sensors and computer vision over the past decades, the tactile sense abilities are lagging behind due to the lack of large-scale flexible tactile sensor array with high sensitivity, high spatial resolution, and fast re… ▽ More In the long pursuit of smart robotics, it has been envisioned to empower robots with human-like senses, especially vision and touch. While tremendous progress has been made in image sensors and computer vision over the past decades, the tactile sense abilities are lagging behind due to the lack of large-scale flexible tactile sensor array with high sensitivity, high spatial resolution, and fast response. In this work, we have demonstrated a 64x64 flexible tactile sensor array with a record-high spatial resolution of 0.9 mm (equivalently 28.2 pixels per inch), by integrating a high-performance piezoresistive film (PRF) with a large-area active matrix of carbon nanotube thin-film transistors. PRF with self-formed microstructures exhibited high pressure-sensitivity of ~385 kPa-1 for MWCNTs concentration of 6%, while the 14% one exhibited fast response time of ~3 ms, good linearity, broad detection range beyond 1400 kPa, and excellent cyclability over 3000 cycles. Using this fully integrated tactile sensor array, the footprint maps of an artificial honeybee were clearly identified. Furthermore, we hardware-implemented a smart tactile system by integrating the PRF-based sensor array with a memristor-based computing-in-memory chip to record and recognize handwritten digits and Chinese calligraphy, achieving high classification accuracies of 98.8% and 97.3% in hardware, respectively. The integration of sensor networks with deep learning hardware may enable edge or near-sensor computing with significantly reduced power consumption and latency. Our work could pave the road to building large-scale intelligent sensor networks for next-generation smart robotics. △ Less

Submitted 3 November, 2022; v1 submitted 23 August, 2022; originally announced August 2022.

Comments: Correction in Methods: The weight ratio of TPU:DMF was set to be 1:5

Journal ref: ACS Nano 2022, 16, 16784

arXiv:2208.10099 [pdf, other]

Recent Advances in Text-to-SQL: A Survey of What We Have and What We Expect

Authors: Naihao Deng, Yulong Chen, Yue Zhang

Abstract: Text-to-SQL has attracted attention from both the natural language processing and database communities because of its ability to convert the semantics in natural language into SQL queries and its practical application in building natural language interfaces to database systems. The major challenges in text-to-SQL lie in encoding the meaning of natural utterances, decoding to SQL queries, and trans… ▽ More Text-to-SQL has attracted attention from both the natural language processing and database communities because of its ability to convert the semantics in natural language into SQL queries and its practical application in building natural language interfaces to database systems. The major challenges in text-to-SQL lie in encoding the meaning of natural utterances, decoding to SQL queries, and translating the semantics between these two forms. These challenges have been addressed to different extents by the recent advances. However, there is still a lack of comprehensive surveys for this task. To this end, we review recent progress on text-to-SQL for datasets, methods, and evaluation and provide this systematic survey, addressing the aforementioned challenges and discussing potential future directions. We hope that this survey can serve as quick access to existing work and motivate future research. △ Less

Submitted 22 August, 2022; originally announced August 2022.

Comments: COLING 2022 oral. Github page: https://github.com/text-to-sql-survey-coling22/text-to-sql-survey-coling22.github.io

arXiv:2208.03898 [pdf, other]

DialogSum Challenge: Results of the Dialogue Summarization Shared Task

Authors: Yulong Chen, Naihao Deng, Yang Liu, Yue Zhang

Abstract: We report the results of DialogSum Challenge, the shared task on summarizing real-life scenario dialogues at INLG 2022. Four teams participate in this shared task and three submit their system reports, exploring different methods to improve the performance of dialogue summarization. Although there is a great improvement over the baseline models regarding automatic evaluation metrics, such as Rouge… ▽ More We report the results of DialogSum Challenge, the shared task on summarizing real-life scenario dialogues at INLG 2022. Four teams participate in this shared task and three submit their system reports, exploring different methods to improve the performance of dialogue summarization. Although there is a great improvement over the baseline models regarding automatic evaluation metrics, such as Rouge scores, we find that there is a salient gap between model generated outputs and human annotated summaries by human evaluation from multiple aspects. These findings demonstrate the difficulty of dialogue summarization and suggest that more fine-grained evaluatuion metrics are in need. △ Less

Submitted 3 September, 2022; v1 submitted 7 August, 2022; originally announced August 2022.

Comments: INLG 2021 DialogSum Challenge Result Report. Link to this report on INLG website: https://inlgmeeting.github.io/poster_paper43.html. Presentation video on YouTube: https://www.youtube.com/watch?v=5Bk3tNr_1ng; on Bilibili: https://www.bilibili.com/video/BV13e4y1h7eB?spm_id_from=444.41.list.card_archive.click

arXiv:2207.06151 [pdf, ps, other]

doi 10.1088/1748-0221/17/11/T11003

Study on SiPM performance at low temperatures between $-60^{\circ}$C and $-20^{\circ}$C

Authors: C. Zhong, F. J. Luo, B. Zheng, X. D. Wang, M. Y. Bu, J. Zou, M. N. Deng

Abstract: Radon is the main background source of dark matter and neutrino experiments. Radon concentration ($\rm mBq/m^3$) measurement by liquid scintillation detector is a highly sensitive method at low temperatures using silicon photomultipliers (SiPMs) arrays. The SiPM performance characteristics are closely related to the lower detection limit of the detector. In this study, we built an automatic and ac… ▽ More Radon is the main background source of dark matter and neutrino experiments. Radon concentration ($\rm mBq/m^3$) measurement by liquid scintillation detector is a highly sensitive method at low temperatures using silicon photomultipliers (SiPMs) arrays. The SiPM performance characteristics are closely related to the lower detection limit of the detector. In this study, we built an automatic and accurate low-temperature measurement system to study the single photoelectron spectrum, SPE resolution, optical crosstalk, and after-pulse of the SiPM at different temperatures. As a result, we obtained the variation trend of the SiPM parameters at different temperatures, and the SiPM optimal working conditions were obtained, which can improve the detector's sensitivity △ Less

Submitted 26 October, 2022; v1 submitted 13 July, 2022; originally announced July 2022.

arXiv:2207.00788 [pdf, other]

Long-Tail Prediction Uncertainty Aware Trajectory Planning for Self-driving Vehicles

Authors: Weitao Zhou, Zhong Cao, Yunkang Xu, Nanshan Deng, Xiaoyu Liu, Kun Jiang, Diange Yang

Abstract: A typical trajectory planner of autonomous driving commonly relies on predicting the future behavior of surrounding obstacles. Recently, deep learning technology has been widely adopted to design prediction models due to their impressive performance. However, such models may fail in the "long-tail" driving cases where the training data is sparse or unavailable, leading to planner failures. To this… ▽ More A typical trajectory planner of autonomous driving commonly relies on predicting the future behavior of surrounding obstacles. Recently, deep learning technology has been widely adopted to design prediction models due to their impressive performance. However, such models may fail in the "long-tail" driving cases where the training data is sparse or unavailable, leading to planner failures. To this end, this work proposes a trajectory planner to consider the prediction model uncertainty arising from insufficient data for safer performance. Firstly, an ensemble network structure estimates the prediction model's uncertainty due to insufficient training data. Then a trajectory planner is designed to consider the worst-case arising from prediction uncertainty. The results show that the proposed method can improve the safety of trajectory planning under the prediction uncertainty caused by insufficient data. At the same time, with sufficient data, the framework will not lead to overly conservative results. This technology helps to improve the safety and reliability of autonomous vehicles under the long-tail data distribution of the real world. △ Less

Submitted 28 July, 2022; v1 submitted 2 July, 2022; originally announced July 2022.

arXiv:2205.05999 [pdf, ps, other]

doi 10.1063/5.0098655

Trajectory-optimized cluster-based network model for the sphere wake

Authors: Chang Hou, Nan Deng, Bernd R. Noack

Abstract: We propose a novel trajectory-optimized Cluster-based Network Model (tCNM) for nonlinear model order reduction from time-resolved data following Li et al. ["Cluster-based network model, " J. Fluid Mech. 906, A21 (2021)] and improving the accuracy for a given number of centroids. The starting point is k-means++ clustering which minimizes the representation error of the snapshots by their closest ce… ▽ More We propose a novel trajectory-optimized Cluster-based Network Model (tCNM) for nonlinear model order reduction from time-resolved data following Li et al. ["Cluster-based network model, " J. Fluid Mech. 906, A21 (2021)] and improving the accuracy for a given number of centroids. The starting point is k-means++ clustering which minimizes the representation error of the snapshots by their closest centroids. The dynamics is presented by 'flights' between the centroids. The proposed trajectory-optimized clustering aims to reduce the kinematic representation error further by shifting the centroids closer to the snapshot trajectory and refining state propagation with trajectory support points. Thus, curved trajectories are better resolved. The resulting tCNM is demonstrated for the sphere wake for three flow regimes, including the periodic, quasi-periodic, and chaotic dynamics. The representation error of tCNM is 5 times smaller as compared to the approximation by the closest centroid. Thus, the error at the same level as Proper Orthogonal Decomposition (POD) of same order. Yet, tCNM has distinct advantages over POD modeling: it is human interpretable by representing dynamics by a handful of coherent structures and their transitions; it shows robust dynamics by design, i.e., stable long-time behavior; and its development is fully automatable, i.e., it does not require tuneable auxiliary closure and other models. △ Less

Submitted 30 June, 2022; v1 submitted 12 May, 2022; originally announced May 2022.

arXiv:2205.00379 [pdf, other]

The Cross-lingual Conversation Summarization Challenge

Authors: Yulong Chen, Ming Zhong, Xuefeng Bai, Naihao Deng, **g Li, Xianchao Zhu, Yue Zhang

Abstract: We propose the shared task of cross-lingual conversation summarization, \emph{ConvSumX Challenge}, opening new avenues for researchers to investigate solutions that integrate conversation summarization and machine translation. This task can be particularly useful due to the emergence of online meetings and conferences. We construct a new benchmark, covering 2 real-world scenarios and 3 language di… ▽ More We propose the shared task of cross-lingual conversation summarization, \emph{ConvSumX Challenge}, opening new avenues for researchers to investigate solutions that integrate conversation summarization and machine translation. This task can be particularly useful due to the emergence of online meetings and conferences. We construct a new benchmark, covering 2 real-world scenarios and 3 language directions, including a low-resource language. We hope that \emph{ConvSumX} can motivate researches to go beyond English and break the barrier for non-English speakers to benefit from recent advances of conversation summarization. △ Less

Submitted 3 May, 2022; v1 submitted 30 April, 2022; originally announced May 2022.

arXiv:2203.14781 [pdf, other]

doi 10.1017/jfm.2022.1039

From snapshots to manifolds - A tale of shear flows

Authors: Ehsan Farzamnik, Andrea Ianiro, Stefano Discetti, Nan Deng, Kilian Oberleithner, Bernd R. Noack, Vanesa Guerrero

Abstract: We propose a novel non-linear manifold learning from snapshot data and demonstrate its superiority over Proper Orthogonal Decomposition (POD) for shedding-dominated shear flows. Key enablers are isometric feature map**, Isomap (Tenenbaum et al., 2000), as encoder and K-nearest neighbours (KNN) algorithm as decoder. The proposed technique is applied to numerical and experimental datasets includin… ▽ More We propose a novel non-linear manifold learning from snapshot data and demonstrate its superiority over Proper Orthogonal Decomposition (POD) for shedding-dominated shear flows. Key enablers are isometric feature map**, Isomap (Tenenbaum et al., 2000), as encoder and K-nearest neighbours (KNN) algorithm as decoder. The proposed technique is applied to numerical and experimental datasets including the fluidic pinball, a swirling jet, and the wake behind a couple of tandem cylinders. Analyzing the fluidic pinball, the manifold is able to describe the pitchfork bifurcation and the chaotic regime with only three feature coordinates. These coordinates are linked to vortex-shedding phases and the force coefficients. The manifold coordinates of the swirling jet are comparable to the POD mode amplitudes, yet allow for a more distinct manifold identification which is less sensitive to measurement noise. As similar observation is made for the wake of two tandem cylinders (Raiola et al., 2016). The tandem cylinders are aligned in streamwise distance which corresponds to the transition between the single bluff body and the reattachment regimes of vortex shedding. Isomap unveils these two shedding regimes while the Lissajous plots of first two POD mode amplitudes feature a single circle. The reconstruction error of the manifold model is small compared to the fluctuation level, indicating that the low embedding dimensions contains the coherent structure dynamics. The proposed Isomap-KNN manifold learner is expected to be of large importance in estimation, dynamic modeling and control for large range of configurations with dominant coherent structures. △ Less

Submitted 28 March, 2022; originally announced March 2022.

arXiv:2111.13028 [pdf, other]

doi 10.1017/jfm.2021.1105

Cluster-based hierarchical network model of the fluidic pinball -- Cartographing transient and post-transient, multi-frequency, multi-attractor behaviour

Authors: Nan Deng, Bernd R. Noack, Marek Morzyński, Luc R. Pastur

Abstract: We propose a self-supervised cluster-based hierarchical reduced-order modelling methodology to model and analyse the complex dynamics arising from a sequence of bifurcations for a two-dimensional incompressible flow of the unforced fluidic pinball. The hierarchy is guided by a triple decomposition separating a slowly varying base flow, dominant shedding and secondary flow structures. All these flo… ▽ More We propose a self-supervised cluster-based hierarchical reduced-order modelling methodology to model and analyse the complex dynamics arising from a sequence of bifurcations for a two-dimensional incompressible flow of the unforced fluidic pinball. The hierarchy is guided by a triple decomposition separating a slowly varying base flow, dominant shedding and secondary flow structures. All these flow components are kinematically resolved by a hierarchy of clusters, starting with the base flow in the first layer, resolving the vortex shedding in the second layer and distilling the secondary flow structures in the third layer. The transition dynamics between these clusters is described by a directed network, called the cluster-based hierarchical network model (HiCNM) in the sequel. Three consecutive Reynolds number regimes for different dynamics are considered: (i) periodic shedding at $Re=80$, (ii) quasi-periodic shedding at $Re=105$, and (iii) chaotic shedding at $Re=130$, involving three unstable fixed points, three limit cycles, two quasi-periodic attractors and a chaotic attractor. The HiCNM enables identifying the transient and post-transient dynamics between multiple invariant sets in a self-supervised manner. Both the global trends and the local structures during the transition are well resolved by a moderate number of hierarchical clusters. The proposed reduced-order modelling provides a visual representation of transient and post-transient, multi-frequency, multi-attractor behaviour and may automate the identification and analysis of complex dynamics with multiple scales and multiple invariant sets. △ Less

Submitted 25 November, 2021; originally announced November 2021.

arXiv:2109.13066 [pdf, other]

Prefix-to-SQL: Text-to-SQL Generation from Incomplete User Questions

Authors: Naihao Deng, Shuaichen Chang, Peng Shi, Tao Yu, Rui Zhang

Abstract: Existing text-to-SQL research only considers complete questions as the input, but lay-users might strive to formulate a complete question. To build a smarter natural language interface to database systems (NLIDB) that also processes incomplete questions, we propose a new task, prefix-to-SQL which takes question prefix from users as the input and predicts the intended SQL. We construct a new benchm… ▽ More Existing text-to-SQL research only considers complete questions as the input, but lay-users might strive to formulate a complete question. To build a smarter natural language interface to database systems (NLIDB) that also processes incomplete questions, we propose a new task, prefix-to-SQL which takes question prefix from users as the input and predicts the intended SQL. We construct a new benchmark called PAGSAS that contains 124K user question prefixes and the intended SQL for 5 sub-tasks Advising, GeoQuery, Scholar, ATIS, and Spider. Additionally, we propose a new metric SAVE to measure how much effort can be saved by users. Experimental results show that PAGSAS is challenging even for strong baseline models such as T5. As we observe the difficulty of prefix-to-SQL is related to the number of omitted tokens, we incorporate curriculum learning of feeding examples with an increasing number of omitted tokens. This improves scores on various sub-tasks by as much as 9% recall scores on sub-task GeoQuery in PAGSAS. △ Less

Submitted 29 September, 2021; v1 submitted 15 September, 2021; originally announced September 2021.

Comments: 17 pages, 12 figures, 9 tables

arXiv:2108.10464 [pdf, other]

The Case for Task Sampling based Learning for Cluster Job Scheduling

Authors: Akshay Jajoo, Y. Charlie Hu, Xiaojun Lin, Nan Deng

Abstract: The ability to accurately estimate job runtime properties allows a scheduler to effectively schedule jobs. State-of-the-art online cluster job schedulers use history-based learning, which uses past job execution information to estimate the runtime properties of newly arrived jobs. However, with fast-paced development in cluster technology (in both hardware and software) and changing user inputs, j… ▽ More The ability to accurately estimate job runtime properties allows a scheduler to effectively schedule jobs. State-of-the-art online cluster job schedulers use history-based learning, which uses past job execution information to estimate the runtime properties of newly arrived jobs. However, with fast-paced development in cluster technology (in both hardware and software) and changing user inputs, job runtime properties can change over time, which lead to inaccurate predictions. In this paper, we explore the potential and limitation of real-time learning of job runtime properties, by proactively sampling and scheduling a small fraction of the tasks of each job. Such a task-sampling-based approach exploits the similarity among runtime properties of the tasks of the same job and is inherently immune to changing job behavior. Our study focuses on two key questions in comparing task-sampling-based learning (learning in space) and history-based learning (learning in time): (1) Can learning in space be more accurate than learning in time? (2) If so, can delaying scheduling the remaining tasks of a job till the completion of sampled tasks be more than compensated by the improved accuracy and result in improved job performance? Our analytical and experimental analysis of 3 production traces with different skew and job distribution shows that learning in space can be substantially more accurate. Our simulation and testbed evaluation on Azure of the two learning approaches anchored in a generic job scheduler using 3 production cluster job traces shows that despite its online overhead, learning in space reduces the average Job Completion Time (JCT) by 1.28x, 1.56x, and 1.32x compared to the prior-art history-based predictor. △ Less

Submitted 16 November, 2021; v1 submitted 23 August, 2021; originally announced August 2021.

arXiv:2107.05155 [pdf, other]

doi 10.1007/s10822-021-00437-y

Application of the Alchemical Transfer and Potential of Mean Force Methods to the SAMPL8 Host-Guest Blinded Challenge

Authors: Solmaz Azimi, Joe Z. Wu, Sheenam Khuttan, Tom Kurtzman, Nanjie Deng, Emilio Gallicchio

Abstract: We report the results of our participation in the SAMPL8 GDCC Blind Challenge for host-guest binding affinity predictions. Absolute binding affinity prediction is of central importance to the biophysics of molecular association and pharmaceutical discovery. The blinded SAMPL series have provided an important forum for assessing the reliability of binding free energy methods in an objective way. In… ▽ More We report the results of our participation in the SAMPL8 GDCC Blind Challenge for host-guest binding affinity predictions. Absolute binding affinity prediction is of central importance to the biophysics of molecular association and pharmaceutical discovery. The blinded SAMPL series have provided an important forum for assessing the reliability of binding free energy methods in an objective way. In this blinded challenge, we employed two binding free energy methods, the newly developed alchemical transfer method (ATM) and the well-established potential of mean force (PMF) physical pathway method, using the same setup and force field model. The calculated binding free energies from the two methods are in excellent quantitative agreement. Importantly, the results from the two methods were also found to agree well with the experimental binding affinities released subsequently, with an $R^2$ of 0.89 (ATM) and 0.83 (PMF). Given that the two free energy methods are based on entirely different thermodynamic pathways, the close agreement between the results from the two methods and their general agreement with the experimental binding free energies are a testament to the high quality achieved by theory and methods. The study provides further validation of the novel ATM binding free energy estimation protocol and it paves the way to further extensions of the method to more complex systems. △ Less

Submitted 11 July, 2021; originally announced July 2021.

arXiv:2104.05709 [pdf, other]

doi 10.1115/fedsm2018-83359

Route to Chaos in the Fluidic Pinball

Authors: Nan Deng, Luc R. Pastur, Marek Morzyński, Bernd R. Noack

Abstract: The fluidic pinball has been recently proposed as an attractive and effective flow configuration for exploring machine learning fluid flow control. In this contribution, we focus on the route to chaos in this system without actuation, as the Reynolds number is smoothly increased. It was found to be of the Newhouse-Ruelle-Takens kind, with a secondary pitchfork bifurcation that breaks the symmetry… ▽ More The fluidic pinball has been recently proposed as an attractive and effective flow configuration for exploring machine learning fluid flow control. In this contribution, we focus on the route to chaos in this system without actuation, as the Reynolds number is smoothly increased. It was found to be of the Newhouse-Ruelle-Takens kind, with a secondary pitchfork bifurcation that breaks the symmetry of the mean flow field on the route to quasi-periodicity. △ Less

Submitted 11 April, 2021; originally announced April 2021.

Comments: arXiv admin note: substantial text overlap with arXiv:2104.05104

Journal ref: Proceedings of the ASME 2018 5th Joint US-European Fluids Engineering Summer

arXiv:2104.05106 [pdf]

Low-dimensional Flow Models from high-dimensional Flow data with Machine Learning and First Principles

Authors: Nan Deng, Luc R. Pastur, Bernd R. Noack

Abstract: Reduced-order modelling and system identification can help us figure out the elementary degrees of freedom and the underlying mechanisms from the high-dimensional and nonlinear dynamics of fluid flow. Machine learning has brought new opportunities to these two processes and is revolutionising traditional methods. We show a framework to obtain a sparse human-interpretable model from complex high-di… ▽ More Reduced-order modelling and system identification can help us figure out the elementary degrees of freedom and the underlying mechanisms from the high-dimensional and nonlinear dynamics of fluid flow. Machine learning has brought new opportunities to these two processes and is revolutionising traditional methods. We show a framework to obtain a sparse human-interpretable model from complex high-dimensional data using machine learning and first principles. △ Less

Submitted 11 April, 2021; originally announced April 2021.

Journal ref: ERCIM News 122: Solving Engineering Problems with Machine Learning. July 2020

arXiv:2104.05105 [pdf, other]

Dynamiques transitoires de sillage dans le pinball fluidique

Authors: Nan Deng, Luc R. Pastur, Bernd R. Noack Bernd R., Guy Cornejo-Maceda, François Lusseyran, Jean-Christophe Loiseau Jean-Christophe, Marek Morzyński

Abstract: In this work, we are interested in the transient dynamics of a fluid configuration consisting of three fixed cylinders whose axes distribute over an equilateral triangle in transverse flow << fluidic pinball >>. As the Reynolds number is increased on the route to chaos, its transient dynamics tell us about the contribution of the elementary degrees of freedom of the system to the lift and drag coe… ▽ More In this work, we are interested in the transient dynamics of a fluid configuration consisting of three fixed cylinders whose axes distribute over an equilateral triangle in transverse flow << fluidic pinball >>. As the Reynolds number is increased on the route to chaos, its transient dynamics tell us about the contribution of the elementary degrees of freedom of the system to the lift and drag coefficients. △ Less

Submitted 11 April, 2021; originally announced April 2021.

Comments: in French

Journal ref: 22e Rencontre du Non-Lineaire, Mar 2019, Paris, France

arXiv:2104.05104 [pdf, other]

doi 10.1007/978-3-030-15297-0_19

Reduced-order modeling of the fluidic pinball

Authors: Luc R. Pastur, Nan Deng, Marek Morzyński, Bernd R. Noack

Abstract: The fluidic pinball is a geometrically simple flow configuration with three rotating cylinders on the vertex of an equilateral triangle. Yet, it remains physically rich enough to host a range of interacting frequencies and to allow testing of control laws within minutes on a laptop. The system has multiple inputs (the three cylinders can independently rotate around their axis) and multiple outputs… ▽ More The fluidic pinball is a geometrically simple flow configuration with three rotating cylinders on the vertex of an equilateral triangle. Yet, it remains physically rich enough to host a range of interacting frequencies and to allow testing of control laws within minutes on a laptop. The system has multiple inputs (the three cylinders can independently rotate around their axis) and multiple outputs (downstream velocity sensors). Investigating the natural flow dynamics, we found that the first unsteady transition undergone by the wake flow, when increasing the Reynolds number, is a Hopf bifurcation leading to the usual time-periodic vortex shedding phenomenon, typical of cylinder wake flows, in which the mean flow field preserves axial symmetry. We extract dynamically consistent modes from the flow data in order to built a reduced-order model (ROM) of this flow regime. We show that the main dynamical features of the primary Hopf bifurcation can be described by a non-trivial lowest-order model made of three degrees of freedom. △ Less

Submitted 11 April, 2021; originally announced April 2021.

Journal ref: 11th Chaotic Modeling and Simulation International Conference. CHAOS 2018. Springer Proceedings in Complexity. Springer, Cham

arXiv:2103.16365 [pdf, other]

FoV-NeRF: Foveated Neural Radiance Fields for Virtual Reality

Authors: Nianchen Deng, Zhenyi He, Jiannan Ye, Budmonde Duinkharjav, Praneeth Chakravarthula, Xubo Yang, Qi Sun

Abstract: Virtual Reality (VR) is becoming ubiquitous with the rise of consumer displays and commercial VR platforms. Such displays require low latency and high quality rendering of synthetic imagery with reduced compute overheads. Recent advances in neural rendering showed promise of unlocking new possibilities in 3D computer graphics via image-based representations of virtual or physical environments. Spe… ▽ More Virtual Reality (VR) is becoming ubiquitous with the rise of consumer displays and commercial VR platforms. Such displays require low latency and high quality rendering of synthetic imagery with reduced compute overheads. Recent advances in neural rendering showed promise of unlocking new possibilities in 3D computer graphics via image-based representations of virtual or physical environments. Specifically, the neural radiance fields (NeRF) demonstrated that photo-realistic quality and continuous view changes of 3D scenes can be achieved without loss of view-dependent effects. While NeRF can significantly benefit rendering for VR applications, it faces unique challenges posed by high field-of-view, high resolution, and stereoscopic/egocentric viewing, typically causing low quality and high latency of the rendered images. In VR, this not only harms the interaction experience but may also cause sickness. To tackle these problems toward six-degrees-of-freedom, egocentric, and stereo NeRF in VR, we present the first gaze-contingent 3D neural representation and view synthesis method. We incorporate the human psychophysics of visual- and stereo-acuity into an egocentric neural representation of 3D scenery. We then jointly optimize the latency/performance and visual quality while mutually bridging human perception and neural scene synthesis to achieve perceptually high-quality immersive interaction. We conducted both objective analysis and subjective studies to evaluate the effectiveness of our approach. We find that our method significantly reduces latency (up to 99% time reduction compared with NeRF) without loss of high-fidelity rendering (perceptually identical to full-resolution ground truth). The presented approach may serve as the first step toward future VR/AR systems that capture, teleport, and visualize remote environments in real-time. △ Less

Submitted 22 July, 2022; v1 submitted 30 March, 2021; originally announced March 2021.

Comments: 9 pages

ACM Class: I.3

arXiv:2102.11368 [pdf, other]

doi 10.1103/PhysRevB.102.115303

Stability of multielectron bubbles in high Landau levels

Authors: Dohyung Ro, S. A. Myers, N. Deng, J. D. Watson, M. J. Manfra, L. N Pfeiffer, K. W. West, G. A. Csáthy

Abstract: We study multielectron bubble phases in the $N=2$ and $N=3$ Landau levels in a high mobility GaAs/AlGaAs sample. We found that the longitudinal magnetoresistance versus temperature curves in the multielectron bubble region exhibit sharp peaks, irrespective of the Landau level index. We associate these peaks with an enhanced scattering caused by thermally fluctuating domains of a bubble phase and a… ▽ More We study multielectron bubble phases in the $N=2$ and $N=3$ Landau levels in a high mobility GaAs/AlGaAs sample. We found that the longitudinal magnetoresistance versus temperature curves in the multielectron bubble region exhibit sharp peaks, irrespective of the Landau level index. We associate these peaks with an enhanced scattering caused by thermally fluctuating domains of a bubble phase and a uniform uncorrelated electron liquid at the onset of the bubble phases. Within the $N=3$ Landau level, onset temperatures of three-electron and two-electron bubbles exhibit linear trends with respect to the filling factor; the onset temperatures of three-electron bubbles are systematically higher than those of two-electron bubbles. Furthermore, onset temperatures of the two-electron bubble phases across $N=2$ and $N=3$ Landau levels are similar, but exhibit an offset. This offset and the dominant nature of the three-electron bubbles in the $N=3$ Landau level reveals the role of the short-range part of the electron-electron interaction in the formation of the bubbles. △ Less

Submitted 22 February, 2021; originally announced February 2021.

Journal ref: prb 102, 115303 (2020)

arXiv:2101.07894 [pdf, other]

Alchemical Transfer Approach to Absolute Binding Free Energy Estimation

Authors: Joe Z. Wu, Solmaz Azimi, Sheenam Khuttan, Nanjie Deng, Emilio Gallicchio

Abstract: The Alchemical Transfer Method (ATM) for the calculation of standard binding free energies of non-covalent molecular complexes is presented. The method is based on a coordinate displacement perturbation of the ligand between the receptor binding site and the explicit solvent bulk, and a thermodynamic cycle connected by a symmetric intermediate in which the ligand interacts with the receptor and so… ▽ More The Alchemical Transfer Method (ATM) for the calculation of standard binding free energies of non-covalent molecular complexes is presented. The method is based on a coordinate displacement perturbation of the ligand between the receptor binding site and the explicit solvent bulk, and a thermodynamic cycle connected by a symmetric intermediate in which the ligand interacts with the receptor and solvent environments with equal strength. While the approach is alchemical, the implementation of ATM is as straightforward as for physical pathway methods of binding. The method is applicable in principle with any force field, it does not require splitting the alchemical transformations into electrostatic and non-electrostatic steps, and it does not require soft-core pair potentials. We have implemented ATM as a freely available and open-source plugin of the OpenMM molecular dynamics library. The method and its implementation are validated on the SAMPL6 SAMPLing host-guest benchmark set. The work paves the way to streamlined alchemical relative and absolute binding free energy implementations on many molecular simulation packages and with arbitrary energy functions including polarizable, quantum-mechanical, and artificial neural network potentials. △ Less

Submitted 19 January, 2021; originally announced January 2021.

arXiv:2011.06254 [pdf, other]

doi 10.1017/jfm.2021.299

Galerkin force model for transient and post-transient dynamics of the fluidic pinbal

Authors: Nan Deng, Bernd R. Noack, Marek Morzyński, Luc R. Pastur

Abstract: We propose an aerodynamic force model associated with a Galerkin model for the unforced fluidic pinball, the two-dimensional flow around three equal cylinders with one radius distance to each other. The starting point is a Galerkin model of a bluff-body flow. The force on this body is derived as a constant-linear-quadratic function of the mode amplitudes from first principles following the pioneer… ▽ More We propose an aerodynamic force model associated with a Galerkin model for the unforced fluidic pinball, the two-dimensional flow around three equal cylinders with one radius distance to each other. The starting point is a Galerkin model of a bluff-body flow. The force on this body is derived as a constant-linear-quadratic function of the mode amplitudes from first principles following the pioneering work of Noca (1997, 1999) and Liang & Dong (2014). The force model is simplified for the mean-field model of the unforced fluidic pinball (Deng et al. 2020) using symmetry properties and sparse calibration. The model is successfully applied to transient and post-transient dynamics in different Reynolds number regimes: the periodic vortex shedding after the Hopf-bifurcation and the asymmetric vortex shedding after the pitchfork bifurcation comprising six different Navier-Stokes solutions. We foresee many applications of the Galerkin force model for other bluff bodies and flow control. △ Less

Submitted 22 March, 2021; v1 submitted 12 November, 2020; originally announced November 2020.

arXiv:2009.01873 [pdf, other]

doi 10.1103/PhysRevB.102.035140

Disorder broadening of even denominator fractional quantum Hall states in the presence of a short-range alloy potential

Authors: E. Kleinbaum, Hongxi Li, N. Deng, G. C. Gardner, M. J. Manfra, G. A. Csáthy

Abstract: We study energy gaps of the $ν=7/2$ and $ν=5/2$ fractional quantum Hall states in a series of two-dimensional electron gases containing alloy disorder. We found that gaps at these two filling factors have the same suppression rate with alloy disorder. The dimensionless intrinsic gaps in our alloy samples obtained from the model proposed by Morf and d'Ambrumenil are consistent with numerical result… ▽ More We study energy gaps of the $ν=7/2$ and $ν=5/2$ fractional quantum Hall states in a series of two-dimensional electron gases containing alloy disorder. We found that gaps at these two filling factors have the same suppression rate with alloy disorder. The dimensionless intrinsic gaps in our alloy samples obtained from the model proposed by Morf and d'Ambrumenil are consistent with numerical results, but are larger than those obtained from experiments on pristine samples published in the literature. The disorder broadening parameter has large uncertainties. However, a modified analysis relying on shared intrinsic gaps yields consistent results for both the $ν=5/2$ and $7/2$ fractional quantum Hall states and establishes a linear relationship between the disorder broadening parameter and alloy concentration. Furthermore, we find that we can separate contributions to the disorder broadening of the long-range and short-range scattering. △ Less

Submitted 3 September, 2020; originally announced September 2020.

Journal ref: Physical Review B 102, 035140 (2020)

arXiv:2005.12263 [pdf, ps, other]

Principal Component Analysis Based on T$\ell_1$-norm Maximization

Authors: Xiang-Fei Yang, Yuan-Hai Shao, Chun-Na Li, Li-Ming Liu, Nai-Yang Deng

Abstract: Classical principal component analysis (PCA) may suffer from the sensitivity to outliers and noise. Therefore PCA based on $\ell_1$-norm and $\ell_p$-norm ($0 < p < 1$) have been studied. Among them, the ones based on $\ell_p$-norm seem to be most interesting from the robustness point of view. However, their numerical performance is not satisfactory. Note that, although T$\ell_1$-norm is similar t… ▽ More Classical principal component analysis (PCA) may suffer from the sensitivity to outliers and noise. Therefore PCA based on $\ell_1$-norm and $\ell_p$-norm ($0 < p < 1$) have been studied. Among them, the ones based on $\ell_p$-norm seem to be most interesting from the robustness point of view. However, their numerical performance is not satisfactory. Note that, although T$\ell_1$-norm is similar to $\ell_p$-norm ($0 < p < 1$) in some sense, it has the stronger suppression effect to outliers and better continuity. So PCA based on T$\ell_1$-norm is proposed in this paper. Our numerical experiments have shown that its performance is superior than PCA-$\ell_p$ and $\ell_p$SPCA as well as PCA, PCA-$\ell_1$ obviously. △ Less

Submitted 23 May, 2020; originally announced May 2020.

arXiv:2002.06739 [pdf, ps, other]

doi 10.1109/TCYB.2021.3050487

Multiple Flat Projections for Cross-manifold Clustering

Authors: Lan Bai, Yuan-Hai Shao, Wei-Jie Chen, Zhen Wang, Nai-Yang Deng

Abstract: Cross-manifold clustering is a hard topic and many traditional clustering methods fail because of the cross-manifold structures. In this paper, we propose a Multiple Flat Projections Clustering (MFPC) to deal with cross-manifold clustering problems. In our MFPC, the given samples are projected into multiple subspaces to discover the global structures of the implicit manifolds. Thus, the cross-mani… ▽ More Cross-manifold clustering is a hard topic and many traditional clustering methods fail because of the cross-manifold structures. In this paper, we propose a Multiple Flat Projections Clustering (MFPC) to deal with cross-manifold clustering problems. In our MFPC, the given samples are projected into multiple subspaces to discover the global structures of the implicit manifolds. Thus, the cross-manifold clusters are distinguished from the various projections. Further, our MFPC is extended to nonlinear manifold clustering via kernel tricks to deal with more complex cross-manifold clustering. A series of non-convex matrix optimization problems in MFPC are solved by a proposed recursive algorithm. The synthetic tests show that our MFPC works on the cross-manifold structures well. Moreover, experimental results on the benchmark datasets show the excellent performance of our MFPC compared with some state-of-the-art clustering methods. △ Less

Submitted 16 February, 2020; originally announced February 2020.

Comments: 12 pages, 58 figures

Journal ref: IEEE Transactions on Cybernetics, 2021

arXiv:1910.09734 [pdf, other]

Single and Union Non-parallel Support Vector Machine Frameworks

Authors: Chun-Na Li, Yuan-Hai Shao, Huajun Wang, Yu-Ting Zhao, Ling-Wei Huang, Naihua Xiu, Nai-Yang Deng

Abstract: Considering the classification problem, we summarize the nonparallel support vector machines with the nonparallel hyperplanes to two types of frameworks. The first type constructs the hyperplanes separately. It solves a series of small optimization problems to obtain a series of hyperplanes, but is hard to measure the loss of each sample. The other type constructs all the hyperplanes simultaneousl… ▽ More Considering the classification problem, we summarize the nonparallel support vector machines with the nonparallel hyperplanes to two types of frameworks. The first type constructs the hyperplanes separately. It solves a series of small optimization problems to obtain a series of hyperplanes, but is hard to measure the loss of each sample. The other type constructs all the hyperplanes simultaneously, and it solves one big optimization problem with the ascertained loss of each sample. We give the characteristics of each framework and compare them carefully. In addition, based on the second framework, we construct a max-min distance-based nonparallel support vector machine for multiclass classification problem, called NSVM. It constructs hyperplanes with large distance margin by solving an optimization problem. Experimental results on benchmark data sets show the advantages of our NSVM. △ Less

Submitted 25 June, 2021; v1 submitted 21 October, 2019; originally announced October 2019.

arXiv:1906.04035 [pdf, other]

doi 10.1103/PhysRevB.99.201111

Electron Bubbles and the Structure of the Orbital Wavefunction

Authors: Dohyung Ro, N. Deng, J. D. Watson, M. J. Manfra, L. N Pfeiffer, K. W. West, G. A. Csáthy

Abstract: Stripe-like and bubble-like patterns spontaneously form in numerous physical, chemical, and biological systems when competing long-range and short-range interactions banish uniformity. Stripe-like and the related nematic morphology are also under intense scrutiny in various strongly correlated electron systems. In contrast, the electronic bubble morphology is rare. Some of the most intriguing elec… ▽ More Stripe-like and bubble-like patterns spontaneously form in numerous physical, chemical, and biological systems when competing long-range and short-range interactions banish uniformity. Stripe-like and the related nematic morphology are also under intense scrutiny in various strongly correlated electron systems. In contrast, the electronic bubble morphology is rare. Some of the most intriguing electron bubbles develop in the two-dimensional electron gas subjected to a perpendicular magnetic field. However, in contrast to bubbles forming in classical systems such as the Turing activator-inhibitor reaction or Langmuir films, bubbles in electron gases owe their existence to elementary quantum mechanics: they are stabilized as wavefunctions of individual electrons overlap. Here we report a rich pattern of multi-electron bubble phases in a high Landau level and we conclude that this richness is due to the nodal structure of the orbital component of the electronic wavefunction. △ Less

Submitted 10 June, 2019; originally announced June 2019.

Journal ref: prb 99, 201111(R) (2019)

arXiv:1812.08529 [pdf, other]

doi 10.1017/jfm.2019.959

Low-order model for successive bifurcations of the fluidic pinball

Authors: Nan Deng, Bernd R. Noack, Marek Morzynski, Luc R. Pastur

Abstract: We propose the first least-order Galerkin model of an incompressible flow undergoing two successive supercritical bifurcations of Hopf and pitchfork type. A key enabler is a mean-field consideration exploiting the symmetry of the mean flow and the asymmetry of the fluctuation. These symmetries generalize mean-field theory, e.g. no assumption of slow growth-rate is needed. The resulting 5-dimension… ▽ More We propose the first least-order Galerkin model of an incompressible flow undergoing two successive supercritical bifurcations of Hopf and pitchfork type. A key enabler is a mean-field consideration exploiting the symmetry of the mean flow and the asymmetry of the fluctuation. These symmetries generalize mean-field theory, e.g. no assumption of slow growth-rate is needed. The resulting 5-dimensional Galerkin model successfully describes the phenomenogram of the fluidic pinball, a two-dimensional wake flow around a cluster of three equidistantly spaced cylinders. The corresponding transition scenario is shown to undergo two successive supercritical bifurcations, namely a Hopf and a pitchfork bifurcations on the way to chaos. The generalized mean-field Galerkin methodology may be employed to describe other transition scenarios. △ Less

Submitted 11 April, 2021; v1 submitted 20 December, 2018; originally announced December 2018.

Journal ref: J. Fluid Mech. 884 (2020) A37

arXiv:1811.02384 [pdf, ps, other]

Robust Bhattacharyya bound linear discriminant analysis through adaptive algorithm

Authors: Chun-Na Li, Yuan-Hai Shao, Zhen Wang, Nai-Yang Deng

Abstract: In this paper, we propose a novel linear discriminant analysis criterion via the Bhattacharyya error bound estimation based on a novel L1-norm (L1BLDA) and L2-norm (L2BLDA). Both L1BLDA and L2BLDA maximize the between-class scatters which are measured by the weighted pairwise distances of class means and meanwhile minimize the within-class scatters under the L1-norm and L2-norm, respectively. The… ▽ More In this paper, we propose a novel linear discriminant analysis criterion via the Bhattacharyya error bound estimation based on a novel L1-norm (L1BLDA) and L2-norm (L2BLDA). Both L1BLDA and L2BLDA maximize the between-class scatters which are measured by the weighted pairwise distances of class means and meanwhile minimize the within-class scatters under the L1-norm and L2-norm, respectively. The proposed models can avoid the small sample size (SSS) problem and have no rank limit that may encounter in LDA. It is worth mentioning that, the employment of L1-norm gives a robust performance of L1BLDA, and L1BLDA is solved through an effective non-greedy alternating direction method of multipliers (ADMM), where all the projection vectors can be obtained once for all. In addition, the weighting constants of L1BLDA and L2BLDA between the between-class and within-class terms are determined by the involved data set, which makes our L1BLDA and L2BLDA adaptive. The experimental results on both benchmark data sets as well as the handwritten digit databases demonstrate the effectiveness of the proposed methods. △ Less

Submitted 6 November, 2018; originally announced November 2018.

arXiv:1810.11733 [pdf, ps, other]

doi 10.3847/1538-4357/aaecd0

Evolution of Photospheric Vector Magnetic Field Associated with Moving Flare Ribbons As Seen By GST

Authors: Chang Liu, Wenda Cao, Jongchul Chae, Kwangsu Ahn, Debi Prasad Choudhary, Jeongwoo Lee, Rui Liu, Na Deng, Jiasheng Wang, Haimin Wang

Abstract: The photospheric response to solar flares, also known as coronal back reaction, is often observed as sudden flare-induced changes in vector magnetic field and sunspot motions. However, it remains obscure whether evolving flare ribbons, the flare signature closest to the photosphere, are accompanied by changes in vector magnetic field therein. Here we explore the relationship between the dynamics o… ▽ More The photospheric response to solar flares, also known as coronal back reaction, is often observed as sudden flare-induced changes in vector magnetic field and sunspot motions. However, it remains obscure whether evolving flare ribbons, the flare signature closest to the photosphere, are accompanied by changes in vector magnetic field therein. Here we explore the relationship between the dynamics of flare ribbons in the chromosphere and variations of magnetic fields in the underlying photosphere, using high-resolution off-band H-alpha images and near-infrared vector magnetograms of the M6.5 flare on 2015 June 22 observed with the 1.6 m Goode Solar Telescope. We find that changes of photospheric fields occur at the arrival of the flare ribbon front, thus propagating analogously to flare ribbons. In general, the horizontal field increases and the field lines become more inclined to the surface. When ribbons sweep through regions that undergo a rotational motion, the fields transiently turn more vertical with decreased horizontal field and inclination angle, and then restore and/or become more horizontal than before the ribbon arrival. The ribbon propagation decelerates near the sunspot rotation center, where the vertical field becomes permanently enhanced. Similar magnetic field changes are discernible in magnetograms from the Helioseismic and Magnetic Imager (HMI), and an inward collapse of coronal magnetic fields is inferred from the time sequence of non-linear force-free field models extrapolated from HMI magnetograms. We conclude that photospheric fields respond nearly instantaneously to magnetic reconnection in the corona. △ Less

Submitted 27 October, 2018; originally announced October 2018.

Comments: 15 pages, 9 figures, accepted to ApJ

arXiv:1808.09104 [pdf]

Combining Alchemical Transformation with Physical Pathway to Accurately Compute Absolute Binding Free Energy

Authors: Nanjie Deng, Lauren Wickstrom, Emilio Gallicchio

Abstract: We present a new method that combines alchemical transformation with physical pathway to accurately and efficiently compute the absolute binding free energy of receptor-ligand complex. Currently, the double decoupling method (DDM) and the potential of mean force approach (PMF) methods are widely used to compute the absolute binding free energy of biomolecules. The DDM relies on alchemically decoup… ▽ More We present a new method that combines alchemical transformation with physical pathway to accurately and efficiently compute the absolute binding free energy of receptor-ligand complex. Currently, the double decoupling method (DDM) and the potential of mean force approach (PMF) methods are widely used to compute the absolute binding free energy of biomolecules. The DDM relies on alchemically decoupling the ligand from its environments, which can be computationally challenging for large ligands and charged ligands because of the large magnitude of the decoupling free energies involved. On the other hand, the PMF approach uses physical pathway to extract the ligand out of the binding site, thus avoids the alchemical decoupling of the ligand. However, the PMF method has its own drawback because of the reliance on a ligand binding/unbinding pathway free of steric obstruction from the receptor atoms. Therefore, in the presence of deeply buried ligand functional groups the convergence of the PMF calculation can be very slow leading to large errors in the computed binding free energy. Here we develop a new method called AlchemPMF by combining alchemical transformation with physical pathway to overcome the major drawback in the PMF method. We have tested the new approach on the binding of a charged ligand to an allosteric site on HIV-1 Integrase. After 20 ns of simulation per umbrella sampling window, the new method yields absolute binding free energies within ~1 kcal/mol from the experimental result, whereas the standard PMF approach and the DDM calculations result in errors of ~5 kcal/mol and > 2 kcal/mol, respectively. Furthermore, the binding free energy computed using the new method is associated with smaller statistical error compared with those obtained from the existing methods. △ Less

Submitted 27 August, 2018; originally announced August 2018.

Comments: 16 pages, 7 figures

Showing 1–50 of 97 results for author: Deng, N