-
Existence and regularity results for anisotropic parabolic equations with degenerate coercivity
Authors:
Weilin Zou,
Yuanchun Ren,
Wei Wang
Abstract:
This paper deals with a class of nonlinear anisotropic parabolic equations with degenerate coercivity. Using the anisotropic Gagliardo-Nirenberg-type inequality, we prove some existence and regularity results for the solutions under the framework of anisotropic Sobolev spaces, which generalize the previous results of [10,18,23].
This paper deals with a class of nonlinear anisotropic parabolic equations with degenerate coercivity. Using the anisotropic Gagliardo-Nirenberg-type inequality, we prove some existence and regularity results for the solutions under the framework of anisotropic Sobolev spaces, which generalize the previous results of [10,18,23].
△ Less
Submitted 16 March, 2023;
originally announced March 2023.
-
Solvable Dynamics of Coupled High-Dimensional Generalized Limit-Cycle Oscillators
Authors:
Wei Zou,
Sujuan He,
D. V. Senthilkumar,
Juergen Kurths
Abstract:
We introduce a new model consisting of globally coupled high-dimensional generalized limit-cycle oscillators, which explicitly incorporates the role of amplitude dynamics of individual units in the collective dynamics. In the limit of weak coupling, our model reduces to the $D$-dimensional Kuramoto phase model, akin to a similar classic construction of the well-known Kuramoto phase model from weak…
▽ More
We introduce a new model consisting of globally coupled high-dimensional generalized limit-cycle oscillators, which explicitly incorporates the role of amplitude dynamics of individual units in the collective dynamics. In the limit of weak coupling, our model reduces to the $D$-dimensional Kuramoto phase model, akin to a similar classic construction of the well-known Kuramoto phase model from weakly coupled two-dimensional limit-cycle oscillators. For the practically important case of $D=3$, the incoherence of the model is rigorously proved to be stable for negative coupling $(K<0)$ but unstable for positive coupling $(K>0)$; the locked states are shown to exist if $K>0$; in particular, the onset of amplitude death is theoretically predicted. For $D\geq2$, the discrete and continuous spectra for both locked states and amplitude death are governed by two general formulas. Our proposed $D$-dimensional model is physically more reasonable, because it is no longer constrained by fixed amplitude dynamics, which puts the recent studies of the $D$-dimensional Kuramoto phase model on a stronger footing by providing a more general framework for $D$-dimensional limit-cycle oscillators.
△ Less
Submitted 10 February, 2023;
originally announced February 2023.
-
Low-loss interconnects for modular superconducting quantum processors
Authors:
**g**g Niu,
Libo Zhang,
Yang Liu,
Jiawei Qiu,
Wenhui Huang,
Jiaxiang Huang,
Hao Jia,
Jiawei Liu,
Ziyu Tao,
Weiwei Wei,
Yuxuan Zhou,
Wan**g Zou,
Yuanzhen Chen,
Xiaowei Deng,
Xiuhao Deng,
Changkang Hu,
Ling Hu,
Jian Li,
Dian Tan,
Yuan Xu,
Fei Yan,
Tongxing Yan,
Song Liu,
Youpeng Zhong,
Andrew N. Cleland
, et al. (1 additional authors not shown)
Abstract:
Scaling is now a key challenge in superconducting quantum computing. One solution is to build modular systems in which smaller-scale quantum modules are individually constructed and calibrated, and then assembled into a larger architecture. This, however, requires the development of suitable interconnects. Here, we report low-loss interconnects based on pure aluminium coaxial cables and on-chip im…
▽ More
Scaling is now a key challenge in superconducting quantum computing. One solution is to build modular systems in which smaller-scale quantum modules are individually constructed and calibrated, and then assembled into a larger architecture. This, however, requires the development of suitable interconnects. Here, we report low-loss interconnects based on pure aluminium coaxial cables and on-chip impedance transformers featuring quality factors up to $8.1 \times 10^5$, which is comparable to the performance of our transmon qubits fabricated on single-crystal sapphire substrate. We use these interconnects to link five quantum modules with inter-module quantum state transfer and Bell state fidelities up to 99\%. To benchmark the overall performance of the processor, we create maximally-entangled, multi-qubit Greenberger-Horne-Zeilinger (GHZ) states. The generated inter-module four-qubit GHZ state exhibits 92.0\% fidelity. We also entangle up to 12 qubits in a GHZ state with $55.8 \pm 1.8\%$ fidelity, which is above the genuine multipartite entanglement threshold of 1/2. These results represent a viable modular approach for large-scale superconducting quantum processors.
△ Less
Submitted 6 February, 2023;
originally announced February 2023.
-
A Class-wise Non-salient Region Generalized Framework for Video Semantic Segmentation
Authors:
Yuhang Zhang,
Shishun Tian,
Muxin Liao,
Zhengyu Zhang,
Wenbin Zou,
Chen Xu
Abstract:
Video semantic segmentation (VSS) is beneficial for dealing with dynamic scenes due to the continuous property of the real-world environment. On the one hand, some methods alleviate the predicted inconsistent problem between continuous frames. On the other hand, other methods employ the previous frame as the prior information to assist in segmenting the current frame. Although the previous methods…
▽ More
Video semantic segmentation (VSS) is beneficial for dealing with dynamic scenes due to the continuous property of the real-world environment. On the one hand, some methods alleviate the predicted inconsistent problem between continuous frames. On the other hand, other methods employ the previous frame as the prior information to assist in segmenting the current frame. Although the previous methods achieve superior performances on the independent and identically distributed (i.i.d) data, they can not generalize well on other unseen domains. Thus, we explore a new task, the video generalizable semantic segmentation (VGSS) task that considers both continuous frames and domain generalization. In this paper, we propose a class-wise non-salient region generalized (CNSG) framework for the VGSS task. Concretely, we first define the class-wise non-salient feature, which describes features of the class-wise non-salient region that carry more generalizable information. Then, we propose a class-wise non-salient feature reasoning strategy to select and enhance the most generalized channels adaptively. Finally, we propose an inter-frame non-salient centroid alignment loss to alleviate the predicted inconsistent problem in the VGSS task. We also extend our video-based framework to the image-based generalizable semantic segmentation (IGSS) task. Experiments demonstrate that our CNSG framework yields significant improvement in the VGSS and IGSS tasks.
△ Less
Submitted 28 December, 2022;
originally announced December 2022.
-
Technical Report -- Competition Solution for Prompt Tuning using Pretrained Language Model
Authors:
Jiang-Long Song,
Wu-He Zou,
Feng Li,
Xiao-Lei Qin,
Wei-Dong Zhang
Abstract:
Prompt tuning recently becomes a hot-spot in the applications of large pretrained language models on specific downstream tasks. Regarding the Language Model as a Service (LMaaS), black-box tuning using derivative-free optimization (DFO) provides a novel approach to expand the practical scenarios of pretrained models and enrich the researches of few-shot learning. In this report, we present our sol…
▽ More
Prompt tuning recently becomes a hot-spot in the applications of large pretrained language models on specific downstream tasks. Regarding the Language Model as a Service (LMaaS), black-box tuning using derivative-free optimization (DFO) provides a novel approach to expand the practical scenarios of pretrained models and enrich the researches of few-shot learning. In this report, we present our solution in this competition that is based on the LMaaS scenario. Our solution consists of several modifications to BBTv2, including multiple label words, selection of P0, rolling update strategy, multi-task loss from MLP classifier, and finally using the ensemble method to further improve generalization ability. We also shared some strategies that we tried but didn't use in the final submission for further discussion. In the end we raised a question about the SNLI dataset and the impact on the results, as well as our concerns about the competition.
△ Less
Submitted 20 December, 2022; v1 submitted 12 December, 2022;
originally announced December 2022.
-
Statistical mechanics of continual learning: variational principle and mean-field potential
Authors:
Chan Li,
Zhenye Huang,
Wenxuan Zou,
Hai** Huang
Abstract:
An obstacle to artificial general intelligence is set by continual learning of multiple tasks of different nature. Recently, various heuristic tricks, both from machine learning and from neuroscience angles, were proposed, but they lack a unified theory ground. Here, we focus on continual learning in single-layered and multi-layered neural networks of binary weights. A variational Bayesian learnin…
▽ More
An obstacle to artificial general intelligence is set by continual learning of multiple tasks of different nature. Recently, various heuristic tricks, both from machine learning and from neuroscience angles, were proposed, but they lack a unified theory ground. Here, we focus on continual learning in single-layered and multi-layered neural networks of binary weights. A variational Bayesian learning setting is thus proposed, where the neural networks are trained in a field-space, rather than gradient-ill-defined discrete-weight space, and furthermore, weight uncertainty is naturally incorporated, and modulates synaptic resources among tasks. From a physics perspective, we translate the variational continual learning into Franz-Parisi thermodynamic potential framework, where previous task knowledge acts as a prior and a reference as well. We thus interpret the continual learning of the binary perceptron in a teacher-student setting as a Franz-Parisi potential computation. The learning performance can then be analytically studied with mean-field order parameters, whose predictions coincide with numerical experiments using stochastic gradient descent methods. Based on the variational principle and Gaussian field approximation of internal preactivations in hidden layers, we also derive the learning algorithm considering weight uncertainty, which solves the continual learning with binary weights using multi-layered neural networks, and performs better than the currently available metaplasticity algorithm. Our proposed principled frameworks also connect to elastic weight consolidation, weight-uncertainty modulated learning, and neuroscience inspired metaplasticity, providing a theory-grounded method for the real-world multi-task learning with deep networks.
△ Less
Submitted 20 June, 2023; v1 submitted 6 December, 2022;
originally announced December 2022.
-
High-resolution and reliable automatic target recognition based on photonic ISAR imaging system with explainable deep learning
Authors:
Xiuting Zou,
Anyi Deng,
Yiheng Hu,
Shiyu Hua,
Linbo Zhang,
Shaofu Xu,
Weiwen Zou
Abstract:
Automatic target recognition (ATR) based on inverse synthetic aperture radar (ISAR) images, which is extensively utilized to surveil environment in military and civil fields, must be high-precision and reliable. Photonic technologies' advantage of broad bandwidth enables ISAR systems to realize high-resolution imaging, which is in favor of achieving high-performance ATR. Deep learning (DL) algorit…
▽ More
Automatic target recognition (ATR) based on inverse synthetic aperture radar (ISAR) images, which is extensively utilized to surveil environment in military and civil fields, must be high-precision and reliable. Photonic technologies' advantage of broad bandwidth enables ISAR systems to realize high-resolution imaging, which is in favor of achieving high-performance ATR. Deep learning (DL) algorithms have achieved excellent recognition accuracies. However, the lack of interpretability of DL algorithms causes the head-scratching problem of credibility. In this paper, we exploit the inner relationship between a photonic ISAR imaging system and behaviors of a convolutional neural network (CNN) to deeply comprehend the intelligent recognition. Specifically, we manipulate imaging physical process and analyze network outputs, the relevance between the ISAR image and network output, and the visualization of features in the network output layer. Consequently, the broader imaging bandwidths and appropriate imaging angles lead to more detailed structural and contour features and the bigger discrepancy among ISAR images of different targets, which contributes to the CNN recognizing and distinguishing objects according to physical laws. Then, based on the photonic ISAR imaging system and the explainable CNN, we accomplish a high-accuracy and reliable ATR. To the best of our knowledge, there is no precedent of explaining the DL algorithms by exploring the influence of the physical process of data generation on network behaviors. It is anticipated that this work can not only inspire the accomplishment of a high-performance ATR but also bring new insights to explore network behaviors and thus achieve better intelligent abilities.
△ Less
Submitted 3 December, 2022;
originally announced December 2022.
-
Emerging Threats in Deep Learning-Based Autonomous Driving: A Comprehensive Survey
Authors:
Hui Cao,
Wenlong Zou,
Yinkun Wang,
Ting Song,
Mengjun Liu
Abstract:
Since the 2004 DARPA Grand Challenge, the autonomous driving technology has witnessed nearly two decades of rapid development. Particularly, in recent years, with the application of new sensors and deep learning technologies extending to the autonomous field, the development of autonomous driving technology has continued to make breakthroughs. Thus, many carmakers and high-tech giants dedicated to…
▽ More
Since the 2004 DARPA Grand Challenge, the autonomous driving technology has witnessed nearly two decades of rapid development. Particularly, in recent years, with the application of new sensors and deep learning technologies extending to the autonomous field, the development of autonomous driving technology has continued to make breakthroughs. Thus, many carmakers and high-tech giants dedicated to research and system development of autonomous driving. However, as the foundation of autonomous driving, the deep learning technology faces many new security risks. The academic community has proposed deep learning countermeasures against the adversarial examples and AI backdoor, and has introduced them into the autonomous driving field for verification. Deep learning security matters to autonomous driving system security, and then matters to personal safety, which is an issue that deserves attention and research.This paper provides an summary of the concepts, developments and recent research in deep learning security technologies in autonomous driving. Firstly, we briefly introduce the deep learning framework and pipeline in the autonomous driving system, which mainly include the deep learning technologies and algorithms commonly used in this field. Moreover, we focus on the potential security threats of the deep learning based autonomous driving system in each functional layer in turn. We reviews the development of deep learning attack technologies to autonomous driving, investigates the State-of-the-Art algorithms, and reveals the potential risks. At last, we provides an outlook on deep learning security in the autonomous driving field and proposes recommendations for building a safe and trustworthy autonomous driving system.
△ Less
Submitted 19 October, 2022;
originally announced October 2022.
-
Safe Model-Based Reinforcement Learning with an Uncertainty-Aware Reachability Certificate
Authors:
Dongjie Yu,
Wenjun Zou,
Yujie Yang,
Haitong Ma,
Shengbo Eben Li,
**gliang Duan,
Jianyu Chen
Abstract:
Safe reinforcement learning (RL) that solves constraint-satisfactory policies provides a promising way to the broader safety-critical applications of RL in real-world problems such as robotics. Among all safe RL approaches, model-based methods reduce training time violations further due to their high sample efficiency. However, lacking safety robustness against the model uncertainties remains an i…
▽ More
Safe reinforcement learning (RL) that solves constraint-satisfactory policies provides a promising way to the broader safety-critical applications of RL in real-world problems such as robotics. Among all safe RL approaches, model-based methods reduce training time violations further due to their high sample efficiency. However, lacking safety robustness against the model uncertainties remains an issue in safe model-based RL, especially in training time safety. In this paper, we propose a distributional reachability certificate (DRC) and its Bellman equation to address model uncertainties and characterize robust persistently safe states. Furthermore, we build a safe RL framework to resolve constraints required by the DRC and its corresponding shield policy. We also devise a line search method to maintain safety and reach higher returns simultaneously while leveraging the shield policy. Comprehensive experiments on classical benchmarks such as constrained tracking and navigation indicate that the proposed algorithm achieves comparable returns with much fewer constraint violations during training.
△ Less
Submitted 14 October, 2022;
originally announced October 2022.
-
Downlink Compression Improves TopK Sparsification
Authors:
William Zou,
Hans De Sterck,
Jun Liu
Abstract:
Training large neural networks is time consuming. To speed up the process, distributed training is often used. One of the largest bottlenecks in distributed training is communicating gradients across different nodes. Different gradient compression techniques have been proposed to alleviate the communication bottleneck, including topK gradient sparsification, which truncates the gradient to the lar…
▽ More
Training large neural networks is time consuming. To speed up the process, distributed training is often used. One of the largest bottlenecks in distributed training is communicating gradients across different nodes. Different gradient compression techniques have been proposed to alleviate the communication bottleneck, including topK gradient sparsification, which truncates the gradient to the largest K components before sending it to other nodes. While some authors have investigated topK gradient sparsification in the parameter-server framework by applying topK compression in both the worker-to-server (uplink) and server-to-worker (downlink) direction, the currently accepted belief says that adding extra compression degrades the convergence of the model. We demonstrate, on the contrary, that adding downlink compression can potentially improve the performance of topK sparsification: not only does it reduce the amount of communication per step, but also, counter-intuitively, can improve the upper bound in the convergence analysis. To show this, we revisit non-convex convergence analysis of topK stochastic gradient descent (SGD) and extend it from the unidirectional to the bidirectional setting. We also remove a restriction of the previous analysis that requires unrealistically large values of K. We experimentally evaluate bidirectional topK SGD against unidirectional topK SGD and show that models trained with bidirectional topK SGD will perform as well as models trained with unidirectional topK SGD while yielding significant communication benefits for large numbers of workers.
△ Less
Submitted 29 September, 2022;
originally announced September 2022.
-
An efficient dosimetry method with a Faraday cup for small animal, small-field proton irradiation under conventional and ultra-high dose rates
Authors:
Abbas Husain,
Jeremiah Ryser,
Julia Pakela,
Menggui Huang,
Khayrullo Shoniyozov,
Francois Vander Stappen,
Costas Koumenis,
Lei Dong,
Yi Fan,
Eric Diffenderfer,
Wei Zou
Abstract:
Introduction: We developed and evaluated a method for dose calibration and monitoring under conventional and ultra-high dose rates for small animal experiments with small-field proton beams using a Faraday cup.
Methods: We determined a relationship between dose and optical density (OD) of EBT-XD Gafchromic film using scanned 10x10 cm2 proton pencil beams delivered at clinical dose rates; the dos…
▽ More
Introduction: We developed and evaluated a method for dose calibration and monitoring under conventional and ultra-high dose rates for small animal experiments with small-field proton beams using a Faraday cup.
Methods: We determined a relationship between dose and optical density (OD) of EBT-XD Gafchromic film using scanned 10x10 cm2 proton pencil beams delivered at clinical dose rates; the dose was measured with an Advanced Markus chamber. On a small animal proton irradiation platform, double-scattered pencil beams with 5 or 8 mm diameter brass collimation at conventional and ultra-high dose rates were delivered to the EBT-XD films. The proton fluence charges were collected by a Faraday cup placed downstream from the film. The average of the irradiated film ODs was related to the Faraday cup charges. A conversion from the Faraday cup charge to the average dose of the small-field proton beam was then obtained.
Results: The relationship between the small-field average profile dose and Faraday cup charge was established for 10 and 15 Gy mice FLASH experiments. The film OD was found to be independent of dose rate. At small-animal treatments, the Faraday cup readings were conveniently used to QA and monitor the delivered dose and dose rates to the mice under conventional and ultra-high dose rates.
Conclusion: The dose calibration and monitoring method with Faraday cup for small animal proton FLASH experiments is time-efficient and cost-effective and can be used for irradiations of various small field sizes. The same approach can also be adopted for clinical proton dosimetry for small-field irradiations.
△ Less
Submitted 15 September, 2022;
originally announced September 2022.
-
AIM 2022 Challenge on Super-Resolution of Compressed Image and Video: Dataset, Methods and Results
Authors:
Ren Yang,
Radu Timofte,
Xin Li,
Qi Zhang,
Lin Zhang,
Fanglong Liu,
Dongliang He,
Fu li,
He Zheng,
Weihang Yuan,
Pavel Ostyakov,
Dmitry Vyal,
Magauiya Zhussip,
Xueyi Zou,
Youliang Yan,
Lei Li,
**gzhu Tang,
Ming Chen,
Shijie Zhao,
Yu Zhu,
Xiaoran Qin,
Chenghua Li,
Cong Leng,
Jian Cheng,
Claudio Rota
, et al. (28 additional authors not shown)
Abstract:
This paper reviews the Challenge on Super-Resolution of Compressed Image and Video at AIM 2022. This challenge includes two tracks. Track 1 aims at the super-resolution of compressed image, and Track~2 targets the super-resolution of compressed video. In Track 1, we use the popular dataset DIV2K as the training, validation and test sets. In Track 2, we propose the LDV 3.0 dataset, which contains 3…
▽ More
This paper reviews the Challenge on Super-Resolution of Compressed Image and Video at AIM 2022. This challenge includes two tracks. Track 1 aims at the super-resolution of compressed image, and Track~2 targets the super-resolution of compressed video. In Track 1, we use the popular dataset DIV2K as the training, validation and test sets. In Track 2, we propose the LDV 3.0 dataset, which contains 365 videos, including the LDV 2.0 dataset (335 videos) and 30 additional videos. In this challenge, there are 12 teams and 2 teams that submitted the final results to Track 1 and Track 2, respectively. The proposed methods and solutions gauge the state-of-the-art of super-resolution on compressed image and video. The proposed LDV 3.0 dataset is available at https://github.com/RenYang-home/LDV_dataset. The homepage of this challenge is at https://github.com/RenYang-home/AIM22_CompressSR.
△ Less
Submitted 25 August, 2022; v1 submitted 23 August, 2022;
originally announced August 2022.
-
Analyzing Robustness of End-to-End Neural Models for Automatic Speech Recognition
Authors:
Goutham Rajendran,
Wei Zou
Abstract:
We investigate robustness properties of pre-trained neural models for automatic speech recognition. Real life data in machine learning is usually very noisy and almost never clean, which can be attributed to various factors depending on the domain, e.g. outliers, random noise and adversarial noise. Therefore, the models we develop for various tasks should be robust to such kinds of noisy data, whi…
▽ More
We investigate robustness properties of pre-trained neural models for automatic speech recognition. Real life data in machine learning is usually very noisy and almost never clean, which can be attributed to various factors depending on the domain, e.g. outliers, random noise and adversarial noise. Therefore, the models we develop for various tasks should be robust to such kinds of noisy data, which led to the thriving field of robust machine learning. We consider this important issue in the setting of automatic speech recognition. With the increasing popularity of pre-trained models, it's an important question to analyze and understand the robustness of such models to noise. In this work, we perform a robustness analysis of the pre-trained neural models wav2vec2, HuBERT and DistilHuBERT on the LibriSpeech and TIMIT datasets. We use different kinds of noising mechanisms and measure the model performances as quantified by the inference time and the standard Word Error Rate metric. We also do an in-depth layer-wise analysis of the wav2vec2 model when injecting noise in between layers, enabling us to predict at a high level what each layer learns. Finally for this model, we visualize the propagation of errors across the layers and compare how it behaves on clean versus noisy data. Our experiments conform the predictions of Pasad et al. [2021] and also raise interesting directions for future work.
△ Less
Submitted 17 August, 2022;
originally announced August 2022.
-
DSLA: Dynamic smooth label assignment for efficient anchor-free object detection
Authors:
Hu Su,
Yonghao He,
Rui Jiang,
Jiabin Zhang,
Wei Zou,
Bin Fan
Abstract:
Anchor-free detectors basically formulate object detection as dense classification and regression. For popular anchor-free detectors, it is common to introduce an individual prediction branch to estimate the quality of localization. The following inconsistencies are observed when we delve into the practices of classification and quality estimation. Firstly, for some adjacent samples which are assi…
▽ More
Anchor-free detectors basically formulate object detection as dense classification and regression. For popular anchor-free detectors, it is common to introduce an individual prediction branch to estimate the quality of localization. The following inconsistencies are observed when we delve into the practices of classification and quality estimation. Firstly, for some adjacent samples which are assigned completely different labels, the trained model would produce similar classification scores. This violates the training objective and leads to performance degradation. Secondly, it is found that detected bounding boxes with higher confidences contrarily have smaller overlaps with the corresponding ground-truth. Accurately localized bounding boxes would be suppressed by less accurate ones in the Non-Maximum Suppression (NMS) procedure. To address the inconsistency problems, the Dynamic Smooth Label Assignment (DSLA) method is proposed. Based on the concept of centerness originally developed in FCOS, a smooth assignment strategy is proposed. The label is smoothed to a continuous value in [0, 1] to make a steady transition between positive and negative samples. Intersection-of-Union (IoU) is predicted dynamically during training and is coupled with the smoothed label. The dynamic smooth label is assigned to supervise the classification branch. Under such supervision, quality estimation branch is naturally merged into the classification branch, which simplifies the architecture of anchor-free detector. Comprehensive experiments are conducted on the MS COCO benchmark. It is demonstrated that, DSLA can significantly boost the detection accuracy by alleviating the above inconsistencies for anchor-free detectors. Our codes are released at https://github.com/YonghaoHe/DSLA.
△ Less
Submitted 29 September, 2022; v1 submitted 1 August, 2022;
originally announced August 2022.
-
Prior-Guided One-shot Neural Architecture Search
Authors:
Peijie Dong,
Xin Niu,
Lujun Li,
Linzhen Xie,
Wenbin Zou,
Tian Ye,
Zimian Wei,
Hengyue Pan
Abstract:
Neural architecture search methods seek optimal candidates with efficient weight-sharing supernet training. However, recent studies indicate poor ranking consistency about the performance between stand-alone architectures and shared-weight networks. In this paper, we present Prior-Guided One-shot NAS (PGONAS) to strengthen the ranking correlation of supernets. Specifically, we first explore the ef…
▽ More
Neural architecture search methods seek optimal candidates with efficient weight-sharing supernet training. However, recent studies indicate poor ranking consistency about the performance between stand-alone architectures and shared-weight networks. In this paper, we present Prior-Guided One-shot NAS (PGONAS) to strengthen the ranking correlation of supernets. Specifically, we first explore the effect of activation functions and propose a balanced sampling strategy based on the Sandwich Rule to alleviate weight coupling in the supernet. Then, FLOPs and Zen-Score are adopted to guide the training of supernet with ranking correlation loss. Our PGONAS ranks 3rd place in the supernet Track Track of CVPR2022 Second lightweight NAS challenge. Code is available in https://github.com/pprp/CVPR2022-NAS?competition-Track1-3th-solution.
△ Less
Submitted 27 June, 2022;
originally announced June 2022.
-
Sharp estimates, uniqueness and nondegeneracy of positive solutions of the Lane-Emden system in planar domains
Authors:
Zhijie Chen,
Houwang Li,
Wenming Zou
Abstract:
We study the Lane-Emden system $$\begin{cases} -Δu=v^p,\quad u>0,\quad\text{in}~Ω, -Δv=u^q,\quad v>0,\quad\text{in}~Ω, u=v=0,\quad\text{on}~\partialΩ, \end{cases}$$ where $Ω\subset\mathbb{R}^2$ is a smooth bounded domain. In a recent work, we studied the concentration phenomena of positive solutions as $p,q\to+\infty$ and $|q-p|\leq Λ$. In this paper, we obtain sharp estimates of such multi-bubble…
▽ More
We study the Lane-Emden system $$\begin{cases} -Δu=v^p,\quad u>0,\quad\text{in}~Ω, -Δv=u^q,\quad v>0,\quad\text{in}~Ω, u=v=0,\quad\text{on}~\partialΩ, \end{cases}$$ where $Ω\subset\mathbb{R}^2$ is a smooth bounded domain. In a recent work, we studied the concentration phenomena of positive solutions as $p,q\to+\infty$ and $|q-p|\leq Λ$. In this paper, we obtain sharp estimates of such multi-bubble solutions, including sharp convergence rates of local maxima and scaling parameters, and accurate approximations of solutions. As an application of these sharp estimates, we show that when $Ω$ is convex, then the solution of this system is unique and nondegenerate for large $p, q$.
△ Less
Submitted 24 July, 2022; v1 submitted 30 May, 2022;
originally announced May 2022.
-
R2D2: Robust Data-to-Text with Replacement Detection
Authors:
Linyong Nan,
Lorenzo Jaime Yu Flores,
Yilun Zhao,
Yixin Liu,
Luke Benson,
Wei** Zou,
Dragomir Radev
Abstract:
Unfaithful text generation is a common problem for text generation systems. In the case of Data-to-Text (D2T) systems, the factuality of the generated text is particularly crucial for any real-world applications. We introduce R2D2, a training framework that addresses unfaithful Data-to-Text generation by training a system both as a generator and a faithfulness discriminator with additional replace…
▽ More
Unfaithful text generation is a common problem for text generation systems. In the case of Data-to-Text (D2T) systems, the factuality of the generated text is particularly crucial for any real-world applications. We introduce R2D2, a training framework that addresses unfaithful Data-to-Text generation by training a system both as a generator and a faithfulness discriminator with additional replacement detection and unlikelihood learning tasks. To facilitate such training, we propose two methods for sampling unfaithful sentences. We argue that the poor entity retrieval capability of D2T systems is one of the primary sources of unfaithfulness, so in addition to the existing metrics, we further propose NER-based metrics to evaluate the fidelity of D2T generations. Our experimental results show that R2D2 systems could effectively mitigate the unfaithful text generation, and they achieve new state-of-the-art results on FeTaQA, LogicNLG, and ToTTo, all with significant improvements.
△ Less
Submitted 24 May, 2022;
originally announced May 2022.
-
NTIRE 2022 Challenge on Efficient Super-Resolution: Methods and Results
Authors:
Yawei Li,
Kai Zhang,
Radu Timofte,
Luc Van Gool,
Fangyuan Kong,
Mingxi Li,
Songwei Liu,
Zongcai Du,
Ding Liu,
Chenhui Zhou,
**gyi Chen,
Qingrui Han,
Zheyuan Li,
Yingqi Liu,
Xiangyu Chen,
Haoming Cai,
Yu Qiao,
Chao Dong,
Long Sun,
**shan Pan,
Yi Zhu,
Zhikai Zong,
Xiaoxiao Liu,
Zheng Hui,
Tao Yang
, et al. (86 additional authors not shown)
Abstract:
This paper reviews the NTIRE 2022 challenge on efficient single image super-resolution with focus on the proposed solutions and results. The task of the challenge was to super-resolve an input image with a magnification factor of $\times$4 based on pairs of low and corresponding high resolution images. The aim was to design a network for single image super-resolution that achieved improvement of e…
▽ More
This paper reviews the NTIRE 2022 challenge on efficient single image super-resolution with focus on the proposed solutions and results. The task of the challenge was to super-resolve an input image with a magnification factor of $\times$4 based on pairs of low and corresponding high resolution images. The aim was to design a network for single image super-resolution that achieved improvement of efficiency measured according to several metrics including runtime, parameters, FLOPs, activations, and memory consumption while at least maintaining the PSNR of 29.00dB on DIV2K validation set. IMDN is set as the baseline for efficiency measurement. The challenge had 3 tracks including the main track (runtime), sub-track one (model complexity), and sub-track two (overall performance). In the main track, the practical runtime performance of the submissions was evaluated. The rank of the teams were determined directly by the absolute value of the average runtime on the validation set and test set. In sub-track one, the number of parameters and FLOPs were considered. And the individual rankings of the two metrics were summed up to determine a final ranking in this track. In sub-track two, all of the five metrics mentioned in the description of the challenge including runtime, parameter count, FLOPs, activations, and memory consumption were considered. Similar to sub-track one, the rankings of five metrics were summed up to determine a final ranking. The challenge had 303 registered participants, and 43 teams made valid submissions. They gauge the state-of-the-art in efficient single image super-resolution.
△ Less
Submitted 11 May, 2022;
originally announced May 2022.
-
Existence and asymptotic behavior of normalized ground states for Sobolev critical Schrödinger systems
Authors:
Thomas Bartsch,
Houwang Li,
Wenming Zou
Abstract:
The paper is concerned with the existence and asymptotic properties of normalized ground states of the following nonlinear Schrödinger system with critical exponent: \begin{equation*}
\left\{\begin{aligned}
&-δu+λ_1 u=|u|^{2^*-2}u+{να} |u|^{α-2}|v|^βu,\quad \text{in }\mathbb{R}^N,
&-δv+λ_2 v=|v|^{2^*-2}v+{νβ} |u|^α|v|^{β-2}v,\quad \text{in }\mathbb{R}^N,
&\int u^2=a^2,\;\;\; \int v^2=b^2,…
▽ More
The paper is concerned with the existence and asymptotic properties of normalized ground states of the following nonlinear Schrödinger system with critical exponent: \begin{equation*}
\left\{\begin{aligned}
&-δu+λ_1 u=|u|^{2^*-2}u+{να} |u|^{α-2}|v|^βu,\quad \text{in }\mathbb{R}^N,
&-δv+λ_2 v=|v|^{2^*-2}v+{νβ} |u|^α|v|^{β-2}v,\quad \text{in }\mathbb{R}^N,
&\int u^2=a^2,\;\;\; \int v^2=b^2,
\end{aligned} \right. \end{equation*} where $N=3,4$, $α,β>1$, $2<α+β<2^*=\frac{2N}{N-2}$. We prove that a normalized ground state does not exist for $ν<0$. When $ν>0$ and $α+β\le 2+\frac{4}{N}$, we show that the system has a normalized ground state solution for $0<ν<ν_0$, the constant $ν_0$ will be explicitly given. In the case $α+β>2+\frac{4}{N}$ we prove the existence of a threshold $ν_1\ge 0$ such that a normalized ground state solution exists for $ν>ν_1$, and does not exist for $ν<ν_1$. We also give conditions for $ν_1=0$. Finally we obtain the asymptotic behavior of the minimizers as $ν\to0^+$ or $ν\to+\infty$.
△ Less
Submitted 22 April, 2022;
originally announced April 2022.
-
Self-Calibrated Efficient Transformer for Lightweight Super-Resolution
Authors:
Wenbin Zou,
Tian Ye,
Weixin Zheng,
Yunchen Zhang,
Liang Chen,
Yi Wu
Abstract:
Recently, deep learning has been successfully applied to the single-image super-resolution (SISR) with remarkable performance. However, most existing methods focus on building a more complex network with a large number of layers, which can entail heavy computational costs and memory storage. To address this problem, we present a lightweight Self-Calibrated Efficient Transformer (SCET) network to s…
▽ More
Recently, deep learning has been successfully applied to the single-image super-resolution (SISR) with remarkable performance. However, most existing methods focus on building a more complex network with a large number of layers, which can entail heavy computational costs and memory storage. To address this problem, we present a lightweight Self-Calibrated Efficient Transformer (SCET) network to solve this problem. The architecture of SCET mainly consists of the self-calibrated module and efficient transformer block, where the self-calibrated module adopts the pixel attention mechanism to extract image features effectively. To further exploit the contextual information from features, we employ an efficient transformer to help the network obtain similar features over long distances and thus recover sufficient texture details. We provide comprehensive results on different settings of the overall network. Our proposed method achieves more remarkable performance than baseline methods. The source code and pre-trained models are available at https://github.com/AlexZou14/SCET.
△ Less
Submitted 19 April, 2022;
originally announced April 2022.
-
Audio Deep Fake Detection System with Neural Stitching for ADD 2022
Authors:
Rui Yan,
Cheng Wen,
Shuran Zhou,
Tingwei Guo,
Wei Zou,
Xiangang Li
Abstract:
This paper describes our best system and methodology for ADD 2022: The First Audio Deep Synthesis Detection Challenge\cite{Yi2022ADD}. The very same system was used for both two rounds of evaluation in Track 3.2 with a similar training methodology. The first round of Track 3.2 data is generated from Text-to-Speech(TTS) or voice conversion (VC) algorithms, while the second round of data consists of…
▽ More
This paper describes our best system and methodology for ADD 2022: The First Audio Deep Synthesis Detection Challenge\cite{Yi2022ADD}. The very same system was used for both two rounds of evaluation in Track 3.2 with a similar training methodology. The first round of Track 3.2 data is generated from Text-to-Speech(TTS) or voice conversion (VC) algorithms, while the second round of data consists of generated fake audio from other participants in Track 3.1, aiming to spoof our systems. Our systems use a standard 34-layer ResNet, with multi-head attention pooling \cite{india2019self} to learn the discriminative embedding for fake audio and spoof detection. We further utilize neural stitching to boost the model's generalization capability in order to perform equally well in different tasks, and more details will be explained in the following sessions. The experiments show that our proposed method outperforms all other systems with a 10.1% equal error rate(EER) in Track 3.2.
△ Less
Submitted 19 April, 2022; v1 submitted 19 April, 2022;
originally announced April 2022.
-
Time Domain Adversarial Voice Conversion for ADD 2022
Authors:
Cheng Wen,
Tingwei Guo,
Xingjun Tan,
Rui Yan,
Shuran Zhou,
Chuandong Xie,
Wei Zou,
Xiangang Li
Abstract:
In this paper, we describe our speech generation system for the first Audio Deep Synthesis Detection Challenge (ADD 2022). Firstly, we build an any-to-many voice conversion (VC) system to convert source speech with arbitrary language content into the target speaker%u2019s fake speech. Then the converted speech generated from VC is post-processed in the time domain to improve the deception ability.…
▽ More
In this paper, we describe our speech generation system for the first Audio Deep Synthesis Detection Challenge (ADD 2022). Firstly, we build an any-to-many voice conversion (VC) system to convert source speech with arbitrary language content into the target speaker%u2019s fake speech. Then the converted speech generated from VC is post-processed in the time domain to improve the deception ability. The experimental results show that our system has adversarial ability against anti-spoofing detectors with a little compromise in audio quality and speaker similarity. This system ranks top in Track 3.1 in the ADD 2022, showing that our method could also gain good generalization ability against different detectors.
△ Less
Submitted 19 April, 2022; v1 submitted 19 April, 2022;
originally announced April 2022.
-
Audio-Visual Wake Word Spotting System For MISP Challenge 2021
Authors:
Yanguang Xu,
Jianwei Sun,
Yang Han,
Shuaijiang Zhao,
Chaoyang Mei,
Tingwei Guo,
Shuran Zhou,
Chuandong Xie,
Wei Zou,
Xiangang Li,
Shuran Zhou,
Chuandong Xie,
Wei Zou,
Xiangang Li
Abstract:
This paper presents the details of our system designed for the Task 1 of Multimodal Information Based Speech Processing (MISP) Challenge 2021. The purpose of Task 1 is to leverage both audio and video information to improve the environmental robustness of far-field wake word spotting. In the proposed system, firstly, we take advantage of speech enhancement algorithms such as beamforming and weight…
▽ More
This paper presents the details of our system designed for the Task 1 of Multimodal Information Based Speech Processing (MISP) Challenge 2021. The purpose of Task 1 is to leverage both audio and video information to improve the environmental robustness of far-field wake word spotting. In the proposed system, firstly, we take advantage of speech enhancement algorithms such as beamforming and weighted prediction error (WPE) to address the multi-microphone conversational audio. Secondly, several data augmentation techniques are applied to simulate a more realistic far-field scenario. For the video information, the provided region of interest (ROI) is used to obtain visual representation. Then the multi-layer CNN is proposed to learn audio and visual representations, and these representations are fed into our two-branch attention-based network which can be employed for fusion, such as transformer and conformed. The focal loss is used to fine-tune the model and improve the performance significantly. Finally, multiple trained models are integrated by casting vote to achieve our final 0.091 score.
△ Less
Submitted 19 April, 2022; v1 submitted 19 April, 2022;
originally announced April 2022.
-
Lensless coherent diffraction imaging based on spatial light modulator with unknown modulation curve
Authors:
Hao Sha,
Chao He,
Shaowei Jiang,
Pengming Song,
Shuai Liu,
Wenzhen Zou,
Peiwu Qin,
Haoqian Wang,
Yongbing Zhang
Abstract:
Lensless imaging is a popular research field for the advantages of small size, wide field-of-view and low aberration in recent years. However, some traditional lensless imaging methods suffer from slow convergence, mechanical errors and conjugate solution interference, which limit its further application and development. In this work, we proposed a lensless imaging method based on spatial light mo…
▽ More
Lensless imaging is a popular research field for the advantages of small size, wide field-of-view and low aberration in recent years. However, some traditional lensless imaging methods suffer from slow convergence, mechanical errors and conjugate solution interference, which limit its further application and development. In this work, we proposed a lensless imaging method based on spatial light modulator (SLM) with unknown modulation curve. In our imaging system, we use SLM to modulate the wavefront of object, and introduce the ptychographic scanning algorithm that is able to recover the complex amplitude information even the SLM modulation curve is inaccurate or unknown. In addition, we also design a split-beam interference experiment to calibrate the modulation curve of SLM, and using the calibrated modulation function as the initial value of the expended ptychography iterative engine (ePIE) algorithm can improve the convergence speed. We further analyze the effect of modulation function, algorithm parameters and the characteristics of the coherent light source on the quality of reconstructed image. The simulated and real experiments show that the proposed method is superior to traditional mechanical scanning methods in terms of recovering speed and accuracy, with the recovering resolution up to 14 um.
△ Less
Submitted 8 April, 2022;
originally announced April 2022.
-
Least energy positive soultions for $d$-coupled Schrödinger systems with critical exponent in dimension three
Authors:
Tianhao Liu,
Song You,
Wenming Zou
Abstract:
In the present paper, we consider the coupled Schrödinger systems with critical exponent: \begin{equation*} \begin{cases} -Δu_i+λ_{i}u_i=\sum\limits_{j=1}^{d} β_{ij}|u_j|^{3}|u_i|u_i \quad ~\text{ in } Ω,\\ u_i \in H_0^1(Ω) ,\quad i= 1,2,...,d. \end{cases} \end{equation*} Here, $Ω\subset \mathbb{R}^{3}$ is a smooth bounded domain, $d \geq 2$, $β_{ii}>0$ for every $i$, and $β_{ij}=β_{ji}$ for…
▽ More
In the present paper, we consider the coupled Schrödinger systems with critical exponent: \begin{equation*} \begin{cases} -Δu_i+λ_{i}u_i=\sum\limits_{j=1}^{d} β_{ij}|u_j|^{3}|u_i|u_i \quad ~\text{ in } Ω,\\ u_i \in H_0^1(Ω) ,\quad i= 1,2,...,d. \end{cases} \end{equation*} Here, $Ω\subset \mathbb{R}^{3}$ is a smooth bounded domain, $d \geq 2$, $β_{ii}>0$ for every $i$, and $β_{ij}=β_{ji}$ for $i \neq j$. We study a Brézis-Nirenberg type problem: $-λ_{1}(Ω)<λ_{1},\cdots,λ_{d}<-λ^*(Ω)$, where $λ_{1}(Ω)$ is the first eigenvalue of $-Δ$ with Dirichlet boundary conditions and $λ^*(Ω)\in (0, λ_1(Ω))$. We acquire the existence of least energy positive solutions to this system for weakly cooperative case ($β_{ij}>0$ small) and for purely competitive case ($β_{ij}\leq 0$) by variational arguments. The proof is performed by mathematical induction on the number of equations, and requires more refined energy estimates for this system. Besides, we present a new nonexistence result, revealing some different phenomena comparing with the higher-dimensional case $N\geq 5$. It seems that this is the first paper to give a rather complete picture for the existence of least energy positive solutions to critical Schrödinger system in dimension three.
△ Less
Submitted 1 April, 2022;
originally announced April 2022.
-
Graph Flow: Cross-layer Graph Flow Distillation for Dual Efficient Medical Image Segmentation
Authors:
Wenxuan Zou,
Muyi Sun
Abstract:
With the development of deep convolutional neural networks, medical image segmentation has achieved a series of breakthroughs in recent years. However, the high-performance convolutional neural networks always mean numerous parameters and high computation costs, which will hinder the applications in clinical scenarios. Meanwhile, the scarceness of large-scale annotated medical image datasets furth…
▽ More
With the development of deep convolutional neural networks, medical image segmentation has achieved a series of breakthroughs in recent years. However, the high-performance convolutional neural networks always mean numerous parameters and high computation costs, which will hinder the applications in clinical scenarios. Meanwhile, the scarceness of large-scale annotated medical image datasets further impedes the application of high-performance networks. To tackle these problems, we propose Graph Flow, a comprehensive knowledge distillation framework, for both network-efficiency and annotation-efficiency medical image segmentation. Specifically, our core Graph Flow Distillation transfer the essence of cross-layer variations from a well-trained cumbersome teacher network to a non-trained compact student network. In addition, an unsupervised Paraphraser Module is integrated to purify the knowledge of the teacher network, which is also beneficial for the stabilization of training procedure. Furthermore, we build a unified distillation framework by integrating the adversarial distillation and the vanilla logits distillation, which can further refine the final predictions of the compact network. With different teacher networks (conventional convolutional architecture or prevalent transformer architecture) and student networks, we conduct extensive experiments on four medical image datasets with different modalities (Gastric Cancer, Synapse, BUSI, and CVC-ClinicDB).We demonstrate the prominent ability of our method which achieves competitive performance on these datasets. Moreover, we demonstrate the effectiveness of our Graph Flow through a novel semi-supervised paradigm for dual efficient medical image segmentation. Our code will be available at Graph Flow.
△ Less
Submitted 29 August, 2022; v1 submitted 16 March, 2022;
originally announced March 2022.
-
Role of limiting dispersal on metacommunity stability and persistence
Authors:
Snehasish Roy Chowdhury,
Ramesh Arumugam,
Wei Zou,
V. K. Chandrasekar,
D. V. Senthilkumar
Abstract:
The role of dispersal on the stability and synchrony of a metacommunity is a topic of considerable interest in theoretical ecology. Dispersal is known to promote both synchrony, which enhances the likelihood of extinction, and spatial heterogeneity, which favors the persistence of the population. Several efforts have been made to understand the effect of diverse variants of dispersal in the spatia…
▽ More
The role of dispersal on the stability and synchrony of a metacommunity is a topic of considerable interest in theoretical ecology. Dispersal is known to promote both synchrony, which enhances the likelihood of extinction, and spatial heterogeneity, which favors the persistence of the population. Several efforts have been made to understand the effect of diverse variants of dispersal in the spatially distributed ecological community. Despite the environmental change strongly affect the dispersal, the effects of controlled dispersal on the metacommunity stability and their persistence remain unknown. We study the influence of limiting the immigration using two patch prey-predator metacommunity at both local and spatial scales. We find that the spread of the inhomogeneous stable steady states (asynchronous states) decreases monotonically upon limiting the predator dispersal. Nevertheless, at the local scale, the spread of the inhomogeneous steady states increases up to a critical value of the limiting factor, favoring the metacommunity persistence, and then starts decreasing for further decrease in the limiting factor with varying local interaction. Interestingly, limiting the prey dispersal promotes inhomogeneous steady states in a large region of the parameter space, thereby increasing the metacommunity persistence both at spatial and local scales. Further, we show similar qualitative dynamics in an entire class of complex networks consisting of a large number of patches. We also deduce various bifurcation curves and stability condition for the inhomogeneous steady states, which we find to agree well with the simulation results. Thus, our findings on the effect of the limiting dispersal can help to develop conservation measures for ecological community.
△ Less
Submitted 8 March, 2022;
originally announced March 2022.
-
Scalable algorithm simplification using quantum AND logic
Authors:
Ji Chu,
Xiaoyu He,
Yuxuan Zhou,
Jiahao Yuan,
Libo Zhang,
Qihao Guo,
Yongju Hai,
Zhikun Han,
Chang-Kang Hu,
Wenhui Huang,
Hao Jia,
Dawei Jiao,
Yang Liu,
Zhongchu Ni,
Xianchuang Pan,
Jiawei Qiu,
Weiwei Wei,
Zusheng Yang,
Jiajian Zhang,
Zhida Zhang,
Wan**g Zou,
Yuanzhen Chen,
Xiaowei Deng,
Xiuhao Deng,
Ling Hu
, et al. (7 additional authors not shown)
Abstract:
Implementing quantum algorithms on realistic hardware requires translating high-level global operations into sequences of native elementary gates, a process known as quantum compiling. Physical limitations, such as constraints in connectivity and gate alphabets, often result in unacceptable implementation costs. To enable successful near-term applications, it is crucial to optimize compilation by…
▽ More
Implementing quantum algorithms on realistic hardware requires translating high-level global operations into sequences of native elementary gates, a process known as quantum compiling. Physical limitations, such as constraints in connectivity and gate alphabets, often result in unacceptable implementation costs. To enable successful near-term applications, it is crucial to optimize compilation by exploiting the potential capabilities of existing hardware. Here, we implement a resource-efficient construction for a quantum version of AND logic that can reduce the cost, enabling the execution of key quantum circuits. On a high-scalability superconducting quantum processor, we demonstrate low-depth synthesis of high-fidelity generalized Toffoli gates with up to 8 qubits and Grover's search algorithm in a search space of up to 64 entries; both are the largest such implementations in scale to date. Our experimental demonstration illustrates a scalable implementation of simplifying quantum algorithms, paving the way for larger, more meaningful quantum applications on noisy devices.
△ Less
Submitted 29 December, 2021;
originally announced December 2021.
-
High-order tensor flow processing using integrated photonic circuits
Authors:
Shaofu Xu,
**g Wang,
Sicheng Yi,
Weiwen Zou
Abstract:
Tensor analytics lays mathematical basis for the prosperous promotion of multiway signal processing. To increase computing throughput, mainstream processors transform tensor convolutions to matrix multiplications to enhance parallelism of computing. However, such order-reducing transformation produces data duplicates and consumes additional memory. Here, we demonstrate an integrated photonic tenso…
▽ More
Tensor analytics lays mathematical basis for the prosperous promotion of multiway signal processing. To increase computing throughput, mainstream processors transform tensor convolutions to matrix multiplications to enhance parallelism of computing. However, such order-reducing transformation produces data duplicates and consumes additional memory. Here, we demonstrate an integrated photonic tensor flow processor without tensor-matrix transformation, which outputs the convolved tensor as the input tensor 'flows' through the processor. The hybrid manipulation of optical dimensions of wavelength, time, and space enables the direct representation and processing of high-order tensors in optical domain. In the proof-of-concept experiment, processing of multi-channel images and videos is accomplished at the frequency of 20 GHz. A convolutional neural network is demonstrated on the processor, which achieves an accuracy of 97.9 percent on action recognition.
△ Less
Submitted 22 December, 2021;
originally announced December 2021.
-
On universally optimal lattice phase transitions and energy minimizers of completely monotone potentials
Authors:
Sen** Luo,
Juncheng Wei,
Wenming Zou
Abstract:
We consider the minimizing problem for energy functionals with two types of competing particles and completely monotone potential on a lattice. We prove that the minima of sum of two completely monotone functions among lattices is located exactly on a special curve which is part of the boundary of the fundamental region. We also establish a universal result for square lattice being the optimal in…
▽ More
We consider the minimizing problem for energy functionals with two types of competing particles and completely monotone potential on a lattice. We prove that the minima of sum of two completely monotone functions among lattices is located exactly on a special curve which is part of the boundary of the fundamental region. We also establish a universal result for square lattice being the optimal in certain interval, which is surprising. Our result establishes the hexagonal-rhombic-square-rectangular transition lattice shapes in many physical and biological system (such as Bose-Einstein condensates and two-component Ginzburg-Landau systems). It turns out, our results also apply to locating the minimizers of sum of two Eisenstein series, which is new in number theory.
△ Less
Submitted 17 October, 2021;
originally announced October 2021.
-
SDWNet: A Straight Dilated Network with Wavelet Transformation for Image Deblurring
Authors:
Wenbin Zou,
Mingchao Jiang,
Yunchen Zhang,
Liang Chen,
Zhiyong Lu,
Yi Wu
Abstract:
Image deblurring is a classical computer vision problem that aims to recover a sharp image from a blurred image. To solve this problem, existing methods apply the Encode-Decode architecture to design the complex networks to make a good performance. However, most of these methods use repeated up-sampling and down-sampling structures to expand the receptive field, which results in texture informatio…
▽ More
Image deblurring is a classical computer vision problem that aims to recover a sharp image from a blurred image. To solve this problem, existing methods apply the Encode-Decode architecture to design the complex networks to make a good performance. However, most of these methods use repeated up-sampling and down-sampling structures to expand the receptive field, which results in texture information loss during the sampling process and some of them design the multiple stages that lead to difficulties with convergence. Therefore, our model uses dilated convolution to enable the obtainment of the large receptive field with high spatial resolution. Through making full use of the different receptive fields, our method can achieve better performance. On this basis, we reduce the number of up-sampling and down-sampling and design a simple network structure. Besides, we propose a novel module using the wavelet transform, which effectively helps the network to recover clear high-frequency texture details. Qualitative and quantitative evaluations of real and synthetic datasets show that our deblurring method is comparable to existing algorithms in terms of performance with much lower training requirements. The source code and pre-trained models are available at https://github.com/FlyEgle/SDWNet.
△ Less
Submitted 12 October, 2021;
originally announced October 2021.
-
Least energy positive solutions of critical Schrödinger systems with mixed competition and cooperation terms: the higher dimensional case
Authors:
Hugo Tavares,
Song You,
Wenming Zou
Abstract:
Let $Ω\subset \mathbb{R}^{N}$ be a smooth bounded domain. In this paper we investigate the existence of least energy positive solutions to the following Schrödinger system with $d\geq 2$ equations \begin{equation*} -Δu_{i}+λ_{i}u_{i}=|u_{i}|^{p-2}u_{i}\sum_{j = 1}^{d}β_{ij}|u_{j}|^{p} \text{ in } Ω, \quad u_i=0 \text{ on } \partial Ω, \qquad i=1,...,d, \end{equation*} in the case of a critical exp…
▽ More
Let $Ω\subset \mathbb{R}^{N}$ be a smooth bounded domain. In this paper we investigate the existence of least energy positive solutions to the following Schrödinger system with $d\geq 2$ equations \begin{equation*} -Δu_{i}+λ_{i}u_{i}=|u_{i}|^{p-2}u_{i}\sum_{j = 1}^{d}β_{ij}|u_{j}|^{p} \text{ in } Ω, \quad u_i=0 \text{ on } \partial Ω, \qquad i=1,...,d, \end{equation*} in the case of a critical exponent $2p=2^*=\frac{2N}{N-2}$ in high dimensions $N\geq 5$. We treat the focusing case ($β_{ii}>0$ for every $i$) in the variational setting $β_{ij}=β_{ji}$ for every $i\neq j$, dealing with a Brézis-Nirenberg type problem: $-λ_{1}(Ω)<λ_{i}<0$, where $λ_{1}(Ω)$ is the first eigenvalue of $(-Δ,H^1_0(Ω))$. We provide several sufficient conditions on the coefficients $β_{ij}$ that ensure the existence of least energy positive solutions; these include the situations of pure cooperation ($β_{ij}> 0$ for every $i\neq j$), pure competition ($β_{ij}\leq 0$ for every $i\neq j$) and coexistence of both cooperation and competition coefficients. Some proofs depend heavily on the fact that $1<p<2$, revealing some different phenomena comparing to the special case $N=4$.
Our results provide a rather complete picture in the particular situation where the components are divided in two groups. Besides, based on the results about a phase separation phenomena, we prove the existence of least energy sign-changing solution to the Brézis-Nirenberg problem \[ -Δu+λu=μ|u|^{2^*-2}u,\quad u\in H^1_0(Ω), \] for $μ>0$, $-λ_1(Ω)<λ<0$ for all $N\geq 4$, a result which is new in dimensions $N=4,5$.
△ Less
Submitted 29 September, 2021;
originally announced September 2021.
-
Comb-based photonic neural population for parallel and nonlinear processing
Authors:
Bowen Ma,
Junfeng Zhang,
Weiwen Zou
Abstract:
It is believed that neural information representation and processing relies on the neural population instead of a single neuron. In neuromorphic photonics, photonic neurons in the form of nonlinear responses have been extensively studied in single devices and temporal nodes. However, to construct a photonic neural population (PNP), the process of scaling up and massive interconnections remain chal…
▽ More
It is believed that neural information representation and processing relies on the neural population instead of a single neuron. In neuromorphic photonics, photonic neurons in the form of nonlinear responses have been extensively studied in single devices and temporal nodes. However, to construct a photonic neural population (PNP), the process of scaling up and massive interconnections remain challenging considering the physical complexity and response latency. Here, we propose a comb-based PNP interconnected by carrier coupling with superior scalability. Two unique properties of neural population are theoretically and experimentally demonstrated in the comb-based PNP, including nonlinear response curves and population activities coding. A classification task of three input patterns with dual radio-frequency (RF) tones is successfully implemented in a real-time manner, which manifests the comb-based PNP can make effective use of the ultra-broad bandwidth of photonics for parallel and nonlinear processing.
△ Less
Submitted 25 September, 2021;
originally announced September 2021.
-
Characterization of the frequency response of channel-interleaved photonic ADCs based on the optical time-division demultiplexer
Authors:
Na Qian,
Linbo Zhang,
Jian** Chen,
Weiwen Zou
Abstract:
We characterize the frequency response of channel-interleaved photonic analog-to-digital converters (CI-PADCs) theoretically and experimentally. The CI-PADC is composed of a photonic frontend for photonic sampling and an electronic backend for quantization. The photonic frontend includes a photonic sampling pulse generator for directly high-speed sampling and an optical time-division demultiplexer…
▽ More
We characterize the frequency response of channel-interleaved photonic analog-to-digital converters (CI-PADCs) theoretically and experimentally. The CI-PADC is composed of a photonic frontend for photonic sampling and an electronic backend for quantization. The photonic frontend includes a photonic sampling pulse generator for directly high-speed sampling and an optical time-division demultiplexer (OTDM) for channel demultiplexing. It is found that the frequency response of the CI-PADC is influenced by both the photonic sampling pulses and the OTDM, of which the combined impact can be characterized through demultiplexed pulse trains. First, the frequency response can be divided into multiple frequency intervals and the range of the frequency interval equals the repetition rate of demultiplexed pulse trains. Second, the analog bandwidth of the CI-PADC is determined by the optical spectral bandwidth of demultiplexed pulse trains which is broadened in the OTDM. Further, the effect of the OTDM is essential for enlarging the analog bandwidth of the CI-PADC employing the photonic sampling pulses with a limited optical spectral bandwidth.
△ Less
Submitted 3 September, 2021;
originally announced September 2021.
-
BLESER: Bug Localization Based on Enhanced Semantic Retrieval
Authors:
Weiqin Zou,
Enming Li,
Chunrong Fang
Abstract:
Static bug localization techniques that locate bugs at method granularity have gained much attention from both researchers and practitioners. For a static method-level bug localization technique, a key but challenging step is to fully retrieve the semantics of methods and bug reports. Currently, existing studies mainly use the same bag-of-word space to represent the semantics of methods and bug re…
▽ More
Static bug localization techniques that locate bugs at method granularity have gained much attention from both researchers and practitioners. For a static method-level bug localization technique, a key but challenging step is to fully retrieve the semantics of methods and bug reports. Currently, existing studies mainly use the same bag-of-word space to represent the semantics of methods and bug reports without considering structure information of methods and textual contexts of bug reports, which largely and negatively affects bug localization performance.
To address this problem, we develop BLESER, a new bug localization technique based on enhanced semantic retrieval. Specifically, we use an AST-based code embedding model (capturing code structure better) to retrieve the semantics of methods, and word embedding models (capturing textual contexts better) to represent the semantics of bug reports. Then, a deep learning model is built on the enhanced semantic representations. During model building, we compare five typical word embedding models in representing bug reports and try to explore the usefulness of re-sampling strategies and cost-sensitive strategies in handling class imbalance problems. We evaluate our BLESER on five Java projects from the Defects4J dataset. We find that: (1) On the whole, the word embedding model ELMo outperformed the other four models (including word2vec, BERT, etc.) in facilitating bug localization techniques. (2) Among four strategies aiming at solving class imbalance problems, the strategy ROS (random over-sampling) performed much better than the other three strategies (including random under-sampling, Focal Loss, etc.). (3) By integrating ELMo and ROS into BLESER, at method-level bug localization, we could achieve MAP of 0.108-0.504, MRR of 0.134-0.510, and Accuracy@1 of 0.125-0.5 on five Defects4J projects.
△ Less
Submitted 8 September, 2021;
originally announced September 2021.
-
Metrics to find a surrogate endpoint of OS in metastatic oncology trials: a simulation study
Authors:
Wei Zou
Abstract:
Surrogate endpoint (SE) for overall survival in cancer patients is essential to improving the efficiency of oncology drug development. In practice, we may discover a new patient level association with survival, based on one or more clinical or biological features, in a discovery cohort; and then measure the trial level association across studies in a meta-analysis to validate the SE. To understand…
▽ More
Surrogate endpoint (SE) for overall survival in cancer patients is essential to improving the efficiency of oncology drug development. In practice, we may discover a new patient level association with survival, based on one or more clinical or biological features, in a discovery cohort; and then measure the trial level association across studies in a meta-analysis to validate the SE. To understand how well various patient level metrics would indicate the eventual trial level association, we considered causal biological trajectories based on bi-exponential functions, modeled the strength of their impact on survival hazards via a parameter α, and simulated the trajectories and survival times in randomized trials simultaneously. We set an early time point in the trials when the trajectory measurement became the SE value. From simulated discovery cohorts, we compared patient level metrics including C index, integrated brier score, and log hazard ratio between SE values and survival times. We assembled multiple simulated studies to enable meta-analyses to estimate the trial level association. Across all the simulation scenarios considered here, we found tight correlations among the three patient level metrics and similar correlations between any of them and the trial level metric. Despite the continual increase in α, both patient and trial level metrics often plateaued together; their association always decreased quickly as α increased. This suggests that incorporating additional biological factors into a composite SE is likely to have diminishing returns on improving both patient level and trial level association.
△ Less
Submitted 6 November, 2022; v1 submitted 8 September, 2021;
originally announced September 2021.
-
Thermoelectric and stress distributions around a smooth cavity in thermoelectric material
Authors:
Zhaohang Lee,
Yu Tang,
Wennan Zou
Abstract:
Thermoelectric materials have attracted more and more attention since they are friendly to the environment and have potentials for sustainable and renewable energy applications. As typically brittle semiconductors with low mechanical strength and always subjected to defects and damages, to clarify the stress concentration is very important in the design and implement of thermoelectric devices. The…
▽ More
Thermoelectric materials have attracted more and more attention since they are friendly to the environment and have potentials for sustainable and renewable energy applications. As typically brittle semiconductors with low mechanical strength and always subjected to defects and damages, to clarify the stress concentration is very important in the design and implement of thermoelectric devices. The two-dimensional thermoelectric coupling problem due to a cavity embedded in an infinite isotropic homogeneous thermoelectric material, subjected to uniform electric current density or uniform energy flux, is studied, where the shape of the cavity is characterized by the Laurent polynomial, and the electric insulated and adiabatic boundary around the cavity are considered. The explicit analytic solutions of Kolosov-Muskhelishvili (K-M) potentials and rigid-body translation are carried out through a novel tactic. Comparing with the reported results, the new obtained are completely exact and possess a finite form. Some results of three typical cavities are presented to analyze the electric current densities (energy fluxes) and stresses around the tips. The main conclusions include: the distribution of thermoelectric field and stress at the tip obviously depends on the curvature of the contour and loading directions; for triangle and square with symmetrical tips, the maximum thermoelectric and stress concentration reach the maximum or minimum when the loading direction is parallel to or perpendicular to the symmetry axis of the tip, which is distinct to the extremum characteristics of pentagram with bimodal of curvature around the tip; the maximum thermoelectric and stress concentration appear near the maximum curvature point for most load directions, but not at the maximum curvature point.
△ Less
Submitted 3 September, 2021;
originally announced September 2021.
-
CoCo DistillNet: a Cross-layer Correlation Distillation Network for Pathological Gastric Cancer Segmentation
Authors:
Wenxuan Zou,
Muyi Sun
Abstract:
In recent years, deep convolutional neural networks have made significant advances in pathology image segmentation. However, pathology image segmentation encounters with a dilemma in which the higher-performance networks generally require more computational resources and storage. This phenomenon limits the employment of high-accuracy networks in real scenes due to the inherent high-resolution of p…
▽ More
In recent years, deep convolutional neural networks have made significant advances in pathology image segmentation. However, pathology image segmentation encounters with a dilemma in which the higher-performance networks generally require more computational resources and storage. This phenomenon limits the employment of high-accuracy networks in real scenes due to the inherent high-resolution of pathological images. To tackle this problem, we propose CoCo DistillNet, a novel Cross-layer Correlation (CoCo) knowledge distillation network for pathological gastric cancer segmentation. Knowledge distillation, a general technique which aims at improving the performance of a compact network through knowledge transfer from a cumbersome network. Concretely, our CoCo DistillNet models the correlations of channel-mixed spatial similarity between different layers and then transfers this knowledge from a pre-trained cumbersome teacher network to a non-trained compact student network. In addition, we also utilize the adversarial learning strategy to further prompt the distilling procedure which is called Adversarial Distillation (AD). Furthermore, to stabilize our training procedure, we make the use of the unsupervised Paraphraser Module (PM) to boost the knowledge paraphrase in the teacher network. As a result, extensive experiments conducted on the Gastric Cancer Segmentation Dataset demonstrate the prominent ability of CoCo DistillNet which achieves state-of-the-art performance.
△ Less
Submitted 12 November, 2021; v1 submitted 27 August, 2021;
originally announced August 2021.
-
PAENet: A Progressive Attention-Enhanced Network for 3D to 2D Retinal Vessel Segmentation
Authors:
Zhuojie Wu,
Zijian Wang,
Wenxuan Zou,
Fan Ji,
Hao Dang,
Wanting Zhou,
Muyi Sun
Abstract:
3D to 2D retinal vessel segmentation is a challenging problem in Optical Coherence Tomography Angiography (OCTA) images. Accurate retinal vessel segmentation is important for the diagnosis and prevention of ophthalmic diseases. However, making full use of the 3D data of OCTA volumes is a vital factor for obtaining satisfactory segmentation results. In this paper, we propose a Progressive Attention…
▽ More
3D to 2D retinal vessel segmentation is a challenging problem in Optical Coherence Tomography Angiography (OCTA) images. Accurate retinal vessel segmentation is important for the diagnosis and prevention of ophthalmic diseases. However, making full use of the 3D data of OCTA volumes is a vital factor for obtaining satisfactory segmentation results. In this paper, we propose a Progressive Attention-Enhanced Network (PAENet) based on attention mechanisms to extract rich feature representation. Specifically, the framework consists of two main parts, the three-dimensional feature learning path and the two-dimensional segmentation path. In the three-dimensional feature learning path, we design a novel Adaptive Pooling Module (APM) and propose a new Quadruple Attention Module (QAM). The APM captures dependencies along the projection direction of volumes and learns a series of pooling coefficients for feature fusion, which efficiently reduces feature dimension. In addition, the QAM reweights the features by capturing four-group cross-dimension dependencies, which makes maximum use of 4D feature tensors. In the two-dimensional segmentation path, to acquire more detailed information, we propose a Feature Fusion Module (FFM) to inject 3D information into the 2D path. Meanwhile, we adopt the Polarized Self-Attention (PSA) block to model the semantic interdependencies in spatial and channel dimensions respectively. Experimentally, our extensive experiments on the OCTA-500 dataset show that our proposed algorithm achieves state-of-the-art performance compared with previous methods.
△ Less
Submitted 16 December, 2021; v1 submitted 26 August, 2021;
originally announced August 2021.
-
Slip topology of steady flows around a critical point: Taking the linear velocity field as an example
Authors:
Wennan Zou,
Jian He
Abstract:
The flow of viscous fluids is considered as the aggregation of the motion of fluid particles when the fluid is conceived to be made up by an infinite number of particles. As an alternative of this conventional model, fluid motion could be understood as the slip of fluid layers with a molecular scale over each other, where the slip structures of fluid and their associated small-scale motion are cha…
▽ More
The flow of viscous fluids is considered as the aggregation of the motion of fluid particles when the fluid is conceived to be made up by an infinite number of particles. As an alternative of this conventional model, fluid motion could be understood as the slip of fluid layers with a molecular scale over each other, where the slip structures of fluid and their associated small-scale motion are characterized by an axial-vector-valued differential 1-form, called the vortex field. In this paper, in the case of steady flows we define the swirling degree of the velocity field at a point, and further the swirl field of the steady flow, to study the slip topology of fluid or the local streamline pattern around the critical point. The linear velocity field in the right real Schur form is used to carry out detailed analyses around the isolated critical point. Theoretical deduction and numerical test unveil the connection between the swirling degree and the swirl field, greatly make clear the topological property of slip structures of fluid in steady flows, especially in three-dimensional space.
△ Less
Submitted 5 February, 2024; v1 submitted 18 August, 2021;
originally announced August 2021.
-
Normalized solutions for nonlinear Schrödinger systems with special mass-mixed terms: The linear couple case
Authors:
Zhen Chen,
Xuexiu Zhong,
Wenming Zou
Abstract:
In this paper, we prove the existence of positive solutions $(λ_1,λ_2, u,v)\in \R^2\times H^1(\R^N, \R^2)$ to the following coupled Schrödinger system $$\begin{cases} -Δu + λ_1 u= μ_1|u|^{p-2}u+βv \quad &\hbox{in}\;\RN, \\ -Δv + λ_2 v= μ_2|v|^{q-2}v+βu \quad &\hbox{in}\;\RN, \end{cases}$$ satisfying the normalization constraints $\displaystyle\int_{\RN}u^2 =a, ~ \int_{\RN}v^2 =b$. The parameters…
▽ More
In this paper, we prove the existence of positive solutions $(λ_1,λ_2, u,v)\in \R^2\times H^1(\R^N, \R^2)$ to the following coupled Schrödinger system $$\begin{cases} -Δu + λ_1 u= μ_1|u|^{p-2}u+βv \quad &\hbox{in}\;\RN, \\ -Δv + λ_2 v= μ_2|v|^{q-2}v+βu \quad &\hbox{in}\;\RN, \end{cases}$$ satisfying the normalization constraints $\displaystyle\int_{\RN}u^2 =a, ~ \int_{\RN}v^2 =b$. The parameters $μ_1,μ_2,β>0$ are prescribed and the masses $a,b>0$.
Here $2+\frac{4}{N}<p,q\leq 2^*$, where $2^* = \frac{2N}{N-2} $ if $N \geq 3$ and $2^* =+ \infty $ if $N=2$. So that the terms $μ_1|u|^{p-2}u$,$μ_2|v|^{q-2}v$ are of the so-called mass supercritical, while the linear couple terms $βv, βu$ are of mass subcritical. An essential novelty is that this is the first try to deal with the linear couples in the normalized solution frame with mass mixed terms, which are big nuisances due to the lack of compactness of the embedding $H^1(\R^N)\hookrightarrow L^2(\R^N)$, even working in the radial subspace. For the Sobolev subcritical case, we can obtain the existence of positive ground state solution for any given $a,b>0$ and $β>0$, provided $2\leqslant N\leqslant 4$.
For the Sobolev critical case with $N=3,4$, it can be viewed as a counterpart of the Brezis-Nirenberg critical semilinear elliptic problem for the system case in the context of normalized solutions. Under some suitable assumptions, we obtain the existence or non-existence of positive normalized ground state solution.
△ Less
Submitted 1 August, 2021; v1 submitted 26 July, 2021;
originally announced July 2021.
-
A new deduce of the strict binding inequality and its application: Ground state normalized solution to Schrödinger equations with potential
Authors:
Xuexiu Zhong,
Wenming Zou
Abstract:
In the present paper, we prove the existence of solutions $(λ, u)\in \R\times H^1(\R^N)$ to the following elliptic equations with potential $\displaystyle -Δu+(V(x)+λ)u=g(u)\;\hbox{in}\;\R^N, $ satisfying the normalization constraint $\displaystyle \int_{\R^N}u^2=a>0,$ which is deduced by searching for solitary wave solution to the time-dependent nonlinear Schrödinger equations. Besides the import…
▽ More
In the present paper, we prove the existence of solutions $(λ, u)\in \R\times H^1(\R^N)$ to the following elliptic equations with potential $\displaystyle -Δu+(V(x)+λ)u=g(u)\;\hbox{in}\;\R^N, $ satisfying the normalization constraint $\displaystyle \int_{\R^N}u^2=a>0,$ which is deduced by searching for solitary wave solution to the time-dependent nonlinear Schrödinger equations. Besides the importance in the applications, not negligible reasons of our interest for such problems with potential $V(x)$ are their stimulating and challenging mathematical difficulties. We develop an interesting way basing on iteration and give a new proof of the so-called "sub-additive inequality", which can simply the standard process in the traditional sense. Under some very relax assumption on the potential $V(x)$ and some other suitable assumptions on $g$, we can obtain the existence of ground state solution for prescribed $a>0$.
△ Less
Submitted 1 August, 2021; v1 submitted 26 July, 2021;
originally announced July 2021.
-
Positive normalized solutions to nonlinear elliptic systems in $\R^4$ with critical Sobolev exponent
Authors:
Xiao Luo,
Xiaolong Yang,
Wenming Zou
Abstract:
In this paper, we consider the existence and asymptotic behavior on mass of the positive solutions to the following system: \begin{equation}\label{eqA0.1}\nonumber \begin{cases} -Δu+λ_1u=μ_1u^3+α_1|u|^{p-2}u+βv^2u\quad&\hbox{in}~\R^4,\\ -Δv+λ_2v=μ_2v^3+α_2|v|^{p-2}v+βu^2v\quad&\hbox{in}~\R^4,\\ \end{cases} \end{equation} under the mass constraint…
▽ More
In this paper, we consider the existence and asymptotic behavior on mass of the positive solutions to the following system: \begin{equation}\label{eqA0.1}\nonumber \begin{cases} -Δu+λ_1u=μ_1u^3+α_1|u|^{p-2}u+βv^2u\quad&\hbox{in}~\R^4,\\ -Δv+λ_2v=μ_2v^3+α_2|v|^{p-2}v+βu^2v\quad&\hbox{in}~\R^4,\\ \end{cases} \end{equation} under the mass constraint $$\int_{\R^4}u^2=a_1^2\quad\text{and}\quad\int_{\R^4}v^2=a_2^2,$$ where $a_1,a_2$ are prescribed, $μ_1,μ_2,β>0$; $α_1,α_2\in \R$, $p\!\in\! (2,4)$ and $λ_1,λ_2\!\in\!\R$ appear as Lagrange multipliers. Firstly, we establish a non-existence result for the repulsive interaction case, i.e., $α_i<0(i=1,2)$. Then turning to the case of $α_i>0 (i=1,2)$, if $2<p<3$, we show that the problem admits a ground state and an excited state, which are characterized respectively by a local minimizer and a mountain-pass critical point of the corresponding energy functional. Moreover, we give a precise asymptotic behavior of these two solutions as $(a_1,a_2)\to (0,0)$ and $a_1\sim a_2$. This seems to be the first contribution regarding the multiplicity as well as the synchronized mass collapse behavior of the normalized solutions to Schrödinger systems with Sobolev critical exponent. When $3\leq p<4$, we prove an existence as well as non-existence ($p=3$) results of the ground states, which are characterized by constrained mountain-pass critical points of the corresponding energy functional. Furthermore, precise asymptotic behaviors of the ground states are obtained when the masses of whose two components vanish and cluster to a upper bound (or infinity), respectively.
△ Less
Submitted 19 July, 2021;
originally announced July 2021.
-
AdaL: Adaptive Gradient Transformation Contributes to Convergences and Generalizations
Authors:
Hongwei Zhang,
Weidong Zou,
Hongbo Zhao,
Qi Ming,
Ti** Yan,
Yuanqing Xia,
Weipeng Cao
Abstract:
Adaptive optimization methods have been widely used in deep learning. They scale the learning rates adaptively according to the past gradient, which has been shown to be effective to accelerate the convergence. However, they suffer from poor generalization performance compared with SGD. Recent studies point that smoothing exponential gradient noise leads to generalization degeneration phenomenon.…
▽ More
Adaptive optimization methods have been widely used in deep learning. They scale the learning rates adaptively according to the past gradient, which has been shown to be effective to accelerate the convergence. However, they suffer from poor generalization performance compared with SGD. Recent studies point that smoothing exponential gradient noise leads to generalization degeneration phenomenon. Inspired by this, we propose AdaL, with a transformation on the original gradient. AdaL accelerates the convergence by amplifying the gradient in the early stage, as well as dampens the oscillation and stabilizes the optimization by shrinking the gradient later. Such modification alleviates the smoothness of gradient noise, which produces better generalization performance. We have theoretically proved the convergence of AdaL and demonstrated its effectiveness on several benchmarks.
△ Less
Submitted 3 July, 2021;
originally announced July 2021.
-
GigaSpeech: An Evolving, Multi-domain ASR Corpus with 10,000 Hours of Transcribed Audio
Authors:
Guoguo Chen,
Shuzhou Chai,
Guanbo Wang,
Jiayu Du,
Wei-Qiang Zhang,
Chao Weng,
Dan Su,
Daniel Povey,
Jan Trmal,
Junbo Zhang,
Mingjie **,
Sanjeev Khudanpur,
Shinji Watanabe,
Shuaijiang Zhao,
Wei Zou,
Xiangang Li,
Xuchen Yao,
Yongqing Wang,
Yujun Wang,
Zhao You,
Zhiyong Yan
Abstract:
This paper introduces GigaSpeech, an evolving, multi-domain English speech recognition corpus with 10,000 hours of high quality labeled audio suitable for supervised training, and 40,000 hours of total audio suitable for semi-supervised and unsupervised training. Around 40,000 hours of transcribed audio is first collected from audiobooks, podcasts and YouTube, covering both read and spontaneous sp…
▽ More
This paper introduces GigaSpeech, an evolving, multi-domain English speech recognition corpus with 10,000 hours of high quality labeled audio suitable for supervised training, and 40,000 hours of total audio suitable for semi-supervised and unsupervised training. Around 40,000 hours of transcribed audio is first collected from audiobooks, podcasts and YouTube, covering both read and spontaneous speaking styles, and a variety of topics, such as arts, science, sports, etc. A new forced alignment and segmentation pipeline is proposed to create sentence segments suitable for speech recognition training, and to filter out segments with low-quality transcription. For system training, GigaSpeech provides five subsets of different sizes, 10h, 250h, 1000h, 2500h, and 10000h. For our 10,000-hour XL training subset, we cap the word error rate at 4% during the filtering/validation stage, and for all our other smaller training subsets, we cap it at 0%. The DEV and TEST evaluation sets, on the other hand, are re-processed by professional human transcribers to ensure high transcription quality. Baseline systems are provided for popular speech recognition toolkits, namely Athena, ESPnet, Kaldi and Pika.
△ Less
Submitted 13 June, 2021;
originally announced June 2021.
-
Optical coherent dot-product chip for sophisticated deep learning regression
Authors:
Shaofu Xu,
**g Wang,
Haowen Shu,
Zhike Zhang,
Sicheng Yi,
Bowen Bai,
Xingjun Wang,
Jianguo Liu,
Weiwen Zou
Abstract:
Optical implementations of neural networks (ONNs) herald the next-generation high-speed and energy-efficient deep learning computing by harnessing the technical advantages of large bandwidth and high parallelism of optics. However, due to the problems of incomplete numerical domain, limited hardware scale, or inadequate numerical accuracy, the majority of existing ONNs were studied for basic class…
▽ More
Optical implementations of neural networks (ONNs) herald the next-generation high-speed and energy-efficient deep learning computing by harnessing the technical advantages of large bandwidth and high parallelism of optics. However, due to the problems of incomplete numerical domain, limited hardware scale, or inadequate numerical accuracy, the majority of existing ONNs were studied for basic classification tasks. Given that regression is a fundamental form of deep learning and accounts for a large part of current artificial intelligence applications, it is necessary to master deep learning regression for further development and deployment of ONNs. Here, we demonstrate a silicon-based optical coherent dot-product chip (OCDC) capable of completing deep learning regression tasks. The OCDC adopts optical fields to carry out operations in complete real-value domain instead of in only positive domain. Via reusing, a single chip conducts matrix multiplications and convolutions in neural networks of any complexity. Also, hardware deviations are compensated via in-situ backpropagation control provided the simplicity of chip architecture. Therefore, the OCDC meets the requirements for sophisticated regression tasks and we successfully demonstrate a representative neural network, the AUTOMAP (a cutting-edge neural network model for image reconstruction). The quality of reconstructed images by the OCDC and a 32-bit digital computer is comparable. To the best of our knowledge, there is no precedent of performing such state-of-the-art regression tasks on ONN chip. It is anticipated that the OCDC can promote novel accomplishment of ONNs in modern AI applications including autonomous driving, natural language processing, and scientific study.
△ Less
Submitted 15 December, 2021; v1 submitted 25 May, 2021;
originally announced May 2021.
-
Positive least energy solutions for $k$-coupled Schrödinger system with critical exponent: the higher dimension and cooperative case
Authors:
Xin Yin,
Wenming Zou
Abstract:
In this paper, we study the following $k$-coupled nonlinear Schrödinger system with Sobolev critical exponent:
\begin{equation*}
\left\{
\begin{aligned}
-Δu_i & +λ_iu_i =μ_i u_i^{2^*-1}+\sum_{j=1,j\ne i}^{k} β_{ij} u_{i}^{\frac{2^*}{2}-1}u_{j}^{\frac{2^*}{2}} \quad \hbox{in}\;Ω,\newline
u_i&>0 \quad \hbox{in}\; Ω\quad \hbox{and}\quad u_i=0 \quad \hbox{on}\;\partialΩ, \quad i=1,2,\cdots,…
▽ More
In this paper, we study the following $k$-coupled nonlinear Schrödinger system with Sobolev critical exponent:
\begin{equation*}
\left\{
\begin{aligned}
-Δu_i & +λ_iu_i =μ_i u_i^{2^*-1}+\sum_{j=1,j\ne i}^{k} β_{ij} u_{i}^{\frac{2^*}{2}-1}u_{j}^{\frac{2^*}{2}} \quad \hbox{in}\;Ω,\newline
u_i&>0 \quad \hbox{in}\; Ω\quad \hbox{and}\quad u_i=0 \quad \hbox{on}\;\partialΩ, \quad i=1,2,\cdots, k.
\end{aligned}
\right.
\end{equation*}
Here $Ω\subset \mathbb{R}^N $ is a smooth bounded domain, $2^{*}=\frac{2N}{N-2}$ is the Sobolev critical exponent, $-λ_1(Ω)<λ_i<0, μ_i>0$ and $ β_{ij}=β_{ji}\ne 0$, where $λ_1(Ω)$ is the first eigenvalue of $-Δ$ with the Dirichlet boundary condition. We characterize the positive least energy solution of the $k$-coupled system for the purely cooperative case $β_{ij}>0$, in higher dimension $N\ge 5$. Since the $k$-coupled case is much more delicated, we shall introduce the idea of induction. We point out that the key idea is to give a more accurate upper bound of the least energy. It's interesting to see that the least energy of the $k$-coupled system decreases as $k$ grows. Moreover, we establish the existence of positive least energy solution of the limit system in $\mathbb{R}^N$, as well as classification results.
△ Less
Submitted 21 May, 2021;
originally announced May 2021.
-
DPR-CAE: Capsule Autoencoder with Dynamic Part Representation for Image Parsing
Authors:
Canqun Xiang,
Zhennan Wang,
Wenbin Zou,
Chen Xu
Abstract:
Parsing an image into a hierarchy of objects, parts, and relations is important and also challenging in many computer vision tasks. This paper proposes a simple and effective capsule autoencoder to address this issue, called DPR-CAE. In our approach, the encoder parses the input into a set of part capsules, including pose, intensity, and dynamic vector. The decoder introduces a novel dynamic part…
▽ More
Parsing an image into a hierarchy of objects, parts, and relations is important and also challenging in many computer vision tasks. This paper proposes a simple and effective capsule autoencoder to address this issue, called DPR-CAE. In our approach, the encoder parses the input into a set of part capsules, including pose, intensity, and dynamic vector. The decoder introduces a novel dynamic part representation (DPR) by combining the dynamic vector and a shared template bank. These part representations are then regulated by corresponding capsules to composite the final output in an interpretable way. Besides, an extra translation-invariant module is proposed to avoid directly learning the uncertain scene-part relationship in our DPR-CAE, which makes the resulting method achieves a promising performance gain on $rm$-MNIST and $rm$-Fashion-MNIST. % to model the scene-object relationship DPR-CAE can be easily combined with the existing stacked capsule autoencoder and experimental results show it significantly improves performance in terms of unsupervised object classification. Our code is available in the Appendix.
△ Less
Submitted 6 September, 2021; v1 submitted 29 April, 2021;
originally announced April 2021.
-
Semantic Data Augmentation for End-to-End Mandarin Speech Recognition
Authors:
Jianwei Sun,
Zhiyuan Tang,
Hengxin Yin,
Wei Wang,
Xi Zhao,
Shuaijiang Zhao,
Xiaoning Lei,
Wei Zou,
Xiangang Li
Abstract:
End-to-end models have gradually become the preferred option for automatic speech recognition (ASR) applications. During the training of end-to-end ASR, data augmentation is a quite effective technique for regularizing the neural networks. This paper proposes a novel data augmentation technique based on semantic transposition of the transcriptions via syntax rules for end-to-end Mandarin ASR. Spec…
▽ More
End-to-end models have gradually become the preferred option for automatic speech recognition (ASR) applications. During the training of end-to-end ASR, data augmentation is a quite effective technique for regularizing the neural networks. This paper proposes a novel data augmentation technique based on semantic transposition of the transcriptions via syntax rules for end-to-end Mandarin ASR. Specifically, we first segment the transcriptions based on part-of-speech tags. Then transposition strategies, such as placing the object in front of the subject or swap** the subject and the object, are applied on the segmented sentences. Finally, the acoustic features corresponding to the transposed transcription are reassembled based on the audio-to-text forced-alignment produced by a pre-trained ASR system. The combination of original data and augmented one is used for training a new ASR system. The experiments are conducted on the Transformer[2] and Conformer[3] based ASR. The results show that the proposed method can give consistent performance gain to the system. Augmentation related issues, such as comparison of different strategies and ratios for data combination are also investigated.
△ Less
Submitted 26 April, 2021;
originally announced April 2021.
-
Enormous Berry-Curvature-Driven Anomalous Hall Effect in Topological Insulator (Bi,Sb)2Te3 on Ferrimagnetic Europium Iron Garnet beyond 400 K
Authors:
Wei-Jhih Zou,
Meng-Xin Guo,
Jyun-Fong Wong,
Zih-** Huang,
Jui-Min Chia,
Wei-Nien Chen,
Sheng-Xin Wang,
Keng-Yung Lin,
Lawrence Boyu Young,
Yen-Hsun Glen Lin,
Mohammad Yahyavi,
Chien-Ting Wu,
Horng-Tay Jeng,
Shang-Fan Lee,
Tay-Rong Chang,
Minghwei Hong,
Jueinai Kwo
Abstract:
To realize the quantum anomalous Hall effect (QAHE) at elevated temperatures, the approach of magnetic proximity effect (MPE) was adopted to break the time-reversal symmetry in the topological insulator (Bi0.3Sb0.7)2Te3 (BST) based heterostructures with a ferrimagnetic insulator europium iron garnet (EuIG) of perpendicular magnetic anisotropy. Here we demonstrate phenomenally large anomalous Hall…
▽ More
To realize the quantum anomalous Hall effect (QAHE) at elevated temperatures, the approach of magnetic proximity effect (MPE) was adopted to break the time-reversal symmetry in the topological insulator (Bi0.3Sb0.7)2Te3 (BST) based heterostructures with a ferrimagnetic insulator europium iron garnet (EuIG) of perpendicular magnetic anisotropy. Here we demonstrate phenomenally large anomalous Hall resistance (RAHE) exceeding 8 Ω (\r{ho}AHE of 3.2 μΩ*cm) at 300 K and sustaining to 400 K in 35 BST/EuIG samples, surpassing the past record of 0.28 Ω (\r{ho}AHE of 0.14 μΩ*cm) at 300 K. The remarkably large RAHE as attributed to an atomically abrupt, Fe-rich interface between BST and EuIG. Importantly, the gate dependence of the AHE loops shows no sign change with varying chemical potential. This observation is supported by our first-principles calculations via applying a gradient Zeeman field plus a contact potential on BST. Our calculations further demonstrate that the AHE in this heterostructure is attributed to the intrinsic Berry curvature. Furthermore, for gate-biased 4 nm BST on EuIG, a pronounced topological Hall effect (THE) coexisting with AHE is observed at the negative top-gate voltage up to 15 K. Interface tuning with theoretical calculations has opened up new opportunities to realize topologically distinct phenomena in tailored magnetic TI-based heterostructures.
△ Less
Submitted 30 September, 2021; v1 submitted 30 March, 2021;
originally announced March 2021.