Skip to main content

Showing 1–27 of 27 results for author: Ke, D

.
  1. arXiv:2402.17450  [pdf

    eess.SP

    Conformal Shield: A Novel Adversarial Attack Detection Framework for Automatic Modulation Classification

    Authors: Tailai Wen, Da Ke, Xiang Wang, Zhitao Huang

    Abstract: Deep learning algorithms have become an essential component in the field of cognitive radio, especially playing a pivotal role in automatic modulation classification. However, Deep learning also present risks and vulnerabilities. Despite their outstanding classification performance, they exhibit fragility when confronted with meticulously crafted adversarial examples, posing potential risks to the… ▽ More

    Submitted 27 February, 2024; originally announced February 2024.

  2. arXiv:2312.11814  [pdf

    physics.optics physics.app-ph

    Study on electromagnetically induced transparency effects in Dirac and VO$_2$ hybrid material structure

    Authors: Di Ke, Xie Meng, Xia Hua Rong, Cheng An Yu, Liu Yu, Du Jia Jia

    Abstract: In this paper, we present a metamaterial structure of Dirac and vanadium dioxide and investigate its optical properties using the finite-difference time-domain (FDTD) technique. Using the phase transition feature of vanadium dioxide, the design can realize active tuning of the PIT effect at terahertz frequency, thereby converting from a single PIT to a double PIT. When VO$_2$ is in the insulating… ▽ More

    Submitted 18 December, 2023; originally announced December 2023.

  3. arXiv:2312.10358  [pdf, other

    cs.CL cs.HC

    CONCSS: Contrastive-based Context Comprehension for Dialogue-appropriate Prosody in Conversational Speech Synthesis

    Authors: Yayue Deng, **long Xue, Yukang Jia, Qifei Li, Yichen Han, Feng** Wang, Yingming Gao, Dengfeng Ke, Ya Li

    Abstract: Conversational speech synthesis (CSS) incorporates historical dialogue as supplementary information with the aim of generating speech that has dialogue-appropriate prosody. While previous methods have already delved into enhancing context comprehension, context representation still lacks effective representation capabilities and context-sensitive discriminability. In this paper, we introduce a con… ▽ More

    Submitted 16 December, 2023; originally announced December 2023.

    Comments: 5 pages, 2 figures, 3 tables, Accepted by ICASSP 2024

  4. arXiv:2310.05402  [pdf

    cond-mat.mtrl-sci cond-mat.str-el

    Challenges for density functional theory in simulating metal-metal singlet bonding: a case study of dimerized VO2

    Authors: Yubo Zhang, Da Ke, Junxiong Wu, Chutong Zhang, Baichen Lin, Zuhuang Chen, John P. Perdew, Jianwei Sun

    Abstract: VO2 is renowned for its electric transition from an insulating monoclinic (M1) phase characterized by V-V dimerized structures, to a metallic rutile (R) phase above 340 Kelvin. This transition is accompanied by a magnetic change: the M1 phase exhibits a non-magnetic spin-singlet state, while the R phase exhibits a state with local magnetic moments. Simultaneous simulation of the structural, electr… ▽ More

    Submitted 9 October, 2023; originally announced October 2023.

    Comments: 14 pages, 6 figures

  5. Rhythm-controllable Attention with High Robustness for Long Sentence Speech Synthesis

    Authors: Dengfeng Ke, Yayue Deng, Yukang Jia, **long Xue, Qi Luo, Ya Li, Jianqing Sun, Jiaen Liang, Binghuai Lin

    Abstract: Regressive Text-to-Speech (TTS) system utilizes attention mechanism to generate alignment between text and acoustic feature sequence. Alignment determines synthesis robustness (e.g, the occurence of skip**, repeating, and collapse) and rhythm via duration control. However, current attention algorithms used in speech synthesis cannot control rhythm using external duration information to generate… ▽ More

    Submitted 5 June, 2023; originally announced June 2023.

    Comments: 5 pages, 3 figures, Published in: 2022 13th International Symposium on Chinese Spoken Language Processing (ISCSLP)

  6. arXiv:2303.15865  [pdf

    eess.SP

    Chloride Ion Erosion of Pre-Stressed Concrete Bridges in Cold Regions

    Authors: Hongtao Cui, Yi Zhuo, Dongyuan Ke, Zhonglong Li, Shunlong Li

    Abstract: The erosion of chloride ions in concrete bridges will accelerate the corrosion of reinforcement, which is an important reason for the decline of bridge durability. The erosion process of chloride ion, especially deicing salt solution in cold regions, is complex and has many influencing factors. It is very important to use accurate and effective methods to analyze the chloride ion erosion process i… ▽ More

    Submitted 28 March, 2023; originally announced March 2023.

  7. arXiv:2208.05228  [pdf

    q-bio.QM q-bio.BM

    Current and perspective sensing methods for monkeypox virus: a reemerging zoonosis in its infancy

    Authors: Ijaz Gul, Changyue Liu, Yuan Xi, Zhicheng Du, Shiyao Zhai, Zhengyang Lei, Chen Qun, Muhammad Akmal Raheem, Qian He, Zhang Haihui, Canyang Zhang, Runming Wang, Sanyang Han, Du Ke, Peiwu Qin

    Abstract: Objectives The review is dedicated to evaluate the current monkeypox virus (MPXV) detection methods, discuss their pros and cons, and provide recommended solutions to the problems. Methods The literature for this review is identified through searches in PubMed, Web of Science, Google Scholar, ResearchGate, and Science Direct advanced search for articles published in English without any start dat… ▽ More

    Submitted 10 August, 2022; originally announced August 2022.

    Comments: 36 pages, 5 figures, 1 table

  8. arXiv:2206.07289  [pdf, other

    cs.SD cs.AI eess.AS

    Text-Aware End-to-end Mispronunciation Detection and Diagnosis

    Authors: Linkai Peng, Yingming Gao, Binghuai Lin, Dengfeng Ke, Yanlu Xie, **song Zhang

    Abstract: Mispronunciation detection and diagnosis (MDD) technology is a key component of computer-assisted pronunciation training system (CAPT). In the field of assessing the pronunciation quality of constrained speech, the given transcriptions can play the role of a teacher. Conventional methods have fully utilized the prior texts for the model construction or improving the system performance, e.g. forced… ▽ More

    Submitted 15 June, 2022; originally announced June 2022.

    Comments: Rejected by Interspeech2022

  9. Backbone and shortest-path exponents of the two-dimensional $Q$-state Potts model

    Authors: Sheng Fang, Da Ke, Wei Zhong, You** Deng

    Abstract: We present a Monte Carlo study of the backbone and the shortest-path exponents of the two-dimensional $Q$-state Potts model in the Fortuin-Kasteleyn bond representation. We first use cluster algorithms to simulate the critical Potts model on the square lattice and obtain the backbone exponents $d_{\rm B} = 1.732 \, 0(3)$ and $1.794(2)$ for $Q=2,3$ respectively. However, for large $Q$, the study su… ▽ More

    Submitted 19 April, 2022; v1 submitted 19 December, 2021; originally announced December 2021.

    Comments: 14 pages, 9 figures

    Journal ref: Phys. Rev. E 105, 044122(2022)

  10. arXiv:2108.03008  [pdf, other

    cs.SD cs.LG eess.AS

    An Empirical Study on End-to-End Singing Voice Synthesis with Encoder-Decoder Architectures

    Authors: Dengfeng Ke, Yuxing Lu, Xudong Liu, Yanyan Xu, **g Sun, Cheng-Hao Cai

    Abstract: With the rapid development of neural network architectures and speech processing models, singing voice synthesis with neural networks is becoming the cutting-edge technique of digital music production. In this work, in order to explore how to improve the quality and efficiency of singing voice synthesis, in this work, we use encoder-decoder neural models and a number of vocoders to achieve singing… ▽ More

    Submitted 6 August, 2021; originally announced August 2021.

    Comments: 27 pages, 4 figures, 5 tables

  11. arXiv:2105.02509  [pdf, other

    cs.SD cs.AI eess.AS

    Speech Enhancement using Separable Polling Attention and Global Layer Normalization followed with PReLU

    Authors: Dengfeng Ke, **song Zhang, Yanlu Xie, Yanyan Xu, Binghuai Lin

    Abstract: Single channel speech enhancement is a challenging task in speech community. Recently, various neural networks based methods have been applied to speech enhancement. Among these models, PHASEN and T-GSA achieve state-of-the-art performances on the publicly opened VoiceBank+DEMAND corpus. Both of the models reach the COVL score of 3.62. PHASEN achieves the highest CSIG score of 4.21 while T-GSA get… ▽ More

    Submitted 6 May, 2021; originally announced May 2021.

  12. arXiv:2104.08428  [pdf, other

    cs.CL

    A Full Text-Dependent End to End Mispronunciation Detection and Diagnosis with Easy Data Augmentation Techniques

    Authors: Kaiqi Fu, Jones Lin, Dengfeng Ke, Yanlu Xie, **song Zhang, Binghuai Lin

    Abstract: Recently, end-to-end mispronunciation detection and diagnosis (MD&D) systems has become a popular alternative to greatly simplify the model-building process of conventional hybrid DNN-HMM systems by representing complicated modules with a single deep network architecture. In this paper, in order to utilize the prior text in the end-to-end structure, we present a novel text-dependent model which is… ▽ More

    Submitted 16 April, 2021; originally announced April 2021.

    Comments: Submitted to INTERSPEECH2021

  13. arXiv:2009.12475  [pdf, ps, other

    math.NT

    Extending Zeckendorf's Theorem to a Non-constant Recurrence and the Zeckendorf Game on this Non-constant Recurrence Relation

    Authors: Elżbieta Bołdyriew, Anna Cusenza, Linglong Dai, Pei Ding, Aidan Dunkelberg, John Haviland, Kate Huffman, Dianhui Ke, Daniel Kleber, Jason Kuretski, John Lentfer, Tianhao Luo, Steven J. Miller, Clayton Mizgerd, Vashisth Tiwari, **gkai Ye, Yunhao Zhang, Xiaoyan Zheng, Weiduo Zhu

    Abstract: Zeckendorf's Theorem states that every positive integer can be uniquely represented as a sum of non-adjacent Fibonacci numbers, indexed from $1, 2, 3, 5,\ldots$. This has been generalized by many authors, in particular to constant coefficient fixed depth linear recurrences with positive (or in some cases non-negative) coefficients. In this work we extend this result to a recurrence with non-consta… ▽ More

    Submitted 25 September, 2020; originally announced September 2020.

    Comments: 21 pages, 1 figure, from Zeckendorf Polymath REU and the Eureka Program

  14. arXiv:2009.09510  [pdf, ps, other

    math.NT

    Bounds on Zeckendorf Games

    Authors: Anna Cusenza, Aiden Dunkelberg, Kate Huffman, Dianhui Ke, Micah McClatchey, Steven J. Miller, Clayton Mizgerd, Vashisth Tiwari, **gkai Ye, Xiaoyan Zheng

    Abstract: Zeckendorf proved that every positive integer $n$ can be written uniquely as the sum of non-adjacent Fibonacci numbers. We use this decomposition to construct a two-player game. Given a fixed integer $n$ and an initial decomposition of $n=n F_1$, the two players alternate by using moves related to the recurrence relation $F_{n+1}=F_n+F_{n-1}$, and whoever moves last wins. The game always terminate… ▽ More

    Submitted 20 September, 2020; originally announced September 2020.

    Comments: 15 pages, from Zeckendorf Polymath REU

  15. arXiv:2009.03708  [pdf, ps, other

    math.NT

    Winning Strategy for the Multiplayer and Multialliance Zeckendorf Games

    Authors: Anna Cusenza, Aidan Dunkelberg, Kate Huffman, Dianhui Ke, Daniel Kleber, Steven J. Miller, Clayton Mizgerd, Vashisth Tiwari, **gkai Ye, Xiaoyan Zheng

    Abstract: Edouard Zeckendorf proved that every positive integer $n$ can be uniquely written \cite{Ze} as the sum of non-adjacent Fibonacci numbers, known as the Zeckendorf decomposition. Based on Zeckendorf's decomposition, we have the Zeckendorf game for multiple players. We show that when the Zeckendorf game has at least $3$ players, none of the players have a winning strategy for $n\geq 5$. Then we exten… ▽ More

    Submitted 20 October, 2020; v1 submitted 8 September, 2020; originally announced September 2020.

    Comments: 11 pages, from Zeckendorf Polymath REU; new version addresses minor typos, table of contents removed, inclusion of MSC subject code

  16. arXiv:2006.14563  [pdf, other

    cs.CV cs.LG eess.AS stat.ML

    Dynamically Mitigating Data Discrepancy with Balanced Focal Loss for Replay Attack Detection

    Authors: Yongqiang Dou, Haocheng Yang, Maolin Yang, Yanyan Xu, Dengfeng Ke

    Abstract: It becomes urgent to design effective anti-spoofing algorithms for vulnerable automatic speaker verification systems due to the advancement of high-quality playback devices. Current studies mainly treat anti-spoofing as a binary classification problem between bonafide and spoofed utterances, while lack of indistinguishable samples makes it difficult to train a robust spoofing detector. In this pap… ▽ More

    Submitted 17 January, 2023; v1 submitted 25 June, 2020; originally announced June 2020.

    Comments: The 25th International Conference on Pattern Recognition (ICPR2020)

  17. arXiv:2006.09045  [pdf

    cond-mat.mtrl-sci physics.chem-ph

    Effect of Cold Sintering Process (CSP) on the Electro-Chemo-Mechanical Properties of Gd-doped Ceria (GDC)

    Authors: Ahsanul Kabir, Daoyao Ke, Salvatore Grasso, Benoit Merle, Vincenzo Esposito

    Abstract: In this report, the effect of the cold sintering process (CSP) on the electro-chemo-mechanical properties of 10 mol% Gd-doped ceria (GDC) is investigated. High purity nanoscale GDC powder is sintered via a cold sintering process (CSP) in pure water followed by post-annealing at 1000 °C. The resultant CSP ceramics exhibits high relative density (~92%) with an ultrafine grain size of ~100 nm. This s… ▽ More

    Submitted 16 June, 2020; originally announced June 2020.

  18. arXiv:2005.10803  [pdf, other

    eess.AS cs.SD

    Formant Tracking Using Dilated Convolutional Networks Through Dense Connection with Gating Mechanism

    Authors: Wang Dai, **song Zhang, Yingming Gao, Wei Wei, Dengfeng Ke, Binghuai Lin, Yanlu Xie

    Abstract: Formant tracking is one of the most fundamental problems in speech processing. Traditionally, formants are estimated using signal processing methods. Recent studies showed that generic convolutional architectures can outperform recurrent networks on temporal tasks such as speech synthesis and machine translation. In this paper, we explored the use of Temporal Convolutional Network (TCN) for forman… ▽ More

    Submitted 8 August, 2020; v1 submitted 21 May, 2020; originally announced May 2020.

    Comments: Accepted by Interspeech 2020

  19. arXiv:1904.08138  [pdf, other

    cs.CL cs.SD eess.AS

    Complementary Fusion of Multi-Features and Multi-Modalities in Sentiment Analysis

    Authors: Feiyang Chen, Ziqian Luo, Yanyan Xu, Dengfeng Ke

    Abstract: Sentiment analysis, mostly based on text, has been rapidly develo** in the last decade and has attracted widespread attention in both academia and industry. However, the information in the real world usually comes from multiple modalities, such as audio and text. Therefore, in this paper, based on audio and text, we consider the task of multimodal sentiment analysis and propose a novel fusion st… ▽ More

    Submitted 11 December, 2019; v1 submitted 17 April, 2019; originally announced April 2019.

    Comments: Accepted by AAAI2020 Workshop: AffCon2020

  20. arXiv:1811.01244  [pdf, ps, other

    math.AP

    Regularity and stability analysis for a class of semilinear nonlocal differential equations in Hilbert spaces

    Authors: Tran Dinh Ke, Nguyen Nhu Thang, Lam Tran Phuong Thuy

    Abstract: We deal with a class of semilinear nonlocal differential equations in Hilbert spaces which is a general model for some anomalous diffusion equations. By using the theory of integral equations with completely positive kernel together with local estimates, some existence, regularity and stability results are established. An application to nonlocal partial differential equations is shown to demonstra… ▽ More

    Submitted 6 December, 2018; v1 submitted 3 November, 2018; originally announced November 2018.

  21. arXiv:1805.01357  [pdf, ps, other

    cs.SD cs.LG eess.AS

    Boosting Noise Robustness of Acoustic Model via Deep Adversarial Training

    Authors: Bin Liu, Shuai Nie, Ya** Zhang, Dengfeng Ke, Shan Liang, Wenju Liu1

    Abstract: In realistic environments, speech is usually interfered by various noise and reverberation, which dramatically degrades the performance of automatic speech recognition (ASR) systems. To alleviate this issue, the commonest way is to use a well-designed speech enhancement approach as the front-end of ASR. However, more complex pipelines, more computations and even higher hardware costs (microphone a… ▽ More

    Submitted 2 May, 2018; originally announced May 2018.

  22. Trainable back-propagated functional transfer matrices

    Authors: Cheng-Hao Cai, Yanyan Xu, Dengfeng Ke, Kaile Su, **g Sun

    Abstract: Connections between nodes of fully connected neural networks are usually represented by weight matrices. In this article, functional transfer matrices are introduced as alternatives to the weight matrices: Instead of using real weights, a functional transfer matrix uses real functions with trainable parameters to represent connections between nodes. Multiple functional transfer matrices are then s… ▽ More

    Submitted 28 October, 2017; originally announced October 2017.

    Comments: 39 pages, 4 figures, submitted as a journal article

    Journal ref: Appl. Intell. (2018)

  23. arXiv:1708.05878  [pdf, ps, other

    cs.IR cs.AI

    Event-Radar: Real-time Local Event Detection System for Geo-Tagged Tweet Streams

    Authors: Sibo Zhang, Yuan Cheng, Deyuan Ke

    Abstract: The local event detection is to use posting messages with geotags on social networks to reveal the related ongoing events and their locations. Recent studies have demonstrated that the geo-tagged tweet stream serves as an unprecedentedly valuable source for local event detection. Nevertheless, how to effectively extract local events from large geo-tagged tweet streams in real time remains challeng… ▽ More

    Submitted 5 October, 2017; v1 submitted 19 August, 2017; originally announced August 2017.

    Comments: 10 pages

  24. arXiv:1706.09995  [pdf

    math.OC

    Stochastic Dynamic Optimal Power Flow in Distribution Network with Distributed Renewable Energy and Battery Energy Storage

    Authors: Chenghui Tang, Jian Xu, Yuanzhang Sun, Siyang Liao, De** Ke, Xiong Li

    Abstract: The penetration of distributed renewable energy (DRE) greatly raises the risk of distribution network operation such as peak shaving and voltage stability. Battery energy storage (BES) has been widely accepted as the most potential application to cope with the challenge of high penetration of DRE. To cope with the uncertainties and variability of DRE, a stochastic day-ahead dynamic optimal power f… ▽ More

    Submitted 29 June, 2017; originally announced June 2017.

  25. Learning of Human-like Algebraic Reasoning Using Deep Feedforward Neural Networks

    Authors: Cheng-Hao Cai, Dengfeng Ke, Yanyan Xu, Kaile Su

    Abstract: There is a wide gap between symbolic reasoning and deep learning. In this research, we explore the possibility of using deep learning to improve symbolic reasoning. Briefly, in a reasoning system, a deep feedforward neural network is used to guide rewriting processes after learning from algebraic reasoning examples produced by humans. To enable the neural network to recognise patterns of algebraic… ▽ More

    Submitted 24 April, 2017; originally announced April 2017.

    Comments: 8 pages, 7 figures

    ACM Class: I.2.0; I.2.3; I.2.4; I.2.6; I.2.8; I.5.0; I.5.1; I.5.2; I.5.4; F.4.1

  26. arXiv:nlin/0603016  [pdf

    nlin.CD

    Lattice complexity and fine graining of symbolic sequence

    Authors: Da-Guan Ke, Hong Zhang, Qin-Ye Tong

    Abstract: A new complexity measure named as Lattice Complexity is presented for finite symbolic sequences. This measure is based on the symbolic dynamics of one-dimensional iterative maps and Lempel-Ziv Complexity. To make Lattice Complexity distinguishable from Lempel-Ziv Complexity, an approach called fine-graining process is also proposed. When the control parameter fine-graining order is small enough,… ▽ More

    Submitted 5 April, 2008; v1 submitted 9 March, 2006; originally announced March 2006.

    Comments: 16 page, 8 figures,a revised English version of a article published in Chinese

    Journal ref: D. G. Ke, H. Zhang, Q. Y. Tong, Acta Physica Sinica 2005 54: 534

  27. arXiv:nlin/0505052  [pdf

    nlin.CD

    Easily Adaptable Complexity Measure for Finite Time Series

    Authors: Da-Guan Ke, Qin-Ye Tong

    Abstract: We present a complexity measure for any finite time series. This measure has invariance under any monotonic transformation of the time series, has a degree of robustness against noise, and has the adaptability of satisfying almost all the widely accepted but conflicting criteria for complexity measurements. Surprisingly, the measure is developed from Kolmogorov complexity, which is traditionally… ▽ More

    Submitted 25 November, 2008; v1 submitted 23 May, 2005; originally announced May 2005.

    Comments: 15 page, 3 figures, 1 table; modifications making cruicial points clearer and improve readibility; had been completely rewritten

    Journal ref: Phys. Rev. E 77, 066215 (2008)