Skip to main content

Showing 1–50 of 80 results for author: Ni, C

.
  1. arXiv:2406.19666  [pdf, other

    cs.CV eess.IV

    CSAKD: Knowledge Distillation with Cross Self-Attention for Hyperspectral and Multispectral Image Fusion

    Authors: Chih-Chung Hsu, Chih-Chien Ni, Chia-Ming Lee, Li-Wei Kang

    Abstract: Hyperspectral imaging, capturing detailed spectral information for each pixel, is pivotal in diverse scientific and industrial applications. Yet, the acquisition of high-resolution (HR) hyperspectral images (HSIs) often needs to be addressed due to the hardware limitations of existing imaging systems. A prevalent workaround involves capturing both a high-resolution multispectral image (HR-MSI) and… ▽ More

    Submitted 28 June, 2024; originally announced June 2024.

    Comments: Submitted to TIP 2024

  2. arXiv:2406.15743  [pdf, other

    cs.SE

    CasModaTest: A Cascaded and Model-agnostic Self-directed Framework for Unit Test Generation

    Authors: Chao Ni, Xiaoya Wang, Liushan Chen, Dehai Zhao, Zhengong Cai, Shaohua Wang, Xiaohu Yang

    Abstract: Though many machine learning (ML)-based unit testing generation approaches have been proposed and indeed achieved remarkable performance, they still have several limitations in effectiveness and practical usage. More precisely, existing ML-based approaches (1) generate partial content of a unit test, mainly focusing on test oracle generation; (2) mismatch the test prefix with the test oracle seman… ▽ More

    Submitted 22 June, 2024; originally announced June 2024.

    Comments: 14 pages, 7 figures

  3. MegaVul: A C/C++ Vulnerability Dataset with Comprehensive Code Representation

    Authors: Chao Ni, Liyu Shen, Xiaohu Yang, Yan Zhu, Shaohua Wang

    Abstract: We constructed a newly large-scale and comprehensive C/C++ vulnerability dataset named MegaVul by crawling the Common Vulnerabilities and Exposures (CVE) database and CVE-related open-source projects. Specifically, we collected all crawlable descriptive information of the vulnerabilities from the CVE database and extracted all vulnerability-related code changes from 28 Git-based websites. We adopt… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

    Comments: 5 pages, 4figures

  4. arXiv:2406.02609  [pdf, other

    cs.LG cs.AI

    Less is More: Pseudo-Label Filtering for Continual Test-Time Adaptation

    Authors: Jiayao Tan, Fan Lyu, Chenggong Ni, Tingliang Feng, Fuyuan Hu, Zhang Zhang, Shaochuang Zhao, Liang Wang

    Abstract: Continual Test-Time Adaptation (CTTA) aims to adapt a pre-trained model to a sequence of target domains during the test phase without accessing the source data. To adapt to unlabeled data from unknown domains, existing methods rely on constructing pseudo-labels for all samples and updating the model through self-training. However, these pseudo-labels often involve noise, leading to insufficient ad… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

    Comments: arXiv admin note: text overlap with arXiv:2310.03335 by other authors

  5. arXiv:2406.02009  [pdf, other

    eess.AS cs.CL cs.SD

    Phonetic Enhanced Language Modeling for Text-to-Speech Synthesis

    Authors: Kun Zhou, Shengkui Zhao, Yukun Ma, Chong Zhang, Hao Wang, Dianwen Ng, Chongjia Ni, Nguyen Trung Hieu, Jia Qi Yip, Bin Ma

    Abstract: Recent language model-based text-to-speech (TTS) frameworks demonstrate scalability and in-context learning capabilities. However, they suffer from robustness issues due to the accumulation of errors in speech unit predictions during autoregressive language modeling. In this paper, we propose a phonetic enhanced language modeling method to improve the performance of TTS models. We leverage self-su… ▽ More

    Submitted 11 June, 2024; v1 submitted 4 June, 2024; originally announced June 2024.

    Comments: Accepted by Interspeech 2024

  6. arXiv:2405.17385  [pdf, other

    quant-ph cond-mat.mes-hall cond-mat.str-el

    Thermalization and Criticality on an Analog-Digital Quantum Simulator

    Authors: Trond I. Andersen, Nikita Astrakhantsev, Amir Karamlou, Julia Berndtsson, Johannes Motruk, Aaron Szasz, Jonathan A. Gross, Tom Westerhout, Yaxing Zhang, Ebrahim Forati, Dario Rossi, Bryce Kobrin, Agustin Di Paolo, Andrey R. Klots, Ilya Drozdov, Vladislav D. Kurilovich, Andre Petukhov, Lev B. Ioffe, Andreas Elben, Aniket Rath, Vittorio Vitale, Benoit Vermersch, Rajeev Acharya, Laleh Aghababaie Beni, Kyle Anderson , et al. (202 additional authors not shown)

    Abstract: Understanding how interacting particles approach thermal equilibrium is a major challenge of quantum simulators. Unlocking the full potential of such systems toward this goal requires flexible initial state preparation, precise time evolution, and extensive probes for final state characterization. We present a quantum simulator comprising 69 superconducting qubits which supports both universal qua… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

  7. arXiv:2405.11196  [pdf, other

    cs.SE

    Natural Is The Best: Model-Agnostic Code Simplification for Pre-trained Large Language Models

    Authors: Yan Wang, Xiaoning Li, Tien Nguyen, Shaohua Wang, Chao Ni, Ling Ding

    Abstract: Pre-trained Large Language Models (LLM) have achieved remarkable successes in several domains. However, code-oriented LLMs are heavy in computational complexity, and quadratically with the length of the input. Toward simplifying the input program of an LLM, the state-of-the-art approach has the strategies to filter the input code tokens based on the attention scores given by the LLM. The decision… ▽ More

    Submitted 18 May, 2024; originally announced May 2024.

  8. arXiv:2404.02056  [pdf, other

    cs.SE

    Multitask-based Evaluation of Open-Source LLM on Software Vulnerability

    Authors: Xin Yin, Chao Ni, Shaohua Wang

    Abstract: This paper proposes a pipeline for quantitatively evaluating interactive LLMs using publicly available datasets. We carry out an extensive technical evaluation of LLMs using Big-Vul covering four different common software vulnerability tasks. We evaluate the multitask and multilingual aspects of LLMs based on this dataset. We find that the existing state-of-the-art methods are generally superior t… ▽ More

    Submitted 25 April, 2024; v1 submitted 2 April, 2024; originally announced April 2024.

  9. arXiv:2403.13219  [pdf, other

    cs.LG math.OC

    Diffusion Model for Data-Driven Black-Box Optimization

    Authors: Zihao Li, Hui Yuan, Kaixuan Huang, Chengzhuo Ni, Yinyu Ye, Minshuo Chen, Mengdi Wang

    Abstract: Generative AI has redefined artificial intelligence, enabling the creation of innovative content and customized solutions that drive business practices into a new era of efficiency and creativity. In this paper, we focus on diffusion models, a powerful generative AI technology, and investigate their potential for black-box optimization over complex structured variables. Consider the practical scen… ▽ More

    Submitted 19 March, 2024; originally announced March 2024.

    Comments: arXiv admin note: substantial text overlap with arXiv:2307.07055

  10. arXiv:2403.00807  [pdf

    cs.IR cs.CL cs.DC cs.DL

    Enhancing Cloud-Based Large Language Model Processing with Elasticsearch and Transformer Models

    Authors: Chunhe Ni, Jiang Wu, Hongbo Wang, Wenran Lu, Chenwei Zhang

    Abstract: Large Language Models (LLMs) are a class of generative AI models built using the Transformer network, capable of leveraging vast datasets to identify, summarize, translate, predict, and generate language. LLMs promise to revolutionize society, yet training these foundational models poses immense challenges. Semantic vector search within large language models is a potent technique that can signific… ▽ More

    Submitted 24 February, 2024; originally announced March 2024.

  11. arXiv:2403.00806  [pdf

    cs.IR cs.CE cs.CL cs.CV

    Enhanced User Interaction in Operating Systems through Machine Learning Language Models

    Authors: Chenwei Zhang, Wenran Lu, Chunhe Ni, Hongbo Wang, Jiang Wu

    Abstract: With the large language model showing human-like logical reasoning and understanding ability, whether agents based on the large language model can simulate the interaction behavior of real users, so as to build a reliable virtual recommendation A/B test scene to help the application of recommendation research is an urgent, important and economic value problem. The combination of interaction design… ▽ More

    Submitted 24 February, 2024; originally announced March 2024.

  12. arXiv:2402.12916  [pdf

    cs.LG cs.AI

    Data Pipeline Training: Integrating AutoML to Optimize the Data Flow of Machine Learning Models

    Authors: Jiang Wu, Hongbo Wang, Chunhe Ni, Chenwei Zhang, Wenran Lu

    Abstract: Data Pipeline plays an indispensable role in tasks such as modeling machine learning and develo** data products. With the increasing diversification and complexity of Data sources, as well as the rapid growth of data volumes, building an efficient Data Pipeline has become crucial for improving work efficiency and solving complex problems. This paper focuses on exploring how to optimize data flow… ▽ More

    Submitted 20 February, 2024; originally announced February 2024.

  13. arXiv:2402.00927  [pdf, other

    astro-ph.HE astro-ph.GA

    Ordered magnetic fields around the 3C 84 central black hole

    Authors: G. F. Paraschos, J. -Y. Kim, M. Wielgus, J. Röder, T. P. Krichbaum, E. Ros, I. Agudo, I. Myserlis, M. Moscibrodzka, E. Traianou, J. A. Zensus, L. Blackburn, C. -K. Chan, S. Issaoun, M. Janssen, M. D. Johnson, V. L. Fish, K. Akiyama, A. Alberdi, W. Alef, J. C. Algaba, R. Anantua, K. Asada, R. Azulay, U. Bach , et al. (258 additional authors not shown)

    Abstract: 3C84 is a nearby radio source with a complex total intensity structure, showing linear polarisation and spectral patterns. A detailed investigation of the central engine region necessitates the use of VLBI above the hitherto available maximum frequency of 86GHz. Using ultrahigh resolution VLBI observations at the highest available frequency of 228GHz, we aim to directly detect compact structures a… ▽ More

    Submitted 1 February, 2024; originally announced February 2024.

    Comments: 15 pages, 6 figures, published in A&A

    Journal ref: Issue: A&A Volume 682, February 2024; Article number: L3; Number of pages: 15

  14. arXiv:2312.11825  [pdf, other

    cs.SD eess.AS

    MossFormer2: Combining Transformer and RNN-Free Recurrent Network for Enhanced Time-Domain Monaural Speech Separation

    Authors: Shengkui Zhao, Yukun Ma, Chongjia Ni, Chong Zhang, Hao Wang, Trung Hieu Nguyen, Kun Zhou, Jiaqi Yip, Dianwen Ng, Bin Ma

    Abstract: Our previously proposed MossFormer has achieved promising performance in monaural speech separation. However, it predominantly adopts a self-attention-based MossFormer module, which tends to emphasize longer-range, coarser-scale dependencies, with a deficiency in effectively modelling finer-scale recurrent patterns. In this paper, we introduce a novel hybrid model that provides the capabilities to… ▽ More

    Submitted 18 December, 2023; originally announced December 2023.

    Comments: 5 pages, 3 figures, accepted by ICASSP 2024

  15. arXiv:2311.10261  [pdf, other

    cs.CV eess.SP

    Vision meets mmWave Radar: 3D Object Perception Benchmark for Autonomous Driving

    Authors: Yizhou Wang, Jen-Hao Cheng, Jui-Te Huang, Sheng-Yao Kuan, Qiqian Fu, Chiming Ni, Shengyu Hao, Gaoang Wang, Guanbin Xing, Hui Liu, Jenq-Neng Hwang

    Abstract: Sensor fusion is crucial for an accurate and robust perception system on autonomous vehicles. Most existing datasets and perception solutions focus on fusing cameras and LiDAR. However, the collaboration between camera and radar is significantly under-exploited. The incorporation of rich semantic information from the camera, and reliable 3D information from the radar can potentially achieve an eff… ▽ More

    Submitted 16 November, 2023; originally announced November 2023.

  16. arXiv:2309.12608  [pdf, other

    eess.AS cs.SD

    SPGM: Prioritizing Local Features for enhanced speech separation performance

    Authors: Jia Qi Yip, Shengkui Zhao, Yukun Ma, Chongjia Ni, Chong Zhang, Hao Wang, Trung Hieu Nguyen, Kun Zhou, Dianwen Ng, Eng Siong Chng, Bin Ma

    Abstract: Dual-path is a popular architecture for speech separation models (e.g. Sepformer) which splits long sequences into overlap** chunks for its intra- and inter-blocks that separately model intra-chunk local features and inter-chunk global relationships. However, it has been found that inter-blocks, which comprise half a dual-path model's parameters, contribute minimally to performance. Thus, we pro… ▽ More

    Submitted 10 March, 2024; v1 submitted 21 September, 2023; originally announced September 2023.

    Comments: This paper was accepted by ICASSP 2024

  17. arXiv:2309.09413  [pdf, other

    cs.SD eess.AS

    Are Soft Prompts Good Zero-shot Learners for Speech Recognition?

    Authors: Dianwen Ng, Chong Zhang, Ruixi Zhang, Yukun Ma, Fabian Ritter-Gutierrez, Trung Hieu Nguyen, Chongjia Ni, Shengkui Zhao, Eng Siong Chng, Bin Ma

    Abstract: Large self-supervised pre-trained speech models require computationally expensive fine-tuning for downstream tasks. Soft prompt tuning offers a simple parameter-efficient alternative by utilizing minimal soft prompt guidance, enhancing portability while also maintaining competitive performance. However, not many people understand how and why this is so. In this study, we aim to deepen our understa… ▽ More

    Submitted 17 September, 2023; originally announced September 2023.

  18. arXiv:2309.00154  [pdf, other

    cs.CY

    Learning From Peers: A Survey of Perception and Utilization of Online Peer Support Among Informal Dementia Caregivers

    Authors: Zhijun Yin, Lauren Stratton, Qingyuan Song, Congning Ni, Lijun Song, Patricia A. Commiskey, Qingxia Chen, Monica Moreno, Sam Fazio, Bradley A. Malin

    Abstract: Informal dementia caregivers are those who care for a person living with dementia (PLWD) without receiving payment (e.g., family members, friends, or other unpaid caregivers). These informal caregivers are subject to substantial mental, physical, and financial burdens. Online communities enable these caregivers to exchange caregiving strategies and communicate experiences with other caregivers who… ▽ More

    Submitted 31 August, 2023; originally announced September 2023.

  19. arXiv:2308.15381  [pdf, other

    astro-ph.HE astro-ph.IM

    A search for pulsars around Sgr A* in the first Event Horizon Telescope dataset

    Authors: Pablo Torne, Kuo Liu, Ralph P. Eatough, Jompoj Wongphechauxsorn, James M. Cordes, Gregory Desvignes, Mariafelicia De Laurentis, Michael Kramer, Scott M. Ransom, Shami Chatterjee, Robert Wharton, Ramesh Karuppusamy, Lindy Blackburn, Michael Janssen, Chi-kwan Chan, Geoffrey B. Crew, Lynn D. Matthews, Ciriaco Goddi, Helge Rottmann, Jan Wagner, Salvador Sanchez, Ignacio Ruiz, Federico Abbate, Geoffrey C. Bower, Juan J. Salamanca , et al. (261 additional authors not shown)

    Abstract: The Event Horizon Telescope (EHT) observed in 2017 the supermassive black hole at the center of the Milky Way, Sagittarius A* (Sgr A*), at a frequency of 228.1 GHz ($λ$=1.3 mm). The fundamental physics tests that even a single pulsar orbiting Sgr A* would enable motivate searching for pulsars in EHT datasets. The high observing frequency means that pulsars - which typically exhibit steep emission… ▽ More

    Submitted 29 August, 2023; originally announced August 2023.

    Comments: 33 pages, 7 figures, 6 Tables. Accepted for publication in ApJ

  20. arXiv:2308.14566  [pdf

    physics.flu-dyn

    Parametric reduced order models with machine learning for spatial emulation of mixing and combustion problems

    Authors: Chenxu Ni, Siyu Ding, Xingjian Wang

    Abstract: High-fidelity simulations of mixing and combustion processes are generally computationally demanding and time-consuming, hindering their wide application in industrial design and optimization. The present study proposes parametric reduced order models (ROMs) to emulate spatial distributions of physical fields for multi-species mixing and combustion problems in a fast and accurate manner. The model… ▽ More

    Submitted 28 August, 2023; originally announced August 2023.

  21. arXiv:2308.11237  [pdf, other

    cs.SE

    Distinguishing Look-Alike Innocent and Vulnerable Code by Subtle Semantic Representation Learning and Explanation

    Authors: Chao Ni, Xin Yin, Kaiwen Yang, Dehai Zhao, Zhenchang Xing, Xin Xia

    Abstract: Though many deep learning (DL)-based vulnerability detection approaches have been proposed and indeed achieved remarkable performance, they still have limitations in the generalization as well as the practical usage. More precisely, existing DL-based approaches (1) perform negatively on prediction tasks among functions that are lexically similar but have contrary semantics; (2) provide no intuitiv… ▽ More

    Submitted 22 August, 2023; originally announced August 2023.

    Comments: Accepted By FSE'23

  22. arXiv:2307.07055  [pdf, other

    cs.LG

    Reward-Directed Conditional Diffusion: Provable Distribution Estimation and Reward Improvement

    Authors: Hui Yuan, Kaixuan Huang, Chengzhuo Ni, Minshuo Chen, Mengdi Wang

    Abstract: We explore the methodology and theory of reward-directed generation via conditional diffusion models. Directed generation aims to generate samples with desired properties as measured by a reward function, which has broad applications in generative AI, reinforcement learning, and computational biology. We consider the common learning scenario where the data set consists of unlabeled data along with… ▽ More

    Submitted 13 July, 2023; originally announced July 2023.

  23. arXiv:2306.17106   

    physics.flu-dyn

    Flow Dynamics of a Dodecane Jet in Oxygen Crossflow at Supercritical Pressures

    Authors: Siyu Ding, Jiabin Li, Longfei Wang, Chenxu Ni, Xingjian Wang

    Abstract: In advanced aero-propulsion engines, kerosene is often injected into the combustor at supercritical pressures, where flow dynamics is distinct from the subcritical counterpart. Large-eddy simulation combined with real-fluid thermodynamics and transport theories of a N-dodecane jet in oxygen crossflow at supercritical pressures is presented. Liquid dodecane at 600 K is injected into a supercritical… ▽ More

    Submitted 11 October, 2023; v1 submitted 29 June, 2023; originally announced June 2023.

    Comments: 1. The citations "Cortelezzi & Karagozian 2001; Yang & Wang 2005" in Page2 were wrong, which are not "compressible". This would mislead the readers. 2. The Reynolds number announced in Page7 was wrong by an order of magnitude

  24. arXiv:2305.12121  [pdf, other

    cs.SD cs.LG eess.AS

    ACA-Net: Towards Lightweight Speaker Verification using Asymmetric Cross Attention

    Authors: Jia Qi Yip, Tuan Truong, Dianwen Ng, Chong Zhang, Yukun Ma, Trung Hieu Nguyen, Chongjia Ni, Shengkui Zhao, Eng Siong Chng, Bin Ma

    Abstract: In this paper, we propose ACA-Net, a lightweight, global context-aware speaker embedding extractor for Speaker Verification (SV) that improves upon existing work by using Asymmetric Cross Attention (ACA) to replace temporal pooling. ACA is able to distill large, variable-length sequences into small, fixed-sized latents by attending a small query to large key and value matrices. In ACA-Net, we buil… ▽ More

    Submitted 20 May, 2023; originally announced May 2023.

    Comments: Accepted to INTERSPEECH 2023

  25. arXiv:2305.01170  [pdf, other

    cs.SD eess.AS

    Contrastive Speech Mixup for Low-resource Keyword Spotting

    Authors: Dianwen Ng, Ruixi Zhang, Jia Qi Yip, Chong Zhang, Yukun Ma, Trung Hieu Nguyen, Chongjia Ni, Eng Siong Chng, Bin Ma

    Abstract: Most of the existing neural-based models for keyword spotting (KWS) in smart devices require thousands of training samples to learn a decent audio representation. However, with the rising demand for smart devices to become more personalized, KWS models need to adapt quickly to smaller user samples. To tackle this challenge, we propose a contrastive speech mixup (CosMix) learning algorithm for low-… ▽ More

    Submitted 1 May, 2023; originally announced May 2023.

    Comments: Accepted by ICASSP 2023

  26. arXiv:2304.05297  [pdf, other

    q-fin.PM cs.CE q-fin.CP

    Neural Network Approach to Portfolio Optimization with Leverage Constraints:a Case Study on High Inflation Investment

    Authors: Chendi Ni, Yuying Li, Peter A. Forsyth

    Abstract: Motivated by the current global high inflation scenario, we aim to discover a dynamic multi-period allocation strategy to optimally outperform a passive benchmark while adhering to a bounded leverage limit. To this end, we formulate an optimal control problem to outperform a benchmark portfolio throughout the investment horizon. Assuming the asset prices follow the jump-diffusion model during high… ▽ More

    Submitted 24 May, 2023; v1 submitted 11 April, 2023; originally announced April 2023.

  27. arXiv:2303.15124  [pdf, other

    cs.CV cs.LG eess.IV

    Blind Inpainting with Object-aware Discrimination for Artificial Marker Removal

    Authors: Xuechen Guo, Wenhao Hu, Chiming Ni, Wenhao Chai, Shiyan Li, Gaoang Wang

    Abstract: Medical images often contain artificial markers added by doctors, which can negatively affect the accuracy of AI-based diagnosis. To address this issue and recover the missing visual contents, inpainting techniques are highly needed. However, existing inpainting methods require manual mask input, limiting their application scenarios. In this paper, we introduce a novel blind inpainting method that… ▽ More

    Submitted 27 March, 2023; originally announced March 2023.

  28. Comparison of Polarized Radiative Transfer Codes used by the EHT Collaboration

    Authors: Ben S. Prather, Jason Dexter, Monika Moscibrodzka, Hung-Yi Pu, Thomas Bronzwaer, Jordy Davelaar, Ziri Younsi, Charles F. Gammie, Roman Gold, George N. Wong, Kazunori Akiyama, Antxon Alberdi, Walter Alef, Juan Carlos Algaba, Richard Anantua, Keiichi Asada, Rebecca Azulay, Uwe Bach, Anne-Kathrin Baczko, David Ball, Mislav Baloković, John Barrett, Michi Bauböck, Bradford A. Benson, Dan Bintley , et al. (248 additional authors not shown)

    Abstract: Interpretation of resolved polarized images of black holes by the Event Horizon Telescope (EHT) requires predictions of the polarized emission observable by an Earth-based instrument for a particular model of the black hole accretion system. Such predictions are generated by general relativistic radiative transfer (GRRT) codes, which integrate the equations of polarized radiative transfer in curve… ▽ More

    Submitted 21 March, 2023; originally announced March 2023.

    Comments: Accepted for publication in ApJ

  29. arXiv:2303.07610  [pdf, other

    cs.CL

    Exploring ChatGPT's Ability to Rank Content: A Preliminary Study on Consistency with Human Preferences

    Authors: Yunjie Ji, Yan Gong, Yi** Peng, Chao Ni, Peiyan Sun, Dongyu Pan, Baochang Ma, Xiangang Li

    Abstract: As a natural language assistant, ChatGPT is capable of performing various tasks, including but not limited to article generation, code completion, and data analysis. Furthermore, ChatGPT has consistently demonstrated a remarkable level of accuracy and reliability in terms of content evaluation, exhibiting the capability of mimicking human preferences. To further explore ChatGPT's potential in this… ▽ More

    Submitted 13 March, 2023; originally announced March 2023.

  30. arXiv:2303.00179  [pdf, other

    cs.LG cs.DC

    A Unified Momentum-based Paradigm of Decentralized SGD for Non-Convex Models and Heterogeneous Data

    Authors: Haizhou Du, Chengdong Ni

    Abstract: Emerging distributed applications recently boosted the development of decentralized machine learning, especially in IoT and edge computing fields. In real-world scenarios, the common problems of non-convexity and data heterogeneity result in inefficiency, performance degradation, and development stagnation. The bulk of studies concentrates on one of the issues mentioned above without having a more… ▽ More

    Submitted 28 February, 2023; originally announced March 2023.

    Comments: 24 pages

    ACM Class: I.2.11; I.2.6

  31. arXiv:2302.14597  [pdf, other

    cs.SD eess.AS

    deHuBERT: Disentangling Noise in a Self-supervised Model for Robust Speech Recognition

    Authors: Dianwen Ng, Ruixi Zhang, Jia Qi Yip, Zhao Yang, **jie Ni, Chong Zhang, Yukun Ma, Chongjia Ni, Eng Siong Chng, Bin Ma

    Abstract: Existing self-supervised pre-trained speech models have offered an effective way to leverage massive unannotated corpora to build good automatic speech recognition (ASR). However, many current models are trained on a clean corpus from a single source, which tends to do poorly when noise is present during testing. Nonetheless, it is crucial to overcome the adverse influence of noise for real-world… ▽ More

    Submitted 28 February, 2023; originally announced February 2023.

    Comments: Accepted by ICASSP 2023

  32. arXiv:2212.02544  [pdf, other

    astro-ph.IM astro-ph.HE

    Probing Accretion Turbulence in the Galactic Center with EHT Polarimetry

    Authors: Chunchong Ni, Avery E. Broderick, Roman Gold

    Abstract: Magnetic fields grown by instabilities driven by differential rotation are believed to be essential to accretion onto black holes. These instabilities saturate in a turbulent state; therefore, the spatial and temporal variability in the horizon-resolving images of Sagittarius A* (Sgr A*) will be able to empirically assess this critical aspect of accretion theory. However, interstellar scattering b… ▽ More

    Submitted 5 December, 2022; originally announced December 2022.

    Journal ref: ApJ 940 149 (2022)

  33. arXiv:2211.05941  [pdf

    cond-mat.mes-hall cond-mat.mtrl-sci

    Negative GMR Effect in current perpendicular-to-plane (Zn,Cr)Te/Cu/Co spin salves

    Authors: W. G. Wang, C. Ni, L. R. Shah, X. M. Kou, J. Q. Xiao

    Abstract: Magnetic and transport properties are explored in the current perpendicular-to-plane (CPP) spin salves with Cr doped wide band gap semiconductor ZnTe as one of the ferromagnetic electrodes. A negative magnetoresistance is observed in these CPP spin valves at low temperature, with a strong temperature dependence. This effect can be explained by the large difference of spin scattering asymmetry coef… ▽ More

    Submitted 10 November, 2022; originally announced November 2022.

  34. arXiv:2210.16976  [pdf, other

    cs.LG stat.ML

    Representation Learning for General-sum Low-rank Markov Games

    Authors: Chengzhuo Ni, Yuda Song, Xuezhou Zhang, Chi **, Mengdi Wang

    Abstract: We study multi-agent general-sum Markov games with nonlinear function approximation. We focus on low-rank Markov games whose transition matrix admits a hidden low-rank structure on top of an unknown non-linear representation. The goal is to design an algorithm that (1) finds an $\varepsilon$-equilibrium policy sample efficiently without prior knowledge of the environment or the representation, and… ▽ More

    Submitted 30 October, 2022; originally announced October 2022.

  35. arXiv:2210.16727  [pdf

    cond-mat.mes-hall cond-mat.mtrl-sci

    Spin-polarized transport in magnetic tunnel junctions with ZnTe barriers

    Authors: W. G. Wang, C. Ni, A. Ozbay, L. R. Shah, X. Fan, X. M. Kou, E. R. Nowak, J. Q. Xiao

    Abstract: Magnetic tunnel junctions with wide band gap semiconductor ZnTe barrier were fabricated. A very low barrier height and sizable magnetoresistance were observed in the Fe/ZnTe/Fe junctions at room temperature. The nonlinear I-V characteristic curve confirmed the observed magnetoresistance is due to spin-dependent tunneling effect. Temperature dependent study indicated that the total conductance of t… ▽ More

    Submitted 29 October, 2022; originally announced October 2022.

  36. Cloud-based Automatic Speech Recognition Systems for Southeast Asian Languages

    Authors: Lei Wang, Rong Tong, Cheung Chi Leung, Sunil Sivadas, Chongjia Ni, Bin Ma

    Abstract: This paper provides an overall introduction of our Automatic Speech Recognition (ASR) systems for Southeast Asian languages. As not much existing work has been carried out on such regional languages, a few difficulties should be addressed before building the systems: limitation on speech and text resources, lack of linguistic knowledge, etc. This work takes Bahasa Indonesia and Thai as examples to… ▽ More

    Submitted 7 October, 2022; originally announced October 2022.

    Comments: Published by the 2017 IEEE International Conference on Orange Technologies (ICOT 2017)

    ACM Class: I.2.7

  37. arXiv:2209.06360  [pdf, other

    cs.SD eess.AS

    I2CR: Improving Noise Robustness on Keyword Spotting Using Inter-Intra Contrastive Regularization

    Authors: Dianwen Ng, Jia Qi Yip, Tanmay Surana, Zhao Yang, Chong Zhang, Yukun Ma, Chongjia Ni, Eng Siong Chng, Bin Ma

    Abstract: Noise robustness in keyword spotting remains a challenge as many models fail to overcome the heavy influence of noises, causing the deterioration of the quality of feature embeddings. We proposed a contrastive regularization method called Inter-Intra Contrastive Regularization (I2CR) to improve the feature representations by guiding the model to learn the fundamental speech information specific to… ▽ More

    Submitted 13 September, 2022; originally announced September 2022.

  38. The Photon Ring in M87*

    Authors: Avery E. Broderick, Dominic W. Pesce, Paul Tiede, Hung-Yi Pu, Roman Gold, Richard Anantua, Silke Britzen, Chiara Ceccobello, Koushik Chatterjee, Yongjun Chen, Nicholas S. Conroy, Geoffrey B. Crew, Alejandro Cruz-Osorio, Yuzhu Cui, Sheperd S. Doeleman, Razieh Emami, Joseph Farah, Christian M. Fromm, Peter Galison, Boris Georgiev, Luis C. Ho, David J. James, Britton Jeter, Alejandra Jimenez-Rosales, Jun Yi Koay , et al. (26 additional authors not shown)

    Abstract: We report measurements of the gravitationally lensed secondary image -- the first in an infinite series of so-called "photon rings" -- around the supermassive black hole M87* via simultaneous modeling and imaging of the 2017 Event Horizon Telescope (EHT) observations. The inferred ring size remains constant across the seven days of the 2017 EHT observing campaign and is consistent with theoretical… ▽ More

    Submitted 18 August, 2022; originally announced August 2022.

    Journal ref: ApJ, 935, 61 (2022)

  39. arXiv:2206.02092  [pdf, other

    cs.LG stat.ML

    Bandit Theory and Thompson Sampling-Guided Directed Evolution for Sequence Optimization

    Authors: Hui Yuan, Chengzhuo Ni, Huazheng Wang, Xuezhou Zhang, Le Cong, Csaba Szepesvári, Mengdi Wang

    Abstract: Directed Evolution (DE), a landmark wet-lab method originated in 1960s, enables discovery of novel protein designs via evolving a population of candidate sequences. Recent advances in biotechnology has made it possible to collect high-throughput data, allowing the use of machine learning to map out a protein's sequence-to-function relation. There is a growing interest in machine learning-assisted… ▽ More

    Submitted 4 June, 2022; originally announced June 2022.

  40. arXiv:2205.03996  [pdf, other

    cs.AR cs.CV cs.LG eess.IV

    Hardware-Robust In-RRAM-Computing for Object Detection

    Authors: Yu-Hsiang Chiang, Cheng En Ni, Yun Sung, Tuo-Hung Hou, Tian-Sheuan Chang, Shyh Jye Jou

    Abstract: In-memory computing is becoming a popular architecture for deep-learning hardware accelerators recently due to its highly parallel computing, low power, and low area cost. However, in-RRAM computing (IRC) suffered from large device variation and numerous nonideal effects in hardware. Although previous approaches including these effects in model training successfully improved variation tolerance, t… ▽ More

    Submitted 8 May, 2022; originally announced May 2022.

    Comments: 10 pages, 18 figures

  41. arXiv:2204.04856  [pdf, other

    cs.SE

    Defect Identification, Categorization, and Repair: Better Together

    Authors: Chao Ni, Kaiwen Yang, Xin Xia, David Lo, Xiang Chen, Xiaohu Yang

    Abstract: Just-In-Time defect prediction (JIT-DP) models can identify defect-inducing commits at check-in time. Even though previous studies have achieved a great progress, these studies still have the following limitations: 1) useful information (e.g., semantic information and structure information) are not fully used; 2) existing work can only predict a commit as buggy one or clean one without more inform… ▽ More

    Submitted 10 April, 2022; originally announced April 2022.

    Comments: 22 pages, 4 figures

  42. Fast and Compute-efficient Sampling-based Local Exploration Planning via Distribution Learning

    Authors: Lukas Schmid, Chao Ni, Yuliang Zhong, Roland Siegwart, Olov Andersson

    Abstract: Exploration is a fundamental problem in robotics. While sampling-based planners have shown high performance, they are oftentimes compute intensive and can exhibit high variance. To this end, we propose to directly learn the underlying distribution of informative views based on the spatial context in the robot's map. We further explore a variety of methods to also learn the information gain. We sho… ▽ More

    Submitted 22 June, 2022; v1 submitted 28 February, 2022; originally announced February 2022.

    Comments: Accepted for IEEE RA-L. Open-source code: https://github.com/ethz-asl/cvae_exploration_planning, 8 pages, 12 figures

    Journal ref: IEEE Robotics and Automation Letters, vol. 7, no. 3, pp. 7810-7817, July 2022

  43. arXiv:2202.04970  [pdf, ps, other

    stat.ML cs.LG

    Off-Policy Fitted Q-Evaluation with Differentiable Function Approximators: Z-Estimation and Inference Theory

    Authors: Ruiqi Zhang, Xuezhou Zhang, Chengzhuo Ni, Mengdi Wang

    Abstract: Off-Policy Evaluation (OPE) serves as one of the cornerstones in Reinforcement Learning (RL). Fitted Q Evaluation (FQE) with various function approximators, especially deep neural networks, has gained practical success. While statistical analysis has proved FQE to be minimax-optimal with tabular, linear and several nonparametric function families, its practical performance with more general functi… ▽ More

    Submitted 10 February, 2022; originally announced February 2022.

    Comments: 39 pages

  44. arXiv:2202.00076  [pdf, other

    stat.ML cs.LG

    Optimal Estimation of Off-Policy Policy Gradient via Double Fitted Iteration

    Authors: Chengzhuo Ni, Ruiqi Zhang, Xiang Ji, Xuezhou Zhang, Mengdi Wang

    Abstract: Policy gradient (PG) estimation becomes a challenge when we are not allowed to sample with the target policy but only have access to a dataset generated by some unknown behavior policy. Conventional methods for off-policy PG estimation often suffer from either significant bias or exponentially large variance. In this paper, we propose the double Fitted PG estimation (FPG) algorithm. FPG can work w… ▽ More

    Submitted 19 June, 2022; v1 submitted 31 January, 2022; originally announced February 2022.

  45. arXiv:2111.03356  [pdf, other

    astro-ph.GA astro-ph.CO astro-ph.HE

    Event Horizon Telescope observations of the jet launching and collimation in Centaurus A

    Authors: Michael Janssen, Heino Falcke, Matthias Kadler, Eduardo Ros, Maciek Wielgus, Kazunori Akiyama, Mislav Baloković, Lindy Blackburn, Katherine L. Bouman, Andrew Chael, Chi-kwan Chan, Koushik Chatterjee, Jordy Davelaar, Philip G. Edwards, Christian M. Fromm, José L. Gómez, Ciriaco Goddi, Sara Issaoun, Michael D. Johnson, Junhan Kim, Jun Yi Koay, Thomas P. Krichbaum, Jun Liu, Elisabetta Liuzzo, Sera Markoff , et al. (215 additional authors not shown)

    Abstract: Very-long-baseline interferometry (VLBI) observations of active galactic nuclei at millimeter wavelengths have the power to reveal the launching and initial collimation region of extragalactic radio jets, down to $10-100$ gravitational radii ($r_g=GM/c^2$) scales in nearby sources. Centaurus A is the closest radio-loud source to Earth. It bridges the gap in mass and accretion rate between the supe… ▽ More

    Submitted 5 November, 2021; originally announced November 2021.

    Comments: 27 pages, 9 figures. This is a post-peer-review, pre-copyedit version of an article published in Nature Astronomy. The final authenticated version is available online at: https://doi.org/10.1038/s41550-021-01417-w

    Journal ref: Nature Astronomy, July 2021, Volume 5, p. 1017-1028

  46. The Variability of the Black-Hole Image in M87 at the Dynamical Time Scale

    Authors: Kaushik Satapathy, Dimitrios Psaltis, Feryal Ozel, Lia Medeiros, Sean T. Dougall, Chi-kwan Chan, Maciek Wielgus, Ben S. Prather, George N. Wong, Charles F. Gammie, Kazunori Akiyama, Antxon Alberdi, Walter Alef, Juan Carlos Algaba, Richard Anantua, Keiichi Asada, Rebecca Azulay, Anne-Kathrin Baczko, David R. Ball, Mislav Baloković, John Barrett, Bradford A. Benson, Dan Bintley, Lindy Blackburn, Raymond Blundell , et al. (213 additional authors not shown)

    Abstract: The black-hole images obtained with the Event Horizon Telescope (EHT) are expected to be variable at the dynamical timescale near their horizons. For the black hole at the center of the M87 galaxy, this timescale (5-61 days) is comparable to the 6-day extent of the 2017 EHT observations. Closure phases along baseline triangles are robust interferometric observables that are sensitive to the expect… ▽ More

    Submitted 1 November, 2021; originally announced November 2021.

    Comments: Accepted for Publication in ApJ

  47. arXiv:2110.08545  [pdf, other

    eess.AS cs.CL cs.LG cs.SD

    A Unified Speaker Adaptation Approach for ASR

    Authors: Yingzhu Zhao, Chongjia Ni, Cheung-Chi Leung, Shafiq Joty, Eng Siong Chng, Bin Ma

    Abstract: Transformer models have been used in automatic speech recognition (ASR) successfully and yields state-of-the-art results. However, its performance is still affected by speaker mismatch between training and test data. Further finetuning a trained model with target speaker data is the most natural approach for adaptation, but it takes a lot of compute and may cause catastrophic forgetting to the exi… ▽ More

    Submitted 16 October, 2021; originally announced October 2021.

    Comments: Accepted by EMNLP 2021

  48. Constraints on black-hole charges with the 2017 EHT observations of M87*

    Authors: Prashant Kocherlakota, Luciano Rezzolla, Heino Falcke, Christian M. Fromm, Michael Kramer, Yosuke Mizuno, Antonios Nathanail, Hector Olivares, Ziri Younsi, Kazunori Akiyama, Antxon Alberdi, Walter Alef, Juan Carlos Algaba, Richard Anantua, Keiichi Asada, Rebecca Azulay, Anne-Kathrin Baczko, David Ball, Mislav Balokovic, John Barrett, Bradford A. Benson, Dan Bintley, Lindy Blackburn, Raymond Blundell, Wilfred Boland , et al. (212 additional authors not shown)

    Abstract: Our understanding of strong gravity near supermassive compact objects has recently improved thanks to the measurements made by the Event Horizon Telescope (EHT). We use here the M87* shadow size to infer constraints on the physical charges of a large variety of nonrotating or rotating black holes. For example, we show that the quality of the measurements is already sufficient to rule out that M87*… ▽ More

    Submitted 19 May, 2021; originally announced May 2021.

    Comments: 15 pages, 4 figures, published in PRD on May 19

  49. The Polarized Image of a Synchrotron Emitting Ring of Gas Orbiting a Black Hole

    Authors: Ramesh Narayan, Daniel C. M. Palumbo, Michael D. Johnson, Zachary Gelles, Elizabeth Himwich, Dominic O. Chang, Angelo Ricarte, Jason Dexter, Charles F. Gammie, Andrew A. Chael, The Event Horizon Telescope Collaboration, :, Kazunori Akiyama, Antxon Alberdi, Walter Alef, Juan Carlos Algaba, Richard Anantua, Keiichi Asada, Rebecca Azulay, Anne-Kathrin Baczko, David Ball, Mislav Balokovic, John Barrett, Bradford A. Benson, Dan Bintley , et al. (215 additional authors not shown)

    Abstract: Synchrotron radiation from hot gas near a black hole results in a polarized image. The image polarization is determined by effects including the orientation of the magnetic field in the emitting region, relativistic motion of the gas, strong gravitational lensing by the black hole, and parallel transport in the curved spacetime. We explore these effects using a simple model of an axisymmetric, equ… ▽ More

    Submitted 13 May, 2021; v1 submitted 4 May, 2021; originally announced May 2021.

    Comments: 29 pages, 11 figures, published in ApJ on May 3

    Journal ref: ApJ 912 35 (2021)

  50. arXiv:2105.01136  [pdf, other

    stat.ML cs.LG

    Learning Good State and Action Representations via Tensor Decomposition

    Authors: Chengzhuo Ni, Yaqi Duan, Munther Dahleh, Anru Zhang, Mengdi Wang

    Abstract: The transition kernel of a continuous-state-action Markov decision process (MDP) admits a natural tensor structure. This paper proposes a tensor-inspired unsupervised learning method to identify meaningful low-dimensional state and action representations from empirical trajectories. The method exploits the MDP's tensor structure by kernelization, importance sampling and low-Tucker-rank approximati… ▽ More

    Submitted 19 February, 2023; v1 submitted 3 May, 2021; originally announced May 2021.