Skip to main content

Showing 1–23 of 23 results for author: Yip, J

.
  1. arXiv:2406.17395  [pdf, ps, other

    math.CO

    The coherent rank of a graph with three eigenvalues

    Authors: Gary Greaves, Jose Yip

    Abstract: We characterise graphs that have three distinct eigenvalues and coherent ranks 8 and 9, linking the former to certain symmetric 2-designs and the latter to specific quasi-symmetric 2-designs. This characterisation leads to the discovery of a new biregular graph with three distinct eigenvalues. Additionally, we demonstrate that the coherent rank of a triregular graph with three distinct eigenvalues… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

    Comments: 36 pages

  2. arXiv:2406.14015  [pdf, other

    cs.LG

    CohortNet: Empowering Cohort Discovery for Interpretable Healthcare Analytics

    Authors: Qingpeng Cai, Kai** Zheng, H. V. Jagadish, Beng Chin Ooi, James Yip

    Abstract: Cohort studies are of significant importance in the field of healthcare analysis. However, existing methods typically involve manual, labor-intensive, and expert-driven pattern definitions or rely on simplistic clustering techniques that lack medical relevance. Automating cohort studies with interpretable patterns has great potential to facilitate healthcare analysis but remains an unmet need in p… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

    Comments: 10 pages, 12 figures

  3. arXiv:2406.12434  [pdf, other

    cs.SD cs.LG eess.AS

    Towards Audio Codec-based Speech Separation

    Authors: Jia Qi Yip, Shengkui Zhao, Dianwen Ng, Eng Siong Chng, Bin Ma

    Abstract: Recent improvements in neural audio codec (NAC) models have generated interest in adopting pre-trained codecs for a variety of speech processing applications to take advantage of the efficiencies gained from high compression, but these have yet been applied to the speech separation (SS) task. SS can benefit from high compression because the compute required for traditional SS models makes them imp… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

    Comments: This paper was accepted by Interspeech 2024, Blue Sky Track

  4. arXiv:2406.02009  [pdf, other

    eess.AS cs.CL cs.SD

    Phonetic Enhanced Language Modeling for Text-to-Speech Synthesis

    Authors: Kun Zhou, Shengkui Zhao, Yukun Ma, Chong Zhang, Hao Wang, Dianwen Ng, Chongjia Ni, Nguyen Trung Hieu, Jia Qi Yip, Bin Ma

    Abstract: Recent language model-based text-to-speech (TTS) frameworks demonstrate scalability and in-context learning capabilities. However, they suffer from robustness issues due to the accumulation of errors in speech unit predictions during autoregressive language modeling. In this paper, we propose a phonetic enhanced language modeling method to improve the performance of TTS models. We leverage self-su… ▽ More

    Submitted 11 June, 2024; v1 submitted 4 June, 2024; originally announced June 2024.

    Comments: Accepted by Interspeech 2024

  5. arXiv:2403.13985  [pdf, other

    astro-ph.CO hep-th math.AT

    Cosmology with Persistent Homology: a Fisher Forecast

    Authors: Jacky H. T. Yip, Matteo Biagetti, Alex Cole, Karthik Viswanathan, Gary Shiu

    Abstract: Persistent homology naturally addresses the multi-scale topological characteristics of the large-scale structure as a distribution of clusters, loops, and voids. We apply this tool to the dark matter halo catalogs from the Quijote simulations, and build a summary statistic for comparison with the joint power spectrum and bispectrum statistic regarding their information content on cosmological para… ▽ More

    Submitted 20 March, 2024; originally announced March 2024.

    Comments: 24+16 pages, 22 figures

  6. arXiv:2312.11825  [pdf, other

    cs.SD eess.AS

    MossFormer2: Combining Transformer and RNN-Free Recurrent Network for Enhanced Time-Domain Monaural Speech Separation

    Authors: Shengkui Zhao, Yukun Ma, Chongjia Ni, Chong Zhang, Hao Wang, Trung Hieu Nguyen, Kun Zhou, Jiaqi Yip, Dianwen Ng, Bin Ma

    Abstract: Our previously proposed MossFormer has achieved promising performance in monaural speech separation. However, it predominantly adopts a self-attention-based MossFormer module, which tends to emphasize longer-range, coarser-scale dependencies, with a deficiency in effectively modelling finer-scale recurrent patterns. In this paper, we introduce a novel hybrid model that provides the capabilities to… ▽ More

    Submitted 18 December, 2023; originally announced December 2023.

    Comments: 5 pages, 3 figures, accepted by ICASSP 2024

  7. Emphasized Non-Target Speaker Knowledge in Knowledge Distillation for Automatic Speaker Verification

    Authors: Duc-Tuan Truong, Ruijie Tao, Jia Qi Yip, Kong Aik Lee, Eng Siong Chng

    Abstract: Knowledge distillation (KD) is used to enhance automatic speaker verification performance by ensuring consistency between large teacher networks and lightweight student networks at the embedding level or label level. However, the conventional label-level KD overlooks the significant knowledge from non-target speakers, particularly their classification probabilities, which can be crucial for automa… ▽ More

    Submitted 14 January, 2024; v1 submitted 26 September, 2023; originally announced September 2023.

    Comments: Accepted by ICASSP 2024

    Journal ref: IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2024, pp. 10336-10340

  8. arXiv:2309.12608  [pdf, other

    eess.AS cs.SD

    SPGM: Prioritizing Local Features for enhanced speech separation performance

    Authors: Jia Qi Yip, Shengkui Zhao, Yukun Ma, Chongjia Ni, Chong Zhang, Hao Wang, Trung Hieu Nguyen, Kun Zhou, Dianwen Ng, Eng Siong Chng, Bin Ma

    Abstract: Dual-path is a popular architecture for speech separation models (e.g. Sepformer) which splits long sequences into overlap** chunks for its intra- and inter-blocks that separately model intra-chunk local features and inter-chunk global relationships. However, it has been found that inter-blocks, which comprise half a dual-path model's parameters, contribute minimally to performance. Thus, we pro… ▽ More

    Submitted 10 March, 2024; v1 submitted 21 September, 2023; originally announced September 2023.

    Comments: This paper was accepted by ICASSP 2024

  9. arXiv:2309.07466  [pdf, other

    eess.AS cs.SD

    Codec Data Augmentation for Time-domain Heart Sound Classification

    Authors: Ansh Mishra, Jia Qi Yip, Eng Siong Chng

    Abstract: Heart auscultations are a low-cost and effective way of detecting valvular heart diseases early, which can save lives. Nevertheless, it has been difficult to scale this screening method since the effectiveness of auscultations is dependent on the skill of doctors. As such, there has been increasing research interest in the automatic classification of heart sounds using deep learning algorithms. Ho… ▽ More

    Submitted 14 September, 2023; originally announced September 2023.

    Comments: Accepted by ICAICTA 2023

  10. arXiv:2309.07458  [pdf, other

    cs.SD eess.AS

    Analysis of Speech Separation Performance Degradation on Emotional Speech Mixtures

    Authors: Jia Qi Yip, Dianwen Ng, Bin Ma, Chng Eng Siong

    Abstract: Despite recent strides made in Speech Separation, most models are trained on datasets with neutral emotions. Emotional speech has been known to degrade performance of models in a variety of speech tasks, which reduces the effectiveness of these models when deployed in real-world scenarios. In this paper we perform analysis to differentiate the performance degradation arising from the emotions in s… ▽ More

    Submitted 14 September, 2023; originally announced September 2023.

    Comments: Accepted by APSIPA ASC 2023

  11. arXiv:2308.02636  [pdf, other

    astro-ph.CO cs.LG math.AT

    Learning from Topology: Cosmological Parameter Estimation from the Large-scale Structure

    Authors: Jacky H. T. Yip, Adam Rouhiainen, Gary Shiu

    Abstract: The topology of the large-scale structure of the universe contains valuable information on the underlying cosmological parameters. While persistent homology can extract this topological information, the optimal method for parameter estimation from the tool remains an open question. To address this, we propose a neural network model to map persistence images to cosmological parameters. Through a pa… ▽ More

    Submitted 4 August, 2023; originally announced August 2023.

    Comments: 7 pages, 4 figures. Accepted to the Synergy of Scientific and Machine Learning Modeling Workshop (ICML 2023)

  12. arXiv:2305.12121  [pdf, other

    cs.SD cs.LG eess.AS

    ACA-Net: Towards Lightweight Speaker Verification using Asymmetric Cross Attention

    Authors: Jia Qi Yip, Tuan Truong, Dianwen Ng, Chong Zhang, Yukun Ma, Trung Hieu Nguyen, Chongjia Ni, Shengkui Zhao, Eng Siong Chng, Bin Ma

    Abstract: In this paper, we propose ACA-Net, a lightweight, global context-aware speaker embedding extractor for Speaker Verification (SV) that improves upon existing work by using Asymmetric Cross Attention (ACA) to replace temporal pooling. ACA is able to distill large, variable-length sequences into small, fixed-sized latents by attending a small query to large key and value matrices. In ACA-Net, we buil… ▽ More

    Submitted 20 May, 2023; originally announced May 2023.

    Comments: Accepted to INTERSPEECH 2023

  13. arXiv:2305.01170  [pdf, other

    cs.SD eess.AS

    Contrastive Speech Mixup for Low-resource Keyword Spotting

    Authors: Dianwen Ng, Ruixi Zhang, Jia Qi Yip, Chong Zhang, Yukun Ma, Trung Hieu Nguyen, Chongjia Ni, Eng Siong Chng, Bin Ma

    Abstract: Most of the existing neural-based models for keyword spotting (KWS) in smart devices require thousands of training samples to learn a decent audio representation. However, with the rising demand for smart devices to become more personalized, KWS models need to adapt quickly to smaller user samples. To tackle this challenge, we propose a contrastive speech mixup (CosMix) learning algorithm for low-… ▽ More

    Submitted 1 May, 2023; originally announced May 2023.

    Comments: Accepted by ICASSP 2023

  14. arXiv:2304.04468  [pdf, other

    cs.LG cs.AI

    Toward Cohort Intelligence: A Universal Cohort Representation Learning Framework for Electronic Health Record Analysis

    Authors: Changshuo Liu, Wenqiao Zhang, Beng Chin Ooi, James Wei Luen Yip, Lingze Zeng, Kai** Zheng

    Abstract: Electronic Health Records (EHR) are generated from clinical routine care recording valuable information of broad patient populations, which provide plentiful opportunities for improving patient management and intervention strategies in clinical practice. To exploit the enormous potential of EHR data, a popular EHR data analysis paradigm in machine learning is EHR representation learning, which fir… ▽ More

    Submitted 12 April, 2023; v1 submitted 10 April, 2023; originally announced April 2023.

    Comments: 10 pages

  15. arXiv:2302.14597  [pdf, other

    cs.SD eess.AS

    deHuBERT: Disentangling Noise in a Self-supervised Model for Robust Speech Recognition

    Authors: Dianwen Ng, Ruixi Zhang, Jia Qi Yip, Zhao Yang, **jie Ni, Chong Zhang, Yukun Ma, Chongjia Ni, Eng Siong Chng, Bin Ma

    Abstract: Existing self-supervised pre-trained speech models have offered an effective way to leverage massive unannotated corpora to build good automatic speech recognition (ASR). However, many current models are trained on a clean corpus from a single source, which tends to do poorly when noise is present during testing. Nonetheless, it is crucial to overcome the adverse influence of noise for real-world… ▽ More

    Submitted 28 February, 2023; originally announced February 2023.

    Comments: Accepted by ICASSP 2023

  16. arXiv:2301.03829  [pdf, other

    cs.LG cs.AI cs.CV cs.DB cs.MM

    From Plate to Prevention: A Dietary Nutrient-aided Platform for Health Promotion in Singapore

    Authors: Kai** Zheng, Thao Nguyen, Jesslyn Hwei Sing Chong, Charlene Enhui Goh, Melanie Herschel, Hee Hoon Lee, Changshuo Liu, Beng Chin Ooi, Wei Wang, James Yip

    Abstract: Singapore has been striving to improve the provision of healthcare services to her people. In this course, the government has taken note of the deficiency in regulating and supervising people's nutrient intake, which is identified as a contributing factor to the development of chronic diseases. Consequently, this issue has garnered significant attention. In this paper, we share our experience in a… ▽ More

    Submitted 28 March, 2023; v1 submitted 10 January, 2023; originally announced January 2023.

  17. arXiv:2209.06360  [pdf, other

    cs.SD eess.AS

    I2CR: Improving Noise Robustness on Keyword Spotting Using Inter-Intra Contrastive Regularization

    Authors: Dianwen Ng, Jia Qi Yip, Tanmay Surana, Zhao Yang, Chong Zhang, Yukun Ma, Chongjia Ni, Eng Siong Chng, Bin Ma

    Abstract: Noise robustness in keyword spotting remains a challenge as many models fail to overcome the heavy influence of noises, causing the deterioration of the quality of feature embeddings. We proposed a contrastive regularization method called Inter-Intra Contrastive Regularization (I2CR) to improve the feature representations by guiding the model to learn the fundamental speech information specific to… ▽ More

    Submitted 13 September, 2022; originally announced September 2022.

  18. arXiv:2208.00935  [pdf, other

    q-bio.QM eess.AS

    Amino Acid Classification in 2D NMR Spectra via Acoustic Signal Embeddings

    Authors: Jia Qi Yip, Dianwen Ng, Bin Ma, Konstantin Pervushin, Eng Siong Chng

    Abstract: Nuclear Magnetic Resonance (NMR) is used in structural biology to experimentally determine the structure of proteins, which is used in many areas of biology and is an important part of drug development. Unfortunately, NMR data can cost thousands of dollars per sample to collect and it can take a specialist weeks to assign the observed resonances to specific chemical groups. There has thus been gro… ▽ More

    Submitted 1 August, 2022; originally announced August 2022.

  19. arXiv:2206.10326  [pdf, other

    cs.HC cs.AI cs.CV cs.DB cs.DC

    The Metaverse Data Deluge: What Can We Do About It?

    Authors: Beng Chin Ooi, Gang Chen, Mike Zheng Shou, Kian-Lee Tan, Anthony Tung, Xiaokui Xiao, James Wei Luen Yip, Meihui Zhang

    Abstract: In the Metaverse, the physical space and the virtual space co-exist, and interact simultaneously. While the physical space is virtually enhanced with information, the virtual space is continuously refreshed with real-time, real-world information. To allow users to process and manipulate information seamlessly between the real and digital spaces, novel technologies must be developed. These include… ▽ More

    Submitted 10 November, 2022; v1 submitted 14 June, 2022; originally announced June 2022.

  20. arXiv:1910.07813  [pdf, other

    astro-ph.CO cs.LG physics.comp-ph

    From Dark Matter to Galaxies with Convolutional Neural Networks

    Authors: Jacky H. T. Yip, Xinyue Zhang, Yanfang Wang, Wei Zhang, Yueqiu Sun, Gabriella Contardo, Francisco Villaescusa-Navarro, Siyu He, Shy Genel, Shirley Ho

    Abstract: Cosmological simulations play an important role in the interpretation of astronomical data, in particular in comparing observed data to our theoretical expectations. However, to compare data with these simulations, the simulations in principle need to include gravity, magneto-hydrodyanmics, radiative transfer, etc. These ideal large-volume simulations (gravo-magneto-hydrodynamical) are incredibly… ▽ More

    Submitted 17 October, 2019; originally announced October 2019.

    Comments: 5 pages, 2 figures. Accepted to the Second Workshop on Machine Learning and the Physical Sciences (NeurIPS 2019)

  21. Strong magnetic coupling in the hexagonal R5Pb3 compounds (R = Gd-Tm)

    Authors: Andrea Marcinkova, Clarina de la Cruz, Joshua Yip, Liang L. Zhao, Jiakui K. Wang, E. Svanidze, E. Morosan

    Abstract: We have synthesized R5Pb3 (R = Gd-Tm) compounds in polycrystalline form and performed structural analysis, magnetization, and neutron scattering measurements. For all R5Pb3 reported here the Weiss temperatures θW are several times smaller than the ordering temperatures TORD, while the latter are remarkably high (TORD up to 275 K for R = Gd) compared to other known R-M binaries (M = Si, Ge, Sn and… ▽ More

    Submitted 24 October, 2014; originally announced October 2014.

  22. arXiv:1104.0063  [pdf

    cond-mat.mtrl-sci physics.chem-ph physics.comp-ph

    Graphene Nucleation on Transition Metal Surface: Structure Transformation and Role of the Metal Step Edge

    Authors: Junfeng Gao, Joanne Yip, Jijun Zhao, Boris I. Yakobson, Feng Ding

    Abstract: The nucleation of graphene on a transition metal (TM) surface, either on a terrace or near a step edge, is systematically explored using density functional theory (DFT) calculations and applying the two-dimensional (2D) crystal nucleation theory. Careful optimization of the supported carbon clusters, CN (with size N ranging from 1 to 24), on the Ni(111) surface indicates a ground state structure t… ▽ More

    Submitted 31 March, 2011; originally announced April 2011.

    Comments: 19 pages, 6 figures, accepted in Journal of the American Chemical Society

    Journal ref: J. Am. Chem. Soc., 2011, 133 (13), pp 5009-5015

  23. arXiv:0906.3772  [pdf

    cs.SE cs.PL

    XML Data Integrity Based on Concatenated Hash Function

    Authors: Baolong Liu, Joan Lu, Jim Yip

    Abstract: Data integrity is the fundamental for data authentication. A major problem for XML data authentication is that signed XML data can be copied to another document but still keep signature valid. This is caused by XML data integrity protecting. Through investigation, the paper discovered that besides data content integrity, XML data integrity should also protect element location information, and co… ▽ More

    Submitted 19 June, 2009; originally announced June 2009.

    Comments: 10 pages, International Journal of Computer Science and Information Security (IJCSIS)

    Journal ref: IJCSIS, May 2009, Vol. 1