Skip to main content

Showing 1–50 of 258 results for author: Wong, F

.
  1. arXiv:2407.08733  [pdf, other

    cs.CL

    Is Your Model Really A Good Math Reasoner? Evaluating Mathematical Reasoning with Checklist

    Authors: Zihao Zhou, Shudong Liu, Maizhen Ning, Wei Liu, **dong Wang, Derek F. Wong, Xiaowei Huang, Qiufeng Wang, Kaizhu Huang

    Abstract: Exceptional mathematical reasoning ability is one of the key features that demonstrate the power of large language models (LLMs). How to comprehensively define and evaluate the mathematical abilities of LLMs, and even reflect the user experience in real-world scenarios, has emerged as a critical issue. Current benchmarks predominantly concentrate on problem-solving capabilities, which presents a s… ▽ More

    Submitted 11 July, 2024; originally announced July 2024.

    Comments: 35 pages, 10 figures, preprint

  2. arXiv:2407.08728  [pdf, other

    gr-qc astro-ph.IM

    The Potential Impact of Noise Correlation in Next-generation Gravitational Wave Detectors

    Authors: Isaac C. F. Wong, Peter T. H. Pang, Milan Wils, Francesco Cireddu, Walter Del Pozzo, Tjonnie G. F. Li

    Abstract: Building upon the statistical formulation for parameter estimation in the presence of correlated noise proposed by Cireddu et al., we present an initial study to incorporate the effects of correlated noise into the analyses of various detector designs' performance. We consider a two L-shaped detector configuration located in the European Union, and compare the expectation of parameter estimation b… ▽ More

    Submitted 11 July, 2024; originally announced July 2024.

    Comments: 10 pages, 2 figures

  3. arXiv:2406.13572  [pdf, other

    quant-ph

    Entanglement source and quantum memory analysis for zero added-loss multiplexing

    Authors: Jeffrey H. Shapiro, Michael G. Raymer, Clark Embleton, Franco N. C. Wong, Brian J. Smith

    Abstract: High-rate, high-fidelity entanglement distribution is essential to the creation of a quantum internet, but recent achievements in fiber and satellite-based entanglement distribution fall far short of what is needed. Chen et al. [Phys. Rev. Appl. 19, 054209 (2023)] proposed a means for dramatically increasing entanglement-distribution rates via zero added-loss multiplexing (ZALM). ZALM's quantum tr… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

    Comments: 26 pages, 15 figure, 1 table

  4. arXiv:2406.11432  [pdf, other

    cs.CV cs.AI

    AnyTrans: Translate AnyText in the Image with Large Scale Models

    Authors: Zhipeng Qian, Pei Zhang, Baosong Yang, Kai Fan, Yiwei Ma, Derek F. Wong, Xiaoshuai Sun, Rongrong Ji

    Abstract: This paper introduces AnyTrans, an all-encompassing framework for the task-Translate AnyText in the Image (TATI), which includes multilingual text translation and text fusion within images. Our framework leverages the strengths of large-scale models, such as Large Language Models (LLMs) and text-guided diffusion models, to incorporate contextual cues from both textual and visual elements during tr… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

  5. arXiv:2406.07054  [pdf, other

    cs.CL cs.AI

    CoEvol: Constructing Better Responses for Instruction Finetuning through Multi-Agent Cooperation

    Authors: Renhao Li, Minghuan Tan, Derek F. Wong, Min Yang

    Abstract: In recent years, instruction fine-tuning (IFT) on large language models (LLMs) has garnered considerable attention to enhance model performance on unseen tasks. Attempts have been made on automatic construction and effective selection for IFT data. However, we posit that previous methods have not fully harnessed the potential of LLMs for enhancing data quality. The responses within IFT data could… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

  6. arXiv:2406.03450  [pdf, other

    cs.CL cs.AI

    What is the Best Way for ChatGPT to Translate Poetry?

    Authors: Shanshan Wang, Derek F. Wong, **gming Yao, Lidia S. Chao

    Abstract: Machine translation (MT) has historically faced significant challenges when applied to literary works, particularly in the domain of poetry translation. The advent of Large Language Models such as ChatGPT holds potential for innovation in this field. This study examines ChatGPT's capabilities in English-Chinese poetry translation tasks, utilizing targeted prompts and small sample scenarios to asce… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

    Comments: 19 pages, 1 figure. The paper has been accepted by ACL 2024(Main Conference)

  7. arXiv:2406.00839  [pdf, other

    cs.CL cs.AI

    FOCUS: Forging Originality through Contrastive Use in Self-Plagiarism for Language Models

    Authors: Kaixin Lan, Tao Fang, Derek F. Wong, Yabo Xu, Lidia S. Chao, Cecilia G. Zhao

    Abstract: Pre-trained Language Models (PLMs) have shown impressive results in various Natural Language Generation (NLG) tasks, such as powering chatbots and generating stories. However, an ethical concern arises due to their potential to produce verbatim copies of paragraphs from their training data. This is problematic as PLMs are trained on corpora constructed by human authors. As such, there is a pressin… ▽ More

    Submitted 2 June, 2024; originally announced June 2024.

    Comments: 16 pages, 8 figures. The paper has been accepted by ACL 2024 (Findings), with Kaixin Lan and Tao Fang contributing equally, and Derek F. Wong serving as the corresponding author

  8. arXiv:2405.14039  [pdf, other

    cs.CL cs.AI cs.LG

    Trajectory Volatility for Out-of-Distribution Detection in Mathematical Reasoning

    Authors: Yiming Wang, Pei Zhang, Baosong Yang, Derek F. Wong, Zhuosheng Zhang, Rui Wang

    Abstract: Real-world data deviating from the independent and identically distributed (i.i.d.) assumption of in-distribution training data poses security threats to deep networks, thus advancing out-of-distribution (OOD) detection algorithms. Detection methods in generative language models (GLMs) mainly focus on uncertainty estimation and embedding distance measurement, with the latter proven to be most effe… ▽ More

    Submitted 22 May, 2024; originally announced May 2024.

    Comments: 27 pages, 6 figures, 12 tables

  9. arXiv:2405.04286  [pdf, other

    cs.CL

    Who Wrote This? The Key to Zero-Shot LLM-Generated Text Detection Is GECScore

    Authors: Junchao Wu, Runzhe Zhan, Derek F. Wong, Shu Yang, Xuebo Liu, Lidia S. Chao, Min Zhang

    Abstract: The efficacy of an large language model (LLM) generated text detector depends substantially on the availability of sizable training data. White-box zero-shot detectors, which require no such data, are nonetheless limited by the accessibility of the source model of the LLM-generated text. In this paper, we propose an simple but effective black-box zero-shot detection approach, predicated on the obs… ▽ More

    Submitted 7 May, 2024; originally announced May 2024.

  10. arXiv:2405.02925  [pdf, other

    cs.CL

    A Two-Stage Prediction-Aware Contrastive Learning Framework for Multi-Intent NLU

    Authors: Guanhua Chen, Yutong Yao, Derek F. Wong, Lidia S. Chao

    Abstract: Multi-intent natural language understanding (NLU) presents a formidable challenge due to the model confusion arising from multiple intents within a single utterance. While previous works train the model contrastively to increase the margin between different multi-intent labels, they are less suited to the nuances of multi-intent NLU. They ignore the rich information between the shared intents, whi… ▽ More

    Submitted 5 May, 2024; originally announced May 2024.

    Comments: LREC-COLING 2024

  11. arXiv:2404.18413  [pdf, other

    cs.CV cs.AI

    3AM: An Ambiguity-Aware Multi-Modal Machine Translation Dataset

    Authors: Xinyu Ma, Xuebo Liu, Derek F. Wong, Jun Rao, Bei Li, Liang Ding, Lidia S. Chao, Dacheng Tao, Min Zhang

    Abstract: Multimodal machine translation (MMT) is a challenging task that seeks to improve translation quality by incorporating visual information. However, recent studies have indicated that the visual information provided by existing MMT datasets is insufficient, causing models to disregard it and overestimate their capabilities. This issue presents a significant obstacle to the development of MMT researc… ▽ More

    Submitted 29 April, 2024; originally announced April 2024.

  12. arXiv:2404.16766  [pdf, other

    cs.CL cs.AI

    Prefix Text as a Yarn: Eliciting Non-English Alignment in Foundation Language Model

    Authors: Runzhe Zhan, Xinyi Yang, Derek F. Wong, Lidia S. Chao, Yue Zhang

    Abstract: While supervised fine-tuning (SFT) has been a straightforward approach for tailoring the output of foundation large language model (LLM) to specific preferences, concerns have been raised about the depth of this alignment, with some critiques suggesting it is merely "superficial". We critically examine this hypothesis within the scope of cross-lingual generation tasks, proposing that the effective… ▽ More

    Submitted 25 April, 2024; originally announced April 2024.

  13. arXiv:2403.11621  [pdf, other

    cs.CL

    Let's Focus on Neuron: Neuron-Level Supervised Fine-tuning for Large Language Model

    Authors: Haoyun Xu, Runzhe Zhan, Derek F. Wong, Lidia S. Chao

    Abstract: Large Language Models (LLMs) are composed of neurons that exhibit various behaviors and roles, which become increasingly diversified as models scale. Recent studies have revealed that not all neurons are active across different datasets, and this sparsity correlates positively with the task-specific ability, leading to advancements in model pruning and training efficiency. Traditional fine-tuning… ▽ More

    Submitted 18 March, 2024; originally announced March 2024.

  14. arXiv:2403.03004  [pdf, other

    astro-ph.CO gr-qc hep-ph

    Ultralight vector dark matter search using data from the KAGRA O3GK run

    Authors: The LIGO Scientific Collaboration, the Virgo Collaboration, the KAGRA Collaboration, A. G. Abac, R. Abbott, H. Abe, I. Abouelfettouh, F. Acernese, K. Ackley, C. Adamcewicz, S. Adhicary, N. Adhikari, R. X. Adhikari, V. K. Adkins, V. B. Adya, C. Affeldt, D. Agarwal, M. Agathos, O. D. Aguiar, I. Aguilar, L. Aiello, A. Ain, P. Ajith, T. Akutsu, S. Albanesi , et al. (1778 additional authors not shown)

    Abstract: Among the various candidates for dark matter (DM), ultralight vector DM can be probed by laser interferometric gravitational wave detectors through the measurement of oscillating length changes in the arm cavities. In this context, KAGRA has a unique feature due to differing compositions of its mirrors, enhancing the signal of vector DM in the length change in the auxiliary channels. Here we prese… ▽ More

    Submitted 5 March, 2024; originally announced March 2024.

    Comments: 20 pages, 5 figures

    Report number: LIGO-P2300250

  15. arXiv:2402.16705  [pdf, other

    cs.CL cs.AI cs.LG

    SelectIT: Selective Instruction Tuning for Large Language Models via Uncertainty-Aware Self-Reflection

    Authors: Liangxin Liu, Xuebo Liu, Derek F. Wong, Dongfang Li, Ziyi Wang, Baotian Hu, Min Zhang

    Abstract: Instruction tuning (IT) is crucial to tailoring large language models (LLMs) towards human-centric interactions. Recent advancements have shown that the careful selection of a small, high-quality subset of IT data can significantly enhance the performance of LLMs. Despite this, common approaches often rely on additional models or data sets, which increases costs and limits widespread adoption. In… ▽ More

    Submitted 26 February, 2024; originally announced February 2024.

  16. arXiv:2402.15903  [pdf, other

    cs.LG cs.AI cs.NI

    ESFL: Efficient Split Federated Learning over Resource-Constrained Heterogeneous Wireless Devices

    Authors: Guangyu Zhu, Yiqin Deng, Xianhao Chen, Haixia Zhang, Yuguang Fang, Tan F. Wong

    Abstract: Federated learning (FL) allows multiple parties (distributed devices) to train a machine learning model without sharing raw data. How to effectively and efficiently utilize the resources on devices and the central server is a highly interesting yet challenging problem. In this paper, we propose an efficient split federated learning algorithm (ESFL) to take full advantage of the powerful computing… ▽ More

    Submitted 16 April, 2024; v1 submitted 24 February, 2024; originally announced February 2024.

  17. arXiv:2402.07616  [pdf, other

    cs.CL cs.AI

    Anchor-based Large Language Models

    Authors: Jianhui Pang, Fanghua Ye, Derek Fai Wong, Xin He, Wanshun Chen, Longyue Wang

    Abstract: Large language models (LLMs) predominantly employ decoder-only transformer architectures, necessitating the retention of keys/values information for historical tokens to provide contextual information and avoid redundant computation. However, the substantial size and parameter volume of these LLMs require massive GPU memory. This memory demand increases with the length of the input text, leading t… ▽ More

    Submitted 1 June, 2024; v1 submitted 12 February, 2024; originally announced February 2024.

    Comments: The paper has been accepted by the ACL2024 conference. Work was done when Jianhui Pang and Fanghua Ye were interning at Tencent AI Lab

  18. arXiv:2401.12794  [pdf, other

    cs.CL

    Benchmarking LLMs via Uncertainty Quantification

    Authors: Fanghua Ye, Mingming Yang, Jianhui Pang, Longyue Wang, Derek F. Wong, Emine Yilmaz, Shuming Shi, Zhaopeng Tu

    Abstract: The proliferation of open-source Large Language Models (LLMs) from various institutions has highlighted the urgent need for comprehensive evaluation methods. However, current evaluation platforms, such as the widely recognized HuggingFace open LLM leaderboard, neglect a crucial aspect -- uncertainty, which is vital for thoroughly assessing LLMs. To bridge this gap, we introduce a new benchmarking… ▽ More

    Submitted 25 April, 2024; v1 submitted 23 January, 2024; originally announced January 2024.

    Comments: 25 pages, preprints

  19. arXiv:2401.09253  [pdf, other

    quant-ph

    The generative quantum eigensolver (GQE) and its application for ground state search

    Authors: Kouhei Nakaji, Lasse Bjørn Kristensen, Jorge A. Campos-Gonzalez-Angulo, Mohammad Ghazi Vakili, Haozhe Huang, Mohsen Bagherimehrab, Christoph Gorgulla, FuTe Wong, Alex McCaskey, **-Sung Kim, Thien Nguyen, Pooja Rao, Alan Aspuru-Guzik

    Abstract: We introduce the generative quantum eigensolver (GQE), a novel method for applying classical generative models for quantum simulation. The GQE algorithm optimizes a classical generative model to produce quantum circuits with desired properties. Here, we develop a transformer-based implementation, which we name the generative pre-trained transformer-based (GPT) quantum eigensolver (GPT-QE), leverag… ▽ More

    Submitted 17 January, 2024; originally announced January 2024.

    Comments: 16 pages, 7 figures

  20. arXiv:2401.08350  [pdf, other

    cs.CL

    Salute the Classic: Revisiting Challenges of Machine Translation in the Age of Large Language Models

    Authors: Jianhui Pang, Fanghua Ye, Longyue Wang, Dian Yu, Derek F. Wong, Shuming Shi, Zhaopeng Tu

    Abstract: The evolution of Neural Machine Translation (NMT) has been significantly influenced by six core challenges (Koehn and Knowles, 2017), which have acted as benchmarks for progress in this field. This study revisits these challenges, offering insights into their ongoing relevance in the context of advanced Large Language Models (LLMs): domain mismatch, amount of parallel data, rare word prediction, t… ▽ More

    Submitted 17 January, 2024; v1 submitted 16 January, 2024; originally announced January 2024.

    Comments: 17 pages. Longyue Wang is the Corresponding Author

  21. arXiv:2312.14614  [pdf, other

    gr-qc astro-ph.IM

    Likelihood for a Network of Gravitational-Wave Detectors with Correlated Noise

    Authors: Francesco Cireddu, Milan Wils, Isaac C. F. Wong, Peter T. H. Pang, Tjonnie G. F. Li, Walter Del Pozzo

    Abstract: The Einstein Telescope faces a critical data analysis challenge with correlated noise, often overlooked in current parameter estimation analyses. We address this issue by presenting the statistical formulation of the likelihood function that includes correlated noise for the Einstein Telescope or any detector network. Neglecting these correlations may significantly reduce parameter estimation accu… ▽ More

    Submitted 22 December, 2023; originally announced December 2023.

    Comments: 7 pages, 3 figures

  22. arXiv:2312.00933  [pdf, other

    cs.IT

    Privacy Preserving Event Detection

    Authors: Xiaoshan Wang, Tan F. Wong

    Abstract: This paper presents a privacy-preserving event detection scheme based on measurements made by a network of sensors. A diameter-like decision statistic made up of the marginal types of the measurements observed by the sensors is employed. The proposed detection scheme can achieve the best type-I error exponent as the type-II error rate is required to be negligible. Detection performance with finite… ▽ More

    Submitted 1 December, 2023; originally announced December 2023.

    Comments: 26 pages, 9 figures, submitted to IEEE Transactions on Information Theory

  23. arXiv:2310.14724  [pdf, other

    cs.CL cs.AI

    A Survey on LLM-Generated Text Detection: Necessity, Methods, and Future Directions

    Authors: Junchao Wu, Shu Yang, Runzhe Zhan, Yulin Yuan, Derek F. Wong, Lidia S. Chao

    Abstract: The powerful ability to understand, follow, and generate complex language emerging from large language models (LLMs) makes LLM-generated text flood many areas of our daily lives at an incredible speed and is widely accepted by humans. As LLMs continue to expand, there is an imperative need to develop detectors that can detect LLM-generated text. This is crucial to mitigate potential misuse of LLMs… ▽ More

    Submitted 19 April, 2024; v1 submitted 23 October, 2023; originally announced October 2023.

  24. arXiv:2310.08908  [pdf, other

    cs.CL

    Human-in-the-loop Machine Translation with Large Language Model

    Authors: Xinyi Yang, Runzhe Zhan, Derek F. Wong, Junchao Wu, Lidia S. Chao

    Abstract: The large language model (LLM) has garnered significant attention due to its in-context learning mechanisms and emergent capabilities. The research community has conducted several pilot studies to apply LLMs to machine translation tasks and evaluate their performance from diverse perspectives. However, previous research has primarily focused on the LLM itself and has not explored human interventio… ▽ More

    Submitted 13 October, 2023; originally announced October 2023.

    Comments: Accepted to MT Summit 2023

  25. arXiv:2309.05234  [pdf

    quant-ph

    High-dimensional time-frequency entanglement in a singly-filtered biphoton frequency comb

    Authors: Xiang Cheng, Kai-Chi Chang, Murat Can Sarihan, Andrew Mueller, Maria Spiropulu, Matthew D. Shaw, Boris Korzh, Andrei Faraon, Franco N. C. Wong, Jeffrey H. Shapiro, Chee Wei Wong

    Abstract: High-dimensional quantum entanglement is a cornerstone for advanced technology enabling large-scale noise-tolerant quantum systems, fault-tolerant quantum computing, and distributed quantum networks. The recently developed biphoton frequency comb (BFC) provides a powerful platform for high-dimensional quantum information processing in its spectral and temporal quantum modes. Here we propose and ge… ▽ More

    Submitted 11 September, 2023; v1 submitted 11 September, 2023; originally announced September 2023.

    Comments: 30 pages, 4 figures

  26. arXiv:2308.13666  [pdf, other

    astro-ph.HE

    A Joint Fermi-GBM and Swift-BAT Analysis of Gravitational-Wave Candidates from the Third Gravitational-wave Observing Run

    Authors: C. Fletcher, J. Wood, R. Hamburg, P. Veres, C. M. Hui, E. Bissaldi, M. S. Briggs, E. Burns, W. H. Cleveland, M. M. Giles, A. Goldstein, B. A. Hristov, D. Kocevski, S. Lesage, B. Mailyan, C. Malacaria, S. Poolakkil, A. von Kienlin, C. A. Wilson-Hodge, The Fermi Gamma-ray Burst Monitor Team, M. Crnogorčević, J. DeLaunay, A. Tohuvavohu, R. Caputo, S. B. Cenko , et al. (1674 additional authors not shown)

    Abstract: We present Fermi Gamma-ray Burst Monitor (Fermi-GBM) and Swift Burst Alert Telescope (Swift-BAT) searches for gamma-ray/X-ray counterparts to gravitational wave (GW) candidate events identified during the third observing run of the Advanced LIGO and Advanced Virgo detectors. Using Fermi-GBM on-board triggers and sub-threshold gamma-ray burst (GRB) candidates found in the Fermi-GBM ground analyses,… ▽ More

    Submitted 25 August, 2023; originally announced August 2023.

  27. arXiv:2308.03822  [pdf, other

    astro-ph.HE

    Search for Eccentric Black Hole Coalescences during the Third Observing Run of LIGO and Virgo

    Authors: The LIGO Scientific Collaboration, the Virgo Collaboration, the KAGRA Collaboration, A. G. Abac, R. Abbott, H. Abe, F. Acernese, K. Ackley, C. Adamcewicz, S. Adhicary, N. Adhikari, R. X. Adhikari, V. K. Adkins, V. B. Adya, C. Affeldt, D. Agarwal, M. Agathos, O. D. Aguiar, I. Aguilar, L. Aiello, A. Ain, P. Ajith, T. Akutsu, S. Albanesi, R. A. Alfaidi , et al. (1750 additional authors not shown)

    Abstract: Despite the growing number of confident binary black hole coalescences observed through gravitational waves so far, the astrophysical origin of these binaries remains uncertain. Orbital eccentricity is one of the clearest tracers of binary formation channels. Identifying binary eccentricity, however, remains challenging due to the limited availability of gravitational waveforms that include effect… ▽ More

    Submitted 7 August, 2023; originally announced August 2023.

    Comments: 24 pages, 5 figures

    Report number: LIGO-P2300080

  28. arXiv:2307.14349  [pdf, other

    cs.SE cs.AI

    Copilot for Xcode: Exploring AI-Assisted Programming by Prompting Cloud-based Large Language Models

    Authors: Chee Wei Tan, Shangxin Guo, Man Fai Wong, Ching Nam Hang

    Abstract: This paper presents an AI-assisted programming tool called Copilot for Xcode for program composition and design to support human software developers. By seamlessly integrating cloud-based Large Language Models (LLM) with Apple's local development environment, Xcode, this tool enhances productivity and unleashes creativity for software development in Apple software ecosystem (e.g., iOS apps, macOS)… ▽ More

    Submitted 8 July, 2023; originally announced July 2023.

  29. arXiv:2307.02503  [pdf, other

    cs.SE cs.AI cs.CL

    Natural Language Generation and Understanding of Big Code for AI-Assisted Programming: A Review

    Authors: Man Fai Wong, Shangxin Guo, Ching Nam Hang, Siu Wai Ho, Chee Wei Tan

    Abstract: This paper provides a comprehensive review of the literature concerning the utilization of Natural Language Processing (NLP) techniques, with a particular focus on transformer-based large language models (LLMs) trained using Big Code, within the domain of AI-assisted programming tasks. LLMs, augmented with software naturalness, have played a crucial role in facilitating AI-assisted programming app… ▽ More

    Submitted 4 July, 2023; originally announced July 2023.

    Journal ref: Entropy(2023), 25(6), 888

  30. arXiv:2305.19847  [pdf, other

    cs.CL cs.AI

    How Does Pretraining Improve Discourse-Aware Translation?

    Authors: Zhihong Huang, Longyue Wang, Siyou Liu, Derek F. Wong

    Abstract: Pretrained language models (PLMs) have produced substantial improvements in discourse-aware neural machine translation (NMT), for example, improved coherence in spoken language translation. However, the underlying reasons for their strong performance have not been well explained. To bridge this gap, we introduce a probing task to interpret the ability of PLMs to capture discourse relation knowledg… ▽ More

    Submitted 31 May, 2023; originally announced May 2023.

    Comments: Interspeech 2023

  31. A chip-scale polarization-spatial-momentum quantum SWAP gate in silicon nanophotonics

    Authors: Xiang Cheng, Kai-Chi Chang, Zhenda Xie, Murat Can Sarihan, Yoo Seung Lee, Yongnan Li, XinAn Xu, Abhinav Kumar Vinod, Serdar Kocaman, Mingbin Yu, Patrick Guo-Qiang Lo, Dim-Lee Kwong, Jeffrey H. Shapiro, Franco N. C. Wong, Chee Wei Wong

    Abstract: Recent progress in quantum computing and networking enables high-performance large-scale quantum processors by connecting different quantum modules. Optical quantum systems show advantages in both computing and communications, and integrated quantum photonics further increases the level of scaling and complexity. Here we demonstrate an efficient SWAP gate that deterministically swaps a photon's po… ▽ More

    Submitted 16 May, 2023; originally announced May 2023.

    Comments: 25 pages, 4 figures

    Journal ref: Nat. Photon. 17, 656-665 (2023)

  32. arXiv:2305.01951  [pdf, other

    cs.CL

    Can LMs Generalize to Future Data? An Empirical Analysis on Text Summarization

    Authors: Chi Seng Cheang, Hou Pong Chan, Derek F. Wong, Xuebo Liu, Zhaocong Li, Yanming Sun, Shudong Liu, Lidia S. Chao

    Abstract: Recent pre-trained language models (PLMs) achieve promising results in existing abstractive summarization datasets. However, existing summarization benchmarks overlap in time with the standard pre-training corpora and finetuning datasets. Hence, the strong performance of PLMs may rely on the parametric knowledge that is memorized during pre-training and fine-tuning. Moreover, the knowledge memoriz… ▽ More

    Submitted 2 November, 2023; v1 submitted 3 May, 2023; originally announced May 2023.

    Comments: Accepted at EMNLP 2023

  33. arXiv:2305.01181  [pdf, other

    cs.CL

    A Paradigm Shift: The Future of Machine Translation Lies with Large Language Models

    Authors: Chenyang Lyu, Zefeng Du, Jitao Xu, Yitao Duan, Minghao Wu, Teresa Lynn, Alham Fikri Aji, Derek F. Wong, Siyou Liu, Longyue Wang

    Abstract: Machine Translation (MT) has greatly advanced over the years due to the developments in deep neural networks. However, the emergence of Large Language Models (LLMs) like GPT-4 and ChatGPT is introducing a new phase in the MT domain. In this context, we believe that the future of MT is intricately tied to the capabilities of LLMs. These models not only offer vast linguistic understandings but also… ▽ More

    Submitted 1 April, 2024; v1 submitted 1 May, 2023; originally announced May 2023.

    Comments: Accepted to LREC-COLING 2024

  34. arXiv:2304.08393  [pdf, other

    gr-qc astro-ph.CO astro-ph.HE

    Search for gravitational-lensing signatures in the full third observing run of the LIGO-Virgo network

    Authors: The LIGO Scientific Collaboration, the Virgo Collaboration, the KAGRA Collaboration, R. Abbott, H. Abe, F. Acernese, K. Ackley, S. Adhicary, N. Adhikari, R. X. Adhikari, V. K. Adkins, V. B. Adya, C. Affeldt, D. Agarwal, M. Agathos, O. D. Aguiar, L. Aiello, A. Ain, P. Ajith, T. Akutsu, S. Albanesi, R. A. Alfaidi, C. Alléné, A. Allocca, P. A. Altin , et al. (1670 additional authors not shown)

    Abstract: Gravitational lensing by massive objects along the line of sight to the source causes distortions of gravitational wave-signals; such distortions may reveal information about fundamental physics, cosmology and astrophysics. In this work, we have extended the search for lensing signatures to all binary black hole events from the third observing run of the LIGO--Virgo network. We search for repeated… ▽ More

    Submitted 17 April, 2023; originally announced April 2023.

    Comments: 28 pages, 11 figures

    Report number: LIGO-P2200031

  35. arXiv:2304.01746  [pdf, other

    cs.CL

    Is ChatGPT a Highly Fluent Grammatical Error Correction System? A Comprehensive Evaluation

    Authors: Tao Fang, Shu Yang, Kaixin Lan, Derek F. Wong, **peng Hu, Lidia S. Chao, Yue Zhang

    Abstract: ChatGPT, a large-scale language model based on the advanced GPT-3.5 architecture, has shown remarkable potential in various Natural Language Processing (NLP) tasks. However, there is currently a dearth of comprehensive study exploring its potential in the area of Grammatical Error Correction (GEC). To showcase its capabilities in GEC, we design zero-shot chain-of-thought (CoT) and few-shot CoT set… ▽ More

    Submitted 4 April, 2023; originally announced April 2023.

  36. arXiv:2303.12723  [pdf, other

    cs.CV

    AdaOPC: A Self-Adaptive Mask Optimization Framework For Real Design Patterns

    Authors: Wenqian Zhao, Xufeng Yao, Ziyang Yu, Guo** Chen, Yuzhe Ma, Bei Yu, Martin D. F. Wong

    Abstract: Optical proximity correction (OPC) is a widely-used resolution enhancement technique (RET) for printability optimization. Recently, rigorous numerical optimization and fast machine learning are the research focus of OPC in both academia and industry, each of which complements the other in terms of robustness or efficiency. We inspect the pattern distribution on a design layer and find that differe… ▽ More

    Submitted 15 March, 2023; originally announced March 2023.

  37. A High-Performance Accelerator for Super-Resolution Processing on Embedded GPU

    Authors: Wenqian Zhao, Qi Sun, Yang Bai, Wenbo Li, Haisheng Zheng, Bei Yu, Martin D. F. Wong

    Abstract: Recent years have witnessed impressive progress in super-resolution (SR) processing. However, its real-time inference requirement sets a challenge not only for the model design but also for the on-chip implementation. In this paper, we implement a full-stack SR acceleration framework on embedded GPU devices. The special dictionary learning algorithm used in SR models was analyzed in detail and acc… ▽ More

    Submitted 15 March, 2023; originally announced March 2023.

  38. arXiv:2303.08435  [pdf, other

    cs.CV cs.LG eess.IV

    Physics-Informed Optical Kernel Regression Using Complex-valued Neural Fields

    Authors: Guo** Chen, Zehua Pei, Haoyu Yang, Yuzhe Ma, Bei Yu, Martin D. F. Wong

    Abstract: Lithography is fundamental to integrated circuit fabrication, necessitating large computation overhead. The advancement of machine learning (ML)-based lithography models alleviates the trade-offs between manufacturing process expense and capability. However, all previous methods regard the lithography system as an image-to-image black box map**, utilizing network parameters to learn by rote mapp… ▽ More

    Submitted 9 April, 2023; v1 submitted 15 March, 2023; originally announced March 2023.

    Comments: Accepted by DAC23

  39. arXiv:2302.09870  [pdf, other

    gr-qc astro-ph.HE hep-ph

    Exploring the hidden Universe: A novel phenomenological approach for recovering arbitrary gravitational-wave millilensing configurations

    Authors: Anna Liu, Isaac C. F. Wong, Samson H. W. Leong, Anupreeta More, Otto A. Hannuksela, Tjonnie G. F. Li

    Abstract: Since the first detection of gravitational waves in 2015, gravitational-wave astronomy has emerged as a rapidly advancing field that holds great potential for studying the cosmos, from probing the properties of black holes to testing the limits of our current understanding of gravity. One important aspect of gravitational-wave astronomy is the phenomenon of gravitational lensing, where massive int… ▽ More

    Submitted 26 February, 2023; v1 submitted 20 February, 2023; originally announced February 2023.

    Comments: 12 pages, 8 figures

  40. arXiv:2302.08975  [pdf, other

    cs.CL

    Towards Fine-Grained Information: Identifying the Type and Location of Translation Errors

    Authors: Keqin Bao, Yu Wan, Dayiheng Liu, Baosong Yang, Wenqiang Lei, Xiangnan He, Derek F. Wong, Jun Xie

    Abstract: Fine-grained information on translation errors is helpful for the translation evaluation community. Existing approaches can not synchronously consider error position and type, failing to integrate the error information of both. In this paper, we propose Fine-Grained Translation Error Detection (FG-TED) task, aiming at identifying both the position and the type of translation errors on given source… ▽ More

    Submitted 17 February, 2023; originally announced February 2023.

  41. Open data from the third observing run of LIGO, Virgo, KAGRA and GEO

    Authors: The LIGO Scientific Collaboration, the Virgo Collaboration, the KAGRA Collaboration, R. Abbott, H. Abe, F. Acernese, K. Ackley, S. Adhicary, N. Adhikari, R. X. Adhikari, V. K. Adkins, V. B. Adya, C. Affeldt, D. Agarwal, M. Agathos, O. D. Aguiar, L. Aiello, A. Ain, P. Ajith, T. Akutsu, S. Albanesi, R. A. Alfaidi, A. Al-Jodah, C. Alléné, A. Allocca , et al. (1719 additional authors not shown)

    Abstract: The global network of gravitational-wave observatories now includes five detectors, namely LIGO Hanford, LIGO Livingston, Virgo, KAGRA, and GEO 600. These detectors collected data during their third observing run, O3, composed of three phases: O3a starting in April of 2019 and lasting six months, O3b starting in November of 2019 and lasting five months, and O3GK starting in April of 2020 and lasti… ▽ More

    Submitted 7 February, 2023; originally announced February 2023.

    Comments: 27 pages, 3 figures

    Report number: LIGO-P2200316

  42. arXiv:2301.13007  [pdf, other

    cs.CV cs.AI cs.LG

    EuclidNet: Deep Visual Reasoning for Constructible Problems in Geometry

    Authors: Man Fai Wong, Xintong Qi, Chee Wei Tan

    Abstract: In this paper, we present a deep learning-based framework for solving geometric construction problems through visual reasoning, which is useful for automated geometry theorem proving. Constructible problems in geometry often ask for the sequence of straightedge-and-compass constructions to construct a given goal given some initial setup. Our EuclidNet framework leverages the neural network archite… ▽ More

    Submitted 27 December, 2022; originally announced January 2023.

    Comments: Accepted by 2nd MATH-AI Workshop at NeurIPS'22

    Journal ref: Adv. Artif. Intell. Mach. Learn.(2023), 3(1):839-852

  43. arXiv:2212.10179  [pdf, other

    cs.CL

    Toward Human-Like Evaluation for Natural Language Generation with Error Analysis

    Authors: Qingyu Lu, Liang Ding, Kanjian Zhang, Derek F. Wong, Dacheng Tao

    Abstract: The state-of-the-art language model-based automatic metrics, e.g. BARTScore, benefiting from large-scale contextualized pre-training, have been successfully used in a wide range of natural language generation (NLG) tasks, including machine translation, text summarization, and data-to-text. Recent studies show that considering both major errors (e.g. mistranslated tokens) and minor errors (e.g. imp… ▽ More

    Submitted 20 December, 2022; originally announced December 2022.

    Comments: work in progress

  44. arXiv:2212.04262  [pdf, other

    cs.CL cs.AI cs.LG

    ConsistTL: Modeling Consistency in Transfer Learning for Low-Resource Neural Machine Translation

    Authors: Zhaocong Li, Xuebo Liu, Derek F. Wong, Lidia S. Chao, Min Zhang

    Abstract: Transfer learning is a simple and powerful method that can be used to boost model performance of low-resource neural machine translation (NMT). Existing transfer learning methods for NMT are static, which simply transfer knowledge from a parent model to a child model once via parameter initialization. In this paper, we propose a novel transfer learning method for NMT, namely ConsistTL, which can c… ▽ More

    Submitted 8 December, 2022; originally announced December 2022.

    Comments: Accepted to EMNLP 2022

  45. arXiv:2212.04248  [pdf, other

    cs.GR cs.CV cs.SD eess.AS

    Talking Head Generation with Probabilistic Audio-to-Visual Diffusion Priors

    Authors: Zhentao Yu, Zixin Yin, Deyu Zhou, Duomin Wang, Finn Wong, Baoyuan Wang

    Abstract: In this paper, we introduce a simple and novel framework for one-shot audio-driven talking head generation. Unlike prior works that require additional driving sources for controlled synthesis in a deterministic manner, we instead probabilistically sample all the holistic lip-irrelevant facial motions (i.e. pose, expression, blink, gaze, etc.) to semantically match the input audio while still maint… ▽ More

    Submitted 7 December, 2022; originally announced December 2022.

    Comments: 16 pages

  46. arXiv:2212.01477  [pdf, other

    astro-ph.HE astro-ph.CO

    Search for subsolar-mass black hole binaries in the second part of Advanced LIGO's and Advanced Virgo's third observing run

    Authors: The LIGO Scientific Collaboration, the Virgo Collaboration, the KAGRA Collaboration, R. Abbott, H. Abe, F. Acernese, K. Ackley, S. Adhicary, N. Adhikari, R. X. Adhikari, V. K. Adkins, V. B. Adya, C. Affeldt, D. Agarwal, M. Agathos, O. D. Aguiar, L. Aiello, A. Ain, P. Ajith, T. Akutsu, S. Albanesi, R. A. Alfaidi, C. Alléné, A. Allocca, P. A. Altin , et al. (1680 additional authors not shown)

    Abstract: We describe a search for gravitational waves from compact binaries with at least one component with mass 0.2 $M_\odot$ -- $1.0 M_\odot$ and mass ratio $q \geq 0.1$ in Advanced LIGO and Advanced Virgo data collected between 1 November 2019, 15:00 UTC and 27 March 2020, 17:00 UTC. No signals were detected. The most significant candidate has a false alarm rate of 0.2 $\mathrm{yr}^{-1}$. We estimate t… ▽ More

    Submitted 26 January, 2024; v1 submitted 2 December, 2022; originally announced December 2022.

    Comments: https://dcc.ligo.org/P2200139

  47. arXiv:2210.10931  [pdf, other

    astro-ph.HE

    Search for gravitational-wave transients associated with magnetar bursts in Advanced LIGO and Advanced Virgo data from the third observing run

    Authors: The LIGO Scientific Collaboration, the Virgo Collaboration, the KAGRA Collaboration, R. Abbott, H. Abe, F. Acernese, K. Ackley, N. Adhikari, R. X. Adhikari, V. K. Adkins, V. B. Adya, C. Affeldt, D. Agarwal, M. Agathos, K. Agatsuma, N. Aggarwal, O. D. Aguiar, L. Aiello, A. Ain, P. Ajith, T. Akutsu, S. Albanesi, R. A. Alfaidi, A. Allocca, P. A. Altin , et al. (1645 additional authors not shown)

    Abstract: Gravitational waves are expected to be produced from neutron star oscillations associated with magnetar giant flares and short bursts. We present the results of a search for short-duration (milliseconds to seconds) and long-duration ($\sim$ 100 s) transient gravitational waves from 13 magnetar short bursts observed during Advanced LIGO, Advanced Virgo and KAGRA's third observation run. These 13 bu… ▽ More

    Submitted 19 October, 2022; originally announced October 2022.

    Comments: 30 pages with appendices, 5 figures, 10 tables

    Report number: LIGO-P2100387

  48. arXiv:2210.10049  [pdf, other

    cs.CL

    Alibaba-Translate China's Submission for WMT 2022 Quality Estimation Shared Task

    Authors: Keqin Bao, Yu Wan, Dayiheng Liu, Baosong Yang, Wenqiang Lei, Xiangnan He, Derek F. Wong, Jun Xie

    Abstract: In this paper, we present our submission to the sentence-level MQM benchmark at Quality Estimation Shared Task, named UniTE (Unified Translation Evaluation). Specifically, our systems employ the framework of UniTE, which combined three types of input formats during training with a pre-trained language model. First, we apply the pseudo-labeled data examples for the continuously pre-training phase.… ▽ More

    Submitted 17 February, 2023; v1 submitted 18 October, 2022; originally announced October 2022.

    Comments: WMT 2022 QE Shared Task. arXiv admin note: text overlap with arXiv:2210.09683

  49. arXiv:2210.09683  [pdf, other

    cs.CL

    Alibaba-Translate China's Submission for WMT 2022 Metrics Shared Task

    Authors: Yu Wan, Keqin Bao, Dayiheng Liu, Baosong Yang, Derek F. Wong, Lidia S. Chao, Wenqiang Lei, Jun Xie

    Abstract: In this report, we present our submission to the WMT 2022 Metrics Shared Task. We build our system based on the core idea of UNITE (Unified Translation Evaluation), which unifies source-only, reference-only, and source-reference-combined evaluation scenarios into one single model. Specifically, during the model pre-training phase, we first apply the pseudo-labeled data examples to continuously pre… ▽ More

    Submitted 17 February, 2023; v1 submitted 18 October, 2022; originally announced October 2022.

    Comments: WMT 2022 Metrics Shared Task

  50. arXiv:2209.02863  [pdf

    astro-ph.HE gr-qc

    Model-based cross-correlation search for gravitational waves from the low-mass X-ray binary Scorpius X-1 in LIGO O3 data

    Authors: The LIGO Scientific Collaboration, the Virgo Collaboration, the KAGRA Collaboration, R. Abbott, H. Abe, F. Acernese, K. Ackley, S. Adhicary, N. Adhikari, R. X. Adhikari, V. K. Adkins, V. B. Adya, C. Affeldt, D. Agarwal, M. Agathos, O. D. Aguiar, L. Aiello, A. Ain, P. Ajith, T. Akutsu, S. Albanesi, R. A. Alfaidi, C. Alléné, A. Allocca, P. A. Altin , et al. (1670 additional authors not shown)

    Abstract: We present the results of a model-based search for continuous gravitational waves from the low-mass X-ray binary Scorpius X-1 using LIGO detector data from the third observing run of Advanced LIGO, Advanced Virgo and KAGRA. This is a semicoherent search which uses details of the signal model to coherently combine data separated by less than a specified coherence time, which can be adjusted to bala… ▽ More

    Submitted 2 January, 2023; v1 submitted 6 September, 2022; originally announced September 2022.

    Comments: 19 pages, Open Access Journal PDF

    Report number: LIGO-P2100110-v13

    Journal ref: The Astrophysical Journal Letters, 941, L30 (2022)