Skip to main content

Showing 1–50 of 131 results for author: Chu, C

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.01926  [pdf

    physics.med-ph cs.CV

    Chemical Shift Encoding based Double Bonds Quantification in Triglycerides using Deep Image Prior

    Authors: Chaoxing Huang, Ziqiang Yu, Zijian Gao, Qiuyi Shen, Queenie Chan, Vincent Wai-Sun Wong, Winnie Chiu-Wing Chu, Weitian Chen

    Abstract: This study evaluated a deep learning-based method using Deep Image Prior (DIP) to quantify triglyceride double bonds from chemical-shift encoded multi-echo gradient echo images without network training. We employed a cost function based on signal constraints to iteratively update the neural network on a single dataset. The method was validated using phantom experiments and in vivo scans. Results s… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

  2. arXiv:2405.17102  [pdf, other

    cs.CV cs.RO

    DINO-SD: Champion Solution for ICRA 2024 RoboDepth Challenge

    Authors: Yifan Mao, Ming Li, Jian Liu, Jiayang Liu, Zihan Qin, Chunxi Chu, Jialei Xu, Wenbo Zhao, Junjun Jiang, Xianming Liu

    Abstract: Surround-view depth estimation is a crucial task aims to acquire the depth maps of the surrounding views. It has many applications in real world scenarios such as autonomous driving, AR/VR and 3D reconstruction, etc. However, given that most of the data in the autonomous driving dataset is collected in daytime scenarios, this leads to poor depth model performance in the face of out-of-distribution… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

    Comments: Outstanding Champion in the RoboDepth Challenge (ICRA24) https://robodrive-24.github.io/

  3. arXiv:2405.13233  [pdf, other

    cs.CL

    MELD-ST: An Emotion-aware Speech Translation Dataset

    Authors: Sirou Chen, Sakiko Yahata, Shuichiro Shimizu, Zhengdong Yang, Yihang Li, Chenhui Chu, Sadao Kurohashi

    Abstract: Emotion plays a crucial role in human conversation. This paper underscores the significance of considering emotion in speech translation. We present the MELD-ST dataset for the emotion-aware speech translation task, comprising English-to-Japanese and English-to-German language pairs. Each language pair includes about 10,000 utterances annotated with emotion labels from the MELD dataset. Baseline e… ▽ More

    Submitted 21 May, 2024; originally announced May 2024.

    Comments: 9 pages. Accepted to ACL 2024 Findings. Dataset: https://huggingface.co/datasets/ku-nlp/MELD-ST

  4. arXiv:2405.11607  [pdf, other

    cs.CR cs.AR

    OFHE: An Electro-Optical Accelerator for Discretized TFHE

    Authors: Mengxin Zheng, Cheng Chu, Qian Lou, Nathan Youngblood, Mo Li, Sajjad Moazeni, Lei Jiang

    Abstract: This paper presents \textit{OFHE}, an electro-optical accelerator designed to process Discretized TFHE (DTFHE) operations, which encrypt multi-bit messages and support homomorphic multiplications, lookup table operations and full-domain functional bootstrap**s. While DTFHE is more efficient and versatile than other fully homomorphic encryption schemes, it requires 32-, 64-, and 128-bit polynomia… ▽ More

    Submitted 19 May, 2024; originally announced May 2024.

  5. arXiv:2405.08816  [pdf, other

    cs.CV cs.RO

    The RoboDrive Challenge: Drive Anytime Anywhere in Any Condition

    Authors: Lingdong Kong, Shaoyuan Xie, Hanjiang Hu, Yaru Niu, Wei Tsang Ooi, Benoit R. Cottereau, Lai Xing Ng, Yuexin Ma, Wenwei Zhang, Liang Pan, Kai Chen, Ziwei Liu, Weichao Qiu, Wei Zhang, Xu Cao, Hao Lu, Ying-Cong Chen, Caixin Kang, Xinning Zhou, Chengyang Ying, Wentao Shang, Xingxing Wei, Yinpeng Dong, Bo Yang, Shengyin Jiang , et al. (66 additional authors not shown)

    Abstract: In the realm of autonomous driving, robust perception under out-of-distribution conditions is paramount for the safe deployment of vehicles. Challenges such as adverse weather, sensor malfunctions, and environmental unpredictability can severely impact the performance of autonomous systems. The 2024 RoboDrive Challenge was crafted to propel the development of driving perception technologies that c… ▽ More

    Submitted 29 May, 2024; v1 submitted 14 May, 2024; originally announced May 2024.

    Comments: ICRA 2024; 32 pages, 24 figures, 5 tables; Code at https://robodrive-24.github.io/

  6. arXiv:2405.03141  [pdf, other

    eess.IV cs.AI cs.CV physics.med-ph

    Automatic Ultrasound Curve Angle Measurement via Affinity Clustering for Adolescent Idiopathic Scoliosis Evaluation

    Authors: Yihao Zhou, Timothy Tin-Yan Lee, Kelly Ka-Lee Lai, Chonglin Wu, Hin Ting Lau, De Yang, Chui-Yi Chan, Winnie Chiu-Wing Chu, Jack Chun-Yiu Cheng, Tsz-** Lam, Yong-** Zheng

    Abstract: The current clinical gold standard for evaluating adolescent idiopathic scoliosis (AIS) is X-ray radiography, using Cobb angle measurement. However, the frequent monitoring of the AIS progression using X-rays poses a challenge due to the cumulative radiation exposure. Although 3D ultrasound has been validated as a reliable and radiation-free alternative for scoliosis assessment, the process of mea… ▽ More

    Submitted 6 May, 2024; v1 submitted 5 May, 2024; originally announced May 2024.

  7. arXiv:2403.18052  [pdf, other

    astro-ph.IM cs.LG eess.IV eess.SP

    R2D2 image reconstruction with model uncertainty quantification in radio astronomy

    Authors: Amir Aghabiglou, Chung San Chu, Arwa Dabbech, Yves Wiaux

    Abstract: The ``Residual-to-Residual DNN series for high-Dynamic range imaging'' (R2D2) approach was recently introduced for Radio-Interferometric (RI) imaging in astronomy. R2D2's reconstruction is formed as a series of residual images, iteratively estimated as outputs of Deep Neural Networks (DNNs) taking the previous iteration's image estimate and associated data residual as inputs. In this work, we inve… ▽ More

    Submitted 27 May, 2024; v1 submitted 26 March, 2024; originally announced March 2024.

    Comments: Accepted to IEEE EUSIPCO 2024

  8. arXiv:2403.17905  [pdf, other

    eess.IV cs.CV cs.LG eess.SP

    Scalable Non-Cartesian Magnetic Resonance Imaging with R2D2

    Authors: Yiwei Chen, Chao Tang, Amir Aghabiglou, Chung San Chu, Yves Wiaux

    Abstract: We propose a new approach for non-Cartesian magnetic resonance image reconstruction. While unrolled architectures provide robustness via data-consistency layers, embedding measurement operators in Deep Neural Network (DNN) can become impractical at large scale. Alternative Plug-and-Play (PnP) approaches, where the denoising DNNs are blind to the measurement setting, are not affected by this limita… ▽ More

    Submitted 28 May, 2024; v1 submitted 26 March, 2024; originally announced March 2024.

    Comments: Accepted to IEEE EUSIPCO 2024

  9. arXiv:2403.15765  [pdf, other

    cs.CV cs.AI cs.IR

    Towards Human-Like Machine Comprehension: Few-Shot Relational Learning in Visually-Rich Documents

    Authors: Hao Wang, Tang Li, Chenhui Chu, Nengjun Zhu, Rui Wang, Pinpin Zhu

    Abstract: Key-value relations are prevalent in Visually-Rich Documents (VRDs), often depicted in distinct spatial regions accompanied by specific color and font styles. These non-textual cues serve as important indicators that greatly enhance human comprehension and acquisition of such relation triplets. However, current document AI approaches often fail to consider this valuable prior information related t… ▽ More

    Submitted 23 March, 2024; originally announced March 2024.

    Comments: 13 pages, 7 figures, accepted by LERC-COLING2024

  10. ChatGPT in Veterinary Medicine: A Practical Guidance of Generative Artificial Intelligence in Clinics, Education, and Research

    Authors: Candice P. Chu

    Abstract: ChatGPT, the most accessible generative artificial intelligence (AI) tool, offers considerable potential for veterinary medicine, yet a dedicated review of its specific applications is lacking. This review concisely synthesizes the latest research and practical applications of ChatGPT within the clinical, educational, and research domains of veterinary medicine. It intends to provide specific guid… ▽ More

    Submitted 25 February, 2024; originally announced March 2024.

  11. arXiv:2403.10790  [pdf, other

    quant-ph cs.CR cs.LG

    QuantumLeak: Stealing Quantum Neural Networks from Cloud-based NISQ Machines

    Authors: Zhenxiao Fu, Min Yang, Cheng Chu, Yilun Xu, Gang Huang, Fan Chen

    Abstract: Variational quantum circuits (VQCs) have become a powerful tool for implementing Quantum Neural Networks (QNNs), addressing a wide range of complex problems. Well-trained VQCs serve as valuable intellectual assets hosted on cloud-based Noisy Intermediate Scale Quantum (NISQ) computers, making them susceptible to malicious VQC stealing attacks. However, traditional model extraction techniques desig… ▽ More

    Submitted 15 March, 2024; originally announced March 2024.

    Journal ref: published in IJCNN 2024

  12. arXiv:2403.05452  [pdf, other

    astro-ph.IM cs.CV cs.LG

    The R2D2 deep neural network series paradigm for fast precision imaging in radio astronomy

    Authors: Amir Aghabiglou, Chung San Chu, Arwa Dabbech, Yves Wiaux

    Abstract: Radio-interferometric (RI) imaging entails solving high-resolution high-dynamic range inverse problems from large data volumes. Recent image reconstruction techniques grounded in optimization theory have demonstrated remarkable capability for imaging precision, well beyond CLEAN's capability. These range from advanced proximal algorithms propelled by handcrafted regularization operators, such as t… ▽ More

    Submitted 1 May, 2024; v1 submitted 8 March, 2024; originally announced March 2024.

    Comments: Accepted for publication in ApJS

  13. arXiv:2403.03690  [pdf

    cs.CL cs.AI

    Rapidly Develo** High-quality Instruction Data and Evaluation Benchmark for Large Language Models with Minimal Human Effort: A Case Study on Japanese

    Authors: Yikun Sun, Zhen Wan, Nobuhiro Ueda, Sakiko Yahata, Fei Cheng, Chenhui Chu, Sadao Kurohashi

    Abstract: The creation of instruction data and evaluation benchmarks for serving Large language models often involves enormous human annotation. This issue becomes particularly pronounced when rapidly develo** such resources for a non-English language like Japanese. Instead of following the popular practice of directly translating existing English resources into Japanese (e.g., Japanese-Alpaca), we propos… ▽ More

    Submitted 6 March, 2024; originally announced March 2024.

    Comments: COLING 2024. Our code are available here: \href{https://github.com/hitoshizuku7/awesome-Ja-self-instruct}{self-instruct data} and \href{https://github.com/ku-nlp/ja-vicuna-qa-benchmark}{evaluation benchmark}

  14. arXiv:2403.00877  [pdf, other

    cs.LG cs.DC cs.IR

    Disaggregated Multi-Tower: Topology-aware Modeling Technique for Efficient Large-Scale Recommendation

    Authors: Liang Luo, Buyun Zhang, Michael Tsang, Yinbin Ma, Ching-Hsiang Chu, Yuxin Chen, Shen Li, Yuchen Hao, Yanli Zhao, Guna Lakshminarayanan, Ellie Dingqiao Wen, Jongsoo Park, Dheevatsa Mudigere, Maxim Naumov

    Abstract: We study a mismatch between the deep learning recommendation models' flat architecture, common distributed training paradigm and hierarchical data center topology. To address the associated inefficiencies, we propose Disaggregated Multi-Tower (DMT), a modeling technique that consists of (1) Semantic-preserving Tower Transform (SPTT), a novel training paradigm that decomposes the monolithic global… ▽ More

    Submitted 2 May, 2024; v1 submitted 1 March, 2024; originally announced March 2024.

  15. arXiv:2402.11021  [pdf, other

    quant-ph cs.ET

    TITAN: A Distributed Large-Scale Trapped-Ion NISQ Computer

    Authors: Cheng Chu, Zhenxiao Fu, Yilun Xu, Gang Huang, Hausi Muller, Fan Chen, Lei Jiang

    Abstract: Trapped-Ion (TI) technology offers potential breakthroughs for Noisy Intermediate Scale Quantum (NISQ) computing. TI qubits offer extended coherence times and high gate fidelity, making them appealing for large-scale NISQ computers. Constructing such computers demands a distributed architecture connecting Quantum Charge Coupled Devices (QCCDs) via quantum matter-links and photonic switches. Howeve… ▽ More

    Submitted 16 February, 2024; originally announced February 2024.

  16. arXiv:2402.06127  [pdf, other

    cs.MA cs.LG

    CityFlowER: An Efficient and Realistic Traffic Simulator with Embedded Machine Learning Models

    Authors: Longchao Da, Chen Chu, Weinan Zhang, Hua Wei

    Abstract: Traffic simulation is an essential tool for transportation infrastructure planning, intelligent traffic control policy learning, and traffic flow analysis. Its effectiveness relies heavily on the realism of the simulators used. Traditional traffic simulators, such as SUMO and CityFlow, are often limited by their reliance on rule-based models with hyperparameters that oversimplify driving behaviors… ▽ More

    Submitted 8 February, 2024; originally announced February 2024.

    Comments: 4 pages, 4 figures

    ACM Class: G.3

  17. arXiv:2401.13601  [pdf, other

    cs.CL

    MM-LLMs: Recent Advances in MultiModal Large Language Models

    Authors: Duzhen Zhang, Yahan Yu, Jiahua Dong, Chenxing Li, Dan Su, Chenhui Chu, Dong Yu

    Abstract: In the past year, MultiModal Large Language Models (MM-LLMs) have undergone substantial advancements, augmenting off-the-shelf LLMs to support MM inputs or outputs via cost-effective training strategies. The resulting models not only preserve the inherent reasoning and decision-making capabilities of LLMs but also empower a diverse range of MM tasks. In this paper, we provide a comprehensive surve… ▽ More

    Submitted 28 May, 2024; v1 submitted 24 January, 2024; originally announced January 2024.

    Comments: Accepted by ACL2024 (findings)

  18. arXiv:2401.13249  [pdf, other

    eess.AS cs.MM

    MOS-FAD: Improving Fake Audio Detection Via Automatic Mean Opinion Score Prediction

    Authors: Wang** Zhou, Zhengdong Yang, Chenhui Chu, Sheng Li, Raj Dabre, Yi Zhao, Tatsuya Kawahara

    Abstract: Automatic Mean Opinion Score (MOS) prediction is employed to evaluate the quality of synthetic speech. This study extends the application of predicted MOS to the task of Fake Audio Detection (FAD), as we expect that MOS can be used to assess how close synthesized speech is to the natural human voice. We propose MOS-FAD, where MOS can be leveraged at two key points in FAD: training data selection a… ▽ More

    Submitted 24 January, 2024; v1 submitted 24 January, 2024; originally announced January 2024.

    Comments: Accepted in ICASSP2024

  19. arXiv:2401.06129  [pdf, other

    cs.CV

    Distilling Vision-Language Models on Millions of Videos

    Authors: Yue Zhao, Long Zhao, Xingyi Zhou, Jialin Wu, Chun-Te Chu, Hui Miao, Florian Schroff, Hartwig Adam, Ting Liu, Boqing Gong, Philipp Krähenbühl, Liangzhe Yuan

    Abstract: The recent advance in vision-language models is largely attributed to the abundance of image-text data. We aim to replicate this success for video-language models, but there simply is not enough human-curated video-text data available. We thus resort to fine-tuning a video-language model from a strong image-language baseline with synthesized instructional data. The resulting video model by video-i… ▽ More

    Submitted 15 April, 2024; v1 submitted 11 January, 2024; originally announced January 2024.

    Comments: CVPR 2024. Project page: https://zhaoyue-zephyrus.github.io/video-instruction-tuning

  20. arXiv:2312.15392  [pdf

    cs.CR cs.SE

    Blockchain Smart Contract Threat Detection Technology Based on Symbolic Execution

    Authors: Chang Chu

    Abstract: The security of smart contracts, which are an important part of blockchain technology, has attracted much attention. In particular, reentrancy vulnerability, which is hidden and complex, poses a great threat to smart contracts. In order to improve the existing detection methods, which exhibit low efficiency and accuracy, in this paper, we propose a smart contract threat detection technology based… ▽ More

    Submitted 23 December, 2023; originally announced December 2023.

  21. arXiv:2311.03696  [pdf, other

    cs.CL

    Bilingual Corpus Mining and Multistage Fine-Tuning for Improving Machine Translation of Lecture Transcripts

    Authors: Haiyue Song, Raj Dabre, Chenhui Chu, Atsushi Fujita, Sadao Kurohashi

    Abstract: Lecture transcript translation helps learners understand online courses, however, building a high-quality lecture machine translation system lacks publicly available parallel corpora. To address this, we examine a framework for parallel corpus mining, which provides a quick and effective way to mine a parallel corpus from publicly available lectures on Coursera. To create the parallel corpora, we… ▽ More

    Submitted 6 November, 2023; originally announced November 2023.

    Comments: Submitted to the Journal of Information Processing (JIP). arXiv admin note: text overlap with arXiv:1912.11739

  22. arXiv:2310.20201  [pdf, other

    cs.CL

    Video-Helpful Multimodal Machine Translation

    Authors: Yihang Li, Shuichiro Shimizu, Chenhui Chu, Sadao Kurohashi, Wei Li

    Abstract: Existing multimodal machine translation (MMT) datasets consist of images and video captions or instructional video subtitles, which rarely contain linguistic ambiguity, making visual information ineffective in generating appropriate translations. Recent work has constructed an ambiguous subtitles dataset to alleviate this problem but is still limited to the problem that videos do not necessarily c… ▽ More

    Submitted 31 October, 2023; originally announced October 2023.

    Comments: Accepted by EMNLP 2023 Main Conference (long paper)

  23. arXiv:2310.14802  [pdf, other

    cs.HC cs.CV cs.IR

    DocTrack: A Visually-Rich Document Dataset Really Aligned with Human Eye Movement for Machine Reading

    Authors: Hao Wang, Qingxuan Wang, Yue Li, Changqing Wang, Chenhui Chu, Rui Wang

    Abstract: The use of visually-rich documents (VRDs) in various fields has created a demand for Document AI models that can read and comprehend documents like humans, which requires the overcoming of technical, linguistic, and cognitive barriers. Unfortunately, the lack of appropriate datasets has significantly hindered advancements in the field. To address this issue, we introduce \textsc{DocTrack}, a VRD d… ▽ More

    Submitted 23 October, 2023; originally announced October 2023.

    Comments: 14 pages, 8 figures, Accepted by Findings of EMNLP2023

  24. arXiv:2310.14785  [pdf, other

    cs.CV cs.AI

    Vision-Enhanced Semantic Entity Recognition in Document Images via Visually-Asymmetric Consistency Learning

    Authors: Hao Wang, Xiahua Chen, Rui Wang, Chenhui Chu

    Abstract: Extracting meaningful entities belonging to predefined categories from Visually-rich Form-like Documents (VFDs) is a challenging task. Visual and layout features such as font, background, color, and bounding box location and size provide important cues for identifying entities of the same type. However, existing models commonly train a visual encoder with weak cross-modal supervision signals, resu… ▽ More

    Submitted 23 October, 2023; originally announced October 2023.

    Comments: 14 pages, 6 figures, Accepted by EMNLP2023

  25. arXiv:2309.16509  [pdf, other

    cs.DC cs.PL

    SIMD Everywhere Optimization from ARM NEON to RISC-V Vector Extensions

    Authors: Ju-Hung Li, Jhih-Kuan Lin, Yung-Cheng Su, Chi-Wei Chu, Lai-Tak Kuok, Hung-Ming Lai, Chao-Lin Lee, Jenq-Kuen Lee

    Abstract: Many libraries, such as OpenCV, FFmpeg, XNNPACK, and Eigen, utilize Arm or x86 SIMD Intrinsics to optimize programs for performance. With the emergence of RISC-V Vector Extensions (RVV), there is a need to migrate these performance legacy codes for RVV. Currently, the migration of NEON code to RVV code requires manual rewriting, which is a time-consuming and error-prone process. In this work, we u… ▽ More

    Submitted 28 September, 2023; originally announced September 2023.

  26. arXiv:2309.03291  [pdf, other

    astro-ph.IM cs.LG eess.IV eess.SP

    CLEANing Cygnus A deep and fast with R2D2

    Authors: Arwa Dabbech, Amir Aghabiglou, Chung San Chu, Yves Wiaux

    Abstract: A novel deep learning paradigm for synthesis imaging by radio interferometry in astronomy was recently proposed, dubbed "Residual-to-Residual DNN series for high-Dynamic range imaging" (R2D2). In this work, we start by shedding light on R2D2's algorithmic structure, interpreting it as a learned version of CLEAN with minor cycles substituted with a deep neural network (DNN) whose training is iterat… ▽ More

    Submitted 23 April, 2024; v1 submitted 6 September, 2023; originally announced September 2023.

    Comments: accepted for publication in ApJL

  27. arXiv:2308.13795  [pdf, other

    cs.CV

    VIDES: Virtual Interior Design via Natural Language and Visual Guidance

    Authors: Minh-Hien Le, Chi-Bien Chu, Khanh-Duy Le, Tam V. Nguyen, Minh-Triet Tran, Trung-Nghia Le

    Abstract: Interior design is crucial in creating aesthetically pleasing and functional indoor spaces. However, develo** and editing interior design concepts requires significant time and expertise. We propose Virtual Interior DESign (VIDES) system in response to this challenge. Leveraging cutting-edge technology in generative AI, our system can assist users in generating and editing indoor scene concepts… ▽ More

    Submitted 26 August, 2023; originally announced August 2023.

    Comments: Accepted to ISMAR 2023 (Poster paper)

  28. arXiv:2308.01941  [pdf

    q-bio.NC cs.AI cs.NE

    Digital twin brain: a bridge between biological intelligence and artificial intelligence

    Authors: Hui Xiong, Congying Chu, Lingzhong Fan, Ming Song, Jiaqi Zhang, Yawei Ma, Ruonan Zheng, Junyang Zhang, Zhengyi Yang, Tianzi Jiang

    Abstract: In recent years, advances in neuroscience and artificial intelligence have paved the way for unprecedented opportunities for understanding the complexity of the brain and its emulation by computational systems. Cutting-edge advancements in neuroscience research have revealed the intricate relationship between brain structure and function, while the success of artificial neural networks highlights… ▽ More

    Submitted 2 August, 2023; originally announced August 2023.

    Journal ref: Intell Comput. 2023;2:0055

  29. arXiv:2308.00085  [pdf, other

    cs.CL cs.AI

    Reasoning before Responding: Integrating Commonsense-based Causality Explanation for Empathetic Response Generation

    Authors: Yahui Fu, Koji Inoue, Chenhui Chu, Tatsuya Kawahara

    Abstract: Recent approaches to empathetic response generation try to incorporate commonsense knowledge or reasoning about the causes of emotions to better understand the user's experiences and feelings. However, these approaches mainly focus on understanding the causalities of context from the user's perspective, ignoring the system's perspective. In this paper, we propose a commonsense-based causality expl… ▽ More

    Submitted 5 September, 2023; v1 submitted 27 July, 2023; originally announced August 2023.

    Comments: Accepted by the 24th Meeting of the Special Interest Group on Discourse and Dialogue (SIGDIAL 2023)

  30. SelfSeg: A Self-supervised Sub-word Segmentation Method for Neural Machine Translation

    Authors: Haiyue Song, Raj Dabre, Chenhui Chu, Sadao Kurohashi, Eiichiro Sumita

    Abstract: Sub-word segmentation is an essential pre-processing step for Neural Machine Translation (NMT). Existing work has shown that neural sub-word segmenters are better than Byte-Pair Encoding (BPE), however, they are inefficient as they require parallel corpora, days to train and hours to decode. This paper introduces SelfSeg, a self-supervised neural sub-word segmentation method that is much faster to… ▽ More

    Submitted 31 July, 2023; originally announced July 2023.

    Comments: Accepted to TALLIP journal

  31. QDoor: Exploiting Approximate Synthesis for Backdoor Attacks in Quantum Neural Networks

    Authors: Cheng Chu, Fan Chen, Philip Richerme, Lei Jiang

    Abstract: Quantum neural networks (QNNs) succeed in object recognition, natural language processing, and financial analysis. To maximize the accuracy of a QNN on a Noisy Intermediate Scale Quantum (NISQ) computer, approximate synthesis modifies the QNN circuit by reducing error-prone 2-qubit quantum gates. The success of QNNs motivates adversaries to attack QNNs via backdoors. However, naïvely transplanting… ▽ More

    Submitted 16 February, 2024; v1 submitted 13 July, 2023; originally announced July 2023.

  32. arXiv:2307.07012  [pdf, other

    quant-ph cs.ET

    CryptoQFL: Quantum Federated Learning on Encrypted Data

    Authors: Cheng Chu, Lei Jiang, Fan Chen

    Abstract: Recent advancements in Quantum Neural Networks (QNNs) have demonstrated theoretical and experimental performance superior to their classical counterparts in a wide range of applications. However, existing centralized QNNs cannot solve many real-world problems because collecting large amounts of training data to a common public site is time-consuming and, more importantly, violates data privacy. Fe… ▽ More

    Submitted 13 July, 2023; originally announced July 2023.

  33. arXiv:2307.02736  [pdf

    physics.med-ph cs.CV

    An Uncertainty Aided Framework for Learning based Liver $T_1ρ$ Map** and Analysis

    Authors: Chaoxing Huang, Vincent Wai Sun Wong, Queenie Chan, Winnie Chiu Wing Chu, Weitian Chen

    Abstract: Objective: Quantitative $T_1ρ$ imaging has potential for assessment of biochemical alterations of liver pathologies. Deep learning methods have been employed to accelerate quantitative $T_1ρ$ imaging. To employ artificial intelligence-based quantitative imaging methods in complicated clinical environment, it is valuable to estimate the uncertainty of the predicated $T_1ρ$ values to provide the con… ▽ More

    Submitted 9 October, 2023; v1 submitted 5 July, 2023; originally announced July 2023.

  34. arXiv:2306.14105  [pdf, other

    cs.RO eess.SY

    Sequential Manipulation Planning for Over-actuated Unmanned Aerial Manipulators

    Authors: Yao Su, Jiarui Li, Ziyuan Jiao, Meng Wang, Chi Chu, Hang Li, Yixin Zhu, Hangxin Liu

    Abstract: We investigate the sequential manipulation planning problem for unmanned aerial manipulators (UAMs). Unlike prior work that primarily focuses on one-step manipulation tasks, sequential manipulations require coordinated motions of a UAM's floating base, the manipulator, and the object being manipulated, entailing a unified kinematics and dynamics model for motion planning under designated constrain… ▽ More

    Submitted 10 July, 2023; v1 submitted 24 June, 2023; originally announced June 2023.

    Journal ref: IROS 2023

  35. arXiv:2305.12733  [pdf, other

    cs.CL

    MADNet: Maximizing Addressee Deduction Expectation for Multi-Party Conversation Generation

    Authors: Jia-Chen Gu, Chao-Hong Tan, Caiyuan Chu, Zhen-Hua Ling, Chongyang Tao, Quan Liu, Cong Liu

    Abstract: Modeling multi-party conversations (MPCs) with graph neural networks has been proven effective at capturing complicated and graphical information flows. However, existing methods rely heavily on the necessary addressee labels and can only be applied to an ideal setting where each utterance must be tagged with an addressee label. To study the scarcity of addressee labels which is a common issue in… ▽ More

    Submitted 17 October, 2023; v1 submitted 22 May, 2023; originally announced May 2023.

    Comments: Accepted by EMNLP 2023. arXiv admin note: text overlap with arXiv:2203.08500

  36. arXiv:2305.10190  [pdf, other

    cs.CL

    Variable-length Neural Interlingua Representations for Zero-shot Neural Machine Translation

    Authors: Zhuoyuan Mao, Haiyue Song, Raj Dabre, Chenhui Chu, Sadao Kurohashi

    Abstract: The language-independency of encoded representations within multilingual neural machine translation (MNMT) models is crucial for their generalization ability on zero-shot translation. Neural interlingua representations have been shown as an effective method for achieving this. However, fixed-length neural interlingua representations introduced in previous work can limit its flexibility and represe… ▽ More

    Submitted 17 May, 2023; originally announced May 2023.

    Comments: Accepted to Multi3Generation workshop (held in conjunction with EAMT 2023)

  37. arXiv:2305.09312  [pdf, other

    cs.CL

    Exploring the Impact of Layer Normalization for Zero-shot Neural Machine Translation

    Authors: Zhuoyuan Mao, Raj Dabre, Qianying Liu, Haiyue Song, Chenhui Chu, Sadao Kurohashi

    Abstract: This paper studies the impact of layer normalization (LayerNorm) on zero-shot translation (ZST). Recent efforts for ZST often utilize the Transformer architecture as the backbone, with LayerNorm at the input of layers (PreNorm) set as the default. However, Xu et al. (2019) has revealed that PreNorm carries the risk of overfitting the training data. Based on this, we hypothesize that PreNorm may ov… ▽ More

    Submitted 16 May, 2023; originally announced May 2023.

    Comments: Accepted to ACL 2023 main conference

  38. arXiv:2305.09210  [pdf, other

    cs.CL

    Towards Speech Dialogue Translation Mediating Speakers of Different Languages

    Authors: Shuichiro Shimizu, Chenhui Chu, Sheng Li, Sadao Kurohashi

    Abstract: We present a new task, speech dialogue translation mediating speakers of different languages. We construct the SpeechBSD dataset for the task and conduct baseline experiments. Furthermore, we consider context to be an important aspect that needs to be addressed in this task and propose two ways of utilizing context, namely monolingual context and bilingual context. We conduct cascaded speech trans… ▽ More

    Submitted 22 May, 2023; v1 submitted 16 May, 2023; originally announced May 2023.

    Comments: 11 pages, 4 figures. Accepted to ACL 2023 Findings. Dataset: https://github.com/ku-nlp/speechBSD

  39. arXiv:2304.09093  [pdf, other

    cs.IR cs.CL cs.LG

    Improving Items and Contexts Understanding with Descriptive Graph for Conversational Recommendation

    Authors: Huy Dao, Dung D. Le, Cuong Chu

    Abstract: State-of-the-art methods on conversational recommender systems (CRS) leverage external knowledge to enhance both items' and contextual words' representations to achieve high quality recommendations and responses generation. However, the representations of the items and words are usually modeled in two separated semantic spaces, which leads to misalignment issue between them. Consequently, this wil… ▽ More

    Submitted 11 April, 2023; originally announced April 2023.

    Comments: 14 pages, 3 figures, 9 tables

  40. arXiv:2304.06662  [pdf, other

    eess.IV cs.CV

    Deep Learning in Breast Cancer Imaging: A Decade of Progress and Future Directions

    Authors: Luyang Luo, Xi Wang, Yi Lin, Xiaoqi Ma, Andong Tan, Ronald Chan, Varut Vardhanabhuti, Winnie CW Chu, Kwang-Ting Cheng, Hao Chen

    Abstract: Breast cancer has reached the highest incidence rate worldwide among all malignancies since 2020. Breast imaging plays a significant role in early diagnosis and intervention to improve the outcome of breast cancer patients. In the past decade, deep learning has shown remarkable progress in breast cancer imaging analysis, holding great promise in interpreting the rich information and complex contex… ▽ More

    Submitted 20 January, 2024; v1 submitted 13 April, 2023; originally announced April 2023.

    Comments: IEEE RBME 2024

  41. arXiv:2303.08774  [pdf, other

    cs.CL cs.AI

    GPT-4 Technical Report

    Authors: OpenAI, Josh Achiam, Steven Adler, Sandhini Agarwal, Lama Ahmad, Ilge Akkaya, Florencia Leoni Aleman, Diogo Almeida, Janko Altenschmidt, Sam Altman, Shyamal Anadkat, Red Avila, Igor Babuschkin, Suchir Balaji, Valerie Balcom, Paul Baltescu, Haiming Bao, Mohammad Bavarian, Jeff Belgum, Irwan Bello, Jake Berdine, Gabriel Bernadett-Shapiro, Christopher Berner, Lenny Bogdonoff, Oleg Boiko , et al. (256 additional authors not shown)

    Abstract: We report the development of GPT-4, a large-scale, multimodal model which can accept image and text inputs and produce text outputs. While less capable than humans in many real-world scenarios, GPT-4 exhibits human-level performance on various professional and academic benchmarks, including passing a simulated bar exam with a score around the top 10% of test takers. GPT-4 is a Transformer-based mo… ▽ More

    Submitted 4 March, 2024; v1 submitted 15 March, 2023; originally announced March 2023.

    Comments: 100 pages; updated authors list; fixed author names and added citation

  42. arXiv:2303.05034  [pdf, other

    cs.CL

    Multi-Stage Coarse-to-Fine Contrastive Learning for Conversation Intent Induction

    Authors: Caiyuan Chu, Ya Li, Yifan Liu, Jia-Chen Gu, Quan Liu, Yongxin Ge, Guo** Hu

    Abstract: Intent recognition is critical for task-oriented dialogue systems. However, for emerging domains and new services, it is difficult to accurately identify the key intent of a conversation due to time-consuming data annotation and comparatively poor model transferability. Therefore, the automatic induction of dialogue intention is very important for intelligent dialogue systems. This paper presents… ▽ More

    Submitted 8 March, 2023; originally announced March 2023.

    Comments: Ranked 1st on Track 2 at DSTC 11, Accepted by DSTC 11 Workshop

  43. arXiv:2302.08090  [pdf, other

    quant-ph cs.AI cs.CR

    QTrojan: A Circuit Backdoor Against Quantum Neural Networks

    Authors: Cheng Chu, Lei Jiang, Martin Swany, Fan Chen

    Abstract: We propose a circuit-level backdoor attack, \textit{QTrojan}, against Quantum Neural Networks (QNNs) in this paper. QTrojan is implemented by few quantum gates inserted into the variational quantum circuit of the victim QNN. QTrojan is much stealthier than a prior Data-Poisoning-based Backdoor Attack (DPBA), since it does not embed any trigger in the inputs of the victim QNN or require the access… ▽ More

    Submitted 16 February, 2023; originally announced February 2023.

    Journal ref: ICASSP2023

  44. arXiv:2212.01618  [pdf

    cs.IT cs.CR

    An Overview of Trust Standards for Communication Networks and Future Digital World

    Authors: Huilin Wang, Xin Kang, Tieyan Li, Zhongding Lei, Cheng-Kang Chu, Haiguang Wang

    Abstract: With the development of Information and Communication Technologies, trust has been applied more and more in various scenarios. At the same time, different organizations have published a series of trust frameworks to support the implementation of trust. There are also academic paper discussing about these trust standards, however, most of them only focus on a specific application. Unlike existing w… ▽ More

    Submitted 3 December, 2022; originally announced December 2022.

    Comments: 7 pages, 3 figures, Magazine paper under review

  45. arXiv:2212.00305  [pdf, other

    cs.CV

    Multilingual Communication System with Deaf Individuals Utilizing Natural and Visual Languages

    Authors: Tuan-Luc Huynh, Khoi-Nguyen Nguyen-Ngoc, Chi-Bien Chu, Minh-Triet Tran, Trung-Nghia Le

    Abstract: According to the World Federation of the Deaf, more than two hundred sign languages exist. Therefore, it is challenging to understand deaf individuals, even proficient sign language users, resulting in a barrier between the deaf community and the rest of society. To bridge this language barrier, we propose a novel multilingual communication system, namely MUGCAT, to improve the communication effic… ▽ More

    Submitted 1 December, 2022; originally announced December 2022.

  46. arXiv:2211.03615  [pdf, other

    cs.LG cs.AI cs.DC eess.SP

    MAISON -- Multimodal AI-based Sensor platform for Older Individuals

    Authors: Ali Abedi, Faranak Dayyani, Charlene Chu, Shehroz S. Khan

    Abstract: There is a global aging population requiring the need for the right tools that can enable older adults' greater independence and the ability to age at home, as well as assist healthcare workers. It is feasible to achieve this objective by building predictive models that assist healthcare workers in monitoring and analyzing older adults' behavioral, functional, and psychological data. To develop su… ▽ More

    Submitted 7 November, 2022; originally announced November 2022.

  47. arXiv:2211.01205  [pdf, other

    cs.MM cs.GR

    No-reference Point Cloud Geometry Quality Assessment Based on Pairwise Rank Learning

    Authors: Zhiyong Su, Chao Chu, Long Chen, Yong Li, Weiqing Li

    Abstract: Objective geometry quality assessment of point clouds is essential to evaluate the performance of a wide range of point cloud-based solutions, such as denoising, simplification, reconstruction, and watermarking. Existing point cloud quality assessment (PCQA) methods dedicate to assigning absolute quality scores to distorted point clouds. Their performance is strongly reliant on the quality and qua… ▽ More

    Submitted 2 November, 2022; originally announced November 2022.

  48. arXiv:2210.17291  [pdf

    cs.IT

    SIX-Trust for 6G: Towards a Secure and Trustworthy 6G Network

    Authors: Yiying Wang, Xin Kang, Tieyan Li, Haiguang Wang, Cheng-Kang Chu, Zhongding Lei

    Abstract: Recent years have witnessed a digital explosion with the deployment of 5G and proliferation of 5G-enabled innovations. Compared with 5G, 6G is envisioned to achieve much higher performance in terms of latency, data rate, connectivity, energy efficiency, coverage and mobility. To fulfil these expectations, 6G will experience a number of paradigm shifts, such as exploiting new spectrum, applying ubi… ▽ More

    Submitted 31 October, 2022; originally announced October 2022.

    Comments: 7 pages, 3 figures, under review

  49. arXiv:2208.10758  [pdf, other

    cs.CV cs.AI

    Learning More May Not Be Better: Knowledge Transferability in Vision and Language Tasks

    Authors: Tianwei Chen, Noa Garcia, Mayu Otani, Chenhui Chu, Yuta Nakashima, Hajime Nagahara

    Abstract: Is more data always better to train vision-and-language models? We study knowledge transferability in multi-modal tasks. The current tendency in machine learning is to assume that by joining multiple datasets from different tasks their overall performance will improve. However, we show that not all the knowledge transfers well or has a positive impact on related tasks, even when they share a commo… ▽ More

    Submitted 23 August, 2022; originally announced August 2022.

  50. arXiv:2208.04910  [pdf

    cs.CV

    Deep Learning-Based Objective and Reproducible Osteosarcoma Chemotherapy Response Assessment and Outcome Prediction

    Authors: David Joon Ho, Narasimhan P. Agaram, Marc-Henri Jean, Stephanie D. Suser, Cynthia Chu, Chad M. Vanderbilt, Paul A. Meyers, Leonard H. Wexler, John H. Healey, Thomas J. Fuchs, Meera R. Hameed

    Abstract: Osteosarcoma is the most common primary bone cancer whose standard treatment includes pre-operative chemotherapy followed by resection. Chemotherapy response is used for predicting prognosis and further management of patients. Necrosis is routinely assessed post-chemotherapy from histology slides on resection specimens where necrosis ratio is defined as the ratio of necrotic tumor to overall tumor… ▽ More

    Submitted 9 August, 2022; originally announced August 2022.