Skip to main content

Showing 1–50 of 172 results for author: jiang, N

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.00617  [pdf, other

    cs.LG cs.AI cs.CL cs.GT

    Iterative Nash Policy Optimization: Aligning LLMs with General Preferences via No-Regret Learning

    Authors: Yuheng Zhang, Dian Yu, Baolin Peng, Linfeng Song, Ye Tian, Mingyue Huo, Nan Jiang, Haitao Mi, Dong Yu

    Abstract: Reinforcement Learning with Human Feedback (RLHF) has achieved great success in aligning large language models (LLMs) with human preferences. Prevalent RLHF approaches are reward-based, following the Bradley-Terry (BT) model assumption, which may not fully capture the complexity of human preferences. In this paper, we explore RLHF under a general preference framework and approach it from a game-th… ▽ More

    Submitted 30 June, 2024; originally announced July 2024.

  2. arXiv:2406.12002  [pdf, other

    q-bio.PE cs.LG math.NA physics.soc-ph

    Modeling, Inference, and Prediction in Mobility-Based Compartmental Models for Epidemiology

    Authors: Ning Jiang, Weiqi Chu, Yao Li

    Abstract: Classical compartmental models in epidemiology often struggle to accurately capture real-world dynamics due to their inability to address the inherent heterogeneity of populations. In this paper, we introduce a novel approach that incorporates heterogeneity through a mobility variable, transforming the traditional ODE system into a system of integro-differential equations that describe the dynamic… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

    Comments: 16 pages, 7 figures

  3. arXiv:2405.18649  [pdf, other

    cs.CL cs.AI cs.SE

    Training LLMs to Better Self-Debug and Explain Code

    Authors: Nan Jiang, Xiaopeng Li, Shiqi Wang, Qiang Zhou, Soneya Binta Hossain, Baishakhi Ray, Varun Kumar, Xiaofei Ma, Anoop Deoras

    Abstract: In the domain of code generation, self-debugging is crucial. It allows LLMs to refine their generated code based on execution feedback. This is particularly important because generating correct solutions in one attempt proves challenging for complex tasks. Prior works on self-debugging mostly focus on prompting methods by providing LLMs with few-shot examples, which work poorly on small open-sourc… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

  4. arXiv:2405.07863  [pdf, other

    cs.LG cs.AI cs.CL stat.ML

    RLHF Workflow: From Reward Modeling to Online RLHF

    Authors: Hanze Dong, Wei Xiong, Bo Pang, Haoxiang Wang, Han Zhao, Yingbo Zhou, Nan Jiang, Doyen Sahoo, Caiming Xiong, Tong Zhang

    Abstract: We present the workflow of Online Iterative Reinforcement Learning from Human Feedback (RLHF) in this technical report, which is widely reported to outperform its offline counterpart by a large margin in the recent large language model (LLM) literature. However, existing open-source RLHF projects are still largely confined to the offline learning setting. In this technical report, we aim to fill i… ▽ More

    Submitted 12 June, 2024; v1 submitted 13 May, 2024; originally announced May 2024.

  5. arXiv:2405.06979  [pdf, other

    cs.LG

    Robust Semi-supervised Learning by Wisely Leveraging Open-set Data

    Authors: Yang Yang, Nan Jiang, Yi Xu, De-Chuan Zhan

    Abstract: Open-set Semi-supervised Learning (OSSL) holds a realistic setting that unlabeled data may come from classes unseen in the labeled set, i.e., out-of-distribution (OOD) data, which could cause performance degradation in conventional SSL models. To handle this issue, except for the traditional in-distribution (ID) classifier, some existing OSSL approaches employ an extra OOD detection module to avoi… ▽ More

    Submitted 20 May, 2024; v1 submitted 11 May, 2024; originally announced May 2024.

  6. arXiv:2404.16666  [pdf, other

    cs.CV

    PhyRecon: Physically Plausible Neural Scene Reconstruction

    Authors: Junfeng Ni, Yixin Chen, Bohan **g, Nan Jiang, Bin Wang, Bo Dai, Puhao Li, Yixin Zhu, Song-Chun Zhu, Siyuan Huang

    Abstract: Neural implicit representations have gained popularity in multi-view 3D reconstruction. However, most previous work struggles to yield physically plausible results, limiting their utility in domains requiring rigorous physical accuracy, such as embodied AI and robotics. This lack of plausibility stems from the absence of physics modeling in existing methods and their inability to recover intricate… ▽ More

    Submitted 2 June, 2024; v1 submitted 25 April, 2024; originally announced April 2024.

    Comments: project page: https://phyrecon.github.io/. arXiv admin note: text overlap with arXiv:2303.08605 by other authors

  7. arXiv:2404.11595  [pdf, other

    cs.SE

    A Deep Dive into Large Language Models for Automated Bug Localization and Repair

    Authors: Soneya Binta Hossain, Nan Jiang, Qiang Zhou, Xiaopeng Li, Wen-Hao Chiang, Yingjun Lyu, Hoan Nguyen, Omer Tripp

    Abstract: Large language models (LLMs) have shown impressive effectiveness in various software engineering tasks, including automated program repair (APR). In this study, we take a deep dive into automated bug fixing utilizing LLMs. In contrast to many deep learning-based APR methods that assume known bug locations, rely on line-level localization tools, or address bug prediction and fixing in one step, our… ▽ More

    Submitted 10 May, 2024; v1 submitted 17 April, 2024; originally announced April 2024.

  8. arXiv:2404.09946  [pdf, other

    cs.LG cs.AI stat.ML

    A Note on Loss Functions and Error Compounding in Model-based Reinforcement Learning

    Authors: Nan Jiang

    Abstract: This note clarifies some confusions (and perhaps throws out more) around model-based reinforcement learning and their theoretical understanding in the context of deep RL. Main topics of discussion are (1) how to reconcile model-based RL's bad empirical reputation on error compounding with its superior theoretical properties, and (2) the limitations of empirically popular losses. For the latter, co… ▽ More

    Submitted 15 April, 2024; originally announced April 2024.

  9. arXiv:2404.05774  [pdf, other

    cs.LG cs.AI

    STMGF: An Effective Spatial-Temporal Multi-Granularity Framework for Traffic Forecasting

    Authors: Zhengyang Zhao, Haitao Yuan, Nan Jiang, Minxiao Chen, Ning Liu, Zengxiang Li

    Abstract: Accurate Traffic Prediction is a challenging task in intelligent transportation due to the spatial-temporal aspects of road networks. The traffic of a road network can be affected by long-distance or long-term dependencies where existing methods fall short in modeling them. In this paper, we introduce a novel framework known as Spatial-Temporal Multi-Granularity Framework (STMGF) to enhance the ca… ▽ More

    Submitted 7 April, 2024; originally announced April 2024.

  10. arXiv:2404.04271  [pdf, other

    cs.IR cs.AI cs.DB

    Towards Effective Next POI Prediction: Spatial and Semantic Augmentation with Remote Sensing Data

    Authors: Nan Jiang, Haitao Yuan, Jianing Si, Minxiao Chen, Shangguang Wang

    Abstract: The next point-of-interest (POI) prediction is a significant task in location-based services, yet its complexity arises from the consolidation of spatial and semantic intent. This fusion is subject to the influences of historical preferences, prevailing location, and environmental factors, thereby posing significant challenges. In addition, the uneven POI distribution further complicates the next… ▽ More

    Submitted 22 March, 2024; originally announced April 2024.

    Comments: 12 pages, 11 figures, Accepted by ICDE 2024

  11. Performance Analysis of Integrated Sensing and Communication Networks with Blockage Effects

    Authors: Zezhong Sun, Shi Yan, Ning Jiang, Jiaen Zhou, Mugen Peng

    Abstract: Communication-sensing integration represents an up-and-coming area of research, enabling wireless networks to simultaneously perform communication and sensing tasks. However, in urban cellular networks, the blockage of buildings results in a complex signal propagation environment, affecting the performance analysis of integrated sensing and communication (ISAC) networks. To overcome this obstacle,… ▽ More

    Submitted 2 July, 2024; v1 submitted 25 March, 2024; originally announced March 2024.

    Comments: This paper has been accepted by IEEE Transactions on Vehicular Technology

  12. arXiv:2403.12556  [pdf, other

    cs.CL

    Factorized Learning Assisted with Large Language Model for Gloss-free Sign Language Translation

    Authors: Zhigang Chen, Benjia Zhou, Jun Li, Jun Wan, Zhen Lei, Ning Jiang, Quan Lu, Guoqing Zhao

    Abstract: Previous Sign Language Translation (SLT) methods achieve superior performance by relying on gloss annotations. However, labeling high-quality glosses is a labor-intensive task, which limits the further development of SLT. Although some approaches work towards gloss-free SLT through jointly training the visual encoder and translation network, these efforts still suffer from poor performance and ine… ▽ More

    Submitted 19 March, 2024; originally announced March 2024.

    Comments: Accepted by LREC-COLING-2024

  13. arXiv:2403.12031  [pdf, other

    cs.LG cs.AI

    RouterBench: A Benchmark for Multi-LLM Routing System

    Authors: Qitian Jason Hu, Jacob Bieker, Xiuyu Li, Nan Jiang, Benjamin Keigwin, Gaurav Ranganath, Kurt Keutzer, Shriyash Kaustubh Upadhyay

    Abstract: As the range of applications for Large Language Models (LLMs) continues to grow, the demand for effective serving solutions becomes increasingly critical. Despite the versatility of LLMs, no single model can optimally address all tasks and applications, particularly when balancing performance with cost. This limitation has led to the development of LLM routing systems, which combine the strengths… ▽ More

    Submitted 28 March, 2024; v1 submitted 18 March, 2024; originally announced March 2024.

  14. arXiv:2403.08629  [pdf, other

    cs.CV

    Scaling Up Dynamic Human-Scene Interaction Modeling

    Authors: Nan Jiang, Zhiyuan Zhang, Hongjie Li, Xiaoxuan Ma, Zan Wang, Yixin Chen, Tengyu Liu, Yixin Zhu, Siyuan Huang

    Abstract: Confronting the challenges of data scarcity and advanced motion synthesis in human-scene interaction modeling, we introduce the TRUMANS dataset alongside a novel HSI motion synthesis method. TRUMANS stands as the most comprehensive motion-captured HSI dataset currently available, encompassing over 15 hours of human interactions across 100 indoor scenes. It intricately captures whole-body human mot… ▽ More

    Submitted 24 May, 2024; v1 submitted 13 March, 2024; originally announced March 2024.

  15. arXiv:2402.14703  [pdf, ps, other

    cs.LG cs.AI stat.ML

    On the Curses of Future and History in Future-dependent Value Functions for Off-policy Evaluation

    Authors: Yuheng Zhang, Nan Jiang

    Abstract: We study off-policy evaluation (OPE) in partially observable environments with complex observations, with the goal of develo** estimators whose guarantee avoids exponential dependence on the horizon. While such estimators exist for MDPs and POMDPs can be converted to history-based MDPs, their estimation errors depend on the state-density ratio for MDPs which becomes history ratios after conversi… ▽ More

    Submitted 22 February, 2024; originally announced February 2024.

  16. arXiv:2402.07314  [pdf, other

    cs.LG stat.ML

    Online Iterative Reinforcement Learning from Human Feedback with General Preference Model

    Authors: Chenlu Ye, Wei Xiong, Yuheng Zhang, Nan Jiang, Tong Zhang

    Abstract: We study Reinforcement Learning from Human Feedback (RLHF) under a general preference oracle. In particular, we do not assume that there exists a reward function and the preference signal is drawn from the Bradley-Terry model as most of the prior works do. We consider a standard mathematical formulation, the reverse-KL regularized minimax game between two LLMs for RLHF under general preference ora… ▽ More

    Submitted 25 April, 2024; v1 submitted 11 February, 2024; originally announced February 2024.

    Comments: RLHF, Preference Learning, Alignment for LLMs

  17. arXiv:2402.00254  [pdf, other

    cs.LG cs.AI

    Vertical Symbolic Regression via Deep Policy Gradient

    Authors: Nan Jiang, Md Nasim, Yexiang Xue

    Abstract: Vertical Symbolic Regression (VSR) recently has been proposed to expedite the discovery of symbolic equations with many independent variables from experimental data. VSR reduces the search spaces following the vertical discovery path by building from reduced-form equations involving a subset of independent variables to full-fledged ones. Proved successful by many symbolic regressors, deep neural n… ▽ More

    Submitted 31 January, 2024; originally announced February 2024.

    Comments: see animated demo at: vsr-dpg.github.io

  18. arXiv:2401.09681  [pdf, other

    cs.LG stat.ML

    Harnessing Density Ratios for Online Reinforcement Learning

    Authors: Philip Amortila, Dylan J. Foster, Nan Jiang, Ayush Sekhari, Tengyang Xie

    Abstract: The theories of offline and online reinforcement learning, despite having evolved in parallel, have begun to show signs of the possibility for a unification, with algorithms and analysis techniques for one setting often having natural counterparts in the other. However, the notion of density ratio modeling, an emerging paradigm in offline RL, has been largely absent from online RL, perhaps for goo… ▽ More

    Submitted 4 June, 2024; v1 submitted 17 January, 2024; originally announced January 2024.

    Comments: ICLR 2024

  19. arXiv:2401.03697  [pdf, other

    cs.SD eess.AS

    An audio-quality-based multi-strategy approach for target speaker extraction in the MISP 2023 Challenge

    Authors: Runduo Han, Xiaopeng Yan, Weiming Xu, Pengcheng Guo, Jiayao Sun, He Wang, Quan Lu, Ning Jiang, Lei Xie

    Abstract: This paper describes our audio-quality-based multi-strategy approach for the audio-visual target speaker extraction (AVTSE) task in the Multi-modal Information based Speech Processing (MISP) 2023 Challenge. Specifically, our approach adopts different extraction strategies based on the audio quality, striking a balance between interference removal and speech preservation, which benifits the back-en… ▽ More

    Submitted 6 March, 2024; v1 submitted 8 January, 2024; originally announced January 2024.

    Comments: Accepted by ICASSP 2024

  20. arXiv:2312.11955  [pdf, other

    cs.AI

    Vertical Symbolic Regression

    Authors: Nan Jiang, Md Nasim, Yexiang Xue

    Abstract: Automating scientific discovery has been a grand goal of Artificial Intelligence (AI) and will bring tremendous societal impact. Learning symbolic expressions from experimental data is a vital step in AI-driven scientific discovery. Despite exciting progress, most endeavors have focused on the horizontal discovery paths, i.e., they directly search for the best expression in the full hypothesis spa… ▽ More

    Submitted 19 December, 2023; originally announced December 2023.

    Comments: arXiv admin note: text overlap with arXiv:2306.08057

  21. arXiv:2312.11805  [pdf, other

    cs.CL cs.AI cs.CV

    Gemini: A Family of Highly Capable Multimodal Models

    Authors: Gemini Team, Rohan Anil, Sebastian Borgeaud, Jean-Baptiste Alayrac, Jiahui Yu, Radu Soricut, Johan Schalkwyk, Andrew M. Dai, Anja Hauth, Katie Millican, David Silver, Melvin Johnson, Ioannis Antonoglou, Julian Schrittwieser, Amelia Glaese, Jilin Chen, Emily Pitler, Timothy Lillicrap, Angeliki Lazaridou, Orhan Firat, James Molloy, Michael Isard, Paul R. Barham, Tom Hennigan, Benjamin Lee , et al. (1325 additional authors not shown)

    Abstract: This report introduces a new family of multimodal models, Gemini, that exhibit remarkable capabilities across image, audio, video, and text understanding. The Gemini family consists of Ultra, Pro, and Nano sizes, suitable for applications ranging from complex reasoning tasks to on-device memory-constrained use-cases. Evaluation on a broad range of benchmarks shows that our most-capable Gemini Ultr… ▽ More

    Submitted 17 June, 2024; v1 submitted 18 December, 2023; originally announced December 2023.

  22. arXiv:2312.11456  [pdf, other

    cs.LG cs.AI stat.ML

    Iterative Preference Learning from Human Feedback: Bridging Theory and Practice for RLHF under KL-Constraint

    Authors: Wei Xiong, Hanze Dong, Chenlu Ye, Ziqi Wang, Han Zhong, Heng Ji, Nan Jiang, Tong Zhang

    Abstract: This paper studies the alignment process of generative models with Reinforcement Learning from Human Feedback (RLHF). We first identify the primary challenges of existing popular methods like offline PPO and offline DPO as lacking in strategical exploration of the environment. Then, to understand the mathematical principle of RLHF, we consider a standard mathematical formulation, the reverse-KL re… ▽ More

    Submitted 1 May, 2024; v1 submitted 18 December, 2023; originally announced December 2023.

    Comments: 53 pages; theoretical study and algorithmic design of iterative RLHF and DPO

  23. arXiv:2312.05572  [pdf, other

    cs.CV

    R2-Talker: Realistic Real-Time Talking Head Synthesis with Hash Grid Landmarks Encoding and Progressive Multilayer Conditioning

    Authors: Zhiling Ye, LiangGuo Zhang, Dingheng Zeng, Quan Lu, Ning Jiang

    Abstract: Dynamic NeRFs have recently garnered growing attention for 3D talking portrait synthesis. Despite advances in rendering speed and visual quality, challenges persist in enhancing efficiency and effectiveness. We present R2-Talker, an efficient and effective framework enabling realistic real-time talking head synthesis. Specifically, using multi-resolution hash grids, we introduce a novel approach f… ▽ More

    Submitted 9 December, 2023; originally announced December 2023.

  24. arXiv:2312.02781  [pdf, other

    cs.CV cs.AI

    PMMTalk: Speech-Driven 3D Facial Animation from Complementary Pseudo Multi-modal Features

    Authors: Tianshun Han, Shengnan Gui, Yiqing Huang, Baihui Li, Lijian Liu, Benjia Zhou, Ning Jiang, Quan Lu, Ruicong Zhi, Yanyan Liang, Du Zhang, Jun Wan

    Abstract: Speech-driven 3D facial animation has improved a lot recently while most related works only utilize acoustic modality and neglect the influence of visual and textual cues, leading to unsatisfactory results in terms of precision and coherence. We argue that visual and textual cues are not trivial information. Therefore, we present a novel framework, namely PMMTalk, using complementary Pseudo Multi-… ▽ More

    Submitted 5 December, 2023; originally announced December 2023.

  25. arXiv:2311.13721  [pdf, other

    cs.SE cs.AI

    Nova: Generative Language Models for Assembly Code with Hierarchical Attention and Contrastive Learning

    Authors: Nan Jiang, Chengxiao Wang, Kevin Liu, Xiangzhe Xu, Lin Tan, Xiangyu Zhang

    Abstract: Binary code analysis is the foundation of crucial tasks in the security domain; thus building effective binary analysis techniques is more important than ever. Large language models (LLMs) although have brought impressive improvement to source code tasks, do not directly generalize to assembly code due to the unique challenges of assembly: (1) the low information density of assembly and (2) the di… ▽ More

    Submitted 24 June, 2024; v1 submitted 22 November, 2023; originally announced November 2023.

  26. arXiv:2311.00457  [pdf, other

    cs.CV

    Single-view 3D Scene Reconstruction with High-fidelity Shape and Texture

    Authors: Yixin Chen, Junfeng Ni, Nan Jiang, Yaowei Zhang, Yixin Zhu, Siyuan Huang

    Abstract: Reconstructing detailed 3D scenes from single-view images remains a challenging task due to limitations in existing approaches, which primarily focus on geometric shape recovery, overlooking object appearances and fine shape details. To address these challenges, we propose a novel framework for simultaneous high-fidelity recovery of object shapes and textures from single-view images. Our approach… ▽ More

    Submitted 1 November, 2023; originally announced November 2023.

    Comments: 3DV 2024, project page: https://dali-jack.github.io/SSR/

  27. arXiv:2310.17101  [pdf, other

    eess.AS cs.SD

    Boosting Multi-Speaker Expressive Speech Synthesis with Semi-supervised Contrastive Learning

    Authors: Xinfa Zhu, Yuke Li, Yi Lei, Ning Jiang, Guoqing Zhao, Lei Xie

    Abstract: This paper aims to build a multi-speaker expressive TTS system, synthesizing a target speaker's speech with multiple styles and emotions. To this end, we propose a novel contrastive learning-based TTS approach to transfer style and emotion across speakers. Specifically, contrastive learning from different levels, i.e. utterance and category level, is leveraged to extract the disentangled style, em… ▽ More

    Submitted 25 April, 2024; v1 submitted 25 October, 2023; originally announced October 2023.

    Comments: 6 pages, 4 figures; Accepted by ICME 2024

  28. arXiv:2310.14278  [pdf, other

    cs.SD cs.CL eess.AS

    Conversational Speech Recognition by Learning Audio-textual Cross-modal Contextual Representation

    Authors: Kun Wei, Bei Li, Hang Lv, Quan Lu, Ning Jiang, Lei Xie

    Abstract: Automatic Speech Recognition (ASR) in conversational settings presents unique challenges, including extracting relevant contextual information from previous conversational turns. Due to irrelevant content, error propagation, and redundancy, existing methods struggle to extract longer and more effective contexts. To address this issue, we introduce a novel conversational ASR system, extending the C… ▽ More

    Submitted 27 April, 2024; v1 submitted 22 October, 2023; originally announced October 2023.

    Comments: TASLP

    Journal ref: IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2024

  29. arXiv:2310.13850  [pdf, other

    cs.CL

    Ecologically Valid Explanations for Label Variation in NLI

    Authors: Nan-Jiang Jiang, Chenhao Tan, Marie-Catherine de Marneffe

    Abstract: Human label variation, or annotation disagreement, exists in many natural language processing (NLP) tasks, including natural language inference (NLI). To gain direct evidence of how NLI label variation arises, we build LiveNLI, an English dataset of 1,415 ecologically valid explanations (annotators explain the NLI labels they chose) for 122 MNLI items (at least 10 explanations per item). The LiveN… ▽ More

    Submitted 20 October, 2023; originally announced October 2023.

    Comments: Findings at EMNLP 2023. Overlap with previous version arXiv:2304.12443

  30. arXiv:2310.08049  [pdf, other

    cs.LG

    Is attention required for ICL? Exploring the Relationship Between Model Architecture and In-Context Learning Ability

    Authors: Ivan Lee, Nan Jiang, Taylor Berg-Kirkpatrick

    Abstract: What is the relationship between model architecture and the ability to perform in-context learning? In this empirical study, we take the first steps toward answering this question. We evaluate thirteen model architectures capable of causal language modeling across a suite of synthetic in-context learning tasks. These selected architectures represent a broad range of paradigms, including recurrent… ▽ More

    Submitted 1 April, 2024; v1 submitted 12 October, 2023; originally announced October 2023.

  31. arXiv:2310.04760  [pdf, other

    eess.AS cs.SD

    Multi-objective Progressive Clustering for Semi-supervised Domain Adaptation in Speaker Verification

    Authors: Ze Li, Yuke Lin, Ning Jiang, Xiaoyi Qin, Guoqing Zhao, Haiying Wu, Ming Li

    Abstract: Utilizing the pseudo-labeling algorithm with large-scale unlabeled data becomes crucial for semi-supervised domain adaptation in speaker verification tasks. In this paper, we propose a novel pseudo-labeling method named Multi-objective Progressive Clustering (MoPC), specifically designed for semi-supervised domain adaptation. Firstly, we utilize limited labeled data from the target domain to deriv… ▽ More

    Submitted 7 October, 2023; originally announced October 2023.

  32. Driving behavior-guided battery health monitoring for electric vehicles using machine learning

    Authors: Nanhua Jiang, Jiawei Zhang, Weiran Jiang, Yao Ren, **g Lin, Edwin Khoo, Ziyou Song

    Abstract: An accurate estimation of the state of health (SOH) of batteries is critical to ensuring the safe and reliable operation of electric vehicles (EVs). Feature-based machine learning methods have exhibited enormous potential for rapidly and precisely monitoring battery health status. However, simultaneously using various health indicators (HIs) may weaken estimation performance due to feature redunda… ▽ More

    Submitted 25 September, 2023; originally announced September 2023.

    Journal ref: Applied Energy (2024)

  33. arXiv:2309.14109  [pdf, other

    eess.AS cs.SD

    Haha-Pod: An Attempt for Laughter-based Non-Verbal Speaker Verification

    Authors: Yuke Lin, Xiaoyi Qin, Ning Jiang, Guoqing Zhao, Ming Li

    Abstract: It is widely acknowledged that discriminative representation for speaker verification can be extracted from verbal speech. However, how much speaker information that non-verbal vocalization carries is still a puzzle. This paper explores speaker verification based on the most ubiquitous form of non-verbal voice, laughter. First, we use a semi-automatic pipeline to collect a new Haha-Pod dataset fro… ▽ More

    Submitted 9 October, 2023; v1 submitted 25 September, 2023; originally announced September 2023.

    Comments: accepted by ASRU 2023

  34. arXiv:2309.08883  [pdf, other

    cs.AI cs.CC cs.LO

    Solving Satisfiability Modulo Counting for Symbolic and Statistical AI Integration With Provable Guarantees

    Authors: **zhao Li, Nan Jiang, Yexiang Xue

    Abstract: Satisfiability Modulo Counting (SMC) encompasses problems that require both symbolic decision-making and statistical reasoning. Its general formulation captures many real-world problems at the intersection of symbolic and statistical Artificial Intelligence. SMC searches for policy interventions to control probabilistic outcomes. Solving SMC is challenging because of its highly intractable nature(… ▽ More

    Submitted 30 December, 2023; v1 submitted 16 September, 2023; originally announced September 2023.

  35. arXiv:2309.07934  [pdf, other

    cs.NE cs.AI cs.LG

    Racing Control Variable Genetic Programming for Symbolic Regression

    Authors: Nan Jiang, Yexiang Xue

    Abstract: Symbolic regression, as one of the most crucial tasks in AI for science, discovers governing equations from experimental data. Popular approaches based on genetic programming, Monte Carlo tree search, or deep reinforcement learning learn symbolic regression from a fixed dataset. They require massive datasets and long training time especially when learning complex equations involving many variables… ▽ More

    Submitted 13 September, 2023; originally announced September 2023.

  36. arXiv:2309.06256  [pdf, other

    cs.LG

    Mitigating the Alignment Tax of RLHF

    Authors: Yong Lin, Hangyu Lin, Wei Xiong, Shizhe Diao, Jianmeng Liu, Jipeng Zhang, Rui Pan, Haoxiang Wang, Wenbin Hu, Hanning Zhang, Hanze Dong, Renjie Pi, Han Zhao, Nan Jiang, Heng Ji, Yuan Yao, Tong Zhang

    Abstract: LLMs acquire a wide range of abilities during pre-training, but aligning LLMs under Reinforcement Learning with Human Feedback (RLHF) can lead to forgetting, which is also known as the alignment tax. To empirically verify this hypothesis, we conducted experiments with existing RLHF algorithms using OpenLLaMA-3B, which revealed a pronounced alignment tax in NLP tasks. On the other hand, despite var… ▽ More

    Submitted 5 February, 2024; v1 submitted 12 September, 2023; originally announced September 2023.

    Comments: 28 Pages

  37. arXiv:2309.01807  [pdf, other

    cs.LG cs.AI cs.RO

    Marginalized Importance Sampling for Off-Environment Policy Evaluation

    Authors: Pulkit Katdare, Nan Jiang, Katherine Driggs-Campbell

    Abstract: Reinforcement Learning (RL) methods are typically sample-inefficient, making it challenging to train and deploy RL-policies in real world robots. Even a robust policy trained in simulation requires a real-world deployment to assess their performance. This paper proposes a new approach to evaluate the real-world performance of agent policies prior to deploying them in the real world. Our approach i… ▽ More

    Submitted 4 October, 2023; v1 submitted 4 September, 2023; originally announced September 2023.

  38. arXiv:2308.08766  [pdf, other

    eess.AS cs.SD

    The DKU-MSXF Speaker Verification System for the VoxCeleb Speaker Recognition Challenge 2023

    Authors: Ze Li, Yuke Lin, Xiaoyi Qin, Ning Jiang, Guoqing Zhao, Ming Li

    Abstract: This paper is the system description of the DKU-MSXF System for the track1, track2 and track3 of the VoxCeleb Speaker Recognition Challenge 2023 (VoxSRC-23). For Track 1, we utilize a network structure based on ResNet for training. By constructing a cross-age QMF training set, we achieve a substantial improvement in system performance. For Track 2, we inherite the pre-trained model from Track 1 an… ▽ More

    Submitted 16 August, 2023; originally announced August 2023.

    Comments: arXiv admin note: text overlap with arXiv:2210.05092

  39. arXiv:2308.07056  [pdf, other

    eess.AS cs.MM cs.SD

    VoxBlink: A Large Scale Speaker Verification Dataset on Camera

    Authors: Yuke Lin, Xiaoyi Qin, Guoqing Zhao, Ming Cheng, Ning Jiang, Haiyang Wu, Ming Li

    Abstract: In this paper, we introduce a large-scale and high-quality audio-visual speaker verification dataset, named VoxBlink. We propose an innovative and robust automatic audio-visual data mining pipeline to curate this dataset, which contains 1.45M utterances from 38K speakers. Due to the inherent nature of automated data collection, introducing noisy data is inevitable. Therefore, we also utilize a mul… ▽ More

    Submitted 12 December, 2023; v1 submitted 14 August, 2023; originally announced August 2023.

    Comments: Accepted By ICASSP2024

  40. arXiv:2307.13332  [pdf, other

    cs.LG cs.AI stat.ML

    The Optimal Approximation Factors in Misspecified Off-Policy Value Function Estimation

    Authors: Philip Amortila, Nan Jiang, Csaba Szepesvári

    Abstract: Theoretical guarantees in reinforcement learning (RL) are known to suffer multiplicative blow-up factors with respect to the misspecification error of function approximation. Yet, the nature of such \emph{approximation factors} -- especially their optimal form in a given learning problem -- is poorly understood. In this paper we study this question in linear off-policy value function estimation, w… ▽ More

    Submitted 14 December, 2023; v1 submitted 25 July, 2023; originally announced July 2023.

    Comments: Accepted to ICML 2023. The arXiv version contains improved results

  41. arXiv:2307.04630  [pdf, other

    cs.SD eess.AS

    The NPU-MSXF Speech-to-Speech Translation System for IWSLT 2023 Speech-to-Speech Translation Task

    Authors: Kun Song, Yi lei, Peikun Chen, Yiqing Cao, Kun Wei, Yongmao Zhang, Lei Xie, Ning Jiang, Guoqing Zhao

    Abstract: This paper describes the NPU-MSXF system for the IWSLT 2023 speech-to-speech translation (S2ST) task which aims to translate from English speech of multi-source to Chinese speech. The system is built in a cascaded manner consisting of automatic speech recognition (ASR), machine translation (MT), and text-to-speech (TTS). We make tremendous efforts to handle the challenging multi-source input. Spec… ▽ More

    Submitted 10 July, 2023; originally announced July 2023.

    Comments: IWSLT@ACL 2023 system paper. Our submitted system ranks 1st in the S2ST task of the IWSLT 2023 evaluation campaign

  42. arXiv:2307.04133  [pdf, other

    eess.IV cs.CV

    Ultrasonic Image's Annotation Removal: A Self-supervised Noise2Noise Approach

    Authors: Yuanheng Zhang, Nan Jiang, Zhaoheng Xie, Junying Cao, Yueyang Teng

    Abstract: Accurately annotated ultrasonic images are vital components of a high-quality medical report. Hospitals often have strict guidelines on the types of annotations that should appear on imaging results. However, manually inspecting these images can be a cumbersome task. While a neural network could potentially automate the process, training such a model typically requires a dataset of paired input an… ▽ More

    Submitted 9 July, 2023; originally announced July 2023.

    Comments: 10 pages, 7 figures

  43. arXiv:2306.11378  [pdf

    cs.CV

    Multi-task Collaborative Pre-training and Individual-adaptive-tokens Fine-tuning: A Unified Framework for Brain Representation Learning

    Authors: Ning Jiang, Gongshu Wang, Tianyi Yan

    Abstract: Structural magnetic resonance imaging (sMRI) provides accurate estimates of the brain's structural organization and learning invariant brain representations from sMRI is an enduring issue in neuroscience. Previous deep representation learning models ignore the fact that the brain, as the core of human cognitive activity, is distinct from other organs whose primary attribute is anatomy. Therefore,… ▽ More

    Submitted 20 June, 2023; originally announced June 2023.

  44. arXiv:2306.08057  [pdf, other

    cs.NE cs.AI cs.LG

    Symbolic Regression via Control Variable Genetic Programming

    Authors: Nan Jiang, Yexiang Xue

    Abstract: Learning symbolic expressions directly from experiment data is a vital step in AI-driven scientific discovery. Nevertheless, state-of-the-art approaches are limited to learning simple expressions. Regressing expressions involving many independent variables still remain out of reach. Motivated by the control variable experiments widely utilized in science, we propose Control Variable Genetic Progra… ▽ More

    Submitted 25 May, 2023; originally announced June 2023.

  45. arXiv:2306.06388  [pdf, other

    cs.CV

    From NeRFLiX to NeRFLiX++: A General NeRF-Agnostic Restorer Paradigm

    Authors: Kun Zhou, Wenbo Li, Nianjuan Jiang, Xiaoguang Han, Jiangbo Lu

    Abstract: Neural radiance fields (NeRF) have shown great success in novel view synthesis. However, recovering high-quality details from real-world scenes is still challenging for the existing NeRF-based approaches, due to the potential imperfect calibration information and scene representation inaccuracy. Even with high-quality training frames, the synthetic novel views produced by NeRF models still suffer… ▽ More

    Submitted 13 December, 2023; v1 submitted 10 June, 2023; originally announced June 2023.

    Comments: 17 pages, 17 figures. To appear in TPAMI2023. Project Page: https://redrock303.github.io/nerflix_plus/. arXiv admin note: text overlap with arXiv:2303.06919

  46. arXiv:2306.05297  [pdf

    eess.IV cs.CV

    Connectional-Style-Guided Contextual Representation Learning for Brain Disease Diagnosis

    Authors: Gongshu Wang, Ning Jiang, Yunxiao Ma, Tiantian Liu, Duanduan Chen, **glong Wu, Guoqi Li, Dong Liang, Tianyi Yan

    Abstract: Structural magnetic resonance imaging (sMRI) has shown great clinical value and has been widely used in deep learning (DL) based computer-aided brain disease diagnosis. Previous approaches focused on local shapes and textures in sMRI that may be significant only within a particular domain. The learned representations are likely to contain spurious information and have a poor generalization ability… ▽ More

    Submitted 8 June, 2023; originally announced June 2023.

  47. arXiv:2306.03324  [pdf, other

    cs.SE

    Impact of Large Language Models on Generating Software Specifications

    Authors: Danning Xie, Byungwoo Yoo, Nan Jiang, Mijung Kim, Lin Tan, Xiangyu Zhang, Judy S. Lee

    Abstract: Software specifications are essential for ensuring the reliability of software systems. Existing specification extraction approaches, however, suffer from limited generalizability and require manual efforts. The recent emergence of Large Language Models (LLMs), which have been successfully applied to numerous software engineering tasks, offers a promising avenue for automating this process. In thi… ▽ More

    Submitted 2 October, 2023; v1 submitted 5 June, 2023; originally announced June 2023.

  48. arXiv:2306.02546  [pdf, other

    cs.SE

    Leveraging Generative Models to Recover Variable Names from Stripped Binary

    Authors: Xiangzhe Xu, Zhuo Zhang, Zian Su, Ziyang Huang, Shiwei Feng, Yapeng Ye, Nan Jiang, Danning Xie, Siyuan Cheng, Lin Tan, Xiangyu Zhang

    Abstract: Decompilation aims to recover the source code form of a binary executable. It has many security applications such as malware analysis, vulnerability detection and code hardening. A prominent challenge in decompilation is to recover variable names. We propose a novel technique that leverages the strengths of generative models while suppressing potential hallucinations and overcoming the input token… ▽ More

    Submitted 30 April, 2024; v1 submitted 4 June, 2023; originally announced June 2023.

  49. arXiv:2305.18607  [pdf, other

    cs.SE cs.AI cs.CR

    How Effective Are Neural Networks for Fixing Security Vulnerabilities

    Authors: Yi Wu, Nan Jiang, Hung Viet Pham, Thibaud Lutellier, Jordan Davis, Lin Tan, Petr Babkin, Sameena Shah

    Abstract: Security vulnerability repair is a difficult task that is in dire need of automation. Two groups of techniques have shown promise: (1) large code language models (LLMs) that have been pre-trained on source code for tasks such as code completion, and (2) automated program repair (APR) techniques that use deep learning (DL) models to automatically fix software bugs. This paper is the first to stud… ▽ More

    Submitted 1 April, 2024; v1 submitted 29 May, 2023; originally announced May 2023.

    Comments: This paper was accepted in the proceedings of the 32nd ACM SIGSOFT International Symposium on Software Testing and Analysis (ISSTA 2023), and was presented at the conference, that was held in Seattle, USA, 17-21 July 2023

  50. arXiv:2305.12798  [pdf, other

    cs.CL cs.AI cs.LG

    Word Embeddings Are Steers for Language Models

    Authors: Chi Han, Jialiang Xu, Manling Li, Yi Fung, Chenkai Sun, Nan Jiang, Tarek Abdelzaher, Heng Ji

    Abstract: Language models (LMs) automatically learn word embeddings during pre-training on language corpora. Although word embeddings are usually interpreted as feature vectors for individual words, their roles in language model generation remain underexplored. In this work, we theoretically and empirically revisit output word embeddings and find that their linear transformations are equivalent to steering… ▽ More

    Submitted 6 June, 2024; v1 submitted 22 May, 2023; originally announced May 2023.

    Comments: ACL 2024 Long Paper, 9 pages, 3 figures