Skip to main content

Showing 1–50 of 84 results for author: Ji, K

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.19280  [pdf, other

    cs.CV cs.AI cs.CL cs.LG

    HuatuoGPT-Vision, Towards Injecting Medical Visual Knowledge into Multimodal LLMs at Scale

    Authors: Junying Chen, Ruyi Ouyang, Anningzhe Gao, Shunian Chen, Guiming Hardy Chen, Xidong Wang, Ruifei Zhang, Zhenyang Cai, Ke Ji, Guangjun Yu, Xiang Wan, Benyou Wang

    Abstract: The rapid development of multimodal large language models (MLLMs), such as GPT-4V, has led to significant advancements. However, these models still face challenges in medical multimodal capabilities due to limitations in the quantity and quality of medical vision-text data, stemming from data privacy concerns and high annotation costs. While pioneering approaches utilize PubMed's large-scale, de-i… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

  2. arXiv:2406.18034  [pdf, other

    cs.CL

    LLMs for Doctors: Leveraging Medical LLMs to Assist Doctors, Not Replace Them

    Authors: Wenya Xie, Qingying Xiao, Yu Zheng, Xidong Wang, Junying Chen, Ke Ji, Anningzhe Gao, Xiang Wan, Feng Jiang, Benyou Wang

    Abstract: The recent success of Large Language Models (LLMs) has had a significant impact on the healthcare field, providing patients with medical advice, diagnostic information, and more. However, due to a lack of professional medical knowledge, patients are easily misled by generated erroneous information from LLMs, which may result in serious medical problems. To address this issue, we focus on tuning th… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

  3. arXiv:2406.16087  [pdf, other

    cs.RO cs.AI cs.CV cs.LG

    Imperative Learning: A Self-supervised Neural-Symbolic Learning Framework for Robot Autonomy

    Authors: Chen Wang, Kaiyi Ji, Junyi Geng, Zhongqiang Ren, Taimeng Fu, Fan Yang, Yifan Guo, Haonan He, Xiangyu Chen, Zitong Zhan, Qiwei Du, Shaoshu Su, Bowen Li, Yuheng Qiu, Yi Du, Qihang Li, Yifan Yang, Xiao Lin, Zhipeng Zhao

    Abstract: Data-driven methods such as reinforcement and imitation learning have achieved remarkable success in robot autonomy. However, their data-centric nature still hinders them from generalizing well to ever-changing environments. Moreover, collecting large datasets for robotic tasks is often impractical and expensive. To overcome these challenges, we introduce a new self-supervised neural-symbolic (NeS… ▽ More

    Submitted 23 June, 2024; originally announced June 2024.

  4. arXiv:2406.00606  [pdf, other

    cs.CL

    LLMs Could Autonomously Learn Without External Supervision

    Authors: Ke Ji, Junying Chen, Anningzhe Gao, Wenya Xie, Xiang Wan, Benyou Wang

    Abstract: In the quest for super-human performance, Large Language Models (LLMs) have traditionally been tethered to human-annotated datasets and predefined training objectives-a process that is both labor-intensive and inherently limited. This paper presents a transformative approach: Autonomous Learning for LLMs, a self-sufficient learning paradigm that frees models from the constraints of human supervisi… ▽ More

    Submitted 6 June, 2024; v1 submitted 1 June, 2024; originally announced June 2024.

    Comments: 20 pages, 8 figures

  5. arXiv:2405.19440  [pdf, other

    cs.LG math.OC stat.ML

    On the Convergence of Multi-objective Optimization under Generalized Smoothness

    Authors: Qi Zhang, Peiyao Xiao, Kaiyi Ji, Shaofeng Zou

    Abstract: Multi-objective optimization (MOO) is receiving more attention in various fields such as multi-task learning. Recent works provide some effective algorithms with theoretical analysis but they are limited by the standard $L$-smooth or bounded-gradient assumptions, which are typically unsatisfactory for neural networks, such as recurrent neural networks (RNNs) and transformers. In this paper, we stu… ▽ More

    Submitted 1 July, 2024; v1 submitted 29 May, 2024; originally announced May 2024.

  6. arXiv:2405.17583  [pdf, other

    cs.LG

    Understanding Forgetting in Continual Learning with Linear Regression

    Authors: Meng Ding, Kaiyi Ji, Di Wang, **hui Xu

    Abstract: Continual learning, focused on sequentially learning multiple tasks, has gained significant attention recently. Despite the tremendous progress made in the past, the theoretical understanding, especially factors contributing to catastrophic forgetting, remains relatively unexplored. In this paper, we provide a general theoretical analysis of forgetting in the linear regression model via Stochastic… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

    Comments: To be published in The 41st International Conference on Machine Learning

  7. arXiv:2405.17137  [pdf, other

    cs.CV

    Jump-teaching: Ultra Efficient and Robust Learning with Noisy Label

    Authors: Kangye Ji, Fei Cheng, Zeqing Wang, Bohu Huang

    Abstract: Sample selection is the most straightforward technique to combat label noise, aiming to distinguish mislabeled samples during training and avoid the degradation of the robustness of the model. In the workflow, $\textit{selecting possibly clean data}$ and $\textit{model update}$ are iterative. However, their interplay and intrinsic characteristics hinder the robustness and efficiency of learning wi… ▽ More

    Submitted 28 May, 2024; v1 submitted 27 May, 2024; originally announced May 2024.

  8. arXiv:2405.16077  [pdf, ps, other

    cs.LG

    Finite-Time Analysis for Conflict-Avoidant Multi-Task Reinforcement Learning

    Authors: Yudan Wang, Peiyao Xiao, Hao Ban, Kaiyi Ji, Shaofeng Zou

    Abstract: Multi-task reinforcement learning (MTRL) has shown great promise in many real-world applications. Existing MTRL algorithms often aim to learn a policy that optimizes individual objective functions simultaneously with a given prior preference (or weights) on different tasks. However, these methods often suffer from the issue of \textit{gradient conflict} such that the tasks with larger gradients do… ▽ More

    Submitted 10 June, 2024; v1 submitted 25 May, 2024; originally announced May 2024.

    Comments: Initial submission at the 41$^{st}$ International Conference on Machine Learning

  9. arXiv:2405.12480  [pdf, other

    cs.HC cs.IR

    Towards Detecting and Mitigating Cognitive Bias in Spoken Conversational Search

    Authors: Kaixin Ji, Sachin Pathiyan Cherumanal, Johanne R. Trippas, Danula Hettiachchi, Flora D. Salim, Falk Scholer, Damiano Spina

    Abstract: Instruments such as eye-tracking devices have contributed to understanding how users interact with screen-based search engines. However, user-system interactions in audio-only channels -- as is the case for Spoken Conversational Search (SCS) -- are harder to characterize, given the lack of instruments to effectively and precisely capture interactions. Furthermore, in this era of information overlo… ▽ More

    Submitted 20 May, 2024; originally announced May 2024.

  10. arXiv:2405.04453  [pdf, other

    cs.AI

    Towards Continual Knowledge Graph Embedding via Incremental Distillation

    Authors: Jiajun Liu, Wenjun Ke, Peng Wang, Ziyu Shang, **hua Gao, Guozheng Li, Ke Ji, Yanhe Liu

    Abstract: Traditional knowledge graph embedding (KGE) methods typically require preserving the entire knowledge graph (KG) with significant training costs when new knowledge emerges. To address this issue, the continual knowledge graph embedding (CKGE) task has been proposed to train the KGE model by learning emerging knowledge efficiently while simultaneously preserving decent old knowledge. However, the e… ▽ More

    Submitted 7 May, 2024; originally announced May 2024.

    Comments: Accepted by AAAI 2024

  11. arXiv:2405.00675  [pdf, other

    cs.LG cs.AI cs.CL stat.ML

    Self-Play Preference Optimization for Language Model Alignment

    Authors: Yue Wu, Zhiqing Sun, Huizhuo Yuan, Kaixuan Ji, Yiming Yang, Quanquan Gu

    Abstract: Traditional reinforcement learning from human feedback (RLHF) approaches relying on parametric models like the Bradley-Terry model fall short in capturing the intransitivity and irrationality in human preferences. Recent advancements suggest that directly working with preference probabilities can yield a more accurate reflection of human preferences, enabling more flexible and accurate language mo… ▽ More

    Submitted 14 June, 2024; v1 submitted 1 May, 2024; originally announced May 2024.

    Comments: 27 pages, 4 figures, 5 tables

  12. Characterizing Information Seeking Processes with Multiple Physiological Signals

    Authors: Kaixin Ji, Danula Hettiachchi, Flora D. Salim, Falk Scholer, Damiano Spina

    Abstract: Information access systems are getting complex, and our understanding of user behavior during information seeking processes is mainly drawn from qualitative methods, such as observational studies or surveys. Leveraging the advances in sensing technologies, our study aims to characterize user behaviors with physiological signals, particularly in relation to cognitive load, affective arousal, and va… ▽ More

    Submitted 7 May, 2024; v1 submitted 1 May, 2024; originally announced May 2024.

    ACM Class: H.5; H.3.3; C.3

    Journal ref: In Proceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2024, Washington, DC, USA. ACM, New York, NY, USA, 12 pages

  13. arXiv:2404.17809  [pdf, other

    cs.CL cs.AI

    Recall, Retrieve and Reason: Towards Better In-Context Relation Extraction

    Authors: Guozheng Li, Peng Wang, Wenjun Ke, Yikai Guo, Ke Ji, Ziyu Shang, Jiajun Liu, Zijie Xu

    Abstract: Relation extraction (RE) aims to identify relations between entities mentioned in texts. Although large language models (LLMs) have demonstrated impressive in-context learning (ICL) abilities in various tasks, they still suffer from poor performances compared to most supervised fine-tuned RE methods. Utilizing ICL for RE with LLMs encounters two challenges: (1) retrieving good demonstrations from… ▽ More

    Submitted 27 April, 2024; originally announced April 2024.

    Comments: IJCAI 2024

  14. arXiv:2404.17807  [pdf, other

    cs.CL cs.AI

    Meta In-Context Learning Makes Large Language Models Better Zero and Few-Shot Relation Extractors

    Authors: Guozheng Li, Peng Wang, Jiajun Liu, Yikai Guo, Ke Ji, Ziyu Shang, Zijie Xu

    Abstract: Relation extraction (RE) is an important task that aims to identify the relationships between entities in texts. While large language models (LLMs) have revealed remarkable in-context learning (ICL) capability for general zero and few-shot learning, recent studies indicate that current LLMs still struggle with zero and few-shot RE. Previous studies are mainly dedicated to design prompt formats and… ▽ More

    Submitted 27 April, 2024; originally announced April 2024.

    Comments: IJCAI 2024

  15. arXiv:2404.17802  [pdf, other

    cs.CL cs.AI

    Empirical Analysis of Dialogue Relation Extraction with Large Language Models

    Authors: Guozheng Li, Zijie Xu, Ziyu Shang, Jiajun Liu, Ke Ji, Yikai Guo

    Abstract: Dialogue relation extraction (DRE) aims to extract relations between two arguments within a dialogue, which is more challenging than standard RE due to the higher person pronoun frequency and lower information density in dialogues. However, existing DRE methods still suffer from two serious issues: (1) hard to capture long and sparse multi-turn information, and (2) struggle to extract golden relat… ▽ More

    Submitted 27 April, 2024; originally announced April 2024.

    Comments: IJCAI 2024

  16. arXiv:2404.04890  [pdf, other

    cs.CV

    A Unified Diffusion Framework for Scene-aware Human Motion Estimation from Sparse Signals

    Authors: Jiangnan Tang, **gya Wang, Kaiyang Ji, Lan Xu, **gyi Yu, Ye Shi

    Abstract: Estimating full-body human motion via sparse tracking signals from head-mounted displays and hand controllers in 3D scenes is crucial to applications in AR/VR. One of the biggest challenges to this task is the one-to-many map** from sparse observations to dense full-body motions, which endowed inherent ambiguities. To help resolve this ambiguous problem, we introduce a new framework to combine r… ▽ More

    Submitted 7 April, 2024; originally announced April 2024.

  17. arXiv:2404.00520  [pdf, other

    cs.RO math.OC

    Competition-Aware Decision-Making Approach for Mobile Robots in Racing Scenarios

    Authors: Kyoungtae Ji, Sangjae Bae, Nan Li, Kyoungseok Han

    Abstract: This paper presents a game-theoretic strategy for racing, where the autonomous ego agent seeks to block a racing opponent that aims to overtake the ego agent. After a library of trajectory candidates and an associated reward matrix are constructed, the optimal trajectory in terms of maximizing the cumulative reward over the planning horizon is determined based on the level-K reasoning framework. I… ▽ More

    Submitted 30 March, 2024; originally announced April 2024.

    Comments: 7 pages, 8 figures

  18. arXiv:2402.17241  [pdf, other

    cs.CR cs.PL cs.SE

    HardTaint: Production-Run Dynamic Taint Analysis via Selective Hardware Tracing

    Authors: Yiyu Zhang, Tianyi Liu, Yueyang Wang, Yun Qi, Kai Ji, Jian Tang, Xiaoliang Wang, Xuandong Li, Zhiqiang Zuo

    Abstract: Dynamic taint analysis (DTA), as a fundamental analysis technique, is widely used in security, privacy, and diagnosis, etc. As DTA demands to collect and analyze massive taint data online, it suffers extremely high runtime overhead. Over the past decades, numerous attempts have been made to lower the overhead of DTA. Unfortunately, the reductions they achieved are marginal, causing DTA only applic… ▽ More

    Submitted 27 February, 2024; originally announced February 2024.

  19. arXiv:2402.15638  [pdf, other

    cs.LG

    Fair Resource Allocation in Multi-Task Learning

    Authors: Hao Ban, Kaiyi Ji

    Abstract: By jointly learning multiple tasks, multi-task learning (MTL) can leverage the shared knowledge across tasks, resulting in improved data efficiency and generalization performance. However, a major challenge in MTL lies in the presence of conflicting gradients, which can hinder the fair optimization of some tasks and subsequently impede MTL's ability to achieve better overall performance. Inspired… ▽ More

    Submitted 23 February, 2024; originally announced February 2024.

  20. arXiv:2402.13741  [pdf, other

    cs.CL cs.AI

    Unlocking Instructive In-Context Learning with Tabular Prompting for Relational Triple Extraction

    Authors: Guozheng Li, Wenjun Ke, Peng Wang, Zijie Xu, Ke Ji, Jiajun Liu, Ziyu Shang, Qiqing Luo

    Abstract: The in-context learning (ICL) for relational triple extraction (RTE) has achieved promising performance, but still encounters two key challenges: (1) how to design effective prompts and (2) how to select proper demonstrations. Existing methods, however, fail to address these challenges appropriately. On the one hand, they usually recast RTE task to text-to-text prompting formats, which is unnatura… ▽ More

    Submitted 21 February, 2024; originally announced February 2024.

    Comments: LREC-COLING 2024

  21. arXiv:2402.10210  [pdf, other

    cs.LG cs.AI cs.CL cs.CV stat.ML

    Self-Play Fine-Tuning of Diffusion Models for Text-to-Image Generation

    Authors: Huizhuo Yuan, Zixiang Chen, Kaixuan Ji, Quanquan Gu

    Abstract: Fine-tuning Diffusion Models remains an underexplored frontier in generative artificial intelligence (GenAI), especially when compared with the remarkable progress made in fine-tuning Large Language Models (LLMs). While cutting-edge diffusion models such as Stable Diffusion (SD) and SDXL rely on supervised fine-tuning, their performance inevitably plateaus after seeing a certain volume of data. Re… ▽ More

    Submitted 15 February, 2024; originally announced February 2024.

    Comments: 28 pages, 8 figures, 10 tables

  22. arXiv:2402.09401  [pdf, other

    cs.LG cs.AI cs.CL math.OC stat.ML

    Reinforcement Learning from Human Feedback with Active Queries

    Authors: Kaixuan Ji, Jiafan He, Quanquan Gu

    Abstract: Aligning large language models (LLM) with human preference plays a key role in building modern generative models and can be achieved by reinforcement learning from human feedback (RLHF). Despite their superior performance, current RLHF approaches often require a large amount of human-labelled preference data, which is expensive to collect. In this paper, inspired by the success of active learning,… ▽ More

    Submitted 14 February, 2024; originally announced February 2024.

    Comments: 28 pages, 1 figure, 4 table

  23. arXiv:2402.06864  [pdf, other

    cs.LG cs.AI

    Discriminative Adversarial Unlearning

    Authors: Rohan Sharma, Shijie Zhou, Kaiyi Ji, Changyou Chen

    Abstract: We introduce a novel machine unlearning framework founded upon the established principles of the min-max optimization paradigm. We capitalize on the capabilities of strong Membership Inference Attacks (MIA) to facilitate the unlearning of specific samples from a trained model. We consider the scenario of two networks, the attacker $\mathbf{A}$ and the trained defender $\mathbf{D}$ pitted against e… ▽ More

    Submitted 13 February, 2024; v1 submitted 9 February, 2024; originally announced February 2024.

    Comments: 13 pages including references, 2 tables, 2 figures and 1 algorithm

  24. arXiv:2401.10559  [pdf, other

    cs.LG cs.AI cs.CL

    OrchMoE: Efficient Multi-Adapter Learning with Task-Skill Synergy

    Authors: Haowen Wang, Tao Sun, Kaixiang Ji, Jian Wang, Cong Fan, **jie Gu

    Abstract: We advance the field of Parameter-Efficient Fine-Tuning (PEFT) with our novel multi-adapter method, OrchMoE, which capitalizes on modular skill architecture for enhanced forward transfer in neural networks. Unlike prior models that depend on explicit task identification inputs, OrchMoE automatically discerns task categories, streamlining the learning process. This is achieved through an integrated… ▽ More

    Submitted 19 January, 2024; originally announced January 2024.

    Comments: 9 pages, 3 figures

  25. Walert: Putting Conversational Search Knowledge into Action by Building and Evaluating a Large Language Model-Powered Chatbot

    Authors: Sachin Pathiyan Cherumanal, Lin Tian, Futoon M. Abushaqra, Angel Felipe Magnossao de Paula, Kaixin Ji, Danula Hettiachchi, Johanne R. Trippas, Halil Ali, Falk Scholer, Damiano Spina

    Abstract: Creating and deploying customized applications is crucial for operational success and enriching user experiences in the rapidly evolving modern business world. A prominent facet of modern user experiences is the integration of chatbots or voice assistants. The rapid evolution of Large Language Models (LLMs) has provided a powerful tool to build conversational applications. We present Walert, a cus… ▽ More

    Submitted 14 January, 2024; originally announced January 2024.

    Comments: Accepted at 2024 ACM SIGIR CHIIR

  26. arXiv:2401.01335  [pdf, other

    cs.LG cs.AI cs.CL stat.ML

    Self-Play Fine-Tuning Converts Weak Language Models to Strong Language Models

    Authors: Zixiang Chen, Yihe Deng, Huizhuo Yuan, Kaixuan Ji, Quanquan Gu

    Abstract: Harnessing the power of human-annotated data through Supervised Fine-Tuning (SFT) is pivotal for advancing Large Language Models (LLMs). In this paper, we delve into the prospect of growing a strong LLM out of a weak one without the need for acquiring additional human-annotated data. We propose a new fine-tuning method called Self-Play fIne-tuNing (SPIN), which starts from a supervised fine-tuned… ▽ More

    Submitted 14 June, 2024; v1 submitted 2 January, 2024; originally announced January 2024.

    Comments: 22 pages, 6 figures, 7 tables. In ICML 2024

  27. arXiv:2312.08212  [pdf, other

    cs.CV

    LAMM: Label Alignment for Multi-Modal Prompt Learning

    Authors: **gsheng Gao, Jiacheng Ruan, Suncheng Xiang, Zefang Yu, Ke Ji, Mingye Xie, Ting Liu, Yuzhuo Fu

    Abstract: With the success of pre-trained visual-language (VL) models such as CLIP in visual representation tasks, transferring pre-trained models to downstream tasks has become a crucial paradigm. Recently, the prompt tuning paradigm, which draws inspiration from natural language processing (NLP), has made significant progress in VL field. However, preceding methods mainly focus on constructing prompt temp… ▽ More

    Submitted 13 December, 2023; originally announced December 2023.

    Comments: Accepted at AAAI 2024 Main Conference

  28. arXiv:2312.03807  [pdf, other

    math.OC cs.LG stat.ML

    Achieving ${O}(ε^{-1.5})$ Complexity in Hessian/Jacobian-free Stochastic Bilevel Optimization

    Authors: Yifan Yang, Peiyao Xiao, Kaiyi Ji

    Abstract: In this paper, we revisit the bilevel optimization problem, in which the upper-level objective function is generally nonconvex and the lower-level objective function is strongly convex. Although this type of problem has been studied extensively, it still remains an open question how to achieve an ${O}(ε^{-1.5})$ sample complexity in Hessian/Jacobian-free stochastic bilevel optimization without any… ▽ More

    Submitted 20 December, 2023; v1 submitted 6 December, 2023; originally announced December 2023.

  29. arXiv:2311.00186  [pdf, other

    astro-ph.IM astro-ph.GA astro-ph.SR cs.CV

    Image Restoration with Point Spread Function Regularization and Active Learning

    Authors: Peng Jia, Jiameng Lv, Runyu Ning, Yu Song, Nan Li, Kaifan Ji, Chenzhou Cui, Shanshan Li

    Abstract: Large-scale astronomical surveys can capture numerous images of celestial objects, including galaxies and nebulae. Analysing and processing these images can reveal intricate internal structures of these objects, allowing researchers to conduct comprehensive studies on their morphology, evolution, and physical properties. However, varying noise levels and point spread functions can hamper the accur… ▽ More

    Submitted 31 October, 2023; originally announced November 2023.

    Comments: To be published in the MNRAS

  30. arXiv:2310.16242  [pdf, other

    cs.LG cs.CL

    ZzzGPT: An Interactive GPT Approach to Enhance Sleep Quality

    Authors: Yonchanok Khaokaew, Kaixin Ji, Thuc Hanh Nguyen, Hiruni Kegalle, Marwah Alaofi, Hao Xue, Flora D. Salim

    Abstract: This paper explores the intersection of technology and sleep pattern comprehension, presenting a cutting-edge two-stage framework that harnesses the power of Large Language Models (LLMs). The primary objective is to deliver precise sleep predictions paired with actionable feedback, addressing the limitations of existing solutions. This innovative approach involves leveraging the GLOBEM dataset alo… ▽ More

    Submitted 6 May, 2024; v1 submitted 24 October, 2023; originally announced October 2023.

  31. arXiv:2310.13304  [pdf, other

    cs.HC

    "Living Within Four Walls": Exploring Emotional and Social Dynamics in Mobile Usage During Home Confinement

    Authors: Nan Gao, Sam Nolan, Kaixin Ji, Shakila Khan Rumi, Judith Simone Heinisch, Christoph Anderson, Klaus David, Flora D. Salim

    Abstract: Home confinement, a situation experienced by individuals for reasons ranging from medical quarantines, rehabilitation needs, disability accommodations, and remote working, is a common yet impactful aspect of modern life. While essential in various scenarios, confinement within the home environment can profoundly influence psychological well-being and digital device usage. In this study, we delve i… ▽ More

    Submitted 8 June, 2024; v1 submitted 20 October, 2023; originally announced October 2023.

  32. arXiv:2310.10590  [pdf, other

    cs.CL

    Mastering the Task of Open Information Extraction with Large Language Models and Consistent Reasoning Environment

    Authors: Ji Qi, Kaixuan Ji, Xiaozhi Wang, Jifan Yu, Kaisheng Zeng, Lei Hou, Juanzi Li, Bin Xu

    Abstract: Open Information Extraction (OIE) aims to extract objective structured knowledge from natural texts, which has attracted growing attention to build dedicated models with human experience. As the large language models (LLMs) have exhibited remarkable in-context learning capabilities, a question arises as to whether the task of OIE can be effectively tackled with this paradigm? In this paper, we exp… ▽ More

    Submitted 16 October, 2023; originally announced October 2023.

  33. arXiv:2310.10586  [pdf, other

    cs.CV cs.CL

    VidCoM: Fast Video Comprehension through Large Language Models with Multimodal Tools

    Authors: Ji Qi, Kaixuan Ji, Jifan Yu, Duokang Wang, Bin Xu, Lei Hou, Juanzi Li

    Abstract: Building models that comprehends videos and responds specific user instructions is a practical and challenging topic, as it requires mastery of both vision understanding and knowledge reasoning. Compared to language and image modalities, training efficiency remains a serious problem as existing studies train models on massive sparse videos paired with brief descriptions. In this paper, we introduc… ▽ More

    Submitted 27 April, 2024; v1 submitted 16 October, 2023; originally announced October 2023.

  34. Designing and Evaluating Presentation Strategies for Fact-Checked Content

    Authors: Danula Hettiachchi, Kaixin Ji, Jenny Kennedy, Anthony McCosker, Flora D. Salim, Mark Sanderson, Falk Scholer, Damiano Spina

    Abstract: With the rapid growth of online misinformation, it is crucial to have reliable fact-checking methods. Recent research on finding check-worthy claims and automated fact-checking have made significant advancements. However, limited guidance exists regarding the presentation of fact-checked content to effectively convey verified information to users. We address this research gap by exploring the crit… ▽ More

    Submitted 23 December, 2023; v1 submitted 20 August, 2023; originally announced August 2023.

    Comments: Accepted to the 32nd ACM International Conference on Information and Knowledge Management (CIKM '23)

    ACM Class: H.3.3; H.5.2

  35. arXiv:2308.03811  [pdf, other

    math.OC cs.LG

    Non-Convex Bilevel Optimization with Time-Varying Objective Functions

    Authors: Sen Lin, Daouda Sow, Kaiyi Ji, Yingbin Liang, Ness Shroff

    Abstract: Bilevel optimization has become a powerful tool in a wide variety of machine learning problems. However, the current nonconvex bilevel optimization considers an offline dataset and static functions, which may not work well in emerging online applications with streaming data and time-varying functions. In this work, we study online bilevel optimization (OBO) where the functions can be time-varying… ▽ More

    Submitted 8 November, 2023; v1 submitted 7 August, 2023; originally announced August 2023.

    Journal ref: NeurIPS 2023

  36. arXiv:2308.02583  [pdf, other

    quant-ph cs.IT hep-th math-ph

    Postselected communication over quantum channels

    Authors: Kaiyuan Ji, Bartosz Regula, Mark M. Wilde

    Abstract: The single-letter characterisation of the entanglement-assisted capacity of a quantum channel is one of the seminal results of quantum information theory. In this paper, we consider a modified communication scenario in which the receiver is allowed an additional, `inconclusive' measurement outcome, and we employ an error metric given by the error probability in decoding the transmitted message con… ▽ More

    Submitted 21 March, 2024; v1 submitted 3 August, 2023; originally announced August 2023.

    Comments: 38 pages, 5 figures, submitted to International Journal of Quantum Information (IJQI) as part of a special issue dedicated to Alexander S. Holevo on the occasion of his 80th birthday

  37. arXiv:2308.02570  [pdf, other

    cs.LG cs.AI cs.CL cs.CV

    Learning Implicit Entity-object Relations by Bidirectional Generative Alignment for Multimodal NER

    Authors: Feng Chen, Jiajia Liu, Kaixiang Ji, Wang Ren, Jian Wang, **gdong Wang

    Abstract: The challenge posed by multimodal named entity recognition (MNER) is mainly two-fold: (1) bridging the semantic gap between text and image and (2) matching the entity with its associated object in image. Existing methods fail to capture the implicit entity-object relations, due to the lack of corresponding annotation. In this paper, we propose a bidirectional generative alignment method named BGA-… ▽ More

    Submitted 3 August, 2023; originally announced August 2023.

  38. arXiv:2305.19442  [pdf, other

    cs.LG cs.DC math.OC stat.ML

    SimFBO: Towards Simple, Flexible and Communication-efficient Federated Bilevel Learning

    Authors: Yifan Yang, Peiyao Xiao, Kaiyi Ji

    Abstract: Federated bilevel optimization (FBO) has shown great potential recently in machine learning and edge computing due to the emerging nested optimization structure in meta-learning, fine-tuning, hyperparameter tuning, etc. However, existing FBO algorithms often involve complicated computations and require multiple sub-loops per iteration, each of which contains a number of communication rounds. In th… ▽ More

    Submitted 27 December, 2023; v1 submitted 30 May, 2023; originally announced May 2023.

  39. arXiv:2305.18409  [pdf, other

    cs.LG math.OC stat.ML

    Direction-oriented Multi-objective Learning: Simple and Provable Stochastic Algorithms

    Authors: Peiyao Xiao, Hao Ban, Kaiyi Ji

    Abstract: Multi-objective optimization (MOO) has become an influential framework in many machine learning problems with multiple objectives such as learning with multiple criteria and multi-task learning (MTL). In this paper, we propose a new direction-oriented multi-objective problem by regularizing the common descent direction within a neighborhood of a direction that optimizes a linear combination of obj… ▽ More

    Submitted 28 November, 2023; v1 submitted 28 May, 2023; originally announced May 2023.

  40. arXiv:2305.17820  [pdf, other

    cs.CV

    Analysis of ROC for Edge Detectors

    Authors: Kai Yi Ji

    Abstract: This paper presents an evaluation of edge detectors using receiver operating characteristic (ROC) analysis on the BIPED dataset. Our study examines the benefits and drawbacks of applying this technique in Matlab. We observed that while ROC analysis is suitable for certain edge filters, but for filters such as Laplacian, Laplacian of Gaussian, and Canny, it presents challenges when accurately measu… ▽ More

    Submitted 28 May, 2023; originally announced May 2023.

  41. arXiv:2305.16885  [pdf, other

    cs.CL

    Hierarchical Verbalizer for Few-Shot Hierarchical Text Classification

    Authors: Ke Ji, Yixin Lian, **gsheng Gao, Baoyuan Wang

    Abstract: Due to the complex label hierarchy and intensive labeling cost in practice, the hierarchical text classification (HTC) suffers a poor performance especially when low-resource or few-shot settings are considered. Recently, there is a growing trend of applying prompts on pre-trained language models (PLMs), which has exhibited effectiveness in the few-shot flat text classification tasks. However, lim… ▽ More

    Submitted 26 May, 2023; originally announced May 2023.

    Comments: 14 pages, 8 figures, Accepted by ACL 2023

  42. arXiv:2305.08359  [pdf, other

    cs.LG math.OC stat.ML

    Horizon-free Reinforcement Learning in Adversarial Linear Mixture MDPs

    Authors: Kaixuan Ji, Qingyue Zhao, Jiafan He, Weitong Zhang, Quanquan Gu

    Abstract: Recent studies have shown that episodic reinforcement learning (RL) is no harder than bandits when the total reward is bounded by $1$, and proved regret bounds that have a polylogarithmic dependence on the planning horizon $H$. However, it remains an open question that if such results can be carried over to adversarial RL, where the reward is adversarially chosen at each episode. In this paper, we… ▽ More

    Submitted 15 May, 2023; originally announced May 2023.

    Comments: 34 pages

  43. Examining the Impact of Uncontrolled Variables on Physiological Signals in User Studies for Information Processing Activities

    Authors: Kaixin Ji, Damiano Spina, Danula Hettiachchi, Flora Dilys Salim, Falk Scholer

    Abstract: Physiological signals can potentially be applied as objective measures to understand the behavior and engagement of users interacting with information access systems. However, the signals are highly sensitive, and many controls are required in laboratory user studies. To investigate the extent to which controlled or uncontrolled (i.e., confounding) variables such as task sequence or duration influ… ▽ More

    Submitted 26 April, 2023; originally announced April 2023.

    Comments: Accepted to the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR '23)

  44. arXiv:2302.05412  [pdf, other

    cs.LG math.OC stat.ML

    Achieving Linear Speedup in Non-IID Federated Bilevel Learning

    Authors: Minhui Huang, Dewei Zhang, Kaiyi Ji

    Abstract: Federated bilevel optimization has received increasing attention in various emerging machine learning and communication applications. Recently, several Hessian-vector-based algorithms have been proposed to solve the federated bilevel optimization problem. However, several important properties in federated learning such as the partial client participation and the linear speedup for convergence (i.e… ▽ More

    Submitted 10 February, 2023; originally announced February 2023.

    Comments: 25 pages. 10 figures, 1 table

  45. arXiv:2302.04969  [pdf, other

    cs.LG math.OC stat.ML

    Communication-Efficient Federated Hypergradient Computation via Aggregated Iterative Differentiation

    Authors: Peiyao Xiao, Kaiyi Ji

    Abstract: Federated bilevel optimization has attracted increasing attention due to emerging machine learning and communication applications. The biggest challenge lies in computing the gradient of the upper-level objective function (i.e., hypergradient) in the federated setting due to the nonlinear and distributed construction of a series of global Hessian matrices. In this paper, we propose a novel communi… ▽ More

    Submitted 15 June, 2023; v1 submitted 9 February, 2023; originally announced February 2023.

    Comments: 38 pages, 15 figures

  46. arXiv:2301.01801  [pdf, other

    cs.LG cs.NI math.OC stat.ML

    Network Utility Maximization with Unknown Utility Functions: A Distributed, Data-Driven Bilevel Optimization Approach

    Authors: Kaiyi Ji, Lei Ying

    Abstract: Fair resource allocation is one of the most important topics in communication networks. Existing solutions almost exclusively assume each user utility function is known and concave. This paper seeks to answer the following question: how to allocate resources when utility functions are unknown, even to the users? This answer has become increasingly important in the next-generation AI-aware communic… ▽ More

    Submitted 5 January, 2023; v1 submitted 4 January, 2023; originally announced January 2023.

    Comments: 28 pages, 12 figures

  47. arXiv:2207.07087  [pdf, other

    cs.CL cs.IR cs.LG

    Parameter-Efficient Prompt Tuning Makes Generalized and Calibrated Neural Text Retrievers

    Authors: Weng Lam Tam, Xiao Liu, Kaixuan Ji, Lilong Xue, Xingjian Zhang, Yuxiao Dong, Jiahua Liu, Maodi Hu, Jie Tang

    Abstract: Prompt tuning attempts to update few task-specific parameters in pre-trained models. It has achieved comparable performance to fine-tuning of the full parameter set on both language understanding and generation tasks. In this work, we study the problem of prompt tuning for neural text retrievers. We introduce parameter-efficient prompt tuning for text retrieval across in-domain, cross-domain, and… ▽ More

    Submitted 14 July, 2022; originally announced July 2022.

  48. arXiv:2205.14224  [pdf, other

    cs.LG math.OC stat.ML

    Will Bilevel Optimizers Benefit from Loops

    Authors: Kaiyi Ji, Mingrui Liu, Yingbin Liang, Lei Ying

    Abstract: Bilevel optimization has arisen as a powerful tool for solving a variety of machine learning problems. Two current popular bilevel optimizers AID-BiO and ITD-BiO naturally involve solving one or two sub-problems, and consequently, whether we solve these problems with loops (that take many iterations) or without loops (that take only a few iterations) can significantly affect the overall computatio… ▽ More

    Submitted 31 May, 2022; v1 submitted 27 May, 2022; originally announced May 2022.

    Comments: 32 pages, 2 figures, 3 tables

  49. arXiv:2204.00006  [pdf, other

    cs.LG math.OC

    Data Sampling Affects the Complexity of Online SGD over Dependent Data

    Authors: Shaocong Ma, Ziyi Chen, Yi Zhou, Kaiyi Ji, Yingbin Liang

    Abstract: Conventional machine learning applications typically assume that data samples are independently and identically distributed (i.i.d.). However, practical scenarios often involve a data-generating process that produces highly dependent data samples, which are known to heavily bias the stochastic optimization process and slow down the convergence of learning. In this paper, we conduct a fundamental s… ▽ More

    Submitted 31 March, 2022; originally announced April 2022.

  50. arXiv:2203.01123  [pdf, other

    math.OC cs.LG stat.ML

    A Primal-Dual Approach to Bilevel Optimization with Multiple Inner Minima

    Authors: Daouda Sow, Kaiyi Ji, Ziwei Guan, Yingbin Liang

    Abstract: Bilevel optimization has found extensive applications in modern machine learning problems such as hyperparameter optimization, neural architecture search, meta-learning, etc. While bilevel problems with a unique inner minimal point (e.g., where the inner function is strongly convex) are well understood, such a problem with multiple inner minimal points remains to be challenging and open. Existing… ▽ More

    Submitted 8 June, 2022; v1 submitted 1 March, 2022; originally announced March 2022.

    Comments: Submitted for publication