Skip to main content

Showing 1–50 of 172 results for author: Feng, F

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.19693  [pdf, other

    cs.RO cs.CV

    MMRo: Are Multimodal LLMs Eligible as the Brain for In-Home Robotics?

    Authors: **ming Li, Yichen Zhu, Zhiyuan Xu, **dong Gu, Minjie Zhu, Xin Liu, Ning Liu, Yaxin Peng, Feifei Feng, Jian Tang

    Abstract: It is fundamentally challenging for robots to serve as useful assistants in human environments because this requires addressing a spectrum of sub-problems across robotics, including perception, language understanding, reasoning, and planning. The recent advancements in Multimodal Large Language Models (MLLMs) have demonstrated their exceptional abilities in solving complex mathematical problems, m… ▽ More

    Submitted 28 June, 2024; originally announced June 2024.

  2. arXiv:2406.17503  [pdf, other

    cs.LG

    WAVE: Weight Template for Adaptive Initialization of Variable-sized Models

    Authors: Fu Feng, Yucheng Xie, **g Wang, Xin Geng

    Abstract: The expansion of model parameters underscores the significance of pre-trained models; however, the constraints encountered during model deployment necessitate models of variable sizes. Consequently, the traditional pre-training and fine-tuning paradigm fails to address the initialization problem when target models are incompatible with pre-trained models. We tackle this issue from a multitasking p… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

  3. arXiv:2406.17182  [pdf, other

    cs.IR cs.LG

    Debiased Recommendation with Noisy Feedback

    Authors: Haoxuan Li, Chunyuan Zheng, Wenjie Wang, Hao Wang, Fuli Feng, Xiao-Hua Zhou

    Abstract: Ratings of a user to most items in recommender systems are usually missing not at random (MNAR), largely because users are free to choose which items to rate. To achieve unbiased learning of the prediction model under MNAR data, three typical solutions have been proposed, including error-imputation-based (EIB), inverse-propensity-scoring (IPS), and doubly robust (DR) methods. However, these method… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

    Comments: KDD 24 Research Track Paper

  4. arXiv:2406.14900  [pdf, other

    cs.IR

    Decoding Matters: Addressing Amplification Bias and Homogeneity Issue for LLM-based Recommendation

    Authors: Keqin Bao, Jizhi Zhang, Yang Zhang, Xinyue Huo, Chong Chen, Fuli Feng

    Abstract: Adapting Large Language Models (LLMs) for recommendation requires careful consideration of the decoding process, given the inherent differences between generating items and natural language. Existing approaches often directly apply LLMs' original decoding methods. However, we find these methods encounter significant challenges: 1) amplification bias -- where standard length normalization inflates… ▽ More

    Submitted 21 June, 2024; originally announced June 2024.

  5. arXiv:2406.14868  [pdf, other

    cs.CL cs.LG

    Direct Multi-Turn Preference Optimization for Language Agents

    Authors: Wentao Shi, Mengqi Yuan, Junkang Wu, Qifan Wang, Fuli Feng

    Abstract: Adapting Large Language Models (LLMs) for agent tasks is critical in develo** language agents. Direct Preference Optimization (DPO) is a promising technique for this adaptation with the alleviation of compounding errors, offering a means to directly optimize Reinforcement Learning (RL) objectives. However, applying DPO to multi-turn tasks presents challenges due to the inability to cancel the pa… ▽ More

    Submitted 25 June, 2024; v1 submitted 21 June, 2024; originally announced June 2024.

  6. arXiv:2406.13443  [pdf, other

    cs.CL

    Dual-Phase Accelerated Prompt Optimization

    Authors: Muchen Yang, Moxin Li, Yongle Li, Zijun Chen, Chongming Gao, Junqi Zhang, Yangyang Li, Fuli Feng

    Abstract: Gradient-free prompt optimization methods have made significant strides in enhancing the performance of closed-source Large Language Models (LLMs) across a wide range of tasks. However, existing approaches make light of the importance of high-quality prompt initialization and the identification of effective optimization directions, thus resulting in substantial optimization steps to obtain satisfa… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

  7. arXiv:2406.11514  [pdf, other

    cs.CL

    Counterfactual Debating with Preset Stances for Hallucination Elimination of LLMs

    Authors: Yi Fang, Moxin Li, Wenjie Wang, Hui Lin, Fuli Feng

    Abstract: Large Language Models (LLMs) excel in various natural language processing tasks but struggle with hallucination issues. Existing solutions have considered utilizing LLMs' inherent reasoning abilities to alleviate hallucination, such as self-correction and diverse sampling methods. However, these methods often overtrust LLMs' initial answers due to inherent biases. The key to alleviating this issue… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

  8. arXiv:2406.11497  [pdf, other

    cs.CL

    CrAM: Credibility-Aware Attention Modification in LLMs for Combating Misinformation in RAG

    Authors: Boyi Deng, Wenjie Wang, Fengbin Zhu, Qifan Wang, Fuli Feng

    Abstract: Retrieval-Augmented Generation (RAG) can alleviate hallucinations of Large Language Models (LLMs) by referencing external documents. However, the misinformation in external documents may mislead LLMs' generation. To address this issue, we explore the task of "credibility-aware RAG", in which LLMs automatically adjust the influence of retrieved documents based on their credibility scores to counter… ▽ More

    Submitted 27 June, 2024; v1 submitted 17 June, 2024; originally announced June 2024.

    Comments: Under review

  9. arXiv:2406.06839  [pdf, other

    cs.CL

    EAVE: Efficient Product Attribute Value Extraction via Lightweight Sparse-layer Interaction

    Authors: Li Yang, Qifan Wang, Jianfeng Chi, Jiahao Liu, **gang Wang, Fuli Feng, Zenglin Xu, Yi Fang, Lifu Huang, Dongfang Liu

    Abstract: Product attribute value extraction involves identifying the specific values associated with various attributes from a product profile. While existing methods often prioritize the development of effective models to improve extraction performance, there has been limited emphasis on extraction efficiency. However, in real-world scenarios, products are typically associated with multiple attributes, ne… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

  10. arXiv:2406.04594  [pdf, other

    cs.DC cs.AI cs.LG

    Boosting Large-scale Parallel Training Efficiency with C4: A Communication-Driven Approach

    Authors: Jianbo Dong, Bin Luo, Jun Zhang, Pengcheng Zhang, Fei Feng, Yikai Zhu, Ang Liu, Zian Chen, Yi Shi, Hairong Jiao, Gang Lu, Yu Guan, Ennan Zhai, Wencong Xiao, Hanyu Zhao, Man Yuan, Siran Yang, Xiang Li, Jiamang Wang, Rui Men, Jianwei Zhang, Huang Zhong, Dennis Cai, Yuan Xie, Binzhang Fu

    Abstract: The emergence of Large Language Models (LLMs) has necessitated the adoption of parallel training techniques, involving the deployment of thousands of GPUs to train a single model. Unfortunately, we have found that the efficiency of current parallel training is often suboptimal, largely due to the following two main issues. Firstly, hardware failures are inevitable, leading to interruptions in the… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

  11. arXiv:2406.03255  [pdf, other

    cs.LG

    On the Maximal Local Disparity of Fairness-Aware Classifiers

    Authors: **qiu **, Haoxuan Li, Fuli Feng

    Abstract: Fairness has become a crucial aspect in the development of trustworthy machine learning algorithms. Current fairness metrics to measure the violation of demographic parity have the following drawbacks: (i) the average difference of model predictions on two groups cannot reflect their distribution disparity, and (ii) the overall calculation along all possible predictions conceals the extreme local… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

    Journal ref: ICML 2024

  12. arXiv:2406.03210  [pdf, other

    cs.IR

    Text-like Encoding of Collaborative Information in Large Language Models for Recommendation

    Authors: Yang Zhang, Keqin Bao, Ming Yan, Wenjie Wang, Fuli Feng, Xiangnan He

    Abstract: When adapting Large Language Models for Recommendation (LLMRec), it is crucial to integrate collaborative information. Existing methods achieve this by learning collaborative embeddings in LLMs' latent space from scratch or by map** from external models. However, they fail to represent the information in a text-like format, which may not align optimally with LLMs. To bridge this gap, we introduc… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

    Comments: Accepted by ACL 2024

    ACM Class: H.3.3

  13. arXiv:2406.00755  [pdf, other

    cs.CL cs.AI cs.LG

    Evaluating Mathematical Reasoning of Large Language Models: A Focus on Error Identification and Correction

    Authors: Xiaoyuan Li, Wenjie Wang, Moxin Li, Junrong Guo, Yang Zhang, Fuli Feng

    Abstract: The rapid advancement of Large Language Models (LLMs) in the realm of mathematical reasoning necessitates comprehensive evaluations to gauge progress and inspire future directions. Existing assessments predominantly focus on problem-solving from the examinee perspective, overlooking a dual perspective of examiner regarding error identification and correction. From the examiner perspective, we defi… ▽ More

    Submitted 2 June, 2024; originally announced June 2024.

    Comments: ACL Findings 2024

  14. arXiv:2405.20626  [pdf, other

    cs.IR cs.IT

    Causal Distillation for Alleviating Performance Heterogeneity in Recommender Systems

    Authors: Shengyu Zhang, Ziqi Jiang, Jiangchao Yao, Fuli Feng, Kun Kuang, Zhou Zhao, Shuo Li, Hongxia Yang, Tat-Seng Chua, Fei Wu

    Abstract: Recommendation performance usually exhibits a long-tail distribution over users -- a small portion of head users enjoy much more accurate recommendation services than the others. We reveal two sources of this performance heterogeneity problem: the uneven distribution of historical interactions (a natural source); and the biased training of recommender models (a model source). As addressing this pr… ▽ More

    Submitted 31 May, 2024; originally announced May 2024.

    Comments: TKDE 2023

  15. arXiv:2405.07314  [pdf, other

    cs.IR

    Learnable Tokenizer for LLM-based Generative Recommendation

    Authors: Wenjie Wang, Honghui Bao, Xinyu Lin, Jizhi Zhang, Yongqi Li, Fuli Feng, See-Kiong Ng, Tat-Seng Chua

    Abstract: Harnessing Large Language Models (LLMs) for generative recommendation has garnered significant attention due to LLMs' powerful capacities such as rich world knowledge and reasoning. However, a critical challenge lies in transforming recommendation data into the language space of LLMs through effective item tokenization. Existing approaches, such as ID identifiers, textual identifiers, and codebook… ▽ More

    Submitted 12 May, 2024; originally announced May 2024.

  16. arXiv:2405.01063  [pdf, other

    cs.IR cs.CY cs.LG

    Fair Recommendations with Limited Sensitive Attributes: A Distributionally Robust Optimization Approach

    Authors: Tianhao Shi, Yang Zhang, Jizhi Zhang, Fuli Feng, Xiangnan He

    Abstract: As recommender systems are indispensable in various domains such as job searching and e-commerce, providing equitable recommendations to users with different sensitive attributes becomes an imperative requirement. Prior approaches for enhancing fairness in recommender systems presume the availability of all sensitive attributes, which can be difficult to obtain due to privacy concerns or inadequat… ▽ More

    Submitted 27 May, 2024; v1 submitted 2 May, 2024; originally announced May 2024.

    Comments: 8 pages, 5 figures, accepted by SIGIR'24

  17. arXiv:2404.19620  [pdf, other

    cs.LG cs.IR stat.ML

    Be Aware of the Neighborhood Effect: Modeling Selection Bias under Interference

    Authors: Haoxuan Li, Chunyuan Zheng, Sihao Ding, Peng Wu, Zhi Geng, Fuli Feng, Xiangnan He

    Abstract: Selection bias in recommender system arises from the recommendation process of system filtering and the interactive process of user selection. Many previous studies have focused on addressing selection bias to achieve unbiased learning of the prediction model, but ignore the fact that potential outcomes for a given user-item pair may vary with the treatments assigned to other user-item pairs, name… ▽ More

    Submitted 30 April, 2024; originally announced April 2024.

    Comments: ICLR 24

  18. arXiv:2404.16924  [pdf, other

    cs.IR cs.CL

    A Survey of Generative Search and Recommendation in the Era of Large Language Models

    Authors: Yongqi Li, Xinyu Lin, Wenjie Wang, Fuli Feng, Liang Pang, Wenjie Li, Liqiang Nie, Xiangnan He, Tat-Seng Chua

    Abstract: With the information explosion on the Web, search and recommendation are foundational infrastructures to satisfying users' information needs. As the two sides of the same coin, both revolve around the same core research problem, matching queries with documents or users with items. In the recent few decades, search and recommendation have experienced synchronous technological paradigm shifts, inclu… ▽ More

    Submitted 25 April, 2024; originally announced April 2024.

  19. arXiv:2404.10327  [pdf, other

    cs.IR

    Exact and Efficient Unlearning for Large Language Model-based Recommendation

    Authors: Zhiyu Hu, Yang Zhang, Minghao Xiao, Wenjie Wang, Fuli Feng, Xiangnan He

    Abstract: The evolving paradigm of Large Language Model-based Recommendation (LLMRec) customizes Large Language Models (LLMs) through parameter-efficient fine-tuning (PEFT) using recommendation data. The inclusion of user data in LLMs raises privacy concerns. To protect users, the unlearning process in LLMRec, specifically removing unusable data (e.g., historical behaviors) from established LLMRec models, b… ▽ More

    Submitted 16 April, 2024; originally announced April 2024.

  20. arXiv:2404.06139  [pdf, other

    cs.CV

    DiffHarmony: Latent Diffusion Model Meets Image Harmonization

    Authors: Pengfei Zhou, Fangxiang Feng, Xiaojie Wang

    Abstract: Image harmonization, which involves adjusting the foreground of a composite image to attain a unified visual consistency with the background, can be conceptualized as an image-to-image translation task. Diffusion models have recently promoted the rapid development of image-to-image translation tasks . However, training diffusion models from scratch is computationally intensive. Fine-tuning pre-tra… ▽ More

    Submitted 9 April, 2024; originally announced April 2024.

    Comments: Accepted by ICMR 2024

  21. arXiv:2403.17745  [pdf, other

    cs.LG

    Leave No Patient Behind: Enhancing Medication Recommendation for Rare Disease Patients

    Authors: Zihao Zhao, Yi **g, Fuli Feng, Jiancan Wu, Chongming Gao, Xiangnan He

    Abstract: Medication recommendation systems have gained significant attention in healthcare as a means of providing tailored and effective drug combinations based on patients' clinical information. However, existing approaches often suffer from fairness issues, as recommendations tend to be more accurate for patients with common diseases compared to those with rare conditions. In this paper, we propose a no… ▽ More

    Submitted 26 March, 2024; originally announced March 2024.

  22. arXiv:2403.09972  [pdf, other

    cs.CL

    Think Twice Before Trusting: Self-Detection for Large Language Models through Comprehensive Answer Reflection

    Authors: Moxin Li, Wenjie Wang, Fuli Feng, Fengbin Zhu, Qifan Wang, Tat-Seng Chua

    Abstract: Self-detection for Large Language Model (LLM) seeks to evaluate the LLM output trustability by leveraging LLM's own capabilities, alleviating the output hallucination issue. However, existing self-detection approaches only retrospectively evaluate answers generated by LLM, typically leading to the over-trust in incorrectly generated answers. To tackle this limitation, we propose a novel self-detec… ▽ More

    Submitted 4 June, 2024; v1 submitted 14 March, 2024; originally announced March 2024.

    Comments: Under review

  23. Proactive Recommendation with Iterative Preference Guidance

    Authors: Shuxian Bi, Wenjie Wang, Hang Pan, Fuli Feng, Xiangnan He

    Abstract: Recommender systems mainly tailor personalized recommendations according to user interests learned from user feedback. However, such recommender systems passively cater to user interests and even reinforce existing interests in the feedback loop, leading to problems like filter bubbles and opinion polarization. To counteract this, proactive recommendation actively steers users towards develo** n… ▽ More

    Submitted 12 March, 2024; originally announced March 2024.

    Comments: Accepted by WWW 2024 (Short)

  24. arXiv:2403.06199  [pdf, other

    cs.CV cs.CL

    Mipha: A Comprehensive Overhaul of Multimodal Assistant with Small Language Models

    Authors: Minjie Zhu, Yichen Zhu, Xin Liu, Ning Liu, Zhiyuan Xu, Chaomin Shen, Yaxin Peng, Zhicai Ou, Feifei Feng, Jian Tang

    Abstract: Multimodal Large Language Models (MLLMs) have showcased impressive skills in tasks related to visual understanding and reasoning. Yet, their widespread application faces obstacles due to the high computational demands during both the training and inference phases, restricting their use to a limited audience within the research and user communities. In this paper, we investigate the design aspects… ▽ More

    Submitted 25 March, 2024; v1 submitted 10 March, 2024; originally announced March 2024.

  25. arXiv:2403.05530  [pdf, other

    cs.CL cs.AI

    Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context

    Authors: Gemini Team, Petko Georgiev, Ving Ian Lei, Ryan Burnell, Libin Bai, Anmol Gulati, Garrett Tanzer, Damien Vincent, Zhufeng Pan, Shibo Wang, Soroosh Mariooryad, Yifan Ding, Xinyang Geng, Fred Alcober, Roy Frostig, Mark Omernick, Lexi Walker, Cosmin Paduraru, Christina Sorokin, Andrea Tacchetti, Colin Gaffney, Samira Daruki, Olcan Sercinoglu, Zach Gleicher, Juliette Love , et al. (1092 additional authors not shown)

    Abstract: In this report, we introduce the Gemini 1.5 family of models, representing the next generation of highly compute-efficient multimodal models capable of recalling and reasoning over fine-grained information from millions of tokens of context, including multiple long documents and hours of video and audio. The family includes two new models: (1) an updated Gemini 1.5 Pro, which exceeds the February… ▽ More

    Submitted 14 June, 2024; v1 submitted 8 March, 2024; originally announced March 2024.

  26. arXiv:2403.04399  [pdf, other

    cs.IR

    The 2nd Workshop on Recommendation with Generative Models

    Authors: Wenjie Wang, Yang Zhang, Xinyu Lin, Fuli Feng, Weiwen Liu, Yong Liu, Xiangyu Zhao, Wayne Xin Zhao, Yang Song, Xiangnan He

    Abstract: The rise of generative models has driven significant advancements in recommender systems, leaving unique opportunities for enhancing users' personalized recommendations. This workshop serves as a platform for researchers to explore and exchange innovative concepts related to the integration of generative models into recommender systems. It primarily focuses on five key perspectives: (i) improving… ▽ More

    Submitted 7 March, 2024; originally announced March 2024.

  27. Uplift Modeling for Target User Attacks on Recommender Systems

    Authors: Wenjie Wang, Changsheng Wang, Fuli Feng, Wentao Shi, Daizong Ding, Tat-Seng Chua

    Abstract: Recommender systems are vulnerable to injective attacks, which inject limited fake users into the platforms to manipulate the exposure of target items to all users. In this work, we identify that conventional injective attackers overlook the fact that each item has its unique potential audience, and meanwhile, the attack difficulty across different users varies. Blindly attacking all users will re… ▽ More

    Submitted 5 March, 2024; originally announced March 2024.

    Comments: Accepted by WWW'24 as an oral paper

  28. Lower-Left Partial AUC: An Effective and Efficient Optimization Metric for Recommendation

    Authors: Wentao Shi, Chenxu Wang, Fuli Feng, Yang Zhang, Wenjie Wang, Junkang Wu, Xiangnan He

    Abstract: Optimization metrics are crucial for building recommendation systems at scale. However, an effective and efficient metric for practical use remains elusive. While Top-K ranking metrics are the gold standard for optimization, they suffer from significant computational overhead. Alternatively, the more efficient accuracy and AUC metrics often fall short of capturing the true targets of recommendatio… ▽ More

    Submitted 29 February, 2024; originally announced March 2024.

    Comments: WWW 2024; 15 pages

  29. arXiv:2403.00843  [pdf, other

    cs.IR cs.AI cs.CL cs.LG

    Large Language Models are Learnable Planners for Long-Term Recommendation

    Authors: Wentao Shi, Xiangnan He, Yang Zhang, Chongming Gao, Xinyue Li, Jizhi Zhang, Qifan Wang, Fuli Feng

    Abstract: Planning for both immediate and long-term benefits becomes increasingly important in recommendation. Existing methods apply Reinforcement Learning (RL) to learn planning capacity by maximizing cumulative reward for long-term recommendation. However, the scarcity of recommendation data presents challenges such as instability and susceptibility to overfitting when training RL models from scratch, re… ▽ More

    Submitted 26 April, 2024; v1 submitted 29 February, 2024; originally announced March 2024.

    Comments: 11 pages, 5 figures

  30. arXiv:2402.18240  [pdf, other

    cs.IR cs.CL

    Prospect Personalized Recommendation on Large Language Model-based Agent Platform

    Authors: Jizhi Zhang, Keqin Bao, Wenjie Wang, Yang Zhang, Wentao Shi, Wanhong Xu, Fuli Feng, Tat-Seng Chua

    Abstract: The new kind of Agent-oriented information system, exemplified by GPTs, urges us to inspect the information system infrastructure to support Agent-level information processing and to adapt to the characteristics of Large Language Model (LLM)-based Agents, such as interactivity. In this work, we envisage the prospect of the recommender system on LLM-based Agent platforms and introduce a novel recom… ▽ More

    Submitted 5 March, 2024; v1 submitted 28 February, 2024; originally announced February 2024.

  31. Diffusion Models for Generative Outfit Recommendation

    Authors: Yiyan Xu, Wenjie Wang, Fuli Feng, Yunshan Ma, Jizhi Zhang, Xiangnan He

    Abstract: Outfit Recommendation (OR) in the fashion domain has evolved through two stages: Pre-defined Outfit Recommendation and Personalized Outfit Composition. However, both stages are constrained by existing fashion products, limiting their effectiveness in addressing users' diverse fashion needs. Recently, the advent of AI-generated content provides the opportunity for OR to transcend these limitations,… ▽ More

    Submitted 20 May, 2024; v1 submitted 27 February, 2024; originally announced February 2024.

    Comments: accepted for publication in SIGIR'24

  32. arXiv:2402.15215  [pdf, other

    cs.IR

    Item-side Fairness of Large Language Model-based Recommendation System

    Authors: Meng Jiang, Keqin Bao, Jizhi Zhang, Wenjie Wang, Zhengyi Yang, Fuli Feng, Xiangnan He

    Abstract: Recommendation systems for Web content distribution intricately connect to the information access and exposure opportunities for vulnerable populations. The emergence of Large Language Models-based Recommendation System (LRS) may introduce additional societal challenges to recommendation systems due to the inherent biases in Large Language Models (LLMs). From the perspective of item-side fairness,… ▽ More

    Submitted 23 February, 2024; originally announced February 2024.

    Comments: Accepted by the Proceedings of the ACM Web Conference 2024

  33. arXiv:2402.03600  [pdf, other

    cs.IR

    Understanding and Counteracting Feature-Level Bias in Click-Through Rate Prediction

    Authors: **qiu **, Sihao Ding, Wenjie Wang, Fuli Feng

    Abstract: Common click-through rate (CTR) prediction recommender models tend to exhibit feature-level bias, which leads to unfair recommendations among item groups and inaccurate recommendations for users. While existing methods address this issue by adjusting the learning of CTR models, such as through additional optimization objectives, they fail to consider how the bias is caused within these models. To… ▽ More

    Submitted 5 February, 2024; originally announced February 2024.

  34. arXiv:2401.17197  [pdf, other

    cs.IR

    Data-efficient Fine-tuning for LLM-based Recommendation

    Authors: Xinyu Lin, Wenjie Wang, Yongqi Li, Shuo Yang, Fuli Feng, Yinwei Wei, Tat-Seng Chua

    Abstract: Leveraging Large Language Models (LLMs) for recommendation has recently garnered considerable attention, where fine-tuning plays a key role in LLMs' adaptation. However, the cost of fine-tuning LLMs on rapidly expanding recommendation data limits their practical application. To address this challenge, few-shot fine-tuning offers a promising approach to quickly adapt LLMs to new recommendation data… ▽ More

    Submitted 4 June, 2024; v1 submitted 30 January, 2024; originally announced January 2024.

    Comments: Accepted by SIGIR 2024

  35. arXiv:2401.13223  [pdf, other

    cs.CL cs.AI

    TAT-LLM: A Specialized Language Model for Discrete Reasoning over Tabular and Textual Data

    Authors: Fengbin Zhu, Ziyang Liu, Fuli Feng, Chao Wang, Moxin Li, Tat-Seng Chua

    Abstract: In this work, we address question answering (QA) over a hybrid of tabular and textual data that are very common content on the Web (e.g. SEC filings), where discrete reasoning capabilities are often required. Recently, large language models (LLMs) like GPT-4 have demonstrated strong multi-step reasoning capabilities. We then consider harnessing the amazing power of LLMs to solve our task. We abstr… ▽ More

    Submitted 22 February, 2024; v1 submitted 23 January, 2024; originally announced January 2024.

    Comments: ACL 2024 (Under Review)

  36. arXiv:2401.09038  [pdf, other

    cs.RO

    Visual Robotic Manipulation with Depth-Aware Pretraining

    Authors: Wanying Wang, **ming Li, Yichen Zhu, Zhiyuan Xu, Zheng** Che, Yaxin Peng, Chaomin Shen, Dong Liu, Feifei Feng, Jian Tang

    Abstract: Recent work on visual representation learning has shown to be efficient for robotic manipulation tasks. However, most existing works pretrained the visual backbone solely on 2D images or egocentric videos, ignoring the fact that robots learn to act in 3D space, which is hard to learn from 2D observation. In this paper, we examine the effectiveness of pretraining for vision backbone with public-ava… ▽ More

    Submitted 17 January, 2024; originally announced January 2024.

    Comments: submitted to ICRA2024

  37. arXiv:2401.08139  [pdf, other

    cs.LG cs.NE

    Transferring Core Knowledge via Learngenes

    Authors: Fu Feng, **g Wang, Xin Geng

    Abstract: The pre-training paradigm fine-tunes the models trained on large-scale datasets to downstream tasks with enhanced performance. It transfers all knowledge to downstream tasks without discriminating which part is necessary or unnecessary, which may lead to negative transfer. In comparison, knowledge transfer in nature is much more efficient. When passing genetic information to descendants, ancestors… ▽ More

    Submitted 16 January, 2024; originally announced January 2024.

  38. arXiv:2401.06982  [pdf, other

    cs.IR

    Denoising Diffusion Recommender Model

    Authors: Jujia Zhao, Wenjie Wang, Yiyan Xu, Teng Sun, Fuli Feng, Tat-Seng Chua

    Abstract: Recommender systems often grapple with noisy implicit feedback. Most studies alleviate the noise issues from data cleaning perspective such as data resampling and reweighting, but they are constrained by heuristic assumptions. Another denoising avenue is from model perspective, which proactively injects noises into user-item interactions and enhances the intrinsic denoising ability of models. Howe… ▽ More

    Submitted 16 June, 2024; v1 submitted 13 January, 2024; originally announced January 2024.

    Comments: Accepted by SIGIR 2024

  39. arXiv:2401.04181  [pdf, other

    cs.RO cs.CV

    Language-Conditioned Robotic Manipulation with Fast and Slow Thinking

    Authors: Minjie Zhu, Yichen Zhu, **ming Li, Junjie Wen, Zhiyuan Xu, Zheng** Che, Chaomin Shen, Yaxin Peng, Dong Liu, Feifei Feng, Jian Tang

    Abstract: The language-conditioned robotic manipulation aims to transfer natural language instructions into executable actions, from simple pick-and-place to tasks requiring intent recognition and visual reasoning. Inspired by the dual process theory in cognitive science, which suggests two parallel systems of fast and slow thinking in human decision-making, we introduce Robotics with Fast and Slow Thinking… ▽ More

    Submitted 1 February, 2024; v1 submitted 8 January, 2024; originally announced January 2024.

    Comments: accepted to ICRA2024

  40. arXiv:2401.02814  [pdf, other

    cs.RO cs.CV

    Object-Centric Instruction Augmentation for Robotic Manipulation

    Authors: Junjie Wen, Yichen Zhu, Minjie Zhu, **ming Li, Zhiyuan Xu, Zheng** Che, Chaomin Shen, Yaxin Peng, Dong Liu, Feifei Feng, Jian Tang

    Abstract: Humans interpret scenes by recognizing both the identities and positions of objects in their observations. For a robot to perform tasks such as \enquote{pick and place}, understanding both what the objects are and where they are located is crucial. While the former has been extensively discussed in the literature that uses the large language model to enrich the text descriptions, the latter remain… ▽ More

    Submitted 1 February, 2024; v1 submitted 5 January, 2024; originally announced January 2024.

    Comments: accepted to ICRA2024

  41. arXiv:2312.15599  [pdf, other

    cs.IR

    Preliminary Study on Incremental Learning for Large Language Model-based Recommender Systems

    Authors: Tianhao Shi, Yang Zhang, Zhijian Xu, Chong Chen, Fuli Feng, Xiangnan He, Qi Tian

    Abstract: Adapting Large Language Models for recommendation (LLM4Rec)has garnered substantial attention and demonstrated promising results. However, the challenges of practically deploying LLM4Rec are largely unexplored, with the need for incremental adaptation to evolving user preferences being a critical concern. Nevertheless, the suitability of traditional incremental learning within LLM4Rec remains ambi… ▽ More

    Submitted 24 December, 2023; originally announced December 2023.

    Comments: 8 pages, 8 figures

  42. arXiv:2312.14862  [pdf, other

    cs.CL cs.AI

    YAYI 2: Multilingual Open-Source Large Language Models

    Authors: Yin Luo, Qingchao Kong, Nan Xu, Jia Cao, Bao Hao, Baoyu Qu, Bo Chen, Chao Zhu, Chenyang Zhao, Donglei Zhang, Fan Feng, Feifei Zhao, Hailong Sun, Hanxuan Yang, Haojun Pan, Hongyu Liu, Jianbin Guo, Jiangtao Du, **gyi Wang, Junfeng Li, Lei Sun, Liduo Liu, Lifeng Dong, Lili Liu, Lin Wang , et al. (28 additional authors not shown)

    Abstract: As the latest advancements in natural language processing, large language models (LLMs) have achieved human-level language understanding and generation abilities in many real-world tasks, and even have been regarded as a potential path to the artificial general intelligence. To better facilitate research on LLMs, many open-source LLMs, such as Llama 2 and Falcon, have recently been proposed and ga… ▽ More

    Submitted 22 December, 2023; originally announced December 2023.

  43. arXiv:2312.11805  [pdf, other

    cs.CL cs.AI cs.CV

    Gemini: A Family of Highly Capable Multimodal Models

    Authors: Gemini Team, Rohan Anil, Sebastian Borgeaud, Jean-Baptiste Alayrac, Jiahui Yu, Radu Soricut, Johan Schalkwyk, Andrew M. Dai, Anja Hauth, Katie Millican, David Silver, Melvin Johnson, Ioannis Antonoglou, Julian Schrittwieser, Amelia Glaese, Jilin Chen, Emily Pitler, Timothy Lillicrap, Angeliki Lazaridou, Orhan Firat, James Molloy, Michael Isard, Paul R. Barham, Tom Hennigan, Benjamin Lee , et al. (1325 additional authors not shown)

    Abstract: This report introduces a new family of multimodal models, Gemini, that exhibit remarkable capabilities across image, audio, video, and text understanding. The Gemini family consists of Ultra, Pro, and Nano sizes, suitable for applications ranging from complex reasoning tasks to on-device memory-constrained use-cases. Evaluation on a broad range of benchmarks shows that our most-capable Gemini Ultr… ▽ More

    Submitted 17 June, 2024; v1 submitted 18 December, 2023; originally announced December 2023.

  44. LabelCraft: Empowering Short Video Recommendations with Automated Label Crafting

    Authors: Yimeng Bai, Yang Zhang, **g Lu, Jianxin Chang, Xiaoxue Zang, Yanan Niu, Yang Song, Fuli Feng

    Abstract: Short video recommendations often face limitations due to the quality of user feedback, which may not accurately depict user interests. To tackle this challenge, a new task has emerged: generating more dependable labels from original feedback. Existing label generation methods rely on manual rules, demanding substantial human effort and potentially misaligning with the desired objectives of the pl… ▽ More

    Submitted 18 December, 2023; originally announced December 2023.

    Comments: Accepted by WSDM'24

    ACM Class: H.3.3; H.3.5

  45. arXiv:2312.09901  [pdf, other

    cs.IR

    Temporally and Distributionally Robust Optimization for Cold-Start Recommendation

    Authors: Xinyu Lin, Wenjie Wang, Jujia Zhao, Yongqi Li, Fuli Feng, Tat-Seng Chua

    Abstract: Collaborative Filtering (CF) recommender models highly depend on user-item interactions to learn CF representations, thus falling short of recommending cold-start items. To address this issue, prior studies mainly introduce item features (e.g., thumbnails) for cold-start item recommendation. They learn a feature extractor on warm-start items to align feature representations with interactions, and… ▽ More

    Submitted 2 February, 2024; v1 submitted 15 December, 2023; originally announced December 2023.

    Comments: Accepted by AAAI'24

  46. arXiv:2312.04032  [pdf, other

    cs.CL cs.LG

    RoAST: Robustifying Language Models via Adversarial Perturbation with Selective Training

    Authors: Jaehyung Kim, Yuning Mao, Rui Hou, Hanchao Yu, Davis Liang, Pascale Fung, Qifan Wang, Fuli Feng, Lifu Huang, Madian Khabsa

    Abstract: Fine-tuning pre-trained language models (LMs) has become the de facto standard in many NLP tasks. Nevertheless, fine-tuned LMs are still prone to robustness issues, such as adversarial robustness and model calibration. Several perspectives of robustness for LMs have been studied independently, but lacking a unified consideration in multiple perspectives. In this paper, we propose Robustifying LMs… ▽ More

    Submitted 6 December, 2023; originally announced December 2023.

    Comments: 33 pages, accepted at EMNLP 2023 Findings

  47. arXiv:2311.08080  [pdf, other

    astro-ph.IM astro-ph.HE astro-ph.SR cs.CV

    Identifying Light-curve Signals with a Deep Learning Based Object Detection Algorithm. II. A General Light Curve Classification Framework

    Authors: Kaiming Cui, D. J. Armstrong, Fabo Feng

    Abstract: Vast amounts of astronomical photometric data are generated from various projects, requiring significant efforts to identify variable stars and other object classes. In light of this, a general, widely applicable classification framework would simplify the task of designing custom classifiers. We present a novel deep learning framework for classifying light curves using a weakly supervised object… ▽ More

    Submitted 14 November, 2023; originally announced November 2023.

    Comments: 26 pages, 19 figures, 6 tables. Submitted to AAS Journal. Code is available on https://github.com/ckm3/Deep-LC

  48. arXiv:2311.03105  [pdf

    cs.CV cs.AI

    Pelvic floor MRI segmentation based on semi-supervised deep learning

    Authors: Jianwei Zuo, Fei Feng, Zhuhui Wang, James A. Ashton-Miller, John O. L. Delancey, Jiajia Luo

    Abstract: The semantic segmentation of pelvic organs via MRI has important clinical significance. Recently, deep learning-enabled semantic segmentation has facilitated the three-dimensional geometric reconstruction of pelvic floor organs, providing clinicians with accurate and intuitive diagnostic results. However, the task of labeling pelvic floor MRI segmentation, typically performed by clinicians, is lab… ▽ More

    Submitted 22 November, 2023; v1 submitted 6 November, 2023; originally announced November 2023.

  49. arXiv:2310.19488  [pdf, other

    cs.IR

    CoLLM: Integrating Collaborative Embeddings into Large Language Models for Recommendation

    Authors: Yang Zhang, Fuli Feng, Jizhi Zhang, Keqin Bao, Qifan Wang, Xiangnan He

    Abstract: Leveraging Large Language Models as Recommenders (LLMRec) has gained significant attention and introduced fresh perspectives in user preference modeling. Existing LLMRec approaches prioritize text semantics, usually neglecting the valuable collaborative information from user-item interactions in recommendations. While these text-emphasizing approaches excel in cold-start scenarios, they may yield… ▽ More

    Submitted 30 October, 2023; originally announced October 2023.

  50. arXiv:2310.14626  [pdf, other

    cs.CL cs.IR

    Conversational Recommender System and Large Language Model Are Made for Each Other in E-commerce Pre-sales Dialogue

    Authors: Yuanxing Liu, Wei-Nan Zhang, Yifan Chen, Yuchi Zhang, Haopeng Bai, Fan Feng, Hengbin Cui, Yongbin Li, Wanxiang Che

    Abstract: E-commerce pre-sales dialogue aims to understand and elicit user needs and preferences for the items they are seeking so as to provide appropriate recommendations. Conversational recommender systems (CRSs) learn user representation and provide accurate recommendations based on dialogue context, but rely on external knowledge. Large language models (LLMs) generate responses that mimic pre-sales dia… ▽ More

    Submitted 23 October, 2023; originally announced October 2023.

    Comments: EMNLP 2023 Findings