Skip to main content

Showing 1–50 of 254 results for author: Tan, F

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.00367  [pdf, other

    cs.CV

    SVG: 3D Stereoscopic Video Generation via Denoising Frame Matrix

    Authors: Peng Dai, Feitong Tan, Qiangeng Xu, David Futschik, Ruofei Du, Sean Fanello, Xiaojuan Qi, Yinda Zhang

    Abstract: Video generation models have demonstrated great capabilities of producing impressive monocular videos, however, the generation of 3D stereoscopic video remains under-explored. We propose a pose-free and training-free approach for generating 3D stereoscopic videos using an off-the-shelf monocular video generation model. Our method warps a generated monocular video into camera views on stereoscopic… ▽ More

    Submitted 29 June, 2024; originally announced July 2024.

    Comments: 3D stereoscopic video generation, video diffusion, inpainting

  2. arXiv:2406.12835  [pdf, other

    cs.LG cs.AI cs.IR cs.SI

    Influence Maximization via Graph Neural Bandits

    Authors: Yuting Feng, Vincent Y. F. Tan, Bogdan Cautis

    Abstract: We consider a ubiquitous scenario in the study of Influence Maximization (IM), in which there is limited knowledge about the topology of the diffusion network. We set the IM problem in a multi-round diffusion campaign, aiming to maximize the number of distinct users that are influenced. Leveraging the capability of bandit algorithms to effectively balance the objectives of exploration and exploita… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

    Comments: To appear at the 2024 ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD)

  3. arXiv:2406.12205  [pdf, other

    cs.LG cs.AI cs.IT math.ST stat.ML

    Order-Optimal Instance-Dependent Bounds for Offline Reinforcement Learning with Preference Feedback

    Authors: Zhirui Chen, Vincent Y. F. Tan

    Abstract: We consider offline reinforcement learning (RL) with preference feedback in which the implicit reward is a linear function of an unknown parameter. Given an offline dataset, our objective consists in ascertaining the optimal action for each state, with the ultimate goal of minimizing the {\em simple regret}. We propose an algorithm, \underline{RL} with \underline{L}ocally \underline{O}ptimal \unde… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

    Comments: Accepted to Models of Human Feedback for AI Alignment Workshop, ICML 2024

  4. arXiv:2406.04881  [pdf, other

    cs.IT eess.SP

    MIMO Capacity Analysis and Channel Estimation for Electromagnetic Information Theory

    Authors: Jieao Zhu, Vincent Y. F. Tan, Linglong Dai

    Abstract: Electromagnetic information theory (EIT) is an interdisciplinary subject that serves to integrate deterministic electromagnetic theory with stochastic Shannon's information theory. Existing EIT analysis operates in the continuous space domain, which is not aligned with the practical algorithms working in the discrete space domain. This mismatch leads to a significant difficulty in application of E… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

    Comments: Submitted to the IEEE TWC. In this paper, we established the discrete-continuous correspondence for electromagnetic information theory (EIT), thus enabling analytical tools in the continuous space domain to be applied to discrete space MIMO architectures. Simulation codes will be provided at http://oa.ee.tsinghua.edu.cn/dailinglong/publications/publications.html

  5. arXiv:2405.17921  [pdf

    cs.AI cs.CY

    Towards Clinical AI Fairness: Filling Gaps in the Puzzle

    Authors: Mingxuan Liu, Yilin Ning, Salinelat Teixayavong, Xiaoxuan Liu, Mayli Mertens, Yuqing Shang, Xin Li, Di Miao, Jie Xu, Daniel Shu Wei Ting, Lionel Tim-Ee Cheng, Jasmine Chiat Ling Ong, Zhen Ling Teo, Ting Fang Tan, Narrendar RaviChandran, Fei Wang, Leo Anthony Celi, Marcus Eng Hock Ong, Nan Liu

    Abstract: The ethical integration of Artificial Intelligence (AI) in healthcare necessitates addressing fairness-a concept that is highly context-specific across medical fields. Extensive studies have been conducted to expand the technical components of AI fairness, while tremendous calls for AI fairness have been raised from healthcare. Despite this, a significant disconnect persists between technical adva… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

  6. arXiv:2405.15200  [pdf, other

    cs.LG cs.IT

    Indexed Minimum Empirical Divergence-Based Algorithms for Linear Bandits

    Authors: Jie Bian, Vincent Y. F. Tan

    Abstract: The Indexed Minimum Empirical Divergence (IMED) algorithm is a highly effective approach that offers a stronger theoretical guarantee of the asymptotic optimality compared to the Kullback--Leibler Upper Confidence Bound (KL-UCB) algorithm for the multi-armed bandit problem. Additionally, it has been observed to empirically outperform UCB-based algorithms and Thompson Sampling. Despite its effectiv… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

    Comments: Accepted to the Transactions on Machine Learning Research (TMLR)

  7. arXiv:2405.13722  [pdf, other

    cs.CV

    InstaDrag: Lightning Fast and Accurate Drag-based Image Editing Emerging from Videos

    Authors: Yujun Shi, Jun Hao Liew, Hanshu Yan, Vincent Y. F. Tan, Jiashi Feng

    Abstract: Accuracy and speed are critical in image editing tasks. Pan et al. introduced a drag-based image editing framework that achieves pixel-level control using Generative Adversarial Networks (GANs). A flurry of subsequent studies enhanced this framework's generality by leveraging large-scale diffusion models. However, these methods often suffer from inordinately long processing times (exceeding 1 minu… ▽ More

    Submitted 22 May, 2024; originally announced May 2024.

    Comments: Project page: https://instadrag.github.io/

  8. arXiv:2405.13532  [pdf, other

    cs.CV

    What Makes Good Few-shot Examples for Vision-Language Models?

    Authors: Zhaojun Guo, **ghui Lu, Xue**g Liu, Rui Zhao, ZhenXing Qian, Fei Tan

    Abstract: Despite the notable advancements achieved by leveraging pre-trained vision-language (VL) models through few-shot tuning for downstream tasks, our detailed empirical study highlights a significant dependence of few-shot learning outcomes on the careful selection of training examples - a facet that has been previously overlooked in research. In this study, we delve into devising more effective strat… ▽ More

    Submitted 22 May, 2024; originally announced May 2024.

    Comments: 8 pages, 4 figures

  9. arXiv:2404.10306  [pdf, other

    cs.CL

    Balancing Speciality and Versatility: a Coarse to Fine Framework for Supervised Fine-tuning Large Language Model

    Authors: Hengyuan Zhang, Yanru Wu, Dawei Li, Sak Yang, Rui Zhao, Yong Jiang, Fei Tan

    Abstract: Aligned Large Language Models (LLMs) showcase remarkable versatility, capable of handling diverse real-world tasks. Meanwhile, aligned LLMs are also expected to exhibit speciality, excelling in specific applications. However, fine-tuning with extra data, a common practice to gain speciality, often leads to catastrophic forgetting (CF) of previously acquired versatility, hindering the model's perfo… ▽ More

    Submitted 3 June, 2024; v1 submitted 16 April, 2024; originally announced April 2024.

    Comments: 43 pages, 10 figures, accepted by ACL 2024 Findings

  10. arXiv:2404.06892  [pdf, other

    cs.CV

    SparseAD: Sparse Query-Centric Paradigm for Efficient End-to-End Autonomous Driving

    Authors: Diankun Zhang, Guoan Wang, Runwen Zhu, Jianbo Zhao, Xiwu Chen, Siyu Zhang, Jiahao Gong, Qibin Zhou, Wenyuan Zhang, Ningzi Wang, Feiyang Tan, Hangning Zhou, Ziyao Xu, Haotian Yao, Chi Zhang, Xiaojun Liu, Xiaoguang Di, Bin Li

    Abstract: End-to-End paradigms use a unified framework to implement multi-tasks in an autonomous driving system. Despite simplicity and clarity, the performance of end-to-end autonomous driving methods on sub-tasks is still far behind the single-task methods. Meanwhile, the widely used dense BEV features in previous end-to-end methods make it costly to extend to more modalities or tasks. In this paper, we p… ▽ More

    Submitted 10 April, 2024; originally announced April 2024.

  11. Adversarial Combinatorial Bandits with Switching Costs

    Authors: Yanyan Dong, Vincent Y. F. Tan

    Abstract: We study the problem of adversarial combinatorial bandit with a switching cost $λ$ for a switch of each selected arm in each round, considering both the bandit feedback and semi-bandit feedback settings. In the oblivious adversarial case with $K$ base arms and time horizon $T$, we derive lower bounds for the minimax regret and design algorithms to approach them. To prove these lower bounds, we des… ▽ More

    Submitted 2 April, 2024; originally announced April 2024.

    Comments: The work has been accepted in IEEE Transactions on Information Theory. https://ieeexplore.ieee.org/document/10487974

  12. arXiv:2404.01543  [pdf, other

    cs.CV cs.GR

    Efficient 3D Implicit Head Avatar with Mesh-anchored Hash Table Blendshapes

    Authors: Ziqian Bai, Feitong Tan, Sean Fanello, Rohit Pandey, Mingsong Dou, Shichen Liu, ** Tan, Yinda Zhang

    Abstract: 3D head avatars built with neural implicit volumetric representations have achieved unprecedented levels of photorealism. However, the computational cost of these methods remains a significant barrier to their widespread adoption, particularly in real-time applications such as virtual reality and teleconferencing. While attempts have been made to develop fast neural rendering approaches for static… ▽ More

    Submitted 1 April, 2024; originally announced April 2024.

    Comments: In CVPR2024. Project page: https://augmentedperception.github.io/monoavatar-plus

  13. arXiv:2403.10803  [pdf, other

    cs.LG cs.AI cs.CV

    Enhancing Out-of-Distribution Detection with Multitesting-based Layer-wise Feature Fusion

    Authors: Jiawei Li, Sitong Li, Shanshan Wang, Yicheng Zeng, Falong Tan, Chuanlong Xie

    Abstract: Deploying machine learning in open environments presents the challenge of encountering diverse test inputs that differ significantly from the training data. These out-of-distribution samples may exhibit shifts in local or global features compared to the training distribution. The machine learning (ML) community has responded with a number of methods aimed at distinguishing anomalous inputs from or… ▽ More

    Submitted 16 March, 2024; originally announced March 2024.

  14. arXiv:2403.02246  [pdf

    cs.CL

    PHAnToM: Personality Has An Effect on Theory-of-Mind Reasoning in Large Language Models

    Authors: Fiona Anting Tan, Gerard Christopher Yeo, Fanyou Wu, Weijie Xu, Vinija Jain, Aman Chadha, Kokil Jaidka, Yang Liu, See-Kiong Ng

    Abstract: Recent advances in large language models (LLMs) demonstrate that their capabilities are comparable, or even superior, to humans in many tasks in natural language processing. Despite this progress, LLMs are still inadequate at social-cognitive reasoning, which humans are naturally good at. Drawing inspiration from psychological research on the links between certain personality traits and Theory-of-… ▽ More

    Submitted 18 March, 2024; v1 submitted 4 March, 2024; originally announced March 2024.

  15. arXiv:2402.17944  [pdf, other

    cs.CL

    Large Language Models(LLMs) on Tabular Data: Prediction, Generation, and Understanding -- A Survey

    Authors: Xi Fang, Weijie Xu, Fiona Anting Tan, Jiani Zhang, Ziqing Hu, Yanjun Qi, Scott Nickleach, Diego Socolinsky, Srinivasan Sengamedu, Christos Faloutsos

    Abstract: Recent breakthroughs in large language modeling have facilitated rigorous exploration of their application in diverse tasks related to tabular data modeling, such as prediction, tabular data synthesis, question answering, and table understanding. Each task presents unique challenges and opportunities. However, there is currently a lack of comprehensive review that summarizes and compares the key t… ▽ More

    Submitted 21 June, 2024; v1 submitted 27 February, 2024; originally announced February 2024.

    Comments: 41 pages, 4 figures, 8 tables

    MSC Class: 68T50 ACM Class: I.2.7

    Journal ref: TMLR 2024

  16. arXiv:2402.17411   

    cs.CL

    Consistency Matters: Explore LLMs Consistency From a Black-Box Perspective

    Authors: Fufangchen Zhao, Guoqiang **, Jiaheng Huang, Rui Zhao, Fei Tan

    Abstract: Nowadays both commercial and open-source academic LLM have become the mainstream models of NLP. However, there is still a lack of research on LLM consistency, meaning that throughout the various stages of LLM research and deployment, its internal parameters and capabilities should remain unchanged. This issue exists in both the industrial and academic sectors. The solution to this problem is often… ▽ More

    Submitted 2 March, 2024; v1 submitted 27 February, 2024; originally announced February 2024.

    Comments: This paper is not ready

  17. arXiv:2402.15127  [pdf, other

    cs.LG cs.IT stat.ML

    Multi-Armed Bandits with Abstention

    Authors: Junwen Yang, Tianyuan **, Vincent Y. F. Tan

    Abstract: We introduce a novel extension of the canonical multi-armed bandit problem that incorporates an additional strategic element: abstention. In this enhanced framework, the agent is not only tasked with selecting an arm at each time step, but also has the option to abstain from accepting the stochastic instantaneous reward before observing it. When opting for abstention, the agent either suffers a fi… ▽ More

    Submitted 23 February, 2024; originally announced February 2024.

    Comments: Preprint

  18. arXiv:2402.11909  [pdf, other

    cs.CV

    One2Avatar: Generative Implicit Head Avatar For Few-shot User Adaptation

    Authors: Zhixuan Yu, Ziqian Bai, Abhimitra Meka, Feitong Tan, Qiangeng Xu, Rohit Pandey, Sean Fanello, Hyun Soo Park, Yinda Zhang

    Abstract: Traditional methods for constructing high-quality, personalized head avatars from monocular videos demand extensive face captures and training time, posing a significant challenge for scalability. This paper introduces a novel approach to create high quality head avatar utilizing only a single or a few images per user. We learn a generative model for 3D animatable photo-realistic head avatar from… ▽ More

    Submitted 19 February, 2024; originally announced February 2024.

  19. arXiv:2402.10083  [pdf

    cs.AI

    Fine-tuning Large Language Model (LLM) Artificial Intelligence Chatbots in Ophthalmology and LLM-based evaluation using GPT-4

    Authors: Ting Fang Tan, Kabilan Elangovan, Liyuan **, Yao Jie, Li Yong, Joshua Lim, Stanley Poh, Wei Yan Ng, Daniel Lim, Yuhe Ke, Nan Liu, Daniel Shu Wei Ting

    Abstract: Purpose: To assess the alignment of GPT-4-based evaluation to human clinician experts, for the evaluation of responses to ophthalmology-related patient queries generated by fine-tuned LLM chatbots. Methods: 400 ophthalmology questions and paired answers were created by ophthalmologists to represent commonly asked patient questions, divided into fine-tuning (368; 92%), and testing (40; 8%). We find… ▽ More

    Submitted 15 February, 2024; originally announced February 2024.

    Comments: 13 Pages, 1 Figure, 8 Tables

  20. arXiv:2401.16726  [pdf, ps, other

    cs.IT

    Variable-Length Feedback Codes over Known and Unknown Channels with Non-vanishing Error Probabilities

    Authors: Recep Can Yavas, Vincent Y. F. Tan

    Abstract: We study variable-length feedback (VLF) codes with noiseless feedback for discrete memoryless channels. We present a novel non-asymptotic bound, which analyzes the average error probability and average decoding time of our modified Yamamoto--Itoh scheme. We then optimize the parameters of our code in the asymptotic regime where the average error probability $ε$ remains a constant as the average de… ▽ More

    Submitted 29 January, 2024; originally announced January 2024.

    Comments: Submitted to ISIT 2024, 14 pages

    ACM Class: E.4

  21. arXiv:2401.09073  [pdf, other

    cs.LG cs.AI cs.IT math.ST stat.ML

    Fixed-Budget Differentially Private Best Arm Identification

    Authors: Zhirui Chen, P. N. Karthik, Yeow Meng Chee, Vincent Y. F. Tan

    Abstract: We study best arm identification (BAI) in linear bandits in the fixed-budget regime under differential privacy constraints, when the arm rewards are supported on the unit interval. Given a finite budget $T$ and a privacy parameter $\varepsilon>0$, the goal is to minimise the error probability in finding the arm with the largest mean after $T$ sampling rounds, subject to the constraint that the pol… ▽ More

    Submitted 17 January, 2024; originally announced January 2024.

    Comments: Accepted to ICLR 2024

  22. arXiv:2401.05750  [pdf, other

    cs.CV

    GO-NeRF: Generating Virtual Objects in Neural Radiance Fields

    Authors: Peng Dai, Feitong Tan, Xin Yu, Yinda Zhang, Xiaojuan Qi

    Abstract: Despite advances in 3D generation, the direct creation of 3D objects within an existing 3D scene represented as NeRF remains underexplored. This process requires not only high-quality 3D object generation but also seamless composition of the generated 3D content into the existing NeRF. To this end, we propose a new method, GO-NeRF, capable of utilizing scene context for high-quality and harmonious… ▽ More

    Submitted 11 January, 2024; originally announced January 2024.

    Comments: 12 pages

    MSC Class: ACM-class

  23. arXiv:2312.12030  [pdf, other

    cs.CV cs.AI

    Towards Accurate Guided Diffusion Sampling through Symplectic Adjoint Method

    Authors: Jiachun Pan, Hanshu Yan, Jun Hao Liew, Jiashi Feng, Vincent Y. F. Tan

    Abstract: Training-free guided sampling in diffusion models leverages off-the-shelf pre-trained networks, such as an aesthetic evaluation model, to guide the generation process. Current training-free guided sampling algorithms obtain the guidance energy function based on a one-step estimate of the clean image. However, since the off-the-shelf pre-trained networks are trained on clean images, the one-step es… ▽ More

    Submitted 19 December, 2023; originally announced December 2023.

  24. arXiv:2312.04875  [pdf, other

    cs.CV

    MVDD: Multi-View Depth Diffusion Models

    Authors: Zhen Wang, Qiangeng Xu, Feitong Tan, Menglei Chai, Shichen Liu, Rohit Pandey, Sean Fanello, Achuta Kadambi, Yinda Zhang

    Abstract: Denoising diffusion models have demonstrated outstanding results in 2D image generation, yet it remains a challenge to replicate its success in 3D shape generation. In this paper, we propose leveraging multi-view depth, which represents complex 3D shapes in a 2D data format that is easy to denoise. We pair this representation with a diffusion model, MVDD, that is capable of generating high-quality… ▽ More

    Submitted 19 December, 2023; v1 submitted 8 December, 2023; originally announced December 2023.

  25. arXiv:2312.03763  [pdf, other

    cs.CV cs.GR cs.LG

    Gaussian3Diff: 3D Gaussian Diffusion for 3D Full Head Synthesis and Editing

    Authors: Yushi Lan, Feitong Tan, Di Qiu, Qiangeng Xu, Kyle Genova, Zeng Huang, Sean Fanello, Rohit Pandey, Thomas Funkhouser, Chen Change Loy, Yinda Zhang

    Abstract: We present a novel framework for generating photorealistic 3D human head and subsequently manipulating and reposing them with remarkable flexibility. The proposed approach leverages an implicit function representation of 3D human heads, employing 3D Gaussians anchored on a parametric face model. To enhance representational capabilities and encode spatial information, we embed a lightweight tri-pla… ▽ More

    Submitted 19 December, 2023; v1 submitted 5 December, 2023; originally announced December 2023.

    Comments: project webpage: https://nirvanalan.github.io/projects/gaussian3diff/

  26. arXiv:2312.01244  [pdf, ps, other

    cs.CL

    Challenges and Applications of Automated Extraction of Socio-political Events from Text (CASE 2023): Workshop and Shared Task Report

    Authors: Ali Hürriyetoğlu, Hristo Tanev, Osman Mutlu, Surendrabikram Thapa, Fiona Anting Tan, Erdem Yörük

    Abstract: We provide a summary of the sixth edition of the CASE workshop that is held in the scope of RANLP 2023. The workshop consists of regular papers, three keynotes, working papers of shared task participants, and shared task overview papers. This workshop series has been bringing together all aspects of event information collection across technical and social science fields. In addition to contributin… ▽ More

    Submitted 2 December, 2023; originally announced December 2023.

    Comments: https://aclanthology.org/2023.case-1.22

  27. arXiv:2311.07306  [pdf, other

    cs.CV

    What Large Language Models Bring to Text-rich VQA?

    Authors: Xue**g Liu, Wei Tang, Xinzhe Ni, **ghui Lu, Rui Zhao, Zechao Li, Fei Tan

    Abstract: Text-rich VQA, namely Visual Question Answering based on text recognition in the images, is a cross-modal task that requires both image comprehension and text recognition. In this work, we focus on investigating the advantages and bottlenecks of LLM-based approaches in addressing this problem. To address the above concern, we separate the vision and language modules, where we leverage external OCR… ▽ More

    Submitted 13 November, 2023; originally announced November 2023.

  28. arXiv:2311.00481  [pdf, ps, other

    cs.LG stat.ML

    Fixed-Budget Best-Arm Identification in Sparse Linear Bandits

    Authors: Recep Can Yavas, Vincent Y. F. Tan

    Abstract: We study the best-arm identification problem in sparse linear bandits under the fixed-budget setting. In sparse linear bandits, the unknown feature vector $θ^*$ may be of large dimension $d$, but only a few, say $s \ll d$ of these features have non-zero values. We design a two-phase algorithm, Lasso and Optimal-Design- (Lasso-OD) based linear best-arm identification. The first phase of Lasso-OD le… ▽ More

    Submitted 1 November, 2023; originally announced November 2023.

    Comments: 28 pages, Submitted to TMLR

    ACM Class: I.2.6

  29. arXiv:2310.17531  [pdf, ps, other

    cs.GT cs.LG stat.ML

    Learning Regularized Graphon Mean-Field Games with Unknown Graphons

    Authors: Fengzhuo Zhang, Vincent Y. F. Tan, Zhaoran Wang, Zhuoran Yang

    Abstract: We design and analyze reinforcement learning algorithms for Graphon Mean-Field Games (GMFGs). In contrast to previous works that require the precise values of the graphons, we aim to learn the Nash Equilibrium (NE) of the regularized GMFGs when the graphons are unknown. Our contributions are threefold. First, we propose the Proximal Policy Optimization for GMFG (GMFG-PPO) algorithm and show that i… ▽ More

    Submitted 26 October, 2023; originally announced October 2023.

  30. arXiv:2310.13393  [pdf, ps, other

    stat.ML cs.IT cs.LG

    Optimal Best Arm Identification with Fixed Confidence in Restless Bandits

    Authors: P. N. Karthik, Vincent Y. F. Tan, Arpan Mukherjee, Ali Tajer

    Abstract: We study best arm identification in a restless multi-armed bandit setting with finitely many arms. The discrete-time data generated by each arm forms a homogeneous Markov chain taking values in a common, finite state space. The state transitions in each arm are captured by an ergodic transition probability matrix (TPM) that is a member of a single-parameter exponential family of TPMs. The real-val… ▽ More

    Submitted 23 June, 2024; v1 submitted 20 October, 2023; originally announced October 2023.

    Comments: Accepted to the IEEE Transactions on Information Theory

  31. arXiv:2310.11005  [pdf, ps, other

    cs.IT cs.CR

    Optimal Private Discrete Distribution Estimation with One-bit Communication

    Authors: Seung-Hyun Nam, Vincent Y. F. Tan, Si-Hyeon Lee

    Abstract: We consider a private discrete distribution estimation problem with one-bit communication constraint. The privacy constraints are imposed with respect to the local differential privacy and the maximal leakage. The estimation error is quantified by the worst-case mean squared error. We completely characterize the first-order asymptotics of this privacy-utility trade-off under the one-bit communicat… ▽ More

    Submitted 17 October, 2023; originally announced October 2023.

    Comments: 13 pages, 5 figures, and 1 page of supplementary material

  32. arXiv:2310.08089  [pdf, other

    cs.GT eess.SY stat.ML

    Learning Regularized Monotone Graphon Mean-Field Games

    Authors: Fengzhuo Zhang, Vincent Y. F. Tan, Zhaoran Wang, Zhuoran Yang

    Abstract: This paper studies two fundamental problems in regularized Graphon Mean-Field Games (GMFGs). First, we establish the existence of a Nash Equilibrium (NE) of any $λ$-regularized GMFG (for $λ\geq 0$). This result relies on weaker conditions than those in previous works for analyzing both unregularized GMFGs ($λ=0$) and $λ$-regularized MFGs, which are special cases of GMFGs. Second, we propose provab… ▽ More

    Submitted 12 October, 2023; originally announced October 2023.

  33. arXiv:2309.03190  [pdf, other

    cs.LG cs.CR

    Blink: Link Local Differential Privacy in Graph Neural Networks via Bayesian Estimation

    Authors: Xiaochen Zhu, Vincent Y. F. Tan, Xiaokui Xiao

    Abstract: Graph neural networks (GNNs) have gained an increasing amount of popularity due to their superior capability in learning node embeddings for various graph inference tasks, but training them can raise privacy concerns. To address this, we propose using link local differential privacy over decentralized nodes, enabling collaboration with an untrusted server to train GNNs without revealing the existe… ▽ More

    Submitted 7 September, 2023; v1 submitted 6 September, 2023; originally announced September 2023.

    Comments: 17 pages, accepted by ACM CCS 2023 as a conference paper

  34. arXiv:2307.10711  [pdf, other

    cs.CV cs.AI cs.LG

    AdjointDPM: Adjoint Sensitivity Method for Gradient Backpropagation of Diffusion Probabilistic Models

    Authors: Jiachun Pan, Jun Hao Liew, Vincent Y. F. Tan, Jiashi Feng, Hanshu Yan

    Abstract: Existing customization methods require access to multiple reference examples to align pre-trained diffusion probabilistic models (DPMs) with user-provided concepts. This paper aims to address the challenge of DPM customization when the only available supervision is a differentiable metric defined on the generated contents. Since the sampling procedure of DPMs involves recursive calls to the denois… ▽ More

    Submitted 20 March, 2024; v1 submitted 20 July, 2023; originally announced July 2023.

  35. arXiv:2307.05893  [pdf, ps, other

    eess.SP cs.LG

    Deep Unrolling for Nonconvex Robust Principal Component Analysis

    Authors: Elizabeth Z. C. Tan, Caroline Chaux, Emmanuel Soubies, Vincent Y. F. Tan

    Abstract: We design algorithms for Robust Principal Component Analysis (RPCA) which consists in decomposing a matrix into the sum of a low rank matrix and a sparse matrix. We propose a deep unrolled algorithm based on an accelerated alternating projection algorithm which aims to solve RPCA in its nonconvex form. The proposed procedure combines benefits of deep neural networks and the interpretability of the… ▽ More

    Submitted 11 July, 2023; originally announced July 2023.

    Comments: 7 pages, 3 figures; Accepted to the 2023 IEEE International Workshop on Machine Learning for Signal Processing

  36. arXiv:2306.14435  [pdf, other

    cs.CV cs.LG

    DragDiffusion: Harnessing Diffusion Models for Interactive Point-based Image Editing

    Authors: Yujun Shi, Chuhui Xue, Jun Hao Liew, Jiachun Pan, Hanshu Yan, Wenqing Zhang, Vincent Y. F. Tan, Song Bai

    Abstract: Accurate and controllable image editing is a challenging task that has attracted significant attention recently. Notably, DragGAN is an interactive point-based image editing framework that achieves impressive editing results with pixel-level precision. However, due to its reliance on generative adversarial networks (GANs), its generality is limited by the capacity of pretrained GAN models. In this… ▽ More

    Submitted 7 April, 2024; v1 submitted 26 June, 2023; originally announced June 2023.

    Comments: Code is released at https://github.com/Yujun-Shi/DragDiffusion

  37. arXiv:2306.07464  [pdf, other

    cs.AI cs.LG stat.ML

    Unlocking Sales Growth: Account Prioritization Engine with Explainable AI

    Authors: Suvendu Jena, Jilei Yang, Fangfang Tan

    Abstract: B2B sales requires effective prediction of customer growth, identification of upsell potential, and mitigation of churn risks. LinkedIn sales representatives traditionally relied on intuition and fragmented data signals to assess customer performance. This resulted in significant time investment in data understanding as well as strategy formulation and under-investment in active selling. To overco… ▽ More

    Submitted 12 June, 2023; originally announced June 2023.

    Comments: 9 pages, 11 figures, 2 tables

  38. arXiv:2305.17903  [pdf, other

    cs.CV

    Deeply Coupled Cross-Modal Prompt Learning

    Authors: Xue**g Liu, Wei Tang, **ghui Lu, Rui Zhao, Zhaojun Guo, Fei Tan

    Abstract: Recent advancements in multimodal foundation models (e.g., CLIP) have excelled in zero-shot generalization. Prompt tuning involved in the knowledge transfer from foundation models to downstream tasks has gained significant attention recently. Existing prompt-tuning methods in cross-modal learning, however, either solely focus on language branch, or learn vision-language interaction in a shallow me… ▽ More

    Submitted 6 December, 2023; v1 submitted 29 May, 2023; originally announced May 2023.

    Comments: Accepted by ACL 2023 findings

  39. arXiv:2305.09359  [pdf, other

    cs.CL

    Constructing and Interpreting Causal Knowledge Graphs from News

    Authors: Fiona Anting Tan, Debdeep Paul, Sahim Yamaura, Miura Koji, See-Kiong Ng

    Abstract: Many financial jobs rely on news to learn about causal events in the past and present, to make informed decisions and predictions about the future. With the ever-increasing amount of news available online, there is a need to automate the extraction of causal events from unstructured texts. In this work, we propose a methodology to construct causal knowledge graphs (KGs) from news using two steps:… ▽ More

    Submitted 30 July, 2023; v1 submitted 16 May, 2023; originally announced May 2023.

    Comments: Accepted to AAAI Summer Symposium 2023 (AI4FinTech)

  40. arXiv:2305.06213  [pdf, other

    cs.CY physics.ed-ph

    Motivation, inclusivity, and realism should drive data science education

    Authors: Candace Savonen, Carrie Wright, Ava M. Hoffman, Elizabeth M. Humphries, Katherine E. L. Cox, Frederick J. Tan, Jeffrey T. Leek

    Abstract: Data science education provides tremendous opportunities but remains inaccessible to many communities. Increasing the accessibility of data science to these communities not only benefits the individuals entering data science, but also increases the field's innovation and potential impact as a whole. Education is the most scalable solution to meet these needs, but many data science educators lack f… ▽ More

    Submitted 9 May, 2023; originally announced May 2023.

    Comments: This has been submitted to F1000 and is under review (as of 5/9/23)

  41. arXiv:2305.05292  [pdf, other

    cs.HC

    Heads-Up Computing: Moving Beyond the Device-Centered Paradigm

    Authors: Shengdong Zhao, Felicia Tan, Katherine Fennedy

    Abstract: This article introduces our vision for a new interaction paradigm: Heads-Up Computing, a concept involving the provision of seamless computing support for daily activities. Its synergistic and user-centric approach frees humans from common constraints caused by existing interactions (e.g. smartphone zombies), made possible by matching input and output channels between the device and human. Wearabl… ▽ More

    Submitted 9 May, 2023; originally announced May 2023.

    Comments: 11 pages, 3 figures

  42. arXiv:2304.13493  [pdf

    cs.CY cs.AI

    Towards clinical AI fairness: A translational perspective

    Authors: Mingxuan Liu, Yilin Ning, Salinelat Teixayavong, Mayli Mertens, Jie Xu, Daniel Shu Wei Ting, Lionel Tim-Ee Cheng, Jasmine Chiat Ling Ong, Zhen Ling Teo, Ting Fang Tan, Ravi Chandran Narrendar, Fei Wang, Leo Anthony Celi, Marcus Eng Hock Ong, Nan Liu

    Abstract: Artificial intelligence (AI) has demonstrated the ability to extract insights from data, but the issue of fairness remains a concern in high-stakes fields such as healthcare. Despite extensive discussion and efforts in algorithm development, AI fairness and clinical concerns have not been adequately addressed. In this paper, we discuss the misalignment between technical and clinical perspectives o… ▽ More

    Submitted 26 April, 2023; originally announced April 2023.

  43. arXiv:2304.12680  [pdf, ps, other

    cs.LG cs.IT stat.ML

    Communication-Constrained Bandits under Additive Gaussian Noise

    Authors: Prathamesh Mayekar, Jonathan Scarlett, Vincent Y. F. Tan

    Abstract: We study a distributed stochastic multi-armed bandit where a client supplies the learner with communication-constrained feedback based on the rewards for the corresponding arm pulls. In our setup, the client must encode the rewards such that the second moment of the encoded rewards is no more than $P$, and this encoded reward is further corrupted by additive Gaussian noise of variance $σ^2$; the l… ▽ More

    Submitted 6 June, 2023; v1 submitted 25 April, 2023; originally announced April 2023.

  44. arXiv:2304.01436  [pdf, other

    cs.CV cs.GR

    Learning Personalized High Quality Volumetric Head Avatars from Monocular RGB Videos

    Authors: Ziqian Bai, Feitong Tan, Zeng Huang, Kripasindhu Sarkar, Danhang Tang, Di Qiu, Abhimitra Meka, Ruofei Du, Mingsong Dou, Sergio Orts-Escolano, Rohit Pandey, ** Tan, Thabo Beeler, Sean Fanello, Yinda Zhang

    Abstract: We propose a method to learn a high-quality implicit 3D head avatar from a monocular RGB video captured in the wild. The learnt avatar is driven by a parametric face model to achieve user-controlled facial expressions and head poses. Our hybrid pipeline combines the geometry prior and dynamic tracking of a 3DMM with a neural radiance field to achieve fine-grained control and photorealism. To reduc… ▽ More

    Submitted 3 April, 2023; originally announced April 2023.

    Comments: In CVPR2023. Project page: https://augmentedperception.github.io/monoavatar/

  45. arXiv:2302.02754  [pdf, ps, other

    cs.IT

    Codes for Correcting $t$ Limited-Magnitude Sticky Deletions

    Authors: Shuche Wang, Van Khu Vu, Vincent Y. F. Tan

    Abstract: Codes for correcting sticky insertions/deletions and limited-magnitude errors have attracted significant attention due to their applications of flash memories, racetrack memories, and DNA data storage systems. In this paper, we first consider the error type of $t$-sticky deletions with $\ell$-limited-magnitude and propose a non-systematic code for correcting this type of error with redundancy… ▽ More

    Submitted 6 February, 2023; originally announced February 2023.

    Comments: arXiv admin note: substantial text overlap with arXiv:2301.11680

  46. arXiv:2301.13393  [pdf, other

    cs.LG cs.AI cs.IT stat.ML

    Probably Anytime-Safe Stochastic Combinatorial Semi-Bandits

    Authors: Yunlong Hou, Vincent Y. F. Tan, Zixin Zhong

    Abstract: Motivated by concerns about making online decisions that incur undue amount of risk at each time step, in this paper, we formulate the probably anytime-safe stochastic combinatorial semi-bandits problem. In this problem, the agent is given the option to select a subset of size at most $K$ from a set of $L$ ground items. Each item is associated to a certain mean reward as well as a variance that re… ▽ More

    Submitted 2 June, 2023; v1 submitted 30 January, 2023; originally announced January 2023.

    Comments: To be presented at ICML 2023. 57 pages, 6 figures

  47. arXiv:2301.11680  [pdf, other

    cs.IT

    Codes for Correcting Asymmetric Adjacent Transpositions and Deletions

    Authors: Shuche Wang, Van Khu Vu, Vincent Y. F. Tan

    Abstract: Codes in the Damerau--Levenshtein metric have been extensively studied recently owing to their applications in DNA-based data storage. In particular, Gabrys, Yaakobi, and Milenkovic (2017) designed a length-$n$ code correcting a single deletion and $s$ adjacent transpositions with at most $(1+2s)\log n$ bits of redundancy. In this work, we consider a new setting where both asymmetric adjacent tran… ▽ More

    Submitted 29 June, 2023; v1 submitted 27 January, 2023; originally announced January 2023.

  48. arXiv:2212.12720  [pdf, other

    cs.LG cs.AI cs.CV

    Boosting Out-of-Distribution Detection with Multiple Pre-trained Models

    Authors: Feng Xue, Zi He, Chuanlong Xie, Falong Tan, Zhenguo Li

    Abstract: Out-of-Distribution (OOD) detection, i.e., identifying whether an input is sampled from a novel distribution other than the training distribution, is a critical task for safely deploying machine learning systems in the open world. Recently, post hoc detection utilizing pre-trained models has shown promising performance and can be scaled to large-scale problems. This advance raises a natural questi… ▽ More

    Submitted 12 January, 2023; v1 submitted 24 December, 2022; originally announced December 2022.

  49. arXiv:2212.10817  [pdf, other

    eess.IV cs.CV

    High-fidelity Direct Contrast Synthesis from Magnetic Resonance Fingerprinting

    Authors: Ke Wang, Mariya Doneva, Jakob Meineke, Thomas Amthor, Ekin Karasan, Fei Tan, Jonathan I. Tamir, Stella X. Yu, Michael Lustig

    Abstract: Magnetic Resonance Fingerprinting (MRF) is an efficient quantitative MRI technique that can extract important tissue and system parameters such as T1, T2, B0, and B1 from a single scan. This property also makes it attractive for retrospectively synthesizing contrast-weighted images. In general, contrast-weighted images like T1-weighted, T2-weighted, etc., can be synthesized directly from parameter… ▽ More

    Submitted 21 December, 2022; originally announced December 2022.

    Comments: 19 pages, 8 figures

  50. arXiv:2211.14838  [pdf, other

    cs.CL cs.AI

    PUnifiedNER: A Prompting-based Unified NER System for Diverse Datasets

    Authors: **ghui Lu, Rui Zhao, Brian Mac Namee, Fei Tan

    Abstract: Much of named entity recognition (NER) research focuses on develo** dataset-specific models based on data from the domain of interest, and a limited set of related entity types. This is frustrating as each new dataset requires a new model to be trained and stored. In this work, we present a ``versatile'' model -- the Prompting-based Unified NER system (PUnifiedNER) -- that works with data from d… ▽ More

    Submitted 22 February, 2023; v1 submitted 27 November, 2022; originally announced November 2022.

    Comments: Accepted to AAAI 2023