Skip to main content

Showing 1–50 of 314 results for author: Lan, Y

.
  1. arXiv:2407.00300  [pdf, other

    math.AP

    On the near soliton dynamics for the 2D cubic Zakharov-Kuznetsov equations

    Authors: Gong Chen, Yang Lan, Xu Yuan

    Abstract: In this article, we consider the Cauchy problem for the cubic (mass-critical) Zakharov-Kuznetsov equations in dimension two: $$\partial_t u+\partial_{x_1}(Δu+u^3)=0,\quad (t,x)\in [0,\infty)\times \mathbb{R}^{2}.$$ For initial data in $H^1$ close to the soliton with a suitable space-decay property, we fully describe the asymptotic behavior of the corresponding solution. More precisely, for such in… ▽ More

    Submitted 28 June, 2024; originally announced July 2024.

    Comments: 65 pages

  2. arXiv:2406.17797  [pdf, other

    physics.chem-ph cs.AI cs.LG

    MoleculeCLA: Rethinking Molecular Benchmark via Computational Ligand-Target Binding Analysis

    Authors: Shikun Feng, Jiaxin Zheng, Yinjun Jia, Yanwen Huang, Fengfeng Zhou, Wei-Ying Ma, Yanyan Lan

    Abstract: Molecular representation learning is pivotal for various molecular property prediction tasks related to drug discovery. Robust and accurate benchmarks are essential for refining and validating current methods. Existing molecular property benchmarks derived from wet experiments, however, face limitations such as data volume constraints, unbalanced label distribution, and noisy labels. To address th… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

  3. arXiv:2406.10584  [pdf, other

    cs.CL

    Concentrate Attention: Towards Domain-Generalizable Prompt Optimization for Language Models

    Authors: Chengzhengxu Li, Xiaoming Liu, Zhaohan Zhang, Yichen Wang, Chen Liu, Yu Lan, Chao Shen

    Abstract: Recent advances in prompt optimization have notably enhanced the performance of pre-trained language models (PLMs) on downstream tasks. However, the potential of optimized prompts on domain generalization has been under-explored. To explore the nature of prompt generalization on unknown domains, we conduct pilot experiments and find that (i) Prompts gaining more attention weight from PLMs' deep la… ▽ More

    Submitted 27 June, 2024; v1 submitted 15 June, 2024; originally announced June 2024.

    Comments: Submitted to NeurIPS 2024, Preprint, Under review

  4. arXiv:2406.08980  [pdf, other

    q-bio.BM cs.LG

    From Theory to Therapy: Reframing SBDD Model Evaluation via Practical Metrics

    Authors: Bowen Gao, Haichuan Tan, Yanwen Huang, Minsi Ren, Xiao Huang, Wei-Ying Ma, Ya-Qin Zhang, Yanyan Lan

    Abstract: Recent advancements in structure-based drug design (SBDD) have significantly enhanced the efficiency and precision of drug discovery by generating molecules tailored to bind specific protein pockets. Despite these technological strides, their practical application in real-world drug development remains challenging due to the complexities of synthesizing and testing these molecules. The reliability… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

  5. arXiv:2406.08961  [pdf, other

    q-bio.BM cs.LG

    SIU: A Million-Scale Structural Small Molecule-Protein Interaction Dataset for Unbiased Bioactivity Prediction

    Authors: Yanwen Huang, Bowen Gao, Yinjun Jia, Hongbo Ma, Wei-Ying Ma, Ya-Qin Zhang, Yanyan Lan

    Abstract: Small molecules play a pivotal role in modern medicine, and scrutinizing their interactions with protein targets is essential for the discovery and development of novel, life-saving therapeutics. The term "bioactivity" encompasses various biological effects resulting from these interactions, including both binding and functional responses. The magnitude of bioactivity dictates the therapeutic or t… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

  6. arXiv:2406.03238  [pdf, ps, other

    math.RT math.QA math.RA

    The parity of Lusztig's restriction functor and Green's formula for a quiver with automorphism

    Authors: Jiepeng Fang, Yixin Lan, Yumeng Wu

    Abstract: In [8], Fang-Lan-Xiao proved a formula about Lusztig's induction and restriction functors which can induce Green's formula for the path algebra of a quiver over a finite field via the trace map. In this paper, we generalize their formula to that for the mixed semisimple perverse sheaves for a quiver with an automorphism. By applying the trace map, we obtain Green's formula for any finite-dimension… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

    MSC Class: 16G20; 17B37

  7. arXiv:2405.19909  [pdf, other

    cs.LG cs.AI cs.RO

    Adaptive Advantage-Guided Policy Regularization for Offline Reinforcement Learning

    Authors: Tenglong Liu, Yang Li, Yixing Lan, Hao Gao, Wei Pan, Xin Xu

    Abstract: In offline reinforcement learning, the challenge of out-of-distribution (OOD) is pronounced. To address this, existing methods often constrain the learned policy through policy regularization. However, these methods often suffer from the issue of unnecessary conservativeness, hampering policy improvement. This occurs due to the indiscriminate use of all actions from the behavior policy that genera… ▽ More

    Submitted 1 June, 2024; v1 submitted 30 May, 2024; originally announced May 2024.

    Comments: ICML 2024, 19 pages

  8. arXiv:2405.17802  [pdf, other

    cs.LG cs.AI q-bio.BM

    Multi-level Interaction Modeling for Protein Mutational Effect Prediction

    Authors: Yuanle Mo, Xin Hong, Bowen Gao, Yinjun Jia, Yanyan Lan

    Abstract: Protein-protein interactions are central mediators in many biological processes. Accurately predicting the effects of mutations on interactions is crucial for guiding the modulation of these interactions, thereby playing a significant role in therapeutic development and drug discovery. Mutations generally affect interactions hierarchically across three levels: mutated residues exhibit different si… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

  9. arXiv:2405.16457  [pdf, other

    gr-qc

    Entanglement island and Page curve for one-sided charged black hole

    Authors: Yun-Feng Qu, Yi-Ling Lan, Hongwei Yu, Wen-Cong Gan, Fu-Wen Shu

    Abstract: In this paper, we extend the method of calculating the entanglement entropy of Hawking radiation of black holes using the "in" vacuum state, which describes one-sided asymptotically flat neutral black hole formed by gravitational collapse, to dynamic charged black holes. We explore the influence of charge on the position of the boundary of island $\partial I$ and the Page time. Due to their distin… ▽ More

    Submitted 26 May, 2024; originally announced May 2024.

  10. arXiv:2405.16150  [pdf, other

    cs.CL

    5W1H Extraction With Large Language Models

    Authors: Yang Cao, Yangsong Lan, Feiyan Zhai, Piji Li

    Abstract: The extraction of essential news elements through the 5W1H framework (\textit{What}, \textit{When}, \textit{Where}, \textit{Why}, \textit{Who}, and \textit{How}) is critical for event extraction and text summarization. The advent of Large language models (LLMs) such as ChatGPT presents an opportunity to address language-related tasks through simple prompts without fine-tuning models with much time… ▽ More

    Submitted 25 May, 2024; originally announced May 2024.

    Comments: IJCNN 2024

  11. arXiv:2405.10481  [pdf, other

    cs.LG cs.AI

    Multi-Evidence based Fact Verification via A Confidential Graph Neural Network

    Authors: Yuqing Lan, Zhenghao Liu, Yu Gu, Xiaoyuan Yi, Xiaohua Li, Liner Yang, Ge Yu

    Abstract: Fact verification tasks aim to identify the integrity of textual contents according to the truthful corpus. Existing fact verification models usually build a fully connected reasoning graph, which regards claim-evidence pairs as nodes and connects them with edges. They employ the graph to propagate the semantics of the nodes. Nevertheless, the noisy nodes usually propagate their semantics via the… ▽ More

    Submitted 16 May, 2024; originally announced May 2024.

    Comments: 12pages

  12. arXiv:2405.10343  [pdf, other

    q-bio.BM cs.AI cs.LG

    UniCorn: A Unified Contrastive Learning Approach for Multi-view Molecular Representation Learning

    Authors: Shikun Feng, Yuyan Ni, Minghao Li, Yanwen Huang, Zhi-Ming Ma, Wei-Ying Ma, Yanyan Lan

    Abstract: Recently, a noticeable trend has emerged in develo** pre-trained foundation models in the domains of CV and NLP. However, for molecular pre-training, there lacks a universal model capable of effectively applying to various categories of molecular tasks, since existing prevalent pre-training methods exhibit effectiveness for specific types of downstream tasks. Furthermore, the lack of profound un… ▽ More

    Submitted 15 May, 2024; originally announced May 2024.

  13. arXiv:2404.19335  [pdf, other

    cs.CL

    StablePT: Towards Stable Prompting for Few-shot Learning via Input Separation

    Authors: Xiaoming Liu, Chen Liu, Zhaohan Zhang, Chengzhengxu Li, Longtian Wang, Yu Lan, Chao Shen

    Abstract: Large language models have shown their ability to become effective few-shot learners with prompting, revoluting the paradigm of learning with data scarcity. However, this approach largely depends on the quality of prompt initialization, and always exhibits large variability among different runs. Such property makes prompt tuning highly unreliable and vulnerable to poorly constructed prompts, which… ▽ More

    Submitted 30 April, 2024; originally announced April 2024.

    Comments: Submitted to ACL 2024

  14. arXiv:2404.11467  [pdf, other

    cs.SE cs.CR

    A Large-scale Fine-grained Analysis of Packages in Open-Source Software Ecosystems

    Authors: Xiaoyan Zhou, Feiran Liang, Zhaojie Xie, Yang Lan, Wenjia Niu, Jiqiang Liu, Haining Wang, Qiang Li

    Abstract: Package managers such as NPM, Maven, and PyPI play a pivotal role in open-source software (OSS) ecosystems, streamlining the distribution and management of various freely available packages. The fine-grained details within software packages can unveil potential risks within existing OSS ecosystems, offering valuable insights for detecting malicious packages. In this study, we undertake a large-sca… ▽ More

    Submitted 17 April, 2024; originally announced April 2024.

  15. arXiv:2404.03205  [pdf, other

    quant-ph

    Optimal Dynamical Gauge in the Quantum Rabi Model

    Authors: Yuqi Qing, Wen-Long You, Yueheng Lan, Maoxin Liu

    Abstract: In this paper, we investigate the gauge dependence of various physical observables in the quantum Rabi model (QRM) under different potential fields, arising from the Hilbert-space truncation of the atomic degree of freedom. We discover that in both the square-well potential and oscillator potential,the optimal gauges for the ground-state energy of the QRM vary with respect to the cavity frequency,… ▽ More

    Submitted 4 April, 2024; originally announced April 2024.

  16. arXiv:2403.17411  [pdf, other

    cs.CL

    PCToolkit: A Unified Plug-and-Play Prompt Compression Toolkit of Large Language Models

    Authors: **yi Li, Yihuai Lan, Lei Wang, Hao Wang

    Abstract: Prompt compression is an innovative method for efficiently condensing input prompts while preserving essential information. To facilitate quick-start services, user-friendly interfaces, and compatibility with common datasets and metrics, we present the Prompt Compression Toolkit (PCToolkit). This toolkit is a unified plug-and-play solution for compressing prompts in Large Language Models (LLMs), f… ▽ More

    Submitted 26 March, 2024; originally announced March 2024.

    Comments: For open-source repository, see https://github.com/3DAgentWorld/Toolkit-for-Prompt-Compression

  17. arXiv:2403.14736  [pdf, other

    q-bio.QM cs.AI cs.LG

    NaNa and MiGu: Semantic Data Augmentation Techniques to Enhance Protein Classification in Graph Neural Networks

    Authors: Yi-Shan Lan, Pin-Yu Chen, Tsung-Yi Ho

    Abstract: Protein classification tasks are essential in drug discovery. Real-world protein structures are dynamic, which will determine the properties of proteins. However, the existing machine learning methods, like ProNet (Wang et al., 2022a), only access limited conformational characteristics and protein side-chain features, leading to impractical protein structure and inaccuracy of protein classes in th… ▽ More

    Submitted 26 March, 2024; v1 submitted 21 March, 2024; originally announced March 2024.

  18. arXiv:2403.12987  [pdf, other

    q-bio.BM cs.LG

    Rethinking Specificity in SBDD: Leveraging Delta Score and Energy-Guided Diffusion

    Authors: Bowen Gao, Minsi Ren, Yuyan Ni, Yanwen Huang, Bo Qiang, Zhi-Ming Ma, Wei-Ying Ma, Yanyan Lan

    Abstract: In the field of Structure-based Drug Design (SBDD), deep learning-based generative models have achieved outstanding performance in terms of docking score. However, further study shows that the existing molecular generative methods and docking scores both have lacked consideration in terms of specificity, which means that generated molecules bind to almost every protein pocket with high affinity. T… ▽ More

    Submitted 4 March, 2024; originally announced March 2024.

  19. arXiv:2403.12019  [pdf, other

    cs.CV

    LN3Diff: Scalable Latent Neural Fields Diffusion for Speedy 3D Generation

    Authors: Yushi Lan, Fangzhou Hong, Shuai Yang, Shangchen Zhou, Xuyi Meng, Bo Dai, Xingang Pan, Chen Change Loy

    Abstract: The field of neural rendering has witnessed significant progress with advancements in generative models and differentiable rendering techniques. Though 2D diffusion has achieved success, a unified 3D diffusion pipeline remains unsettled. This paper introduces a novel framework called LN3Diff to address this gap and enable fast, high-quality, and generic conditional 3D generation. Our approach harn… ▽ More

    Submitted 18 March, 2024; originally announced March 2024.

    Comments: project webpage: https://nirvanalan.github.io/projects/ln3diff/

  20. arXiv:2402.17971  [pdf, other

    cs.CV cs.AI cs.CL

    All in an Aggregated Image for In-Image Learning

    Authors: Lei Wang, Wanyu Xu, Zhiqiang Hu, Yihuai Lan, Shan Dong, Hao Wang, Roy Ka-Wei Lee, Ee-Peng Lim

    Abstract: This paper introduces a new in-context learning (ICL) mechanism called In-Image Learning (I$^2$L) that combines demonstration examples, visual cues, and chain-of-thought reasoning into an aggregated image to enhance the capabilities of Large Multimodal Models (e.g., GPT-4V) in multimodal reasoning tasks. Unlike previous approaches that rely on converting images to text or incorporating visual inpu… ▽ More

    Submitted 2 April, 2024; v1 submitted 27 February, 2024; originally announced February 2024.

    Comments: Preprint

  21. arXiv:2402.16567  [pdf, other

    cs.CL cs.AI cs.DB

    Aligning Large Language Models to a Domain-specific Graph Database

    Authors: Yuanyuan Liang, Keren Tan, Tingyu Xie, Wenbiao Tao, Siyuan Wang, Yunshi Lan, Weining Qian

    Abstract: Graph Databases (Graph DB) are widely applied in various fields, including finance, social networks, and medicine. However, translating Natural Language (NL) into the Graph Query Language (GQL), commonly known as NL2GQL, proves to be challenging due to its inherent complexity and specialized nature. Some approaches have sought to utilize Large Language Models (LLMs) to address analogous tasks like… ▽ More

    Submitted 28 February, 2024; v1 submitted 26 February, 2024; originally announced February 2024.

    Comments: 13 pages,2 figures

  22. arXiv:2402.14704  [pdf, other

    cs.CL

    An LLM-Enhanced Adversarial Editing System for Lexical Simplification

    Authors: Keren Tan, Kangyang Luo, Yunshi Lan, Zheng Yuan, **long Shu

    Abstract: Lexical Simplification (LS) aims to simplify text at the lexical level. Existing methods rely heavily on annotated data, making it challenging to apply in low-resource scenarios. In this paper, we propose a novel LS method without parallel corpora. This method employs an Adversarial Editing System with guidance from a confusion loss and an invariance loss to predict lexical edits in the original s… ▽ More

    Submitted 22 March, 2024; v1 submitted 22 February, 2024; originally announced February 2024.

    Comments: Accepted by COLING 2024 main conference

  23. arXiv:2402.13779  [pdf, other

    cs.LG cs.AI q-bio.BM

    Contextual Molecule Representation Learning from Chemical Reaction Knowledge

    Authors: Han Tang, Shikun Feng, Bicheng Lin, Yuyan Ni, JIng**g Liu, Wei-Ying Ma, Yanyan Lan

    Abstract: In recent years, self-supervised learning has emerged as a powerful tool to harness abundant unlabelled data for representation learning and has been broadly adopted in diverse areas. However, when applied to molecular representation learning (MRL), prevailing techniques such as masked sub-unit reconstruction often fall short, due to the high degree of freedom in the possible combinations of atoms… ▽ More

    Submitted 21 February, 2024; originally announced February 2024.

    Comments: Preprint. Under Review

  24. arXiv:2402.13647  [pdf, other

    cs.CL cs.AI

    Unsupervised Text Style Transfer via LLMs and Attention Masking with Multi-way Interactions

    Authors: Lei Pan, Yunshi Lan, Yang Li, Weining Qian

    Abstract: Unsupervised Text Style Transfer (UTST) has emerged as a critical task within the domain of Natural Language Processing (NLP), aiming to transfer one stylistic aspect of a sentence into another style without changing its semantics, syntax, or other attributes. This task is especially challenging given the intrinsic lack of parallel text pairings. Among existing methods for UTST tasks, attention ma… ▽ More

    Submitted 21 February, 2024; originally announced February 2024.

  25. arXiv:2402.13125  [pdf, other

    cs.CL cs.AI

    TreeEval: Benchmark-Free Evaluation of Large Language Models through Tree Planning

    Authors: Xiang Li, Yunshi Lan, Chao Yang

    Abstract: Recently, numerous new benchmarks have been established to evaluate the performance of large language models (LLMs) via either computing a holistic score or employing another LLM as a judge. However, these approaches suffer from data leakage due to the open access of the benchmark and inflexible evaluation process. To address this issue, we introduce $\textbf{TreeEval}$, a benchmark-free evaluatio… ▽ More

    Submitted 20 February, 2024; originally announced February 2024.

  26. arXiv:2402.00357  [pdf, other

    cs.CV

    Safety of Multimodal Large Language Models on Images and Texts

    Authors: Xin Liu, Yichen Zhu, Yunshi Lan, Chao Yang, Yu Qiao

    Abstract: Attracted by the impressive power of Multimodal Large Language Models (MLLMs), the public is increasingly utilizing them to improve the efficiency of daily work. Nonetheless, the vulnerabilities of MLLMs to unsafe instructions bring huge safety risks when these models are deployed in real-world scenarios. In this paper, we systematically survey current efforts on the evaluation, attack, and defens… ▽ More

    Submitted 20 June, 2024; v1 submitted 1 February, 2024; originally announced February 2024.

    Comments: Accepted at IJCAI2024

  27. arXiv:2402.00263  [pdf, other

    cs.CL

    Does DetectGPT Fully Utilize Perturbation? Bridge Selective Perturbation to Fine-tuned Contrastive Learning Detector would be Better

    Authors: Shengchao Liu, Xiaoming Liu, Yichen Wang, Zehua Cheng, Chengzhengxu Li, Zhaohan Zhang, Yu Lan, Chao Shen

    Abstract: The burgeoning generative capabilities of large language models (LLMs) have raised growing concerns about abuse, demanding automatic machine-generated text detectors. DetectGPT, a zero-shot metric-based detector, first introduces perturbation and shows great performance improvement. However, in DetectGPT, random perturbation strategy could introduce noise, and logit regression depends on threshold… ▽ More

    Submitted 24 February, 2024; v1 submitted 31 January, 2024; originally announced February 2024.

  28. arXiv:2401.07518  [pdf, other

    cs.CL cs.AI

    Survey of Natural Language Processing for Education: Taxonomy, Systematic Review, and Future Trends

    Authors: Yunshi Lan, Xinyuan Li, Hanyue Du, Xuesong Lu, Ming Gao, Weining Qian, Aoying Zhou

    Abstract: Natural Language Processing (NLP) aims to analyze text or speech via techniques in the computer science field. It serves the applications in domains of healthcare, commerce, education and so on. Particularly, NLP has been widely applied to the education domain and its applications have enormous potential to help teaching and learning. In this survey, we review recent advances in NLP with the focus… ▽ More

    Submitted 15 March, 2024; v1 submitted 15 January, 2024; originally announced January 2024.

  29. arXiv:2312.11057  [pdf, other

    cs.CR cs.AI cs.CV

    DataElixir: Purifying Poisoned Dataset to Mitigate Backdoor Attacks via Diffusion Models

    Authors: Jiachen Zhou, Peizhuo Lv, Yibing Lan, Guozhu Meng, Kai Chen, Hualong Ma

    Abstract: Dataset sanitization is a widely adopted proactive defense against poisoning-based backdoor attacks, aimed at filtering out and removing poisoned samples from training datasets. However, existing methods have shown limited efficacy in countering the ever-evolving trigger functions, and often leading to considerable degradation of benign accuracy. In this paper, we propose DataElixir, a novel sanit… ▽ More

    Submitted 19 December, 2023; v1 submitted 18 December, 2023; originally announced December 2023.

    Comments: Accepted by AAAI2024

  30. arXiv:2312.10422  [pdf, other

    cs.CV

    Learning Dense Correspondence for NeRF-Based Face Reenactment

    Authors: Songlin Yang, Wei Wang, Yushi Lan, Xiangyu Fan, Bo Peng, Lei Yang, **g Dong

    Abstract: Face reenactment is challenging due to the need to establish dense correspondence between various face representations for motion transfer. Recent studies have utilized Neural Radiance Field (NeRF) as fundamental representation, which further enhanced the performance of multi-view face reenactment in photo-realism and 3D consistency. However, establishing dense correspondence between different fac… ▽ More

    Submitted 18 December, 2023; v1 submitted 16 December, 2023; originally announced December 2023.

    Comments: Accepted by Proceedings of the AAAI Conference on Artificial Intelligence, 2024

  31. arXiv:2312.10389  [pdf, other

    cs.CV

    ElasticLaneNet: An Efficient Geometry-Flexible Approach for Lane Detection

    Authors: Yaxin Feng, Yuan Lan, Luchan Zhang, Yang Xiang

    Abstract: The task of lane detection involves identifying the boundaries of driving areas in real-time. Recognizing lanes with variable and complex geometric structures remains a challenge. In this paper, we explore a novel and flexible way of implicit lanes representation named \textit{Elastic Lane map (ELM)}, and introduce an efficient physics-informed end-to-end lane detection framework, namely, ElasticL… ▽ More

    Submitted 3 April, 2024; v1 submitted 16 December, 2023; originally announced December 2023.

  32. Scaling Computational Fluid Dynamics: In Situ Visualization of NekRS using SENSEI

    Authors: Victor A. Mateevitsi, Mathis Bode, Nicola Ferrier, Paul Fischer, Jens Henrik Göbbert, Joseph A. Insley, Yu-Hsiang Lan, Misun Min, Michael E. Papka, Saumil Patel, Silvio Rizzi, Jonathan Windgassen

    Abstract: In the realm of Computational Fluid Dynamics (CFD), the demand for memory and computation resources is extreme, necessitating the use of leadership-scale computing platforms for practical domain sizes. This intensive requirement renders traditional checkpointing methods ineffective due to the significant slowdown in simulations while saving state data to disk. As we progress towards exascale and G… ▽ More

    Submitted 18 December, 2023; v1 submitted 15 December, 2023; originally announced December 2023.

  33. arXiv:2312.07586  [pdf, other

    cs.CV cs.AI cs.LG physics.data-an

    Characteristic Guidance: Non-linear Correction for Diffusion Model at Large Guidance Scale

    Authors: Candi Zheng, Yuan Lan

    Abstract: Popular guidance for denoising diffusion probabilistic model (DDPM) linearly combines distinct conditional models together to provide enhanced control over samples. However, this approach overlooks nonlinear effects that become significant when guidance scale is large. To address this issue, we propose characteristic guidance, a guidance method that provides first-principle non-linear correction f… ▽ More

    Submitted 3 June, 2024; v1 submitted 10 December, 2023; originally announced December 2023.

    Comments: 8 pages, 7 figures

  34. arXiv:2312.07168  [pdf, other

    cs.LG cs.AI

    Equivariant Flow Matching with Hybrid Probability Transport

    Authors: Yuxuan Song, **g**g Gong, Minkai Xu, Ziyao Cao, Yanyan Lan, Stefano Ermon, Hao Zhou, Wei-Ying Ma

    Abstract: The generation of 3D molecules requires simultaneously deciding the categorical features~(atom types) and continuous features~(atom coordinates). Deep generative models, especially Diffusion Models (DMs), have demonstrated effectiveness in generating feature-rich geometries. However, existing DMs typically suffer from unstable probability dynamics with inefficient sampling speed. In this paper, we… ▽ More

    Submitted 12 December, 2023; originally announced December 2023.

    Comments: NeurIPS 2023

  35. arXiv:2312.03763  [pdf, other

    cs.CV cs.GR cs.LG

    Gaussian3Diff: 3D Gaussian Diffusion for 3D Full Head Synthesis and Editing

    Authors: Yushi Lan, Feitong Tan, Di Qiu, Qiangeng Xu, Kyle Genova, Zeng Huang, Sean Fanello, Rohit Pandey, Thomas Funkhouser, Chen Change Loy, Yinda Zhang

    Abstract: We present a novel framework for generating photorealistic 3D human head and subsequently manipulating and reposing them with remarkable flexibility. The proposed approach leverages an implicit function representation of 3D human heads, employing 3D Gaussians anchored on a parametric face model. To enhance representational capabilities and encode spatial information, we embed a lightweight tri-pla… ▽ More

    Submitted 19 December, 2023; v1 submitted 5 December, 2023; originally announced December 2023.

    Comments: project webpage: https://nirvanalan.github.io/projects/gaussian3diff/

  36. arXiv:2312.01972  [pdf, other

    cond-mat.supr-con

    Do** dependence of superconductivity on a honeycomb lattice within the framework of kinetic-energy-driven superconductivity

    Authors: Yu Lan, Xian-Feng Yu, Li-Ting Zhang

    Abstract: Unconventional superconductivity on a honeycomb lattice has received increasing interest since the discovery of graphene primarily due to the similarities between materials with a honeycomb lattice and cuprate superconductors. Many theoretical studies have been conducted on superconductivity on a honeycomb lattice, however, a consistent picture is still lacking. In this article we have extended th… ▽ More

    Submitted 4 December, 2023; originally announced December 2023.

    Comments: 8 pages, 2 figures

  37. arXiv:2311.17600  [pdf, other

    cs.CV

    MM-SafetyBench: A Benchmark for Safety Evaluation of Multimodal Large Language Models

    Authors: Xin Liu, Yichen Zhu, **dong Gu, Yunshi Lan, Chao Yang, Yu Qiao

    Abstract: The security concerns surrounding Large Language Models (LLMs) have been extensively explored, yet the safety of Multimodal Large Language Models (MLLMs) remains understudied. In this paper, we observe that Multimodal Large Language Models (MLLMs) can be easily compromised by query-relevant images, as if the text query itself were malicious. To address this, we introduce MM-SafetyBench, a comprehe… ▽ More

    Submitted 19 June, 2024; v1 submitted 29 November, 2023; originally announced November 2023.

  38. arXiv:2311.16160  [pdf, other

    q-bio.BM cs.LG

    Protein-ligand binding representation learning from fine-grained interactions

    Authors: Shikun Feng, Minghao Li, Yinjun Jia, Weiying Ma, Yanyan Lan

    Abstract: The binding between proteins and ligands plays a crucial role in the realm of drug discovery. Previous deep learning approaches have shown promising results over traditional computationally intensive methods, but resulting in poor generalization due to limited supervised data. In this paper, we propose to learn protein-ligand binding representation in a self-supervised learning manner. Different f… ▽ More

    Submitted 8 November, 2023; originally announced November 2023.

  39. arXiv:2311.12035  [pdf, other

    q-bio.QM cs.LG

    Delta Score: Improving the Binding Assessment of Structure-Based Drug Design Methods

    Authors: Minsi Ren, Bowen Gao, Bo Qiang, Yanyan Lan

    Abstract: Structure-based drug design (SBDD) stands at the forefront of drug discovery, emphasizing the creation of molecules that target specific binding pockets. Recent advances in this area have witnessed the adoption of deep generative models and geometric deep learning techniques, modeling SBDD as a conditional generation task where the target structure serves as context. Historically, evaluation of th… ▽ More

    Submitted 1 November, 2023; originally announced November 2023.

  40. arXiv:2311.09050  [pdf, other

    cs.CV cs.AI

    Improving Zero-shot Visual Question Answering via Large Language Models with Reasoning Question Prompts

    Authors: Yunshi Lan, Xiang Li, Xin Liu, Yang Li, Wei Qin, Weining Qian

    Abstract: Zero-shot Visual Question Answering (VQA) is a prominent vision-language task that examines both the visual and textual understanding capability of systems in the absence of training data. Recently, by converting the images into captions, information across multi-modalities is bridged and Large Language Models (LLMs) can apply their strong zero-shot generalization capability to unseen questions. T… ▽ More

    Submitted 15 November, 2023; originally announced November 2023.

  41. FlaCGEC: A Chinese Grammatical Error Correction Dataset with Fine-grained Linguistic Annotation

    Authors: Hanyue Du, Yike Zhao, Qingyuan Tian, Jiani Wang, Lei Wang, Yunshi Lan, Xuesong Lu

    Abstract: Chinese Grammatical Error Correction (CGEC) has been attracting growing attention from researchers recently. In spite of the fact that multiple CGEC datasets have been developed to support the research, these datasets lack the ability to provide a deep linguistic topology of grammar errors, which is critical for interpreting and diagnosing CGEC approaches. To address this limitation, we introduce… ▽ More

    Submitted 26 September, 2023; originally announced November 2023.

  42. arXiv:2311.03955  [pdf

    cs.IT cs.AI

    Elastic Information Bottleneck

    Authors: Yuyan Ni, Yanyan Lan, Ao Liu, Zhiming Ma

    Abstract: Information bottleneck is an information-theoretic principle of representation learning that aims to learn a maximally compressed representation that preserves as much information about labels as possible. Under this principle, two different methods have been proposed, i.e., information bottleneck (IB) and deterministic information bottleneck (DIB), and have gained significant progress in explaini… ▽ More

    Submitted 7 November, 2023; originally announced November 2023.

  43. arXiv:2311.02124  [pdf, other

    q-bio.BM cs.AI cs.LG

    Sliced Denoising: A Physics-Informed Molecular Pre-Training Method

    Authors: Yuyan Ni, Shikun Feng, Wei-Ying Ma, Zhi-Ming Ma, Yanyan Lan

    Abstract: While molecular pre-training has shown great potential in enhancing drug discovery, the lack of a solid physical interpretation in current methods raises concerns about whether the learned representation truly captures the underlying explanatory factors in observed data, ultimately resulting in limited generalization and robustness. Although denoising methods offer a physical interpretation, their… ▽ More

    Submitted 3 November, 2023; originally announced November 2023.

  44. arXiv:2310.18682  [pdf, ps, other

    math.RT math.QA

    Lusztig sheaves and tensor products of integrable highest weight modules

    Authors: Jiepeng Fang, Yixin Lan

    Abstract: By introducing $N$-framed quivers, we define the localization of Lusztig's sheaves for $N$-framed quivers and functors $E^{(n)}_{i}, F^{(n)}_{i}, K^{\pm}_i$ for localizations. This gives a categorical realization of tensor products of integrable highest weight modules of the quantized envelo** algebra. The simple perverse sheaves in the localization provide the canonical basis of tensor products… ▽ More

    Submitted 30 October, 2023; v1 submitted 28 October, 2023; originally announced October 2023.

    Comments: 35 pages, arXiv admin note: text overlap with arXiv:2307.16131

    MSC Class: 16G20; 17B37

  45. arXiv:2310.16535  [pdf, other

    cs.CL cs.AI

    R$^3$ Prompting: Review, Rephrase and Resolve for Chain-of-Thought Reasoning in Large Language Models under Noisy Context

    Authors: Qingyuan Tian, Hanlun Zhu, Lei Wang, Yang Li, Yunshi Lan

    Abstract: With the help of Chain-of-Thought (CoT) prompting, Large Language Models (LLMs) have achieved remarkable performance on various reasoning tasks. However, most of them have been evaluated under noise-free context and the dilemma for LLMs to produce inaccurate results under the noisy context has not been fully investigated. Existing studies utilize trigger sentences to encourage LLMs to concentrate… ▽ More

    Submitted 25 October, 2023; originally announced October 2023.

  46. arXiv:2310.14985  [pdf, other

    cs.CL

    LLM-Based Agent Society Investigation: Collaboration and Confrontation in Avalon Gameplay

    Authors: Yihuai Lan, Zhiqiang Hu, Lei Wang, Yang Wang, Deheng Ye, Peilin Zhao, Ee-Peng Lim, Hui Xiong, Hao Wang

    Abstract: This paper aims to investigate the open research problem of uncovering the social behaviors of LLM-based agents. To achieve this goal, we adopt Avalon, a representative communication game, as the environment and use system prompts to guide LLM agents to play the game. While previous studies have conducted preliminary investigations into gameplay with LLM agents, there lacks research on their socia… ▽ More

    Submitted 7 March, 2024; v1 submitted 23 October, 2023; originally announced October 2023.

  47. Mock Observations: Formation and Evolution of diffuse light in Galaxy Groups and Clusters in the IllustrisTNG Simulations

    Authors: Lin Tang, Weipeng Lin, Yang Wang, **g Li, Yanyao Lan

    Abstract: In this paper, by analyzing mock images from the IllustrisTNG100-1 simulation, we examine the properties of the diffuse light and compare them to those of central and satellite galaxies. Our findings suggest that the majority of the diffuse light originates from satellites. This claim is supported by the similarity between the age and metallicity distributions of the diffuse light and those of the… ▽ More

    Submitted 22 October, 2023; originally announced October 2023.

    Comments: 8 figures, 2 tables, Accepted for publication in ApJ

    Journal ref: The Astrophysical Journal, 2023, Volume 959, Number 2

  48. arXiv:2310.14216  [pdf, other

    cs.LG cs.AI q-bio.BM

    UniMAP: Universal SMILES-Graph Representation Learning

    Authors: Shikun Feng, Lixin Yang, Weiying Ma, Yanyan Lan

    Abstract: Molecular representation learning is fundamental for many drug related applications. Most existing molecular pre-training models are limited in using single molecular modality, either SMILES or graph representation. To effectively leverage both modalities, we argue that it is critical to capture the fine-grained 'semantics' between SMILES and graph, because subtle sequence/graph differences may le… ▽ More

    Submitted 22 October, 2023; originally announced October 2023.

  49. arXiv:2310.11295  [pdf, other

    cs.CV cs.CG

    CorrTalk: Correlation Between Hierarchical Speech and Facial Activity Variances for 3D Animation

    Authors: Zhaojie Chu, Kailing Guo, Xiaofen Xing, Yilin Lan, Bolun Cai, Xiangmin Xu

    Abstract: Speech-driven 3D facial animation is a challenging cross-modal task that has attracted growing research interest. During speaking activities, the mouth displays strong motions, while the other facial regions typically demonstrate comparatively weak activity levels. Existing approaches often simplify the process by directly map** single-level speech features to the entire facial animation, which… ▽ More

    Submitted 17 October, 2023; originally announced October 2023.

  50. arXiv:2310.08395  [pdf, other

    cs.CL cs.AI

    Prompting Large Language Models with Chain-of-Thought for Few-Shot Knowledge Base Question Generation

    Authors: Yuanyuan Liang, Jianing Wang, Hanlun Zhu, Lei Wang, Weining Qian, Yunshi Lan

    Abstract: The task of Question Generation over Knowledge Bases (KBQG) aims to convert a logical form into a natural language question. For the sake of expensive cost of large-scale question annotation, the methods of KBQG under low-resource scenarios urgently need to be developed. However, current methods heavily rely on annotated data for fine-tuning, which is not well-suited for few-shot question generati… ▽ More

    Submitted 23 October, 2023; v1 submitted 12 October, 2023; originally announced October 2023.

    Comments: Accepted by EMNLP 2023 main conference