Skip to main content

Showing 1–50 of 367 results for author: Zhou, F

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.19611  [pdf, other

    q-bio.QM cs.AI

    Multimodal Data Integration for Precision Oncology: Challenges and Future Directions

    Authors: Huajun Zhou, Fengtao Zhou, Chenyu Zhao, Yingxue Xu, Luyang Luo, Hao Chen

    Abstract: The essence of precision oncology lies in its commitment to tailor targeted treatments and care measures to each patient based on the individual characteristics of the tumor. The inherent heterogeneity of tumors necessitates gathering information from diverse data sources to provide valuable insights from various perspectives, fostering a holistic comprehension of the tumor. Over the past decade,… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

    Comments: 15 pages, 4 figures

  2. arXiv:2406.18364  [pdf

    cs.CL cs.AI

    Research on Information Extraction of LCSTS Dataset Based on an Improved BERTSum-LSTM Model

    Authors: Yiming Chen, Haobin Chen, Simin Liu, Yunyun Liu, Fanhao Zhou, Bing Wei

    Abstract: With the continuous advancement of artificial intelligence, natural language processing technology has become widely utilized in various fields. At the same time, there are many challenges in creating Chinese news summaries. First of all, the semantics of Chinese news is complex, and the amount of information is enormous. Extracting critical information from Chinese news presents a significant cha… ▽ More

    Submitted 26 June, 2024; originally announced June 2024.

    Comments: submitted to ICMIII 2024

  3. arXiv:2406.17797  [pdf, other

    physics.chem-ph cs.AI cs.LG

    MoleculeCLA: Rethinking Molecular Benchmark via Computational Ligand-Target Binding Analysis

    Authors: Shikun Feng, Jiaxin Zheng, Yinjun Jia, Yanwen Huang, Fengfeng Zhou, Wei-Ying Ma, Yanyan Lan

    Abstract: Molecular representation learning is pivotal for various molecular property prediction tasks related to drug discovery. Robust and accurate benchmarks are essential for refining and validating current methods. Existing molecular property benchmarks derived from wet experiments, however, face limitations such as data volume constraints, unbalanced label distribution, and noisy labels. To address th… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

  4. arXiv:2406.14887  [pdf, other

    cs.CL

    InternLM-Law: An Open Source Chinese Legal Large Language Model

    Authors: Zhiwei Fei, Songyang Zhang, Xiaoyu Shen, Dawei Zhu, Xiao Wang, Maosong Cao, Fengzhe Zhou, Yining Li, Wenwei Zhang, Dahua Lin, Kai Chen, Jidong Ge

    Abstract: While large language models (LLMs) have showcased impressive capabilities, they struggle with addressing legal queries due to the intricate complexities and specialized expertise required in the legal field. In this paper, we introduce InternLM-Law, a specialized LLM tailored for addressing diverse legal queries related to Chinese laws, spanning from responding to standard legal questions (e.g., l… ▽ More

    Submitted 21 June, 2024; originally announced June 2024.

    Comments: Our dataset, code and models will be released at https://github.com/InternLM/InternLM-Law

  5. arXiv:2406.13555  [pdf, other

    cs.CL cs.AI

    BiLD: Bi-directional Logits Difference Loss for Large Language Model Distillation

    Authors: Minchong Li, Feng Zhou, Xiaohui Song

    Abstract: In recent years, large language models (LLMs) have shown exceptional capabilities across various natural language processing (NLP) tasks. However, such impressive performance often comes with the trade-off of an increased parameter size, posing significant challenges for widespread deployment. Knowledge distillation (KD) provides a solution by transferring knowledge from a large teacher model to a… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

    Comments: Submitted to ARR June (for EMNLP 2024)

  6. arXiv:2406.12753  [pdf, other

    cs.CL cs.AI

    OlympicArena: Benchmarking Multi-discipline Cognitive Reasoning for Superintelligent AI

    Authors: Zhen Huang, Zengzhi Wang, Shijie Xia, Xuefeng Li, Haoyang Zou, Ruijie Xu, Run-Ze Fan, Lyumanshan Ye, Ethan Chern, Yixin Ye, Yikai Zhang, Yuqing Yang, Ting Wu, Binjie Wang, Shichao Sun, Yang Xiao, Yiyuan Li, Fan Zhou, Steffi Chern, Yiwei Qin, Yan Ma, Jiadi Su, Yixiu Liu, Yuxiang Zheng, Shaoting Zhang , et al. (3 additional authors not shown)

    Abstract: The evolution of Artificial Intelligence (AI) has been significantly accelerated by advancements in Large Language Models (LLMs) and Large Multimodal Models (LMMs), gradually showcasing potential cognitive reasoning abilities in problem-solving and scientific discovery (i.e., AI4Science) once exclusive to human intellect. To comprehensively evaluate current models' performance in cognitive reasoni… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

    Comments: 44 pages

  7. GMP-AR: Granularity Message Passing and Adaptive Reconciliation for Temporal Hierarchy Forecasting

    Authors: Fan Zhou, Chen Pan, Lintao Ma, Yu Liu, James Zhang, Jun Zhou, Hongyuan Mei, Weitao Lin, Zi Zhuang, Wenxin Ning, Yunhua Hu, Siqiao Xue

    Abstract: Time series forecasts of different temporal granularity are widely used in real-world applications, e.g., sales prediction in days and weeks for making different inventory plans. However, these tasks are usually solved separately without ensuring coherence, which is crucial for aligning downstream decisions. Previous works mainly focus on ensuring coherence with some straightforward methods, e.g.,… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

  8. arXiv:2406.11434  [pdf, other

    cs.DB

    DB-GPT-Hub: Towards Open Benchmarking Text-to-SQL Empowered by Large Language Models

    Authors: Fan Zhou, Siqiao Xue, Danrui Qi, Wenhui Shi, Wang Zhao, Ganglin Wei, Hongyang Zhang, Caigai Jiang, Gangwei Jiang, Zhixuan Chu, Faqiang Chen

    Abstract: Large language models (LLMs) becomes the dominant paradigm for the challenging task of text-to-SQL. LLM-empowered text-to-SQL methods are typically categorized into prompting-based and tuning approaches. Compared to prompting-based methods, benchmarking fine-tuned LLMs for text-to-SQL is important yet under-explored, partially attributed to the prohibitively high computational cost. In this paper,… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

  9. arXiv:2406.10869  [pdf, other

    eess.IV cs.CV

    Geometric Distortion Guided Transformer for Omnidirectional Image Super-Resolution

    Authors: Cuixin Yang, Rongkang Dong, Jun Xiao, Cong Zhang, Kin-Man Lam, Fei Zhou, Guo** Qiu

    Abstract: As virtual and augmented reality applications gain popularity, omnidirectional image (ODI) super-resolution has become increasingly important. Unlike 2D plain images that are formed on a plane, ODIs are projected onto spherical surfaces. Applying established image super-resolution methods to ODIs, therefore, requires performing equirectangular projection (ERP) to map the ODIs onto a plane. ODI sup… ▽ More

    Submitted 16 June, 2024; originally announced June 2024.

    Comments: 13 pages, 12 figures, journal

  10. arXiv:2406.09858  [pdf, other

    cs.CV

    Vision Language Modeling of Content, Distortion and Appearance for Image Quality Assessment

    Authors: Fei Zhou, Zhicong Huang, Tianhao Gu, Guo** Qiu

    Abstract: The visual quality of an image is confounded by a number of intertwined factors including its semantic content, distortion characteristics and appearance properties such as brightness, contrast, sharpness, and colourfulness. Distilling high level knowledge about all these quality bearing attributes is crucial for develo** objective Image Quality Assessment (IQA).While existing solutions have mod… ▽ More

    Submitted 21 June, 2024; v1 submitted 14 June, 2024; originally announced June 2024.

  11. arXiv:2406.05036  [pdf, other

    cs.LG cs.AI

    TimeSieve: Extracting Temporal Dynamics through Information Bottlenecks

    Authors: Ninghui Feng, Songning Lai, Fobao Zhou, Zhenxiao Yin, Hang Zhao

    Abstract: Time series forecasting has become an increasingly popular research area due to its critical applications in various real-world domains such as traffic management, weather prediction, and financial analysis. Despite significant advancements, existing models face notable challenges, including the necessity of manual hyperparameter tuning for different datasets, and difficulty in effectively disting… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

  12. arXiv:2406.03421  [pdf, other

    cs.CV

    Post-hoc Part-prototype Networks

    Authors: Andong Tan, Fengtao Zhou, Hao Chen

    Abstract: Post-hoc explainability methods such as Grad-CAM are popular because they do not influence the performance of a trained model. However, they mainly reveal "where" a model looks at for a given input, fail to explain "what" the model looks for (e.g., what is important to classify a bird image to a Scott Oriole?). Existing part-prototype networks leverage part-prototypes (e.g., characteristic Scott O… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

    Comments: ICML 2024

  13. arXiv:2405.16940  [pdf, other

    cs.CV

    Adversarial Attacks on Both Face Recognition and Face Anti-spoofing Models

    Authors: Fengfan Zhou, Qianyu Zhou, Xiangtai Li, Xuequan Lu, Lizhuang Ma, Hefei Ling

    Abstract: Adversarial attacks on Face Recognition (FR) systems have proven highly effective in compromising pure FR models, yet adversarial examples may be ineffective to the complete FR systems as Face Anti-Spoofing (FAS) models are often incorporated and can detect a significant number of them. To address this under-explored and essential problem, we propose a novel setting of adversarially attacking both… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

  14. arXiv:2405.16197  [pdf, other

    cs.CV eess.IV

    A 7K Parameter Model for Underwater Image Enhancement based on Transmission Map Prior

    Authors: Fuheng Zhou, Dikai Wei, Ye Fan, Yulong Huang, Yonggang Zhang

    Abstract: Although deep learning based models for underwater image enhancement have achieved good performance, they face limitations in both lightweight and effectiveness, which prevents their deployment and application on resource-constrained platforms. Moreover, most existing deep learning based models use data compression to get high-level semantic information in latent space instead of using the origina… ▽ More

    Submitted 25 May, 2024; originally announced May 2024.

    Comments: 10 pages

  15. arXiv:2405.16059  [pdf, other

    cs.SI

    Interpretable Transformer Hawkes Processes: Unveiling Complex Interactions in Social Networks

    Authors: Zizhuo Meng, Ke Wan, Yadong Huang, Zhidong Li, Yang Wang, Feng Zhou

    Abstract: Social networks represent complex ecosystems where the interactions between users or groups play a pivotal role in information dissemination, opinion formation, and social interactions. Effectively harnessing event sequence data within social networks to unearth interactions among users or groups has persistently posed a challenging frontier within the realm of point processes. Current deep point… ▽ More

    Submitted 28 May, 2024; v1 submitted 25 May, 2024; originally announced May 2024.

  16. arXiv:2405.15599  [pdf, ps, other

    cs.LG stat.ML

    On the Computational Landscape of Replicable Learning

    Authors: Alkis Kalavasis, Amin Karbasi, Grigoris Velegkas, Felix Zhou

    Abstract: We study computational aspects of algorithmic replicability, a notion of stability introduced by Impagliazzo, Lei, Pitassi, and Sorrell [2022]. Motivated by a recent line of work that established strong statistical connections between replicability and other notions of learnability such as online learning, private learning, and SQ learning, we aim to understand better the computational connections… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

  17. arXiv:2405.12209  [pdf, other

    cs.CL

    MathBench: Evaluating the Theory and Application Proficiency of LLMs with a Hierarchical Mathematics Benchmark

    Authors: Hongwei Liu, Zilong Zheng, Yuxuan Qiao, Haodong Duan, Zhiwei Fei, Fengzhe Zhou, Wenwei Zhang, Songyang Zhang, Dahua Lin, Kai Chen

    Abstract: Recent advancements in large language models (LLMs) have showcased significant improvements in mathematics. However, traditional math benchmarks like GSM8k offer a unidimensional perspective, falling short in providing a holistic assessment of the LLMs' math capabilities. To address this gap, we introduce MathBench, a new benchmark that rigorously assesses the mathematical capabilities of large la… ▽ More

    Submitted 20 May, 2024; originally announced May 2024.

    Comments: Project: https://github.com/open-compass/MathBench

  18. arXiv:2405.11281  [pdf, other

    cs.DC cs.AI

    Cooperative Cognitive Dynamic System in UAV Swarms: Reconfigurable Mechanism and Framework

    Authors: Ziye Jia, Jiahao You, Chao Dong, Qihui Wu, Fuhui Zhou, Dusit Niyato, Zhu Han

    Abstract: As the demands for immediate and effective responses increase in both civilian and military domains, the unmanned aerial vehicle (UAV) swarms emerge as effective solutions, in which multiple cooperative UAVs can work together to achieve specific goals. However, how to manage such complex systems to ensure real-time adaptability lack sufficient researches. Hence, in this paper, we propose the coope… ▽ More

    Submitted 18 May, 2024; originally announced May 2024.

  19. arXiv:2405.08938  [pdf, ps, other

    cs.DS

    Pointwise Lipschitz Continuous Graph Algorithms via Proximal Gradient Analysis

    Authors: Quanquan C. Liu, Grigoris Velegkas, Yuichi Yoshida, Felix Zhou

    Abstract: In many real-world applications, it is prohibitively expensive to drastically change the solution to a problem after a small perturbation in the environment. Therefore, the stability of an algorithm is a very desirable property. In this paper, we study the class of pointwise Lipschitz continuous algorithms as introduced in the recent work of Kumabe and Yoshida [KY23b, FOCS'23]. The Lipschitz const… ▽ More

    Submitted 14 May, 2024; originally announced May 2024.

  20. arXiv:2405.08603  [pdf, other

    cs.CL

    A Comprehensive Survey of Large Language Models and Multimodal Large Language Models in Medicine

    Authors: Hanguang Xiao, Feizhong Zhou, Xingyue Liu, Tianqi Liu, Zhipeng Li, Xin Liu, Xiaoxuan Huang

    Abstract: Since the release of ChatGPT and GPT-4, large language models (LLMs) and multimodal large language models (MLLMs) have garnered significant attention due to their powerful and general capabilities in understanding, reasoning, and generation, thereby offering new paradigms for the integration of artificial intelligence with medicine. This survey comprehensively overviews the development background… ▽ More

    Submitted 14 May, 2024; originally announced May 2024.

  21. arXiv:2405.08005  [pdf, other

    math.OC cs.AI cs.GT cs.LG stat.ML

    Graphon Mean Field Games with a Representative Player: Analysis and Learning Algorithm

    Authors: Fuzhong Zhou, Chenyu Zhang, Xu Chen, Xuan Di

    Abstract: We propose a discrete time graphon game formulation on continuous state and action spaces using a representative player to study stochastic games with heterogeneous interaction among agents. This formulation admits both philosophical and mathematical advantages, compared to a widely adopted formulation using a continuum of players. We prove the existence and uniqueness of the graphon equilibrium w… ▽ More

    Submitted 4 June, 2024; v1 submitted 8 May, 2024; originally announced May 2024.

    Comments: Published as a conference paper at ICML 2024

  22. arXiv:2405.07088  [pdf, other

    cs.HC

    Towards Context-Aware Modeling of Situation Awareness in Conditionally Automated Driving

    Authors: Lilit Avetisyan, X. Jessie Yang, Feng Zhou

    Abstract: Maintaining adequate situation awareness (SA) is crucial for the safe operation of conditionally automated vehicles (AVs), which requires drivers to regain control during takeover (TOR) events. This study developed a predictive model for real-time assessment of driver SA using multimodal data (e.g., galvanic skin response, heart rate and eye tracking data, and driver characteristics) collected in… ▽ More

    Submitted 11 May, 2024; originally announced May 2024.

    Comments: 37 Pages, 8 figures

  23. arXiv:2405.01507  [pdf, other

    cs.LG stat.ML

    Accelerating Convergence in Bayesian Few-Shot Classification

    Authors: Tianjun Ke, Haoqun Cao, Feng Zhou

    Abstract: Bayesian few-shot classification has been a focal point in the field of few-shot learning. This paper seamlessly integrates mirror descent-based variational inference into Gaussian process-based few-shot classification, addressing the challenge of non-conjugate inference. By leveraging non-Euclidean geometry, mirror descent achieves accelerated convergence by providing the steepest descent directi… ▽ More

    Submitted 7 May, 2024; v1 submitted 2 May, 2024; originally announced May 2024.

  24. arXiv:2404.17771  [pdf, ps, other

    cs.CV

    Characterization of dim light response in DVS pixel: Discontinuity of event triggering time

    Authors: Xiao Jiang, Fei Zhou

    Abstract: Dynamic Vision Sensors (DVS) have recently generated great interest because of the advantages of wide dynamic range and low latency compared with conventional frame-based cameras. However, the complicated behaviors in dim light conditions are still not clear, restricting the applications of DVS. In this paper, we analyze the typical DVS circuit, and find that there exists discontinuity of event tr… ▽ More

    Submitted 30 April, 2024; v1 submitted 26 April, 2024; originally announced April 2024.

    Comments: 6 pages, 4 figures

  25. arXiv:2404.15891  [pdf, other

    cs.CV

    OMEGAS: Object Mesh Extraction from Large Scenes Guided by Gaussian Segmentation

    Authors: Lizhi Wang, Feng Zhou, Jianqin Yin

    Abstract: Recent advancements in 3D reconstruction technologies have paved the way for high-quality and real-time rendering of complex 3D scenes. Despite these achievements, a notable challenge persists: it is difficult to precisely reconstruct specific objects from large scenes. Current scene reconstruction techniques frequently result in the loss of object detail textures and are unable to reconstruct obj… ▽ More

    Submitted 25 April, 2024; v1 submitted 24 April, 2024; originally announced April 2024.

    Comments: arXiv admin note: text overlap with arXiv:2311.17061 by other authors

  26. arXiv:2404.12367  [pdf, other

    cond-mat.mtrl-sci cs.LG physics.chem-ph

    Information theory unifies atomistic machine learning, uncertainty quantification, and materials thermodynamics

    Authors: Daniel Schwalbe-Koda, Sebastien Hamel, Babak Sadigh, Fei Zhou, Vincenzo Lordi

    Abstract: An accurate description of information is relevant for a range of problems in atomistic modeling, such as sampling methods, detecting rare events, analyzing datasets, or performing uncertainty quantification (UQ) in machine learning (ML)-driven simulations. Although individual methods have been proposed for each of these tasks, they lack a common theoretical background integrating their solutions.… ▽ More

    Submitted 18 April, 2024; originally announced April 2024.

    Report number: LLNL-JRNL-862887-DRAFT

  27. arXiv:2404.10209  [pdf, other

    cs.AI cs.LG

    Demonstration of DB-GPT: Next Generation Data Interaction System Empowered by Large Language Models

    Authors: Siqiao Xue, Danrui Qi, Caigao Jiang, Wenhui Shi, Fangyin Cheng, Keting Chen, Hongjun Yang, Zhi** Zhang, Jianshan He, Hongyang Zhang, Ganglin Wei, Wang Zhao, Fan Zhou, Hong Yi, Shaodong Liu, Hongjun Yang, Faqiang Chen

    Abstract: The recent breakthroughs in large language models (LLMs) are positioned to transition many areas of software. The technologies of interacting with data particularly have an important entanglement with LLMs as efficient and intuitive data interactions are paramount. In this paper, we present DB-GPT, a revolutionary and product-ready Python library that integrates LLMs into traditional data interact… ▽ More

    Submitted 24 April, 2024; v1 submitted 15 April, 2024; originally announced April 2024.

  28. arXiv:2404.09738  [pdf

    q-bio.BM cs.AI q-bio.QM

    AMPCliff: quantitative definition and benchmarking of activity cliffs in antimicrobial peptides

    Authors: Kewei Li, Yuqian Wu, Yutong Guo, Yinheng Li, Yusi Fan, Ruochi Zhang, Lan Huang, Fengfeng Zhou

    Abstract: Activity cliff (AC) is a phenomenon that a pair of similar molecules differ by a small structural alternation but exhibit a large difference in their biochemical activities. The AC of small molecules has been extensively investigated but limited knowledge is accumulated about the AC phenomenon in peptides with canonical amino acids. This study introduces a quantitative definition and benchmarking… ▽ More

    Submitted 15 April, 2024; originally announced April 2024.

  29. arXiv:2404.08661  [pdf, other

    cs.CL

    The Comparison of Translationese in Machine Translation and Human Transation in terms of Translation Relations

    Authors: Fan Zhou

    Abstract: This study explores the distinctions between neural machine translation (NMT) and human translation (HT) through the lens of translation relations. It benchmarks HT to assess the translation techniques produced by an NMT system and aims to address three key research questions: the differences in overall translation relations between NMT and HT, how each utilizes non-literal translation techniques,… ▽ More

    Submitted 27 March, 2024; originally announced April 2024.

  30. arXiv:2404.07032  [pdf, other

    cs.CV

    An Evidential-enhanced Tri-Branch Consistency Learning Method for Semi-supervised Medical Image Segmentation

    Authors: Zhenxi Zhang, Heng Zhou, Xiaoran Shi, Ran Ran, Chunna Tian, Feng Zhou

    Abstract: Semi-supervised segmentation presents a promising approach for large-scale medical image analysis, effectively reducing annotation burdens while achieving comparable performance. This methodology holds substantial potential for streamlining the segmentation process and enhancing its feasibility within clinical settings for translational investigations. While cross-supervised training, based on dis… ▽ More

    Submitted 10 April, 2024; originally announced April 2024.

  31. arXiv:2404.02394  [pdf, other

    eess.IV cs.CV

    Cohort-Individual Cooperative Learning for Multimodal Cancer Survival Analysis

    Authors: Huajun Zhou, Fengtao Zhou, Hao Chen

    Abstract: Recently, we have witnessed impressive achievements in cancer survival analysis by integrating multimodal data, e.g., pathology images and genomic profiles. However, the heterogeneity and high dimensionality of these modalities pose significant challenges for extracting discriminative representations while maintaining good generalization. In this paper, we propose a Cohort-individual Cooperative L… ▽ More

    Submitted 2 April, 2024; originally announced April 2024.

    Comments: 10 pages, 9 figures

  32. arXiv:2404.01192  [pdf, other

    eess.IV cs.CV

    iMD4GC: Incomplete Multimodal Data Integration to Advance Precise Treatment Response Prediction and Survival Analysis for Gastric Cancer

    Authors: Fengtao Zhou, Yingxue Xu, Yanfen Cui, Shenyan Zhang, Yun Zhu, Weiyang He, Jiguang Wang, Xin Wang, Ronald Chan, Louis Ho Shing Lau, Chu Han, Dafu Zhang, Zhenhui Li, Hao Chen

    Abstract: Gastric cancer (GC) is a prevalent malignancy worldwide, ranking as the fifth most common cancer with over 1 million new cases and 700 thousand deaths in 2020. Locally advanced gastric cancer (LAGC) accounts for approximately two-thirds of GC diagnoses, and neoadjuvant chemotherapy (NACT) has emerged as the standard treatment for LAGC. However, the effectiveness of NACT varies significantly among… ▽ More

    Submitted 1 April, 2024; originally announced April 2024.

    Comments: 27 pages, 9 figures, 3 tables (under review)

  33. arXiv:2404.00312  [pdf, other

    cs.CV cs.AI

    Bayesian Exploration of Pre-trained Models for Low-shot Image Classification

    Authors: Yibo Miao, Yu Lei, Feng Zhou, Zhijie Deng

    Abstract: Low-shot image classification is a fundamental task in computer vision, and the emergence of large-scale vision-language models such as CLIP has greatly advanced the forefront of research in this field. However, most existing CLIP-based methods lack the flexibility to effectively incorporate other pre-trained models that encompass knowledge distinct from CLIP. To bridge the gap, this work proposes… ▽ More

    Submitted 30 March, 2024; originally announced April 2024.

  34. arXiv:2403.17297  [pdf, other

    cs.CL cs.AI

    InternLM2 Technical Report

    Authors: Zheng Cai, Maosong Cao, Haojiong Chen, Kai Chen, Keyu Chen, Xin Chen, Xun Chen, Zehui Chen, Zhi Chen, Pei Chu, Xiaoyi Dong, Haodong Duan, Qi Fan, Zhaoye Fei, Yang Gao, Jiaye Ge, Chenya Gu, Yuzhe Gu, Tao Gui, Aijia Guo, Qipeng Guo, Conghui He, Yingfan Hu, Ting Huang, Tao Jiang , et al. (75 additional authors not shown)

    Abstract: The evolution of Large Language Models (LLMs) like ChatGPT and GPT-4 has sparked discussions on the advent of Artificial General Intelligence (AGI). However, replicating such advancements in open-source models has been challenging. This paper introduces InternLM2, an open-source LLM that outperforms its predecessors in comprehensive evaluations across 6 dimensions and 30 benchmarks, long-context m… ▽ More

    Submitted 25 March, 2024; originally announced March 2024.

  35. arXiv:2403.14454  [pdf, other

    cs.CL

    Prediction of Translation Techniques for the Translation Process

    Authors: Fan Zhou, Vincent Vandeghinste

    Abstract: Machine translation (MT) encompasses a variety of methodologies aimed at enhancing the accuracy of translations. In contrast, the process of human-generated translation relies on a wide range of translation techniques, which are crucial for ensuring linguistic adequacy and fluency. This study suggests that these translation techniques could further optimize machine translation if they are automati… ▽ More

    Submitted 21 March, 2024; originally announced March 2024.

    Comments: 11 pages, 6 figures, conference

  36. arXiv:2403.09433  [pdf, other

    cs.CV

    Open-Vocabulary Object Detection with Meta Prompt Representation and Instance Contrastive Optimization

    Authors: Zhao Wang, Aoxue Li, Fengwei Zhou, Zhenguo Li, Qi Dou

    Abstract: Classical object detectors are incapable of detecting novel class objects that are not encountered before. Regarding this issue, Open-Vocabulary Object Detection (OVOD) is proposed, which aims to detect the objects in the candidate class list. However, current OVOD models are suffering from overfitting on the base classes, heavily relying on the large-scale extra data, and complex training process… ▽ More

    Submitted 14 March, 2024; originally announced March 2024.

    Comments: BMVC 2023

  37. arXiv:2403.08599  [pdf, other

    cs.SI

    The role of susceptible individuals in spreading dynamics

    Authors: Chang Su, Fang Zhou, Linyuan Lü

    Abstract: Exploring the internal mechanism of information spreading is critical for understanding and controlling the process. Traditional spreading models often assume individuals play the same role in the spreading process. In reality, however, individuals' diverse characteristics contribute differently to the spreading performance, leading to a heterogeneous infection rate across the system. To investiga… ▽ More

    Submitted 13 March, 2024; originally announced March 2024.

  38. arXiv:2403.04279  [pdf, other

    cs.CV

    Controllable Generation with Text-to-Image Diffusion Models: A Survey

    Authors: Pu Cao, Feng Zhou, Qing Song, Lu Yang

    Abstract: In the rapidly advancing realm of visual generation, diffusion models have revolutionized the landscape, marking a significant shift in capabilities with their impressive text-guided generative functions. However, relying solely on text for conditioning these models does not fully cater to the varied and complex requirements of different applications and scenarios. Acknowledging this shortfall, a… ▽ More

    Submitted 7 March, 2024; originally announced March 2024.

    Comments: A collection of resources on controllable generation with text-to-image diffusion models: https://github.com/PRIV-Creation/Awesome-Controllable-T2I-Diffusion-Models

  39. arXiv:2403.00625  [pdf, other

    cs.LG cs.CY

    Bias Mitigation in Fine-tuning Pre-trained Models for Enhanced Fairness and Efficiency

    Authors: Yixuan Zhang, Feng Zhou

    Abstract: Fine-tuning pre-trained models is a widely employed technique in numerous real-world applications. However, fine-tuning these models on new tasks can lead to unfair outcomes. This is due to the absence of generalization guarantees for fairness properties, regardless of whether the original pre-trained model was developed with fairness considerations. To tackle this issue, we introduce an efficient… ▽ More

    Submitted 1 March, 2024; originally announced March 2024.

  40. arXiv:2402.17228  [pdf, other

    cs.CV

    Feature Re-Embedding: Towards Foundation Model-Level Performance in Computational Pathology

    Authors: Wenhao Tang, Fengtao Zhou, Sheng Huang, Xiang Zhu, Yi Zhang, Bo Liu

    Abstract: Multiple instance learning (MIL) is the most widely used framework in computational pathology, encompassing sub-ty**, diagnosis, prognosis, and more. However, the existing MIL paradigm typically requires an offline instance feature extractor, such as a pre-trained ResNet or a foundation model. This approach lacks the capability for feature fine-tuning within the specific downstream tasks, limiti… ▽ More

    Submitted 8 April, 2024; v1 submitted 27 February, 2024; originally announced February 2024.

    Comments: Accepted by CVPR2024

  41. arXiv:2402.16586  [pdf, other

    cs.CV

    Improving the JPEG-resistance of Adversarial Attacks on Face Recognition by Interpolation Smoothing

    Authors: Kefu Guo, Fengfan Zhou, Hefei Ling, ** Li, Hui Liu

    Abstract: JPEG compression can significantly impair the performance of adversarial face examples, which previous adversarial attacks on face recognition (FR) have not adequately addressed. Considering this challenge, we propose a novel adversarial attack on FR that aims to improve the resistance of adversarial examples against JPEG compression. Specifically, during the iterative process of generating advers… ▽ More

    Submitted 26 February, 2024; originally announced February 2024.

  42. arXiv:2402.13857  [pdf, ps, other

    cs.LG

    Replicable Learning of Large-Margin Halfspaces

    Authors: Alkis Kalavasis, Amin Karbasi, Kasper Green Larsen, Grigoris Velegkas, Felix Zhou

    Abstract: We provide efficient replicable algorithms for the problem of learning large-margin halfspaces. Our results improve upon the algorithms provided by Impagliazzo, Lei, Pitassi, and Sorrell [STOC, 2022]. We design the first dimension-independent replicable algorithms for this task which runs in polynomial time, is proper, and has strictly improved sample complexity compared to the one achieved by Imp… ▽ More

    Submitted 1 June, 2024; v1 submitted 21 February, 2024; originally announced February 2024.

    Comments: to be published in ICML 2024

  43. arXiv:2402.11296  [pdf, other

    cs.CL cs.AI

    Dissecting Human and LLM Preferences

    Authors: Junlong Li, Fan Zhou, Shichao Sun, Yikai Zhang, Hai Zhao, Pengfei Liu

    Abstract: As a relative quality comparison of model responses, human and Large Language Model (LLM) preferences serve as common alignment goals in model fine-tuning and criteria in evaluation. Yet, these preferences merely reflect broad tendencies, resulting in less explainable and controllable models with potential safety risks. In this work, we dissect the preferences of human and 32 different LLMs to und… ▽ More

    Submitted 17 February, 2024; originally announced February 2024.

  44. arXiv:2402.07834  [pdf, other

    cs.LG

    Generalizing across Temporal Domains with Koopman Operators

    Authors: Qiuhao Zeng, Wei Wang, Fan Zhou, Gezheng Xu, Ruizhi Pu, Changjian Shui, Christian Gagne, Shichun Yang, Boyu Wang, Charles X. Ling

    Abstract: In the field of domain generalization, the task of constructing a predictive model capable of generalizing to a target domain without access to target data remains challenging. This problem becomes further complicated when considering evolving dynamics between domains. While various approaches have been proposed to address this issue, a comprehensive understanding of the underlying generalization… ▽ More

    Submitted 15 February, 2024; v1 submitted 12 February, 2024; originally announced February 2024.

    Comments: 15 pages, 7 figures, Accepted by AAAI 2024. arXiv admin note: text overlap with arXiv:2206.00047

  45. arXiv:2402.06332  [pdf, other

    cs.CL

    InternLM-Math: Open Math Large Language Models Toward Verifiable Reasoning

    Authors: Huaiyuan Ying, Shuo Zhang, Linyang Li, Zhejian Zhou, Yunfan Shao, Zhaoye Fei, Yichuan Ma, Jiawei Hong, Kuikun Liu, Ziyi Wang, Yudong Wang, Zijian Wu, Shuaibin Li, Fengzhe Zhou, Hongwei Liu, Songyang Zhang, Wenwei Zhang, Hang Yan, Xipeng Qiu, Jiayu Wang, Kai Chen, Dahua Lin

    Abstract: The math abilities of large language models can represent their abstract reasoning ability. In this paper, we introduce and open-source our math reasoning LLMs InternLM-Math which is continue pre-trained from InternLM2. We unify chain-of-thought reasoning, reward modeling, formal reasoning, data augmentation, and code interpreter in a unified seq2seq format and supervise our model to be a versatil… ▽ More

    Submitted 24 May, 2024; v1 submitted 9 February, 2024; originally announced February 2024.

  46. arXiv:2402.02697  [pdf, ps, other

    cs.LG stat.ML

    Deep Equilibrium Models are Almost Equivalent to Not-so-deep Explicit Models for High-dimensional Gaussian Mixtures

    Authors: Zenan Ling, Longbo Li, Zhanbo Feng, Yixuan Zhang, Feng Zhou, Robert C. Qiu, Zhenyu Liao

    Abstract: Deep equilibrium models (DEQs), as a typical implicit neural network, have demonstrated remarkable success on various tasks. There is, however, a lack of theoretical understanding of the connections and differences between implicit DEQs and explicit neural network models. In this paper, leveraging recent advances in random matrix theory (RMT), we perform an in-depth analysis on the eigenspectra of… ▽ More

    Submitted 19 May, 2024; v1 submitted 4 February, 2024; originally announced February 2024.

    Comments: Accepted by ICML 2024

  47. arXiv:2402.00853  [pdf, other

    cs.LG cond-mat.mtrl-sci

    LTAU-FF: Loss Trajectory Analysis for Uncertainty in Atomistic Force Fields

    Authors: Joshua A. Vita, Amit Samanta, Fei Zhou, Vincenzo Lordi

    Abstract: Model ensembles are effective tools for estimating prediction uncertainty in deep learning atomistic force fields. However, their widespread adoption is hindered by high computational costs and overconfident error estimates. In this work, we address these challenges by leveraging distributions of per-sample errors obtained during training and employing a distance-based similarity search in the mod… ▽ More

    Submitted 22 May, 2024; v1 submitted 1 February, 2024; originally announced February 2024.

  48. arXiv:2401.11360  [pdf

    cs.LG cs.AI cs.CE q-bio.BM

    PepHarmony: A Multi-View Contrastive Learning Framework for Integrated Sequence and Structure-Based Peptide Encoding

    Authors: Ruochi Zhang, Haoran Wu, Chang Liu, Hua** Li, Yuqian Wu, Kewei Li, Yifan Wang, Yifan Deng, Jiahui Chen, Fengfeng Zhou, Xin Gao

    Abstract: Recent advances in protein language models have catalyzed significant progress in peptide sequence representation. Despite extensive exploration in this field, pre-trained models tailored for peptide-specific needs remain largely unaddressed due to the difficulty in capturing the complex and sometimes unstable structures of peptides. This study introduces a novel multi-view contrastive learning fr… ▽ More

    Submitted 20 January, 2024; originally announced January 2024.

    Comments: 25 pages, 5 figures, 3 tables

  49. arXiv:2401.08903  [pdf, other

    cs.CV cs.LG

    Rethinking Impersonation and Dodging Attacks on Face Recognition Systems

    Authors: Fengfan Zhou, Qianyu Zhou, Bangjie Yin, Hui Zheng, Xuequan Lu, Lizhuang Ma, Hefei Ling

    Abstract: Face Recognition (FR) systems can be easily deceived by adversarial examples that manipulate benign face images through imperceptible perturbations. Adversarial attacks on FR encompass two types: impersonation (targeted) attacks and dodging (untargeted) attacks. Previous methods often achieve a successful impersonation attack on FR; However, it does not necessarily guarantee a successful dodging a… ▽ More

    Submitted 25 April, 2024; v1 submitted 16 January, 2024; originally announced January 2024.

  50. arXiv:2401.08449  [pdf, other

    cs.MM

    CLIPRerank: An Extremely Simple Method for Improving Ad-hoc Video Search

    Authors: Aozhu Chen, Fangming Zhou, Ziyuan Wang, Xirong Li

    Abstract: Ad-hoc Video Search (AVS) enables users to search for unlabeled video content using on-the-fly textual queries. Current deep learning-based models for AVS are trained to optimize holistic similarity between short videos and their associated descriptions. However, due to the diversity of ad-hoc queries, even for a short video, its truly relevant part w.r.t. a given query can be of shorter duration.… ▽ More

    Submitted 16 January, 2024; originally announced January 2024.

    Comments: Accepted by ICASSP 2024