Skip to main content

Showing 1–50 of 161 results for author: Peng, D

.
  1. arXiv:2407.01031  [pdf, other

    cs.LG cs.CL

    PocketLLM: Enabling On-Device Fine-Tuning for Personalized LLMs

    Authors: Dan Peng, Zhihui Fu, Jun Wang

    Abstract: Recent advancements in large language models (LLMs) have indeed showcased their impressive capabilities. On mobile devices, the wealth of valuable, non-public data generated daily holds great promise for locally fine-tuning personalized LLMs, while maintaining privacy through on-device processing. However, the constraints of mobile device resources pose challenges to direct on-device LLM fine-tuni… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

    Comments: Accepted to the ACL 2024 Workshop on Privacy in Natural Language Processing (PrivateNLP)

  2. arXiv:2406.18681  [pdf, other

    stat.ME

    Data Sketching and Stacking: A Confluence of Two Strategies for Predictive Inference in Gaussian Process Regressions with High-Dimensional Features

    Authors: Samuel Gailliot, Rajarshi Guhaniyogi, Roger D. Peng

    Abstract: This article focuses on drawing computationally-efficient predictive inference from Gaussian process (GP) regressions with a large number of features when the response is conditionally independent of the features given the projection to a noisy low dimensional manifold. Bayesian estimation of the regression relationship using Markov Chain Monte Carlo and subsequent predictive inference is computat… ▽ More

    Submitted 26 June, 2024; originally announced June 2024.

    Comments: 32 Pages, 10 Figures

  3. arXiv:2405.17732  [pdf, other

    cs.CL

    C$^{3}$Bench: A Comprehensive Classical Chinese Understanding Benchmark for Large Language Models

    Authors: Jiahuan Cao, Yongxin Shi, Dezhi Peng, Yang Liu, Lianwen **

    Abstract: Classical Chinese Understanding (CCU) holds significant value in preserving and exploration of the outstanding traditional Chinese culture. Recently, researchers have attempted to leverage the potential of Large Language Models (LLMs) for CCU by capitalizing on their remarkable comprehension and semantic capabilities. However, no comprehensive benchmark is available to assess the CCU capabilities… ▽ More

    Submitted 30 May, 2024; v1 submitted 27 May, 2024; originally announced May 2024.

  4. arXiv:2405.11336  [pdf, other

    cs.CV

    UPAM: Unified Prompt Attack in Text-to-Image Generation Models Against Both Textual Filters and Visual Checkers

    Authors: Duo Peng, Qiuhong Ke, Jun Liu

    Abstract: Text-to-Image (T2I) models have raised security concerns due to their potential to generate inappropriate or harmful images. In this paper, we propose UPAM, a novel framework that investigates the robustness of T2I models from the attack perspective. Unlike most existing attack methods that focus on deceiving textual defenses, UPAM aims to deceive both textual and visual defenses in T2I models. UP… ▽ More

    Submitted 25 May, 2024; v1 submitted 18 May, 2024; originally announced May 2024.

    Comments: Accepted by ICML2024

    ACM Class: I.2.6

  5. arXiv:2405.08740  [pdf, other

    cs.LG

    Reinformer: Max-Return Sequence Modeling for Offline RL

    Authors: Zifeng Zhuang, Dengyun Peng, **xin Liu, Ziqi Zhang, Donglin Wang

    Abstract: As a data-driven paradigm, offline reinforcement learning (RL) has been formulated as sequence modeling that conditions on the hindsight information including returns, goal or future trajectory. Although promising, this supervised paradigm overlooks the core objective of RL that maximizes the return. This overlook directly leads to the lack of trajectory stitching capability that affects the seque… ▽ More

    Submitted 2 June, 2024; v1 submitted 14 May, 2024; originally announced May 2024.

    Comments: ICML 2024

  6. arXiv:2405.04408  [pdf, other

    cs.CV

    DocRes: A Generalist Model Toward Unifying Document Image Restoration Tasks

    Authors: Jiaxin Zhang, Dezhi Peng, Chongyu Liu, Peirong Zhang, Lianwen **

    Abstract: Document image restoration is a crucial aspect of Document AI systems, as the quality of document images significantly influences the overall performance. Prevailing methods address distinct restoration tasks independently, leading to intricate systems and the incapability to harness the potential synergies of multi-task learning. To overcome this challenge, we propose DocRes, a generalist model t… ▽ More

    Submitted 7 May, 2024; originally announced May 2024.

    Comments: Accepted by CVPR 2024

  7. arXiv:2404.12567  [pdf

    cs.HC

    Impact of Vibrotactile Triggers on Mental Well-Being through ASMR Experience in VR

    Authors: Danyang Peng, Tanner Person, Ximing Shen, Yun Suen Pai, Giulia Barbareschi, Shengyin Li, Kouta Minamizawa

    Abstract: Watching Autonomous Sensory Meridian Response (ASMR) videos is a popular approach to support mental well-being, as the triggered ASMR tingling sensation supports de-stressing and regulating emotions. Therefore, there is increasing research on how to efficiently trigger ASMR tingling sensation. Tactile sensation remains unexplored because current popular ASMR approaches focus on the visual and audi… ▽ More

    Submitted 18 April, 2024; originally announced April 2024.

  8. arXiv:2404.07503  [pdf, other

    cs.CL

    Best Practices and Lessons Learned on Synthetic Data for Language Models

    Authors: Ruibo Liu, Jerry Wei, Fangyu Liu, Chenglei Si, Yanzhe Zhang, **meng Rao, Steven Zheng, Daiyi Peng, Diyi Yang, Denny Zhou, Andrew M. Dai

    Abstract: The success of AI models relies on the availability of large, diverse, and high-quality datasets, which can be challenging to obtain due to data scarcity, privacy concerns, and high costs. Synthetic data has emerged as a promising solution by generating artificial data that mimics real-world patterns. This paper provides an overview of synthetic data research, discussing its applications, challeng… ▽ More

    Submitted 11 April, 2024; originally announced April 2024.

  9. arXiv:2403.19386  [pdf, other

    cs.CV cs.AI

    PointCloud-Text Matching: Benchmark Datasets and a Baseline

    Authors: Yanglin Feng, Yang Qin, Dezhong Peng, Hongyuan Zhu, Xi Peng, Peng Hu

    Abstract: In this paper, we present and study a new instance-level retrieval task: PointCloud-Text Matching~(PTM), which aims to find the exact cross-modal instance that matches a given point-cloud query or text query. PTM could be applied to various scenarios, such as indoor/urban-canyon localization and scene retrieval. However, there exists no suitable and targeted dataset for PTM in practice. Therefore,… ▽ More

    Submitted 28 March, 2024; originally announced March 2024.

  10. arXiv:2403.18802  [pdf, other

    cs.CL cs.AI cs.LG

    Long-form factuality in large language models

    Authors: Jerry Wei, Chengrun Yang, Xinying Song, Yifeng Lu, Nathan Hu, Jie Huang, Dustin Tran, Daiyi Peng, Ruibo Liu, Da Huang, Cosmo Du, Quoc V. Le

    Abstract: Large language models (LLMs) often generate content that contains factual errors when responding to fact-seeking prompts on open-ended topics. To benchmark a model's long-form factuality in open domains, we first use GPT-4 to generate LongFact, a prompt set comprising thousands of questions spanning 38 topics. We then propose that LLM agents can be used as automated evaluators for long-form factua… ▽ More

    Submitted 3 April, 2024; v1 submitted 27 March, 2024; originally announced March 2024.

  11. arXiv:2403.13761  [pdf, other

    cs.CV

    HierCode: A Lightweight Hierarchical Codebook for Zero-shot Chinese Text Recognition

    Authors: Yuyi Zhang, Yuanzhi Zhu, Dezhi Peng, Peirong Zhang, Zhenhua Yang, Zhibo Yang, Cong Yao, Lianwen **

    Abstract: Text recognition, especially for complex scripts like Chinese, faces unique challenges due to its intricate character structures and vast vocabulary. Traditional one-hot encoding methods struggle with the representation of hierarchical radicals, recognition of Out-Of-Vocabulary (OOV) characters, and on-device deployment due to their computational intensity. To address these challenges, we propose… ▽ More

    Submitted 20 March, 2024; originally announced March 2024.

  12. arXiv:2402.14547  [pdf, other

    cs.LG cs.AI cs.CL cs.DB

    OmniPred: Language Models as Universal Regressors

    Authors: Xingyou Song, Oscar Li, Chansoo Lee, Bangding Yang, Daiyi Peng, Sagi Perel, Yutian Chen

    Abstract: Over the broad landscape of experimental design, regression has been a powerful tool to accurately predict the outcome metrics of a system or model given a set of parameters, but has been traditionally restricted to methods which are only applicable to a specific task. In this paper, we propose OmniPred, a framework for training language models as universal end-to-end regressors over $(x,y)$ evalu… ▽ More

    Submitted 4 March, 2024; v1 submitted 22 February, 2024; originally announced February 2024.

    Comments: 24 pages, 10 figures. Code can be found in https://github.com/google-research/optformer/tree/main/optformer/omnipred

  13. arXiv:2402.08562  [pdf, other

    cs.CL cs.AI

    Higher Layers Need More LoRA Experts

    Authors: Chongyang Gao, Kezhen Chen, **meng Rao, Baochen Sun, Ruibo Liu, Daiyi Peng, Yawen Zhang, Xiaoyuan Guo, Jie Yang, VS Subrahmanian

    Abstract: Parameter-efficient tuning (PEFT) techniques like low-rank adaptation (LoRA) offer training efficiency on Large Language Models, but their impact on model performance remains limited. Recent efforts integrate LoRA and Mixture-of-Experts (MoE) to improve the performance of PEFT methods. Despite promising results, research on improving the efficiency of LoRA with MoE is still in its early stages. Re… ▽ More

    Submitted 13 February, 2024; originally announced February 2024.

    Comments: The code is available at https://github.com/GCYZSL/MoLA

  14. arXiv:2402.06512  [pdf, other

    cs.LG cs.CL

    Multimodal Clinical Trial Outcome Prediction with Large Language Models

    Authors: Wenhao Zheng, Dongsheng Peng, Hongxia Xu, Yun Li, Hongtu Zhu, Tianfan Fu, Huaxiu Yao

    Abstract: The clinical trial is a pivotal and costly process, often spanning multiple years and requiring substantial financial resources. Therefore, the development of clinical trial outcome prediction models aims to exclude drugs likely to fail and holds the potential for significant cost savings. Recent data-driven attempts leverage deep learning methods to integrate multimodal data for predicting clinic… ▽ More

    Submitted 8 May, 2024; v1 submitted 9 February, 2024; originally announced February 2024.

  15. arXiv:2402.02988  [pdf, other

    physics.chem-ph

    Ultrafast Nuclear Dynamics in Double-Core Ionized Water Molecules

    Authors: Iyas Ismail, Ludger Inhester, Tatiana Marchenko, Florian Trinter, Abhishek Verma, Alberto De Fanis, Anthony Ferte, Daniel E. Rivas, Dawei Peng, Dimitris Koulentianos, Edwin Kukk, Francis Penent, Gilles Doumy, Giuseppe Sansone, John D. Bozek, Kai Li, Linda Young, Markus Ilchen, Maria Novella Piancastelli, Michael Meyer, Nicolas Velasquez, Oksana Travnikova, Rebecca Boll, Renaud Guillemin, Reinhard Dorner , et al. (8 additional authors not shown)

    Abstract: Double-core-hole (DCH) states in isolated water and heavy water molecules, resulting from the sequential absorption of two x-ray photons, have been investigated. A comparison of the subsequent Auger emission spectra from the two isotopes provides direct evidence of ultrafast nuclear motion during the 1.5 fs lifetime of these DCH states. Our numerical results align well with the experimental data,… ▽ More

    Submitted 11 March, 2024; v1 submitted 5 February, 2024; originally announced February 2024.

  16. arXiv:2402.00585  [pdf, other

    cs.RO

    SATac: A Thermoluminescence Enabled Tactile Sensor for Concurrent Perception of Temperature, Pressure, and Shear

    Authors: Ziwu Song, Ran Yu, Xuan Zhang, Kit Wa Sou, Shilong Mu, Dengfeng Peng, Xiao-** Zhang, Wenbo Ding

    Abstract: Most vision-based tactile sensors use elastomer deformation to infer tactile information, which can not sense some modalities, like temperature. As an important part of human tactile perception, temperature sensing can help robots better interact with the environment. In this work, we propose a novel multimodal vision-based tactile sensor, SATac, which can simultaneously perceive information of te… ▽ More

    Submitted 1 February, 2024; originally announced February 2024.

  17. arXiv:2401.12754  [pdf, other

    astro-ph.HE astro-ph.IM physics.ins-det

    Research on the knee region of cosmic ray by using a novel type of electron-neutron detector array

    Authors: Bing-Bing Li, Xin-Hua Ma, Shu-Wang Cui, Hao-Kun Chen, Tian-Lu Chen, Danzengluobu, Wei Gao, Hai-Bing Hu, Denis Kuleshov, Kirill Kurinov, Hu Liu, Mao-Yuan Liu, Ye Liu, Da-Yu Peng, Yao-Hui Qi, Oleg Shchegolev, Yuri Stenkin, Li-Qiao Yin, Heng-Yu Zhang, Liang-Wei Zhang

    Abstract: By accurately measuring composition and energy spectrum of cosmic ray, the origin problem of so called "keen" region (energy > 1 PeV) can be solved. However, up to the present, the results of the spectrum in the knee region obtained by several previous experiments have shown obvious differences, so they cannot give effective evidence for judging the theoretical models on the origin of the knee. Re… ▽ More

    Submitted 23 January, 2024; originally announced January 2024.

  18. arXiv:2401.07641  [pdf, other

    cs.CV

    SwinTextSpotter v2: Towards Better Synergy for Scene Text Spotting

    Authors: Mingxin Huang, Dezhi Peng, Hongliang Li, Zhenghao Peng, Chongyu Liu, Dahua Lin, Yuliang Liu, Xiang Bai, Lianwen **

    Abstract: End-to-end scene text spotting, which aims to read the text in natural images, has garnered significant attention in recent years. However, recent state-of-the-art methods usually incorporate detection and recognition simply by sharing the backbone, which does not directly take advantage of the feature interaction between the two tasks. In this paper, we propose a new end-to-end scene text spottin… ▽ More

    Submitted 15 January, 2024; originally announced January 2024.

    Comments: arXiv admin note: text overlap with arXiv:2203.10209

  19. arXiv:2401.03486  [pdf, ps, other

    physics.app-ph physics.optics

    Nanofabrication beyond optical diffraction limit: Optical driven assembly enabled by superlubricity

    Authors: Liu Jiang-tao, Deli Peng, Qin Yang, Ze Liu, Zhenhua Wu

    Abstract: The optical manipulation of nanoparticles on superlubricity surfaces is investigated. The research revealed that, due to the near-zero static friction and extremely low dynamic friction at superlubricity interfaces, the maximum intensity for controlling the optical field can be less than 100 W/cm$^2$, which is nine orders of magnitude lower than controlling nanoparticles on traditional interfaces.… ▽ More

    Submitted 7 January, 2024; originally announced January 2024.

  20. arXiv:2401.01100  [pdf

    cs.LG

    Scalable manifold learning by uniform landmark sampling and constrained locally linear embedding

    Authors: Dehua Peng, Zhipeng Gui, Wenzhang Wei, Huayi Wu

    Abstract: As a pivotal approach in machine learning and data science, manifold learning aims to uncover the intrinsic low-dimensional structure within complex nonlinear manifolds in high-dimensional space. By exploiting the manifold hypothesis, various techniques for nonlinear dimension reduction have been developed to facilitate visualization, classification, clustering, and gaining key insights. Although… ▽ More

    Submitted 5 January, 2024; v1 submitted 2 January, 2024; originally announced January 2024.

    Comments: 33 pages, 10 figures

    ACM Class: I.5.3

  21. arXiv:2401.00422  [pdf

    cs.LG cs.DS

    Interpreting the Curse of Dimensionality from Distance Concentration and Manifold Effect

    Authors: Dehua Peng, Zhipeng Gui, Huayi Wu

    Abstract: The characteristics of data like distribution and heterogeneity, become more complex and counterintuitive as the dimensionality increases. This phenomenon is known as curse of dimensionality, where common patterns and relationships (e.g., internal and boundary pattern) that hold in low-dimensional space may be invalid in higher-dimensional space. It leads to a decreasing performance for the regres… ▽ More

    Submitted 7 January, 2024; v1 submitted 31 December, 2023; originally announced January 2024.

    Comments: 17 pages, 11 figures

  22. arXiv:2312.17024  [pdf, other

    cs.DS cs.IT eess.IV eess.SP

    Selective Run-Length Encoding

    Authors: Xutan Peng, Yi Zhang, Dejia Peng, Jiafa Zhu

    Abstract: Run-Length Encoding (RLE) is one of the most fundamental tools in data compression. However, its compression power drops significantly if there lacks consecutive elements in the sequence. In extreme cases, the output of the encoder may require more space than the input (aka size inflation). To alleviate this issue, using combinatorics, we quantify RLE's space savings for a given input distribution… ▽ More

    Submitted 28 December, 2023; originally announced December 2023.

    Comments: Accepted at DCC 2024

  23. arXiv:2312.16012  [pdf, other

    cs.CV cs.AI

    Detection-based Intermediate Supervision for Visual Question Answering

    Authors: Yuhang Liu, Daowan Peng, Wei Wei, Yuanyuan Fu, Wenfeng Xie, Dangyang Chen

    Abstract: Recently, neural module networks (NMNs) have yielded ongoing success in answering compositional visual questions, especially those involving multi-hop visual and logical reasoning. NMNs decompose the complex question into several sub-tasks using instance-modules from the reasoning paths of that question and then exploit intermediate supervisions to guide answer prediction, thereby improving infere… ▽ More

    Submitted 26 December, 2023; originally announced December 2023.

    Comments: Accepted by AAAI24

  24. arXiv:2312.12142  [pdf, other

    cs.CV cs.AI

    FontDiffuser: One-Shot Font Generation via Denoising Diffusion with Multi-Scale Content Aggregation and Style Contrastive Learning

    Authors: Zhenhua Yang, Dezhi Peng, Yuxin Kong, Yuyi Zhang, Cong Yao, Lianwen **

    Abstract: Automatic font generation is an imitation task, which aims to create a font library that mimics the style of reference images while preserving the content from source images. Although existing font generation methods have achieved satisfactory performance, they still struggle with complex characters and large style variations. To address these issues, we propose FontDiffuser, a diffusion-based ima… ▽ More

    Submitted 19 December, 2023; originally announced December 2023.

    Comments: Accepted to AAAI 2024; Github Page: https://github.com/yeungchenwa/FontDiffuser

    Journal ref: 38th AAAI Conference on Artificial Intelligence (AAAI2024), Vancouver, BC, Canada, 2024

  25. arXiv:2312.11805  [pdf, other

    cs.CL cs.AI cs.CV

    Gemini: A Family of Highly Capable Multimodal Models

    Authors: Gemini Team, Rohan Anil, Sebastian Borgeaud, Jean-Baptiste Alayrac, Jiahui Yu, Radu Soricut, Johan Schalkwyk, Andrew M. Dai, Anja Hauth, Katie Millican, David Silver, Melvin Johnson, Ioannis Antonoglou, Julian Schrittwieser, Amelia Glaese, Jilin Chen, Emily Pitler, Timothy Lillicrap, Angeliki Lazaridou, Orhan Firat, James Molloy, Michael Isard, Paul R. Barham, Tom Hennigan, Benjamin Lee , et al. (1325 additional authors not shown)

    Abstract: This report introduces a new family of multimodal models, Gemini, that exhibit remarkable capabilities across image, audio, video, and text understanding. The Gemini family consists of Ultra, Pro, and Nano sizes, suitable for applications ranging from complex reasoning tasks to on-device memory-constrained use-cases. Evaluation on a broad range of benchmarks shows that our most-capable Gemini Ultr… ▽ More

    Submitted 17 June, 2024; v1 submitted 18 December, 2023; originally announced December 2023.

  26. arXiv:2312.07616  [pdf, other

    stat.ME math.ST stat.AP

    Evaluating the Alignment of a Data Analysis between Analyst and Audience

    Authors: Lucy D'Agostino McGowan, Roger D. Peng, Stephanie C. Hicks

    Abstract: A challenge that data analysts face is building a data analysis that is useful for a given consumer. Previously, we defined a set of principles for describing data analyses that can be used to create a data analysis and to characterize the variation between analyses. Here, we introduce a concept that we call the alignment of a data analysis between the data analyst and a consumer. We define a succ… ▽ More

    Submitted 11 December, 2023; originally announced December 2023.

  27. arXiv:2312.04067  [pdf

    cs.LG

    MeanCut: A Greedy-Optimized Graph Clustering via Path-based Similarity and Degree Descent Criterion

    Authors: Dehua Peng, Zhipeng Gui, Huayi Wu

    Abstract: As the most typical graph clustering method, spectral clustering is popular and attractive due to the remarkable performance, easy implementation, and strong adaptability. Classical spectral clustering measures the edge weights of graph using pairwise Euclidean-based metric, and solves the optimal graph partition by relaxing the constraints of indicator matrix and performing Laplacian decompositio… ▽ More

    Submitted 7 December, 2023; originally announced December 2023.

    Comments: 17 pages, 8 figures, 6 tables

    ACM Class: I.5.3

  28. arXiv:2312.04065  [pdf

    cs.LG

    A Robust and Efficient Boundary Point Detection Method by Measuring Local Direction Dispersion

    Authors: Dehua Peng, Zhipeng Gui, Huayi Wu

    Abstract: Boundary points pose a significant challenge for machine learning tasks, including classification, clustering, and dimensionality reduction. Due to the similarity of features, boundary areas can result in mixed-up classes or clusters, leading to a crowding problem in dimensionality reduction. To address this challenge, numerous boundary point detection methods have been developed, but they are ins… ▽ More

    Submitted 7 December, 2023; originally announced December 2023.

    Comments: 11 pages, 6 figures, 3 tables

    ACM Class: I.5.2

  29. arXiv:2312.02694  [pdf, other

    cs.CV

    UPOCR: Towards Unified Pixel-Level OCR Interface

    Authors: Dezhi Peng, Zhenhua Yang, Jiaxin Zhang, Chongyu Liu, Yongxin Shi, Kai Ding, Fengjun Guo, Lianwen **

    Abstract: In recent years, the optical character recognition (OCR) field has been proliferating with plentiful cutting-edge approaches for a wide spectrum of tasks. However, these approaches are task-specifically designed with divergent paradigms, architectures, and training strategies, which significantly increases the complexity of research and maintenance and hinders the fast deployment in applications.… ▽ More

    Submitted 5 December, 2023; originally announced December 2023.

  30. arXiv:2311.16610  [pdf, other

    cs.HC

    The Empathic Metaverse: An Assistive Bioresponsive Platform For Emotional Experience Sharing

    Authors: Yun Suen Pai, Mark Armstrong, Kinga Skiers, Anish Kundu, Danyang Peng, Yixin Wang, Tamil Selvan Gunasekaran, Chi-Lan Yang, Kouta Minamizawa

    Abstract: The Metaverse is poised to be a future platform that redefines what it means to communicate, socialize, and interact with each other. Yet, it is important for us to consider avoiding the pitfalls of social media platforms we use today; cyberbullying, lack of transparency and an overall false mental model of society. In this paper, we propose the Empathic Metaverse, a virtual platform that prioriti… ▽ More

    Submitted 28 November, 2023; originally announced November 2023.

    Comments: 5 pages including references, 4 figures, presented at the Towards an Inclusive and Accessible Metaverse (TIAM) Workshop at CHI 2023

  31. arXiv:2311.09622  [pdf

    cs.RO

    Homography Initialization and Dynamic Weighting Algorithm Based on a Downward-Looking Camera and IMU

    Authors: Bo Dong, Yongkang Tao, Deng Peng, Zhigang Fu

    Abstract: In recent years, the technology in visual-inertial odometry (VIO) has matured considerably and has been widely used in many applications. However, we still encounter challenges when applying VIO to a micro air vehicle (MAV) equipped with a downward-looking camera. Specifically, VIO cannot compute the correct initialization results during take-off and the cumulative drift is large when the MAV is f… ▽ More

    Submitted 16 November, 2023; originally announced November 2023.

  32. arXiv:2311.08001  [pdf

    cs.SI cs.CL physics.soc-ph

    A Comparative Analysis of the COVID-19 Infodemic in English and Chinese: Insights from Social Media Textual Data

    Authors: Jia Luo, Daiyun Peng, Lei Shi, Didier El Baz, Xinran Liu

    Abstract: The COVID-19 infodemic, characterized by the rapid spread of misinformation and unverified claims related to the pandemic, presents a significant challenge. This paper presents a comparative analysis of the COVID-19 infodemic in the English and Chinese languages, utilizing textual data extracted from social media platforms. To ensure a balanced representation, two infodemic datasets were created b… ▽ More

    Submitted 14 November, 2023; originally announced November 2023.

    Journal ref: Frontiers in Public Health, 2023, 11

  33. arXiv:2311.07353  [pdf

    cond-mat.supr-con cond-mat.mtrl-sci cond-mat.str-el

    Superconductivity in trilayer nickelate La$_4$Ni$_3$O$_{10}$ single crystals

    Authors: Yinghao Zhu, Enkang Zhang, Bingying Pan, Xu Chen, Di Peng, Lixing Chen, Huifen Ren, Feiyang Liu, Nana Li, Zhenfang Xing, Jiyuan Han, Junjie Wang, Donghan Jia, Hongliang Wo, Yiqing Gu, Yimeng Gu, Li Ji, Wenbin Wang, Huiyang Gou, Yao Shen, Tian** Ying, Xiaolong Chen, Wenge Yang, Changlin Zheng, Qiaoshi Zeng , et al. (2 additional authors not shown)

    Abstract: The pursuit of discovering new high-temperature superconductors that diverge from the copper-based paradigm carries profound implications for elucidating mechanisms behind superconductivity and may also enable new applications. Here, our investigation reveals that application of pressure effectively suppresses the spin and charge order in trilayer nickelate La$_4$Ni$_3$O$_{10}$ single crystals, le… ▽ More

    Submitted 2 January, 2024; v1 submitted 13 November, 2023; originally announced November 2023.

    Comments: Updated: Zero Resistance, Meissner Effect, and Detailed Structural Property Measurements

  34. arXiv:2310.17506  [pdf, other

    stat.AP

    Predicting Patient No-Shows in Community Health Clinics: A Case Study in Designing a Data Analytic Product

    Authors: Roger D. Peng

    Abstract: The data science revolution has highlighted the varying roles that data analytic products can play in a different industries and applications. There has been particular interest in using analytic products coupled with algorithmic prediction models to aid in human decision-making. However, detailed descriptions of the decision-making process that leads to the design and development of analytic prod… ▽ More

    Submitted 26 October, 2023; originally announced October 2023.

  35. arXiv:2310.17468  [pdf, other

    cs.CV cs.LG

    Cross-modal Active Complementary Learning with Self-refining Correspondence

    Authors: Yang Qin, Yuan Sun, Dezhong Peng, Joey Tianyi Zhou, Xi Peng, Peng Hu

    Abstract: Recently, image-text matching has attracted more and more attention from academia and industry, which is fundamental to understanding the latent correspondence across visual and textual modalities. However, most existing methods implicitly assume the training pairs are well-aligned while ignoring the ubiquitous annotation noise, a.k.a noisy correspondence (NC), thereby inevitably leading to a perf… ▽ More

    Submitted 7 January, 2024; v1 submitted 26 October, 2023; originally announced October 2023.

    Comments: This paper is accepted by NeurIPS 2023

  36. arXiv:2310.16809  [pdf, other

    cs.CV

    Exploring OCR Capabilities of GPT-4V(ision) : A Quantitative and In-depth Evaluation

    Authors: Yongxin Shi, Dezhi Peng, Wenhui Liao, Zening Lin, Xinhong Chen, Chongyu Liu, Yuyi Zhang, Lianwen **

    Abstract: This paper presents a comprehensive evaluation of the Optical Character Recognition (OCR) capabilities of the recently released GPT-4V(ision), a Large Multimodal Model (LMM). We assess the model's performance across a range of OCR tasks, including scene text recognition, handwritten text recognition, handwritten mathematical expression recognition, table structure recognition, and information extr… ▽ More

    Submitted 29 October, 2023; v1 submitted 25 October, 2023; originally announced October 2023.

  37. arXiv:2310.11989  [pdf, other

    cs.LG

    Image Clustering with External Guidance

    Authors: Yunfan Li, Peng Hu, Dezhong Peng, Jiancheng Lv, Jian** Fan, Xi Peng

    Abstract: The core of clustering is incorporating prior knowledge to construct supervision signals. From classic k-means based on data compactness to recent contrastive clustering guided by self-supervision, the evolution of clustering methods intrinsically corresponds to the progression of supervision signals. At present, substantial efforts have been devoted to mining internal supervision signals from dat… ▽ More

    Submitted 16 May, 2024; v1 submitted 18 October, 2023; originally announced October 2023.

    Journal ref: ICML 2024

  38. arXiv:2310.08269  [pdf, ps, other

    math.GN math.GR

    The Lattice of Group Topologies on a Group

    Authors: Dekui Peng

    Abstract: For an infinite group $G$, the set of group topologies, $\mathcal{L}_G$, forms a complete lattice. It is known that $\mathcal{L}_G$ is modular if $G$ is abelian but the same result does not hold for nilpotent groups. We prove that the lattice $\mathcal{L}_G$ is semi-modular if and only if $\mathcal{L}_{G/Z}$ is semi-modular, where $Z$ is the centre. As a corollary, for every nilpotent group $G$, t… ▽ More

    Submitted 12 October, 2023; originally announced October 2023.

    Comments: 11 pages

    MSC Class: 54A10; 20F18

  39. arXiv:2310.06276  [pdf, other

    astro-ph.IM astro-ph.EP

    GPI 2.0: Performance Evaluation of the Wavefront Sensor's EMCCD

    Authors: Clarissa R. Do Ó, Saavidra Perera, Jérôme Maire, Jayke S. Nguyen, Vincent Chambouleyron, Quinn M. Konopacky, Jeffrey Chilcote, Joeleff Fitzsimmons, Randall Hamper, Dan Kerley, Bruce Macintosh, Christian Marois, Fredrik Rantakyrö, Dmitry Savranksy, Jean-Pierre Veran, Guido Agapito, S. Mark Ammons, Marco Bonaglia, Marc-Andre Boucher, Jennifer Dunn, Simone Esposito, Guillaume Filion, Jean Thomas Landry, Olivier Lardiere, Duan Li , et al. (4 additional authors not shown)

    Abstract: The Gemini Planet Imager (GPI) is a high contrast imaging instrument that aims to detect and characterize extrasolar planets. GPI is being upgraded to GPI 2.0, with several subsystems receiving a re-design to improve the instrument's contrast. To enable observations on fainter targets and increase stability on brighter ones, one of the upgrades is to the adaptive optics system. The current Shack-H… ▽ More

    Submitted 9 October, 2023; originally announced October 2023.

    Comments: 16 pages, 14 figures. Conference Proceedings for AO4ELT7, held in June 2023 in Avignon, France

  40. arXiv:2309.17065  [pdf, other

    math.GN math.GR math.NT

    Minimality of the inner automorphism group

    Authors: Dekui Peng, Menachem Shlossberg

    Abstract: By [6], a minimal group $G$ is called $z$-minimal if $G/Z(G)$ is minimal. In this paper, we present the $z$-Minimality Criterion for dense subgroups with some applications to topological matrix groups. For a locally compact group $G$, let $\operatorname{Inn}(G)$ be the group of all inner automorphisms of $G,$ endowed with the Birkhoff topology. Using a theorem by Goto [14], we obtain our main resu… ▽ More

    Submitted 28 June, 2024; v1 submitted 29 September, 2023; originally announced September 2023.

  41. arXiv:2309.08494  [pdf, other

    stat.ME

    Modeling Data Analytic Iteration With Probabilistic Outcome Sets

    Authors: Roger D. Peng, Stephanie C. Hicks

    Abstract: In 1977 John Tukey described how in exploratory data analysis, data analysts use tools, such as data visualizations, to separate their expectations from what they observe. In contrast to statistical theory, an underappreciated aspect of data analysis is that a data analyst must make decisions by comparing the observed data or output from a statistical tool to what the analyst previously expected f… ▽ More

    Submitted 1 February, 2024; v1 submitted 15 September, 2023; originally announced September 2023.

    Comments: 30 pages

  42. arXiv:2309.08154  [pdf, other

    cs.CV cs.IR

    Dynamic Visual Semantic Sub-Embeddings and Fast Re-Ranking

    Authors: Wenzhang Wei, Zhipeng Gui, Changguang Wu, Anqi Zhao, Dehua Peng, Huayi Wu

    Abstract: The core of cross-modal matching is to accurately measure the similarity between different modalities in a unified representation space. However, compared to textual descriptions of a certain perspective, the visual modality has more semantic variations. So, images are usually associated with multiple textual captions in databases. Although popular symmetric embedding methods have explored numerou… ▽ More

    Submitted 20 December, 2023; v1 submitted 15 September, 2023; originally announced September 2023.

  43. Adapting Segment Anything Model for Change Detection in HR Remote Sensing Images

    Authors: Lei Ding, Kun Zhu, Daifeng Peng, Hao Tang, Kuiwu Yang, Lorenzo Bruzzone

    Abstract: Vision Foundation Models (VFMs) such as the Segment Anything Model (SAM) allow zero-shot or interactive segmentation of visual contents, thus they are quickly applied in a variety of visual scenes. However, their direct use in many Remote Sensing (RS) applications is often unsatisfactory due to the special imaging characteristics of RS images. In this work, we aim to utilize the strong visual reco… ▽ More

    Submitted 25 January, 2024; v1 submitted 4 September, 2023; originally announced September 2023.

  44. arXiv:2308.13893  [pdf, other

    cs.CV

    Unsupervised Domain Adaptation via Domain-Adaptive Diffusion

    Authors: Duo Peng, Qiuhong Ke, Yinjie Lei, Jun Liu

    Abstract: Unsupervised Domain Adaptation (UDA) is quite challenging due to the large distribution discrepancy between the source domain and the target domain. Inspired by diffusion models which have strong capability to gradually convert data distributions across a large gap, we consider to explore the diffusion technique to handle the challenging UDA task. However, using diffusion models to convert data di… ▽ More

    Submitted 26 August, 2023; originally announced August 2023.

    Comments: 11 pages, 4 figures

  45. arXiv:2308.13241  [pdf, other

    cs.RO cond-mat.mtrl-sci physics.optics

    WSTac: Interactive Surface Perception based on Whisker-Inspired and Self-Illuminated Vision-Based Tactile Sensor

    Authors: Kai Chong Lei, Kit Wa Sou, Wang Sing Chan, Jiayi Yan, Siqi **, Dengfeng Peng, Wenbo Ding, Xiao-** Zhang

    Abstract: Modern Visual-Based Tactile Sensors (VBTSs) use cost-effective cameras to track elastomer deformation, but struggle with ambient light interference. Solutions typically involve using internal LEDs and blocking external light, thus adding complexity. Creating a VBTS resistant to ambient light with just a camera and an elastomer remains a challenge. In this work, we introduce WStac, a self-illuminat… ▽ More

    Submitted 25 August, 2023; originally announced August 2023.

  46. arXiv:2308.12350  [pdf, other

    cs.CV

    Diffusion-based Image Translation with Label Guidance for Domain Adaptive Semantic Segmentation

    Authors: Duo Peng, ** Hu, Qiuhong Ke, Jun Liu

    Abstract: Translating images from a source domain to a target domain for learning target models is one of the most common strategies in domain adaptive semantic segmentation (DASS). However, existing methods still struggle to preserve semantically-consistent local details between the original and translated images. In this work, we present an innovative approach that addresses this challenge by using source… ▽ More

    Submitted 23 August, 2023; originally announced August 2023.

    Comments: Accepted to ICCV2023

  47. arXiv:2308.11164  [pdf, other

    cs.CV

    Decoupled Contrastive Multi-View Clustering with High-Order Random Walks

    Authors: Yiding Lu, Yijie Lin, Mouxing Yang, Dezhong Peng, Peng Hu, Xi Peng

    Abstract: In recent, some robust contrastive multi-view clustering (MvC) methods have been proposed, which construct data pairs from neighborhoods to alleviate the false negative issue, i.e., some intra-cluster samples are wrongly treated as negative pairs. Although promising performance has been achieved by these methods, the false negative issue is still far from addressed and the false positive issue eme… ▽ More

    Submitted 18 January, 2024; v1 submitted 21 August, 2023; originally announced August 2023.

    Comments: Accepted by AAAI 2024

  48. arXiv:2308.10147  [pdf, other

    cs.CV

    ESTextSpotter: Towards Better Scene Text Spotting with Explicit Synergy in Transformer

    Authors: Mingxin Huang, Jiaxin Zhang, Dezhi Peng, Hao Lu, Can Huang, Yuliang Liu, Xiang Bai, Lianwen **

    Abstract: In recent years, end-to-end scene text spotting approaches are evolving to the Transformer-based framework. While previous studies have shown the crucial importance of the intrinsic synergy between text detection and recognition, recent advances in Transformer-based methods usually adopt an implicit synergy strategy with shared query, which can not fully realize the potential of these two interact… ▽ More

    Submitted 19 August, 2023; originally announced August 2023.

    Comments: Accepted to ICCV 2023

  49. arXiv:2308.09911  [pdf, other

    cs.CV cs.MM

    Noisy-Correspondence Learning for Text-to-Image Person Re-identification

    Authors: Yang Qin, Yingke Chen, Dezhong Peng, Xi Peng, Joey Tianyi Zhou, Peng Hu

    Abstract: Text-to-image person re-identification (TIReID) is a compelling topic in the cross-modal community, which aims to retrieve the target person based on a textual query. Although numerous TIReID methods have been proposed and achieved promising performance, they implicitly assume the training image-text pairs are correctly aligned, which is not always the case in real-world scenarios. In practice, th… ▽ More

    Submitted 28 March, 2024; v1 submitted 19 August, 2023; originally announced August 2023.

  50. arXiv:2308.07405  [pdf, ps, other

    math.CO

    More on Rainbow Cliques in Edge-Colored Graphs

    Authors: Xiao-Chuan Liu, Danni Peng, Xu Yang

    Abstract: In an edge-colored graph $G$, a rainbow clique $K_k$ is a $k$-complete subgraph in which all the edges have distinct colors. Let $e(G)$ and $c(G)$ be the number of edges and colors in $G$, respectively. In this paper, we show that for any $\varepsilon>0$, if $e(G)+c(G) \geq (1+\frac{k-3}{k-2}+2\varepsilon) {n\choose 2}$ and $k\geq 3$, then for sufficiently large $n$, the number of rainbow cliques… ▽ More

    Submitted 14 August, 2023; originally announced August 2023.

    Comments: 16pages