Skip to main content

Showing 1–50 of 801 results for author: Yin., Y

.
  1. arXiv:2407.06579  [pdf, other

    cs.CL

    NoisyAG-News: A Benchmark for Addressing Instance-Dependent Noise in Text Classification

    Authors: Hongfei Huang, Tingting Liang, Xixi Sun, Zikang **, Yuyu Yin

    Abstract: Existing research on learning with noisy labels predominantly focuses on synthetic label noise. Although synthetic noise possesses well-defined structural properties, it often fails to accurately replicate real-world noise patterns. In recent years, there has been a concerted effort to construct generalizable and controllable instance-dependent noise datasets for image classification, significantl… ▽ More

    Submitted 9 July, 2024; originally announced July 2024.

    Comments: 20 pages , 13 figure

  2. arXiv:2407.05676  [pdf, other

    physics.atom-ph physics.app-ph

    Continuous broadband Rydberg receiver using AC Stark shifts and Floquet States

    Authors: Danni Song, Yuechun Jiao, **lian Hu, Yuwen Yin, Zhenhua Li, Yunhui He, **gxu Bai, Jianming Zhao, Suotang Jia

    Abstract: We demonstrate the continuous broadband microwave receivers based on AC Stark shifts and Floquet States of Rydberg levels in a cesium atomic vapor cell. The resonant transition frequency of two adjacent Rydberg states 78$S_{1/2}$ and 78$P_{1/2}$ is tuned based on AC Stark effect of 70~MHz Radio frequency (RF) field that is applied outside the vapor cell. Meanwhile, the Rydberg states also exhibit… ▽ More

    Submitted 8 July, 2024; originally announced July 2024.

    Comments: 5 pages, 4 figures

  3. arXiv:2407.03363  [pdf, other

    math.NA

    A novel direct imaging method for passive inverse obstacle scattering problem

    Authors: Yunwen Yin, Liang Yan

    Abstract: This paper investigates the inverse scattering problem of recovering a sound-soft obstacle using passive measurements taken from randomly distributed point sources. The randomness introduced by these sources poses significant challenges, leading to the failure of classical direct sampling methods that rely on scattered field measurements. To address this issue, we introduce the Doubly Cross-Correl… ▽ More

    Submitted 28 June, 2024; originally announced July 2024.

  4. arXiv:2407.03000  [pdf, other

    cs.CL cs.CV

    VIVA: A Benchmark for Vision-Grounded Decision-Making with Human Values

    Authors: Zhe Hu, Yixiao Ren, **g Li, Yu Yin

    Abstract: This paper introduces VIVA, a benchmark for VIsion-grounded decision-making driven by human VAlues. While most large vision-language models (VLMs) focus on physical-level skills, our work is the first to examine their multimodal capabilities in leveraging human values to make decisions under a vision-depicted situation. VIVA contains 1,062 images depicting diverse real-world situations and the man… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

  5. arXiv:2407.02883  [pdf, other

    cs.IR cs.CL

    CoIR: A Comprehensive Benchmark for Code Information Retrieval Models

    Authors: Xiangyang Li, Kuicai Dong, Yi Quan Lee, Wei Xia, Yichun Yin, Hao Zhang, Yong Liu, Yasheng Wang, Ruiming Tang

    Abstract: Despite the substantial success of Information Retrieval (IR) in various NLP tasks, most IR systems predominantly handle queries and corpora in natural language, neglecting the domain of code retrieval. Code retrieval is critically important yet remains under-explored, with existing methods and benchmarks inadequately representing the diversity of code in various domains and tasks. Addressing this… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

  6. arXiv:2407.01008  [pdf

    physics.optics

    Periodic domain inversion in single crystal barium titanate-on-insulator thin film

    Authors: Pragati Aashna, Hong-Lin Lin, Yu Cao, Yuhui Yin, Yuan Gao, Sakthi Sanjeev Mohanraj, Di Zhu, Aaron Danner

    Abstract: We report experimentally achieving first-ever electric field periodic poling of single crystal barium titanate (BTO, or BaTiO3) thin film on insulator. Owing to the outstanding optical nonlinearities of BTO, this result is a key step towards achieving quasi-phase-matching in BTO. We first grow the BTO thin film on a dysprosium scandate substrate using pulsed laser deposition with a thin layer of s… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

  7. arXiv:2406.19643  [pdf, other

    cs.CL cs.AI

    Unlocking Varied Perspectives: A Persona-Based Multi-Agent Framework with Debate-Driven Text Planning for Argument Generation

    Authors: Zhe Hu, Hou Pong Chan, **g Li, Yu Yin

    Abstract: Writing persuasive arguments is a challenging task for both humans and machines. It entails incorporating high-level beliefs from various perspectives on the topic, along with deliberate reasoning and planning to construct a coherent narrative. Current language models often generate surface tokens autoregressively, lacking explicit integration of these underlying controls, resulting in limited out… ▽ More

    Submitted 28 June, 2024; originally announced June 2024.

  8. arXiv:2406.18832  [pdf, other

    cs.CL

    OutlierTune: Efficient Channel-Wise Quantization for Large Language Models

    Authors: **guang Wang, Yuexi Yin, Haifeng Sun, Qi Qi, **gyu Wang, Zirui Zhuang, Tingting Yang, Jianxin Liao

    Abstract: Quantizing the activations of large language models (LLMs) has been a significant challenge due to the presence of structured outliers. Most existing methods focus on the per-token or per-tensor quantization of activations, making it difficult to achieve both accuracy and hardware efficiency. To address this problem, we propose OutlierTune, an efficient per-channel post-training quantization (PTQ)… ▽ More

    Submitted 26 June, 2024; originally announced June 2024.

  9. arXiv:2406.18770  [pdf, other

    cs.LG

    ADO-LLM: Analog Design Bayesian Optimization with In-Context Learning of Large Language Models

    Authors: Yuxuan Yin, Yu Wang, Boxun Xu, Peng Li

    Abstract: Analog circuit design requires substantial human expertise and involvement, which is a significant roadblock to design productivity. Bayesian Optimization (BO), a popular machine learning based optimization strategy, has been leveraged to automate analog design given its applicability across various circuit topologies and technologies. Traditional BO methods employ black box Gaussian Process surro… ▽ More

    Submitted 26 June, 2024; originally announced June 2024.

    Comments: 8 pages, 3 figures

  10. arXiv:2406.18536  [pdf, other

    eess.SY cs.AI cs.AR

    Reliable Interval Prediction of Minimum Operating Voltage Based on On-chip Monitors via Conformalized Quantile Regression

    Authors: Yuxuan Yin, Xiaoxiao Wang, Rebecca Chen, Chen He, Peng Li

    Abstract: Predicting the minimum operating voltage ($V_{min}$) of chips is one of the important techniques for improving the manufacturing testing flow, as well as ensuring the long-term reliability and safety of in-field systems. Current $V_{min}$ prediction methods often provide only point estimates, necessitating additional techniques for constructing prediction confidence intervals to cover uncertaintie… ▽ More

    Submitted 3 May, 2024; originally announced June 2024.

    Comments: Accepted by DATE 2024. Camera-ready version

  11. arXiv:2406.16505  [pdf, other

    q-fin.CP cs.AI

    $\text{Alpha}^2$: Discovering Logical Formulaic Alphas using Deep Reinforcement Learning

    Authors: Feng Xu, Yan Yin, Xinyu Zhang, Tianyuan Liu, Shengyi Jiang, Zongzhang Zhang

    Abstract: Alphas are pivotal in providing signals for quantitative trading. The industry highly values the discovery of formulaic alphas for their interpretability and ease of analysis, compared with the expressive yet overfitting-prone black-box alphas. In this work, we focus on discovering formulaic alphas. Prior studies on automatically generating a collection of formulaic alphas were mostly based on gen… ▽ More

    Submitted 26 June, 2024; v1 submitted 24 June, 2024; originally announced June 2024.

  12. arXiv:2406.16441  [pdf, other

    cs.CL

    UniCoder: Scaling Code Large Language Model via Universal Code

    Authors: Tao Sun, Linzheng Chai, Jian Yang, Yuwei Yin, Hongcheng Guo, Jiaheng Liu, Bing Wang, Liqun Yang, Zhoujun Li

    Abstract: Intermediate reasoning or acting steps have successfully improved large language models (LLMs) for handling various downstream natural language processing (NLP) tasks. When applying LLMs for code generation, recent works mainly focus on directing the models to articulate intermediate natural-language reasoning steps, as in chain-of-thought (CoT) prompting, and then output code with the natural lan… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

    Comments: Accepted by ACL 2024 (Main)

  13. PetalView: Fine-grained Location and Orientation Extraction of Street-view Images via Cross-view Local Search with Supplementary Materials

    Authors: Wenmiao Hu, Yichen Zhang, Yuxuan Liang, Xian**g Han, Yifang Yin, Hannes Kruppa, See-Kiong Ng, Roger Zimmermann

    Abstract: Satellite-based street-view information extraction by cross-view matching refers to a task that extracts the location and orientation information of a given street-view image query by using one or multiple geo-referenced satellite images. Recent work has initiated a new research direction to find accurate information within a local area covered by one satellite image centered at a location prior (… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

    Comments: This paper has been accepted by ACM Multimedia 2023. This version contains additional supplementary materials

    Journal ref: Proceedings of the 31st ACM International Conference on Multimedia (2023) 56-66

  14. arXiv:2406.13157  [pdf, other

    physics.atom-ph physics.comp-ph

    Genetics-based deperturbation analysis for the spin-orbit coupled ${\rm A}^1Σ^+$ and ${\rm b}^3Π_{0^+}$ states of LiRb

    Authors: Yide Yin, Xuhui Bai, Xuechun Li, Xin-Yu Luo, Jie Yu, Gaoren Wang, Yongchang Han

    Abstract: We present a deperturbation analysis of the spin-orbit coupled $\rm A^1Σ^+$ and $\rm b^3Π_{0^+}$ states of LiRb based on the rovibrational energy levels observed previously by photoassociation spectroscopy in bosonic $^7$Li$^{85}$Rb molecule. Using the genetic algorithm, we fit the potential energy curves of the $\rm A^1Σ^+$ state and the $\rm b^3Π$ state into point-wise form. We then fit these po… ▽ More

    Submitted 4 July, 2024; v1 submitted 18 June, 2024; originally announced June 2024.

    Comments: 12 pages, 9 figures

  15. arXiv:2406.10671  [pdf

    cs.CL

    Augmenting Biomedical Named Entity Recognition with General-domain Resources

    Authors: Yu Yin, Hyunjae Kim, Xiao Xiao, Chih Hsuan Wei, Jaewoo Kang, Zhiyong Lu, Hua Xu, Meng Fang, Qingyu Chen

    Abstract: Training a neural network-based biomedical named entity recognition (BioNER) model usually requires extensive and costly human annotations. While several studies have employed multi-task learning with multiple BioNER datasets to reduce human effort, this approach does not consistently yield performance improvements and may introduce label ambiguity in different biomedical corpora. We aim to tackle… ▽ More

    Submitted 18 June, 2024; v1 submitted 15 June, 2024; originally announced June 2024.

    Comments: We make data, codes, and models publicly available via https://github.com/qingyu-qc/bioner_gerbera

  16. arXiv:2406.08756  [pdf, other

    cs.DC cs.LG

    Optimizing Large Model Training through Overlapped Activation Recomputation

    Authors: ** Chen, Wenjie Zhang, Shuibing He, Yingjie Gu, Zhuwei Peng, Kexin Huang, Xuan Zhan, Weijian Chen, Yi Zheng, Zhefeng Wang, Yanlong Yin, Gang Chen

    Abstract: Large model training has been using recomputation to alleviate the memory pressure and pipelining to exploit the parallelism of data, tensor, and devices. The existing recomputation approaches may incur up to 40% overhead when training real-world models, e.g., the GPT model with 22B parameters. This is because they are executed on demand in the critical training path. In this paper, we design a ne… ▽ More

    Submitted 27 June, 2024; v1 submitted 12 June, 2024; originally announced June 2024.

    Comments: 13 pages

  17. arXiv:2406.07436  [pdf, other

    cs.PL

    McEval: Massively Multilingual Code Evaluation

    Authors: Linzheng Chai, Shukai Liu, Jian Yang, Yuwei Yin, Ke **, Jiaheng Liu, Tao Sun, Ge Zhang, Changyu Ren, Hongcheng Guo, Zekun Wang, Boyang Wang, Xianjie Wu, Bing Wang, Tongliang Li, Liqun Yang, Sufeng Duan, Zhoujun Li

    Abstract: Code large language models (LLMs) have shown remarkable advances in code understanding, completion, and generation tasks. Programming benchmarks, comprised of a selection of code challenges and corresponding test cases, serve as a standard to evaluate the capability of different LLMs in such tasks. However, most existing benchmarks primarily focus on Python and are still restricted to a limited nu… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

    Comments: 22 pages

  18. arXiv:2406.06382  [pdf, other

    cs.CV cs.CL cs.LG

    Diffusion-RPO: Aligning Diffusion Models through Relative Preference Optimization

    Authors: Yi Gu, Zhendong Wang, Yueqin Yin, Yujia Xie, Mingyuan Zhou

    Abstract: Aligning large language models with human preferences has emerged as a critical focus in language modeling research. Yet, integrating preference learning into Text-to-Image (T2I) generative models is still relatively uncharted territory. The Diffusion-DPO technique made initial strides by employing pairwise preference learning in diffusion models tailored for specific text prompts. We introduce Di… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

  19. arXiv:2406.06279  [pdf, other

    cs.CL

    Multi-Prompting Decoder Helps Better Language Understanding

    Authors: Zifeng Cheng, Zhaoling Chen, Zhiwei Jiang, Yafeng Yin, Shi** Ge, Yuliang Liu, Qing Gu

    Abstract: Recent Pre-trained Language Models (PLMs) usually only provide users with the inference APIs, namely the emerging Model-as-a-Service (MaaS) setting. To adapt MaaS PLMs to downstream tasks without accessing their parameters and gradients, some existing methods focus on the output-side adaptation of PLMs, viewing the PLM as an encoder and then optimizing a task-specific decoder for decoding the outp… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

  20. arXiv:2406.05058  [pdf, other

    q-bio.PE

    Accurate stochastic simulation algorithm for multiscale models of infectious diseases

    Authors: Yuan Yin, Jennifer A. Flegg, Mark B. Flegg

    Abstract: In the infectious disease literature, significant effort has been devoted to studying dynamics at a single scale. For example, compartmental models describing population-level dynamics are often formulated using differential equations. In cases where small numbers or noise play a crucial role, these differential equations are replaced with memoryless Markovian models, where discrete individuals ca… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

    Comments: 23 pages, 7 figures

  21. arXiv:2406.01441  [pdf, other

    cs.CL

    LexMatcher: Dictionary-centric Data Collection for LLM-based Machine Translation

    Authors: Yong**g Yin, Jiali Zeng, Yafu Li, Fandong Meng, Yue Zhang

    Abstract: The fine-tuning of open-source large language models (LLMs) for machine translation has recently received considerable attention, marking a shift towards data-centric research from traditional neural machine translation. However, the area of data collection for instruction fine-tuning in machine translation remains relatively underexplored. In this paper, we present LexMatcher, a simple yet effect… ▽ More

    Submitted 2 July, 2024; v1 submitted 3 June, 2024; originally announced June 2024.

  22. arXiv:2405.20830  [pdf, other

    cs.CL cs.LG

    Self-Augmented Preference Optimization: Off-Policy Paradigms for Language Model Alignment

    Authors: Yueqin Yin, Zhendong Wang, Yujia Xie, Weizhu Chen, Mingyuan Zhou

    Abstract: Traditional language model alignment methods, such as Direct Preference Optimization (DPO), are limited by their dependence on static, pre-collected paired preference data, which hampers their adaptability and practical applicability. To overcome this limitation, we introduce Self-Augmented Preference Optimization (SAPO), an effective and scalable training paradigm that does not require existing p… ▽ More

    Submitted 31 May, 2024; originally announced May 2024.

  23. arXiv:2405.19088  [pdf, other

    cs.CL cs.CV

    Cracking the Code of Juxtaposition: Can AI Models Understand the Humorous Contradictions

    Authors: Zhe Hu, Tuo Liang, **g Li, Yiren Lu, Yunlai Zhou, Yiran Qiao, **g Ma, Yu Yin

    Abstract: Recent advancements in large multimodal language models have demonstrated remarkable proficiency across a wide range of tasks. Yet, these models still struggle with understanding the nuances of human humor through juxtaposition, particularly when it involves nonlinear narratives that underpin many jokes and humor cues. This paper investigates this challenge by focusing on comics with contradictory… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

  24. arXiv:2405.18855  [pdf

    cond-mat.mtrl-sci cond-mat.mes-hall

    Electric Field Control of Molecular Charge State in a Single-Component 2D Organic Nanoarray

    Authors: Dhaneesh Kumar, Cornelius Krull, Yuefeng Yin, Nikhil V. Medhekar, Agustin Schiffrin

    Abstract: Quantum dots (QD) with electric-field-controlled charge state are promising for electronics applications, e.g., digital information storage, single-electron transistors and quantum computing. Inorganic QDs consisting of semiconductor nanostructures or heterostructures often offer limited control on size and composition distribution, as well as low potential for scalability and/or nanoscale miniatu… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

  25. arXiv:2405.17532  [pdf, other

    cs.CV

    ClassDiffusion: More Aligned Personalization Tuning with Explicit Class Guidance

    Authors: Jiannan Huang, Jun Hao Liew, Hanshu Yan, Yuyang Yin, Yao Zhao, Yunchao Wei

    Abstract: Recent text-to-image customization works have been proven successful in generating images of given concepts by fine-tuning the diffusion models on a few examples. However, these methods tend to overfit the concepts, resulting in failure to create the concept under multiple conditions (e.g. headphone is missing when generating a <sks> dog wearing a headphone'). Interestingly, we notice that the bas… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

  26. arXiv:2405.16645  [pdf, other

    cs.CV

    Diffusion4D: Fast Spatial-temporal Consistent 4D Generation via Video Diffusion Models

    Authors: Hanwen Liang, Yuyang Yin, Dejia Xu, Hanxue Liang, Zhangyang Wang, Konstantinos N. Plataniotis, Yao Zhao, Yunchao Wei

    Abstract: The availability of large-scale multimodal datasets and advancements in diffusion models have significantly accelerated progress in 4D content generation. Most prior approaches rely on multiple image or video diffusion models, utilizing score distillation sampling for optimization or generating pseudo novel views for direct supervision. However, these methods are hindered by slow optimization spee… ▽ More

    Submitted 26 May, 2024; originally announced May 2024.

    Comments: Project page: https://vita-group.github.io/Diffusion4D

  27. arXiv:2405.16093  [pdf, other

    cs.CV

    Diverse Teacher-Students for Deep Safe Semi-Supervised Learning under Class Mismatch

    Authors: Qikai Wang, Rundong He, Yongshun Gong, Chunxiao Ren, Haoliang Sun, Xiaoshui Huang, Yilong Yin

    Abstract: Semi-supervised learning can significantly boost model performance by leveraging unlabeled data, particularly when labeled data is scarce. However, real-world unlabeled data often contain unseen-class samples, which can hinder the classification of seen classes. To address this issue, mainstream safe SSL methods suggest detecting and discarding unseen-class samples from unlabeled data. Nevertheles… ▽ More

    Submitted 25 May, 2024; originally announced May 2024.

  28. arXiv:2405.14588  [pdf, other

    astro-ph.HE

    A Study of the Spectral properties of Gamma-Ray Bursts with the Precursors and Main bursts

    Authors: Hui-Ying Deng, Zhao-Yang Peng, Jia-Ming Chen, Yue Yin, Ting Li

    Abstract: There is no consensus yet on whether the precursor and the main burst of gamma-ray bursts (GRBs) have the same origin, and their jet composition is still unclear. In order to further investigate this issue, we systematically search 21 Fermi GRBs with both precursor and main burst for spectral analysis. We first perform Bayesian time-resolved spectral analysis and find that almost all the precursor… ▽ More

    Submitted 23 May, 2024; v1 submitted 23 May, 2024; originally announced May 2024.

    Comments: 36 pages,13 figures. Accepted for publication in ApJ

  29. arXiv:2405.12969  [pdf, other

    cs.LG

    Can We Treat Noisy Labels as Accurate?

    Authors: Yuxiang Zheng, Zhongyi Han, Yilong Yin, Xin Gao, Tongliang Liu

    Abstract: Noisy labels significantly hinder the accuracy and generalization of machine learning models, particularly due to ambiguous instance features. Traditional techniques that attempt to correct noisy labels directly, such as those using transition matrices, often fail to address the inherent complexities of the problem sufficiently. In this paper, we introduce EchoAlign, a transformative paradigm shif… ▽ More

    Submitted 21 May, 2024; originally announced May 2024.

    Comments: 10 pages

  30. arXiv:2405.12788  [pdf, other

    cs.CL

    What Have We Achieved on Non-autoregressive Translation?

    Authors: Yafu Li, Huajian Zhang, Jianhao Yan, Yong**g Yin, Yue Zhang

    Abstract: Recent advances have made non-autoregressive (NAT) translation comparable to autoregressive methods (AT). However, their evaluation using BLEU has been shown to weakly correlate with human annotations. Limited research compares non-autoregressive translation and autoregressive translation comprehensively, leaving uncertainty about the true proximity of NAT to AT. To address this gap, we systematic… ▽ More

    Submitted 21 May, 2024; originally announced May 2024.

    Comments: ACL 2024 Findings

  31. arXiv:2405.12452  [pdf, other

    cs.LG cs.AI

    Prompt-Enhanced Spatio-Temporal Graph Transfer Learning

    Authors: Junfeng Hu, Xu Liu, Zhencheng Fan, Yifang Yin, Shili Xiang, Savitha Ramasamy, Roger Zimmermann

    Abstract: Spatio-temporal graph neural networks have demonstrated efficacy in capturing complex dependencies for urban computing tasks such as forecasting and kriging. However, their performance is constrained by the reliance on extensive data for training on specific tasks, which limits their adaptability to new urban domains with varied demands. Although transfer learning has been proposed to address this… ▽ More

    Submitted 20 May, 2024; originally announced May 2024.

  32. arXiv:2405.11895  [pdf, other

    cs.LG eess.SY

    Sparse Attention-driven Quality Prediction for Production Process Optimization in Digital Twins

    Authors: Yanlei Yin, Lihua Wang, Wenbo Wang, Dinh Thai Hoang

    Abstract: In the process industry, optimizing production lines for long-term efficiency requires real-time monitoring and analysis of operation states to fine-tune production line parameters. However, the complexity in operational logic and the intricate coupling of production process parameters make it difficult to develop an accurate mathematical model for the entire process, thus hindering the deployment… ▽ More

    Submitted 20 May, 2024; originally announced May 2024.

  33. arXiv:2405.11420  [pdf, ps, other

    cond-mat.mes-hall cond-mat.str-el

    Generic Approach to Intrinsic Magnetic Second-order Topological Insulators via Inverted $p-d$ Orbitals

    Authors: Zhao Liu, Bing Liu, Yuefeng Yin, Nikhil V. Medhekar

    Abstract: The integration of intrinsically magnetic and topologically nontrivial two-dimensional materials holds tantalizing prospects for the exotic quantum anomalous Hall insulators and magnetic second-order topological insulators (SOTIs). Compared with the well-studied nonmagnetic counterparts, the pursuit of intrinsic magnetic SOTIs remains limited. In this work, we address this gap by focusing on… ▽ More

    Submitted 18 May, 2024; originally announced May 2024.

    Comments: 4 figures, all comments are welcomed !

  34. arXiv:2405.07554  [pdf

    physics.optics

    Shape Measurement of Single Gold Nanorods in Water Using Open-access Optical Microcavities

    Authors: Yumeng Yin, Aurelien Trichet, Jiangrui Qian, Jason Smith

    Abstract: Shape measurement of rod-shaped particles in fluids is an outstanding challenge with applications in characterising synthetic functional nanoparticles and in early warning detection of rod-shaped pathogens in water supplies. However, it is challenging to achieve accurate and real-time measurements at a single particle scale in solution with existing methods. Here we introduce a novel technique to… ▽ More

    Submitted 13 May, 2024; originally announced May 2024.

  35. arXiv:2405.03349  [pdf, other

    cs.CV

    Retinexmamba: Retinex-based Mamba for Low-light Image Enhancement

    Authors: Jiesong Bai, Yuhao Yin, Qiyuan He, Yuanxian Li, Xiaofeng Zhang

    Abstract: In the field of low-light image enhancement, both traditional Retinex methods and advanced deep learning techniques such as Retinexformer have shown distinct advantages and limitations. Traditional Retinex methods, designed to mimic the human eye's perception of brightness and color, decompose images into illumination and reflection components but struggle with noise management and detail preserva… ▽ More

    Submitted 19 May, 2024; v1 submitted 6 May, 2024; originally announced May 2024.

  36. arXiv:2405.02572  [pdf, other

    cs.LG cs.AI

    Off-OAB: Off-Policy Policy Gradient Method with Optimal Action-Dependent Baseline

    Authors: Wenjia Meng, Qian Zheng, Long Yang, Yilong Yin, Gang Pan

    Abstract: Policy-based methods have achieved remarkable success in solving challenging reinforcement learning problems. Among these methods, off-policy policy gradient methods are particularly important due to that they can benefit from off-policy data. However, these methods suffer from the high variance of the off-policy policy gradient (OPPG) estimator, which results in poor sample efficiency during trai… ▽ More

    Submitted 4 May, 2024; originally announced May 2024.

    Comments: 12 pages, 3 figures

  37. arXiv:2404.18961  [pdf, other

    cs.LG cs.AI cs.CV

    Unleashing the Power of Multi-Task Learning: A Comprehensive Survey Spanning Traditional, Deep, and Pretrained Foundation Model Eras

    Authors: Jun Yu, Yutong Dai, Xiaokang Liu, ** Huang, Yishan Shen, Ke Zhang, Rong Zhou, Eashan Adhikarla, Wenxuan Ye, Yixin Liu, Zhaoming Kong, Kai Zhang, Yilong Yin, Vinod Namboodiri, Brian D. Davison, Jason H. Moore, Yong Chen

    Abstract: MTL is a learning paradigm that effectively leverages both task-specific and shared information to address multiple related tasks simultaneously. In contrast to STL, MTL offers a suite of benefits that enhance both the training process and the inference efficiency. MTL's key advantages encompass streamlined model architecture, performance enhancement, and cross-domain generalizability. Over the pa… ▽ More

    Submitted 29 April, 2024; originally announced April 2024.

    Comments: 60 figures, 116 pages, 500+ references

  38. arXiv:2404.18155  [pdf, other

    cs.CV

    ShapeMoiré: Channel-Wise Shape-Guided Network for Image Demoiréing

    Authors: **ming Cao, Sicheng Shen, Qiu Zhou, Yifang Yin, Yangyan Li, Roger Zimmermann

    Abstract: Photographing optoelectronic displays often introduces unwanted moiré patterns due to analog signal interference between the pixel grids of the display and the camera sensor arrays. This work identifies two problems that are largely ignored by existing image demoiréing approaches: 1) moiré patterns vary across different channels (RGB); 2) repetitive patterns are constantly observed. However, emplo… ▽ More

    Submitted 28 April, 2024; originally announced April 2024.

    Comments: 12 pages

  39. arXiv:2404.16425  [pdf, other

    astro-ph.HE

    Soft X-ray prompt emission from a high-redshift gamma-ray burst EP240315a

    Authors: Y. Liu, H. Sun, D. Xu, D. S. Svinkin, J. Delaunay, N. R. Tanvir, H. Gao, C. Zhang, Y. Chen, X. -F. Wu, B. Zhang, W. Yuan, J. An, G. Bruni, D. D. Frederiks, G. Ghirlanda, J. -W. Hu, A. Li, C. -K. Li, J. -D. Li, D. B. Malesani, L. Piro, G. Raman, R. Ricci, E. Troja , et al. (170 additional authors not shown)

    Abstract: Long gamma-ray bursts (GRBs) are believed to originate from core collapse of massive stars. High-redshift GRBs can probe the star formation and reionization history of the early universe, but their detection remains rare. Here we report the detection of a GRB triggered in the 0.5--4 keV band by the Wide-field X-ray Telescope (WXT) on board the Einstein Probe (EP) mission, designated as EP240315a,… ▽ More

    Submitted 25 April, 2024; originally announced April 2024.

    Comments: 41 pages, 8 figures, 7 tables

  40. arXiv:2404.16263  [pdf, other

    astro-ph.HE

    New Timing Results of MSPs from NICER Observations

    Authors: Shijie Zheng, Dawei Han, Heng Xu, Kejia Lee, Jian** Yuan, Haoxi Wang, Mingyu Ge, Liang Zhang, Yongye Li, Yitao Yin, Xiang Ma, Yong Chen, Shuangnan Zhang

    Abstract: Millisecond pulsars (MSPs) are known for their long-term stability. Using six years of observations from the Neutron Star Interior Composition Explorer (NICER), we have conducted an in-depth analysis of the X-ray timing results for six MSPs: PSRs B1937+21, B1821$-$24, J0437$-$4715, J0030+0451, J0218+4232, and J2124$-$3358. The timing stability parameter $σ_z$ has been calculated, revealing remarka… ▽ More

    Submitted 24 April, 2024; originally announced April 2024.

  41. arXiv:2404.12180  [pdf, other

    cond-mat.quant-gas physics.atom-ph

    Microwave seeding time crystal in Floquet driven Rydberg atoms

    Authors: Bang Liu, Li-Hua Zhang, Yu Ma, Tian-Yu Han, Qi-Feng Wang, Jun Zhang, Zheng-Yuan Zhang, Shi-Yao Shao, Qing Li, Han-Chao Chen, Ya-Jun Wang, Jia-Dou Nan, Yi-Ming Yin, Dong-Sheng Ding, Bao-Sen Shi

    Abstract: Crystal seeding enables a deeper understanding of phase behavior, leading to the development of methods for controlling and manipulating phase transitions in various applications such as materials synthesis, crystallization processes, and phase transformation engineering. How to seed a crystalline in time domain is an open question, which is of great significant and may provide an avenue to unders… ▽ More

    Submitted 18 April, 2024; originally announced April 2024.

  42. arXiv:2404.10096  [pdf, other

    cs.CV cs.AI

    Vision Augmentation Prediction Autoencoder with Attention Design (VAPAAD)

    Authors: Yiqiao Yin

    Abstract: Recent advancements in sequence prediction have significantly improved the accuracy of video data interpretation; however, existing models often overlook the potential of attention-based mechanisms for next-frame prediction. This study introduces the Vision Augmentation Prediction Autoencoder with Attention Design (VAPAAD), an innovative approach that integrates attention mechanisms into sequence… ▽ More

    Submitted 16 April, 2024; v1 submitted 15 April, 2024; originally announced April 2024.

    Comments: 12 pages, 4 figures

  43. arXiv:2404.04668  [pdf, ps, other

    cs.DS math.PR

    Spectral Independence Beyond Total Influence on Trees and Related Graphs

    Authors: Xiaoyu Chen, Xiongxin Yang, Yitong Yin, Xinyuan Zhang

    Abstract: We study how to establish $\textit{spectral independence}$, a key concept in sampling, without relying on total influence bounds, by applying an $\textit{approximate inverse}$ of the influence matrix. Our method gives constant upper bounds on spectral independence for two foundational Gibbs distributions known to have unbounded total influences: $\bullet$ The monomer-dimer model on graphs with l… ▽ More

    Submitted 6 April, 2024; originally announced April 2024.

  44. arXiv:2404.01157  [pdf, other

    cs.CL cs.PF

    Green AI: Exploring Carbon Footprints, Mitigation Strategies, and Trade Offs in Large Language Model Training

    Authors: Vivian Liu, Yiqiao Yin

    Abstract: Prominent works in the field of Natural Language Processing have long attempted to create new innovative models by improving upon previous model training approaches, altering model architecture, and develo** more in-depth datasets to better their performance. However, with the quickly advancing field of NLP comes increased greenhouse gas emissions, posing concerns over the environmental damage c… ▽ More

    Submitted 1 April, 2024; originally announced April 2024.

  45. arXiv:2404.00323  [pdf, other

    cs.CV cs.LG

    CLIP-driven Outliers Synthesis for few-shot OOD detection

    Authors: Hao Sun, Rundong He, Zhongyi Han, Zhicong Lin, Yongshun Gong, Yilong Yin

    Abstract: Few-shot OOD detection focuses on recognizing out-of-distribution (OOD) images that belong to classes unseen during training, with the use of only a small number of labeled in-distribution (ID) images. Up to now, a mainstream strategy is based on large-scale vision-language models, such as CLIP. However, these methods overlook a crucial issue: the lack of reliable OOD supervision information, whic… ▽ More

    Submitted 30 March, 2024; originally announced April 2024.

    Comments: 9 pages,5 figures

  46. arXiv:2403.19899  [pdf, other

    cs.IR

    Inclusive Design Insights from a Preliminary Image-Based Conversational Search Systems Evaluation

    Authors: Yue Zheng, Lei Yu, Junmian Chen, Tianyu Xia, Yuanyuan Yin, Shan Wang, Haiming Liu

    Abstract: The digital realm has witnessed the rise of various search modalities, among which the Image-Based Conversational Search System stands out. This research delves into the design, implementation, and evaluation of this specific system, juxtaposing it against its text-based and mixed counterparts. A diverse participant cohort ensures a broad evaluation spectrum. Advanced tools facilitate emotion anal… ▽ More

    Submitted 28 March, 2024; originally announced March 2024.

  47. arXiv:2403.19470  [pdf, other

    math.NA cs.LG eess.SP

    Deep decomposition method for the limited aperture inverse obstacle scattering problem

    Authors: Yunwen Yin, Liang Yan

    Abstract: In this paper, we consider a deep learning approach to the limited aperture inverse obstacle scattering problem. It is well known that traditional deep learning relies solely on data, which may limit its performance for the inverse problem when only indirect observation data and a physical model are available. A fundamental question arises in light of these limitations: is it possible to enable de… ▽ More

    Submitted 28 March, 2024; originally announced March 2024.

  48. arXiv:2403.17556  [pdf, other

    cs.CL cs.AI

    m3P: Towards Multimodal Multilingual Translation with Multimodal Prompt

    Authors: Jian Yang, Hongcheng Guo, Yuwei Yin, Jiaqi Bai, Bing Wang, Jiaheng Liu, Xinnian Liang, Linzheng Cahi, Liqun Yang, Zhoujun Li

    Abstract: Multilingual translation supports multiple translation directions by projecting all languages in a shared space, but the translation quality is undermined by the difference between languages in the text-only modality, especially when the number of languages is large. To bridge this gap, we introduce visual context as the universal language-independent representation to facilitate multilingual tran… ▽ More

    Submitted 26 March, 2024; originally announced March 2024.

    Comments: COLING 2024

  49. arXiv:2403.17507  [pdf, other

    cs.LG physics.chem-ph

    EL-MLFFs: Ensemble Learning of Machine Leaning Force Fields

    Authors: Bangchen Yin, Yue Yin, Yuda W. Tang, Hai Xiao

    Abstract: Machine learning force fields (MLFFs) have emerged as a promising approach to bridge the accuracy of quantum mechanical methods and the efficiency of classical force fields. However, the abundance of MLFF models and the challenge of accurately predicting atomic forces pose significant obstacles in their practical application. In this paper, we propose a novel ensemble learning framework, EL-MLFFs,… ▽ More

    Submitted 26 March, 2024; originally announced March 2024.

    Comments: 12 pages, 3 figures

  50. arXiv:2403.17148  [pdf, ps, other

    hep-th

    Low spin solutions of Higher Spin Gravity: BPST instanton

    Authors: Evgeny Skvortsov, Yihao Yin

    Abstract: Higher spin gravities do not have a low energy limit where higher-spin fields decouple from gravity. Nevertheless, it is possible to construct fine-tuned exact solutions that activate low-spin fields without sourcing the higher-spin fields. We show that BPST (Belavin-Polyakov-Schwartz-Tyupkin) instanton is an exact solution of Chiral Higher Spin Gravity, i.e. it is also a solution of the holograph… ▽ More

    Submitted 25 March, 2024; originally announced March 2024.

    Comments: 33 pages