Skip to main content

Showing 151–200 of 861 results for author: Dai, Y

.
  1. arXiv:2309.09426  [pdf, other

    eess.IV cs.AI cs.CV cs.LG eess.SP

    Joint Demosaicing and Denoising with Double Deep Image Priors

    Authors: Taihui Li, Anish Lahiri, Yutong Dai, Owen Mayer

    Abstract: Demosaicing and denoising of RAW images are crucial steps in the processing pipeline of modern digital cameras. As only a third of the color information required to produce a digital image is captured by the camera sensor, the process of demosaicing is inherently ill-posed. The presence of noise further exacerbates this problem. Performing these two steps sequentially may distort the content of th… ▽ More

    Submitted 17 September, 2023; originally announced September 2023.

  2. arXiv:2309.08622  [pdf, other

    cs.IR cs.AI

    Representation Learning in Low-rank Slate-based Recommender Systems

    Authors: Yijia Dai, Wen Sun

    Abstract: Reinforcement learning (RL) in recommendation systems offers the potential to optimize recommendations for long-term user engagement. However, the environment often involves large state and action spaces, which makes it hard to efficiently learn and explore. In this work, we propose a sample-efficient representation learning algorithm, using the standard slate recommendation setup, to treat this a… ▽ More

    Submitted 18 September, 2023; v1 submitted 10 September, 2023; originally announced September 2023.

    Comments: in MFPL, ICML 2023

  3. arXiv:2309.08348  [pdf, other

    eess.AS cs.SD

    The Multimodal Information Based Speech Processing (MISP) 2023 Challenge: Audio-Visual Target Speaker Extraction

    Authors: Shilong Wu, Chenxi Wang, Hang Chen, Yusheng Dai, Chenyue Zhang, Ruoyu Wang, Hongbo Lan, Jun Du, Chin-Hui Lee, **gdong Chen, Shinji Watanabe, Sabato Marco Siniscalchi, Odette Scharenborg, Zhong-Qiu Wang, Jia Pan, Jianqing Gao

    Abstract: Previous Multimodal Information based Speech Processing (MISP) challenges mainly focused on audio-visual speech recognition (AVSR) with commendable success. However, the most advanced back-end recognition systems often hit performance limits due to the complex acoustic environments. This has prompted a shift in focus towards the Audio-Visual Target Speaker Extraction (AVTSE) task for the MISP 2023… ▽ More

    Submitted 15 September, 2023; originally announced September 2023.

    Comments: 5 pages, 4 figures

  4. arXiv:2309.04953  [pdf, ps, other

    cond-mat.stat-mech

    Extracting the number of type-B Goldstone modes and the dynamical critical exponent for a type of scale-invariant states

    Authors: Huan-Qiang Zhou, Yan-Wei Dai, Qian-Qian Shi, Ian P. McCulloch, Murray T. Batchelor

    Abstract: A generic scheme is proposed to perform a finite-entanglement scaling analysis for scale-invariant states, which appear to be highly degenerate ground states arising from spontaneous symmetry breaking with type-B Goldstone modes. This allows us to extract the number of type-B Goldstone modes and the dynamical critical exponent, in combination with a finite block-size scaling analysis, from numeric… ▽ More

    Submitted 30 November, 2023; v1 submitted 10 September, 2023; originally announced September 2023.

    Comments: 14 pages, 24 figures, 11 tables. Minor changes

  5. arXiv:2309.03559  [pdf, other

    cs.CL

    An Anchor Learning Approach for Citation Field Learning

    Authors: Zilin Yuan, Borun Chen, Yimeng Dai, Yinghui Li, Hai-Tao Zheng, Rui Zhang

    Abstract: Citation field learning is to segment a citation string into fields of interest such as author, title, and venue. Extracting such fields from citations is crucial for citation indexing, researcher profile analysis, etc. User-generated resources like academic homepages and Curriculum Vitae, provide rich citation field information. However, extracting fields from these resources is challenging due t… ▽ More

    Submitted 14 December, 2023; v1 submitted 7 September, 2023; originally announced September 2023.

    Comments: accepted by ICASSP2024

  6. arXiv:2309.03490  [pdf, other

    math.PR

    Lipschitz Transport Maps via the Follmer Flow

    Authors: Yin Dai, Yuan Gao, Jian Huang, Yuling Jiao, Lican Kang, ** Liu

    Abstract: Inspired by the construction of the F{ö}llmer process, we construct a unit-time flow on the Euclidean space, termed the F{ö}llmer flow, whose flow map at time 1 pushes forward a standard Gaussian measure onto a general target measure. We study the well-posedness of the F{ö}llmer flow and establish the Lipschitz property of the flow map at time 1. We apply the Lipschitz map** to several rich clas… ▽ More

    Submitted 7 September, 2023; originally announced September 2023.

  7. arXiv:2309.03126  [pdf, other

    cs.CL

    Everyone Deserves A Reward: Learning Customized Human Preferences

    Authors: Pengyu Cheng, Jiawen Xie, Ke Bai, Yong Dai, Nan Du

    Abstract: Reward models (RMs) are essential for aligning large language models (LLMs) with human preferences to improve interaction quality. However, the real world is pluralistic, which leads to diversified human preferences with respect to different religions, politics, cultures, etc. Moreover, each individual can have their unique preferences on various topics. Neglecting the diversity of human preferenc… ▽ More

    Submitted 15 September, 2023; v1 submitted 6 September, 2023; originally announced September 2023.

  8. arXiv:2309.02043  [pdf, other

    cs.CV

    Decomposed Guided Dynamic Filters for Efficient RGB-Guided Depth Completion

    Authors: Yufei Wang, Yuxin Mao, Qi Liu, Yuchao Dai

    Abstract: RGB-guided depth completion aims at predicting dense depth maps from sparse depth measurements and corresponding RGB images, where how to effectively and efficiently exploit the multi-modal information is a key issue. Guided dynamic filters, which generate spatially-variant depth-wise separable convolutional filters from RGB features to guide depth features, have been proven to be effective in thi… ▽ More

    Submitted 5 September, 2023; originally announced September 2023.

  9. arXiv:2308.14032  [pdf, ps, other

    hep-ph

    $ρ$-meson longitudinal leading-twist distribution amplitude revisited and the $D\to ρ$ semileptonic decay

    Authors: Tao Zhong, Ya-Hong Dai, Hai-Bing Fu

    Abstract: Motivated by our previous work [Phys. Rev. D \textbf{104}, no.1, 016021 (2021)] on pionic leading-twist distribution amplitude (DA), we revisit $ρ$-meson leading-twist longitudinal DA $φ_{2;ρ}^\|(x,μ)$ in this paper. A model proposed by Chang based on the Dyson-Schwinger equations (DSEs) is adopted to describe the behavior of $φ_{2;ρ}^\|(x,μ)$. On the other hand, the $ξ$-moments of… ▽ More

    Submitted 27 August, 2023; originally announced August 2023.

    Comments: 9 pages, 3 figures

  10. arXiv:2308.13774  [pdf, other

    cs.CV cs.IR cs.MM

    Central Similarity Multi-View Hashing for Multimedia Retrieval

    Authors: Jian Zhu, Wen Cheng, Yu Cui, Chang Tang, Yuyang Dai, Yong Li, Lingfang Zeng

    Abstract: Hash representation learning of multi-view heterogeneous data is the key to improving the accuracy of multimedia retrieval. However, existing methods utilize local similarity and fall short of deeply fusing the multi-view features, resulting in poor retrieval accuracy. Current methods only use local similarity to train their model. These methods ignore global similarity. Furthermore, most recent w… ▽ More

    Submitted 26 August, 2023; originally announced August 2023.

    Comments: accepted by the Asia Pacific Web (APWeb) and Web-Age Information Management (WAIM) Joint International Conference on Web and Big Data (APWeb-WAIM2023)

  11. arXiv:2308.13191  [pdf, other

    cs.CL cs.AI

    Chunk, Align, Select: A Simple Long-sequence Processing Method for Transformers

    Authors: Jiawen Xie, Pengyu Cheng, Xiao Liang, Yong Dai, Nan Du

    Abstract: Although dominant in natural language processing, transformer-based models remain challenged by the task of long-sequence processing, because the computational cost of self-attention operations in transformers swells quadratically with the input sequence length. To alleviate the complexity of long-sequence processing, we propose a simple framework to enable the offthe-shelf pre-trained transformer… ▽ More

    Submitted 25 August, 2023; originally announced August 2023.

  12. arXiv:2308.11925  [pdf, other

    math.OC cs.LG math.NA

    Solving Elliptic Optimal Control Problems via Neural Networks and Optimality System

    Authors: Yongcheng Dai, Bangti **, Ramesh Sau, Zhi Zhou

    Abstract: In this work, we investigate a neural network based solver for optimal control problems (without / with box constraint) for linear and semilinear second-order elliptic problems. It utilizes a coupled system derived from the first-order optimality system of the optimal control problem, and employs deep neural networks to represent the solutions to the reduced system. We present an error analysis of… ▽ More

    Submitted 8 May, 2024; v1 submitted 23 August, 2023; originally announced August 2023.

    Comments: 26 pages

  13. arXiv:2308.10705  [pdf, other

    cs.CV cs.AI

    Unsupervised 3D Pose Estimation with Non-Rigid Structure-from-Motion Modeling

    Authors: Haorui Ji, Hui Deng, Yuchao Dai, Hongdong Li

    Abstract: Most of the previous 3D human pose estimation work relied on the powerful memory capability of the network to obtain suitable 2D-3D map**s from the training data. Few works have studied the modeling of human posture deformation in motion. In this paper, we propose a new modeling method for human pose deformations and design an accompanying diffusion-based motion prior. Inspired by the field of n… ▽ More

    Submitted 18 August, 2023; originally announced August 2023.

  14. arXiv:2308.09064  [pdf, other

    astro-ph.GA

    The Lyman Continuum Escape Fraction of Star-forming Galaxies at $2.4\lesssim z\lesssim3.7$ from UVCANDELS

    Authors: Xin Wang, Harry I. Teplitz, Brent M. Smith, Rogier A. Windhorst, Marc Rafelski, Vihang Mehta, Anahita Alavi, Gabriel Brammer, James Colbert, Norman Grogin, Nimish P. Hathi, Anton M. Koekemoer, Laura Prichard, Claudia Scarlata, Ben Sunnquist, Pablo Arrabal Haro, Christopher Conselice, Eric Gawiser, Yicheng Guo, Matthew Hayes, Rolf A. Jansen, Zhiyuan Ji, Ray A. Lucas, Robert O'Connell, Brant Robertson , et al. (52 additional authors not shown)

    Abstract: The UltraViolet Imaging of the Cosmic Assembly Near-infrared Deep Extragalactic Legacy Survey Fields (UVCANDELS) survey is a Hubble Space Telescope (HST) Cycle-26 Treasury Program, allocated in total 164 orbits of primary Wide-Field Camera 3 Ultraviolet and Visible light F275W imaging with coordinated parallel Advanced Camera for Surveys F435W imaging, on four of the five premier extragalactic sur… ▽ More

    Submitted 17 August, 2023; originally announced August 2023.

    Comments: 33 pages, 21 figures, and 5 tables. Resubmitted after addressing the referee report

  15. arXiv:2308.08488  [pdf, other

    cs.CL cs.AI cs.SD eess.AS

    Improving Audio-Visual Speech Recognition by Lip-Subword Correlation Based Visual Pre-training and Cross-Modal Fusion Encoder

    Authors: Yusheng Dai, Hang Chen, Jun Du, Xiaofei Ding, Ning Ding, Feijun Jiang, Chin-Hui Lee

    Abstract: In recent research, slight performance improvement is observed from automatic speech recognition systems to audio-visual speech recognition systems in the end-to-end framework with low-quality videos. Unmatching convergence rates and specialized input representations between audio and visual modalities are considered to cause the problem. In this paper, we propose two novel techniques to improve a… ▽ More

    Submitted 8 March, 2024; v1 submitted 14 August, 2023; originally announced August 2023.

    Comments: 6 pages, 2 figures, published in ICME2023

  16. arXiv:2308.08288  [pdf, other

    cs.CV

    Improving Audio-Visual Segmentation with Bidirectional Generation

    Authors: Dawei Hao, Yuxin Mao, Bowen He, Xiaodong Han, Yuchao Dai, Yiran Zhong

    Abstract: The aim of audio-visual segmentation (AVS) is to precisely differentiate audible objects within videos down to the pixel level. Traditional approaches often tackle this challenge by combining information from various modalities, where the contribution of each modality is implicitly or explicitly modeled. Nevertheless, the interconnections between different modalities tend to be overlooked in audio… ▽ More

    Submitted 19 December, 2023; v1 submitted 16 August, 2023; originally announced August 2023.

    Comments: AAAI Camera Ready. Dawei Hao and Yuxin Mao contribute equality to this paper. Yiran Zhong is the corresponding author. The code will be released at https://github.com/OpenNLPLab/AVS-bidirectional

  17. arXiv:2308.04413  [pdf, other

    cs.CV

    Digging into Depth Priors for Outdoor Neural Radiance Fields

    Authors: Chen Wang, Jiadai Sun, Lina Liu, Chenming Wu, Zhelun Shen, Dayan Wu, Yuchao Dai, Liangjun Zhang

    Abstract: Neural Radiance Fields (NeRF) have demonstrated impressive performance in vision and graphics tasks, such as novel view synthesis and immersive reality. However, the shape-radiance ambiguity of radiance fields remains a challenge, especially in the sparse viewpoints setting. Recent work resorts to integrating depth priors into outdoor NeRF training to alleviate the issue. However, the criteria for… ▽ More

    Submitted 8 August, 2023; originally announced August 2023.

    Comments: Accepted to ACM MM 2023. Project Page: https://cwchenwang.github.io/outdoor-nerf-depth

  18. arXiv:2308.02809  [pdf

    physics.class-ph physics.app-ph

    3D front tip fields in cree** solids under constraint effects: a higher-order asymptotic solution

    Authors: Weichen Kong, Yanwei Dai, Yinghua Liu

    Abstract: As one of the most important topics studied in creep fracture mechanics, mechanics fields at three-dimensional (3D) sharp V-notches and crack tip have drawn tremendous attentions. With many years efforts on constraint theory developed in cree** solids, there still seems dense fog on how in-plane and out-of-plane constraint effects are interacted for 3D sharp V-notch and crack in cree** solids.… ▽ More

    Submitted 5 August, 2023; originally announced August 2023.

    Comments: 56 pages, 25 figures

  19. UV-Bright Star-Forming Clumps and Their Host Galaxies in UVCANDELS at 0.5 $\leq$ z $\leq$ 1

    Authors: Alec Martin, Yicheng Guo, Xin Wang, Anton M. Koekemoer, Marc Rafelski, Harry I. Teplitz, Rogier A. Windhorst, Anahita Alavi, Norman A. Grogin, Laura Prichard, Ben Sunnquist, Daniel Ceverino, Nima Chartab, Christopher J. Conselice, Y. Sophia Dai, Avishai Dekel, Johnathan P. Gardner, Eric Gawiser, Nimish P. Hathi, Matthew J. Hayes, Rolf A. Jansen, Zhiyuan Ji, David C. Koo, Ray A. Lucas, Nir Mandelker , et al. (10 additional authors not shown)

    Abstract: Giant star-forming clumps are a prominent feature of star-forming galaxies (SFGs) and contain important clues on galaxy formation and evolution. However, basic demographics of clumps and their host galaxies remain uncertain. Using the HST/WFC3 F275W images from the Ultraviolet Imaging of the Cosmic Assembly Near-infrared Deep Extragalactic Legacy Survey (UVCANDELS), we detect and analyze giant sta… ▽ More

    Submitted 2 October, 2023; v1 submitted 31 July, 2023; originally announced August 2023.

    Comments: 21 pages, 13 figures, accepted for publication in ApJ

    Journal ref: ApJ 955 106 (2023)

  20. arXiv:2307.16579  [pdf, other

    cs.CV cs.MM cs.SD eess.AS

    Contrastive Conditional Latent Diffusion for Audio-visual Segmentation

    Authors: Yuxin Mao, **g Zhang, Mochu Xiang, Yunqiu Lv, Yiran Zhong, Yuchao Dai

    Abstract: We propose a latent diffusion model with contrastive learning for audio-visual segmentation (AVS) to extensively explore the contribution of audio. We interpret AVS as a conditional generation task, where audio is defined as the conditional variable for sound producer(s) segmentation. With our new interpretation, it is especially necessary to model the correlation between audio and the final segme… ▽ More

    Submitted 31 July, 2023; originally announced July 2023.

  21. arXiv:2307.16572  [pdf, other

    cs.CV

    Transferable Attack for Semantic Segmentation

    Authors: Mengqi He, **g Zhang, Zhaoyuan Yang, Mingyi He, Nick Barnes, Yuchao Dai

    Abstract: We analysis performance of semantic segmentation models wrt. adversarial attacks, and observe that the adversarial examples generated from a source model fail to attack the target models. i.e The conventional attack methods, such as PGD and FGSM, do not transfer well to target models, making it necessary to study the transferable attacks, especially transferable attacks for semantic segmentation.… ▽ More

    Submitted 21 August, 2023; v1 submitted 31 July, 2023; originally announced July 2023.

    Comments: Source code is available at: https://github.com/anucvers/TASS

  22. arXiv:2307.16509  [pdf, other

    cs.CV

    Digging Into Uncertainty-based Pseudo-label for Robust Stereo Matching

    Authors: Zhelun Shen, Xibin Song, Yuchao Dai, Dingfu Zhou, Zhibo Rao, Liangjun Zhang

    Abstract: Due to the domain differences and unbalanced disparity distribution across multiple datasets, current stereo matching approaches are commonly limited to a specific dataset and generalize poorly to others. Such domain shift issue is usually addressed by substantial adaptation on costly target-domain ground-truth data, which cannot be easily obtained in practical settings. In this paper, we propose… ▽ More

    Submitted 31 July, 2023; originally announced July 2023.

    Comments: Accepted by TPAMI

  23. arXiv:2307.15429  [pdf, other

    cs.LG cs.AI cs.CV

    Improvable Gap Balancing for Multi-Task Learning

    Authors: Yanqi Dai, Nanyi Fei, Zhiwu Lu

    Abstract: In multi-task learning (MTL), gradient balancing has recently attracted more research interest than loss balancing since it often leads to better performance. However, loss balancing is much more efficient than gradient balancing, and thus it is still worth further exploration in MTL. Note that prior studies typically ignore that there exist varying improvable gaps across multiple tasks, where the… ▽ More

    Submitted 28 July, 2023; originally announced July 2023.

    Comments: Accepted for the 39th Conference on Uncertainty in Artificial Intelligence (UAI 2023)

  24. arXiv:2307.09929  [pdf, other

    cs.CV

    Measuring and Modeling Uncertainty Degree for Monocular Depth Estimation

    Authors: Mochu Xiang, **g Zhang, Nick Barnes, Yuchao Dai

    Abstract: Effectively measuring and modeling the reliability of a trained model is essential to the real-world deployment of monocular depth estimation (MDE) models. However, the intrinsic ill-posedness and ordinal-sensitive nature of MDE pose major challenges to the estimation of uncertainty degree of the trained models. On the one hand, utilizing current uncertainty modeling methods may increase memory co… ▽ More

    Submitted 19 July, 2023; originally announced July 2023.

  25. arXiv:2307.09270  [pdf, other

    cs.CL

    Linearized Relative Positional Encoding

    Authors: Zhen Qin, Weixuan Sun, Kaiyue Lu, Hui Deng, Dongxu Li, Xiaodong Han, Yuchao Dai, Lingpeng Kong, Yiran Zhong

    Abstract: Relative positional encoding is widely used in vanilla and linear transformers to represent positional information. However, existing encoding methods of a vanilla transformer are not always directly applicable to a linear transformer, because the latter requires a decomposition of the query and key representations into separate kernel functions. Nevertheless, principles for designing encoding met… ▽ More

    Submitted 18 July, 2023; originally announced July 2023.

    Comments: Reviewed by TMLR, decision pending. Yiran Zhong is the corresponding author. Code is available at https://github.com/OpenNLPLab/Lrpe

  26. Human Body Digital Twin: A Master Plan

    Authors: Chenyu Tang, Wentian Yi, Edoardo Occhipinti, Yanning Dai, Shuo Gao, Luigi G. Occhipinti

    Abstract: A human body digital twin (DT) is a virtual representation of an individual's physiological state, created using real-time data from sensors and medical test devices, with the purpose of simulating, predicting, and optimizing health outcomes through advanced analytics and simulations. The human body DT has the potential to revolutionize healthcare and wellness, but its responsible and effective im… ▽ More

    Submitted 12 September, 2023; v1 submitted 18 July, 2023; originally announced July 2023.

    Comments: 3 figures, 2 boxes

  27. arXiv:2307.04651  [pdf, other

    cs.CV

    Joint Salient Object Detection and Camouflaged Object Detection via Uncertainty-aware Learning

    Authors: Aixuan Li, **g Zhang, Yunqiu Lv, Tong Zhang, Yiran Zhong, Mingyi He, Yuchao Dai

    Abstract: Salient objects attract human attention and usually stand out clearly from their surroundings. In contrast, camouflaged objects share similar colors or textures with the environment. In this case, salient objects are typically non-camouflaged, and camouflaged objects are usually not salient. Due to this inherent contradictory attribute, we introduce an uncertainty-aware learning pipeline to extens… ▽ More

    Submitted 10 July, 2023; originally announced July 2023.

  28. arXiv:2307.03376  [pdf, other

    cs.CV

    Weakly-supervised Contrastive Learning for Unsupervised Object Discovery

    Authors: Yunqiu Lv, **g Zhang, Nick Barnes, Yuchao Dai

    Abstract: Unsupervised object discovery (UOD) refers to the task of discriminating the whole region of objects from the background within a scene without relying on labeled datasets, which benefits the task of bounding-box-level localization and pixel-level segmentation. This task is promising due to its ability to discover objects in a generic manner. We roughly categorise existing techniques into two main… ▽ More

    Submitted 7 July, 2023; originally announced July 2023.

  29. arXiv:2307.02950  [pdf, ps, other

    cond-mat.supr-con cond-mat.str-el

    Electronic correlations and partial gap in the bilayer nickelate La$_{3}$Ni$_{2}$O$_{7}$

    Authors: Zhe Liu, Mengwu Huo, Jie Li, Qing Li, Yuecong Liu, Yaomin Dai, Xiaoxiang Zhou, Jiahao Hao, Yi Lu, Meng Wang, Hai-Hu Wen

    Abstract: The discovery of superconductivity with a critical temperature of about 80~K in La$_{3}$Ni$_{2}$O$_{7}$ single crystals under pressure has received enormous attention. La$_{3}$Ni$_{2}$O$_{7}$ is not superconducting under ambient pressure but exhibits a transition at $T^{\ast} \simeq 115$~K. Understanding the electronic correlations and charge dynamics is an important step towards the origin of sup… ▽ More

    Submitted 2 April, 2024; v1 submitted 6 July, 2023; originally announced July 2023.

    Comments: 26 pages, 4 figures, Comments are welcome and appreciated

  30. arXiv:2307.02335  [pdf, other

    astro-ph.GA

    The Classification of Galaxy Morphology in H-band of COSMOS-DASH Field: a combination-based machine learning clustering model

    Authors: Yao Dai, Jun Xu, Jie Song, Guanwen Fang, Chichun Zhou, Shuo Ba, Yizhou Gu, Zesen Lin, Xu Kong

    Abstract: By applying our previously developed two-step scheme for galaxy morphology classification, we present a catalog of galaxy morphology for H-band selected massive galaxies in the COSMOS-DASH field, which includes 17292 galaxies with stellar mass $M_{\star}>10^{10}~M_{\odot}$ at $0.5<z<2.5$. The classification scheme is designed to provide a complete morphology classification for galaxies via a combi… ▽ More

    Submitted 6 July, 2023; v1 submitted 5 July, 2023; originally announced July 2023.

    Comments: 13 pages, 10 figures, accepted by ApJS

  31. arXiv:2306.16176  [pdf, other

    cs.CL

    SkillNet-X: A Multilingual Multitask Model with Sparsely Activated Skills

    Authors: Zhangyin Feng, Yong Dai, Fan Zhang, Duyu Tang, Xiaocheng Feng, Shuangzhi Wu, Bing Qin, Yunbo Cao, Shuming Shi

    Abstract: Traditional multitask learning methods basically can only exploit common knowledge in task- or language-wise, which lose either cross-language or cross-task knowledge. This paper proposes a general multilingual multitask model, named SkillNet-X, which enables a single model to tackle many different tasks from different languages. To this end, we define several language-specific skills and task-spe… ▽ More

    Submitted 28 June, 2023; originally announced June 2023.

  32. arXiv:2306.15247  [pdf, ps, other

    cs.IT eess.SP math.OC

    Towards Efficient Optimal Large-Scale Network Slicing: A Decomposition Approach

    Authors: Wei-Kun Chen, Zheyu Wu, Rui-** Zhang, Ya-Feng Liu, Yu-Hong Dai, Zhi-Quan Luo

    Abstract: This paper considers the network slicing (NS) problem which attempts to map multiple customized virtual network requests to a common shared network infrastructure and allocate network resources to meet diverse service requirements. This paper proposes an efficient decomposition algorithm for globally solving the large-scale NP-hard NS problem. The proposed algorithm decomposes the hard NS problem… ▽ More

    Submitted 14 December, 2023; v1 submitted 27 June, 2023; originally announced June 2023.

    Comments: 13 pages, 11 figures, submitted for possible publication; for the conference version, see arXiv:2306.15247v1

  33. arXiv:2306.15032  [pdf, other

    stat.ME

    DMseg: a Python algorithm for de novo detection of differentially or variably methylated regions

    Authors: Xiaoyu Wang, Ming Yu, William Grady, Ziding Feng, Wei Sun, James Y Dai

    Abstract: Detecting and assessing statistical significance of differentially methylated regions (DMRs) is a fundamental task in methylome association studies. While the average differential methylation in different phenotype groups has been the inferential focus, methylation changes in chromosomal regions may also present as differential variability, i.e., variably methylated regions (VMRs). Testing statist… ▽ More

    Submitted 26 June, 2023; originally announced June 2023.

  34. arXiv:2306.14826  [pdf, other

    stat.ME

    Incorporating increased variability in testing for cancer DNA methylation

    Authors: James Y. Dai, Heng Chen, Xiaoyu Wang, Wei Sun, Ying Huang, William M. Grady, Ziding Feng

    Abstract: Cancer development is associated with aberrant DNA methylation, including increased stochastic variability. Statistical tests for discovering cancer methylation biomarkers have focused on changes in mean methylation. To improve the power of detection, we propose to incorporate increased variability in testing for cancer differential methylation by two joint constrained tests: one for differential… ▽ More

    Submitted 26 June, 2023; originally announced June 2023.

  35. arXiv:2306.11231  [pdf, other

    astro-ph.GA

    Deep HI Map** of Stephan's Quintet and Its Neighborhood

    Authors: Cheng Cheng, Cong Kevin Xu, P. N. Appleton, P. -A. Duc, N. -Y. Tang, Y. S. Dai, J. -S. Huang, U. Lisenfeld, F. Renaud, Chuan He, Hai-Cheng Feng

    Abstract: We carried out deep map** observations of the atomic hydrogen (HI) 21 cm line emission in a field centered on the famous galaxy group Stephan's Quintet (SQ), using the Five-hundred-meter Aperture Spherical Telescope (FAST) equipped with the 19-Beam Receiver. The final data cube reaches an HI column density sensitivity of $5 σ= 2.1\times 10^{17}$ cm$^{-2}$ per 20 km s$^{-1}$ channel with an angul… ▽ More

    Submitted 19 June, 2023; originally announced June 2023.

    Comments: 20 pages, 5 figures, Accepted by ApJ

  36. arXiv:2306.06877  [pdf, other

    cs.CV

    Boosting Breast Ultrasound Video Classification by the Guidance of Keyframe Feature Centers

    Authors: AnLan Sun, Zhao Zhang, Meng Lei, Yuting Dai, Dong Wang, Liwei Wang

    Abstract: Breast ultrasound videos contain richer information than ultrasound images, therefore it is more meaningful to develop video models for this diagnosis task. However, the collection of ultrasound video datasets is much harder. In this paper, we explore the feasibility of enhancing the performance of ultrasound video classification using the static image dataset. To this end, we propose KGA-Net and… ▽ More

    Submitted 12 June, 2023; originally announced June 2023.

    Comments: Medical Image Computing and Computer-Assisted Intervention 2023

  37. arXiv:2306.04236  [pdf, other

    cs.CV eess.IV

    Flare7K++: Mixing Synthetic and Real Datasets for Nighttime Flare Removal and Beyond

    Authors: Yuekun Dai, Chongyi Li, Shangchen Zhou, Ruicheng Feng, Yihang Luo, Chen Change Loy

    Abstract: Artificial lights commonly leave strong lens flare artifacts on the images captured at night, degrading both the visual quality and performance of vision algorithms. Existing flare removal approaches mainly focus on removing daytime flares and fail in nighttime cases. Nighttime flare removal is challenging due to the unique luminance and spectrum of artificial lights, as well as the diverse patter… ▽ More

    Submitted 7 June, 2023; v1 submitted 7 June, 2023; originally announced June 2023.

    Comments: Extension of arXiv:2210.06570; Project page at https://ykdai.github.io/projects/Flare7K

  38. arXiv:2306.03630  [pdf, other

    cs.CV

    Mutual Information Regularization for Weakly-supervised RGB-D Salient Object Detection

    Authors: Aixuan Li, Yuxin Mao, **g Zhang, Yuchao Dai

    Abstract: In this paper, we present a weakly-supervised RGB-D salient object detection model via scribble supervision. Specifically, as a multimodal learning task, we focus on effective multimodal representation learning via inter-modal mutual information regularization. In particular, following the principle of disentangled representation learning, we introduce a mutual information upper bound with a mutua… ▽ More

    Submitted 6 June, 2023; originally announced June 2023.

    Comments: IEEE Transactions on Circuits and Systems for Video Technology 2023

  39. arXiv:2306.02610  [pdf, other

    astro-ph.EP astro-ph.GA astro-ph.SR

    Understanding the Planetary Formation and Evolution in Star Clusters(UPiC)-I: Evidence of Hot Giant Exoplanets Formation Timescales

    Authors: Yuan-Zhe Dai, Hui-Gen Liu, Jia-Yi Yang, Ji-Lin Zhou

    Abstract: Planets in young star clusters could shed light on planet formation and evolution since star clusters can provide accurate age estimation. However, the number of transiting planets detected in clusters was only $\sim 30$, too small for statistical analysis. Thanks to the unprecedented high-precision astrometric data provided by Gaia DR2 and Gaia DR3, many new Open Clusters(OCs) and comoving groups… ▽ More

    Submitted 6 November, 2023; v1 submitted 5 June, 2023; originally announced June 2023.

    Comments: 22 pages, 11 figures, 2 tables, accepted for publication in AJ

    Journal ref: The Astronomical Journal, Year 2023, Volume 166, Number 6

  40. arXiv:2305.15287  [pdf, other

    cs.LG cs.AI stat.ML

    The Crucial Role of Normalization in Sharpness-Aware Minimization

    Authors: Yan Dai, Kwangjun Ahn, Suvrit Sra

    Abstract: Sharpness-Aware Minimization (SAM) is a recently proposed gradient-based optimizer (Foret et al., ICLR 2021) that greatly improves the prediction performance of deep neural networks. Consequently, there has been a surge of interest in explaining its empirical success. We focus, in particular, on understanding the role played by normalization, a key component of the SAM updates. We theoretically an… ▽ More

    Submitted 23 October, 2023; v1 submitted 24 May, 2023; originally announced May 2023.

    Comments: 30 pages, Published in 37th Neural Information Processing Systems (NeurIPS 2023)

  41. An Atypical Plateau-like Extreme-ultraviolet Late-phase Solar Flare Driven by the Non-radial Eruption of a Magnetic Flux Rope

    Authors: Yuehong Chen, Yu Dai, Mingde Ding

    Abstract: Recent observations in extreme-ultraviolet (EUV) wavelengths reveal an EUV late phase in some solar flares, which is characterized by a second peak in the warm coronal emissions (about 3 MK) occurring several tens of minutes to a few hours after the corresponding main flare peak. We aim to clarify the physical origin of an atypical plateau-like EUV late phase in an X1.8-class solar flare occurring… ▽ More

    Submitted 20 June, 2023; v1 submitted 24 May, 2023; originally announced May 2023.

    Comments: Accepted by A&A

    Journal ref: A&A 675, A147 (2023)

  42. arXiv:2305.14895  [pdf, other

    astro-ph.IM hep-ex physics.ins-det

    The Lobster Eye Imager for Astronomy Onboard the SATech-01 Satellite

    Authors: Z. X. Ling, X. J. Sun, C. Zhang, S. L. Sun, G. **, S. N. Zhang, X. F. Zhang, J. B. Chang, F. S. Chen, Y. F. Chen, Z. W. Cheng, W. Fu, Y. X. Han, H. Li, J. F. Li, Y. Li, Z. D. Li, P. R. Liu, Y. H. Lv, X. H. Ma, Y. J. Tang, C. B. Wang, R. J. Xie, Y. L. Xue, A. L. Yan , et al. (101 additional authors not shown)

    Abstract: The Lobster Eye Imager for Astronomy (LEIA), a pathfinder of the Wide-field X-ray Telescope of the Einstein Probe (EP) mission, was successfully launched onboard the SATech-01 satellite of the Chinese Academy of Sciences on 27 July 2022. In this paper, we introduce the design and on-ground test results of the LEIA instrument. Using state-of-the-art Micro-Pore Optics (MPO), a wide field-of-view (Fo… ▽ More

    Submitted 24 May, 2023; originally announced May 2023.

    Comments: Accepted by RAA

  43. arXiv:2305.13770  [pdf, other

    cs.CV eess.IV

    MIPI 2023 Challenge on Nighttime Flare Removal: Methods and Results

    Authors: Yuekun Dai, Chongyi Li, Shangchen Zhou, Ruicheng Feng, Qingpeng Zhu, Qianhui Sun, Wenxiu Sun, Chen Change Loy, **wei Gu

    Abstract: Develo** and integrating advanced image sensors with novel algorithms in camera systems are prevalent with the increasing demand for computational photography and imaging on mobile platforms. However, the lack of high-quality data for research and the rare opportunity for in-depth exchange of views from industry and academia constrain the development of mobile intelligent photography and imaging… ▽ More

    Submitted 23 May, 2023; originally announced May 2023.

    Comments: CVPR 2023 Mobile Intelligent Photography and Imaging (MIPI) Workshop--Nighttime Flare Removal Challenge Report. Website: https://mipi-challenge.org/MIPI2023/

  44. arXiv:2305.13413  [pdf, other

    cs.CL

    Syntactic Knowledge via Graph Attention with BERT in Machine Translation

    Authors: Yuqian Dai, Serge Sharoff, Marc de Kamps

    Abstract: Although the Transformer model can effectively acquire context features via a self-attention mechanism, deeper syntactic knowledge is still not effectively modeled. To alleviate the above problem, we propose Syntactic knowledge via Graph attention with BERT (SGB) in Machine Translation (MT) scenarios. Graph Attention Network (GAT) and BERT jointly represent syntactic dependency feature as explicit… ▽ More

    Submitted 22 May, 2023; originally announced May 2023.

  45. arXiv:2305.13403  [pdf, other

    cs.CL

    GATology for Linguistics: What Syntactic Dependencies It Knows

    Authors: Yuqian Dai, Serge Sharoff, Marc de Kamps

    Abstract: Graph Attention Network (GAT) is a graph neural network which is one of the strategies for modeling and representing explicit syntactic knowledge and can work with pre-trained models, such as BERT, in downstream tasks. Currently, there is still a lack of investigation into how GAT learns syntactic knowledge from the perspective of model structure. As one of the strategies for modeling explicit syn… ▽ More

    Submitted 22 May, 2023; originally announced May 2023.

  46. arXiv:2305.13040  [pdf, other

    cs.CL cs.AI

    SpokenWOZ: A Large-Scale Speech-Text Benchmark for Spoken Task-Oriented Dialogue Agents

    Authors: Shuzheng Si, Wentao Ma, Haoyu Gao, Yuchuan Wu, Ting-En Lin, Yinpei Dai, Hangyu Li, Rui Yan, Fei Huang, Yongbin Li

    Abstract: Task-oriented dialogue (TOD) models have made significant progress in recent years. However, previous studies primarily focus on datasets written by annotators, which has resulted in a gap between academic research and real-world spoken conversation scenarios. While several small-scale spoken TOD datasets are proposed to address robustness issues such as ASR errors, they ignore the unique challeng… ▽ More

    Submitted 12 March, 2024; v1 submitted 22 May, 2023; originally announced May 2023.

    Comments: NeurIPS 2023

  47. Fraction of Clumpy Star-Forming Galaxies at $0.5\leq z\leq 3$ in UVCANDELS: Dependence on Stellar Mass and Environment

    Authors: Zahra Sattari, Bahram Mobasher, Nima Chartab, Daniel D. Kelson, Harry I. Teplitz, Marc Rafelski, Norman A. Grogin, Anton M. Koekemoer, Xin Wang, Rogier A. Windhorst, Anahita Alavi, Laura Prichard, Ben Sunnquist, Jonathan P. Gardner, Eric Gawiser, Nimish P. Hathi, Matthew J. Hayes, Zhiyuan Ji, Vihang Mehta, Brant E. Robertson, Claudia Scarlata, L. Y. Aaron Yung, Christopher J. Conselice, Y. Sophia Dai, Yicheng Guo , et al. (3 additional authors not shown)

    Abstract: High-resolution imaging of galaxies in rest-frame UV has revealed the existence of giant star-forming clumps prevalent in high redshift galaxies. Studying these sub-structures provides important information about their formation and evolution and informs theoretical galaxy evolution models. We present a new method to identify clumps in galaxies' high-resolution rest-frame UV images. Using imaging… ▽ More

    Submitted 15 May, 2023; originally announced May 2023.

    Comments: 16 pages, 11 figures, 2 tables, accepted for publication in ApJ

  48. arXiv:2305.06557  [pdf, other

    cs.CL cs.AI cs.LG

    Long-Tailed Question Answering in an Open World

    Authors: Yi Dai, Hao Lang, Yinhe Zheng, Fei Huang, Yongbin Li

    Abstract: Real-world data often have an open long-tailed distribution, and building a unified QA model supporting various tasks is vital for practical QA applications. However, it is non-trivial to extend previous QA approaches since they either require access to seen tasks of adequate samples or do not explicitly model samples from unseen tasks. In this paper, we define Open Long-Tailed QA (OLTQA) as learn… ▽ More

    Submitted 11 May, 2023; originally announced May 2023.

    Comments: ACL2023 Main Track Long Paper

  49. arXiv:2305.06555  [pdf, other

    cs.CL cs.AI cs.LG

    Domain Incremental Lifelong Learning in an Open World

    Authors: Yi Dai, Hao Lang, Yinhe Zheng, Bowen Yu, Fei Huang, Yongbin Li

    Abstract: Lifelong learning (LL) is an important ability for NLP models to learn new tasks continuously. Architecture-based approaches are reported to be effective implementations for LL models. However, it is non-trivial to extend previous approaches to domain incremental LL scenarios since they either require access to task identities in the testing phase or cannot handle samples from unseen tasks. In thi… ▽ More

    Submitted 11 May, 2023; originally announced May 2023.

    Comments: ACL2023 Findings Long Paper. arXiv admin note: substantial text overlap with arXiv:2208.14602

  50. arXiv:2305.06103  [pdf, ps, other

    cond-mat.supr-con cond-mat.mtrl-sci

    Pressure-induced color change arising from transformation between intra- and inter-band transitions in LuH$_{2\pm x}$N$_{y}$

    Authors: Zhe Liu, Yingjie Zhang, Shenyang Huang, Xue Ming, Qing Li, Chenghao Pan, Yaomin Dai, Xiaoxiang Zhou, Xiyu Zhu, Hugen Yan, Hai-Hu Wen

    Abstract: The pressure-induced color change in the nitrogen-doped lutetium hydride has triggered extensive discussions about the underlying physics. Here, we study the optical response of LuH$_{2 \pm x}$N$_{y}$ in a broad frequency range at ambient pressure and its evolution with pressure in the visible spectral range. The broad-band optical spectra at ambient pressure reveal a Drude component associated wi… ▽ More

    Submitted 30 January, 2024; v1 submitted 10 May, 2023; originally announced May 2023.

    Comments: 20 pages, 4 figures. Comments are welcome and appreciated

    Journal ref: Sci. China Phys. Mech. Astron. 67, 227411 (2024)