Skip to main content

Showing 51–100 of 1,438 results for author: Guo, Z

.
  1. arXiv:2406.00279  [pdf

    eess.IV cs.CV

    Hybrid attention structure preserving network for reconstruction of under-sampled OCT images

    Authors: Zezhao Guo, Zhanfang Zhao

    Abstract: Optical coherence tomography (OCT) is a non-invasive, high-resolution imaging technology that provides cross-sectional images of tissues. Dense acquisition of A-scans along the fast axis is required to obtain high digital resolution images. However, the dense acquisition will increase the acquisition time, causing the discomfort of patients. In addition, the longer acquisition time may lead to mot… ▽ More

    Submitted 31 May, 2024; originally announced June 2024.

  2. arXiv:2405.21071  [pdf, other

    astro-ph.SR

    A Multi-wavelength, Multi-epoch Monitoring Campaign of Accretion Variability in T Tauri Stars from the ODYSSEUS Survey. II. Photometric Light Curves

    Authors: John Wendeborn, Catherine C. Espaillat, Thanawuth Thanathibodee, Connor E. Robinson, Caeley V. Pittman, Nuria Calvet, Ágnes Kóspál, Konstantin N. Grankin, Fredrick M. Walter, Zhen Guo, Jochen Eislöffel

    Abstract: Classical T Tauri Stars (CTTSs) are young, low-mass stars which accrete material from their surrounding protoplanetary disk. To better understand accretion variability, we conducted a multi-epoch, multi-wavelength photometric monitoring campaign of four CTTSs: TW Hya, RU Lup, BP Tau, and GM Aur, in 2021 and 2022, contemporaneous with HST UV and optical spectra. We find that all four targets displa… ▽ More

    Submitted 31 May, 2024; originally announced May 2024.

    Comments: 28 pages, 12 figures

  3. arXiv:2405.21038  [pdf, other

    astro-ph.SR

    A Multi-wavelength, Multi-epoch Monitoring Campaign of Accretion Variability in T Tauri Stars from the ODYSSEUS Survey. I. HST FUV and NUV Spectra

    Authors: John Wendeborn, Catherine C. Espaillat, Sophia Lopez, Thanawuth Thanathibodee, Connor E. Robinson, Caeley V. Pittman, Nuria Calvet, Nicole Flors, Fredrick M. Walter, Ágnes Kóspál, Konstantin N. Grankin, Ignacio Mendigutía, Hans Moritz Günther, Jochen Eislöffel, Zhen Guo, Kevin France, Eleonora Fiorellino, William J. Fischer, Péter Ábrahám, Gregory J. Herczeg

    Abstract: The Classical T Tauri Star (CTTS) stage is a critical phase of the star and planet formation process. In an effort to better understand the mass accretion process, which can dictate further stellar evolution and planet formation, a multi-epoch, multi-wavelength photometric and spectroscopic monitoring campaign of four CTTSs (TW Hya, RU Lup, BP Tau, and GM Aur) was carried out in 2021 and 2022/2023… ▽ More

    Submitted 31 May, 2024; originally announced May 2024.

    Comments: 37 pages, 14 figures

  4. arXiv:2405.19732  [pdf, other

    cs.CV cs.CL cs.LG

    Two Optimizers Are Better Than One: LLM Catalyst Empowers Gradient-Based Optimization for Prompt Tuning

    Authors: Zixian Guo, Ming Liu, Zhilong Ji, **feng Bai, Yiwen Guo, Wangmeng Zuo

    Abstract: Learning a skill generally relies on both practical experience by doer and insightful high-level guidance by instructor. Will this strategy also work well for solving complex non-convex optimization problems? Here, a common gradient-based optimizer acts like a disciplined doer, making locally optimal update at each step. Recent methods utilize large language models (LLMs) to optimize solutions for… ▽ More

    Submitted 6 June, 2024; v1 submitted 30 May, 2024; originally announced May 2024.

  5. arXiv:2405.18727  [pdf, other

    cs.CL cs.AI cs.IR

    CtrlA: Adaptive Retrieval-Augmented Generation via Probe-Guided Control

    Authors: Huanshuo Liu, Hao Zhang, Zhijiang Guo, Kuicai Dong, Xiangyang Li, Yi Quan Lee, Cong Zhang, Yong Liu

    Abstract: Retrieval-augmented generation (RAG) has emerged as a promising solution for mitigating hallucinations of large language models (LLMs) with retrieved external knowledge. Adaptive RAG enhances this approach by dynamically assessing the retrieval necessity, aiming to balance external and internal knowledge usage. However, existing adaptive RAG methods primarily realize retrieval on demand by relying… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

    Comments: 28 pages, 7 figures, 9 tables

  6. arXiv:2405.18523  [pdf, other

    cs.CV cs.AI

    TripletMix: Triplet Data Augmentation for 3D Understanding

    Authors: Jiaze Wang, Yi Wang, Ziyu Guo, Renrui Zhang, Donghao Zhou, Guangyong Chen, Anfeng Liu, Pheng-Ann Heng

    Abstract: Data augmentation has proven to be a vital tool for enhancing the generalization capabilities of deep learning models, especially in the context of 3D vision where traditional datasets are often limited. Despite previous advancements, existing methods primarily cater to unimodal data scenarios, leaving a gap in the augmentation of multimodal triplet data, which integrates text, images, and point c… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

  7. arXiv:2405.18216  [pdf, other

    cs.SE

    A Survey on Modern Code Review: Progresses, Challenges and Opportunities

    Authors: Zezhou Yang, Cuiyun Gao, Zhaoqiang Guo, Zhenhao Li, Kui Liu, Xin Xia, Yuming Zhou

    Abstract: Over the past decade, modern code review (MCR) has been deemed as a crucial practice of software quality assurance, which is applied to improve software quality and transfer development knowledge within a software team. Despite its importance, MCR is often a complicated and time-consuming activity for practitioners. In recent years, many studies that are dedicated to the comprehension and the impr… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

    Comments: 62 pages

  8. arXiv:2405.18132  [pdf, other

    cs.CV

    EG4D: Explicit Generation of 4D Object without Score Distillation

    Authors: Qi Sun, Zhiyang Guo, Ziyu Wan, **g Nathan Yan, Shengming Yin, Wengang Zhou, **g Liao, Houqiang Li

    Abstract: In recent years, the increasing demand for dynamic 3D assets in design and gaming applications has given rise to powerful generative pipelines capable of synthesizing high-quality 4D objects. Previous methods generally rely on score distillation sampling (SDS) algorithm to infer the unseen views and motion of 4D objects, thus leading to unsatisfactory results with defects like over-saturation and… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

  9. arXiv:2405.17420  [pdf, other

    cs.LG

    Survival of the Fittest Representation: A Case Study with Modular Addition

    Authors: Xiaoman Delores Ding, Zifan Carl Guo, Eric J. Michaud, Ziming Liu, Max Tegmark

    Abstract: When a neural network can learn multiple distinct algorithms to solve a task, how does it "choose" between them during training? To approach this question, we take inspiration from ecology: when multiple species coexist, they eventually reach an equilibrium where some survive while others die out. Analogously, we suggest that a neural network at initialization contains many solutions (representati… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

  10. arXiv:2405.16980  [pdf, other

    cs.CV eess.IV

    DSU-Net: Dynamic Snake U-Net for 2-D Seismic First Break Picking

    Authors: Hongtao Wang, Rongyu Feng, Liangyi Wu, Mutian Liu, Yinuo Cui, Chunxia Zhang, Zhenbo Guo

    Abstract: In seismic exploration, identifying the first break (FB) is a critical component in establishing subsurface velocity models. Various automatic picking techniques based on deep neural networks have been developed to expedite this procedure. The most popular class is using semantic segmentation networks to pick on a shot gather called 2-dimensional (2-D) picking. Generally, 2-D segmentation-based pi… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

  11. arXiv:2405.16952  [pdf, other

    eess.AS

    A Variance-Preserving Interpolation Approach for Diffusion Models with Applications to Single Channel Speech Enhancement and Recognition

    Authors: Zilu Guo, Qing Wang, Jun Du, Jia Pan, Qing-Feng Liu, Chin-Hui

    Abstract: In this paper, we propose a variance-preserving interpolation framework to improve diffusion models for single-channel speech enhancement (SE) and automatic speech recognition (ASR). This new variance-preserving interpolation diffusion model (VPIDM) approach requires only 25 iterative steps and obviates the need for a corrector, an essential element in the existing variance-exploding interpolation… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

  12. arXiv:2405.16802  [pdf, other

    cs.CL cs.LG

    AutoCV: Empowering Reasoning with Automated Process Labeling via Confidence Variation

    Authors: Jianqiao Lu, Zhiyang Dou, Hongru Wang, Zeyu Cao, Jianbo Dai, Yingjia Wan, Yinya Huang, Zhijiang Guo

    Abstract: In this work, we propose a novel method named \textbf{Auto}mated Process Labeling via \textbf{C}onfidence \textbf{V}ariation (\textbf{\textsc{AutoCV}}) to enhance the reasoning capabilities of large language models (LLMs) by automatically annotating the reasoning steps. Our approach begins by training a verification model on the correctness of final answers, enabling it to generate automatic proce… ▽ More

    Submitted 28 May, 2024; v1 submitted 26 May, 2024; originally announced May 2024.

    Comments: 20 pages, 1 figure, 13 tables

  13. arXiv:2405.15863  [pdf, other

    cs.SD cs.AI eess.AS

    Quality-aware Masked Diffusion Transformer for Enhanced Music Generation

    Authors: Chang Li, Ruoyu Wang, Lijuan Liu, Jun Du, Yixuan Sun, Zilu Guo, Zhenrong Zhang, Yuan Jiang

    Abstract: In recent years, diffusion-based text-to-music (TTM) generation has gained prominence, offering a novel approach to synthesizing musical content from textual descriptions. Achieving high accuracy and diversity in this generation process requires extensive, high-quality data, which often constitutes only a fraction of available datasets. Within open-source datasets, the prevalence of issues like mi… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

  14. arXiv:2405.15412  [pdf, other

    physics.ao-ph cs.AI cs.LG

    ORCA: A Global Ocean Emulator for Multi-year to Decadal Predictions

    Authors: Zijie Guo, Pumeng Lyu, Fenghua Ling, **g-Jia Luo, Niklas Boers, Wanli Ouyang, Lei Bai

    Abstract: Ocean dynamics plays a crucial role in driving global weather and climate patterns. Accurate and efficient modeling of ocean dynamics is essential for improved understanding of complex ocean circulation and processes, for predicting climate variations and their associated teleconnections, and for addressing the challenges of climate change. While great efforts have been made to improve numerical O… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

  15. arXiv:2405.15189  [pdf, other

    cs.SE cs.CL

    SOAP: Enhancing Efficiency of Generated Code via Self-Optimization

    Authors: Dong Huang, Jianbo Dai, Han Weng, Puzhen Wu, Yuhao Qing, Jie M. Zhang, Heming Cui, Zhijiang Guo

    Abstract: Large language models (LLMs) have shown remarkable progress in code generation, but their generated code often suffers from inefficiency, resulting in longer execution times and higher memory consumption. To address this issue, we propose Self Optimization based on OverheAd Profile (SOAP), a self-optimization framework that utilizes execution overhead profiles to improve the efficiency of LLM-gene… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

    Comments: 31 pages, 18 figures, and 8 tables

  16. arXiv:2405.13710  [pdf, other

    eess.IV cs.CV cs.LG

    Optimizing Lymphocyte Detection in Breast Cancer Whole Slide Imaging through Data-Centric Strategies

    Authors: Amine Marzouki, Zhuxian Guo, Qinghe Zeng, Camille Kurtz, Nicolas Loménie

    Abstract: Efficient and precise quantification of lymphocytes in histopathology slides is imperative for the characterization of the tumor microenvironment and immunotherapy response insights. We developed a data-centric optimization pipeline that attain great lymphocyte detection performance using an off-the-shelf YOLOv5 model, without any architectural modifications. Our contribution that rely on strategi… ▽ More

    Submitted 22 May, 2024; originally announced May 2024.

  17. arXiv:2405.13614  [pdf, ps, other

    math.DG

    On Relative Tractor Bundles

    Authors: Andreas Cap, Zhangwen Guo, Michal Wasilewicz

    Abstract: This article contributes to the relative BGG-machinery for parabolic geometries. Starting from a relative tractor bundle, this machinery constructs a sequence of differential operators that are naturally associated to the geometry in question. In many situations of interest, it is known that this sequence provides a resolution of a sheaf that can locally be realized as a pullback from a local leaf… ▽ More

    Submitted 22 May, 2024; originally announced May 2024.

    Comments: 21 pages, Comments are welcome

    MSC Class: primary: 58J10; secondary: 53C07; 53C15; 58J60; 58J70

  18. arXiv:2405.13532  [pdf, other

    cs.CV

    What Makes Good Few-shot Examples for Vision-Language Models?

    Authors: Zhaojun Guo, **ghui Lu, Xue**g Liu, Rui Zhao, ZhenXing Qian, Fei Tan

    Abstract: Despite the notable advancements achieved by leveraging pre-trained vision-language (VL) models through few-shot tuning for downstream tasks, our detailed empirical study highlights a significant dependence of few-shot learning outcomes on the careful selection of training examples - a facet that has been previously overlooked in research. In this study, we delve into devising more effective strat… ▽ More

    Submitted 22 May, 2024; originally announced May 2024.

    Comments: 8 pages, 4 figures

  19. arXiv:2405.12069  [pdf, other

    cs.CV

    Gaussian Head & Shoulders: High Fidelity Neural Upper Body Avatars with Anchor Gaussian Guided Texture War**

    Authors: Tianhao Wu, **g Yang, Zhilin Guo, **gyi Wan, Fangcheng Zhong, Cengiz Oztireli

    Abstract: By equip** the most recent 3D Gaussian Splatting representation with head 3D morphable models (3DMM), existing methods manage to create head avatars with high fidelity. However, most existing methods only reconstruct a head without the body, substantially limiting their application scenarios. We found that naively applying Gaussians to model the clothed chest and shoulders tends to result in blu… ▽ More

    Submitted 21 May, 2024; v1 submitted 20 May, 2024; originally announced May 2024.

    Comments: Project Page: https://gaussian-head-shoulders.netlify.app/

  20. arXiv:2405.11682  [pdf, other

    cs.CV cs.RO

    FADet: A Multi-sensor 3D Object Detection Network based on Local Featured Attention

    Authors: Ziang Guo, Zakhar Yagudin, Selamawit Asfaw, Artem Lykov, Dzmitry Tsetserukou

    Abstract: Camera, LiDAR and radar are common perception sensors for autonomous driving tasks. Robust prediction of 3D object detection is optimally based on the fusion of these sensors. To exploit their abilities wisely remains a challenge because each of these sensors has its own characteristics. In this paper, we propose FADet, a multi-sensor 3D detection network, which specifically studies the characteri… ▽ More

    Submitted 19 May, 2024; originally announced May 2024.

    Comments: Submitted to IEEE

  21. arXiv:2405.11532  [pdf, other

    eess.SP

    Non-Invasive Monitoring of Vital Signs in Calves Using Thermal Imaging Technology

    Authors: Ehsan Sadeghi, Zinan Guo, Alessandro Chiumento, Paul Havinga

    Abstract: This study presents a non-invasive method using thermal imaging to estimate heart and respiration rates in calves, avoiding the stress from wearables. Using Kernelised Correlation Filters (KCF) for movement tracking and advanced signal processing, we targeted one ROI for respiration and four for heart rate based on their thermal correlation. Achieving Mean Absolute Percentage Errors (MAPE) of 3.08… ▽ More

    Submitted 19 May, 2024; originally announced May 2024.

  22. arXiv:2405.11430  [pdf, other

    cs.CL

    MHPP: Exploring the Capabilities and Limitations of Language Models Beyond Basic Code Generation

    Authors: Jianbo Dai, Jianqiao Lu, Yunlong Feng, Rongju Ruan, Ming Cheng, Haochen Tan, Zhijiang Guo

    Abstract: Recent advancements in large language models (LLMs) have greatly improved code generation, specifically at the function level. For instance, GPT-4 has achieved an 88.4% pass rate on HumanEval. However, this draws into question the adequacy of existing benchmarks in thoroughly assessing function-level code generation capabilities. Our study analyzed two common benchmarks, HumanEval and MBPP, and fo… ▽ More

    Submitted 18 May, 2024; originally announced May 2024.

    Comments: 39 pages, dataset and code are available at https://github.com/SparksofAGI/MHPP

  23. arXiv:2405.10877  [pdf, other

    cs.LG cs.AI

    WEITS: A Wavelet-enhanced residual framework for interpretable time series forecasting

    Authors: Ziyou Guo, Yan Sun, Tieru Wu

    Abstract: Time series (TS) forecasting has been an unprecedentedly popular problem in recent years, with ubiquitous applications in both scientific and business fields. Various approaches have been introduced to time series analysis, including both statistical approaches and deep neural networks. Although neural network approaches have illustrated stronger ability of representation than statistical methods,… ▽ More

    Submitted 17 May, 2024; originally announced May 2024.

    Comments: arXiv admin note: text overlap with arXiv:2310.09488 by other authors

  24. arXiv:2405.10277  [pdf, ps, other

    cs.CC

    Hilbert Functions and Low-Degree Randomness Extractors

    Authors: Alexander Golovnev, Zeyu Guo, Pooya Hatami, Satyajeet Nagargoje, Chao Yan

    Abstract: For $S\subseteq \mathbb{F}^n$, consider the linear space of restrictions of degree-$d$ polynomials to $S$. The Hilbert function of $S$, denoted $\mathrm{h}_S(d,\mathbb{F})$, is the dimension of this space. We obtain a tight lower bound on the smallest value of the Hilbert function of subsets $S$ of arbitrary finite grids in $\mathbb{F}^n$ with a fixed size $|S|$. We achieve this by proving that th… ▽ More

    Submitted 16 May, 2024; originally announced May 2024.

  25. arXiv:2405.08981  [pdf, other

    cs.HC cs.CV cs.LG

    Impact of Design Decisions in Scanpath Modeling

    Authors: Parvin Emami, Yue Jiang, Zixin Guo, Luis A. Leiva

    Abstract: Modeling visual saliency in graphical user interfaces (GUIs) allows to understand how people perceive GUI designs and what elements attract their attention. One aspect that is often overlooked is the fact that computational models depend on a series of design parameters that are not straightforward to decide. We systematically analyze how different design parameters affect scanpath evaluation metr… ▽ More

    Submitted 14 May, 2024; originally announced May 2024.

    Comments: 16 pages

  26. arXiv:2405.08591  [pdf, ps, other

    hep-ph astro-ph.HE

    Degeneracy Enhancement of Neutron-Antineutron Oscillation in Neutron Star

    Authors: Xuan-Ye Fu, Shao-Feng Ge, Zi-Yang Guo, Qi-Heng Wang

    Abstract: We explore the fermion oscillation in a degenerate environment. The direct consequence is introducing a Pauli blocking factor $1 - f_i$, where $f_i$ is the phase space distribution function, for each intermediate mass eigenstate during propagation. It is then much easier for a state with larger existing fraction or density to oscillate into other states with less degeneracy while the reversed proc… ▽ More

    Submitted 14 May, 2024; originally announced May 2024.

    Comments: 6 pages, 2 figures

  27. arXiv:2405.08448  [pdf, other

    cs.LG cs.AI

    Understanding the performance gap between online and offline alignment algorithms

    Authors: Yunhao Tang, Daniel Zhaohan Guo, Zeyu Zheng, Daniele Calandriello, Yuan Cao, Eugene Tarassov, Rémi Munos, Bernardo Ávila Pires, Michal Valko, Yong Cheng, Will Dabney

    Abstract: Reinforcement learning from human feedback (RLHF) is the canonical framework for large language model alignment. However, rising popularity in offline alignment algorithms challenge the need for on-policy sampling in RLHF. Within the context of reward over-optimization, we start with an opening set of experiments that demonstrate the clear advantage of online methods over offline methods. This pro… ▽ More

    Submitted 14 May, 2024; originally announced May 2024.

  28. arXiv:2405.07638  [pdf, other

    cs.NI cs.AI cs.CR

    DoLLM: How Large Language Models Understanding Network Flow Data to Detect Carpet Bombing DDoS

    Authors: Qingyang Li, Yihang Zhang, Zhidong Jia, Yannan Hu, Lei Zhang, Jianrong Zhang, Yongming Xu, Yong Cui, Zongming Guo, Xinggong Zhang

    Abstract: It is an interesting question Can and How Large Language Models (LLMs) understand non-language network data, and help us detect unknown malicious flows. This paper takes Carpet Bombing as a case study and shows how to exploit LLMs' powerful capability in the networking area. Carpet Bombing is a new DDoS attack that has dramatically increased in recent years, significantly threatening network infra… ▽ More

    Submitted 13 May, 2024; originally announced May 2024.

  29. arXiv:2405.07072  [pdf, other

    cs.SI

    Selecting focused digital cohorts from social media using the metric backbone of biomedical knowledge graphs

    Authors: Ziqi Guo, Jack Felag, Jordan C. Rozum, Rion Brattig Correia, Luis M. Rocha

    Abstract: The abundance of social media data allows researchers to construct large digital cohorts to study the interplay between human behavior and medical treatment. Identifying the users most relevant to a specific health problem is, however, a challenge in that social media sites vary in the generality of their discourse. While X (formerly Twitter), Instagram, and Facebook cater to wide ranging topics,… ▽ More

    Submitted 11 May, 2024; originally announced May 2024.

  30. arXiv:2405.06763  [pdf, other

    stat.ME

    Post-selection inference for causal effects after causal discovery

    Authors: Ting-Hsuan Chang, Zijian Guo, Daniel Malinsky

    Abstract: Algorithms for constraint-based causal discovery select graphical causal models among a space of possible candidates (e.g., all directed acyclic graphs) by executing a sequence of conditional independence tests. These may be used to inform the estimation of causal effects (e.g., average treatment effects) when there is uncertainty about which covariates ought to be adjusted for, or which variables… ▽ More

    Submitted 10 May, 2024; originally announced May 2024.

  31. arXiv:2405.06041  [pdf

    cond-mat.mes-hall cond-mat.mtrl-sci

    Gate Tunable Asymmetric Ozone Adsorption on Graphene

    Authors: Zhen Qi, Wanlei Li, Jun Cheng, Zhongxin Guo, Chenglong Li, Shang Wang, Zuoquan Tan, Zhiting Gao, Yongchao Wang, Zichen Lian, Shanshan Chen, Yonglin He, Zhiyong Wang, Yapei Wang, **song Zhang, Yayu Wang, Peng Cai

    Abstract: Molecular adsorption is pivotal in device fabrication and material synthesis for quantum technology. However, elucidating the behavior of physisorption poses technical challenges. Here graphene with ultrahigh sensitivity was utilized to detect ozone adsorption at cryogenic temperatures. Significant hole do** observed in graphene indicates a strong interaction between ozone and graphene. Interest… ▽ More

    Submitted 9 May, 2024; originally announced May 2024.

  32. arXiv:2405.05885  [pdf, other

    cs.RO

    Co-driver: VLM-based Autonomous Driving Assistant with Human-like Behavior and Understanding for Complex Road Scenes

    Authors: Ziang Guo, Artem Lykov, Zakhar Yagudin, Mikhail Konenkov, Dzmitry Tsetserukou

    Abstract: Recent research about Large Language Model based autonomous driving solutions shows a promising picture in planning and control fields. However, heavy computational resources and hallucinations of Large Language Models continue to hinder the tasks of predicting precise trajectories and instructing control signals. To address this problem, we propose Co-driver, a novel autonomous driving assistant… ▽ More

    Submitted 9 May, 2024; originally announced May 2024.

    Comments: The paper is submitted to the IEEE conference

  33. arXiv:2405.05317  [pdf, other

    astro-ph.GA

    First detection of CO isotopologues in a high-redshift main-sequence galaxy: evidence of a top-heavy stellar initial mass function

    Authors: Ziyi Guo, Zhi-Yu Zhang, Zhiqiang Yan, Eda Gjergo, Allison Man, R. J. Ivison, Xiaoting Fu, Yong Shi

    Abstract: Recent observations and theories have presented a strong challenge to the universality of the stellar initial mass function (IMF) in extreme environments. A notable example has been found for starburst conditions, where evidence favours a top-heavy IMF, i.e. there is a bias toward massive stars compared to the IMF that is responsible for the stellar mass function and elemental abundances observed… ▽ More

    Submitted 8 May, 2024; originally announced May 2024.

    Comments: 15 pages, 8 figures, accepted by ApJ

  34. arXiv:2405.05229  [pdf, other

    cs.IR cs.DL

    myAURA: Personalized health library for epilepsy management via knowledge graph sparsification and visualization

    Authors: Rion Brattig Correia, Jordan C. Rozum, Leonard Cross, Jack Felag, Michael Gallant, Ziqi Guo, Bruce W. Herr II, Aehong Min, Deborah Stungis Rocha, Xuan Wang, Katy Börner, Wendy Miller, Luis M. Rocha

    Abstract: Objective: We report the development of the patient-centered myAURA application and suite of methods designed to aid epilepsy patients, caregivers, and researchers in making decisions about care and self-management. Materials and Methods: myAURA rests on the federation of an unprecedented collection of heterogeneous data resources relevant to epilepsy, such as biomedical databases, social media,… ▽ More

    Submitted 10 May, 2024; v1 submitted 8 May, 2024; originally announced May 2024.

  35. arXiv:2405.02322  [pdf

    stat.AP

    Towards Causal Interpretation of Sexual Orientation in Regression Analysis: Applications and Challenges

    Authors: Junjie Lu, Zhongyi Guo, David H. Rehkopf

    Abstract: This study presents an approach to analyze health disparities in Sexual and Gender Minority (SGM) populations, with a focus on the role of social support levels as an example to allow causal interpretations of regression models. We advocate for precisely defining the exposure variable and incorporating mediators into analyses, to address the limitations of comparing counterfactual outcomes solely… ▽ More

    Submitted 21 April, 2024; originally announced May 2024.

  36. arXiv:2405.01943  [pdf, other

    cs.CL cs.AI cs.LG

    Dependency-Aware Semi-Structured Sparsity: Declining Roles of Outliers in Pruning GLU-based LLMs

    Authors: Zhiyu Guo, Hidetaka Kamigaito, Taro Wanatnabe

    Abstract: The rapid growth in the scale of Large Language Models (LLMs) has led to significant computational and memory costs, making model compression techniques such as network pruning increasingly crucial for their efficient deployment. Recent LLMs such as LLaMA2 and Mistral have adopted GLU-based MLP architectures. However, current LLM pruning strategies are primarily based on insights from older LLM ar… ▽ More

    Submitted 20 June, 2024; v1 submitted 3 May, 2024; originally announced May 2024.

  37. arXiv:2405.00630  [pdf, other

    cs.CV

    Depth Priors in Removal Neural Radiance Fields

    Authors: Zhihao Guo, Peng Wang

    Abstract: Neural Radiance Fields (NeRF) have achieved impressive results in 3D reconstruction and novel view generation. A significant challenge within NeRF involves editing reconstructed 3D scenes, such as object removal, which demands consistency across multiple views and the synthesis of high-quality perspectives. Previous studies have integrated depth priors, typically sourced from LiDAR or sparse depth… ▽ More

    Submitted 3 July, 2024; v1 submitted 1 May, 2024; originally announced May 2024.

    Comments: 17 pages

    MSC Class: 68T40; 68T07; 68T45 ACM Class: I.4.5

  38. arXiv:2405.00236  [pdf, other

    cs.RO cs.AI cs.CV cs.LG

    STT: Stateful Tracking with Transformers for Autonomous Driving

    Authors: Longlong **g, Ruichi Yu, Xu Chen, Zhengli Zhao, Shiwei Sheng, Colin Graber, Qi Chen, Qinru Li, Shangxuan Wu, Han Deng, Sang** Lee, Chris Sweeney, Qiurui He, Wei-Chih Hung, Tong He, Xingyi Zhou, Farshid Moussavi, Zijian Guo, Yin Zhou, Mingxing Tan, Weilong Yang, Congcong Li

    Abstract: Tracking objects in three-dimensional space is critical for autonomous driving. To ensure safety while driving, the tracker must be able to reliably track objects across frames and accurately estimate their states such as velocity and acceleration in the present. Existing works frequently focus on the association task while either neglecting the model performance on state estimation or deploying c… ▽ More

    Submitted 30 April, 2024; originally announced May 2024.

    Comments: ICRA 2024

  39. arXiv:2404.19484  [pdf, other

    cs.LG cs.AI cs.CL

    More Compute Is What You Need

    Authors: Zhen Guo

    Abstract: Large language model pre-training has become increasingly expensive, with most practitioners relying on scaling laws to allocate compute budgets for model size and training tokens, commonly referred to as Compute-Optimal or Chinchilla Optimal. In this paper, we hypothesize a new scaling law that suggests model performance depends mostly on the amount of compute spent for transformer-based models,… ▽ More

    Submitted 1 May, 2024; v1 submitted 30 April, 2024; originally announced April 2024.

  40. arXiv:2404.19245  [pdf, other

    cs.CL cs.AI

    HydraLoRA: An Asymmetric LoRA Architecture for Efficient Fine-Tuning

    Authors: Chunlin Tian, Zhan Shi, Zhijiang Guo, Li Li, Chengzhong Xu

    Abstract: Adapting Large Language Models (LLMs) to new tasks through fine-tuning has been made more efficient by the introduction of Parameter-Efficient Fine-Tuning (PEFT) techniques, such as LoRA. However, these methods often underperform compared to full fine-tuning, particularly in scenarios involving complex datasets. This issue becomes even more pronounced in complex domains, highlighting the need for… ▽ More

    Submitted 23 May, 2024; v1 submitted 30 April, 2024; originally announced April 2024.

    Comments: 19 pages, 7 figures

  41. arXiv:2404.18146  [pdf, other

    cond-mat.mes-hall

    Tailoring coercive fields and the Curie temperature via proximity coupling in WSe$_2$/Fe$_3$GeTe$_2$ van der Waals heterostructures

    Authors: Guodong Ma, Renjun Du, Fuzhuo Lian, Song Bao, Zi**g Guo, Xiaofan Cai, **gkuan Xiao, Yaqing Han, Di Zhang, Siqi Jiang, Jiabei Huang, Xinglong Wu, Alexander S. Mayorov, **sheng Wen, Lei Wang, Geliang Yu

    Abstract: Hybrid structures consisting of two-dimensional (2D) magnets and semiconductors have exhibited extensive functionalities in spintronics and opto-spintronics. In this work, we have fabricated WSe$_2$/Fe$_3$GeTe$_2$ van der Waals (vdW) heterostructures and investigated the proximity effects on 2D magnetism. Through reflective magnetic circular dichroism (RMCD), we have observed a temperature-depende… ▽ More

    Submitted 28 April, 2024; originally announced April 2024.

  42. arXiv:2404.18045  [pdf, other

    cond-mat.mes-hall

    Blood Works for Graphene Production

    Authors: Xiaofan Cai, Ming Li, Chao Chen, Renjun Du, Zi**g Guo, ** Wang, Guodong Ma, Xinglong Wu, Zhiyuan Wang, Yaqing Han, Fuzhuo Lian, **gkuan Xiao, Siqi Jiang, Lei Wang, Alexander S. Mayorov, Libo Gao, Kostya S. Novoselov, Geliang Yu

    Abstract: Blood, a ubiquitous and fundamental carbohydrate material composed of plasma, red blood cells, white blood cells, and platelets, has been playing an important role in biology, life science, history, and religious study, while graphene has garnered significant attention due to its exceptional properties and extensive range of potential applications. Achieving environmentally friendly, cost-effectiv… ▽ More

    Submitted 27 April, 2024; originally announced April 2024.

  43. arXiv:2404.17667  [pdf, other

    eess.SP cs.LG

    SiamQuality: A ConvNet-Based Foundation Model for Imperfect Physiological Signals

    Authors: Cheng Ding, Zhicheng Guo, Zhaoliang Chen, Randall J Lee, Cynthia Rudin, Xiao Hu

    Abstract: Foundation models, especially those using transformers as backbones, have gained significant popularity, particularly in language and language-vision tasks. However, large foundation models are typically trained on high-quality data, which poses a significant challenge, given the prevalence of poor-quality real-world data. This challenge is more pronounced for develo** foundation models for phys… ▽ More

    Submitted 26 April, 2024; originally announced April 2024.

  44. arXiv:2404.16812  [pdf, other

    cs.DC

    ESG: Pipeline-Conscious Efficient Scheduling of DNN Workflows on Serverless Platforms with Shareable GPUs

    Authors: Xinning Hui, Yuanchao Xu, Zhishan Guo, Xipeng Shen

    Abstract: Recent years have witnessed increasing interest in machine learning inferences on serverless computing for its auto-scaling and cost effective properties. Existing serverless computing, however, lacks effective job scheduling methods to handle the schedule space dramatically expanded by GPU sharing, task batching, and inter-task relations. Prior solutions have dodged the issue by neglecting some i… ▽ More

    Submitted 25 April, 2024; originally announced April 2024.

    Comments: To appear in the 33rd International Symposium on High-Performance Parallel and Distributed Computing (HPDC'24)

  45. arXiv:2404.16022  [pdf, other

    cs.CV

    PuLID: Pure and Lightning ID Customization via Contrastive Alignment

    Authors: Zinan Guo, Yanze Wu, Zhuowei Chen, Lang Chen, Qian He

    Abstract: We propose Pure and Lightning ID customization (PuLID), a novel tuning-free ID customization method for text-to-image generation. By incorporating a Lightning T2I branch with a standard diffusion one, PuLID introduces both contrastive alignment loss and accurate ID loss, minimizing disruption to the original model and ensuring high ID fidelity. Experiments show that PuLID achieves superior perform… ▽ More

    Submitted 24 April, 2024; originally announced April 2024.

    Comments: Tech Report. Codes and models will be available at https://github.com/ToTheBeginning/PuLID

  46. arXiv:2404.14918  [pdf, ps, other

    math.AP

    Existence of weak solutions for a class of non-divergent parabolic equations with variable exponent

    Authors: **gfeng Shao, Zhichang Guo, Zhongxiang Zhou

    Abstract: A doubly degenerate parabolic equation in non-divergent form with variable growth is investigated in this paper. In suitable spaces, we prove the existence of weak solutions of the equation for cases $1\leq m < 2$ and $m\geq 2$ in different ways. And we establish the non-expansion of support of the solution for the problem.

    Submitted 23 April, 2024; originally announced April 2024.

    MSC Class: 35D30; 35K59

  47. arXiv:2404.14719  [pdf, other

    cs.CR

    Source Code Vulnerability Detection: Combining Code Language Models and Code Property Graphs

    Authors: Ruitong Liu, Yanbin Wang, Haitao Xu, Bin Liu, Jianguo Sun, Zhenhao Guo, Wenrui Ma

    Abstract: Currently, deep learning successfully applies to code vulnerability detection by learning from code sequences or property graphs. However, sequence-based methods often overlook essential code attributes such as syntax, control flow, and data dependencies, whereas graph-based approaches might underestimate the semantics of code and face challenges in capturing long-distance contextual information.… ▽ More

    Submitted 22 April, 2024; originally announced April 2024.

    Comments: 10 pages, 6 figures

  48. arXiv:2404.13779  [pdf, other

    cs.CL cs.LG

    Automated Text Mining of Experimental Methodologies from Biomedical Literature

    Authors: Ziqing Guo

    Abstract: Biomedical literature is a rapidly expanding field of science and technology. Classification of biomedical texts is an essential part of biomedicine research, especially in the field of biology. This work proposes the fine-tuned DistilBERT, a methodology-specific, pre-trained generative classification language model for mining biomedicine texts. The model has proven its effectiveness in linguistic… ▽ More

    Submitted 21 April, 2024; originally announced April 2024.

  49. arXiv:2404.13230  [pdf, other

    cs.IT math.CO

    Random Gabidulin Codes Achieve List Decoding Capacity in the Rank Metric

    Authors: Zeyu Guo, Chen Yuan, Zihan Zhang

    Abstract: Gabidulin codes, serving as the rank-metric counterpart of Reed-Solomon codes, constitute an important class of maximum rank distance (MRD) codes. However, unlike the fruitful positive results about the list decoding of Reed-Solomon codes, results concerning the list decodability of Gabidulin codes in the rank metric are all negative so far. For example, in contrast to Reed-Solomon codes, which ar… ▽ More

    Submitted 19 April, 2024; originally announced April 2024.

  50. arXiv:2404.12364  [pdf, ps, other

    math.AP

    On the well-posedness of the KP-I equation

    Authors: Zihua Guo, Luc Molinet

    Abstract: We revisit the local well-posedness for the KP-I equation. We obtain unconditional local well-posedness in $H^{s,0}({\mathbb R}^2)$ for $s>3/4$ and unconditional global well-posedness in the energy space. We also prove the global existence of perturbations with finite energy of non decaying smooth global solutions.

    Submitted 18 April, 2024; originally announced April 2024.

    MSC Class: Primary: 35A02; 35E15; 35Q53; Secondary: 35B45; 35D30