Skip to main content

Showing 51–100 of 1,070 results for author: Zhao, T

.
  1. arXiv:2404.04575  [pdf, other

    cs.LG cs.AI math.OC

    To Cool or not to Cool? Temperature Network Meets Large Foundation Models via DRO

    Authors: Zi-Hao Qiu, Siqi Guo, Mao Xu, Tuo Zhao, Lijun Zhang, Tianbao Yang

    Abstract: The temperature parameter plays a profound role during training and/or inference with large foundation models (LFMs) such as large language models (LLMs) and CLIP models. Particularly, it adjusts the logits in the softmax function in LLMs, which is crucial for next token generation, and it scales the similarities in the contrastive loss for training CLIP models. A significant question remains: Is… ▽ More

    Submitted 16 June, 2024; v1 submitted 6 April, 2024; originally announced April 2024.

    Comments: 41 pages, 10 figures, accepted by ICML2024

  2. arXiv:2404.02511  [pdf, other

    math.OC cs.LG

    Stochastic Constrained Decentralized Optimization for Machine Learning with Fewer Data Oracles: a Gradient Sliding Approach

    Authors: Hoang Huy Nguyen, Yan Li, Tuo Zhao

    Abstract: In modern decentralized applications, ensuring communication efficiency and privacy for the users are the key challenges. In order to train machine-learning models, the algorithm has to communicate to the data center and sample data for its gradient computation, thus exposing the data and increasing the communication cost. This gives rise to the need for a decentralized optimization algorithm that… ▽ More

    Submitted 3 April, 2024; originally announced April 2024.

  3. arXiv:2404.01925  [pdf, other

    cs.CV cs.AI

    Improving Bird's Eye View Semantic Segmentation by Task Decomposition

    Authors: Tianhao Zhao, Yongcan Chen, Yu Wu, Tianyang Liu, Bo Du, Peilun Xiao, Shi Qiu, Hongda Yang, Guozhen Li, Yi Yang, Yutian Lin

    Abstract: Semantic segmentation in bird's eye view (BEV) plays a crucial role in autonomous driving. Previous methods usually follow an end-to-end pipeline, directly predicting the BEV segmentation map from monocular RGB inputs. However, the challenge arises when the RGB inputs and BEV targets from distinct perspectives, making the direct point-to-point predicting hard to optimize. In this paper, we decompo… ▽ More

    Submitted 2 April, 2024; originally announced April 2024.

    Comments: Accepted by CVPR 2024

  4. arXiv:2404.01657  [pdf, other

    cs.CL cs.AI cs.CV cs.LG eess.AS

    Release of Pre-Trained Models for the Japanese Language

    Authors: Kei Sawada, Tianyu Zhao, Makoto Shing, Kentaro Mitsui, Akio Kaga, Yukiya Hono, Toshiaki Wakatsuki, Koh Mitsuda

    Abstract: AI democratization aims to create a world in which the average person can utilize AI techniques. To achieve this goal, numerous research institutes have attempted to make their results accessible to the public. In particular, large pre-trained models trained on large-scale data have shown unprecedented potential, and their release has had a significant impact. However, most of the released models… ▽ More

    Submitted 2 April, 2024; originally announced April 2024.

    Comments: 9 pages, 1 figure, 5 tables, accepted for LREC-COLING 2024. Models are publicly available at https://huggingface.co/rinna

  5. arXiv:2403.20248  [pdf

    cond-mat.mes-hall cond-mat.mtrl-sci

    Gate-tunable quantum acoustoelectric transport in graphene

    Authors: Yicheng Mou, Haonan Chen, Jiaqi Liu, Qing Lan, Jiayu Wang, Chuanxin Zhang, Yuxiang Wang, Jiaming Gu, Tuoyu Zhao, Xue Jiang, Wu Shi, Cheng Zhang

    Abstract: Transport probes the motion of quasiparticles in response to external excitations. Apart from the well-known electric and thermoelectric transport, acoustoelectric transport induced by traveling acoustic waves has been rarely explored. Here, by adopting a hybrid nanodevices integrated with piezoelectric substrates, we establish a simple design of acoustoelectric transport with gate tunability. We… ▽ More

    Submitted 29 March, 2024; originally announced March 2024.

    Comments: 16 pages, 5 figures

  6. arXiv:2403.18295  [pdf, other

    cs.CL

    Dual Instruction Tuning with Large Language Models for Mathematical Reasoning

    Authors: Yongwei Zhou, Tiejun Zhao

    Abstract: Recent advancements highlight the success of instruction tuning with large language models (LLMs) utilizing Chain-of-Thought (CoT) data for mathematical reasoning tasks. Despite the fine-tuned LLMs, challenges persist, such as incorrect, missing, and redundant steps in CoT generation leading to inaccuracies in answer predictions. To alleviate this problem, we propose a dual instruction tuning stra… ▽ More

    Submitted 27 March, 2024; originally announced March 2024.

  7. arXiv:2403.18280  [pdf, other

    cs.IR

    Improving Out-of-Vocabulary Handling in Recommendation Systems

    Authors: William Shiao, Mingxuan Ju, Zhichun Guo, Xin Chen, Evangelos Papalexakis, Tong Zhao, Neil Shah, Yozen Liu

    Abstract: Recommendation systems (RS) are an increasingly relevant area for both academic and industry researchers, given their widespread impact on the daily online experiences of billions of users. One common issue in real RS is the cold-start problem, where users and items may not contain enough information to produce high-quality recommendations. This work focuses on a complementary problem: recommendin… ▽ More

    Submitted 27 March, 2024; originally announced March 2024.

    Comments: 11 pages, 6 figures

  8. arXiv:2403.16379  [pdf, other

    cs.CV

    FlashEval: Towards Fast and Accurate Evaluation of Text-to-image Diffusion Generative Models

    Authors: Lin Zhao, Tianchen Zhao, Zinan Lin, Xuefei Ning, Guohao Dai, Huazhong Yang, Yu Wang

    Abstract: In recent years, there has been significant progress in the development of text-to-image generative models. Evaluating the quality of the generative models is one essential step in the development process. Unfortunately, the evaluation process could consume a significant amount of computational resources, making the required periodic evaluation of model performance (e.g., monitoring training progr… ▽ More

    Submitted 24 March, 2024; originally announced March 2024.

    Comments: The paper is accepted by CVPR 2024

  9. arXiv:2403.15761  [pdf, other

    quant-ph

    Phase estimation via coherent and photon-catalyzed squeezed vacuum states

    Authors: Zekun Zhao, Qingqian Kang, Huan Zhang, Teng Zhao, Cun** Liu, Liyun Hu

    Abstract: The research focused on enhancing the measurement accuracy through the use of non-Gaussian states has garnered increasing attention. In this study, we propose a scheme to input the coherent state mixed with photon-catalyzed squeezed vacuum state into the Mach-Zender interferometer to enhance phase measurement accuracy. The findings demonstrate that photon catalysis, particularly multi-photon catal… ▽ More

    Submitted 23 March, 2024; originally announced March 2024.

  10. arXiv:2403.12945  [pdf, other

    cs.RO

    DROID: A Large-Scale In-The-Wild Robot Manipulation Dataset

    Authors: Alexander Khazatsky, Karl Pertsch, Suraj Nair, Ashwin Balakrishna, Sudeep Dasari, Siddharth Karamcheti, Soroush Nasiriany, Mohan Kumar Srirama, Lawrence Yunliang Chen, Kirsty Ellis, Peter David Fagan, Joey Hejna, Masha Itkina, Marion Lepert, Yecheng Jason Ma, Patrick Tree Miller, Jimmy Wu, Suneel Belkhale, Shivin Dass, Huy Ha, Arhan Jain, Abraham Lee, Youngwoon Lee, Marius Memmel, Sungjae Park , et al. (74 additional authors not shown)

    Abstract: The creation of large, diverse, high-quality robot manipulation datasets is an important step** stone on the path toward more capable and robust robotic manipulation policies. However, creating such datasets is challenging: collecting robot manipulation data in diverse environments poses logistical and safety challenges and requires substantial investments in hardware and human labour. As a resu… ▽ More

    Submitted 19 March, 2024; originally announced March 2024.

    Comments: Project website: https://droid-dataset.github.io/

  11. arXiv:2403.12910  [pdf, other

    cs.RO cs.AI cs.LG

    Yell At Your Robot: Improving On-the-Fly from Language Corrections

    Authors: Lucy Xiaoyang Shi, Zheyuan Hu, Tony Z. Zhao, Archit Sharma, Karl Pertsch, Jianlan Luo, Sergey Levine, Chelsea Finn

    Abstract: Hierarchical policies that combine language and low-level control have been shown to perform impressively long-horizon robotic tasks, by leveraging either zero-shot high-level planners like pretrained language and vision-language models (LLMs/VLMs) or models trained on annotated robotic demonstrations. However, for complex and dexterous skills, attaining high success rates on long-horizon tasks st… ▽ More

    Submitted 19 March, 2024; originally announced March 2024.

    Comments: Project website: https://yay-robot.github.io/

  12. arXiv:2403.11789  [pdf, other

    cs.CV

    EMIE-MAP: Large-Scale Road Surface Reconstruction Based on Explicit Mesh and Implicit Encoding

    Authors: Wenhua Wu, Qi Wang, Guangming Wang, Jun** Wang, Tiankun Zhao, Yang Liu, Dongchao Gao, Zhe Liu, Hesheng Wang

    Abstract: Road surface reconstruction plays a vital role in autonomous driving systems, enabling road lane perception and high-precision map**. Recently, neural implicit encoding has achieved remarkable results in scene representation, particularly in the realistic rendering of scene textures. However, it faces challenges in directly representing geometric information for large-scale scenes. To address th… ▽ More

    Submitted 18 March, 2024; originally announced March 2024.

  13. arXiv:2403.11423  [pdf, other

    cs.CV

    VmambaIR: Visual State Space Model for Image Restoration

    Authors: Yuan Shi, Bin Xia, Xiaoyu **, Xing Wang, Tianyu Zhao, Xin Xia, Xuefeng Xiao, Wenming Yang

    Abstract: Image restoration is a critical task in low-level computer vision, aiming to restore high-quality images from degraded inputs. Various models, such as convolutional neural networks (CNNs), generative adversarial networks (GANs), transformers, and diffusion models (DMs), have been employed to address this problem with significant impact. However, CNNs have limitations in capturing long-range depend… ▽ More

    Submitted 17 March, 2024; originally announced March 2024.

    Comments: 23 pages

  14. arXiv:2403.11084  [pdf

    physics.optics physics.app-ph

    High Performance Graphene Integrated Photonics Platform Enabled by Gold-assisted Transfer

    Authors: Xiaoxuan Wu, Zhengyi Cao, Tianxiang Zhao, Yun Wu, Zhonghui Li, Spyros Doukas, Elefterios Lidorikis, Yu Xue, Liu Liu, Omid Ghaebi, Giancarlo Soavi, Junpeng Lv, Zhenghua Ni, Junjia Wang

    Abstract: Graphene is promising for nanoscale, efficient, ultra-fast photo- and opto-electronic devices because of its remarkable electrical and optical properties, such as fast electron relaxation and heat dissipation. Here, we realize high-performance graphene integrated photonics platform enabled by gold-assisted transfer. Thanks to our optimized transfer technique, we fabricate and demonstrate (1) a mic… ▽ More

    Submitted 17 March, 2024; originally announced March 2024.

  15. arXiv:2403.10841  [pdf, other

    eess.SY

    Extended Kalman Filtering for Recursive Online Discrete-Time Inverse Optimal Control

    Authors: Tian Zhao, Timothy L. Molloy

    Abstract: We formulate the discrete-time inverse optimal control problem of inferring unknown parameters in the objective function of an optimal control problem from measurements of optimal states and controls as a nonlinear filtering problem. This formulation enables us to propose a novel extended Kalman filter (EKF) for solving inverse optimal control problems in a computationally efficient recursive onli… ▽ More

    Submitted 16 March, 2024; originally announced March 2024.

    Comments: 7 pages, 2 figures, accepted for presentation at 2024 American Control Conference

  16. arXiv:2403.06892  [pdf, other

    cs.CV cs.CL

    Real-time Transformer-based Open-Vocabulary Detection with Efficient Fusion Head

    Authors: Tiancheng Zhao, Peng Liu, Xuan He, Lu Zhang, Kyusong Lee

    Abstract: End-to-end transformer-based detectors (DETRs) have shown exceptional performance in both closed-set and open-vocabulary object detection (OVD) tasks through the integration of language modalities. However, their demanding computational requirements have hindered their practical application in real-time object detection (OD) scenarios. In this paper, we scrutinize the limitations of two leading mo… ▽ More

    Submitted 11 March, 2024; originally announced March 2024.

    Comments: Preprint

  17. arXiv:2403.06554  [pdf, ps, other

    math.AP

    Unconditional deep-water limit of the intermediate long wave equation in low-regularity

    Authors: Justin Forlano, Guopeng Li, Tengfei Zhao

    Abstract: In this paper, we establish the unconditional deep-water limit of the intermediate long wave equation (ILW) to the Benjamin-Ono equation (BO) in low-regularity Sobolev spaces on both the real line and the circle. Our main tool is new unconditional uniqueness results for ILW in $H^s$ when $s_0<s\leq \frac 14$ on the line and $s_0<s< \frac 12$ on the circle, where… ▽ More

    Submitted 11 March, 2024; originally announced March 2024.

    Comments: 27 pages

    MSC Class: 35Q53; 35A02; 76B55

  18. arXiv:2403.05527  [pdf, other

    cs.LG cs.AI cs.CL

    GEAR: An Efficient KV Cache Compression Recipe for Near-Lossless Generative Inference of LLM

    Authors: Hao Kang, Qingru Zhang, Souvik Kundu, Geonhwa Jeong, Zaoxing Liu, Tushar Krishna, Tuo Zhao

    Abstract: Key-value (KV) caching has become the de-facto to accelerate generation speed for large language models (LLMs) inference. However, the growing cache demand with increasing sequence length has transformed LLM inference to be a memory bound problem, significantly constraining the system throughput. Existing methods rely on drop** unimportant tokens or quantizing all entries uniformly. Such methods… ▽ More

    Submitted 11 March, 2024; v1 submitted 8 March, 2024; originally announced March 2024.

  19. arXiv:2403.04222  [pdf, other

    cs.CL

    Self-Evaluation of Large Language Model based on Glass-box Features

    Authors: Hui Huang, Yingqi Qu, **g Liu, Muyun Yang, Tiejun Zhao

    Abstract: The proliferation of open-source Large Language Models (LLMs) underscores the pressing need for evaluation methods. Existing works primarily rely on external evaluators, focusing on training and prompting strategies. However, a crucial aspect - model-aware glass-box features - is overlooked. In this study, we explore the utility of glass-box features under the scenario of self-evaluation, namely a… ▽ More

    Submitted 6 March, 2024; originally announced March 2024.

    Comments: work in progress

  20. arXiv:2403.02839  [pdf, other

    cs.CL

    On the Limitations of Fine-tuned Judge Models for LLM Evaluation

    Authors: Hui Huang, Yingqi Qu, Hongli Zhou, **g Liu, Muyun Yang, Bing Xu, Tiejun Zhao

    Abstract: Recently, there has been a growing trend of utilizing Large Language Model (LLM) to evaluate the quality of other LLMs. Many studies have employed proprietary close-source models, especially GPT-4, as the evaluator. Alternatively, other works have fine-tuned judge models based on open-source LLMs as the evaluator. While the fine-tuned judge models are claimed to achieve comparable evaluation capab… ▽ More

    Submitted 17 June, 2024; v1 submitted 5 March, 2024; originally announced March 2024.

  21. arXiv:2403.01083  [pdf, other

    cs.CV

    Beyond Night Visibility: Adaptive Multi-Scale Fusion of Infrared and Visible Images

    Authors: Shufan Pei, Junhong Lin, Wenxi Liu, Tiesong Zhao, Chia-Wen Lin

    Abstract: In addition to low light, night images suffer degradation from light effects (e.g., glare, floodlight, etc). However, existing nighttime visibility enhancement methods generally focus on low-light regions, which neglects, or even amplifies the light effects. To address this issue, we propose an Adaptive Multi-scale Fusion network (AMFusion) with infrared and visible images, which designs fusion ru… ▽ More

    Submitted 1 March, 2024; originally announced March 2024.

  22. arXiv:2402.18865  [pdf, other

    cs.LG cs.AI cs.CL

    Analyzing and Reducing Catastrophic Forgetting in Parameter Efficient Tuning

    Authors: Weijieying Ren, Xinlong Li, Lei Wang, Tianxiang Zhao, Wei Qin

    Abstract: Existing research has shown that large language models (LLMs) exhibit remarkable performance in language understanding and generation. However, when LLMs are continuously fine-tuned on complex and diverse domain-specific downstream tasks, the inference performance on historical tasks decreases dramatically, which is known as a catastrophic forgetting problem. A trade-off needs to be kept between l… ▽ More

    Submitted 29 February, 2024; originally announced February 2024.

  23. arXiv:2402.17680  [pdf, other

    cs.CV

    MCF-VC: Mitigate Catastrophic Forgetting in Class-Incremental Learning for Multimodal Video Captioning

    Authors: Huiyu Xiong, Lanxiao Wang, Heqian Qiu, Tai** Zhao, Benliu Qiu, Hongliang Li

    Abstract: To address the problem of catastrophic forgetting due to the invisibility of old categories in sequential input, existing work based on relatively simple categorization tasks has made some progress. In contrast, video captioning is a more complex task in multimodal scenario, which has not been explored in the field of incremental learning. After identifying this stability-plasticity problem when a… ▽ More

    Submitted 27 February, 2024; originally announced February 2024.

    Comments: 13 pages

  24. arXiv:2402.11129  [pdf, other

    cs.CL

    BlendFilter: Advancing Retrieval-Augmented Large Language Models via Query Generation Blending and Knowledge Filtering

    Authors: Haoyu Wang, Tuo Zhao, **g Gao

    Abstract: Retrieval-augmented Large Language Models (LLMs) offer substantial benefits in enhancing performance across knowledge-intensive scenarios. However, these methods often face challenges with complex inputs and encounter difficulties due to noisy knowledge retrieval, notably hindering model effectiveness. To address this issue, we introduce BlendFilter, a novel approach that elevates retrieval-augmen… ▽ More

    Submitted 16 February, 2024; originally announced February 2024.

  25. arXiv:2402.09711  [pdf, other

    cs.LG cs.SI

    Node Duplication Improves Cold-start Link Prediction

    Authors: Zhichun Guo, Tong Zhao, Yozen Liu, Kaiwen Dong, William Shiao, Neil Shah, Nitesh V. Chawla

    Abstract: Graph Neural Networks (GNNs) are prominent in graph machine learning and have shown state-of-the-art performance in Link Prediction (LP) tasks. Nonetheless, recent studies show that GNNs struggle to produce good results on low-degree nodes despite their overall strong performance. In practical applications of LP, like recommendation systems, improving performance on low-degree nodes is critical, a… ▽ More

    Submitted 15 February, 2024; originally announced February 2024.

  26. arXiv:2402.08931  [pdf, other

    cs.CV

    Depth-aware Volume Attention for Texture-less Stereo Matching

    Authors: Tong Zhao, Mingyu Ding, Wei Zhan, Masayoshi Tomizuka, Yintao Wei

    Abstract: Stereo matching plays a crucial role in 3D perception and scenario understanding. Despite the proliferation of promising methods, addressing texture-less and texture-repetitive conditions remains challenging due to the insufficient availability of rich geometric and semantic information. In this paper, we propose a lightweight volume refinement scheme to tackle the texture deterioration in practic… ▽ More

    Submitted 26 February, 2024; v1 submitted 13 February, 2024; originally announced February 2024.

    Comments: 10 pages, 6 figures

  27. Disambiguated Node Classification with Graph Neural Networks

    Authors: Tianxiang Zhao, Xiang Zhang, Suhang Wang

    Abstract: Graph Neural Networks (GNNs) have demonstrated significant success in learning from graph-structured data across various domains. Despite their great successful, one critical challenge is often overlooked by existing works, i.e., the learning of message propagation that can generalize effectively to underrepresented graph regions. These minority regions often exhibit irregular homophily/heterophil… ▽ More

    Submitted 13 February, 2024; originally announced February 2024.

    Comments: Accepted by WebConf (WWW) 2024

  28. arXiv:2402.08170  [pdf, other

    cs.LG cs.AI

    LLaGA: Large Language and Graph Assistant

    Authors: Run** Chen, Tong Zhao, Ajay Jaiswal, Neil Shah, Zhangyang Wang

    Abstract: Graph Neural Networks (GNNs) have empowered the advance in graph-structured data analysis. Recently, the rise of Large Language Models (LLMs) like GPT-4 has heralded a new era in deep learning. However, their application to graph data poses distinct challenges due to the inherent difficulty of translating graph structures to language. To this end, we introduce the Large Language and Graph Assistan… ▽ More

    Submitted 11 April, 2024; v1 submitted 12 February, 2024; originally announced February 2024.

  29. arXiv:2402.07721  [pdf, other

    cs.LG cs.CL

    LoRA-drop: Efficient LoRA Parameter Pruning based on Output Evaluation

    Authors: Hongyun Zhou, Xiangyu Lu, Wang Xu, Conghui Zhu, Tiejun Zhao, Muyun Yang

    Abstract: Low-Rank Adaptation (LoRA) is currently the most commonly used Parameter-efficient fine-tuning (PEFT) method, it introduces auxiliary parameters for each layer to fine-tune the pre-trained model under limited computing resources. However, it still faces resource consumption challenges during training when scaling up to larger models. Most previous studies have tackled this issue by using pruning t… ▽ More

    Submitted 18 June, 2024; v1 submitted 12 February, 2024; originally announced February 2024.

    Comments: 15 pages, 12 figures

  30. arXiv:2402.07268  [pdf, other

    q-bio.GN cs.AI cs.LG

    Highly Accurate Disease Diagnosis and Highly Reproducible Biomarker Identification with PathFormer

    Authors: Zehao Dong, Qihang Zhao, Philip R. O. Payne, Michael A Province, Carlos Cruchaga, Muhan Zhang, Tianyu Zhao, Yixin Chen, Fuhai Li

    Abstract: Biomarker identification is critical for precise disease diagnosis and understanding disease pathogenesis in omics data analysis, like using fold change and regression analysis. Graph neural networks (GNNs) have been the dominant deep learning model for analyzing graph-structured data. However, we found two major limitations of existing GNNs in omics data analysis, i.e., limited-prediction (diagno… ▽ More

    Submitted 11 February, 2024; originally announced February 2024.

  31. arXiv:2402.07167  [pdf, other

    cs.AI

    Large-Language-Model Empowered Dose Volume Histogram Prediction for Intensity Modulated Radiotherapy

    Authors: Zehao Dong, Yixin Chen, Hiram Gay, Yao Hao, Geoffrey D. Hugo, Pamela Samson, Tianyu Zhao

    Abstract: Treatment planning is currently a patient specific, time-consuming, and resource demanding task in radiotherapy. Dose-volume histogram (DVH) prediction plays a critical role in automating this process. The geometric relationship between DVHs in radiotherapy plans and organs-at-risk (OAR) and planning target volume (PTV) has been well established. This study explores the potential of deep learning… ▽ More

    Submitted 11 February, 2024; originally announced February 2024.

  32. arXiv:2402.05355  [pdf, other

    cs.CY cs.AI

    A Survey on Safe Multi-Modal Learning System

    Authors: Tianyi Zhao, Liangliang Zhang, Yao Ma, Lu Cheng

    Abstract: In the rapidly evolving landscape of artificial intelligence, multimodal learning systems (MMLS) have gained traction for their ability to process and integrate information from diverse modality inputs. Their expanding use in vital sectors such as healthcare has made safety assurance a critical concern. However, the absence of systematic research into their safety is a significant barrier to progr… ▽ More

    Submitted 1 July, 2024; v1 submitted 7 February, 2024; originally announced February 2024.

  33. arXiv:2402.02825  [pdf, other

    gr-qc astro-ph.IM

    GWAI: Harnessing Artificial Intelligence for Enhancing Gravitational Wave Data Analysis

    Authors: Tianyu Zhao, Yue Zhou, Ruijun Shi, Zhoujian Cao, Zhixiang Ren

    Abstract: Gravitational wave (GW) astronomy has opened new frontiers in understanding the cosmos, while the integration of artificial intelligence (AI) in science promises to revolutionize data analysis methodologies. However, a significant gap exists, as there is currently no dedicated platform that enables scientists to develop, test, and evaluate AI algorithms efficiently. To address this gap, we introdu… ▽ More

    Submitted 5 February, 2024; originally announced February 2024.

    Comments: 10 pages, 5 figures

  34. arXiv:2402.02775  [pdf

    physics.optics eess.IV physics.bio-ph

    Instant square lattice structured illumination microscopy: an optimal strategy towards photon-saving and real-time super-resolution observation

    Authors: Tianyu Zhao, Zhaojun Wang, Manming Shu, **gxiang Zhang, Yansheng Liang, Shaowei Wang, Ming Lei

    Abstract: Over the past decade, structured illumination microscopy (SIM) has found its niche in super-resolution (SR) microscopy due to its fast imaging speed and low excitation intensity. However, due to the significantly higher light dose compared to wide-field microscopy and the time-consuming post-processing procedures, long-term, real-time, super-resolution observation of living cells is still out of r… ▽ More

    Submitted 5 February, 2024; originally announced February 2024.

  35. arXiv:2402.02216  [pdf, other

    cs.LG

    Position: Graph Foundation Models are Already Here

    Authors: Haitao Mao, Zhikai Chen, Wenzhuo Tang, Jianan Zhao, Yao Ma, Tong Zhao, Neil Shah, Mikhail Galkin, Jiliang Tang

    Abstract: Graph Foundation Models (GFMs) are emerging as a significant research topic in the graph domain, aiming to develop graph models trained on extensive and diverse data to enhance their applicability across various tasks and domains. Develo** GFMs presents unique challenges over traditional Graph Neural Networks (GNNs), which are typically trained from scratch for specific tasks on particular datas… ▽ More

    Submitted 30 May, 2024; v1 submitted 3 February, 2024; originally announced February 2024.

    Comments: 23 pages, 2 figures

  36. arXiv:2402.02054  [pdf, other

    cs.LG cs.AI

    Neural Scaling Laws on Graphs

    Authors: **gzhe Liu, Haitao Mao, Zhikai Chen, Tong Zhao, Neil Shah, Jiliang Tang

    Abstract: Deep graph models (e.g., graph neural networks and graph transformers) have become important techniques for leveraging knowledge across various types of graphs. Yet, the scaling properties of deep graph models have not been systematically investigated, casting doubt on the feasibility of achieving large graph models through enlarging the model and dataset sizes. In this work, we delve into neural… ▽ More

    Submitted 9 June, 2024; v1 submitted 3 February, 2024; originally announced February 2024.

  37. DeepBranchTracer: A Generally-Applicable Approach to Curvilinear Structure Reconstruction Using Multi-Feature Learning

    Authors: Chao Liu, Ting Zhao, Nenggan Zheng

    Abstract: Curvilinear structures, which include line-like continuous objects, are fundamental geometrical elements in image-based applications. Reconstructing these structures from images constitutes a pivotal research area in computer vision. However, the complex topology and ambiguous image evidence render this process a challenging task. In this paper, we introduce DeepBranchTracer, a novel method that l… ▽ More

    Submitted 2 February, 2024; originally announced February 2024.

    Comments: 10 pages, 6 figures, AAAI 2024 accepted

  38. arXiv:2402.01076  [pdf, other

    cs.LG

    DoseGNN: Improving the Performance of Deep Learning Models in Adaptive Dose-Volume Histogram Prediction through Graph Neural Networks

    Authors: Zehao Dong, Yixin Chen, Tianyu Zhao

    Abstract: Dose-Volume Histogram (DVH) prediction is fundamental in radiation therapy that facilitate treatment planning, dose evaluation, plan comparison and etc. It helps to increase the ability to deliver precise and effective radiation treatments while managing potential toxicities to healthy tissues as needed to reduce the risk of complications. This paper extends recently disclosed research findings pr… ▽ More

    Submitted 1 February, 2024; originally announced February 2024.

  39. arXiv:2401.16375  [pdf, other

    cs.CV

    Spot the Error: Non-autoregressive Graphic Layout Generation with Wireframe Locator

    Authors: Jieru Lin, Danqing Huang, Tiejun Zhao, Dechen Zhan, Chin-Yew Lin

    Abstract: Layout generation is a critical step in graphic design to achieve meaningful compositions of elements. Most previous works view it as a sequence generation problem by concatenating element attribute tokens (i.e., category, size, position). So far the autoregressive approach (AR) has achieved promising results, but is still limited in global context modeling and suffers from error propagation since… ▽ More

    Submitted 29 January, 2024; originally announced January 2024.

    Comments: accepted by AAAI24

  40. arXiv:2401.13942  [pdf, other

    cs.CV

    StyleInject: Parameter Efficient Tuning of Text-to-Image Diffusion Models

    Authors: Mohan Zhou, Yalong Bai, Qing Yang, Tiejun Zhao

    Abstract: The ability to fine-tune generative models for text-to-image generation tasks is crucial, particularly facing the complexity involved in accurately interpreting and visualizing textual inputs. While LoRA is efficient for language model adaptation, it often falls short in text-to-image tasks due to the intricate demands of image generation, such as accommodating a broad spectrum of styles and nuanc… ▽ More

    Submitted 10 May, 2024; v1 submitted 24 January, 2024; originally announced January 2024.

    Comments: 11 pages, 11 figures

  41. arXiv:2401.10731  [pdf, other

    cs.CV

    Removal and Selection: Improving RGB-Infrared Object Detection via Coarse-to-Fine Fusion

    Authors: Tianyi Zhao, Maoxun Yuan, Feng Jiang, Nan Wang, Xingxing Wei

    Abstract: Object detection in visible (RGB) and infrared (IR) images has been widely applied in recent years. Leveraging the complementary characteristics of RGB and IR images, the object detector provides reliable and robust object localization from day to night. Most existing fusion strategies directly input RGB and IR images into deep neural networks, leading to inferior detection performance. However, t… ▽ More

    Submitted 7 May, 2024; v1 submitted 19 January, 2024; originally announced January 2024.

    Comments: 11pages, 11figures

  42. Distribution Consistency based Self-Training for Graph Neural Networks with Sparse Labels

    Authors: Fali Wang, Tianxiang Zhao, Suhang Wang

    Abstract: Few-shot node classification poses a significant challenge for Graph Neural Networks (GNNs) due to insufficient supervision and potential distribution shifts between labeled and unlabeled nodes. Self-training has emerged as a widely popular framework to leverage the abundance of unlabeled data, which expands the training set by assigning pseudo-labels to selected unlabeled nodes. Efforts have been… ▽ More

    Submitted 18 January, 2024; originally announced January 2024.

    Comments: Accepted by WSDM 2024

    ACM Class: F.2.2; I.2.7

  43. arXiv:2401.09660  [pdf

    stat.AP

    Data-Driven Assessment of the County-Level Breast Cancer Incidence in the United States: Impacts of Modifiable and Non-Modifiable Factors

    Authors: Tingting Zhao, Qing Han, **feng Zhang

    Abstract: Female breast cancer (FBC) incidence rate (IR) varies greatly by counties across the United States (US). Factors responsible for such high spatial disparities are not well understood, making it challenging to design effective intervention strategies. We predicted FBC IRs using prevailing machine learning techniques for 1,754 US counties with a female population over 10,000. Outlier counties with t… ▽ More

    Submitted 17 January, 2024; originally announced January 2024.

  44. arXiv:2401.08347  [pdf, other

    nucl-th astro-ph.GA gr-qc

    Exploring Radial Oscillations in Slow Stable and Hybrid Neutron Stars

    Authors: Sayantan Ghosh, Sailesh Ranjan Mohanty, Tianqi Zhao, Bharat Kumar

    Abstract: In the era of gravitational wave astronomy, radial oscillations hold significant potential for not only uncovering the microphysics behind the internal structure but also investigating the stability of neutron stars (NSs). We start by constructing families of static NSs following nucleonic, quarkyonic, and hybrid equations of state and then subject them to radial perturbations in order to explore… ▽ More

    Submitted 16 January, 2024; originally announced January 2024.

    Comments: Comments and suggestions are welcome

  45. Quantum Random Number Generation Based on Phase Reconstruction

    Authors: Jialiang Li, Zitao Huang, Chunlin Yu, Jiajie Wu, Tongge Zhao, Xiangwei Zhu, Shihai Sun

    Abstract: Quantum random number generator (QRNG) utilizes the intrinsic randomness of quantum systems to generate completely unpredictable and genuine random numbers, finding wide applications across many fields. QRNGs relying on the phase noise of a laser have attracted considerable attention due to their straightforward system architecture and high random number generation rates. However, traditional phas… ▽ More

    Submitted 16 January, 2024; originally announced January 2024.

    Comments: 11pages. Submitted to Optics Express, and any comment is welcome

    Journal ref: Optics Express,Vol.32,No.4, 2024

  46. arXiv:2401.07654  [pdf, other

    cs.CV

    Foundation Models for Biomedical Image Segmentation: A Survey

    Authors: Ho Hin Lee, Yu Gu, Theodore Zhao, Yanbo Xu, Jianwei Yang, Naoto Usuyama, Cliff Wong, Mu Wei, Bennett A. Landman, Yuankai Huo, Alberto Santamaria-Pang, Hoifung Poon

    Abstract: Recent advancements in biomedical image analysis have been significantly driven by the Segment Anything Model (SAM). This transformative technology, originally developed for general-purpose computer vision, has found rapid application in medical image processing. Within the last year, marked by over 100 publications, SAM has demonstrated its prowess in zero-shot learning adaptations for medical im… ▽ More

    Submitted 15 January, 2024; originally announced January 2024.

    Comments: 22 pages, 4 figures, 7 tables

  47. arXiv:2401.06789  [pdf

    cs.IR cs.AI cs.CL cs.LG

    Information Retrieval and Classification of Real-Time Multi-Source Hurricane Evacuation Notices

    Authors: Tingting Zhao, Shubo Tian, Jordan Daly, Melissa Geiger, Minna Jia, **feng Zhang

    Abstract: For an approaching disaster, the tracking of time-sensitive critical information such as hurricane evacuation notices is challenging in the United States. These notices are issued and distributed rapidly by numerous local authorities that may spread across multiple states. They often undergo frequent updates and are distributed through diverse online portals lacking standard formats. In this study… ▽ More

    Submitted 7 January, 2024; originally announced January 2024.

  48. arXiv:2401.06457  [pdf

    econ.GN

    Analysis of the Impact of Central bank Digital Currency on the Demand for Transactional Currency

    Authors: Ruimin Song, Tiantian Zhao, Chunhui Zhou

    Abstract: This paper takes the development of Central bank digital currencies as a perspective, introduces it into the Baumol-Tobin money demand theoretical framework, establishes the transactional money demand model under Central bank Digital Currency, and qualitatively analyzes the influence mechanism of Central bank digital currencies on transactional money demand; meanwhile, quarterly data from 2010-202… ▽ More

    Submitted 12 January, 2024; originally announced January 2024.

    Comments: Central bank digital currencies; transactional money demand; ARDL model. arXiv admin note: text overlap with arXiv:2310.07326

  49. arXiv:2401.05871  [pdf, other

    cs.CL

    Enhancing Personality Recognition in Dialogue by Data Augmentation and Heterogeneous Conversational Graph Networks

    Authors: Yahui Fu, Haiyue Song, Tianyu Zhao, Tatsuya Kawahara

    Abstract: Personality recognition is useful for enhancing robots' ability to tailor user-adaptive responses, thus fostering rich human-robot interactions. One of the challenges in this task is a limited number of speakers in existing dialogue corpora, which hampers the development of robust, speaker-independent personality recognition models. Additionally, accurately modeling both the interdependencies amon… ▽ More

    Submitted 8 March, 2024; v1 submitted 11 January, 2024; originally announced January 2024.

    Comments: This paper has been accepted for presentation at International Workshop on Spoken Dialogue Systems Technology 2024 (IWSDS 2024)

  50. arXiv:2401.02236  [pdf, other

    cs.LG

    U-Mixer: An Unet-Mixer Architecture with Stationarity Correction for Time Series Forecasting

    Authors: Xiang Ma, Xuemei Li, Lexin Fang, Tianlong Zhao, Caiming Zhang

    Abstract: Time series forecasting is a crucial task in various domains. Caused by factors such as trends, seasonality, or irregular fluctuations, time series often exhibits non-stationary. It obstructs stable feature propagation through deep layers, disrupts feature distributions, and complicates learning data distribution changes. As a result, many existing models struggle to capture the underlying pattern… ▽ More

    Submitted 4 January, 2024; originally announced January 2024.

    Comments: Accepted by AAAI2024