Skip to main content

Showing 1–50 of 239 results for author: Liang, H

Searching in archive cs. Search in all archives.
.
  1. SmartAxe: Detecting Cross-Chain Vulnerabilities in Bridge Smart Contracts via Fine-Grained Static Analysis

    Authors: Zeqin Liao, Yuhong Nan, Henglong Liang, Sicheng Hao, Juan Zhai, Jia**g Wu, Zibin Zheng

    Abstract: With the increasing popularity of blockchain, different blockchain platforms coexist in the ecosystem (e.g., Ethereum, BNB, EOSIO, etc.), which prompts the high demand for cross-chain communication. Cross-chain bridge is a specific type of decentralized application for asset exchange across different blockchain platforms. Securing the smart contracts of cross-chain bridges is in urgent need, as th… ▽ More

    Submitted 22 June, 2024; originally announced June 2024.

    Journal ref: The ACM International Conference on the Foundations of Software Engineering 2024

  2. arXiv:2406.14724  [pdf, other

    cs.SE

    An Exploratory Mixed-Methods Study on General Data Protection Regulation (GDPR) Compliance in Open-Source Software

    Authors: Lucas Franke, Huayu Liang, Sahar Farzanehpour, Aaron Brantly, James C. Davis, Chris Brown

    Abstract: Background: Governments worldwide are considering data privacy regulations. These laws, e.g. the European Union's General Data Protection Regulation (GDPR), require software developers to meet privacy-related requirements when interacting with users' data. Prior research describes the impact of such laws on software development, but only for commercial software. Open-source software is commonly in… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

    Comments: In the proceedings of the 18th ACM/IEEE International Symposium on Empirical Software Engineering and Measurement (ESEM'24)

  3. arXiv:2406.10324  [pdf, other

    cs.CV cs.LG

    L4GM: Large 4D Gaussian Reconstruction Model

    Authors: Jiawei Ren, Kevin Xie, Ashkan Mirzaei, Hanxue Liang, Xiaohui Zeng, Karsten Kreis, Ziwei Liu, Antonio Torralba, Sanja Fidler, Seung Wook Kim, Huan Ling

    Abstract: We present L4GM, the first 4D Large Reconstruction Model that produces animated objects from a single-view video input -- in a single feed-forward pass that takes only a second. Key to our success is a novel dataset of multiview videos containing curated, rendered animated objects from Objaverse. This dataset depicts 44K diverse objects with 110K animations rendered in 48 viewpoints, resulting in… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

    Comments: Project page: https://research.nvidia.com/labs/toronto-ai/l4gm

  4. arXiv:2406.09317  [pdf, other

    eess.IV cs.CV

    Common and Rare Fundus Diseases Identification Using Vision-Language Foundation Model with Knowledge of Over 400 Diseases

    Authors: Meng Wang, Tian Lin, Aidi Lin, Kai Yu, Yuanyuan Peng, Lianyu Wang, Cheng Chen, Ke Zou, Huiyu Liang, Man Chen, Xue Yao, Meiqin Zhang, Binwei Huang, Chaoxin Zheng, Peixin Zhang, Wei Chen, Yilong Luo, Yifan Chen, Honghe Xia, Tingkun Shi, Qi Zhang, **ming Guo, Xiaolin Chen, **gcheng Wang, Yih Chung Tham , et al. (24 additional authors not shown)

    Abstract: Previous foundation models for retinal images were pre-trained with limited disease categories and knowledge base. Here we introduce RetiZero, a vision-language foundation model that leverages knowledge from over 400 fundus diseases. To RetiZero's pre-training, we compiled 341,896 fundus images paired with text descriptions, sourced from public datasets, ophthalmic literature, and online resources… ▽ More

    Submitted 30 June, 2024; v1 submitted 13 June, 2024; originally announced June 2024.

  5. arXiv:2406.08782  [pdf, other

    eess.IV cs.CV

    Hybrid Spatial-spectral Neural Network for Hyperspectral Image Denoising

    Authors: Hao Liang, Chengjie, Kun Li, Xin Tian

    Abstract: Hyperspectral image (HSI) denoising is an essential procedure for HSI applications. Unfortunately, the existing Transformer-based methods mainly focus on non-local modeling, neglecting the importance of locality in image denoising. Moreover, deep learning methods employ complex spectral learning mechanisms, thus introducing large computation costs. To address these problems, we propose a hybrid… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

  6. arXiv:2406.06199  [pdf

    cs.CY cs.AI

    Implications for Governance in Public Perceptions of Societal-scale AI Risks

    Authors: Ross Gruetzemacher, Toby D. Pilditch, Huigang Liang, Christy Manning, Vael Gates, David Moss, James W. B. Elsey, Willem W. A. Sleegers, Kyle Kilian

    Abstract: Amid growing concerns over AI's societal risks--ranging from civilizational collapse to misinformation and systemic bias--this study explores the perceptions of AI experts and the general US registered voters on the likelihood and impact of 18 specific AI risks, alongside their policy preferences for managing these risks. While both groups favor international oversight over national or corporate g… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

    Comments: 9 pages, 18 page supplementary materials

  7. arXiv:2406.04685  [pdf, other

    eess.SY cs.NI

    Statistical QoS Provisioning Architecture for 6G Satellite-Terrestrial Integrated Networks

    Authors: **gqing Wang, Wenchi Cheng, Wei Zhang, Hui Liang

    Abstract: The emergence of massive ultra-reliable and low latency communications (mURLLC) as a category of time/reliability-sensitive service over 6G networks has received considerable research attention, which has presented unprecedented challenges. As one of the key enablers for 6G, satellite-terrestrial integrated networks (STIN) have been developed to offer more expansive connectivity and comprehensive… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

  8. arXiv:2406.03865  [pdf, other

    cs.CV cs.AI

    Semantic Similarity Score for Measuring Visual Similarity at Semantic Level

    Authors: Senran Fan, Zhicheng Bao, Chen Dong, Haotai Liang, Xiaodong Xu, ** Zhang

    Abstract: Semantic communication, as a revolutionary communication architecture, is considered a promising novel communication paradigm. Unlike traditional symbol-based error-free communication systems, semantic-based visual communication systems extract, compress, transmit, and reconstruct images at the semantic level. However, widely used image similarity evaluation metrics, whether pixel-based MSE or PSN… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

  9. arXiv:2405.17531  [pdf, other

    cs.CV

    Evolutive Rendering Models

    Authors: Fangneng Zhan, Hanxue Liang, Yifan Wang, Michael Niemeyer, Michael Oechsle, Adam Kortylewski, Cengiz Oztireli, Gordon Wetzstein, Christian Theobalt

    Abstract: The landscape of computer graphics has undergone significant transformations with the recent advances of differentiable rendering models. These rendering models often rely on heuristic designs that may not fully align with the final rendering objectives. We address this gap by pioneering \textit{evolutive rendering models}, a methodology where rendering models possess the ability to evolve and ada… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

    Comments: Project page: https://fnzhan.com/Evolutive-Rendering-Models/

  10. arXiv:2405.16645  [pdf, other

    cs.CV

    Diffusion4D: Fast Spatial-temporal Consistent 4D Generation via Video Diffusion Models

    Authors: Hanwen Liang, Yuyang Yin, Dejia Xu, Hanxue Liang, Zhangyang Wang, Konstantinos N. Plataniotis, Yao Zhao, Yunchao Wei

    Abstract: The availability of large-scale multimodal datasets and advancements in diffusion models have significantly accelerated progress in 4D content generation. Most prior approaches rely on multiple image or video diffusion models, utilizing score distillation sampling for optimization or generating pseudo novel views for direct supervision. However, these methods are hindered by slow optimization spee… ▽ More

    Submitted 26 May, 2024; originally announced May 2024.

    Comments: Project page: https://vita-group.github.io/Diffusion4D

  11. arXiv:2405.16640  [pdf, other

    cs.AI cs.CL cs.CV cs.MM

    A Survey of Multimodal Large Language Model from A Data-centric Perspective

    Authors: Tianyi Bai, Hao Liang, Binwang Wan, Ling Yang, Bozhou Li, Yifan Wang, Bin Cui, Conghui He, Binhang Yuan, Wentao Zhang

    Abstract: Human beings perceive the world through diverse senses such as sight, smell, hearing, and touch. Similarly, multimodal large language models (MLLMs) enhance the capabilities of traditional large language models by integrating and processing data from multiple modalities including text, vision, audio, video, and 3D environments. Data plays a pivotal role in the development and refinement of these m… ▽ More

    Submitted 26 May, 2024; originally announced May 2024.

  12. arXiv:2405.15119  [pdf, other

    cs.LG stat.ML

    Bayesian Optimization of Functions over Node Subsets in Graphs

    Authors: Huidong Liang, Xingchen Wan, Xiaowen Dong

    Abstract: We address the problem of optimizing over functions defined on node subsets in a graph. The optimization of such functions is often a non-trivial task given their combinatorial, black-box and expensive-to-evaluate nature. Although various algorithms have been introduced in the literature, most are either task-specific or computationally inefficient and only utilize information about the graph stru… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

    Comments: 26 pages with 20 figures

  13. arXiv:2405.12063  [pdf, other

    cs.CL

    CLAMBER: A Benchmark of Identifying and Clarifying Ambiguous Information Needs in Large Language Models

    Authors: Tong Zhang, Peixin Qin, Yang Deng, Chen Huang, Wenqiang Lei, Junhong Liu, Dingnan **, Hongru Liang, Tat-Seng Chua

    Abstract: Large language models (LLMs) are increasingly used to meet user information needs, but their effectiveness in dealing with user queries that contain various types of ambiguity remains unknown, ultimately risking user trust and satisfaction. To this end, we introduce CLAMBER, a benchmark for evaluating LLMs using a well-organized taxonomy. Building upon the taxonomy, we construct ~12K high-quality… ▽ More

    Submitted 1 June, 2024; v1 submitted 20 May, 2024; originally announced May 2024.

    Comments: Accepted to ACL 2024. Camera Ready. Our dataset is available at https://github.com/zt991211/CLAMBER

  14. arXiv:2405.09114  [pdf, other

    cs.CV

    SOEDiff: Efficient Distillation for Small Object Editing

    Authors: Qihe Pan, Zicheng Wang, Zhen Zhao, Yiming Wu, Sifan Long, Haoran Liang, Ronghua Liang

    Abstract: In this paper, we delve into a new task known as small object editing (SOE), which focuses on text-based image inpainting within a constrained, small-sized area. Despite the remarkable success have been achieved by current image inpainting approaches, their application to the SOE task generally results in failure cases such as Object Missing, Text-Image Mismatch, and Distortion. These failures ste… ▽ More

    Submitted 15 May, 2024; originally announced May 2024.

  15. arXiv:2405.05160  [pdf, other

    cs.LG cs.AI cs.CV

    Selective Classification Under Distribution Shifts

    Authors: Hengyue Liang, Le Peng, Ju Sun

    Abstract: In selective classification (SC), a classifier abstains from making predictions that are likely to be wrong to avoid excessive errors. To deploy imperfect classifiers -- imperfect either due to intrinsic statistical noise of data or for robustness issue of the classifier or beyond -- in high-stakes scenarios, SC appears to be an attractive and necessary path to follow. Despite decades of research… ▽ More

    Submitted 8 May, 2024; originally announced May 2024.

    Comments: Total 25 pages (14 pages for main body); preprint for journal submission

  16. arXiv:2404.11887  [pdf, other

    cs.AR

    EN-TensorCore: Advancing TensorCores Performance through Encoder-Based Methodology

    Authors: Qizhe Wu, Yuchen Gui, Zhichen Zeng, Xiaotian Wang, Huawen Liang, Xi **

    Abstract: Tensor computations, with matrix multiplication being the primary operation, serve as the fundamental basis for data analysis, physics, machine learning, and deep learning. As the scale and complexity of data continue to grow rapidly, the demand for tensor computations has also increased significantly. To meet this demand, several research institutions have started develo** dedicated hardware fo… ▽ More

    Submitted 18 April, 2024; originally announced April 2024.

    Comments: 7 pages, 6 figures

  17. arXiv:2404.11171  [pdf, other

    cs.LG cs.AI eess.SP

    Personalized Heart Disease Detection via ECG Digital Twin Generation

    Authors: Yaojun Hu, **tai Chen, Lianting Hu, Dantong Li, Jiahuan Yan, Haochao Ying, Huiying Liang, Jian Wu

    Abstract: Heart diseases rank among the leading causes of global mortality, demonstrating a crucial need for early diagnosis and intervention. Most traditional electrocardiogram (ECG) based automated diagnosis methods are trained at population level, neglecting the customization of personalized ECGs to enhance individual healthcare management. A potential solution to address this limitation is to employ dig… ▽ More

    Submitted 11 May, 2024; v1 submitted 17 April, 2024; originally announced April 2024.

  18. arXiv:2404.03361  [pdf, other

    cs.CL

    nicolay-r at SemEval-2024 Task 3: Using Flan-T5 for Reasoning Emotion Cause in Conversations with Chain-of-Thought on Emotion States

    Authors: Nicolay Rusnachenko, Huizhi Liang

    Abstract: Emotion expression is one of the essential traits of conversations. It may be self-related or caused by another speaker. The variety of reasons may serve as a source of the further emotion causes: conversation history, speaker's emotional state, etc. Inspired by the most recent advances in Chain-of-Thought, in this work, we exploit the existing three-hop reasoning approach (THOR) to perform large… ▽ More

    Submitted 4 April, 2024; originally announced April 2024.

    Comments: Ranked 3rd-4th place (F1-proportional) and 5th place (F1-strict) in SemEval'24 Task 3, Subtask 1, to appear in SemEval-2024 proceedings

  19. arXiv:2404.03164  [pdf, ps, other

    cs.IR cs.AI cs.LG

    Does Knowledge Graph Really Matter for Recommender Systems?

    Authors: Haonan Zhang, Dongxia Wang, Zhu Sun, Yanhui Li, Youcheng Sun, Huizhi Liang, Wenhai Wang

    Abstract: Recommender systems (RSs) are designed to provide personalized recommendations to users. Recently, knowledge graphs (KGs) have been widely introduced in RSs to improve recommendation accuracy. In this study, however, we demonstrate that RSs do not necessarily perform worse even if the KG is downgraded to the user-item interaction graph only (or removed). We propose an evaluation framework KG4RecEv… ▽ More

    Submitted 3 April, 2024; originally announced April 2024.

  20. arXiv:2403.20284  [pdf, other

    cs.CL cs.LG

    LayerNorm: A key component in parameter-efficient fine-tuning

    Authors: Taha ValizadehAslani, Hualou Liang

    Abstract: Fine-tuning a pre-trained model, such as Bidirectional Encoder Representations from Transformers (BERT), has been proven to be an effective method for solving many natural language processing (NLP) tasks. However, due to the large number of parameters in many state-of-the-art NLP models, including BERT, the process of fine-tuning is computationally expensive. One attractive solution to this issue… ▽ More

    Submitted 29 March, 2024; originally announced March 2024.

  21. arXiv:2403.17847  [pdf, other

    cs.LG cs.AI

    Climate Downscaling: A Deep-Learning Based Super-resolution Model of Precipitation Data with Attention Block and Skip Connections

    Authors: Chia-Hao Chiang, Zheng-Han Huang, Liwen Liu, Hsin-Chien Liang, Yi-Chi Wang, Wan-Ling Tseng, Chao Wang, Che-Ta Chen, Ko-Chih Wang

    Abstract: Human activities accelerate consumption of fossil fuels and produce greenhouse gases, resulting in urgent issues today: global warming and the climate change. These indirectly cause severe natural disasters, plenty of lives suffering and huge losses of agricultural properties. To mitigate impacts on our lands, scientists are develo** renewable, reusable, and clean energies and climatologists are… ▽ More

    Submitted 26 March, 2024; originally announced March 2024.

  22. arXiv:2403.16993  [pdf, other

    cs.CV

    Comp4D: LLM-Guided Compositional 4D Scene Generation

    Authors: Dejia Xu, Hanwen Liang, Neel P. Bhatt, Hezhen Hu, Hanxue Liang, Konstantinos N. Plataniotis, Zhangyang Wang

    Abstract: Recent advancements in diffusion models for 2D and 3D content creation have sparked a surge of interest in generating 4D content. However, the scarcity of 3D scene datasets constrains current methodologies to primarily object-centric generation. To overcome this limitation, we present Comp4D, a novel framework for Compositional 4D Generation. Unlike conventional methods that generate a singular 4D… ▽ More

    Submitted 25 March, 2024; originally announced March 2024.

    Comments: Project page: https://vita-group.github.io/Comp4D/

  23. arXiv:2403.15736  [pdf, other

    cs.CL

    LLMs Instruct LLMs:An Extraction and Editing Method

    Authors: Xin Zhang, Tianjie Ju, Huijia Liang, Ying Fu, Qin Zhang

    Abstract: The interest in updating Large Language Models (LLMs) without retraining from scratch is substantial, yet it comes with some challenges.This is especially true for situations demanding complex reasoning with limited samples, a scenario we refer to as the Paucity-Constrained Complex Reasoning Adaptation for LLMs (PCRA-LLM).Traditional methods like Low-Rank Adaptation (LoRA) and Retrieval-Augmented… ▽ More

    Submitted 23 March, 2024; originally announced March 2024.

    Comments: Working in progress

  24. Data Cubes in Hand: A Design Space of Tangible Cubes for Visualizing 3D Spatio-Temporal Data in Mixed Reality

    Authors: Shuqi He, Haonan Yao, Luyan Jiang, Kaiwen Li, Nan Xiang, Yue Li, Hai-Ning Liang, Lingyun Yu

    Abstract: Tangible interfaces in mixed reality (MR) environments allow for intuitive data interactions. Tangible cubes, with their rich interaction affordances, high maneuverability, and stable structure, are particularly well-suited for exploring multi-dimensional data types. However, the design potential of these cubes is underexplored. This study introduces a design space for tangible cubes in MR, focusi… ▽ More

    Submitted 11 March, 2024; originally announced March 2024.

  25. arXiv:2403.06769  [pdf, other

    cs.CL

    Strength Lies in Differences! Towards Effective Non-collaborative Dialogues via Tailored Strategy Planning

    Authors: Tong Zhang, Chen Huang, Yang Deng, Hongru Liang, Jia Liu, Zujie Wen, Wenqiang Lei, Tat-Seng Chua

    Abstract: We investigate non-collaborative dialogue agents, which are expected to engage in strategic conversations with diverse users, for securing a mutual agreement that leans favorably towards the system's objectives. This poses two main challenges for existing dialogue agents: 1) The inability to integrate user-specific characteristics into the strategic planning, and 2) The difficulty of training stra… ▽ More

    Submitted 6 May, 2024; v1 submitted 11 March, 2024; originally announced March 2024.

    Comments: V2: 20 pages, 8 figures, and 20 tables

  26. arXiv:2402.14299  [pdf, other

    cs.RO cs.AI

    We Choose to Go to Space: Agent-driven Human and Multi-Robot Collaboration in Microgravity

    Authors: Miao Xin, Zhongrui You, Zihan Zhang, Taoran Jiang, Tingjia Xu, Haotian Liang, Guo**g Ge, Yuchen Ji, Shentong Mo, Jian Cheng

    Abstract: We present SpaceAgents-1, a system for learning human and multi-robot collaboration (HMRC) strategies under microgravity conditions. Future space exploration requires humans to work together with robots. However, acquiring proficient robot skills and adept collaboration under microgravity conditions poses significant challenges within ground laboratories. To address this issue, we develop a microg… ▽ More

    Submitted 22 February, 2024; originally announced February 2024.

  27. arXiv:2402.09456  [pdf, other

    cs.LG cs.AI cs.GT stat.ML

    Optimistic Thompson Sampling for No-Regret Learning in Unknown Games

    Authors: Yingru Li, Liangqi Liu, Wenqiang Pu, Hao Liang, Zhi-Quan Luo

    Abstract: This work tackles the complexities of multi-player scenarios in \emph{unknown games}, where the primary challenge lies in navigating the uncertainty of the environment through bandit feedback alongside strategic decision-making. We introduce Thompson Sampling (TS)-based algorithms that exploit the information of opponents' actions and reward structures, leading to a substantial reduction in experi… ▽ More

    Submitted 24 February, 2024; v1 submitted 7 February, 2024; originally announced February 2024.

  28. arXiv:2402.05798  [pdf, other

    cs.HC

    Visual Harmony: Text-Visual Interplay in Circular Infographics

    Authors: Shuqi He, Yuqing Chen, Yuxin Xia, Yichun Li, Hai-Ning Liang, Lingyun Yu

    Abstract: Infographics are visual representations designed for efficient and effective communication of data and knowledge. One crucial aspect of infographic design is the interplay between text and visual elements, particularly in circular visualizations where the textual descriptions can either be embedded within the graphics or placed adjacent to the visual representation. While several studies have exam… ▽ More

    Submitted 8 February, 2024; originally announced February 2024.

  29. arXiv:2402.02423  [pdf, other

    cs.LG cs.AI cs.HC cs.RO

    Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human Feedback

    Authors: Yifu Yuan, Jianye Hao, Yi Ma, Zibin Dong, Hebin Liang, **yi Liu, Zhixin Feng, Kai Zhao, Yan Zheng

    Abstract: Reinforcement Learning with Human Feedback (RLHF) has received significant attention for performing tasks without the need for costly manual reward design by aligning human preferences. It is crucial to consider diverse human feedback types and various learning methods in different environments. However, quantifying progress in RLHF with diverse feedback is challenging due to the lack of standardi… ▽ More

    Submitted 25 March, 2024; v1 submitted 4 February, 2024; originally announced February 2024.

    Comments: Published as a conference paper at ICLR 2024. The website is available at https://uni-rlhf.github.io/

  30. arXiv:2402.01380  [pdf, other

    cs.CV eess.IV

    Efficient Dynamic-NeRF Based Volumetric Video Coding with Rate Distortion Optimization

    Authors: Zhiyu Zhang, Guo Lu, Huanxiong Liang, Anni Tang, Qiang Hu, Li Song

    Abstract: Volumetric videos, benefiting from immersive 3D realism and interactivity, hold vast potential for various applications, while the tremendous data volume poses significant challenges for compression. Recently, NeRF has demonstrated remarkable potential in volumetric video compression thanks to its simple representation and powerful 3D modeling capabilities, where a notable work is ReRF. However, R… ▽ More

    Submitted 2 February, 2024; originally announced February 2024.

  31. arXiv:2401.15687  [pdf, other

    cs.CV cs.GR

    Media2Face: Co-speech Facial Animation Generation With Multi-Modality Guidance

    Authors: Qingcheng Zhao, Pengyu Long, Qixuan Zhang, Dafei Qin, Han Liang, Longwen Zhang, Yingliang Zhang, **gyi Yu, Lan Xu

    Abstract: The synthesis of 3D facial animations from speech has garnered considerable attention. Due to the scarcity of high-quality 4D facial data and well-annotated abundant multi-modality labels, previous methods often suffer from limited realism and a lack of lexible conditioning. We address this challenge through a trilogy. We first introduce Generalized Neural Parametric Facial Asset (GNPFA), an effic… ▽ More

    Submitted 30 January, 2024; v1 submitted 28 January, 2024; originally announced January 2024.

    Comments: Project Page: https://sites.google.com/view/media2face

  32. arXiv:2401.14629  [pdf, ps, other

    cs.SE cs.CY

    A First Look at the General Data Protection Regulation (GDPR) in Open-Source Software

    Authors: Lucas Franke, Huayu Liang, Aaron Brantly, James C Davis, Chris Brown

    Abstract: This poster describes work on the General Data Protection Regulation (GDPR) in open-source software. Although open-source software is commonly integrated into regulated software, and thus must be engineered or adapted for compliance, we do not know how such laws impact open-source software development. We surveyed open-source developers (N=47) to understand their experiences and perceptions of G… ▽ More

    Submitted 25 January, 2024; originally announced January 2024.

    Comments: 2 page extended abstract for ICSE-Poster 2024

  33. arXiv:2401.12326  [pdf, other

    cs.CL cs.AI

    Fine-tuning Large Language Models for Multigenerator, Multidomain, and Multilingual Machine-Generated Text Detection

    Authors: Feng Xiong, Thanet Markchom, Ziwei Zheng, Subin Jung, Varun Ojha, Huizhi Liang

    Abstract: SemEval-2024 Task 8 introduces the challenge of identifying machine-generated texts from diverse Large Language Models (LLMs) in various languages and domains. The task comprises three subtasks: binary classification in monolingual and multilingual (Subtask A), multi-class classification (Subtask B), and mixed text detection (Subtask C). This paper focuses on Subtask A & B. Each subtask is support… ▽ More

    Submitted 22 January, 2024; originally announced January 2024.

  34. arXiv:2401.11782  [pdf, other

    physics.soc-ph cs.SI q-bio.PE

    Temporal Interaction and its Role in the Evolution of Cooperation

    Authors: Yujie He, Tianyu Ren, Xiao-Jun Zeng, Huawen Liang, Liukai Yu, Junjun Zheng

    Abstract: This research investigates the impact of dynamic interactions with time-varying topologies on the evolution of cooperative behaviours in social dilemmas. Traditional research has focused on deterministic rules governing pairwise interactions, yet the impact of interaction frequency and synchronicity on cooperation remains underexplored. Addressing this gap, our work introduces two temporal interac… ▽ More

    Submitted 5 February, 2024; v1 submitted 22 January, 2024; originally announced January 2024.

    Comments: 10 pages, 9 figures

  35. arXiv:2401.00225  [pdf

    eess.AS cs.AI eess.SP

    Enhancing dysarthria speech feature representation with empirical mode decomposition and Walsh-Hadamard transform

    Authors: Ting Zhu, Shufei Duan, Camille Dingam, Huizhi Liang, Wei Zhang

    Abstract: Dysarthria speech contains the pathological characteristics of vocal tract and vocal fold, but so far, they have not yet been included in traditional acoustic feature sets. Moreover, the nonlinearity and non-stationarity of speech have been ignored. In this paper, we propose a feature enhancement algorithm for dysarthria speech called WHFEMD. It combines empirical mode decomposition (EMD) and fast… ▽ More

    Submitted 30 December, 2023; originally announced January 2024.

  36. arXiv:2312.16057  [pdf, other

    cs.IT eess.SP

    Semantic Importance-Aware Based for Multi-User Communication Over MIMO Fading Channels

    Authors: Haotai Liang, Zhicheng Bao, Wannian An, Chen Dong, Xiaodong Xu

    Abstract: Semantic communication, as a novel communication paradigm, has attracted the interest of many scholars, with multi-user, multi-input multi-output (MIMO) scenarios being one of the critical contexts. This paper presents a semantic importance-aware based communication system (SIA-SC) over MIMO Rayleigh fading channels. Combining the semantic symbols' inequality and the equivalent subchannels of MIMO… ▽ More

    Submitted 26 December, 2023; originally announced December 2023.

  37. arXiv:2312.08998  [pdf

    eess.AS cs.AI cs.SD eess.SP

    Design, construction and evaluation of emotional multimodal pathological speech database

    Authors: Ting Zhu, Shufei Duan, Huizhi Liang, Wei Zhang

    Abstract: The lack of an available emotion pathology database is one of the key obstacles in studying the emotion expression status of patients with dysarthria. The first Chinese multimodal emotional pathological speech database containing multi-perspective information is constructed in this paper. It includes 29 controls and 39 patients with different degrees of motor dysarthria, expressing happy, sad, ang… ▽ More

    Submitted 14 December, 2023; originally announced December 2023.

  38. arXiv:2312.08985  [pdf, other

    cs.CV

    OMG: Towards Open-vocabulary Motion Generation via Mixture of Controllers

    Authors: Han Liang, Jiacheng Bao, Ruichi Zhang, Sihan Ren, Yuecheng Xu, Sibei Yang, Xin Chen, **gyi Yu, Lan Xu

    Abstract: We have recently seen tremendous progress in realistic text-to-motion generation. Yet, the existing methods often fail or produce implausible motions with unseen text inputs, which limits the applications. In this paper, we present OMG, a novel framework, which enables compelling motion generation from zero-shot open-vocabulary text prompts. Our key idea is to carefully tailor the pretrain-then-fi… ▽ More

    Submitted 19 March, 2024; v1 submitted 14 December, 2023; originally announced December 2023.

    Comments: accepted by CVPR 2024

  39. arXiv:2312.08224  [pdf, other

    cs.AI cs.LG

    GLOP: Learning Global Partition and Local Construction for Solving Large-scale Routing Problems in Real-time

    Authors: Haoran Ye, Jiarui Wang, Helan Liang, Zhiguang Cao, Yong Li, Fanzhang Li

    Abstract: The recent end-to-end neural solvers have shown promise for small-scale routing problems but suffered from limited real-time scaling-up performance. This paper proposes GLOP (Global and Local Optimization Policies), a unified hierarchical framework that efficiently scales toward large-scale routing problems. GLOP partitions large routing problems into Travelling Salesman Problems (TSPs) and TSPs i… ▽ More

    Submitted 13 December, 2023; originally announced December 2023.

    Comments: Accepted at AAAI 2024

  40. arXiv:2311.10492  [pdf, other

    cs.CV

    A Relay System for Semantic Image Transmission based on Shared Feature Extraction and Hyperprior Entropy Compression

    Authors: Wannian An, Zhicheng Bao, Haotai Liang, Chen Dong, Xiaodong

    Abstract: Nowadays, the need for high-quality image reconstruction and restoration is more and more urgent. However, most image transmission systems may suffer from image quality degradation or transmission interruption in the face of interference such as channel noise and link fading. To solve this problem, a relay communication network for semantic image transmission based on shared feature extraction and… ▽ More

    Submitted 17 November, 2023; originally announced November 2023.

  41. arXiv:2311.00267  [pdf, other

    cs.LG cs.AI

    Rethinking Decision Transformer via Hierarchical Reinforcement Learning

    Authors: Yi Ma, Chenjun Xiao, Hebin Liang, Jianye Hao

    Abstract: Decision Transformer (DT) is an innovative algorithm leveraging recent advances of the transformer architecture in reinforcement learning (RL). However, a notable limitation of DT is its reliance on recalling trajectories from datasets, losing the capability to seamlessly stitch sub-optimal trajectories together. In this work we introduce a general sequence modeling framework for studying sequenti… ▽ More

    Submitted 31 October, 2023; originally announced November 2023.

  42. arXiv:2310.16276  [pdf, other

    physics.soc-ph cs.SI

    Complexity of Government response to Covid-19 pandemic: A perspective of coupled dynamics on information heterogeneity and epidemic outbreak

    Authors: Xiaoqi Zhang, Jie Fu, Sheng Hua, Han Liang, Zi-Ke Zhang

    Abstract: This study aims at modeling the universal failure in preventing the outbreak of COVID-19 via real-world data from the perspective of complexity and network science. Through formalizing information heterogeneity and government intervention in the coupled dynamics of epidemic and infodemic spreading; first, we find that information heterogeneity and its induced variation in human responses significa… ▽ More

    Submitted 24 October, 2023; originally announced October 2023.

    Comments: This version contains the full-resolution figures for the paper DOI: 10.1007/s11071-023-08427-5

  43. arXiv:2310.14867  [pdf, other

    cs.HC

    Who's Watching Me?: Exploring the Impact of Audience Familiarity on Player Performance, Experience, and Exertion in Virtual Reality Exergames

    Authors: Zixuan Guo, Wenge Xu, Jialin Zhang, Hongyu Wang, Cheng-Hung Lo, Hai-Ning Liang

    Abstract: Familiarity with audiences plays a significant role in sha** individual performance and experience across various activities in everyday life. This study delves into the impact of familiarity with non-playable character (NPC) audiences on player performance and experience in virtual reality (VR) exergames. By manipulating of NPC appearance (face and body shape) and voice familiarity, we explored… ▽ More

    Submitted 23 October, 2023; originally announced October 2023.

    Comments: 10 pages, 5 figures, IEEE International Symposium on Mixed and Augmented Reality (ISMAR) 2023

  44. arXiv:2310.08935  [pdf, ps, other

    math.PR cs.GT

    Proof of a conjecture about Parrondo's paradox for two-armed slot machines

    Authors: Huai** Liang, Zeng**g Chen

    Abstract: The 1936 Mills Futurity slot machine had the feature that, if a player loses 10 times in a row, the 10 lost coins are returned. Ethier and Lee (2010) studied a generalized version of this machine, with 10 replaced by deterministic parameter J. They established the Parrondo effect for a hypothetical two-armed machine with the Futurity award. Specifically, arm A and arm B, played individually, are a… ▽ More

    Submitted 23 June, 2024; v1 submitted 13 October, 2023; originally announced October 2023.

    Comments: 45 pages

    MSC Class: 60J10; 60F05

  45. arXiv:2310.07659  [pdf, other

    cs.CL

    Well Begun is Half Done: Generator-agnostic Knowledge Pre-Selection for Knowledge-Grounded Dialogue

    Authors: Lang Qin, Yao Zhang, Hongru Liang, Jun Wang, Zhenglu Yang

    Abstract: Accurate knowledge selection is critical in knowledge-grounded dialogue systems. Towards a closer look at it, we offer a novel perspective to organize existing literature, i.e., knowledge selection coupled with, after, and before generation. We focus on the third under-explored category of study, which can not only select knowledge accurately in advance, but has the advantage to reduce the learnin… ▽ More

    Submitted 20 October, 2023; v1 submitted 11 October, 2023; originally announced October 2023.

    Comments: Accepted by EMNLP2023 main conference

  46. arXiv:2309.14032  [pdf, other

    cs.NE cs.AI cs.LG

    DeepACO: Neural-enhanced Ant Systems for Combinatorial Optimization

    Authors: Haoran Ye, Jiarui Wang, Zhiguang Cao, Helan Liang, Yong Li

    Abstract: Ant Colony Optimization (ACO) is a meta-heuristic algorithm that has been successfully applied to various Combinatorial Optimization Problems (COPs). Traditionally, customizing ACO for a specific problem requires the expert design of knowledge-driven heuristics. In this paper, we propose DeepACO, a generic framework that leverages deep reinforcement learning to automate heuristic designs. DeepACO… ▽ More

    Submitted 4 November, 2023; v1 submitted 25 September, 2023; originally announced September 2023.

    Comments: Accepted at NeurIPS 2023

  47. arXiv:2309.12849  [pdf, other

    cs.LG eess.SY

    DeepOPF-U: A Unified Deep Neural Network to Solve AC Optimal Power Flow in Multiple Networks

    Authors: Heng Liang, Changhong Zhao

    Abstract: The traditional machine learning models to solve optimal power flow (OPF) are mostly trained for a given power network and lack generalizability to today's power networks with varying topologies and growing plug-and-play distributed energy resources (DERs). In this paper, we propose DeepOPF-U, which uses one unified deep neural network (DNN) to solve alternating-current (AC) OPF problems in differ… ▽ More

    Submitted 22 September, 2023; originally announced September 2023.

    Comments: 3 pages, 2 figures

  48. arXiv:2309.07846  [pdf, other

    cs.CV

    MC-NeRF: Multi-Camera Neural Radiance Fields for Multi-Camera Image Acquisition Systems

    Authors: Yu Gao, Lutong Su, Hao Liang, Yufeng Yue, Yi Yang, Mengyin Fu

    Abstract: Neural Radiance Fields (NeRF) use multi-view images for 3D scene representation, demonstrating remarkable performance. As one of the primary sources of multi-view images, multi-camera systems encounter challenges such as varying intrinsic parameters and frequent pose changes. Most previous NeRF-based methods assume a unique camera and rarely consider multi-camera scenarios. Besides, some NeRF meth… ▽ More

    Submitted 22 March, 2024; v1 submitted 14 September, 2023; originally announced September 2023.

    Comments: This manuscript is currently under review

  49. arXiv:2309.03522  [pdf, other

    cs.HC

    AR.S.Space: An AR Casual Game for Social Engagement in Work Environments

    Authors: Boyuan Chen, Junkun Long, Wenxuan Zheng, Yuzheng Wu, Ziming Li, Yue Li, Hai-Ning Liang

    Abstract: In social situations, individuals often encounter communication challenges, particularly when adapting to new environments. While some studies have acknowledged the potential of AR social games to aid in effective socialization to some extent, little attention has been given to AR HMD-based games specifically designed to facilitate social interactions. In response, we propose AR.S.Space, an AR HMD… ▽ More

    Submitted 7 September, 2023; originally announced September 2023.

    Comments: 2023 ISMAR Student Competition

  50. arXiv:2308.16738  [pdf, other

    eess.IV cs.CV cs.LG

    SFUSNet: A Spatial-Frequency domain-based Multi-branch Network for diagnosis of Cervical Lymph Node Lesions in Ultrasound Images

    Authors: Yubiao Yue, Jun Xue, Haihua Liang, Bingchun Luo, Zhenzhang Li

    Abstract: Booming deep learning has substantially improved the diagnosis for diverse lesions in ultrasound images, but a conspicuous research gap concerning cervical lymph node lesions still remains. The objective of this work is to diagnose cervical lymph node lesions in ultrasound images by leveraging a deep learning model. To this end, we first collected 3392 cervical ultrasound images containing normal… ▽ More

    Submitted 4 October, 2023; v1 submitted 31 August, 2023; originally announced August 2023.