Skip to main content

Showing 1–50 of 146 results for author: Wei, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.00030  [pdf, other

    cs.DC cs.PF

    On Orchestrating Parallel Broadcasts for Distributed Ledgers

    Authors: Peiyao Sheng, Chenyuan Wu, Dahlia Malkhi, Michael K. Reiter, Chrysoula Stathakopoulou, Michael Wei, Maofan Yin

    Abstract: This paper introduces and develops the concept of ``ticketing'', through which atomic broadcasts are orchestrated by nodes in a distributed system. The paper studies different ticketing regimes that allow parallelism, yet prevent slow nodes from hampering overall progress. It introduces a hybrid scheme which combines managed and unmanaged ticketing regimes, striking a balance between adaptivity an… ▽ More

    Submitted 17 May, 2024; originally announced July 2024.

  2. arXiv:2406.14123  [pdf

    cs.CY

    Map** AI Ethics Narratives: Evidence from Twitter Discourse Between 2015 and 2022

    Authors: Mengyi Wei, Puzhen Zhang, Chuan Chen, Dongsheng Chen, Chenyu Zuo, Liqiu Meng

    Abstract: Public participation is indispensable for an insightful understanding of the ethics issues raised by AI technologies. Twitter is selected in this paper to serve as an online public sphere for exploring discourse on AI ethics, facilitating broad and equitable public engagement in the development of AI technology. A research framework is proposed to demonstrate how to transform AI ethics-related dis… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

    Comments: 22 pages, 6 figures

  3. arXiv:2406.13445  [pdf, other

    cs.CV cs.AI

    Lost in UNet: Improving Infrared Small Target Detection by Underappreciated Local Features

    Authors: Wuzhou Quan, Wei Zhao, Weiming Wang, Haoran Xie, Fu Lee Wang, Mingqiang Wei

    Abstract: Many targets are often very small in infrared images due to the long-distance imaging meachnism. UNet and its variants, as popular detection backbone networks, downsample the local features early and cause the irreversible loss of these local features, leading to both the missed and false detection of small targets in infrared images. We propose HintU, a novel network to recover the local features… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

  4. arXiv:2406.13007  [pdf, other

    cs.CV

    NTIRE 2024 Challenge on Night Photography Rendering

    Authors: Egor Ershov, Artyom Panshin, Oleg Karasev, Sergey Korchagin, Shepelev Lev, Alexandr Startsev, Daniil Vladimirov, Ekaterina Zaychenkova, Nikola Banić, Dmitrii Iarchuk, Maria Efimova, Radu Timofte, Arseniy Terekhin, Shuwei Yue, Yuyang Liu, Minchen Wei, Lu Xu, Chao Zhang, Yasi Wang, Furkan Kınlı, Doğa Yılmaz, Barış Özcan, Furkan Kıraç, Shuai Liu, **gyuan Xiao , et al. (25 additional authors not shown)

    Abstract: This paper presents a review of the NTIRE 2024 challenge on night photography rendering. The goal of the challenge was to find solutions that process raw camera images taken in nighttime conditions, and thereby produce a photo-quality output images in the standard RGB (sRGB) space. Unlike the previous year's competition, the challenge images were collected with a mobile phone and the speed of algo… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

    Comments: 10 pages, 10 figures

  5. arXiv:2406.12214  [pdf, other

    cs.RO cs.CV

    Is Your HD Map Constructor Reliable under Sensor Corruptions?

    Authors: Xiaoshuai Hao, Mengchuan Wei, Yifan Yang, Haimei Zhao, Hui Zhang, Yi Zhou, Qiang Wang, Weiming Li, Lingdong Kong, **g Zhang

    Abstract: Driving systems often rely on high-definition (HD) maps for precise environmental information, which is crucial for planning and navigation. While current HD map constructors perform well under ideal conditions, their resilience to real-world challenges, \eg, adverse weather and sensor failures, is not well understood, raising safety concerns. This work introduces MapBench, the first comprehensive… ▽ More

    Submitted 18 June, 2024; v1 submitted 17 June, 2024; originally announced June 2024.

    Comments: project url: https://mapbench.github.io/

  6. arXiv:2406.12161  [pdf, other

    cs.CY cs.CR cs.HC cs.SI

    Understanding Help-Seeking and Help-Giving on Social Media for Image-Based Sexual Abuse

    Authors: Miranda Wei, Sunny Consolvo, Patrick Gage Kelley, Tadayoshi Kohno, Tara Matthews, Sarah Meiklejohn, Franziska Roesner, Renee Shelby, Kurt Thomas, Rebecca Umbach

    Abstract: Image-based sexual abuse (IBSA), like other forms of technology-facilitated abuse, is a growing threat to people's digital safety. Attacks include unwanted solicitations for sexually explicit images, extorting people under threat of leaking their images, or purposefully leaking images to enact revenge or exert control. In this paper, we explore how people seek and receive help for IBSA on social m… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

    Comments: 18 pages, 4 figures, 8 tables, 103 references

    ACM Class: K.4.2; H.4.3; J.4

    Journal ref: Proceedings of the 33rd USENIX Security Symposium (USENIX Security 2024)

  7. arXiv:2406.09178  [pdf, other

    cs.RO

    AutomaChef: A Physics-informed Demonstration-guided Learning Framework for Granular Material Manipulation

    Authors: Minglun Wei, Xintong Yang, Yu-Kun Lai, Seyed Amir Tafrishi, Ze Ji

    Abstract: Due to the complex physical properties of granular materials, research on robot learning for manipulating such materials predominantly either disregards the consideration of their physical characteristics or uses surrogate models to approximate their physical properties. Learning to manipulate granular materials based on physical information obtained through precise modelling remains an unsolved p… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

    Comments: 8 pages

  8. arXiv:2406.08308  [pdf, other

    cs.GR

    FSH: 3D Representation via Fibonacci Spherical Harmonics

    Authors: Zikuan Li, Anyi Huang, Wenru Jia, Qiaoyun Wu, Mingqiang Wei, Jun Wang

    Abstract: Spherical harmonics are a favorable technique for 3D representation, employing a frequency-based approach through the spherical harmonic transform (SHT). Typically, SHT is performed using equiangular sampling grids. However, these grids are non-uniform on spherical surfaces and exhibit local anisotropy, a common limitation in existing spherical harmonic decomposition methods. This paper proposes a… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

  9. arXiv:2406.06541  [pdf, other

    cs.AR

    Global and Local Attention-based Inception U-Net for Static IR Drop Estimation

    Authors: Yilu Chen, Zhijie Cai, Min Wei, Zhifeng Lin, Jianli Chen

    Abstract: Static IR drop analysis is a fundamental and critical task in chip design since the IR drop will significantly affect the design's functionality, performance, and reliability. However, the process of IR drop analysis can be time-consuming, potentially taking several hours. Furthermore, in the process of fixing violations, it is frequently imperative to do IR drop analysis iteratively, hence exacer… ▽ More

    Submitted 27 April, 2024; originally announced June 2024.

    Comments: 7 pages, 8 figures

  10. arXiv:2406.06016  [pdf, other

    cs.LG

    EpiLearn: A Python Library for Machine Learning in Epidemic Modeling

    Authors: Zewen Liu, Yunxiao Li, Mingyang Wei, Guancheng Wan, Max S. Y. Lau, Wei **

    Abstract: EpiLearn is a Python toolkit developed for modeling, simulating, and analyzing epidemic data. Although there exist several packages that also deal with epidemic modeling, they are often restricted to mechanistic models or traditional statistical tools. As machine learning continues to shape the world, the gap between these packages and the latest models has become larger. To bridge the gap and ins… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

  11. arXiv:2406.05520  [pdf, other

    cs.CY

    "Violation of my body:" Perceptions of AI-generated non-consensual (intimate) imagery

    Authors: Natalie Grace Brigham, Miranda Wei, Tadayoshi Kohno, Elissa M. Redmiles

    Abstract: AI technology has enabled the creation of deepfakes: hyper-realistic synthetic media. We surveyed 315 individuals in the U.S. on their views regarding the hypothetical non-consensual creation of deepfakes depicting them, including deepfakes portraying sexual acts. Respondents indicated strong opposition to creating and, even more so, sharing non-consensually created synthetic content, especially i… ▽ More

    Submitted 16 June, 2024; v1 submitted 8 June, 2024; originally announced June 2024.

    Journal ref: Proceedings of the 20th Symposium on Usable Privacy and Security (SOUPS 2024)

  12. arXiv:2405.15228  [pdf, other

    cs.LG cs.CV

    Learning from True-False Labels via Multi-modal Prompt Retrieving

    Authors: Zhongnian Li, **ghao Xu, Peng Ying, Meng Wei, Tongfeng Sun, Xinzheng Xu

    Abstract: Weakly supervised learning has recently achieved considerable success in reducing annotation costs and label noise. Unfortunately, existing weakly supervised learning methods are short of ability in generating reliable labels via pre-trained vision-language models (VLMs). In this paper, we propose a novel weakly supervised labeling setting, namely True-False Labels (TFLs) which can achieve high ac… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

    Comments: 15 pages, 4 figures

  13. arXiv:2405.12971  [pdf, other

    cs.CV

    BiomedParse: a biomedical foundation model for image parsing of everything everywhere all at once

    Authors: Theodore Zhao, Yu Gu, Jianwei Yang, Naoto Usuyama, Ho Hin Lee, Tristan Naumann, Jianfeng Gao, Angela Crabtree, Jacob Abel, Christine Moung-Wen, Brian Piening, Carlo Bifulco, Mu Wei, Hoifung Poon, Sheng Wang

    Abstract: Biomedical image analysis is fundamental for biomedical discovery in cell biology, pathology, radiology, and many other biomedical domains. Holistic image analysis comprises interdependent subtasks such as segmentation, detection, and recognition of relevant objects. Here, we propose BiomedParse, a biomedical foundation model for imaging parsing that can jointly conduct segmentation, detection, an… ▽ More

    Submitted 4 June, 2024; v1 submitted 21 May, 2024; originally announced May 2024.

    Comments: Project page: https://aka.ms/biomedparse-project

  14. arXiv:2405.10567  [pdf, other

    cs.CV

    Team Samsung-RAL: Technical Report for 2024 RoboDrive Challenge-Robust Map Segmentation Track

    Authors: Xiaoshuai Hao, Yifan Yang, Hui Zhang, Mengchuan Wei, Yi Zhou, Haimei Zhao, **g Zhang

    Abstract: In this report, we describe the technical details of our submission to the 2024 RoboDrive Challenge Robust Map Segmentation Track. The Robust Map Segmentation track focuses on the segmentation of complex driving scene elements in BEV maps under varied driving conditions. Semantic map segmentation provides abundant and precise static environmental information crucial for autonomous driving systems'… ▽ More

    Submitted 17 May, 2024; originally announced May 2024.

    Comments: ICRA 2024 RoboDrive Challenge Robust Map Segmentation Track 3rd Place Technical Report. arXiv admin note: text overlap with arXiv:2205.09743 by other authors

  15. arXiv:2405.09083  [pdf, other

    cs.CV

    RSHazeDiff: A Unified Fourier-aware Diffusion Model for Remote Sensing Image Dehazing

    Authors: Jiamei Xiong, Xuefeng Yan, Yongzhen Wang, Wei Zhao, Xiao-** Zhang, Mingqiang Wei

    Abstract: Haze severely degrades the visual quality of remote sensing images and hampers the performance of automotive navigation, intelligent monitoring, and urban management. The emerging denoising diffusion probabilistic model (DDPM) exhibits the significant potential for dense haze removal with its strong generation ability. Since remote sensing images contain extensive small-scale texture structures, i… ▽ More

    Submitted 15 May, 2024; originally announced May 2024.

  16. arXiv:2405.08816  [pdf, other

    cs.CV cs.RO

    The RoboDrive Challenge: Drive Anytime Anywhere in Any Condition

    Authors: Lingdong Kong, Shaoyuan Xie, Hanjiang Hu, Yaru Niu, Wei Tsang Ooi, Benoit R. Cottereau, Lai Xing Ng, Yuexin Ma, Wenwei Zhang, Liang Pan, Kai Chen, Ziwei Liu, Weichao Qiu, Wei Zhang, Xu Cao, Hao Lu, Ying-Cong Chen, Caixin Kang, Xinning Zhou, Chengyang Ying, Wentao Shang, Xingxing Wei, Yinpeng Dong, Bo Yang, Shengyin Jiang , et al. (66 additional authors not shown)

    Abstract: In the realm of autonomous driving, robust perception under out-of-distribution conditions is paramount for the safe deployment of vehicles. Challenges such as adverse weather, sensor malfunctions, and environmental unpredictability can severely impact the performance of autonomous systems. The 2024 RoboDrive Challenge was crafted to propel the development of driving perception technologies that c… ▽ More

    Submitted 29 May, 2024; v1 submitted 14 May, 2024; originally announced May 2024.

    Comments: ICRA 2024; 32 pages, 24 figures, 5 tables; Code at https://robodrive-24.github.io/

  17. arXiv:2404.10187  [pdf, other

    cs.CR cs.CY cs.HC

    SoK (or SoLK?): On the Quantitative Study of Sociodemographic Factors and Computer Security Behaviors

    Authors: Miranda Wei, Jaron Mink, Yael Eiger, Tadayoshi Kohno, Elissa M. Redmiles, Franziska Roesner

    Abstract: Researchers are increasingly exploring how gender, culture, and other sociodemographic factors correlate with user computer security and privacy behaviors. To more holistically understand relationships between these factors and behaviors, we make two contributions. First, we broadly survey existing scholarship on sociodemographics and secure behavior (151 papers) before conducting a focused litera… ▽ More

    Submitted 15 April, 2024; originally announced April 2024.

    Comments: 20 pages, 1 figure, 8 tables

    Journal ref: Proceedings of the 33rd USENIX Security Symposium (USENIX Security 2024)

  18. arXiv:2404.04586  [pdf, other

    cs.CV

    PIE: Physics-inspired Low-light Enhancement

    Authors: Dong Liang, Zhengyan Xu, Ling Li, Mingqiang Wei, Songcan Chen

    Abstract: In this paper, we propose a physics-inspired contrastive learning paradigm for low-light enhancement, called PIE. PIE primarily addresses three issues: (i) To resolve the problem of existing learning-based methods often training a LLE model with strict pixel-correspondence image pairs, we eliminate the need for pixel-correspondence paired training data and instead train with unpaired images. (ii)… ▽ More

    Submitted 6 April, 2024; originally announced April 2024.

    Comments: arXiv admin note: text overlap with arXiv:2112.06451

  19. arXiv:2403.16482  [pdf, other

    cs.LG

    Determined Multi-Label Learning via Similarity-Based Prompt

    Authors: Meng Wei, Zhongnian Li, Peng Ying, Yong Zhou, Xinzheng Xu

    Abstract: In multi-label classification, each training instance is associated with multiple class labels simultaneously. Unfortunately, collecting the fully precise class labels for each training instance is time- and labor-consuming for real-world applications. To alleviate this problem, a novel labeling setting termed \textit{Determined Multi-Label Learning} (DMLL) is proposed, aiming to effectively allev… ▽ More

    Submitted 25 March, 2024; originally announced March 2024.

    Comments: 10 pages, 4 figures

  20. arXiv:2403.16469  [pdf, other

    cs.LG cs.CV

    Learning from Reduced Labels for Long-Tailed Data

    Authors: Meng Wei, Zhongnian Li, Yong Zhou, Xinzheng Xu

    Abstract: Long-tailed data is prevalent in real-world classification tasks and heavily relies on supervised information, which makes the annotation process exceptionally labor-intensive and time-consuming. Unfortunately, despite being a common approach to mitigate labeling costs, existing weakly supervised learning methods struggle to adequately preserve supervised information for tail samples, resulting in… ▽ More

    Submitted 25 March, 2024; originally announced March 2024.

    Comments: 12 pages, 3 figures

  21. arXiv:2403.09062  [pdf

    eess.IV cs.CV

    TBI Image/Text (TBI-IT): Comprehensive Text and Image Datasets for Traumatic Brain Injury Research

    Authors: Jie Li, Jiaying Wen, Tongxin Yang, Fenglin Cai, Miao Wei, Zhiwei Zhang, Li Jiang

    Abstract: In this paper, we introduce a new dataset in the medical field of Traumatic Brain Injury (TBI), called TBI-IT, which includes both electronic medical records (EMRs) and head CT images. This dataset is designed to enhance the accuracy of artificial intelligence in the diagnosis and treatment of TBI. This dataset, built upon the foundation of standard text and image data, incorporates specific annot… ▽ More

    Submitted 13 March, 2024; originally announced March 2024.

    Comments: arXiv admin note: substantial text overlap with arXiv:2401.15934

  22. arXiv:2403.08002  [pdf, other

    cs.CL cs.CV

    Towards a clinically accessible radiology foundation model: open-access and lightweight, with automated evaluation

    Authors: Juan Manuel Zambrano Chaves, Shih-Cheng Huang, Yanbo Xu, Hanwen Xu, Naoto Usuyama, Sheng Zhang, Fei Wang, Yujia Xie, Mahmoud Khademi, Ziyi Yang, Hany Awadalla, Julia Gong, Houdong Hu, Jianwei Yang, Chunyuan Li, Jianfeng Gao, Yu Gu, Cliff Wong, Mu Wei, Tristan Naumann, Muhao Chen, Matthew P. Lungren, Akshay Chaudhari, Serena Yeung-Levy, Curtis P. Langlotz , et al. (2 additional authors not shown)

    Abstract: The scaling laws and extraordinary performance of large foundation models motivate the development and utilization of such models in biomedicine. However, despite early promising results on some biomedical benchmarks, there are still major challenges that need to be addressed before these models can be used in real-world clinics. Frontier general-domain models such as GPT-4V still have significant… ▽ More

    Submitted 26 June, 2024; v1 submitted 12 March, 2024; originally announced March 2024.

  23. arXiv:2403.06728  [pdf, other

    cs.CV

    Large Model driven Radiology Report Generation with Clinical Quality Reinforcement Learning

    Authors: Zijian Zhou, Miao**g Shi, Meng Wei, Oluwatosin Alabi, Zijie Yue, Tom Vercauteren

    Abstract: Radiology report generation (RRG) has attracted significant attention due to its potential to reduce the workload of radiologists. Current RRG approaches are still unsatisfactory against clinical standards. This paper introduces a novel RRG method, \textbf{LM-RRG}, that integrates large models (LMs) with clinical quality reinforcement learning to generate accurate and comprehensive chest X-ray rad… ▽ More

    Submitted 11 March, 2024; originally announced March 2024.

  24. arXiv:2403.04443  [pdf, other

    cs.CV

    FriendNet: Detection-Friendly Dehazing Network

    Authors: Yihua Fan, Yongzhen Wang, Mingqiang Wei, Fu Lee Wang, Haoran Xie

    Abstract: Adverse weather conditions often impair the quality of captured images, inevitably inducing cutting-edge object detection models for advanced driver assistance systems (ADAS) and autonomous driving. In this paper, we raise an intriguing question: can the combination of image restoration and object detection enhance detection performance in adverse weather conditions? To answer it, we propose an ef… ▽ More

    Submitted 7 March, 2024; originally announced March 2024.

    Comments: 13 pages, 8 figures, 6 tables

  25. arXiv:2402.16866  [pdf, other

    cs.IT cs.AI

    Computation Rate Maximization for Wireless Powered Edge Computing With Multi-User Cooperation

    Authors: Yang Li, Xing Zhang, Bo Lei, Qianying Zhao, Min Wei, Zheyan Qu, Wenbo Wang

    Abstract: The combination of mobile edge computing (MEC) and radio frequency-based wireless power transfer (WPT) presents a promising technique for providing sustainable energy supply and computing services at the network edge. This study considers a wireless-powered mobile edge computing system that includes a hybrid access point (HAP) equipped with a computing unit and multiple Internet of Things (IoT) de… ▽ More

    Submitted 22 January, 2024; originally announced February 2024.

    Comments: Accepted to IEEE Open Journal of the Communications Society

  26. arXiv:2401.15934  [pdf, other

    cs.CV

    HICH Image/Text (HICH-IT): Comprehensive Text and Image Datasets for Hypertensive Intracerebral Hemorrhage Research

    Authors: Jie Li, Yulong Xia, Tongxin Yang, Fenglin Cai, Miao Wei, Zhiwei Zhang, Li Jiang

    Abstract: In this paper, we introduce a new dataset in the medical field of hypertensive intracerebral hemorrhage (HICH), called HICH-IT, which includes both electronic medical records (EMRs) and head CT images. This dataset is designed to enhance the accuracy of artificial intelligence in the diagnosis and treatment of HICH. This dataset, built upon the foundation of standard text and image data, incorpora… ▽ More

    Submitted 5 February, 2024; v1 submitted 29 January, 2024; originally announced January 2024.

  27. arXiv:2401.12453  [pdf, other

    cs.CY cs.HC

    "The teachers are confused as well": A Multiple-Stakeholder Ethics Discussion on Large Language Models in Computing Education

    Authors: Kyrie Zhixuan Zhou, Zachary Kilhoffer, Madelyn Rose Sanfilippo, Ted Underwood, Ece Gumusel, Mengyi Wei, Abhinav Choudhry, **jun Xiong

    Abstract: Large Language Models (LLMs) are advancing quickly and impacting people's lives for better or worse. In higher education, concerns have emerged such as students' misuse of LLMs and degraded education outcomes. To unpack the ethical concerns of LLMs for higher education, we conducted a case study consisting of stakeholder interviews (n=20) in higher education computer science. We found that student… ▽ More

    Submitted 22 January, 2024; originally announced January 2024.

  28. arXiv:2401.07654  [pdf, other

    cs.CV

    Foundation Models for Biomedical Image Segmentation: A Survey

    Authors: Ho Hin Lee, Yu Gu, Theodore Zhao, Yanbo Xu, Jianwei Yang, Naoto Usuyama, Cliff Wong, Mu Wei, Bennett A. Landman, Yuankai Huo, Alberto Santamaria-Pang, Hoifung Poon

    Abstract: Recent advancements in biomedical image analysis have been significantly driven by the Segment Anything Model (SAM). This transformative technology, originally developed for general-purpose computer vision, has found rapid application in medical image processing. Within the last year, marked by over 100 publications, SAM has demonstrated its prowess in zero-shot learning adaptations for medical im… ▽ More

    Submitted 15 January, 2024; originally announced January 2024.

    Comments: 22 pages, 4 figures, 7 tables

  29. arXiv:2312.12743  [pdf, other

    cs.CV

    PointeNet: A Lightweight Framework for Effective and Efficient Point Cloud Analysis

    Authors: Lipeng Gu, Xuefeng Yan, Liangliang Nan, Dingkun Zhu, Honghua Chen, Weiming Wang, Mingqiang Wei

    Abstract: Current methodologies in point cloud analysis predominantly explore 3D geometries, often achieved through the introduction of intricate learnable geometric extractors in the encoder or by deepening networks with repeated blocks. However, these approaches inevitably lead to a significant number of learnable parameters, resulting in substantial computational costs and imposing memory burdens on CPU/… ▽ More

    Submitted 19 December, 2023; originally announced December 2023.

  30. arXiv:2312.12717  [pdf, other

    cs.IT cs.LG

    DoDo-Code: a Deep Levenshtein Distance Embedding-based Code for IDS Channel and DNA Storage

    Authors: Alan J. X. Guo, Sihan Sun, Xiang Wei, Mengyi Wei, Xin Chen

    Abstract: Recently, DNA storage has emerged as a promising data storage solution, offering significant advantages in storage density, maintenance cost efficiency, and parallel replication capability. Mathematically, the DNA storage pipeline can be viewed as an insertion, deletion, and substitution (IDS) channel. Because of the mathematical terra incognita of the Levenshtein distance, designing an IDS-correc… ▽ More

    Submitted 19 December, 2023; originally announced December 2023.

  31. arXiv:2312.10623  [pdf, other

    cs.IR cs.SE

    A Survey on Query-based API Recommendation

    Authors: Moshi Wei, Nima Shiri Harzevili, Alvine Boaye Belle, Junjie Wang, Lin Shi, **qiu Yang, Song Wang, Ming Zhen, Jiang

    Abstract: Application Programming Interfaces (APIs) are designed to help developers build software more effectively. Recommending the right APIs for specific tasks has gained increasing attention among researchers and developers in recent years. To comprehensively understand this research domain, we have surveyed to analyze API recommendation studies published in the last 10 years. Our study begins with an… ▽ More

    Submitted 26 January, 2024; v1 submitted 17 December, 2023; originally announced December 2023.

  32. Levenshtein Distance Embedding with Poisson Regression for DNA Storage

    Authors: Xiang Wei, Alan J. X. Guo, Sihan Sun, Mengyi Wei, Wei Yu

    Abstract: Efficient computation or approximation of Levenshtein distance, a widely-used metric for evaluating sequence similarity, has attracted significant attention with the emergence of DNA storage and other biological applications. Sequence embedding, which maps Levenshtein distance to a conventional distance between embedding vectors, has emerged as a promising solution. In this paper, a novel neural n… ▽ More

    Submitted 13 December, 2023; originally announced December 2023.

    Journal ref: Proceedings of the AAAI Conference on Artificial Intelligence, (2024) 38(14), 15796-15804

  33. arXiv:2312.04891  [pdf, other

    cs.CV

    Cross-BERT for Point Cloud Pretraining

    Authors: Xin Li, Peng Li, Zeyong Wei, Zhe Zhu, Mingqiang Wei, Junhui Hou, Liangliang Nan, **g Qin, Haoran Xie, Fu Lee Wang

    Abstract: Introducing BERT into cross-modal settings raises difficulties in its optimization for handling multiple modalities. Both the BERT architecture and training objective need to be adapted to incorporate and model information from different modalities. In this paper, we address these challenges by exploring the implicit semantic and geometric correlations between 2D and 3D data of the same objects/sc… ▽ More

    Submitted 8 December, 2023; originally announced December 2023.

  34. arXiv:2312.00739  [pdf, other

    cs.CV

    Adversarial Score Distillation: When score distillation meets GAN

    Authors: Min Wei, **gkai Zhou, Junyao Sun, Xuesong Zhang

    Abstract: Existing score distillation methods are sensitive to classifier-free guidance (CFG) scale: manifested as over-smoothness or instability at small CFG scales, while over-saturation at large ones. To explain and analyze these issues, we revisit the derivation of Score Distillation Sampling (SDS) and decipher existing score distillation with the Wasserstein Generative Adversarial Network (WGAN) paradi… ▽ More

    Submitted 1 December, 2023; originally announced December 2023.

  35. arXiv:2311.16540  [pdf, other

    cs.LG cs.DC cs.NI

    Communication Efficiency Optimization of Federated Learning for Computing and Network Convergence of 6G Networks

    Authors: Yizhuo Cai, Bo Lei, Qianying Zhao, **g Peng, Min Wei, Yushun Zhang, Xing Zhang

    Abstract: Federated learning effectively addresses issues such as data privacy by collaborating across participating devices to train global models. However, factors such as network topology and device computing power can affect its training or communication process in complex network environments. A new network architecture and paradigm with computing-measurable, perceptible, distributable, dispatchable, a… ▽ More

    Submitted 28 November, 2023; originally announced November 2023.

    Comments: 13 pages, 11 figures, accepted by Frontiers of Information Technology & Electronic Engineering

  36. arXiv:2311.11773  [pdf, other

    cs.CV

    Practical cross-sensor color constancy using a dual-map** strategy

    Authors: Shuwei Yue, Minchen Wei

    Abstract: Deep Neural Networks (DNNs) have been widely used for illumination estimation, which is time-consuming and requires sensor-specific data collection. Our proposed method uses a dual-map** strategy and only requires a simple white point from a test sensor under a D65 condition. This allows us to derive a map** matrix, enabling the reconstructions of image data and illuminants. In the second mapp… ▽ More

    Submitted 20 November, 2023; originally announced November 2023.

  37. arXiv:2311.01659  [pdf, other

    cs.CV

    Efficient Cloud Pipelines for Neural Radiance Fields

    Authors: Derek Jacoby, Donglin Xu, Weder Ribas, Minyi Xu, Ting Liu, Vishwanath Jayaraman, Mengdi Wei, Emma De Blois, Yvonne Coady

    Abstract: Since their introduction in 2020, Neural Radiance Fields (NeRFs) have taken the computer vision community by storm. They provide a multi-view representation of a scene or object that is ideal for eXtended Reality (XR) applications and for creative endeavors such as virtual production, as well as change detection operations in geospatial analytics. The computational cost of these generative AI mode… ▽ More

    Submitted 2 November, 2023; originally announced November 2023.

  38. arXiv:2310.18532  [pdf, other

    cs.SE

    SkipAnalyzer: A Tool for Static Code Analysis with Large Language Models

    Authors: Mohammad Mahdi Mohajer, Reem Aleithan, Nima Shiri Harzevili, Moshi Wei, Alvine Boaye Belle, Hung Viet Pham, Song Wang

    Abstract: We introduce SkipAnalyzer, a large language model (LLM)-powered tool for static code analysis. SkipAnalyzer has three components: 1) an LLM-based static bug detector that scans source code and reports specific types of bugs, 2) an LLM-based false-positive filter that can identify false-positive bugs in the results of static bug detectors (e.g., the result of step 1) to improve detection accuracy,… ▽ More

    Submitted 17 December, 2023; v1 submitted 27 October, 2023; originally announced October 2023.

  39. arXiv:2310.07856  [pdf, ps, other

    cs.CL cs.SE

    Assessing Evaluation Metrics for Neural Test Oracle Generation

    Authors: Jiho Shin, Hadi Hemmati, Moshi Wei, Song Wang

    Abstract: In this work, we revisit existing oracle generation studies plus ChatGPT to empirically investigate the current standing of their performance in both NLG-based and test adequacy metrics. Specifically, we train and run four state-of-the-art test oracle generation models on five NLG-based and two test adequacy metrics for our analysis. We apply two different correlation analyses between these two di… ▽ More

    Submitted 11 October, 2023; originally announced October 2023.

    Comments: 10 pages + reference

  40. arXiv:2310.05107  [pdf, other

    cs.CV

    OV-PARTS: Towards Open-Vocabulary Part Segmentation

    Authors: Meng Wei, Xiaoyu Yue, Wenwei Zhang, Shu Kong, Xihui Liu, Jiangmiao Pang

    Abstract: Segmenting and recognizing diverse object parts is a crucial ability in applications spanning various computer vision and robotic tasks. While significant progress has been made in object-level Open-Vocabulary Semantic Segmentation (OVSS), i.e., segmenting objects with arbitrary text, the corresponding part-level research poses additional challenges. Firstly, part segmentation inherently involves… ▽ More

    Submitted 8 October, 2023; originally announced October 2023.

    Comments: Accepted by NeurIPS Dataset and Benchmark Track 2023

  41. arXiv:2310.01994  [pdf, other

    cs.CV

    Understanding Masked Autoencoders From a Local Contrastive Perspective

    Authors: Xiaoyu Yue, Lei Bai, Meng Wei, Jiangmiao Pang, Xihui Liu, Lu** Zhou, Wanli Ouyang

    Abstract: Masked AutoEncoder (MAE) has revolutionized the field of self-supervised learning with its simple yet effective masking and reconstruction strategies. However, despite achieving state-of-the-art performance across various downstream vision tasks, the underlying mechanisms that drive MAE's efficacy are less well-explored compared to the canonical contrastive learning paradigm. In this paper, we fir… ▽ More

    Submitted 8 December, 2023; v1 submitted 3 October, 2023; originally announced October 2023.

  42. arXiv:2309.07495  [pdf, other

    cs.CV cs.AI

    HDTR-Net: A Real-Time High-Definition Teeth Restoration Network for Arbitrary Talking Face Generation Methods

    Authors: Yongyuan Li, Xiuyuan Qin, Chao Liang, Mingqiang Wei

    Abstract: Talking Face Generation (TFG) aims to reconstruct facial movements to achieve high natural lip movements from audio and facial features that are under potential connections. Existing TFG methods have made significant advancements to produce natural and realistic images. However, most work rarely takes visual quality into consideration. It is challenging to ensure lip synchronization while avoiding… ▽ More

    Submitted 14 September, 2023; originally announced September 2023.

    Comments: 15pages, 6 figures, PRCV2023

  43. arXiv:2309.02787  [pdf, other

    cs.LG cs.NI

    Dynamic Encoding and Decoding of Information for Split Learning in Mobile-Edge Computing: Leveraging Information Bottleneck Theory

    Authors: Omar Alhussein, Moshi Wei, Arashmid Akhavain

    Abstract: Split learning is a privacy-preserving distributed learning paradigm in which an ML model (e.g., a neural network) is split into two parts (i.e., an encoder and a decoder). The encoder shares so-called latent representation, rather than raw data, for model training. In mobile-edge computing, network functions (such as traffic forecasting) can be trained via split learning where an encoder resides… ▽ More

    Submitted 6 September, 2023; originally announced September 2023.

    Comments: Accepted to Proc. IEEE Globecom 2023

  44. arXiv:2309.02111  [pdf, other

    cs.AR cs.ET

    HW/SW Codesign for Robust and Efficient Binarized SNNs by Capacitor Minimization

    Authors: Mikail Yayla, Simon Thomann, Ming-Liang Wei, Chia-Lin Yang, Jian-Jia Chen, Hussam Amrouch

    Abstract: Using accelerators based on analog computing is an efficient way to process the immensely large workloads in Neural Networks (NNs). One example of an analog computing scheme for NNs is Integrate-and-Fire (IF) Spiking Neural Networks (SNNs). However, to achieve high inference accuracy in IF-SNNs, the analog hardware needs to represent current-based multiply-accumulate (MAC) levels as spike times, f… ▽ More

    Submitted 5 September, 2023; originally announced September 2023.

    Comments: 9 pages, 9 figures

  45. arXiv:2308.05232  [pdf, other

    cs.CV cs.LG

    SegMatch: A semi-supervised learning method for surgical instrument segmentation

    Authors: Meng Wei, Charlie Budd, Luis C. Garcia-Peraza-Herrera, Reuben Dorent, Miao**g Shi, Tom Vercauteren

    Abstract: Surgical instrument segmentation is recognised as a key enabler to provide advanced surgical assistance and improve computer assisted interventions. In this work, we propose SegMatch, a semi supervised learning method to reduce the need for expensive annotation for laparoscopic and robotic surgical images. SegMatch builds on FixMatch, a widespread semi supervised classification pipeline combining… ▽ More

    Submitted 9 August, 2023; originally announced August 2023.

    Comments: preprint under review, 12 pages, 7 figures

  46. arXiv:2307.11958  [pdf, other

    cs.CV

    Pick the Best Pre-trained Model: Towards Transferability Estimation for Medical Image Segmentation

    Authors: Yuncheng Yang, Meng Wei, Junjun He, Jie Yang, ** Ye, Yun Gu

    Abstract: Transfer learning is a critical technique in training deep neural networks for the challenging medical image segmentation task that requires enormous resources. With the abundance of medical image data, many research institutions release models trained on various datasets that can form a huge pool of candidate source models to choose from. Hence, it's vital to estimate the source models' transfera… ▽ More

    Submitted 21 July, 2023; originally announced July 2023.

    Comments: MICCAI2023(Early Accepted)

  47. arXiv:2307.08984  [pdf, other

    cs.CV

    In Defense of Clip-based Video Relation Detection

    Authors: Meng Wei, Long Chen, Wei Ji, Xiaoyu Yue, Roger Zimmermann

    Abstract: Video Visual Relation Detection (VidVRD) aims to detect visual relationship triplets in videos using spatial bounding boxes and temporal boundaries. Existing VidVRD methods can be broadly categorized into bottom-up and top-down paradigms, depending on their approach to classifying relations. Bottom-up methods follow a clip-based approach where they classify relations of short clip tubelet pairs an… ▽ More

    Submitted 18 July, 2023; originally announced July 2023.

  48. arXiv:2307.08492  [pdf, other

    cs.CV

    SVDFormer: Complementing Point Cloud via Self-view Augmentation and Self-structure Dual-generator

    Authors: Zhe Zhu, Honghua Chen, Xing He, Weiming Wang, **g Qin, Mingqiang Wei

    Abstract: In this paper, we propose a novel network, SVDFormer, to tackle two specific challenges in point cloud completion: understanding faithful global shapes from incomplete point clouds and generating high-accuracy local structures. Current methods either perceive shape patterns using only 3D coordinates or import extra images with well-calibrated intrinsic parameters to guide the geometry estimation o… ▽ More

    Submitted 12 August, 2023; v1 submitted 17 July, 2023; originally announced July 2023.

    Comments: Accepted by ICCV 2023

  49. arXiv:2307.06439  [pdf, other

    cs.CL cs.AI

    Distilling Large Language Models for Biomedical Knowledge Extraction: A Case Study on Adverse Drug Events

    Authors: Yu Gu, Sheng Zhang, Naoto Usuyama, Yonas Woldesenbet, Cliff Wong, Praneeth Sanapathi, Mu Wei, Naveen Valluri, Erika Strandberg, Tristan Naumann, Hoifung Poon

    Abstract: Large language models (LLMs), such as GPT-4, have demonstrated remarkable capabilities across a wide range of tasks, including health applications. In this paper, we study how LLMs can be used to scale biomedical knowledge curation. We find that while LLMs already possess decent competency in structuring biomedical text, by distillation into a task-specific student model through self-supervised le… ▽ More

    Submitted 12 July, 2023; originally announced July 2023.

  50. arXiv:2307.00404  [pdf, other

    cs.SE

    Automatic Unit Test Generation for Deep Learning Frameworks based on API Knowledge

    Authors: Arunkaleeshwaran Narayanan, Nima Shiri harzevili, Junjie Wang, Lin Shi, Moshi Wei, Song Wang

    Abstract: Many automatic unit test generation tools that can generate unit test cases with high coverage over a program have been proposed. However, most of these tools are ineffective on deep learning (DL) frameworks due to the fact that many of deep learning APIs expect inputs that follow specific API knowledge. To fill this gap, we propose MUTester to generate unit test cases for APIs of deep learning fr… ▽ More

    Submitted 1 July, 2023; originally announced July 2023.