Skip to main content

Showing 1–50 of 115 results for author: Yutian

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.18547  [pdf

    eess.IV cs.CV

    Enhancing Medical Imaging with GANs Synthesizing Realistic Images from Limited Data

    Authors: Yinqiu Feng, Bo Zhang, Lingxi Xiao, Yutian Yang, Tana Gegen, Zexi Chen

    Abstract: In this research, we introduce an innovative method for synthesizing medical images using generative adversarial networks (GANs). Our proposed GANs method demonstrates the capability to produce realistic synthetic images even when trained on a limited quantity of real medical image data, showcasing commendable generalization prowess. To achieve this, we devised a generator and discriminator networ… ▽ More

    Submitted 22 May, 2024; originally announced June 2024.

  2. arXiv:2406.18546  [pdf

    cs.CV cs.AI

    Application of Multimodal Fusion Deep Learning Model in Disease Recognition

    Authors: Xiaoyi Liu, Hongjie Qiu, Muqing Li, Zhou Yu, Yutian Yang, Yafeng Yan

    Abstract: This paper introduces an innovative multi-modal fusion deep learning approach to overcome the drawbacks of traditional single-modal recognition techniques. These drawbacks include incomplete information and limited diagnostic accuracy. During the feature extraction stage, cutting-edge deep learning models including convolutional neural networks (CNN), recurrent neural networks (RNN), and transform… ▽ More

    Submitted 22 May, 2024; originally announced June 2024.

  3. arXiv:2406.16981  [pdf

    eess.IV cs.AI cs.LG eess.SP

    Research on Feature Extraction Data Processing System For MRI of Brain Diseases Based on Computer Deep Learning

    Authors: Lingxi Xiao, **xin Hu, Yutian Yang, Yinqiu Feng, Zichao Li, Zexi Chen

    Abstract: Most of the existing wavelet image processing techniques are carried out in the form of single-scale reconstruction and multiple iterations. However, processing high-quality fMRI data presents problems such as mixed noise and excessive computation time. This project proposes the use of matrix operations by combining mixed noise elimination methods with wavelet analysis to replace traditional itera… ▽ More

    Submitted 23 June, 2024; originally announced June 2024.

  4. arXiv:2406.13205  [pdf

    eess.IV cs.CV

    Application of Computer Deep Learning Model in Diagnosis of Pulmonary Nodules

    Authors: Yutian Yang, Hongjie Qiu, Yulu Gong, Xiaoyi Liu, Yang Lin, Muqing Li

    Abstract: The 3D simulation model of the lung was established by using the reconstruction method. A computer aided pulmonary nodule detection model was constructed. The process iterates over the images to refine the lung nodule recognition model based on neural networks. It is integrated with 3D virtual modeling technology to improve the interactivity of the system, so as to achieve intelligent recognition… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

    MSC Class: 68T10; 92C50

  5. arXiv:2406.12757  [pdf, other

    cs.CV

    MAC: A Benchmark for Multiple Attributes Compositional Zero-Shot Learning

    Authors: Shuo Xu, Sai Wang, Xinyue Hu, Yutian Lin, Bo Du, Yu Wu

    Abstract: Compositional Zero-Shot Learning (CZSL) aims to learn semantic primitives (attributes and objects) from seen compositions and recognize unseen attribute-object compositions. Existing CZSL datasets focus on single attributes, neglecting the fact that objects naturally exhibit multiple interrelated attributes. Real-world objects often possess multiple interrelated attributes, and current datasets' n… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

    Comments: 13pages,5figures

  6. arXiv:2406.10054  [pdf, other

    cs.SE cs.CR

    SmartOracle: Generating Smart Contract Oracle via Fine-Grained Invariant Detection

    Authors: Jianzhong Su, Jiachi Chen, Zhiyuan Fang, Xingwei Lin, Yutian Tang, Zibin Zheng

    Abstract: As decentralized applications (DApps) proliferate, the increased complexity and usage of smart contracts have heightened their susceptibility to security incidents and financial losses. Although various vulnerability detection tools have been developed to mitigate these issues, they often suffer poor performance in detecting vulnerabilities, as they either rely on simplistic and general-purpose or… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

  7. arXiv:2406.05962  [pdf, other

    cs.DC cs.DB

    Data Caching for Enterprise-Grade Petabyte-Scale OLAP

    Authors: Chunxu Tang, Bin Fan, **g Zhao, Chen Liang, Yi Wang, Beinan Wang, Ziyue Qiu, Lu Qiu, Bowen Ding, Shouzhuo Sun, Saiguang Che, Jiaming Mai, Shouwei Chen, Yu Zhu, Jianjian Xie, Yutian, Sun, Yao Li, Yangjun Zhang, Ke Wang, Mingmin Chen

    Abstract: With the exponential growth of data and evolving use cases, petabyte-scale OLAP data platforms are increasingly adopting a model that decouples compute from storage. This shift, evident in organizations like Uber and Meta, introduces operational challenges including massive, read-heavy I/O traffic with potential throttling, as well as skewed and fragmented data access patterns. Addressing these ch… ▽ More

    Submitted 9 June, 2024; originally announced June 2024.

    Comments: Accepted to the USENIX Annual Technical Conference (USENIX ATC) 2024

  8. arXiv:2405.16121  [pdf

    cs.HC

    Design and Implementation of an Emotion Analysis System Based on EEG Signals

    Authors: Zhang Yutian, Huang Shan, Zhang Jianing, Fan Ci'en

    Abstract: Traditional brain-computer systems are complex and expensive, and emotion classification algorithms lack repre-sentations of the intrinsic relationships between different channels of electroencephalogram (EEG) signals. There is still room for improvement in accuracy. To lower the research barrier for EEG and harness the rich information embedded in multi-channel EEG, we propose and implement a sim… ▽ More

    Submitted 25 May, 2024; originally announced May 2024.

  9. arXiv:2405.03547  [pdf, other

    cs.LG cs.AI cs.NE

    Position: Leverage Foundational Models for Black-Box Optimization

    Authors: Xingyou Song, Yingtao Tian, Robert Tjarko Lange, Chansoo Lee, Yu** Tang, Yutian Chen

    Abstract: Undeniably, Large Language Models (LLMs) have stirred an extraordinary wave of innovation in the machine learning research domain, resulting in substantial impact across diverse fields such as reinforcement learning, robotics, and computer vision. Their incorporation has been rapid and transformative, marking a significant paradigm shift in the field of machine learning research. However, the fiel… ▽ More

    Submitted 9 May, 2024; v1 submitted 6 May, 2024; originally announced May 2024.

    Comments: International Conference on Machine Learning (ICML) 2024

  10. arXiv:2404.07839  [pdf, other

    cs.LG cs.AI cs.CL

    RecurrentGemma: Moving Past Transformers for Efficient Open Language Models

    Authors: Aleksandar Botev, Soham De, Samuel L Smith, Anushan Fernando, George-Cristian Muraru, Ruba Haroun, Leonard Berrada, Razvan Pascanu, Pier Giuseppe Sessa, Robert Dadashi, Léonard Hussenot, Johan Ferret, Sertan Girgin, Olivier Bachem, Alek Andreev, Kathleen Kenealy, Thomas Mesnard, Cassidy Hardin, Surya Bhupatiraju, Shreya Pathak, Laurent Sifre, Morgane Rivière, Mihir Sanjay Kale, Juliette Love, Pouya Tafti , et al. (37 additional authors not shown)

    Abstract: We introduce RecurrentGemma, an open language model which uses Google's novel Griffin architecture. Griffin combines linear recurrences with local attention to achieve excellent performance on language. It has a fixed-sized state, which reduces memory use and enables efficient inference on long sequences. We provide a pre-trained model with 2B non-embedding parameters, and an instruction tuned var… ▽ More

    Submitted 11 April, 2024; originally announced April 2024.

  11. arXiv:2404.05976  [pdf, other

    cs.LG eess.SY stat.ME

    A Cyber Manufacturing IoT System for Adaptive Machine Learning Model Deployment by Interactive Causality Enabled Self-Labeling

    Authors: Yutian Ren, Yuqi He, Xuyin Zhang, Aaron Yen, G. P. Li

    Abstract: Machine Learning (ML) has been demonstrated to improve productivity in many manufacturing applications. To host these ML applications, several software and Industrial Internet of Things (IIoT) systems have been proposed for manufacturing applications to deploy ML applications and provide real-time intelligence. Recently, an interactive causality enabled self-labeling method has been proposed to ad… ▽ More

    Submitted 8 April, 2024; originally announced April 2024.

  12. arXiv:2404.05809  [pdf, other

    cs.LG cs.AI stat.ME

    Self-Labeling in Multivariate Causality and Quantification for Adaptive Machine Learning

    Authors: Yutian Ren, Aaron Haohua Yen, G. P. Li

    Abstract: Adaptive machine learning (ML) aims to allow ML models to adapt to ever-changing environments with potential concept drift after model deployment. Traditionally, adaptive ML requires a new dataset to be manually labeled to tailor deployed models to altered data distributions. Recently, an interactive causality based self-labeling method was proposed to autonomously associate causally related data… ▽ More

    Submitted 8 April, 2024; originally announced April 2024.

  13. arXiv:2404.04997  [pdf, other

    cs.LG cs.AI cs.CL

    Adapting LLMs for Efficient Context Processing through Soft Prompt Compression

    Authors: Cangqing Wang, Yutian Yang, Ruisi Li, Dan Sun, Ruicong Cai, Yuzhu Zhang, Chengqian Fu, Lillian Floyd

    Abstract: The rapid advancement of Large Language Models (LLMs) has inaugurated a transformative epoch in natural language processing, fostering unprecedented proficiency in text generation, comprehension, and contextual scrutiny. Nevertheless, effectively handling extensive contexts, crucial for myriad applications, poses a formidable obstacle owing to the intrinsic constraints of the models' context windo… ▽ More

    Submitted 18 April, 2024; v1 submitted 7 April, 2024; originally announced April 2024.

    Comments: This paper has been accepted by the 2024 International Conference on Image Processing and Computer Applications (IPCA 2024)

  14. arXiv:2404.01925  [pdf, other

    cs.CV cs.AI

    Improving Bird's Eye View Semantic Segmentation by Task Decomposition

    Authors: Tianhao Zhao, Yongcan Chen, Yu Wu, Tianyang Liu, Bo Du, Peilun Xiao, Shi Qiu, Hongda Yang, Guozhen Li, Yi Yang, Yutian Lin

    Abstract: Semantic segmentation in bird's eye view (BEV) plays a crucial role in autonomous driving. Previous methods usually follow an end-to-end pipeline, directly predicting the BEV segmentation map from monocular RGB inputs. However, the challenge arises when the RGB inputs and BEV targets from distinct perspectives, making the direct point-to-point predicting hard to optimize. In this paper, we decompo… ▽ More

    Submitted 2 April, 2024; originally announced April 2024.

    Comments: Accepted by CVPR 2024

  15. arXiv:2403.11461  [pdf, other

    cs.RO

    VIHE: Virtual In-Hand Eye Transformer for 3D Robotic Manipulation

    Authors: Weiyao Wang, Yutian Lei, Shiyu **, Gregory D. Hager, Liangjun Zhang

    Abstract: In this work, we introduce the Virtual In-Hand Eye Transformer (VIHE), a novel method designed to enhance 3D manipulation capabilities through action-aware view rendering. VIHE autoregressively refines actions in multiple stages by conditioning on rendered views posed from action predictions in the earlier stages. These virtual in-hand views provide a strong inductive bias for effectively recogniz… ▽ More

    Submitted 18 March, 2024; v1 submitted 18 March, 2024; originally announced March 2024.

  16. arXiv:2403.06420  [pdf, other

    cs.RO cs.AI cs.HC cs.LG

    RLingua: Improving Reinforcement Learning Sample Efficiency in Robotic Manipulations With Large Language Models

    Authors: Liangliang Chen, Yutian Lei, Shiyu **, Ying Zhang, Liangjun Zhang

    Abstract: Reinforcement learning (RL) has demonstrated its capability in solving various tasks but is notorious for its low sample efficiency. In this paper, we propose RLingua, a framework that can leverage the internal knowledge of large language models (LLMs) to reduce the sample complexity of RL in robotic manipulations. To this end, we first present a method for extracting the prior knowledge of LLMs b… ▽ More

    Submitted 19 March, 2024; v1 submitted 11 March, 2024; originally announced March 2024.

  17. arXiv:2403.01756  [pdf, other

    cs.CV

    Attention Guidance Mechanism for Handwritten Mathematical Expression Recognition

    Authors: Yutian Liu, Wenjun Ke, Jianguo Wei

    Abstract: Handwritten mathematical expression recognition (HMER) is challenging in image-to-text tasks due to the complex layouts of mathematical expressions and suffers from problems including over-parsing and under-parsing. To solve these, previous HMER methods improve the attention mechanism by utilizing historical alignment information. However, this approach has limitations in addressing under-parsing… ▽ More

    Submitted 5 March, 2024; v1 submitted 4 March, 2024; originally announced March 2024.

  18. arXiv:2402.19427  [pdf, other

    cs.LG cs.CL

    Griffin: Mixing Gated Linear Recurrences with Local Attention for Efficient Language Models

    Authors: Soham De, Samuel L. Smith, Anushan Fernando, Aleksandar Botev, George Cristian-Muraru, Albert Gu, Ruba Haroun, Leonard Berrada, Yutian Chen, Srivatsan Srinivasan, Guillaume Desjardins, Arnaud Doucet, David Budden, Yee Whye Teh, Razvan Pascanu, Nando De Freitas, Caglar Gulcehre

    Abstract: Recurrent neural networks (RNNs) have fast inference and scale efficiently on long sequences, but they are difficult to train and hard to scale. We propose Hawk, an RNN with gated linear recurrences, and Griffin, a hybrid model that mixes gated linear recurrences with local attention. Hawk exceeds the reported performance of Mamba on downstream tasks, while Griffin matches the performance of Llama… ▽ More

    Submitted 29 February, 2024; originally announced February 2024.

    Comments: 25 pages, 11 figures

  19. arXiv:2402.15078  [pdf, other

    cs.SE

    LLM-CompDroid: Repairing Configuration Compatibility Bugs in Android Apps with Pre-trained Large Language Models

    Authors: Zhijie Liu, Yutian Tang, Meiyun Li, Xin **, Yunfei Long, Liang Feng Zhang, Xiapu Luo

    Abstract: XML configurations are integral to the Android development framework, particularly in the realm of UI display. However, these configurations can introduce compatibility issues (bugs), resulting in divergent visual outcomes and system crashes across various Android API versions (levels). In this study, we systematically investigate LLM-based approaches for detecting and repairing configuration comp… ▽ More

    Submitted 22 February, 2024; originally announced February 2024.

  20. arXiv:2402.14547  [pdf, other

    cs.LG cs.AI cs.CL cs.DB

    OmniPred: Language Models as Universal Regressors

    Authors: Xingyou Song, Oscar Li, Chansoo Lee, Bangding Yang, Daiyi Peng, Sagi Perel, Yutian Chen

    Abstract: Over the broad landscape of experimental design, regression has been a powerful tool to accurately predict the outcome metrics of a system or model given a set of parameters, but has been traditionally restricted to methods which are only applicable to a specific task. In this paper, we propose OmniPred, a framework for training language models as universal end-to-end regressors over $(x,y)$ evalu… ▽ More

    Submitted 4 March, 2024; v1 submitted 22 February, 2024; originally announced February 2024.

    Comments: 24 pages, 10 figures. Code can be found in https://github.com/google-research/optformer/tree/main/optformer/omnipred

  21. arXiv:2402.11957  [pdf, other

    cs.CV

    Event-Based Motion Magnification

    Authors: Yutian Chen, Shi Guo, Fangzheng Yu, Feng Zhang, **wei Gu, Tianfan Xue

    Abstract: Detecting and magnifying imperceptible high-frequency motions in real-world scenarios has substantial implications for industrial and medical applications. These motions are characterized by small amplitudes and high frequencies. Traditional motion magnification methods rely on costly high-speed cameras or active light sources, which limit the scope of their applications. In this work, we propose… ▽ More

    Submitted 19 February, 2024; originally announced February 2024.

    Comments: Project Page: https://openimaginglab.github.io/emm/

  22. arXiv:2402.06798  [pdf, other

    cs.RO

    Reasoning Gras** via Multimodal Large Language Model

    Authors: Shiyu **, **xuan Xu, Yutian Lei, Liangjun Zhang

    Abstract: Despite significant progress in robotic systems for operation within human-centric environments, existing models still heavily rely on explicit human commands to identify and manipulate specific objects. This limits their effectiveness in environments where understanding and acting on implicit human intentions are crucial. In this study, we introduce a novel task: reasoning gras**, where robots… ▽ More

    Submitted 25 April, 2024; v1 submitted 9 February, 2024; originally announced February 2024.

  23. arXiv:2402.01156  [pdf, other

    cs.SE

    An Empirical Study on Low Code Programming using Traditional vs Large Language Model Support

    Authors: Yongkun Liu, Jiachi Chen, Tingting Bi, John Grundy, Yanlin Wang, Jianxing Yu, Ting Chen, Yutian Tang, Zibin Zheng

    Abstract: Low-code programming (LCP) refers to programming using models at higher levels of abstraction, resulting in less manual and more efficient programming, and reduced learning effort for amateur developers. Many LCP tools have rapidly evolved and have benefited from the concepts of visual programming languages (VPLs) and programming by demonstration (PBD). With huge increase in interest in using larg… ▽ More

    Submitted 6 June, 2024; v1 submitted 2 February, 2024; originally announced February 2024.

  24. Ambush from All Sides: Understanding Security Threats in Open-Source Software CI/CD Pipelines

    Authors: Ziyue Pan, Wenbo Shen, Xingkai Wang, Yutian Yang, Rui Chang, Yao Liu, Chengwei Liu, Yang Liu, Kui Ren

    Abstract: The continuous integration and continuous deployment (CI/CD) pipelines are widely adopted on Internet hosting platforms, such as GitHub. With the popularity, the CI/CD pipeline faces various security threats. However, current CI/CD pipelines suffer from malicious code and severe vulnerabilities. Even worse, people have not been fully aware of its attack surfaces and the corresponding impacts. Th… ▽ More

    Submitted 31 January, 2024; originally announced January 2024.

    Journal ref: IEEE Transactions on Dependable and Secure Computing (Volume: 21, Issue: 1, Jan.-Feb. 2024)

  25. arXiv:2401.11396  [pdf, other

    cs.LG cs.CV

    Visual Imitation Learning with Calibrated Contrastive Representation

    Authors: Yunke Wang, Linwei Tao, Bo Du, Yutian Lin, Chang Xu

    Abstract: Adversarial Imitation Learning (AIL) allows the agent to reproduce expert behavior with low-dimensional states and actions. However, challenges arise in handling visual states due to their less distinguishable representation compared to low-dimensional proprioceptive features. While existing methods resort to adopt complex network architectures or separate the process of learning representation an… ▽ More

    Submitted 20 January, 2024; originally announced January 2024.

  26. arXiv:2401.08525  [pdf, other

    cs.AI cs.CV cs.LG cs.RO

    GATS: Gather-Attend-Scatter

    Authors: Konrad Zolna, Serkan Cabi, Yutian Chen, Eric Lau, Claudio Fantacci, Jurgis Pasukonis, Jost Tobias Springenberg, Sergio Gomez Colmenarejo

    Abstract: As the AI community increasingly adopts large-scale models, it is crucial to develop general and flexible tools to integrate them. We introduce Gather-Attend-Scatter (GATS), a novel module that enables seamless combination of pretrained foundation models, both trainable and frozen, into larger multimodal networks. GATS empowers AI systems to process and generate information across multiple modalit… ▽ More

    Submitted 16 January, 2024; originally announced January 2024.

  27. arXiv:2312.09439  [pdf, other

    eess.SP cs.RO eess.SY

    Smart Roads: Roadside Perception, Vehicle-Road Cooperation and Business Model

    Authors: Rui Chen, Lu Gao, Yutian Liu, Yong Liang Guan, Yan Zhang

    Abstract: Smart roads have become an essential component of intelligent transportation systems (ITS). The roadside perception technology, a critical aspect of smart roads, utilizes various sensors, roadside units (RSUs), and edge computing devices to gather real-time traffic data for vehicle-road cooperation. However, the full potential of smart roads in improving the safety and efficiency of autonomous veh… ▽ More

    Submitted 19 October, 2023; originally announced December 2023.

  28. arXiv:2311.16030  [pdf, other

    cs.AI cs.LG math.OC

    Machine Learning-Enhanced Aircraft Landing Scheduling under Uncertainties

    Authors: Yutian Pang, Peng Zhao, Jueming Hu, Yongming Liu

    Abstract: This paper addresses aircraft delays, emphasizing their impact on safety and financial losses. To mitigate these issues, an innovative machine learning (ML)-enhanced landing scheduling methodology is proposed, aiming to improve automation and safety. Analyzing flight arrival delay scenarios reveals strong multimodal distributions and clusters in arrival flight time durations. A multi-stage conditi… ▽ More

    Submitted 27 November, 2023; originally announced November 2023.

  29. Token Prediction as Implicit Classification to Identify LLM-Generated Text

    Authors: Yutian Chen, Hao Kang, Vivian Zhai, Liangze Li, Rita Singh, Bhiksha Raj

    Abstract: This paper introduces a novel approach for identifying the possible large language models (LLMs) involved in text generation. Instead of adding an additional classification layer to a base LM, we reframe the classification task as a next-token prediction task and directly fine-tune the base LM to perform it. We utilize the Text-to-Text Transfer Transformer (T5) model as the backbone for our experi… ▽ More

    Submitted 15 November, 2023; originally announced November 2023.

    Comments: EMNLP 2023, Main Conference

  30. arXiv:2310.12168  [pdf, other

    cs.LG cs.AI cs.CV

    RK-core: An Established Methodology for Exploring the Hierarchical Structure within Datasets

    Authors: Yao Lu, Yutian Huang, Jiaqi Nie, Zuohui Chen, Qi Xuan

    Abstract: Recently, the field of machine learning has undergone a transition from model-centric to data-centric. The advancements in diverse learning tasks have been propelled by the accumulation of more extensive datasets, subsequently facilitating the training of larger models on these datasets. However, these datasets remain relatively under-explored. To this end, we introduce a pioneering approach known… ▽ More

    Submitted 10 October, 2023; originally announced October 2023.

  31. arXiv:2310.09461  [pdf, other

    cs.CV cs.MM cs.RO

    MAC: ModAlity Calibration for Object Detection

    Authors: Yutian Lei, Jun Liu, Dong Huang

    Abstract: The flourishing success of Deep Neural Networks(DNNs) on RGB-input perception tasks has opened unbounded possibilities for non-RGB-input perception tasks, such as object detection from wireless signals, lidar scans, and infrared images. Compared to the matured development pipeline of RGB-input (source modality) models, develo** non-RGB-input (target-modality) models from scratch poses excessive… ▽ More

    Submitted 13 October, 2023; originally announced October 2023.

  32. arXiv:2310.04874  [pdf, other

    cs.RO cs.AI

    AirIMU: Learning Uncertainty Propagation for Inertial Odometry

    Authors: Yuheng Qiu, Chen Wang, Can Xu, Yutian Chen, Xunfei Zhou, Youjie Xia, Sebastian Scherer

    Abstract: Inertial odometry (IO) using strap-down inertial measurement units (IMUs) is critical in many robotic applications where precise orientation and position tracking are essential. Prior kinematic motion model-based IO methods often use a simplified linearized IMU noise model and thus usually encounter difficulties in modeling non-deterministic errors arising from environmental disturbances and mecha… ▽ More

    Submitted 15 May, 2024; v1 submitted 7 October, 2023; originally announced October 2023.

  33. arXiv:2309.13035  [pdf, other

    cs.RO

    PyPose v0.6: The Imperative Programming Interface for Robotics

    Authors: Zitong Zhan, Xiangfu Li, Qihang Li, Haonan He, Abhinav Pandey, Haitao Xiao, Yangmengfei Xu, Xiangyu Chen, Kuan Xu, Kun Cao, Zhipeng Zhao, Zihan Wang, Huan Xu, Zihang Fang, Yutian Chen, Wentao Wang, Xu Fang, Yi Du, Tianhao Wu, Xiao Lin, Yuheng Qiu, Fan Yang, **gnan Shi, Shaoshu Su, Yiren Lu , et al. (11 additional authors not shown)

    Abstract: PyPose is an open-source library for robot learning. It combines a learning-based approach with physics-based optimization, which enables seamless end-to-end robot learning. It has been used in many tasks due to its meticulously designed application programming interface (API) and efficient implementation. From its initial launch in early 2022, PyPose has experienced significant enhancements, inco… ▽ More

    Submitted 22 September, 2023; originally announced September 2023.

  34. arXiv:2309.07518  [pdf, other

    cs.SE

    Coverage Goal Selector for Combining Multiple Criteria in Search-Based Unit Test Generation

    Authors: Zhichao Zhou, Yuming Zhou, Chunrong Fang, Zhenyu Chen, Xiapu Luo, **gzhu He, Yutian Tang

    Abstract: Unit testing is critical to the software development process, ensuring the correctness of basic programming units in a program (e.g., a method). Search-based software testing (SBST) is an automated approach to generating test cases. SBST generates test cases with genetic algorithms by specifying the coverage criterion (e.g., branch coverage). However, a good test suite must have different properti… ▽ More

    Submitted 4 January, 2024; v1 submitted 14 September, 2023; originally announced September 2023.

    Comments: arXiv admin note: substantial text overlap with arXiv:2208.04096

  35. arXiv:2309.03036  [pdf, other

    cs.SD cs.AI eess.AS

    An Efficient Temporary Deepfake Location Approach Based Embeddings for Partially Spoofed Audio Detection

    Authors: Yuankun Xie, Haonan Cheng, Yutian Wang, Long Ye

    Abstract: Partially spoofed audio detection is a challenging task, lying in the need to accurately locate the authenticity of audio at the frame level. To address this issue, we propose a fine-grained partially spoofed audio detection method, namely Temporal Deepfake Location (TDL), which can effectively capture information of both features and locations. Specifically, our approach involves two novel parts:… ▽ More

    Submitted 21 November, 2023; v1 submitted 6 September, 2023; originally announced September 2023.

  36. arXiv:2308.13707  [pdf

    cs.SE

    Human-in-the-loop online just-in-time software defect prediction

    Authors: Xutong Liu, Yufei Zhou, Yutian Tang, Junyan Qian, Yuming Zhou

    Abstract: Online Just-In-Time Software Defect Prediction (O-JIT-SDP) uses an online model to predict whether a new software change will introduce a bug or not. However, existing studies neglect the interaction of Software Quality Assurance (SQA) staff with the model, which may miss the opportunity to improve the prediction accuracy through the feedback from SQA staff. To tackle this problem, we propose Huma… ▽ More

    Submitted 25 August, 2023; originally announced August 2023.

    Comments: 16 pages, 10 figures

  37. arXiv:2308.05137  [pdf, other

    cs.CV

    Discrepancy-based Active Learning for Weakly Supervised Bleeding Segmentation in Wireless Capsule Endoscopy Images

    Authors: Fan Bai, Xiaohan Xing, Yutian Shen, Han Ma, Max Q. -H. Meng

    Abstract: Weakly supervised methods, such as class activation maps (CAM) based, have been applied to achieve bleeding segmentation with low annotation efforts in Wireless Capsule Endoscopy (WCE) images. However, the CAM labels tend to be extremely noisy, and there is an irreparable gap between CAM labels and ground truths for medical images. This paper proposes a new Discrepancy-basEd Active Learning (DEAL)… ▽ More

    Submitted 9 August, 2023; originally announced August 2023.

    Comments: accepted by MICCAI 2022

  38. arXiv:2308.04838  [pdf, other

    cs.SE

    No Need to Lift a Finger Anymore? Assessing the Quality of Code Generation by ChatGPT

    Authors: Zhijie Liu, Yutian Tang, Xiapu Luo, Yuming Zhou, Liang Feng Zhang

    Abstract: Large language models (LLMs) have demonstrated impressive capabilities across various NLP tasks. Additionally, LLMs are also highly valuable in supporting software engineering tasks, particularly in the field of code generation. Automatic code generation is a process of automatically generating source code or executable code based on given specifications or requirements, improving developer produc… ▽ More

    Submitted 13 April, 2024; v1 submitted 9 August, 2023; originally announced August 2023.

  39. arXiv:2307.10559  [pdf, other

    cs.LG cs.AI cs.HC

    Air Traffic Controller Workload Level Prediction using Conformalized Dynamical Graph Learning

    Authors: Yutian Pang, Jueming Hu, Christopher S. Lieber, Nancy J. Cooke, Yongming Liu

    Abstract: Air traffic control (ATC) is a safety-critical service system that demands constant attention from ground air traffic controllers (ATCos) to maintain daily aviation operations. The workload of the ATCos can have negative effects on operational safety and airspace usage. To avoid overloading and ensure an acceptable workload level for the ATCos, it is important to predict the ATCos' workload accura… ▽ More

    Submitted 22 July, 2023; v1 submitted 19 July, 2023; originally announced July 2023.

  40. arXiv:2307.01216  [pdf, other

    cs.CL

    Discovering Patterns of Definitions and Methods from Scientific Documents

    Authors: Yutian Sun, Hai Zhuge

    Abstract: The difficulties of automatic extraction of definitions and methods from scientific documents lie in two aspects: (1) the complexity and diversity of natural language texts, which requests an analysis method to support the discovery of pattern; and, (2) a complete definition or method represented by a scientific paper is usually distributed within text, therefore an effective approach should not o… ▽ More

    Submitted 1 July, 2023; originally announced July 2023.

  41. arXiv:2307.00588  [pdf, other

    cs.SE

    ChatGPT vs SBST: A Comparative Assessment of Unit Test Suite Generation

    Authors: Yutian Tang, Zhijie Liu, Zhichao Zhou, Xiapu Luo

    Abstract: Recent advancements in large language models (LLMs) have demonstrated exceptional success in a wide range of general domain tasks, such as question answering and following instructions. Moreover, LLMs have shown potential in various software engineering applications. In this study, we present a systematic comparison of test suites generated by the ChatGPT LLM and the state-of-the-art SBST tool Evo… ▽ More

    Submitted 2 July, 2023; originally announced July 2023.

  42. arXiv:2306.09800  [pdf, other

    cs.LG cs.RO

    $\pi2\text{vec}$: Policy Representations with Successor Features

    Authors: Gianluca Scarpellini, Ksenia Konyushkova, Claudio Fantacci, Tom Le Paine, Yutian Chen, Misha Denil

    Abstract: This paper describes $\pi2\text{vec}$, a method for representing behaviors of black box policies as feature vectors. The policy representations capture how the statistics of foundation model features change in response to the policy behavior in a task agnostic way, and can be trained from offline data, allowing them to be used in offline policy selection. This work provides a key piece of a recipe… ▽ More

    Submitted 24 January, 2024; v1 submitted 16 June, 2023; originally announced June 2023.

    Comments: Accepted paper at ICLR2024

  43. arXiv:2306.08368  [pdf, other

    cs.CL cs.AI

    T5-SR: A Unified Seq-to-Seq Decoding Strategy for Semantic Parsing

    Authors: Yuntao Li, Zhenpeng Su, Yutian Li, Hanchu Zhang, Sirui Wang, Wei Wu, Yan Zhang

    Abstract: Translating natural language queries into SQLs in a seq2seq manner has attracted much attention recently. However, compared with abstract-syntactic-tree-based SQL generation, seq2seq semantic parsers face much more challenges, including poor quality on schematical information prediction and poor semantic coherence between natural language queries and SQLs. This paper analyses the above difficultie… ▽ More

    Submitted 14 June, 2023; originally announced June 2023.

    Comments: This paper has been accepted by ICASSP2023

  44. arXiv:2306.00595  [pdf, other

    cs.CV

    Revisit Weakly-Supervised Audio-Visual Video Parsing from the Language Perspective

    Authors: Yingying Fan, Yu Wu, Bo Du, Yutian Lin

    Abstract: We focus on the weakly-supervised audio-visual video parsing task (AVVP), which aims to identify and locate all the events in audio/visual modalities. Previous works only concentrate on video-level overall label denoising across modalities, but overlook the segment-level label noise, where adjacent video segments (i.e., 1-second video clips) may contain different events. However, recognizing event… ▽ More

    Submitted 27 October, 2023; v1 submitted 1 June, 2023; originally announced June 2023.

    Comments: Accepted to NeurIPS 2023

  45. arXiv:2305.07969  [pdf, other

    cs.CL

    GPT-Sentinel: Distinguishing Human and ChatGPT Generated Content

    Authors: Yutian Chen, Hao Kang, Vivian Zhai, Liangze Li, Rita Singh, Bhiksha Raj

    Abstract: This paper presents a novel approach for detecting ChatGPT-generated vs. human-written text using language models. To this end, we first collected and released a pre-processed dataset named OpenGPTText, which consists of rephrased content generated using ChatGPT. We then designed, implemented, and trained two different models for text classification, using Robustly Optimized BERT Pretraining Appro… ▽ More

    Submitted 17 May, 2023; v1 submitted 13 May, 2023; originally announced May 2023.

  46. arXiv:2304.10244  [pdf, other

    cs.CV

    Omni Aggregation Networks for Lightweight Image Super-Resolution

    Authors: Hang Wang, Xuanhong Chen, Bingbing Ni, Yutian Liu, **fan Liu

    Abstract: While lightweight ViT framework has made tremendous progress in image super-resolution, its uni-dimensional self-attention modeling, as well as homogeneous aggregation scheme, limit its effective receptive field (ERF) to include more comprehensive interactions from both spatial and channel dimensions. To tackle these drawbacks, this work proposes two enhanced components under a new Omni-SR archite… ▽ More

    Submitted 24 April, 2023; v1 submitted 20 April, 2023; originally announced April 2023.

    Comments: Accepted by CVPR2023. Code is available at \url{https://github.com/Francis0625/Omni-SR}

  47. arXiv:2304.03995  [pdf, other

    cs.NE cs.LG

    Discovering Attention-Based Genetic Algorithms via Meta-Black-Box Optimization

    Authors: Robert Tjarko Lange, Tom Schaul, Yutian Chen, Chris Lu, Tom Zahavy, Valentin Dalibard, Sebastian Flennerhag

    Abstract: Genetic algorithms constitute a family of black-box optimization algorithms, which take inspiration from the principles of biological evolution. While they provide a general-purpose tool for optimization, their particular instantiations can be heuristic and motivated by loose biological intuition. In this work we explore a fundamentally different approach: Given a sufficiently flexible parametriza… ▽ More

    Submitted 8 April, 2023; originally announced April 2023.

    Comments: 14 pages, 31 figures

  48. arXiv:2303.10406  [pdf, other

    cs.CV cs.AI cs.LG

    3DQD: Generalized Deep 3D Shape Prior via Part-Discretized Diffusion Process

    Authors: Yuhan Li, Yishun Dou, Xuanhong Chen, Bingbing Ni, Yilin Sun, Yutian Liu, Fuzhen Wang

    Abstract: We develop a generalized 3D shape generation prior model, tailored for multiple 3D tasks including unconditional shape generation, point cloud completion, and cross-modality shape generation, etc. On one hand, to precisely capture local fine detailed shape information, a vector quantized variational autoencoder (VQ-VAE) is utilized to index local geometry from a compactly learned codebook based on… ▽ More

    Submitted 18 March, 2023; originally announced March 2023.

    Comments: Accepted by CVPR 2023

  49. arXiv:2302.11732  [pdf, other

    cs.SE

    On Code Reuse from StackOverflow: An Exploratory Study on Jupyter Notebook

    Authors: Mingke Yang, Yuming Zhou, Bixin Li, Yutian Tang

    Abstract: Jupyter Notebook is a popular tool among data analysts and scientists for working with data. It provides a way to combine code, documentation, and visualizations in a single, interactive environment, facilitating code reuse. While code reuse can improve programming efficiency, it can also decrease readability, security, and overall performance. We conduct a large-scale exploratory study of code re… ▽ More

    Submitted 22 February, 2023; originally announced February 2023.

  50. arXiv:2302.08212  [pdf, other

    cs.CV

    Visible-Infrared Person Re-Identification via Patch-Mixed Cross-Modality Learning

    Authors: Zhihao Qian, Yutian Lin, Bo Du

    Abstract: Visible-infrared person re-identification (VI-ReID) aims to retrieve images of the same pedestrian from different modalities, where the challenges lie in the significant modality discrepancy. To alleviate the modality gap, recent methods generate intermediate images by GANs, grayscaling, or mixup strategies. However, these methods could introduce extra data distribution, and the semantic correspon… ▽ More

    Submitted 30 April, 2024; v1 submitted 16 February, 2023; originally announced February 2023.