Skip to main content

Showing 1–50 of 189 results for author: Yang, N

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.01595  [pdf, other

    cs.LG cs.CY cs.SE

    Fairpriori: Improving Biased Subgroup Discovery for Deep Neural Network Fairness

    Authors: Kacy Zhou, Jiawen Wen, Nan Yang, Dong Yuan, Qinghua Lu, Huaming Chen

    Abstract: While deep learning has become a core functional module of most software systems, concerns regarding the fairness of ML predictions have emerged as a significant issue that affects prediction results due to discrimination. Intersectional bias, which disproportionately affects members of subgroups, is a prime example of this. For instance, a machine learning model might exhibit bias against darker-… ▽ More

    Submitted 24 June, 2024; originally announced July 2024.

    Comments: 11 pages

  2. arXiv:2406.18573  [pdf

    cs.CV cs.CY cs.GR

    Generating grid maps via the snake model

    Authors: Zhiwei Wei, Nai Yang, Wenjia Xu, Su Ding

    Abstract: The grid map, often referred to as the tile map, stands as a vital tool in geospatial visualization, possessing unique attributes that differentiate it from more commonly known techniques such as choropleths and cartograms. It transforms geographic regions into grids, which requires the displacement of both region centroids and boundary nodes to establish a coherent grid arrangement. However, exis… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

    Comments: 10 Pages, 8 Figures

    Journal ref: Transactions in GIS, 2024, 1-19

  3. arXiv:2406.18102  [pdf

    eess.IV cs.CV

    A Lung Nodule Dataset with Histopathology-based Cancer Type Annotation

    Authors: Muwei Jian, Hongyu Chen, Zaiyong Zhang, Nan Yang, Haorang Zhang, Lifu Ma, Wen**g Xu, Huixiang Zhi

    Abstract: Recently, Computer-Aided Diagnosis (CAD) systems have emerged as indispensable tools in clinical diagnostic workflows, significantly alleviating the burden on radiologists. Nevertheless, despite their integration into clinical settings, CAD systems encounter limitations. Specifically, while CAD systems can achieve high performance in the detection of lung nodules, they face challenges in accuratel… ▽ More

    Submitted 26 June, 2024; originally announced June 2024.

  4. arXiv:2406.10224  [pdf, other

    cs.CV

    EFM3D: A Benchmark for Measuring Progress Towards 3D Egocentric Foundation Models

    Authors: Julian Straub, Daniel DeTone, Tianwei Shen, Nan Yang, Chris Sweeney, Richard Newcombe

    Abstract: The advent of wearable computers enables a new source of context for AI that is embedded in egocentric sensor data. This new egocentric data comes equipped with fine-grained 3D location information and thus presents the opportunity for a novel class of spatial foundation models that are rooted in 3D space. To measure progress on what we term Egocentric Foundation Models (EFMs) we establish EFM3D,… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

  5. arXiv:2406.00707  [pdf, other

    cs.RO

    QUADFormer: Learning-based Detection of Cyber Attacks in Quadrotor UAVs

    Authors: Pengyu Wang, Zhaohua Yang, Nachuan Yang, Zikai Wang, Jialu Li, Fan Zhang, Chaoqun Wang, Jiankun Wang, Max Q. -H. Meng, Ling Shi

    Abstract: Safety-critical intelligent cyber-physical systems, such as quadrotor unmanned aerial vehicles (UAVs), are vulnerable to different types of cyber attacks, and the absence of timely and accurate attack detection can lead to severe consequences. When UAVs are engaged in large outdoor maneuvering flights, their system constitutes highly nonlinear dynamics that include non-Gaussian noises. Therefore,… ▽ More

    Submitted 14 June, 2024; v1 submitted 2 June, 2024; originally announced June 2024.

  6. arXiv:2404.18084  [pdf, other

    cs.NI

    Age-minimal Multicast by Graph Attention Reinforcement Learning

    Authors: Yanning Zhang, Guocheng Liao, Shengbin Cao, Ning Yang, Meng Zhang

    Abstract: Age of Information (AoI) is an emerging metric used to assess the timeliness of information, gaining research interest in real-time multicast applications such as video streaming and metaverse platforms. In this paper, we consider a dynamic multicast network with energy constraints, where our objective is to minimize the expected time-average AoI through energy-constrained multicast routing and sc… ▽ More

    Submitted 31 May, 2024; v1 submitted 28 April, 2024; originally announced April 2024.

  7. arXiv:2404.14856  [pdf, other

    cs.IR

    Cross-Domain Causal Preference Learning for Out-of-Distribution Recommendation

    Authors: Zhuhang Li, Ning Yang

    Abstract: Recommender systems use users' historical interactions to learn their preferences and deliver personalized recommendations from a vast array of candidate items. Current recommender systems primarily rely on the assumption that the training and testing datasets have identical distributions, which may not hold true in reality. In fact, the distribution shift between training and testing datasets oft… ▽ More

    Submitted 23 April, 2024; originally announced April 2024.

    Comments: 16 pages, 5 figures, accepted by DASFAA2024

  8. arXiv:2404.14238  [pdf, other

    cs.NI cs.AI

    Beyond the Edge: An Advanced Exploration of Reinforcement Learning for Mobile Edge Computing, its Applications, and Future Research Trajectories

    Authors: Ning Yang, Shuo Chen, Haijun Zhang, Randall Berry

    Abstract: Mobile Edge Computing (MEC) broadens the scope of computation and storage beyond the central network, incorporating edge nodes close to end devices. This expansion facilitates the implementation of large-scale "connected things" within edge networks. The advent of applications necessitating real-time, high-quality service presents several challenges, such as low latency, high data rate, reliabilit… ▽ More

    Submitted 22 April, 2024; originally announced April 2024.

    Comments: The paper is accepted by IEEE Communications Surveys and Tutorials (COMST)

  9. arXiv:2404.13600  [pdf, other

    cs.RO

    Are We Ready for Planetary Exploration Robots? The TAIL-Plus Dataset for SLAM in Granular Environments

    Authors: Zirui Wang, Chen Yao, Yangtao Ge, Guowei Shi, Ningbo Yang, Zheng Zhu, Kewei Dong, Hexiang Wei, Zhenzhong Jia, **g Wu

    Abstract: So far, planetary surface exploration depends on various mobile robot platforms. The autonomous navigation and decision-making of these mobile robots in complex terrains largely rely on their terrain-aware perception, localization and map** capabilities. In this paper we release the TAIL-Plus dataset, a new challenging dataset in deformable granular environments for planetary exploration robots,… ▽ More

    Submitted 21 April, 2024; originally announced April 2024.

    Comments: Accepted to the IEEE ICRA Workshop on Field Robotics 2024

  10. arXiv:2404.12096  [pdf, other

    cs.CL cs.LG

    LongEmbed: Extending Embedding Models for Long Context Retrieval

    Authors: Dawei Zhu, Liang Wang, Nan Yang, Yifan Song, Wenhao Wu, Furu Wei, Sujian Li

    Abstract: Embedding models play a pivot role in modern NLP applications such as IR and RAG. While the context limit of LLMs has been pushed beyond 1 million tokens, embedding models are still confined to a narrow context window not exceeding 8k tokens, refrained from application scenarios requiring long inputs such as legal contracts. This paper explores context window extension of existing embedding models… ▽ More

    Submitted 24 April, 2024; v1 submitted 18 April, 2024; originally announced April 2024.

    Comments: Fix results for Nomic

  11. arXiv:2404.11999  [pdf, other

    cs.CL cs.AI

    Token-level Direct Preference Optimization

    Authors: Yongcheng Zeng, Guoqing Liu, Weiyu Ma, Ning Yang, Haifeng Zhang, Jun Wang

    Abstract: Fine-tuning pre-trained Large Language Models (LLMs) is essential to align them with human values and intentions. This process often utilizes methods like pairwise comparisons and KL divergence against a reference LLM, focusing on the evaluation of full answers generated by the models. However, the generation of these responses occurs in a token level, following a sequential, auto-regressive fashi… ▽ More

    Submitted 27 June, 2024; v1 submitted 18 April, 2024; originally announced April 2024.

  12. arXiv:2404.11916  [pdf, other

    cs.CL cs.AI

    SKIP: Skill-Localized Prompt Tuning for Inference Speed Boost-Up

    Authors: Nakyeong Yang, Junseok Kim, Jiwon Moon, Yunah Jang, Kyomin Jung

    Abstract: Prompt-tuning methods have shown comparable performance as parameter-efficient fine-tuning (PEFT) methods in various natural language understanding tasks. However, existing prompt tuning methods still utilize the entire model architecture; thus, they fail to accelerate inference speed in the application. In this paper, we propose a novel approach called SKIll-localized Prompt tuning (SKIP), which… ▽ More

    Submitted 18 April, 2024; originally announced April 2024.

    Comments: 6 pages

  13. arXiv:2404.09324  [pdf, other

    cs.MA

    Correlated Mean Field Imitation Learning

    Authors: Zhiyu Zhao, Ning Yang, Xue Yan, Haifeng Zhang, Jun Wang, Yaodong Yang

    Abstract: We investigate multi-agent imitation learning (IL) within the framework of mean field games (MFGs), considering the presence of time-varying correlated signals. Existing MFG IL algorithms assume demonstrations are sampled from Mean Field Nash Equilibria (MFNE), limiting their adaptability to real-world scenarios. For example, in the traffic network equilibrium influenced by public routing recommen… ▽ More

    Submitted 14 April, 2024; originally announced April 2024.

    Comments: 23 pages

  14. Adaptive Fair Representation Learning for Personalized Fairness in Recommendations via Information Alignment

    Authors: Xinyu Zhu, Lilin Zhang, Ning Yang

    Abstract: Personalized fairness in recommendations has been attracting increasing attention from researchers. The existing works often treat a fairness requirement, represented as a collection of sensitive attributes, as a hyper-parameter, and pursue extreme fairness by completely removing information of sensitive attributes from the learned fair embedding, which suffer from two challenges: huge training co… ▽ More

    Submitted 12 April, 2024; v1 submitted 11 April, 2024; originally announced April 2024.

    Comments: This paper has been accepted by SIGIR '24

  15. arXiv:2403.16875  [pdf, other

    cs.RO

    TAIL: A Terrain-Aware Multi-Modal SLAM Dataset for Robot Locomotion in Deformable Granular Environments

    Authors: Chen Yao, Yangtao Ge, Guowei Shi, Zirui Wang, Ningbo Yang, Zheng Zhu, Hexiang Wei, Yuntian Zhao, **g Wu, Zhenzhong Jia

    Abstract: Terrain-aware perception holds the potential to improve the robustness and accuracy of autonomous robot navigation in the wilds, thereby facilitating effective off-road traversals. However, the lack of multi-modal perception across various motion patterns hinders the solutions of Simultaneous Localization And Map** (SLAM), especially when confronting non-geometric hazards in demanding landscapes… ▽ More

    Submitted 25 March, 2024; originally announced March 2024.

    Comments: Submitted to IEEE Robotics and Automation Letters

  16. arXiv:2403.11202  [pdf, other

    cs.AR cs.AI cs.PL

    Data is all you need: Finetuning LLMs for Chip Design via an Automated design-data augmentation framework

    Authors: Kaiyan Chang, Kun Wang, Nan Yang, Ying Wang, Dantong **, Wenlong Zhu, Zhirong Chen, Cangyuan Li, Hao Yan, Yunhao Zhou, Zhuoliang Zhao, Yuan Cheng, Yudong Pan, Yiqi Liu, Mengdi Wang, Shengwen Liang, yinhe han, Huawei Li, Xiaowei Li

    Abstract: Recent advances in large language models have demonstrated their potential for automated generation of hardware description language (HDL) code from high-level prompts. Researchers have utilized fine-tuning to enhance the ability of these large language models (LLMs) in the field of Chip Design. However, the lack of Verilog data hinders further improvement in the quality of Verilog generation by L… ▽ More

    Submitted 17 March, 2024; originally announced March 2024.

    Comments: Accepted by DAC 2024; please note that this is not the final camera-ready version

  17. arXiv:2402.09906  [pdf, other

    cs.CL cs.AI cs.LG

    Generative Representational Instruction Tuning

    Authors: Niklas Muennighoff, Hong** Su, Liang Wang, Nan Yang, Furu Wei, Tao Yu, Amanpreet Singh, Douwe Kiela

    Abstract: All text-based language problems can be reduced to either generation or embedding. Current models only perform well at one or the other. We introduce generative representational instruction tuning (GRIT) whereby a large language model is trained to handle both generative and embedding tasks by distinguishing between them through instructions. Compared to other open models, our resulting GritLM 7B… ▽ More

    Submitted 17 April, 2024; v1 submitted 15 February, 2024; originally announced February 2024.

    Comments: 66 pages (16 main), 25 figures, 34 tables

  18. arXiv:2402.05672  [pdf, other

    cs.CL cs.IR

    Multilingual E5 Text Embeddings: A Technical Report

    Authors: Liang Wang, Nan Yang, Xiaolong Huang, Linjun Yang, Rangan Majumder, Furu Wei

    Abstract: This technical report presents the training methodology and evaluation results of the open-source multilingual E5 text embedding models, released in mid-2023. Three embedding models of different sizes (small / base / large) are provided, offering a balance between the inference efficiency and embedding quality. The training procedure adheres to the English E5 model recipe, involving contrastive pr… ▽ More

    Submitted 8 February, 2024; originally announced February 2024.

    Comments: 6 pages

  19. arXiv:2402.01330  [pdf, other

    cs.NI

    Video Semantic Communication with Major Object Extraction and Contextual Video Encoding

    Authors: Haopeng Li, Haonan Tong, Sihua Wang, Nuocheng Yang, Zhaohui Yang, Changchuan Yin

    Abstract: This paper studies an end-to-end video semantic communication system for massive communication. In the considered system, the transmitter must continuously send the video to the receiver to facilitate character reconstruction in immersive applications, such as interactive video conference. However, transmitting the original video information with substantial amounts of data poses a challenge to th… ▽ More

    Submitted 2 February, 2024; originally announced February 2024.

    Comments: 6 pages, 9 figures, accepted by IEEE WCNC wksp 2024

  20. arXiv:2401.11219  [pdf, ps, other

    cs.IT

    On the Information Leakage Performance of Secure Finite Blocklength Transmissions over Rayleigh Fading Channels

    Authors: Milad Tatar Mamaghani, Xiangyun Zhou, Nan Yang, A. Lee Swindlehurst, H. Vincent Poor

    Abstract: This paper presents a secrecy performance study of a wiretap communication system with finite blocklength (FBL) transmissions over Rayleigh fading channels, based on the definition of an average information leakage (AIL) metric. We evaluate the exact and closed-form approximate AIL performance, assuming that only statistical channel state information (CSI) of the eavesdrop** link is available. T… ▽ More

    Submitted 20 January, 2024; originally announced January 2024.

    Comments: 6 pages, 5 figures. Accepted for presentation at the 2024 IEEE International Conference on Communications (CT Symposium), 9 - 13 June 2024, Denver, CO United States. Note: An extended version of this work is available as arXiv:2308.13184

  21. arXiv:2401.09500  [pdf, other

    q-bio.NC cs.LG cs.NE

    MorphGrower: A Synchronized Layer-by-layer Growing Approach for Plausible Neuronal Morphology Generation

    Authors: Nianzu Yang, Kaipeng Zeng, Haotian Lu, Yexin Wu, Zexin Yuan, Danni Chen, Shengdian Jiang, Jiaxiang Wu, Yimin Wang, Junchi Yan

    Abstract: Neuronal morphology is essential for studying brain functioning and understanding neurodegenerative disorders. As acquiring real-world morphology data is expensive, computational approaches for morphology generation have been studied. Traditional methods heavily rely on expert-set rules and parameter tuning, making it difficult to generalize across different types of morphologies. Recently, MorphV… ▽ More

    Submitted 27 May, 2024; v1 submitted 17 January, 2024; originally announced January 2024.

  22. arXiv:2401.07329  [pdf, other

    cs.NE

    Attention-based UNet enabled Lightweight Image Semantic Communication System over Internet of Things

    Authors: Guoxin Ma, Haonan Tong, Nuocheng Yang, Changchuan Yin

    Abstract: This paper studies the problem of the lightweight image semantic communication system that is deployed on Internet of Things (IoT) devices. In the considered system model, devices must use semantic communication techniques to support user behavior recognition in ultimate video service with high data transmission efficiency. However, it is computationally expensive for IoT devices to deploy semanti… ▽ More

    Submitted 14 January, 2024; originally announced January 2024.

    Comments: 6 pages, 6 figures, accepted by IEEE WCNC 2024

  23. arXiv:2401.03228  [pdf, other

    stat.ML cs.LG

    Reflected Schrödinger Bridge for Constrained Generative Modeling

    Authors: Wei Deng, Yu Chen, Nicole Tianjiao Yang, Hengrong Du, Qi Feng, Ricky T. Q. Chen

    Abstract: Diffusion models have become the go-to method for large-scale generative models in real-world applications. These applications often involve data distributions confined within bounded domains, typically requiring ad-hoc thresholding techniques for boundary enforcement. Reflected diffusion models (Lou23) aim to enhance generalizability by generating the data distribution through a backward process… ▽ More

    Submitted 6 January, 2024; originally announced January 2024.

  24. arXiv:2401.00368  [pdf, other

    cs.CL cs.IR

    Improving Text Embeddings with Large Language Models

    Authors: Liang Wang, Nan Yang, Xiaolong Huang, Linjun Yang, Rangan Majumder, Furu Wei

    Abstract: In this paper, we introduce a novel and simple method for obtaining high-quality text embeddings using only synthetic data and less than 1k training steps. Unlike existing methods that often depend on multi-stage intermediate pre-training with billions of weakly-supervised text pairs, followed by fine-tuning with a few labeled datasets, our method does not require building complex training pipelin… ▽ More

    Submitted 31 May, 2024; v1 submitted 30 December, 2023; originally announced January 2024.

    Comments: Accepted by ACL 2024

  25. arXiv:2312.14457  [pdf, other

    cs.RO cs.CV

    QUAR-VLA: Vision-Language-Action Model for Quadruped Robots

    Authors: Pengxiang Ding, Han Zhao, Wenjie Zhang, Wenxuan Song, Ningxi Yang, Donglin Wang

    Abstract: The important manifestation of robot intelligence is the ability to naturally interact and autonomously make decisions. Traditional approaches to robot control often compartmentalize perception, planning, and decision-making, simplifying system design but limiting the synergy between different information streams. This compartmentalization poses challenges in achieving seamless autonomous reasonin… ▽ More

    Submitted 16 June, 2024; v1 submitted 22 December, 2023; originally announced December 2023.

  26. arXiv:2312.00372  [pdf, other

    cs.IR cs.CL

    Event-driven Real-time Retrieval in Web Search

    Authors: Nan Yang, Shusen Zhang, Yannan Zhang, Xiaoling Bai, Hualong Deng, Tianhua Zhou, ** Ma

    Abstract: Information retrieval in real-time search presents unique challenges distinct from those encountered in classical web search. These challenges are particularly pronounced due to the rapid change of user search intent, which is influenced by the occurrence and evolution of breaking news events, such as earthquakes, elections, and wars. Previous dense retrieval methods, which primarily focused on st… ▽ More

    Submitted 4 December, 2023; v1 submitted 1 December, 2023; originally announced December 2023.

  27. arXiv:2311.15208  [pdf, other

    cs.CL cs.AI

    LongStory: Coherent, Complete and Length Controlled Long story Generation

    Authors: Kyeongman Park, Nakyeong Yang, Kyomin Jung

    Abstract: A human author can write any length of story without losing coherence. Also, they always bring the story to a proper ending, an ability that current language models lack. In this work, we present the LongStory for coherent, complete, and length-controlled long story generation. LongStory introduces two novel methodologies: (1) the long and short-term contexts weight calibrator (CWC) and (2) long s… ▽ More

    Submitted 26 November, 2023; originally announced November 2023.

  28. arXiv:2311.09627  [pdf, other

    cs.AI cs.CL cs.LG

    Mitigating Biases for Instruction-following Language Models via Bias Neurons Elimination

    Authors: Nakyeong Yang, Taegwan Kang, Jungkyu Choi, Honglak Lee, Kyomin Jung

    Abstract: Instruction-following language models often show undesirable biases. These undesirable biases may be accelerated in the real-world usage of language models, where a wide range of instructions is used through zero-shot example prompting. To solve this problem, we first define the bias neuron, which significantly affects biased outputs, and prove its existence empirically. Furthermore, we propose a… ▽ More

    Submitted 5 June, 2024; v1 submitted 16 November, 2023; originally announced November 2023.

    Comments: accepted to ACL 2024

  29. arXiv:2311.00436  [pdf, other

    cs.CV

    Enhancing Traffic Object Detection in Variable Illumination with RGB-Event Fusion

    Authors: Zhanwen Liu, Nan Yang, Yang Wang, Yuke Li, Xiangmo Zhao, Fei-Yue Wang

    Abstract: Traffic object detection under variable illumination is challenging due to the information loss caused by the limited dynamic range of conventional frame-based cameras. To address this issue, we introduce bio-inspired event cameras and propose a novel Structure-aware Fusion Network (SFNet) that extracts sharp and complete object structures from the event stream to compensate for the lost informati… ▽ More

    Submitted 1 November, 2023; originally announced November 2023.

    Comments: 13 pages, 10 figures

  30. arXiv:2310.14587  [pdf, other

    cs.IR cs.CL

    Large Search Model: Redefining Search Stack in the Era of LLMs

    Authors: Liang Wang, Nan Yang, Xiaolong Huang, Linjun Yang, Rangan Majumder, Furu Wei

    Abstract: Modern search engines are built on a stack of different components, including query understanding, retrieval, multi-stage ranking, and question answering, among others. These components are often optimized and deployed independently. In this paper, we introduce a novel conceptual framework called large search model, which redefines the conventional search stack by unifying search tasks with one la… ▽ More

    Submitted 2 January, 2024; v1 submitted 23 October, 2023; originally announced October 2023.

    Comments: SIGIR Forum, Vol. 57 No. 2 - December 2023

  31. arXiv:2310.08319  [pdf, other

    cs.IR

    Fine-Tuning LLaMA for Multi-Stage Text Retrieval

    Authors: Xueguang Ma, Liang Wang, Nan Yang, Furu Wei, Jimmy Lin

    Abstract: The effectiveness of multi-stage text retrieval has been solidly demonstrated since before the era of pre-trained language models. However, most existing studies utilize models that predate recent advances in large language models (LLMs). This study seeks to explore potential improvements that state-of-the-art LLMs can bring. We conduct a comprehensive study, fine-tuning the latest LLaMA model bot… ▽ More

    Submitted 12 October, 2023; originally announced October 2023.

  32. arXiv:2310.05142  [pdf, ps, other

    cs.IT eess.SP

    Secure Short-Packet Transmission with Aerial Relaying: Blocklength and Trajectory Co-Design

    Authors: Milad Tatar Mamaghani, Xiangyun Zhou, Nan Yang, A. Lee Swindlehurst

    Abstract: In this paper, we propose a secure short-packet communication (SPC) system involving an unmanned aerial vehicle (UAV)-aided relay in the presence of a terrestrial passive eavesdropper. The considered system, which is applicable to various next-generation Internet-of-Things (IoT) networks, exploits a UAV as a mobile relay, facilitating the reliable and secure exchange of intermittent short packets… ▽ More

    Submitted 8 October, 2023; originally announced October 2023.

    Comments: 7 pages, 5 figures, 1 table, Accepted by IEEE Global Communications Conference, 4-8 December 2023, Kuala Lumpur, Malaysia. An extended version of this work is available arXiv:2307.07227

  33. arXiv:2309.16701  [pdf, other

    cs.CV cs.AI cs.CL

    Is it Really Negative? Evaluating Natural Language Video Localization Performance on Multiple Reliable Videos Pool

    Authors: Nakyeong Yang, Minsung Kim, Seunghyun Yoon, Joongbo Shin, Kyomin Jung

    Abstract: With the explosion of multimedia content in recent years, Video Corpus Moment Retrieval (VCMR), which aims to detect a video moment that matches a given natural language query from multiple videos, has become a critical problem. However, existing VCMR studies have a significant limitation since they have regarded all videos not paired with a specific query as negative, neglecting the possibility o… ▽ More

    Submitted 18 March, 2024; v1 submitted 15 August, 2023; originally announced September 2023.

    Comments: 15 pages, 10 figures

  34. arXiv:2309.15324  [pdf, other

    cs.CR

    DefectHunter: A Novel LLM-Driven Boosted-Conformer-based Code Vulnerability Detection Mechanism

    Authors: ** Wang, Zishan Huang, Hengli Liu, Nianyi Yang, Yinhao Xiao

    Abstract: One of the most pressing threats to computing systems is software vulnerabilities, which can compromise both hardware and software components. Existing methods for vulnerability detection remain suboptimal. Traditional techniques are both time-consuming and labor-intensive, while machine-learning-based approaches often underperform when applied to complex datasets, due to their inability to captur… ▽ More

    Submitted 26 September, 2023; originally announced September 2023.

  35. Privacy-Preserving Quantum Two-Party Geometric Intersection

    Authors: Wen-Jie Liu, Yong Xu, James C. N. Yang, Wen-Bin Yu, Lian-Hua Chi

    Abstract: Privacy-preserving computational geometry is the research area on the intersection of the domains of secure multi-party computation (SMC) and computational geometry. As an important field, the privacy-preserving geometric intersection (PGI) problem is when each of the multiple parties has a private geometric graph and seeks to determine whether their graphs intersect or not without revealing their… ▽ More

    Submitted 21 September, 2023; originally announced September 2023.

    Journal ref: CMC: Computers, Materials & Continua, 2019. 60(3): p. 1237-1250

  36. arXiv:2309.10400  [pdf, other

    cs.CL cs.LG

    PoSE: Efficient Context Window Extension of LLMs via Positional Skip-wise Training

    Authors: Dawei Zhu, Nan Yang, Liang Wang, Yifan Song, Wenhao Wu, Furu Wei, Sujian Li

    Abstract: Large Language Models (LLMs) are trained with a pre-defined context length, restricting their use in scenarios requiring long inputs. Previous efforts for adapting LLMs to a longer length usually requires fine-tuning with this target length (Full-length fine-tuning), suffering intensive training cost. To decouple train length from target length for efficient context window extension, we propose Po… ▽ More

    Submitted 21 February, 2024; v1 submitted 19 September, 2023; originally announced September 2023.

    Comments: ICLR 2024

  37. arXiv:2309.09324  [pdf, other

    cs.CY cs.CL

    How People Perceive The Dynamic Zero-COVID Policy: A Retrospective Analysis From The Perspective of Appraisal Theory

    Authors: Na Yang, Kyrie Zhixuan Zhou, Yunzhe Li

    Abstract: The Dynamic Zero-COVID Policy in China spanned three years and diverse emotional responses have been observed at different times. In this paper, we retrospectively analyzed public sentiments and perceptions of the policy, especially regarding how they evolved over time, and how they related to people's lived experiences. Through sentiment analysis of 2,358 collected Weibo posts, we identified four… ▽ More

    Submitted 17 September, 2023; originally announced September 2023.

  38. arXiv:2308.13561  [pdf, other

    cs.HC cs.CV

    Project Aria: A New Tool for Egocentric Multi-Modal AI Research

    Authors: Jakob Engel, Kiran Somasundaram, Michael Goesele, Albert Sun, Alexander Gamino, Andrew Turner, Arjang Talattof, Arnie Yuan, Bilal Souti, Brighid Meredith, Cheng Peng, Chris Sweeney, Cole Wilson, Dan Barnes, Daniel DeTone, David Caruso, Derek Valleroy, Dinesh Ginjupalli, Duncan Frost, Edward Miller, Elias Mueggler, Evgeniy Oleinik, Fan Zhang, Guruprasad Somasundaram, Gustavo Solaira , et al. (49 additional authors not shown)

    Abstract: Egocentric, multi-modal data as available on future augmented reality (AR) devices provides unique challenges and opportunities for machine perception. These future devices will need to be all-day wearable in a socially acceptable form-factor to support always available, context-aware and personalized AI applications. Our team at Meta Reality Labs Research built the Aria device, an egocentric, mul… ▽ More

    Submitted 1 October, 2023; v1 submitted 24 August, 2023; originally announced August 2023.

  39. Performance Analysis of Finite Blocklength Transmissions Over Wiretap Fading Channels: An Average Information Leakage Perspective

    Authors: Milad Tatar Mamaghani, Xiangyun Zhou, Nan Yang, A. Lee Swindlehurst, H. Vincent Poor

    Abstract: Physical-layer security (PLS) is a promising technique to complement more traditional means of communication security in beyond-5G wireless networks. However, studies of PLS are often based on ideal assumptions such as infinite coding blocklengths or perfect knowledge of the wiretap link's channel state information (CSI). In this work, we study the performance of finite blocklength (FBL) transmiss… ▽ More

    Submitted 13 May, 2024; v1 submitted 25 August, 2023; originally announced August 2023.

    Comments: To appear in IEEE Transactions on Wireless Communications. Note: This is an extended version of our work to be presented at the 2024 IEEE ICC (arXiv:2401.11219)

  40. arXiv:2307.14346  [pdf, other

    cs.NI cs.AI cs.LG

    Multi-objective Deep Reinforcement Learning for Mobile Edge Computing

    Authors: Ning Yang, Junrui Wen, Meng Zhang, Ming Tang

    Abstract: Mobile edge computing (MEC) is essential for next-generation mobile network applications that prioritize various performance metrics, including delays and energy consumption. However, conventional single-objective scheduling solutions cannot be directly applied to practical systems in which the preferences of these applications (i.e., the weights of different objectives) are often unknown or chall… ▽ More

    Submitted 5 July, 2023; originally announced July 2023.

    Comments: Received by IEEE WiOpt 2023

  41. arXiv:2307.14202  [pdf, ps, other

    cs.ET

    Heterogeneous Receptors - Based Molecule Harvesting in MC: Analysis for ISI Mitigation and Energy Efficiency

    Authors: Xinyu Huang, Yu Huang, Miaowen Wen, Nan Yang, Robert Schober

    Abstract: This paper investigates a spherical transmitter (TX) with a membrane covered by heterogeneous receptors of varying sizes and arbitrary locations for molecular communication (MC), where molecules are encapsulated within vesicles and released from the TX through membrane fusion. Assuming continuous vesicle generation at the TX and a transparent receiver (RX), we calculate the molecule release rate,… ▽ More

    Submitted 26 July, 2023; originally announced July 2023.

    Comments: 30 pages, 9 figures, Submitted to IEEE journals for possible publication. arXiv admin note: substantial text overlap with arXiv:2211.14603

  42. arXiv:2307.12594  [pdf

    physics.app-ph cs.LG

    Optimized data collection and analysis process for studying solar-thermal desalination by machine learning

    Authors: Guilong Peng, Senshan Sun, Yangjun Qin, Zhenwei Xu, Juxin Du, Swellam W. sharshir, A. W. Kandel, A. E. Kabeel, Nuo Yang

    Abstract: An effective interdisciplinary study between machine learning and solar-thermal desalination requires a sufficiently large and well-analyzed experimental datasets. This study develops a modified dataset collection and analysis process for studying solar-thermal desalination by machine learning. Based on the optimized water condensation and collection process, the proposed experimental method colle… ▽ More

    Submitted 24 July, 2023; originally announced July 2023.

  43. Secure Short-Packet Communications via UAV-Enabled Mobile Relaying: Joint Resource Optimization and 3D Trajectory Design

    Authors: Milad Tatar Mamaghani, Xiangyun Zhou, Nan Yang, A. Lee Swindlehurst

    Abstract: Short-packet communication (SPC) and unmanned aerial vehicles (UAVs) are anticipated to play crucial roles in the development of 5G-and-beyond wireless networks and the Internet of Things (IoT). In this paper, we propose a secure SPC system, where a UAV serves as a mobile decode-and-forward (DF) relay, periodically receiving and relaying small data packets from a remote IoT device to its receiver… ▽ More

    Submitted 29 December, 2023; v1 submitted 14 July, 2023; originally announced July 2023.

    Comments: 14 double-column pages, 8 figures. To appear in IEEE Transactions on Wireless Communications. This is an extended version of our work presented at the 2023 IEEE GlobeCom arXiv:2310.05142

  44. arXiv:2307.07164  [pdf, other

    cs.CL cs.IR

    Learning to Retrieve In-Context Examples for Large Language Models

    Authors: Liang Wang, Nan Yang, Furu Wei

    Abstract: Large language models (LLMs) have demonstrated their ability to learn in-context, allowing them to perform various tasks based on a few input-output examples. However, the effectiveness of in-context learning is heavily reliant on the quality of the selected examples. In this paper, we propose a novel framework to iteratively train dense retrievers that can identify high-quality in-context example… ▽ More

    Submitted 26 January, 2024; v1 submitted 14 July, 2023; originally announced July 2023.

    Comments: Accepted by EACL 2024

  45. arXiv:2307.01366  [pdf, other

    cs.AI cs.NI

    Minimizing Age of Information for Mobile Edge Computing Systems: A Nested Index Approach

    Authors: Shuo Chen, Ning Yang, Meng Zhang, Jun Wang

    Abstract: Exploiting the computational heterogeneity of mobile devices and edge nodes, mobile edge computation (MEC) provides an efficient approach to achieving real-time applications that are sensitive to information freshness, by offloading tasks from mobile devices to edge nodes. We use the metric Age-of-Information (AoI) to evaluate information freshness. An efficient solution to minimize the AoI for th… ▽ More

    Submitted 3 July, 2023; originally announced July 2023.

  46. arXiv:2306.15222  [pdf, other

    cs.CL cs.AI cs.IR

    Learning to Rank in Generative Retrieval

    Authors: Yongqi Li, Nan Yang, Liang Wang, Furu Wei, Wenjie Li

    Abstract: Generative retrieval stands out as a promising new paradigm in text retrieval that aims to generate identifier strings of relevant passages as the retrieval target. This generative paradigm taps into powerful generative language models, distinct from traditional sparse or dense retrieval methods. However, only learning to generate is insufficient for generative retrieval. Generative retrieval lear… ▽ More

    Submitted 16 December, 2023; v1 submitted 27 June, 2023; originally announced June 2023.

    Comments: AAAI 2024

  47. arXiv:2305.16675  [pdf, other

    cs.CL cs.AI cs.IR cs.LG

    Multiview Identifiers Enhanced Generative Retrieval

    Authors: Yongqi Li, Nan Yang, Liang Wang, Furu Wei, Wenjie Li

    Abstract: Instead of simply matching a query to pre-existing passages, generative retrieval generates identifier strings of passages as the retrieval target. At a cost, the identifier must be distinctive enough to represent a passage. Current approaches use either a numeric ID or a text piece (such as a title or substrings) as the identifier. However, these identifiers cannot cover a passage's content well.… ▽ More

    Submitted 26 May, 2023; originally announced May 2023.

    Comments: ACL 2023 Main Conference

  48. Incremental Dense Reconstruction from Monocular Video with Guided Sparse Feature Volume Fusion

    Authors: Xingxing Zuo, Nan Yang, Nathaniel Merrill, Binbin Xu, Stefan Leutenegger

    Abstract: Incrementally recovering 3D dense structures from monocular videos is of paramount importance since it enables various robotics and AR applications. Feature volumes have recently been shown to enable efficient and accurate incremental dense reconstruction without the need to first estimate depth, but they are not able to achieve as high of a resolution as depth-based methods due to the large memor… ▽ More

    Submitted 24 May, 2023; originally announced May 2023.

    Comments: 8 pages, 5 figures, RA-L 2023

  49. arXiv:2305.07247  [pdf, other

    cs.LG

    Provably Convergent Schrödinger Bridge with Applications to Probabilistic Time Series Imputation

    Authors: Yu Chen, Wei Deng, Shikai Fang, Fengpei Li, Nicole Tianjiao Yang, Yikai Zhang, Kashif Rasul, Shandian Zhe, Anderson Schneider, Yuriy Nevmyvaka

    Abstract: The Schrödinger bridge problem (SBP) is gaining increasing attention in generative modeling and showing promising potential even in comparison with the score-based generative models (SGMs). SBP can be interpreted as an entropy-regularized optimal transport problem, which conducts projections onto every other marginal alternatingly. However, in practice, only approximated projections are accessible… ▽ More

    Submitted 10 September, 2023; v1 submitted 12 May, 2023; originally announced May 2023.

    Comments: Accepted by ICML 2023

  50. arXiv:2304.12550  [pdf, other

    cs.LG cs.AI

    Combining Adversaries with Anti-adversaries in Training

    Authors: Xiaoling Zhou, Nan Yang, Ou Wu

    Abstract: Adversarial training is an effective learning technique to improve the robustness of deep neural networks. In this study, the influence of adversarial training on deep learning models in terms of fairness, robustness, and generalization is theoretically investigated under more general perturbation scope that different samples can have different perturbation directions (the adversarial and anti-adv… ▽ More

    Submitted 18 May, 2023; v1 submitted 24 April, 2023; originally announced April 2023.

    Comments: 8 pages, 5 figures

    Journal ref: AAAI2023