Skip to main content

Showing 1–50 of 108 results for author: Lin, R

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.14024  [pdf, other

    cs.CL

    LLM Critics Help Catch Bugs in Mathematics: Towards a Better Mathematical Verifier with Natural Language Feedback

    Authors: Bofei Gao, Zefan Cai, Runxin Xu, Peiyi Wang, Ce Zheng, Runji Lin, Keming Lu, Junyang Lin, Chang Zhou, Wen Xiao, Junjie Hu, Tianyu Liu, Baobao Chang

    Abstract: Mathematical verfier achieves success in mathematical reasoning tasks by validating the correctness of solutions. However, existing verifiers are trained with binary classification labels, which are not informative enough for the model to accurately assess the solutions. To mitigate the aforementioned insufficiency of binary labels, we introduce step-wise natural language feedbacks as rationale la… ▽ More

    Submitted 30 June, 2024; v1 submitted 20 June, 2024; originally announced June 2024.

    Comments: 9 pages

  2. arXiv:2405.19139  [pdf, other

    cs.CL cs.AI

    DGRC: An Effective Fine-tuning Framework for Distractor Generation in Chinese Multi-choice Reading Comprehension

    Authors: Runfeng Lin, Dacheng Xu, Huijiang Wang, Zebiao Chen, Yating Wang, Shouqiang Liu

    Abstract: When evaluating a learner's knowledge proficiency, the multiple-choice question is an efficient and widely used format in standardized tests. Nevertheless, generating these questions, particularly plausible distractors (incorrect options), poses a considerable challenge. Generally, the distractor generation can be classified into cloze-style distractor generation (CDG) and natural questions distra… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

  3. arXiv:2405.18172  [pdf, other

    cs.CV cs.AI cs.LG

    AnyFit: Controllable Virtual Try-on for Any Combination of Attire Across Any Scenario

    Authors: Yuhan Li, Hao Zhou, Wenxiang Shang, Ran Lin, Xuanhong Chen, Bingbing Ni

    Abstract: While image-based virtual try-on has made significant strides, emerging approaches still fall short of delivering high-fidelity and robust fitting images across various scenarios, as their models suffer from issues of ill-fitted garment styles and quality degrading during the training process, not to mention the lack of support for various combinations of attire. Therefore, we first propose a ligh… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

    Comments: Project website: https://colorful-liyu.github.io/anyfit-page/

  4. arXiv:2405.17953  [pdf, other

    cs.DS cs.CC

    Graph Threading with Turn Costs

    Authors: Erik D. Demaine, Yael Kirkpatrick, Rebecca Lin

    Abstract: How should we thread a single string through a set of tubes so that pulling the string taut self-assembles the tubes into a desired graph? While prior work [ITCS 2024] solves this problem with the goal of minimizing the length of string, we study here the objective of minimizing the total turn cost. The frictional force required to pull the string through the tubes grows exponentially with the tot… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

    Comments: 18 pages; 10 figures

    ACM Class: G.2.2; F.2.2

  5. arXiv:2405.17931  [pdf, other

    cs.CL cs.LG

    Online Merging Optimizers for Boosting Rewards and Mitigating Tax in Alignment

    Authors: Keming Lu, Bowen Yu, Fei Huang, Yang Fan, Runji Lin, Chang Zhou

    Abstract: Effectively aligning Large Language Models (LLMs) with human-centric values while preventing the degradation of abilities acquired through Pre-training and Supervised Fine-tuning (SFT) poses a central challenge in Reinforcement Learning from Human Feedback (RLHF). In this paper, we first discover that interpolating RLHF and SFT model parameters can adjust the trade-off between human preference and… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

  6. arXiv:2405.16262  [pdf, other

    cs.LG

    Layer-Aware Analysis of Catastrophic Overfitting: Revealing the Pseudo-Robust Shortcut Dependency

    Authors: Runqi Lin, Chaojian Yu, Bo Han, Hang Su, Tongliang Liu

    Abstract: Catastrophic overfitting (CO) presents a significant challenge in single-step adversarial training (AT), manifesting as highly distorted deep neural networks (DNNs) that are vulnerable to multi-step adversarial attacks. However, the underlying factors that lead to the distortion of decision boundaries remain unclear. In this work, we delve into the specific changes within different DNN layers and… ▽ More

    Submitted 25 May, 2024; originally announced May 2024.

  7. arXiv:2405.07623  [pdf, other

    cs.CL

    COBias and Debias: Minimizing Language Model Pairwise Accuracy Bias via Nonlinear Integer Programming

    Authors: Ruixi Lin, Yang You

    Abstract: For language model classification, would you prefer having only one workable class or having every class working? The latter makes more practical uses. Especially for large language models (LLMs), the fact that they achieve a fair overall accuracy by in-context learning (ICL) obscures a large difference in individual class accuracies. In this work, we uncover and tackle language models' imbalance… ▽ More

    Submitted 13 May, 2024; originally announced May 2024.

  8. arXiv:2404.08154  [pdf, other

    cs.LG

    Eliminating Catastrophic Overfitting Via Abnormal Adversarial Examples Regularization

    Authors: Runqi Lin, Chaojian Yu, Tongliang Liu

    Abstract: Single-step adversarial training (SSAT) has demonstrated the potential to achieve both efficiency and robustness. However, SSAT suffers from catastrophic overfitting (CO), a phenomenon that leads to a severely distorted classifier, making it vulnerable to multi-step adversarial attacks. In this work, we observe that some adversarial examples generated on the SSAT-trained network exhibit anomalous… ▽ More

    Submitted 11 April, 2024; originally announced April 2024.

  9. arXiv:2404.03121  [pdf

    cs.CV q-bio.NC

    Utilizing Computer Vision for Continuous Monitoring of Vaccine Side Effects in Experimental Mice

    Authors: Chuang Li, Shuai Shao, Willian Mikason, Rubing Lin, Yantong Liu

    Abstract: The demand for improved efficiency and accuracy in vaccine safety assessments is increasing. Here, we explore the application of computer vision technologies to automate the monitoring of experimental mice for potential side effects after vaccine administration. Traditional observation methods are labor-intensive and lack the capability for continuous monitoring. By deploying a computer vision sys… ▽ More

    Submitted 3 April, 2024; originally announced April 2024.

    Comments: 1 figure

  10. arXiv:2404.02823  [pdf, other

    cs.CL cs.AI cs.LG

    Conifer: Improving Complex Constrained Instruction-Following Ability of Large Language Models

    Authors: Haoran Sun, Lixin Liu, Junjie Li, Fengyu Wang, Baohua Dong, Ran Lin, Ruohui Huang

    Abstract: The ability of large language models (LLMs) to follow instructions is crucial to real-world applications. Despite recent advances, several studies have highlighted that LLMs struggle when faced with challenging instructions, especially those that include complex constraints, hindering their effectiveness in various tasks. To address this challenge, we introduce Conifer, a novel instruction tuning… ▽ More

    Submitted 3 April, 2024; originally announced April 2024.

  11. arXiv:2404.00247  [pdf, ps, other

    eess.SY cs.AI cs.LG

    Facilitating Reinforcement Learning for Process Control Using Transfer Learning: Perspectives

    Authors: Runze Lin, Junghui Chen, Lei Xie, Hongye Su, Biao Huang

    Abstract: This paper provides insights into deep reinforcement learning (DRL) for process control from the perspective of transfer learning. We analyze the challenges of applying DRL in the field of process industries and the necessity of introducing transfer learning. Furthermore, recommendations and prospects are provided for future research directions on how transfer learning can be integrated with DRL t… ▽ More

    Submitted 1 May, 2024; v1 submitted 30 March, 2024; originally announced April 2024.

    Comments: Final Version of Asian Control Conference (ASCC 2024)

  12. arXiv:2403.12945  [pdf, other

    cs.RO

    DROID: A Large-Scale In-The-Wild Robot Manipulation Dataset

    Authors: Alexander Khazatsky, Karl Pertsch, Suraj Nair, Ashwin Balakrishna, Sudeep Dasari, Siddharth Karamcheti, Soroush Nasiriany, Mohan Kumar Srirama, Lawrence Yunliang Chen, Kirsty Ellis, Peter David Fagan, Joey Hejna, Masha Itkina, Marion Lepert, Yecheng Jason Ma, Patrick Tree Miller, Jimmy Wu, Suneel Belkhale, Shivin Dass, Huy Ha, Arhan Jain, Abraham Lee, Youngwoon Lee, Marius Memmel, Sungjae Park , et al. (74 additional authors not shown)

    Abstract: The creation of large, diverse, high-quality robot manipulation datasets is an important step** stone on the path toward more capable and robust robotic manipulation policies. However, creating such datasets is challenging: collecting robot manipulation data in diverse environments poses logistical and safety challenges and requires substantial investments in hardware and human labour. As a resu… ▽ More

    Submitted 19 March, 2024; originally announced March 2024.

    Comments: Project website: https://droid-dataset.github.io/

  13. arXiv:2403.02408  [pdf, other

    eess.IV cs.CV

    A Spatio-temporal Aligned SUNet Model for Low-light Video Enhancement

    Authors: Ruirui Lin, Nantheera Anantrasirichai, Alexandra Malyugina, David Bull

    Abstract: Distortions caused by low-light conditions are not only visually unpleasant but also degrade the performance of computer vision tasks. The restoration and enhancement have proven to be highly beneficial. However, there are only a limited number of enhancement methods explicitly designed for videos acquired in low-light conditions. We propose a Spatio-Temporal Aligned SUNet (STA-SUNet) model using… ▽ More

    Submitted 9 April, 2024; v1 submitted 4 March, 2024; originally announced March 2024.

  14. arXiv:2403.02075  [pdf, other

    cs.CV

    DiffMOT: A Real-time Diffusion-based Multiple Object Tracker with Non-linear Prediction

    Authors: Weiyi Lv, Yuhang Huang, Ning Zhang, Ruei-Sung Lin, Mei Han, Dan Zeng

    Abstract: In Multiple Object Tracking, objects often exhibit non-linear motion of acceleration and deceleration, with irregular direction changes. Tacking-by-detection (TBD) trackers with Kalman Filter motion prediction work well in pedestrian-dominant scenarios but fall short in complex situations when multiple objects perform non-linear and diverse motion simultaneously. To tackle the complex non-linear m… ▽ More

    Submitted 20 March, 2024; v1 submitted 4 March, 2024; originally announced March 2024.

    Comments: CVPR2024

  15. arXiv:2402.10884  [pdf, other

    cs.CL cs.AI cs.CV cs.LG

    Multi-modal preference alignment remedies regression of visual instruction tuning on language model

    Authors: Shengzhi Li, Rongyu Lin, Shichao Pei

    Abstract: In production, multi-modal large language models (MLLMs) are expected to support multi-turn queries of interchanging image and text modalities. However, the current MLLMs trained with visual-question-answering (VQA) datasets could suffer from degradation, as VQA datasets lack the diversity and complexity of the original text instruction datasets which the underlying language model had been trained… ▽ More

    Submitted 16 February, 2024; originally announced February 2024.

  16. arXiv:2402.04356  [pdf, other

    cs.SD cs.CV eess.AS

    Bidirectional Autoregressive Diffusion Model for Dance Generation

    Authors: Canyu Zhang, Youbao Tang, Ning Zhang, Ruei-Sung Lin, Mei Han, **g Xiao, Song Wang

    Abstract: Dance serves as a powerful medium for expressing human emotions, but the lifelike generation of dance is still a considerable challenge. Recently, diffusion models have showcased remarkable generative abilities across various domains. They hold promise for human motion generation due to their adaptable many-to-many nature. Nonetheless, current diffusion-based motion generation models often create… ▽ More

    Submitted 22 June, 2024; v1 submitted 6 February, 2024; originally announced February 2024.

  17. arXiv:2402.01970  [pdf, other

    cs.CV

    BVI-Lowlight: Fully Registered Benchmark Dataset for Low-Light Video Enhancement

    Authors: Nantheera Anantrasirichai, Ruirui Lin, Alexandra Malyugina, David Bull

    Abstract: Low-light videos often exhibit spatiotemporal incoherent noise, leading to poor visibility and compromised performance across various computer vision applications. One significant challenge in enhancing such content using modern technologies is the scarcity of training data. This paper introduces a novel low-light video dataset, consisting of 40 scenes captured in various motion scenarios under tw… ▽ More

    Submitted 25 May, 2024; v1 submitted 2 February, 2024; originally announced February 2024.

  18. arXiv:2401.12383  [pdf, other

    cs.CR math.NT

    A New Class of Algorithms for Finding Short Vectors in Lattices Lifted from Co-dimension $k$ Codes

    Authors: Robert Lin, Peter W. Shor

    Abstract: We introduce a new class of algorithms for finding a short vector in lattices defined by codes of co-dimension $k$ over $\mathbb{Z}_P^d$, where $P$ is prime. The co-dimension $1$ case is solved by exploiting the packing properties of the projections mod $P$ of an initial set of non-lattice vectors onto a single dual codeword. The technical tools we introduce are sorting of the projections followed… ▽ More

    Submitted 22 January, 2024; originally announced January 2024.

  19. arXiv:2312.14773  [pdf, other

    eess.IV cs.CV physics.med-ph

    Cross-Age and Cross-Site Domain Shift Impacts on Deep Learning-Based White Matter Fiber Estimation in Newborn and Baby Brains

    Authors: Rizhong Lin, Ali Gholipour, Jean-Philippe Thiran, Davood Karimi, Hamza Kebiri, Meritxell Bach Cuadra

    Abstract: Deep learning models have shown great promise in estimating tissue microstructure from limited diffusion magnetic resonance imaging data. However, these models face domain shift challenges when test and train data are from different scanners and protocols, or when the models are applied to data with inherent variations such as the develo** brains of infants and children scanned at various ages.… ▽ More

    Submitted 22 December, 2023; originally announced December 2023.

    Comments: 5 pages, 5 figures, submitted to ISBI 2024

  20. arXiv:2312.12021  [pdf, other

    cs.CL cs.AI

    Synergistic Anchored Contrastive Pre-training for Few-Shot Relation Extraction

    Authors: Da Luo, Yanglei Gan, Rui Hou, Run Lin, Qiao Liu, Yuxiang Cai, Wannian Gao

    Abstract: Few-shot Relation Extraction (FSRE) aims to extract relational facts from a sparse set of labeled corpora. Recent studies have shown promising results in FSRE by employing Pre-trained Language Models (PLMs) within the framework of supervised contrastive learning, which considers both instances and label facts. However, how to effectively harness massive instance-label pairs to encompass the learne… ▽ More

    Submitted 11 March, 2024; v1 submitted 19 December, 2023; originally announced December 2023.

  21. arXiv:2312.11865  [pdf, other

    cs.AI

    Large Language Models Play StarCraft II: Benchmarks and A Chain of Summarization Approach

    Authors: Weiyu Ma, Qirui Mi, Yongcheng Zeng, Xue Yan, Yuqiao Wu, Runji Lin, Haifeng Zhang, Jun Wang

    Abstract: StarCraft II is a challenging benchmark for AI agents due to the necessity of both precise micro level operations and strategic macro awareness. Previous works, such as Alphastar and SCC, achieve impressive performance on tackling StarCraft II , however, still exhibit deficiencies in long term strategic planning and strategy interpretability. Emerging large language model (LLM) agents, such as Voy… ▽ More

    Submitted 17 June, 2024; v1 submitted 19 December, 2023; originally announced December 2023.

  22. arXiv:2312.11671  [pdf, other

    cs.CL cs.AI cs.LG

    Evaluating Language-Model Agents on Realistic Autonomous Tasks

    Authors: Megan Kinniment, Lucas Jun Koba Sato, Haoxing Du, Brian Goodrich, Max Hasin, Lawrence Chan, Luke Harold Miles, Tao R. Lin, Hjalmar Wijk, Joel Burget, Aaron Ho, Elizabeth Barnes, Paul Christiano

    Abstract: In this report, we explore the ability of language model agents to acquire resources, create copies of themselves, and adapt to novel challenges they encounter in the wild. We refer to this cluster of capabilities as "autonomous replication and adaptation" or ARA. We believe that systems capable of ARA could have wide-reaching and hard-to-anticipate consequences, and that measuring and forecasting… ▽ More

    Submitted 4 January, 2024; v1 submitted 18 December, 2023; originally announced December 2023.

    Comments: 14 pages

  23. arXiv:2312.09922  [pdf, other

    cs.CV cs.AI

    A Unifying Tensor View for Lightweight CNNs

    Authors: Jason Chun Lok Li, Rui Lin, Jiajun Zhou, Edmund Yin Mun Lam, Ngai Wong

    Abstract: Despite the decomposition of convolutional kernels for lightweight CNNs being well studied, existing works that rely on tensor network diagrams or hyperdimensional abstraction lack geometry intuition. This work devises a new perspective by linking a 3D-reshaped kernel tensor to its various slice-wise and rank-1 decompositions, permitting a straightforward connection between various tensor approxim… ▽ More

    Submitted 15 December, 2023; originally announced December 2023.

    Comments: 4 pages, 3 figures, accepted in 2023 IEEE 15th International Conference on ASIC (ASICON 2023)

  24. arXiv:2312.01126  [pdf, other

    cs.IT eess.SP

    BER Analysis of SCMA-OFDM Systems in the Presence of Carrier Frequency Offset

    Authors: Haibo Liu, Qu Luo, Zilong Liu, Shan Luo, Pei Xiao, Rong** Lin

    Abstract: Sparse code multiple access (SCMA) building upon orthogonal frequency division multiplexing (OFDM) is a promising wireless technology for supporting massive connectivity in future machine-type communication networks. However, the sensitivity of OFDM to carrier frequency offset (CFO) poses a major challenge because it leads to orthogonality loss and incurs intercarrier interference (ICI). In this p… ▽ More

    Submitted 2 December, 2023; originally announced December 2023.

  25. arXiv:2311.08692  [pdf, other

    cs.CL cs.LG

    Routing to the Expert: Efficient Reward-guided Ensemble of Large Language Models

    Authors: Keming Lu, Hongyi Yuan, Runji Lin, Junyang Lin, Zheng Yuan, Chang Zhou, **gren Zhou

    Abstract: The complementary potential of Large Language Models (LLM) assumes off-the-shelf LLMs have heterogeneous expertise in a wide range of domains and tasks so that an ensemble of LLMs can achieve consistently better performance. Existing ensemble methods for LLMs mainly focus on reward model ranking of outputs, leading to significant computation overhead. To combat this issue, we revisit the complemen… ▽ More

    Submitted 14 November, 2023; originally announced November 2023.

  26. arXiv:2311.08125  [pdf, other

    cs.LG

    Lite it fly: An All-Deformable-Butterfly Network

    Authors: Rui Lin, Jason Chun Lok Li, Jiajun Zhou, Binxiao Huang, Jie Ran, Ngai Wong

    Abstract: Most deep neural networks (DNNs) consist fundamentally of convolutional and/or fully connected layers, wherein the linear transform can be cast as the product between a filter matrix and a data matrix obtained by arranging feature tensors into columns. The lately proposed deformable butterfly (DeBut) decomposes the filter matrix into generalized, butterflylike factors, thus achieving network compr… ▽ More

    Submitted 14 November, 2023; originally announced November 2023.

    Comments: 7 pages, 3 figures, accepted as a brief paper in IEEE Transactions on Neural Networks and Learning Systems (TNNLS)

  27. arXiv:2310.11535  [pdf, other

    eess.IV cs.CV

    Learning Lens Blur Fields

    Authors: Esther Y. H. Lin, Zhecheng Wang, Rebecca Lin, Daniel Miau, Florian Kainz, Jiawen Chen, Xuaner Cecilia Zhang, David B. Lindell, Kiriakos N. Kutulakos

    Abstract: Optical blur is an inherent property of any lens system and is challenging to model in modern cameras because of their complex optical elements. To tackle this challenge, we introduce a high-dimensional neural representation of blur$-$$\textit{the lens blur field}$$-$and a practical method for acquiring it. The lens blur field is a multilayer perceptron (MLP) designed to (1) accurately capture var… ▽ More

    Submitted 17 October, 2023; originally announced October 2023.

  28. arXiv:2310.08864  [pdf, other

    cs.RO

    Open X-Embodiment: Robotic Learning Datasets and RT-X Models

    Authors: Open X-Embodiment Collaboration, Abby O'Neill, Abdul Rehman, Abhinav Gupta, Abhiram Maddukuri, Abhishek Gupta, Abhishek Padalkar, Abraham Lee, Acorn Pooley, Agrim Gupta, Ajay Mandlekar, A**kya Jain, Albert Tung, Alex Bewley, Alex Herzog, Alex Irpan, Alexander Khazatsky, Anant Rai, Anchit Gupta, Andrew Wang, Andrey Kolobov, Anikait Singh, Animesh Garg, Aniruddha Kembhavi, Annie Xie , et al. (267 additional authors not shown)

    Abstract: Large, high-capacity models trained on diverse datasets have shown remarkable successes on efficiently tackling downstream applications. In domains from NLP to Computer Vision, this has led to a consolidation of pretrained models, with general pretrained backbones serving as a starting point for many applications. Can such a consolidation happen in robotics? Conventionally, robotic learning method… ▽ More

    Submitted 1 June, 2024; v1 submitted 13 October, 2023; originally announced October 2023.

    Comments: Project website: https://robotics-transformer-x.github.io

  29. arXiv:2310.08847  [pdf, other

    cs.LG

    On the Over-Memorization During Natural, Robust and Catastrophic Overfitting

    Authors: Runqi Lin, Chaojian Yu, Bo Han, Tongliang Liu

    Abstract: Overfitting negatively impacts the generalization ability of deep neural networks (DNNs) in both natural and adversarial training. Existing methods struggle to consistently address different types of overfitting, typically designing strategies that focus separately on either natural or adversarial patterns. In this work, we adopt a unified perspective by solely focusing on natural patterns to expl… ▽ More

    Submitted 11 April, 2024; v1 submitted 13 October, 2023; originally announced October 2023.

  30. arXiv:2310.08439  [pdf, other

    physics.comp-ph cs.DC

    TensorMD: Scalable Tensor-Diagram based Machine Learning Interatomic Potential on Heterogeneous Many-Core Processors

    Authors: Xin Chen, Yucheng Ouyang, Xin Chen, Zhenchuan Chen, Rongfen Lin, Xingyu Gao, Lifang Wang, Fang Li, Yin Liu, Honghui Shang, Haifeng Song

    Abstract: Molecular dynamics simulations have emerged as a potent tool for investigating the physical properties and kinetic behaviors of materials at the atomic scale, particularly in extreme conditions. Ab initio accuracy is now achievable with machine learning based interatomic potentials. With recent advancements in high-performance computing, highly accurate and large-scale simulations become feasible.… ▽ More

    Submitted 12 October, 2023; v1 submitted 12 October, 2023; originally announced October 2023.

  31. arXiv:2309.16609  [pdf, other

    cs.CL

    Qwen Technical Report

    Authors: **ze Bai, Shuai Bai, Yunfei Chu, Zeyu Cui, Kai Dang, Xiaodong Deng, Yang Fan, Wenbin Ge, Yu Han, Fei Huang, Binyuan Hui, Luo Ji, Mei Li, Junyang Lin, Runji Lin, Dayiheng Liu, Gao Liu, Chengqiang Lu, Keming Lu, Jianxin Ma, Rui Men, Xingzhang Ren, Xuancheng Ren, Chuanqi Tan, Sinan Tan , et al. (23 additional authors not shown)

    Abstract: Large language models (LLMs) have revolutionized the field of artificial intelligence, enabling natural language processing tasks that were previously thought to be exclusive to humans. In this work, we introduce Qwen, the first installment of our large language model series. Qwen is a comprehensive language model series that encompasses distinct models with varying parameter counts. It includes Q… ▽ More

    Submitted 28 September, 2023; originally announced September 2023.

    Comments: 59 pages, 5 figures

  32. arXiv:2309.14541  [pdf

    stat.ML cs.CR cs.LG

    Cluster-based Method for Eavesdrop** Identification and Localization in Optical Links

    Authors: Haokun Song, Rui Lin, Andrea Sgambelluri, Filippo Cugini, Yajie Li, Jie Zhang, Paolo Monti

    Abstract: We propose a cluster-based method to detect and locate eavesdrop** events in optical line systems characterized by small power losses. Our findings indicate that detecting such subtle losses from eavesdrop** can be accomplished solely through optical performance monitoring (OPM) data collected at the receiver. On the other hand, the localization of such events can be effectively achieved by le… ▽ More

    Submitted 25 September, 2023; originally announced September 2023.

    Comments: 4 pages, 6 figures, Asia Communications and Photonics Conference (ACP) 2023

  33. Graph Threading

    Authors: Erik D. Demaine, Yael Kirkpatrick, Rebecca Lin

    Abstract: Inspired by artistic practices such as beadwork and himmeli, we study the problem of threading a single string through a set of tubes, so that pulling the string forms a desired graph. More precisely, given a connected graph (where edges represent tubes and vertices represent junctions where they meet), we give a polynomial-time algorithm to find a minimum-length closed walk (representing a thread… ▽ More

    Submitted 28 May, 2024; v1 submitted 18 September, 2023; originally announced September 2023.

    Comments: 19 pages, 6 figures

    ACM Class: G.2.2; F.2.2

  34. arXiv:2309.03660  [pdf, other

    cs.CR

    Learning from Limited Heterogeneous Training Data: Meta-Learning for Unsupervised Zero-Day Web Attack Detection across Web Domains

    Authors: Peiyang Li, Ye Wang, Qi Li, Zhuotao Liu, Ke Xu, Ju Ren, Zhiying Liu, Ruilin Lin

    Abstract: Recently unsupervised machine learning based systems have been developed to detect zero-day Web attacks, which can effectively enhance existing Web Application Firewalls (WAFs). However, prior arts only consider detecting attacks on specific domains by training particular detection models for the domains. These systems require a large amount of training data, which causes a long period of time for… ▽ More

    Submitted 7 September, 2023; originally announced September 2023.

  35. arXiv:2308.07074  [pdf, other

    cs.CL cs.AI cs.LG

    #InsTag: Instruction Tagging for Analyzing Supervised Fine-tuning of Large Language Models

    Authors: Keming Lu, Hongyi Yuan, Zheng Yuan, Runji Lin, Junyang Lin, Chuanqi Tan, Chang Zhou, **gren Zhou

    Abstract: Foundation language models obtain the instruction-following ability through supervised fine-tuning (SFT). Diversity and complexity are considered critical factors of a successful SFT dataset, while their definitions remain obscure and lack quantitative analyses. In this work, we propose InsTag, an open-set fine-grained tagger, to tag samples within SFT datasets based on semantics and intentions an… ▽ More

    Submitted 15 August, 2023; v1 submitted 14 August, 2023; originally announced August 2023.

  36. arXiv:2308.02765  [pdf

    eess.SY cs.AI

    Surrogate Empowered Sim2Real Transfer of Deep Reinforcement Learning for ORC Superheat Control

    Authors: Runze Lin, Yangyang Luo, Xialai Wu, Junghui Chen, Biao Huang, Lei Xie, Hongye Su

    Abstract: The Organic Rankine Cycle (ORC) is widely used in industrial waste heat recovery due to its simple structure and easy maintenance. However, in the context of smart manufacturing in the process industry, traditional model-based optimization control methods are unable to adapt to the varying operating conditions of the ORC system or sudden changes in operating modes. Deep reinforcement learning (DRL… ▽ More

    Submitted 4 August, 2023; originally announced August 2023.

  37. A Message Passing Detection based Affine Frequency Division Multiplexing Communication System

    Authors: Lifan Wu, Shan Luo, Dongxiao Song, Fan Yang, Rong** Lin

    Abstract: The next generation of wireless communication technology is anticipated to address the communication reliability challenges encountered in high-speed mobile communication scenarios. An Orthogonal Time Frequency Space (OTFS) system has been introduced as a solution that effectively mitigates these issues. However, OTFS is associated with relatively high pilot overhead and multiuser multiplexing ove… ▽ More

    Submitted 30 August, 2023; v1 submitted 29 July, 2023; originally announced July 2023.

    Comments: 8 pages, 7 figures

  38. arXiv:2306.14262  [pdf, other

    cs.CV

    A Spectral Perspective towards Understanding and Improving Adversarial Robustness

    Authors: Binxiao Huang, Rui Lin, Chaofan Tao, Ngai Wong

    Abstract: Deep neural networks (DNNs) are incredibly vulnerable to crafted, imperceptible adversarial perturbations. While adversarial training (AT) has proven to be an effective defense approach, the AT mechanism for robustness improvement is not fully understood. This work investigates AT from a spectral perspective, adding new insights to the design of effective defenses. In particular, we show that AT i… ▽ More

    Submitted 25 June, 2023; originally announced June 2023.

  39. arXiv:2306.13991  [pdf, other

    cs.LG math.OC

    Kernel Support Vector Machine Classifiers with the $\ell_0$-Norm Hinge Loss

    Authors: Rongrong Lin, Yingjia Yao, Yulan Liu

    Abstract: Support Vector Machine (SVM) has been one of the most successful machine learning techniques for binary classification problems. The key idea is to maximize the margin from the data to the hyperplane subject to correct classification on training samples. The commonly used hinge loss and its variations are sensitive to label noise, and unstable for resampling due to its unboundedness. This paper is… ▽ More

    Submitted 24 June, 2023; originally announced June 2023.

  40. arXiv:2306.13945  [pdf, other

    cs.LG cs.AI cs.MA

    Large Sequence Models for Sequential Decision-Making: A Survey

    Authors: Muning Wen, Runji Lin, Han**g Wang, Yaodong Yang, Ying Wen, Luo Mai, Jun Wang, Haifeng Zhang, Weinan Zhang

    Abstract: Transformer architectures have facilitated the development of large-scale and general-purpose sequence models for prediction tasks in natural language processing and computer vision, e.g., GPT-3 and Swin Transformer. Although originally designed for prediction problems, it is natural to inquire about their suitability for sequential decision-making and reinforcement learning problems, which are ty… ▽ More

    Submitted 24 June, 2023; originally announced June 2023.

    Comments: 25 pages, 4 figures, 2 tables

  41. arXiv:2306.01016  [pdf, other

    cs.CL cs.AI cs.CV cs.LG cs.MM

    PV2TEA: Patching Visual Modality to Textual-Established Information Extraction

    Authors: Hejie Cui, Rongmei Lin, Nasser Zalmout, Chenwei Zhang, **gbo Shang, Carl Yang, Xian Li

    Abstract: Information extraction, e.g., attribute value extraction, has been extensively studied and formulated based only on text. However, many attributes can benefit from image-based extraction, like color, shape, pattern, among others. The visual modality has long been underutilized, mainly due to multimodal annotation difficulty. In this paper, we aim to patch the visual modality to the textual-establi… ▽ More

    Submitted 1 June, 2023; originally announced June 2023.

    Comments: ACL 2023 Findings

  42. arXiv:2304.01234  [pdf, other

    astro-ph.SR astro-ph.EP cs.LG physics.plasm-ph physics.space-ph

    Prediction of solar wind speed by applying convolutional neural network to potential field source surface (PFSS) magnetograms

    Authors: Rong Lin, Zhekai Luo, Jiansen He, Lun Xie, Chuanpeng Hou, Shuwei Chen

    Abstract: An accurate solar wind speed model is important for space weather predictions, catastrophic event warnings, and other issues concerning solar wind - magnetosphere interaction. In this work, we construct a model based on convolutional neural network (CNN) and Potential Field Source Surface (PFSS) magnetograms, considering a solar wind source surface of $R_{\rm SS}=2.5R_\odot$, aiming to predict the… ▽ More

    Submitted 3 April, 2023; originally announced April 2023.

  43. arXiv:2303.16898  [pdf, other

    cs.RO

    Bagging by Learning to Singulate Layers Using Interactive Perception

    Authors: Lawrence Yunliang Chen, Baiyu Shi, Roy Lin, Daniel Seita, Ayah Ahmad, Richard Cheng, Thomas Kollar, David Held, Ken Goldberg

    Abstract: Many fabric handling and 2D deformable material tasks in homes and industry require singulating layers of material such as opening a bag or arranging garments for sewing. In contrast to methods requiring specialized sensing or end effectors, we use only visual observations with ordinary parallel jaw grippers. We propose SLIP: Singulating Layers using Interactive Perception, and apply SLIP to the t… ▽ More

    Submitted 1 September, 2023; v1 submitted 29 March, 2023; originally announced March 2023.

    Comments: IROS 2023

  44. arXiv:2301.07322  [pdf, other

    cs.CV

    HSTFormer: Hierarchical Spatial-Temporal Transformers for 3D Human Pose Estimation

    Authors: Xiaoye Qian, Youbao Tang, Ning Zhang, Mei Han, **g Xiao, Ming-Chun Huang, Ruei-Sung Lin

    Abstract: Transformer-based approaches have been successfully proposed for 3D human pose estimation (HPE) from 2D pose sequence and achieved state-of-the-art (SOTA) performance. However, current SOTAs have difficulties in modeling spatial-temporal correlations of joints at different levels simultaneously. This is due to the poses' spatial-temporal complexity. Poses move at various speeds temporarily with va… ▽ More

    Submitted 18 January, 2023; originally announced January 2023.

    Comments: The first two authors have equal contribution

  45. arXiv:2301.01471  [pdf, other

    cs.GR cs.CG

    Freeform Islamic Geometric Patterns

    Authors: Rebecca Lin, Craig S. Kaplan

    Abstract: Islamic geometric patterns are a rich and venerable ornamental tradition. Many classic designs feature periodic arrangements of rosettes: star shapes surrounded by rings of hexagonal petals. We present a new technique for generating 'freeform' compositions of rosettes: finite designs that freely mix rosettes of unusual sizes while retaining the aesthetics of traditional patterns. We use a circle p… ▽ More

    Submitted 4 January, 2023; originally announced January 2023.

    Comments: 20 pages, 21 figures

  46. arXiv:2212.12732  [pdf, other

    cs.CV

    Frequency Regularization for Improving Adversarial Robustness

    Authors: Binxiao Huang, Chaofan Tao, Rui Lin, Ngai Wong

    Abstract: Deep neural networks are incredibly vulnerable to crafted, human-imperceptible adversarial perturbations. Although adversarial training (AT) has proven to be an effective defense approach, we find that the AT-trained models heavily rely on the input low-frequency content for judgment, accounting for the low standard accuracy. To close the large gap between the standard and robust accuracies during… ▽ More

    Submitted 24 December, 2022; originally announced December 2022.

    Comments: accepted by AAAI 2023 workshop

  47. arXiv:2211.10960  [pdf, other

    cs.CV

    CoCoNet: Coupled Contrastive Learning Network with Multi-level Feature Ensemble for Multi-modality Image Fusion

    Authors: **yuan Liu, Runjia Lin, Guanyao Wu, Risheng Liu, Zhongxuan Luo, Xin Fan

    Abstract: Infrared and visible image fusion targets to provide an informative image by combining complementary information from different sensors. Existing learning-based fusion approaches attempt to construct various loss functions to preserve complementary features, while neglecting to discover the inter-relationship between the two modalities, leading to redundant or even invalid information on the fusio… ▽ More

    Submitted 14 October, 2023; v1 submitted 20 November, 2022; originally announced November 2022.

  48. arXiv:2211.08016  [pdf, other

    cs.LG cs.AI

    Contextual Transformer for Offline Meta Reinforcement Learning

    Authors: Runji Lin, Ye Li, Xidong Feng, Zhaowei Zhang, Xian Hong Wu Fung, Haifeng Zhang, Jun Wang, Yali Du, Yaodong Yang

    Abstract: The pretrain-finetuning paradigm in large-scale sequence models has made significant progress in natural language processing and computer vision tasks. However, such a paradigm is still hindered by several challenges in Reinforcement Learning (RL), including the lack of self-supervised pretraining algorithms based on offline data and efficient fine-tuning/prompt-tuning over unseen downstream tasks… ▽ More

    Submitted 15 November, 2022; originally announced November 2022.

    Comments: Accepted by Foundation Models for Decision Making Workshop at Neural Information Processing Systems, 2022

  49. arXiv:2211.05299  [pdf, other

    cs.CV

    Prior-enhanced Temporal Action Localization using Subject-aware Spatial Attention

    Authors: Yifan Liu, Youbao Tang, Ning Zhang, Ruei-Sung Lin, Haoqian Wang

    Abstract: Temporal action localization (TAL) aims to detect the boundary and identify the class of each action instance in a long untrimmed video. Current approaches treat video frames homogeneously, and tend to give background and key objects excessive attention. This limits their sensitivity to localize action boundaries. To this end, we propose a prior-enhanced temporal action localization method (PETAL)… ▽ More

    Submitted 9 November, 2022; originally announced November 2022.

    Comments: 5 pages, 2 figures

  50. Recovering Sign Bits of DCT Coefficients in Digital Images as an Optimization Problem

    Authors: Ruiyuan Lin, Sheng Liu, Jun Jiang, Shujun Li, Chengqing Li, C. -C. Jay Kuo

    Abstract: Recovering unknown, missing, damaged, distorted, or lost information in DCT coefficients is a common task in multiple applications of digital image processing, including image compression, selective image encryption, and image communication. This paper investigates the recovery of sign bits in DCT coefficients of digital images, by proposing two different approximation methods to solve a mixed int… ▽ More

    Submitted 8 January, 2024; v1 submitted 2 November, 2022; originally announced November 2022.

    Comments: 22 pages, 8 figures

    MSC Class: 68P30

    Journal ref: Journal of Visual Communication and Image Representation, vol. 98, art. no. 104045, 2024