Skip to main content

Showing 51–100 of 271 results for author: Yin, M

.
  1. arXiv:2310.08824  [pdf, other

    cs.HC stat.ML

    Confounding-Robust Policy Improvement with Human-AI Teams

    Authors: Ruijiang Gao, Mingzhang Yin

    Abstract: Human-AI collaboration has the potential to transform various domains by leveraging the complementary strengths of human experts and Artificial Intelligence (AI) systems. However, unobserved confounding can undermine the effectiveness of this collaboration, leading to biased and unreliable outcomes. In this paper, we propose a novel solution to address unobserved confounding in human-AI collaborat… ▽ More

    Submitted 12 October, 2023; originally announced October 2023.

    Comments: 24 pages

  2. arXiv:2310.07849  [pdf, other

    cs.CL cs.AI

    Synthetic Data Generation with Large Language Models for Text Classification: Potential and Limitations

    Authors: Zhuoyan Li, Hangxiao Zhu, Zhuoran Lu, Ming Yin

    Abstract: The collection and curation of high-quality training data is crucial for develo** text classification models with superior performance, but it is often associated with significant costs and time investment. Researchers have recently explored using large language models (LLMs) to generate synthetic datasets as an alternative approach. However, the effectiveness of the LLM-generated synthetic data… ▽ More

    Submitted 12 October, 2023; v1 submitted 11 October, 2023; originally announced October 2023.

    Comments: EMNLP 2023

  3. arXiv:2310.06335  [pdf, other

    cs.DC

    BBCA-CHAIN: Low Latency, High Throughput BFT Consensus on a DAG

    Authors: Dahlia Malkhi, Chrysoula Stathakopoulou, Maofan Yin

    Abstract: This paper presents a partially synchronous BFT consensus protocol powered by BBCA, a lightly modified Byzantine Consistent Broadcast (BCB) primitive. BBCA provides a Complete-Adopt semantic through an added probing interface to allow either aborting the broadcast by correct nodes or exclusively, adopting the message consistently in case of a potential delivery. It does not introduce any extra typ… ▽ More

    Submitted 24 May, 2024; v1 submitted 10 October, 2023; originally announced October 2023.

  4. arXiv:2309.16251  [pdf, other

    cs.HC

    The effect of 3D stereopsis and hand-tool alignment on learning effectiveness and skill transfer of a VR-based simulator for dental training

    Authors: Maximilian Kaluschke, Myat Su Yin, Peter Haddawy, Siriwan Suebnukarn, Gabriel Zachmann

    Abstract: Dental simulators gained prevalence in recent years. Important aspects distinguishing VR hardware configurations are 3D stereoscopic rendering and visual alignment of the user's hands with the virtual tools. New dental simulators are often evaluated without analysing the impact of these simulation aspects. In this paper, we seek to determine the impact of 3D stereoscopic rendering and of hand-tool… ▽ More

    Submitted 28 September, 2023; originally announced September 2023.

    Comments: 26 pages, 15 figures, Accepted at online journal PLoS ONE

    MSC Class: 62A86 (Primary) 62H30 (Secondary) ACM Class: J.3; G.3

  5. arXiv:2309.15964  [pdf, ps, other

    math.CO

    Enumerating pattern-avoiding permutations by leading terms

    Authors: Ömer Eğecioğlu, Collier Gaiser, Mei Yin

    Abstract: The number of 123-avoiding permutation on $\{1,2,\ldots,n\}$ with a fixed leading terms is counted by the ballot numbers. The same holds for $132$-avoiding permutations. These results were proved by Miner and Pak using the Robinson-Schensted-Knuth (RSK) correspondence to connect permutations with Dyck paths. In this paper, we first provide an alternate proof of these enumeration results via a dire… ▽ More

    Submitted 24 June, 2024; v1 submitted 27 September, 2023; originally announced September 2023.

    Comments: 23 pages, Journal of Combinatorics (forthcoming)

    MSC Class: 05A05; 05A15

  6. arXiv:2308.08858  [pdf, ps, other

    cs.LG cs.AI cs.GT stat.ML

    Improving Sample Efficiency of Model-Free Algorithms for Zero-Sum Markov Games

    Authors: Songtao Feng, Ming Yin, Yu-Xiang Wang, **g Yang, Yingbin Liang

    Abstract: The problem of two-player zero-sum Markov games has recently attracted increasing interests in theoretical studies of multi-agent reinforcement learning (RL). In particular, for finite-horizon episodic Markov decision processes (MDPs), it has been shown that model-based algorithms can find an $ε$-optimal Nash Equilibrium (NE) with the sample complexity of $O(H^3SAB/ε^2)$, which is optimal in the d… ▽ More

    Submitted 5 June, 2024; v1 submitted 17 August, 2023; originally announced August 2023.

  7. arXiv:2308.02665  [pdf, other

    cs.AI

    Let's Give a Voice to Conversational Agents in Virtual Reality

    Authors: Michele Yin, Gabriel Roccabruna, Abhinav Azad, Giuseppe Riccardi

    Abstract: The dialogue experience with conversational agents can be greatly enhanced with multimodal and immersive interactions in virtual reality. In this work, we present an open-source architecture with the goal of simplifying the development of conversational agents operating in virtual environments. The architecture offers the possibility of plugging in conversational agents of different domains and ad… ▽ More

    Submitted 4 August, 2023; originally announced August 2023.

  8. arXiv:2306.14757  [pdf, other

    cs.DC

    BBCA-LEDGER: High Throughput Consensus meets Low Latency

    Authors: Chrysoula Stathakopoulou, Michael Wei, Maofan Yin, Hongbo Zhang, Dahlia Malkhi

    Abstract: This paper presents BBCA-LEDGER, a Byzantine log replication technology for partially synchronous networks enabling blocks to be broadcast in parallel, such that each broadcast is finalized independently and instantaneously into an individual slot in the log. Every finalized broadcast is eventually committed to the total ordering, so that all network bandwidth has utility in disseminating blocks.… ▽ More

    Submitted 26 June, 2023; originally announced June 2023.

  9. arXiv:2306.14063  [pdf, other

    cs.LG cs.AI

    Offline Policy Evaluation for Reinforcement Learning with Adaptively Collected Data

    Authors: Sunil Madhow, Dan Qiao, Ming Yin, Yu-Xiang Wang

    Abstract: Develo** theoretical guarantees on the sample complexity of offline RL methods is an important step towards making data-hungry RL algorithms practically viable. Currently, most results hinge on unrealistic assumptions about the data distribution -- namely that it comprises a set of i.i.d. trajectories collected by a single logging policy. We consider a more general setting where the dataset may… ▽ More

    Submitted 30 April, 2024; v1 submitted 24 June, 2023; originally announced June 2023.

  10. arXiv:2306.08681  [pdf, ps, other

    math.CO math.PR

    Some enumerative properties of parking functions

    Authors: Richard P. Stanley, Mei Yin

    Abstract: A parking function is a sequence $(a_1,\dots, a_n)$ of positive integers such that if $b_1\leq\cdots\leq b_n$ is the increasing rearrangement of $a_1,\dots,a_n$, then $b_i\leq i$ for $1\leq i\leq n$. In this paper we obtain some new results on the enumeration of parking functions. We will consider the joint distribution of several sets of statistics on parking functions. The distribution of most o… ▽ More

    Submitted 14 June, 2023; originally announced June 2023.

    Comments: 30 pages, 4 figures, 1 table

    MSC Class: 05A15; 60C05; 05A19

  11. arXiv:2306.07992  [pdf, other

    cs.CV cs.AI cs.CR cs.LG

    Securing Visually-Aware Recommender Systems: An Adversarial Image Reconstruction and Detection Framework

    Authors: Minglei Yin, Bin Liu, Neil Zhenqiang Gong, Xin Li

    Abstract: With rich visual data, such as images, becoming readily associated with items, visually-aware recommendation systems (VARS) have been widely used in different applications. Recent studies have shown that VARS are vulnerable to item-image adversarial attacks, which add human-imperceptible perturbations to the clean images associated with those items. Attacks on VARS pose new security challenges to… ▽ More

    Submitted 11 June, 2023; originally announced June 2023.

  12. arXiv:2306.06766  [pdf, other

    cs.RO cs.LG

    Zero-Shot Wireless Indoor Navigation through Physics-Informed Reinforcement Learning

    Authors: Mingsheng Yin, Tao Li, Haozhe Lei, Yaqi Hu, Sundeep Rangan, Quanyan Zhu

    Abstract: The growing focus on indoor robot navigation utilizing wireless signals has stemmed from the capability of these signals to capture high-resolution angular and temporal measurements. Prior heuristic-based methods, based on radio frequency propagation, are intuitive and generalizable across simple scenarios, yet fail to navigate in complex environments. On the other hand, end-to-end (e2e) deep rein… ▽ More

    Submitted 15 September, 2023; v1 submitted 11 June, 2023; originally announced June 2023.

    Comments: 16 pages, 13 figures, 4 tables

  13. arXiv:2306.00861  [pdf, ps, other

    cs.LG stat.ML

    Non-stationary Reinforcement Learning under General Function Approximation

    Authors: Songtao Feng, Ming Yin, Ruiquan Huang, Yu-Xiang Wang, **g Yang, Yingbin Liang

    Abstract: General function approximation is a powerful tool to handle large state and action spaces in a broad range of reinforcement learning (RL) scenarios. However, theoretical understanding of non-stationary MDPs with general function approximation is still limited. In this paper, we make the first such an attempt. We first propose a new complexity metric called dynamic Bellman Eluder (DBE) dimension fo… ▽ More

    Submitted 1 June, 2023; originally announced June 2023.

    Comments: ICML 2023

  14. arXiv:2305.17235  [pdf, other

    cs.CV cs.AI

    COMCAT: Towards Efficient Compression and Customization of Attention-Based Vision Models

    Authors: **qi Xiao, Miao Yin, Yu Gong, Xiao Zang, Jian Ren, Bo Yuan

    Abstract: Attention-based vision models, such as Vision Transformer (ViT) and its variants, have shown promising performance in various computer vision tasks. However, these emerging architectures suffer from large model sizes and high computational costs, calling for efficient model compression solutions. To date, pruning ViTs has been well studied, while other compression strategies that have been widely… ▽ More

    Submitted 9 June, 2023; v1 submitted 26 May, 2023; originally announced May 2023.

    Comments: ICML 2023 Poster

    Journal ref: Proceedings of the 40th International Conference on Machine Learning, PMLR 202:38125-38136, 2023

  15. arXiv:2305.13556  [pdf, other

    cs.DC

    Lessons from HotStuff

    Authors: Dahlia Malkhi, Maofan Yin

    Abstract: This article will take you on a journey to the core of blockchains, their Byzantine consensus engine, where HotStuff emerged as a new algorithmic foundation for the classical Byzantine generals consensus problem. The first part of the article underscores the theoretical advances HotStuff enabled, including several models in which HotStuff-based solutions closed problems which were opened for dec… ▽ More

    Submitted 22 May, 2023; originally announced May 2023.

  16. arXiv:2305.12524  [pdf, other

    cs.CL cs.AI

    TheoremQA: A Theorem-driven Question Answering dataset

    Authors: Wenhu Chen, Ming Yin, Max Ku, Pan Lu, Yixin Wan, Xueguang Ma, Jianyu Xu, Xinyi Wang, Tony Xia

    Abstract: The recent LLMs like GPT-4 and PaLM-2 have made tremendous progress in solving fundamental math problems like GSM8K by achieving over 90% accuracy. However, their capabilities to solve more challenging math problems which require domain-specific knowledge (i.e. theorem) have yet to be investigated. In this paper, we introduce TheoremQA, the first theorem-driven question-answering dataset designed… ▽ More

    Submitted 5 December, 2023; v1 submitted 21 May, 2023; originally announced May 2023.

    Comments: Accepted to Main Conference of EMNLP 2023

  17. arXiv:2305.11800  [pdf, ps, other

    math.CO math.PR

    Moments of Colored Permutation Statistics on Conjugacy Classes

    Authors: Jesse Campion Loth, Michael Levet, Kevin Liu, Sheila Sundaram, Mei Yin

    Abstract: In this paper, we consider the moments of statistics on conjugacy classes of the colored permutation groups $\mathfrak{S}_{n,r}=\mathbb{Z}_r\wr \mathfrak{S}_n$. We first show that any fixed moment coincides on all conjugacy classes where all cycles have sufficiently long length. Additionally, for permutation statistics that can be realized via a process we call symmetric extensions, these moments… ▽ More

    Submitted 27 December, 2023; v1 submitted 19 May, 2023; originally announced May 2023.

    MSC Class: 05A05; 05E05; 60C05

  18. arXiv:2305.08398  [pdf, ps, other

    math.AP

    Blow-up phenomena for a class of extensible beam equations

    Authors: Gongwei Liu, Mengyun Yin, Suxia Xia

    Abstract: In this paper, we investigate the initial boundary value problem of the following nonlinear extensible beam equation with nonlinear dam** term $$u_{t t}+Δ^2 u-M\left(\|\nabla u\|^2\right) Δu-Δu_t+\left|u_t\right|^{r-1} u_t=|u|^{p-1} u$$ which was considered by Yang et al. (Advanced Nonlinear Studies 2022; 22:436-468). We consider the problem with the nonlinear dam** and establish the finite ti… ▽ More

    Submitted 15 May, 2023; originally announced May 2023.

  19. arXiv:2305.05094  [pdf, other

    cs.CL cs.HC

    Interactive Concept Learning for Uncovering Latent Themes in Large Text Collections

    Authors: Maria Leonor Pacheco, Tunazzina Islam, Lyle Ungar, Ming Yin, Dan Goldwasser

    Abstract: Experts across diverse disciplines are often interested in making sense of large text collections. Traditionally, this challenge is approached either by noisy unsupervised techniques such as topic models, or by following a manual theme discovery process. In this paper, we expand the definition of a theme to account for more than just a word distribution, and include generalized concepts deemed rel… ▽ More

    Submitted 8 May, 2023; originally announced May 2023.

    Comments: Accepted to Findings of ACL: ACL 2023

  20. A Generative Modeling Framework for Inferring Families of Biomechanical Constitutive Laws in Data-Sparse Regimes

    Authors: Minglang Yin, Zongren Zou, Enrui Zhang, Cristina Cavinato, Jay D. Humphrey, George Em Karniadakis

    Abstract: Quantifying biomechanical properties of the human vasculature could deepen our understanding of cardiovascular diseases. Standard nonlinear regression in constitutive modeling requires considerable high-quality data and an explicit form of the constitutive model as prior knowledge. By contrast, we propose a novel approach that combines generative deep learning with Bayesian inference to efficientl… ▽ More

    Submitted 4 May, 2023; originally announced May 2023.

  21. Neutral Atom Quantum Computing Hardware: Performance and End-User Perspective

    Authors: Karen Wintersperger, Florian Dommert, Thomas Ehmer, Andrey Hoursanov, Johannes Klepsch, Wolfgang Mauerer, Georg Reuber, Thomas Strohm, Ming Yin, Sebastian Luber

    Abstract: We present an industrial end-user perspective on the current state of quantum computing hardware for one specific technological approach, the neutral atom platform. Our aim is to assist developers in understanding the impact of the specific properties of these devices on the effectiveness of algorithm execution. Based on discussions with different vendors and recent literature, we discuss the perf… ▽ More

    Submitted 15 September, 2023; v1 submitted 27 April, 2023; originally announced April 2023.

    Journal ref: EPJ Quantum Technology 10, 32 (2023)

  22. arXiv:2304.05318  [pdf, ps, other

    math.CO math.PR

    Sampling planar tanglegrams and pairs of disjoint triangulations

    Authors: Alexander E. Black, Kevin Liu, Alex Mcdonough, Garrett Nelson, Michael C. Wigal, Mei Yin, Youngho Yoo

    Abstract: A tanglegram consists of two rooted binary trees and a perfect matching between their leaves, and a planar tanglegram is one that admits a layout with no crossings. We show that the problem of generating planar tanglegrams uniformly at random reduces to the corresponding problem for irreducible planar tanglegram layouts, which are known to be in bijection with pairs of disjoint triangulations of a… ▽ More

    Submitted 11 April, 2023; originally announced April 2023.

    Comments: 16 pages, 8 figures

    MSC Class: 05C05; 05C30

  23. arXiv:2304.02379  [pdf, other

    math.OC

    A Dual System-Level Parameterization for Identification from Closed-Loop Data

    Authors: Amber Srivastava, Mingzhou Yin, Andrea Iannelli, Roy S. Smith

    Abstract: This work presents a dual system-level parameterization (D-SLP) method for closed-loop system identification. The recent system-level synthesis framework parameterizes all stabilizing controllers via linear constraints on closed-loop response functions, known as system-level parameters. It was demonstrated that several structural, locality, and communication constraints on the controller can be po… ▽ More

    Submitted 5 April, 2023; originally announced April 2023.

  24. arXiv:2303.15116  [pdf

    cs.CL

    An ontology-aided, natural language-based approach for multi-constraint BIM model querying

    Authors: Mengtian Yin, Llewellyn Tang, Chris Webster, Shen Xu, Xiongyi Li, Huaquan Ying

    Abstract: Being able to efficiently retrieve the required building information is critical for construction project stakeholders to carry out their engineering and management activities. Natural language interface (NLI) systems are emerging as a time and cost-effective way to query Building Information Models (BIMs). However, the existing methods cannot logically combine different constraints to perform fin… ▽ More

    Submitted 27 March, 2023; originally announced March 2023.

  25. arXiv:2303.09842  [pdf, ps, other

    eess.SY stat.ML

    Error Bounds for Kernel-Based Linear System Identification with Unknown Hyperparameters

    Authors: Mingzhou Yin, Roy S. Smith

    Abstract: The kernel-based method has been successfully applied in linear system identification using stable kernel designs. From a Gaussian process perspective, it automatically provides probabilistic error bounds for the identified models from the posterior covariance, which are useful in robust and stochastic control. However, the error bounds require knowledge of the true hyperparameters in the kernel d… ▽ More

    Submitted 17 March, 2023; originally announced March 2023.

  26. arXiv:2303.03739  [pdf, other

    cs.RO

    Path Planning Under Uncertainty to Localize mmWave Sources

    Authors: Kai Pfeiffer, Yuze Jia, Mingsheng Yin, Akshaj Kumar Veldanda, Yaqi Hu, Amee Trivedi, Jeff Zhang, Siddharth Garg, Elza Erkip, Sundeep Rangan, Ludovic Righetti

    Abstract: In this paper, we study a navigation problem where a mobile robot needs to locate a mmWave wireless signal. Using the directionality properties of the signal, we propose an estimation and path planning algorithm that can efficiently navigate in cluttered indoor environments. We formulate Extended Kalman filters for emitter location estimation in cases where the signal is received in line-of-sight… ▽ More

    Submitted 8 March, 2023; v1 submitted 7 March, 2023; originally announced March 2023.

  27. arXiv:2302.13252  [pdf, other

    cs.LG

    No-Regret Linear Bandits beyond Realizability

    Authors: Chong Liu, Ming Yin, Yu-Xiang Wang

    Abstract: We study linear bandits when the underlying reward function is not linear. Existing work relies on a uniform misspecification parameter $ε$ that measures the sup-norm error of the best linear approximation. This results in an unavoidable linear regret whenever $ε> 0$. We describe a more natural model of misspecification which only requires the approximation error at each input $x$ to be proportion… ▽ More

    Submitted 19 July, 2023; v1 submitted 26 February, 2023; originally announced February 2023.

    Journal ref: Proceedings of the 39th Conference on Uncertainty in Artificial Intelligence, PMLR 216:1294-1303, 2023

  28. arXiv:2302.12456  [pdf, ps, other

    cs.LG cs.AI stat.ML

    Logarithmic Switching Cost in Reinforcement Learning beyond Linear MDPs

    Authors: Dan Qiao, Ming Yin, Yu-Xiang Wang

    Abstract: In many real-life reinforcement learning (RL) problems, deploying new policies is costly. In those scenarios, algorithms must solve exploration (which requires adaptivity) while switching the deployed policy sparsely (which limits adaptivity). In this paper, we go beyond the existing state-of-the-art on this problem that focused on linear Markov Decision Processes (MDPs) by considering linear Bell… ▽ More

    Submitted 24 February, 2023; originally announced February 2023.

    Comments: 25 pages

  29. arXiv:2301.13364  [pdf, other

    cs.IR

    A Counterfactual Collaborative Session-based Recommender System

    Authors: Wenzhuo Song, Shou** Wang, Yan Wang, Kunpeng Liu, Xueyan Liu, Minghao Yin

    Abstract: Most session-based recommender systems (SBRSs) focus on extracting information from the observed items in the current session of a user to predict a next item, ignoring the causes outside the session (called outer-session causes, OSCs) that influence the user's selection of items. However, these causes widely exist in the real world, and few studies have investigated their role in SBRSs. In this w… ▽ More

    Submitted 6 May, 2023; v1 submitted 30 January, 2023; originally announced January 2023.

    Comments: accepted by the ACM WebConf 2023

  30. Bi-AM-RRT*: A Fast and Efficient Sampling-Based Motion Planning Algorithm in Dynamic Environments

    Authors: Ying Zhang, Heyong Wang, Maoliang Yin, Jiankun Wang, Changchun Hua

    Abstract: The efficiency of sampling-based motion planning brings wide application in autonomous mobile robots. The conventional rapidly exploring random tree (RRT) algorithm and its variants have gained significant successes, but there are still challenges for the optimal motion planning of mobile robots in dynamic environments. In this paper, based on Bidirectional RRT and the use of an assisting metric (… ▽ More

    Submitted 30 April, 2023; v1 submitted 27 January, 2023; originally announced January 2023.

    Comments: Submitted to IEEE Transactions on Intelligent Vehicles

    Journal ref: IEEE Transactions on Intelligent Vehicles, 2023

  31. arXiv:2301.09422  [pdf, other

    cs.LG cs.AI cs.CV

    HALOC: Hardware-Aware Automatic Low-Rank Compression for Compact Neural Networks

    Authors: **qi Xiao, Chengming Zhang, Yu Gong, Miao Yin, Yang Sui, Lizhi Xiang, Dingwen Tao, Bo Yuan

    Abstract: Low-rank compression is an important model compression strategy for obtaining compact neural network models. In general, because the rank values directly determine the model complexity and model accuracy, proper selection of layer-wise rank is very critical and desired. To date, though many low-rank compression approaches, either selecting the ranks in a manual or automatic way, have been proposed… ▽ More

    Submitted 1 February, 2023; v1 submitted 19 January, 2023; originally announced January 2023.

    Comments: AAAI-23

    Journal ref: Proceedings of the AAAI Conference on Artificial Intelligence. 37, 9 (Jun. 2023), 10464-10472

  32. arXiv:2301.05809  [pdf, other

    cs.HC cs.AI cs.LG

    Who Should I Trust: AI or Myself? Leveraging Human and AI Correctness Likelihood to Promote Appropriate Trust in AI-Assisted Decision-Making

    Authors: Shuai Ma, Ying Lei, Xinru Wang, Chengbo Zheng, Chuhan Shi, Ming Yin, Xiaojuan Ma

    Abstract: In AI-assisted decision-making, it is critical for human decision-makers to know when to trust AI and when to trust themselves. However, prior studies calibrated human trust only based on AI confidence indicating AI's correctness likelihood (CL) but ignored humans' CL, hindering optimal team decision-making. To mitigate this gap, we proposed to promote humans' appropriate trust based on the CL of… ▽ More

    Submitted 13 January, 2023; originally announced January 2023.

  33. arXiv:2301.05345  [pdf, other

    cs.AI cs.CV

    GOHSP: A Unified Framework of Graph and Optimization-based Heterogeneous Structured Pruning for Vision Transformer

    Authors: Miao Yin, Burak Uzkent, Yilin Shen, Hongxia **, Bo Yuan

    Abstract: The recently proposed Vision transformers (ViTs) have shown very impressive empirical performance in various computer vision tasks, and they are viewed as an important type of foundation model. However, ViTs are typically constructed with large-scale sizes, which then severely hinder their potential deployment in many practical resources-constrained applications. To mitigate this challenging probl… ▽ More

    Submitted 6 February, 2023; v1 submitted 12 January, 2023; originally announced January 2023.

    Comments: This manuscript was accepted to AAAI 2023 Main Track

  34. arXiv:2301.01593  [pdf, other

    cs.CY cs.AI cs.LG

    Multi-View MOOC Quality Evaluation via Information-Aware Graph Representation Learning

    Authors: Lu Jiang, Yibin Wang, Jianan Wang, Pengyang Wang, Minghao Yin

    Abstract: In this paper, we study the problem of MOOC quality evaluation which is essential for improving the course materials, promoting students' learning efficiency, and benefiting user services. While achieving promising performances, current works still suffer from the complicated interactions and relationships of entities in MOOC platforms. To tackle the challenges, we formulate the problem as a cours… ▽ More

    Submitted 1 January, 2023; originally announced January 2023.

  35. arXiv:2301.00898  [pdf, ps, other

    math.CO math.PR

    Permutation Statistics in Conjugacy Classes of the Symmetric Group

    Authors: Jesse Campion Loth, Michael Levet, Kevin Liu, Eric Nathan Stucky, Sheila Sundaram, Mei Yin

    Abstract: We introduce the notion of a weighted inversion statistic on the symmetric group, and examine its distribution on each conjugacy class. Our work generalizes the study of several common permutation statistics, including the number of inversions, the number of descents, the major index, and the number of excedances. As a consequence, we obtain explicit formulas for the first moments of several stati… ▽ More

    Submitted 17 May, 2023; v1 submitted 2 January, 2023; originally announced January 2023.

    Comments: We would also like to express our gratitude to Yan Zhuang for kindly alerting us to the arXiv paper of Hamaker and Rhoades (arXiv:2206.06567), after seeing the first version of the present paper. We also thank Zach Hamaker for taking the time to explain the results of the Hamaker--Rhoades paper and its overlap with the present work

    MSC Class: 05A05; 05E05; 60C05

  36. arXiv:2212.13026  [pdf, other

    q-bio.NC

    Network analysis on cortical morphometry in first-episode schizophrenia

    Authors: Mowen Yin, Weikai Huang, Zhichao Liang, Quanying Liu, Xiaoying Tang

    Abstract: First-episode schizophrenia (FES) results in abnormality of brain connectivity at different levels. Despite some successful findings on functional and structural connectivity of FES, relatively few studies have been focused on morphological connectivity, which may provide a potential biomarker for FES. In this study, we aim to investigate cortical morphological connectivity in FES. T1-weighted mag… ▽ More

    Submitted 26 December, 2022; originally announced December 2022.

  37. arXiv:2212.12767  [pdf, other

    stat.ML cs.LG

    Streaming Traffic Flow Prediction Based on Continuous Reinforcement Learning

    Authors: Yanan Xiao, Minyu Liu, Zichen Zhang, Lu Jiang, Minghao Yin, Jianan Wang

    Abstract: Traffic flow prediction is an important part of smart transportation. The goal is to predict future traffic conditions based on historical data recorded by sensors and the traffic network. As the city continues to build, parts of the transportation network will be added or modified. How to accurately predict expanding and evolving long-term streaming networks is of great significance. To this end,… ▽ More

    Submitted 24 December, 2022; originally announced December 2022.

  38. arXiv:2212.11858  [pdf, other

    eess.SP

    Multi-Frequency Channel Modeling for Millimeter Wave and THz Wireless Communication via Generative Adversarial Networks

    Authors: Yaqi Hu, Mingsheng Yin, William Xia, Sundeep Rangan, Marco Mezzavilla

    Abstract: Modern cellular systems rely increasingly on simultaneous communication in multiple discontinuous bands for macro-diversity and increased bandwidth. Multi-frequency communication is particularly crucial in the millimeter wave (mmWave) and Terahertz (THz) frequencies, as these bands are often coupled with lower frequencies for robustness. Evaluation of these systems requires statistical models that… ▽ More

    Submitted 22 December, 2022; originally announced December 2022.

    Comments: Accepted by 2022 Asilomar Conference on Signals, Systems, and Computers

  39. arXiv:2212.09563  [pdf, other

    cs.CL

    Source-Free Domain Adaptation for Question Answering with Masked Self-training

    Authors: M. Yin, B. Wang, Y. Dong, C. Ling

    Abstract: Most previous unsupervised domain adaptation (UDA) methods for question answering(QA) require access to source domain data while fine-tuning the model for the target domain. Source domain data may, however, contain sensitive information and may be restricted. In this study, we investigate a more challenging setting, source-free UDA, in which we have only the pretrained source model and target doma… ▽ More

    Submitted 17 March, 2024; v1 submitted 19 December, 2022; originally announced December 2022.

  40. arXiv:2212.02046  [pdf, other

    cs.CV

    Algorithm and Hardware Co-Design of Energy-Efficient LSTM Networks for Video Recognition with Hierarchical Tucker Tensor Decomposition

    Authors: Yu Gong, Miao Yin, Lingyi Huang, Chunhua Deng, Yang Sui, Bo Yuan

    Abstract: Long short-term memory (LSTM) is a type of powerful deep neural network that has been widely used in many sequence analysis and modeling applications. However, the large model size problem of LSTM networks make their practical deployment still very challenging, especially for the video recognition tasks that require high-dimensional input data. Aiming to overcome this limitation and fully unlock t… ▽ More

    Submitted 5 December, 2022; originally announced December 2022.

    Comments: TC 2022

  41. arXiv:2212.01957  [pdf, other

    cs.CV

    CSTAR: Towards Compact and STructured Deep Neural Networks with Adversarial Robustness

    Authors: Huy Phan, Miao Yin, Yang Sui, Bo Yuan, Saman Zonouz

    Abstract: Model compression and model defense for deep neural networks (DNNs) have been extensively and individually studied. Considering the co-importance of model compactness and robustness in practical applications, several prior works have explored to improve the adversarial robustness of the sparse neural networks. However, the structured sparse models obtained by the exiting works suffer severe perfor… ▽ More

    Submitted 17 February, 2023; v1 submitted 4 December, 2022; originally announced December 2022.

    Comments: AAAI-23

  42. arXiv:2211.15956  [pdf, other

    cs.LG cs.AI

    Offline Reinforcement Learning with Closed-Form Policy Improvement Operators

    Authors: Jiachen Li, Edwin Zhang, Ming Yin, Qinxun Bai, Yu-Xiang Wang, William Yang Wang

    Abstract: Behavior constrained policy optimization has been demonstrated to be a successful paradigm for tackling Offline Reinforcement Learning. By exploiting historical transitions, a policy is trained to maximize a learned value function while constrained by the behavior policy to avoid a significant distributional shift. In this paper, we propose our closed-form policy improvement operators. We make a n… ▽ More

    Submitted 22 July, 2023; v1 submitted 29 November, 2022; originally announced November 2022.

    Comments: Accepted at ICML 2023

  43. arXiv:2211.13208  [pdf, other

    cs.LG

    On Instance-Dependent Bounds for Offline Reinforcement Learning with Linear Function Approximation

    Authors: Thanh Nguyen-Tang, Ming Yin, Sunil Gupta, Svetha Venkatesh, Raman Arora

    Abstract: Sample-efficient offline reinforcement learning (RL) with linear function approximation has recently been studied extensively. Much of prior work has yielded the minimax-optimal bound of $\tilde{\mathcal{O}}(\frac{1}{\sqrt{K}})$, with $K$ being the number of episodes in the offline data. In this work, we seek to understand instance-dependent bounds for offline RL with function approximation. We pr… ▽ More

    Submitted 27 January, 2023; v1 submitted 23 November, 2022; originally announced November 2022.

    Comments: AAAI'23

  44. TDC: Towards Extremely Efficient CNNs on GPUs via Hardware-Aware Tucker Decomposition

    Authors: Lizhi Xiang, Miao Yin, Chengming Zhang, Aravind Sukumaran-Rajam, P. Sadayappan, Bo Yuan, Dingwen Tao

    Abstract: Tucker decomposition is one of the SOTA CNN model compression techniques. However, unlike the FLOPs reduction, we observe very limited inference time reduction with Tucker-compressed models using existing GPU software such as cuDNN. To this end, we propose an efficient end-to-end framework that can generate highly accurate and compact CNN models via Tucker decomposition and optimized inference cod… ▽ More

    Submitted 4 January, 2023; v1 submitted 7 November, 2022; originally announced November 2022.

    Comments: 14 pages, 9 figures, 3 tables, accepted by PPoPP '23

  45. arXiv:2211.00536  [pdf, other

    math.CO math.PR

    Probabilistic Parking Functions

    Authors: Irfan Durmić, Alex Han, Pamela E. Harris, Rodrigo Ribeiro, Mei Yin

    Abstract: We consider the notion of classical parking functions by introducing randomness and a new parking protocol, as inspired by the work presented in the paper ``Parking Functions: Choose your own adventure,'' (arXiv:2001.04817) by Carlson, Christensen, Harris, Jones, and Rodríguez. Among our results, we prove that the probability of obtaining a parking function, from a length $n$ preference vector, is… ▽ More

    Submitted 1 November, 2022; originally announced November 2022.

    Comments: 22 pages, 3 figures, 4 tables

  46. arXiv:2210.07815  [pdf, other

    cs.IR cs.LG

    Intra-session Context-aware Feed Recommendation in Live Systems

    Authors: Luo Ji, Gao Liu, Mingyang Yin, Hongxia Yang

    Abstract: Feed recommendation allows users to constantly browse items until feel uninterested and leave the session, which differs from traditional recommendation scenarios. Within a session, user's decision to continue browsing or not substantially affects occurrences of later clicks. However, such type of exposure bias is generally ignored or not explicitly modeled in most feed recommendation studies. In… ▽ More

    Submitted 11 January, 2023; v1 submitted 30 September, 2022; originally announced October 2022.

    Comments: 5 pages, 4 figures, CIKM 2022 short paper

  47. arXiv:2210.03918  [pdf, other

    cs.AI

    Finding and Exploring Promising Search Space for the 0-1 Multidimensional Knapsack Problem

    Authors: Jitao Xu, Hongbo Li, Minghao Yin

    Abstract: The 0-1 Multidimensional Knapsack Problem (MKP) is a classical NP-hard combinatorial optimization problem with many engineering applications. In this paper, we propose a novel algorithm combining evolutionary computation with the exact algorithm to solve the 0-1 MKP. It maintains a set of solutions and utilizes the information from the population to extract good partial assignments. To find high-q… ▽ More

    Submitted 26 May, 2024; v1 submitted 8 October, 2022; originally announced October 2022.

  48. arXiv:2210.00750  [pdf, ps, other

    cs.LG cs.AI stat.ML

    Offline Reinforcement Learning with Differentiable Function Approximation is Provably Efficient

    Authors: Ming Yin, Mengdi Wang, Yu-Xiang Wang

    Abstract: Offline reinforcement learning, which aims at optimizing sequential decision-making strategies with historical data, has been extensively applied in real-life applications. State-Of-The-Art algorithms usually leverage powerful function approximators (e.g. neural networks) to alleviate the sample complexity hurdle for better empirical performances. Despite the successes, a more systematic understan… ▽ More

    Submitted 23 November, 2022; v1 submitted 3 October, 2022; originally announced October 2022.

  49. arXiv:2208.11287  [pdf, other

    cs.RO cs.LG

    Robot Motion Planning as Video Prediction: A Spatio-Temporal Neural Network-based Motion Planner

    Authors: Xiao Zang, Miao Yin, Lingyi Huang, **g** Yu, Saman Zonouz, Bo Yuan

    Abstract: Neural network (NN)-based methods have emerged as an attractive approach for robot motion planning due to strong learning capabilities of NN models and their inherently high parallelism. Despite the current development in this direction, the efficient capture and processing of important sequential and spatial information, in a direct and simultaneous way, is still relatively under-explored. To ove… ▽ More

    Submitted 23 August, 2022; originally announced August 2022.

    Comments: Accepted in IROS 2022

  50. arXiv:2208.06124  [pdf, other

    cs.LG stat.ML

    Gradient Estimation for Binary Latent Variables via Gradient Variance Clip**

    Authors: Russell Z. Kunes, Mingzhang Yin, Max Land, Doron Haviv, Dana Pe'er, Simon Tavaré

    Abstract: Gradient estimation is often necessary for fitting generative models with discrete latent variables, in contexts such as reinforcement learning and variational autoencoder (VAE) training. The DisARM estimator (Yin et al. 2020; Dong, Mnih, and Tucker 2020) achieves state of the art gradient variance for Bernoulli latent variable models in many contexts. However, DisARM and other estimators have pot… ▽ More

    Submitted 12 August, 2022; originally announced August 2022.