Skip to main content

Showing 1–9 of 9 results for author: Gai, Y

Searching in archive eess. Search in all archives.
.
  1. arXiv:2211.02443  [pdf

    cs.RO eess.SY

    Robotic Assembly Control Reconfiguration Based on Transfer Reinforcement Learning for Objects with Different Geometric Features

    Authors: Yuhang Gai, Bing Wang, Jiwen Zhang, Dan Wu, Ken Chen

    Abstract: Robotic force-based compliance control is a preferred approach to achieve high-precision assembly tasks. When the geometric features of assembly objects are asymmetric or irregular, reinforcement learning (RL) agents are gradually incorporated into the compliance controller to adapt to complex force-pose map** which is hard to model analytically. Since force-pose map** is strongly dependent on… ▽ More

    Submitted 4 November, 2022; originally announced November 2022.

  2. arXiv:2210.13255  [pdf

    cs.RO eess.SY

    Local Connection Reinforcement Learning Method for Efficient Control of Robotic Peg-in-Hole Assembly

    Authors: Yuhang Gai, Jiwen Zhang, Dan Wu, Ken Chen

    Abstract: Traditional control methods of robotic peg-in-hole assembly rely on complex contact state analysis. Reinforcement learning (RL) is gradually becoming a preferred method of controlling robotic peg-in-hole assembly tasks. However, the training process of RL is quite time-consuming because RL methods are always globally connected, which means all state components are assumed to be the input of polici… ▽ More

    Submitted 24 October, 2022; originally announced October 2022.

  3. arXiv:2206.14866  [pdf, other

    eess.AS cs.HC

    iEmoTTS: Toward Robust Cross-Speaker Emotion Transfer and Control for Speech Synthesis based on Disentanglement between Prosody and Timbre

    Authors: Guangyan Zhang, Ying Qin, Wenjie Zhang, Jialun Wu, Mei Li, Yutao Gai, Feijun Jiang, Tan Lee

    Abstract: The capability of generating speech with specific type of emotion is desired for many applications of human-computer interaction. Cross-speaker emotion transfer is a common approach to generating emotional speech when speech with emotion labels from target speakers is not available for model training. This paper presents a novel cross-speaker emotion transfer system, named iEmoTTS. The system is c… ▽ More

    Submitted 4 January, 2023; v1 submitted 29 June, 2022; originally announced June 2022.

    Comments: Submitted to IEEE Transactions on Audio, Speech, and Language Processing

  4. arXiv:2104.04078  [pdf

    cs.LG eess.SY

    Progressive extension of reinforcement learning action dimension for asymmetric assembly tasks

    Authors: Yuhang Gai, Jiuming Guo, Dan Wu, Ken Chen

    Abstract: Reinforcement learning (RL) is always the preferred embodiment to construct the control strategy of complex tasks, like asymmetric assembly tasks. However, the convergence speed of reinforcement learning severely restricts its practical application. In this paper, the convergence is first accelerated by combining RL and compliance control. Then a completely innovative progressive extension of acti… ▽ More

    Submitted 6 April, 2021; originally announced April 2021.

  5. arXiv:2103.16003  [pdf

    eess.SY

    Feature-Based Compliance Control for Peg-in-Hole Assembly with Clearance or Interference Fit

    Authors: Yuhang Gai, Jiuming Guo, Dan Wu, Ken Chen

    Abstract: This paper aims at solving mass precise peg-in-hole assembly. First, a feature space and a response space are constructed according to the relative pose and equivalent forces and moments. Then the contact states are segmented in the feature space and the segmentation boundaries are mapped into the response space. Further, a feature-based compliance control (FBCC) algorithm is proposed based on bou… ▽ More

    Submitted 29 March, 2021; originally announced March 2021.

    Comments: 10 pages, 14 figures

  6. arXiv:1109.2088  [pdf, ps, other

    cs.LG cs.NI eess.SY math.OC math.PR

    Online Learning Algorithms for Stochastic Water-Filling

    Authors: Yi Gai, Bhaskar Krishnamachari

    Abstract: Water-filling is the term for the classic solution to the problem of allocating constrained power to a set of parallel channels to maximize the total data-rate. It is used widely in practice, for example, for power allocation to sub-carriers in multi-user OFDM systems such as WiMax. The classic water-filling algorithm is deterministic and requires perfect knowledge of the channel gain to noise rat… ▽ More

    Submitted 9 September, 2011; originally announced September 2011.

  7. arXiv:1109.1552  [pdf, ps, other

    cs.LG cs.NI eess.SY math.OC math.PR

    Efficient Online Learning for Opportunistic Spectrum Access

    Authors: Wenhan Dai, Yi Gai, Bhaskar Krishnamachari

    Abstract: The problem of opportunistic spectrum access in cognitive radio networks has been recently formulated as a non-Bayesian restless multi-armed bandit problem. In this problem, there are N arms (corresponding to channels) and one player (corresponding to a secondary user). The state of each arm evolves as a finite-state Markov chain with unknown parameters. At each time slot, the player can select K… ▽ More

    Submitted 7 September, 2011; originally announced September 2011.

  8. arXiv:1109.1533  [pdf, ps, other

    math.OC cs.LG cs.NI eess.SY math.PR

    The Non-Bayesian Restless Multi-Armed Bandit: A Case of Near-Logarithmic Strict Regret

    Authors: Wenhan Dai, Yi Gai, Bhaskar Krishnamachari, Qing Zhao

    Abstract: In the classic Bayesian restless multi-armed bandit (RMAB) problem, there are $N$ arms, with rewards on all arms evolving at each time as Markov chains with known parameters. A player seeks to activate $K \geq 1$ arms at each time in order to maximize the expected total reward obtained over multiple plays. RMAB is a challenging problem that is known to be PSPACE-hard in general. We consider in thi… ▽ More

    Submitted 7 September, 2011; originally announced September 2011.

    Comments: arXiv admin note: significant text overlap with arXiv:1011.4752

  9. arXiv:1012.3005  [pdf, ps, other

    math.OC cs.LG cs.NI eess.SY math.PR

    On the Combinatorial Multi-Armed Bandit Problem with Markovian Rewards

    Authors: Yi Gai, Bhaskar Krishnamachari, Mingyan Liu

    Abstract: We consider a combinatorial generalization of the classical multi-armed bandit problem that is defined as follows. There is a given bipartite graph of $M$ users and $N \geq M$ resources. For each user-resource pair $(i,j)$, there is an associated state that evolves as an aperiodic irreducible finite-state Markov chain with unknown parameters, with transitions occurring each time the particular use… ▽ More

    Submitted 19 March, 2011; v1 submitted 14 December, 2010; originally announced December 2010.