Skip to main content

Showing 1–42 of 42 results for author: Peng, B

Searching in archive eess. Search in all archives.
.
  1. arXiv:2406.17338  [pdf, other

    eess.IV cs.CV cs.LG

    Robustly Optimized Deep Feature Decoupling Network for Fatty Liver Diseases Detection

    Authors: Peng Huang, Shu Hu, Bo Peng, Jiashu Zhang, Xi Wu, Xin Wang

    Abstract: Current medical image classification efforts mainly aim for higher average performance, often neglecting the balance between different classes. This can lead to significant differences in recognition accuracy between classes and obvious recognition weaknesses. Without the support of massive data, deep learning faces challenges in fine-grained classification of fatty liver. In this paper, we propos… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

    Comments: MICCAI 2024

  2. arXiv:2406.05652  [pdf, other

    eess.SP

    Distributed Combinatorial Optimization of Downlink User Assignment in mmWave Cell-free Massive MIMO Using Graph Neural Networks

    Authors: Bile Peng, Bihan Guo, Karl-Ludwig Besser, Luca Kunz, Ramprasad Raghunath, Anke Schmeink, Eduard A Jorswieck, Giuseppe Caire, H. Vincent Poor

    Abstract: Millimeter wave (mmWave) cell-free massive MIMO (CF mMIMO) is a promising solution for future wireless communications. However, its optimization is non-trivial due to the challenging channel characteristics. We show that mmWave CF mMIMO optimization is largely an assignment problem between access points (APs) and users due to the high path loss of mmWave channels, the limited output power of the a… ▽ More

    Submitted 9 June, 2024; originally announced June 2024.

  3. arXiv:2404.16522  [pdf, other

    eess.IV cs.LG

    A Deep Learning-Driven Pipeline for Differentiating Hypertrophic Cardiomyopathy from Cardiac Amyloidosis Using 2D Multi-View Echocardiography

    Authors: Bo Peng, Xiaofeng Li, Xinyu Li, Zhenghan Wang, Hui Deng, Xiaoxian Luo, Lixue Yin, Hongmei Zhang

    Abstract: Hypertrophic cardiomyopathy (HCM) and cardiac amyloidosis (CA) are both heart conditions that can progress to heart failure if untreated. They exhibit similar echocardiographic characteristics, often leading to diagnostic challenges. This paper introduces a novel multi-view deep learning approach that utilizes 2D echocardiography for differentiating between HCM and CA. The method begins by classif… ▽ More

    Submitted 25 April, 2024; originally announced April 2024.

  4. arXiv:2404.08549  [pdf

    eess.IV cs.CV physics.bio-ph

    Benchmarking the Cell Image Segmentation Models Robustness under the Microscope Optical Aberrations

    Authors: Boyuan Peng, Jiaju Chen, Qihui Ye, Minjiang Chen, Peiwu Qin, Chenggang Yan, Dongmei Yu, Zhenglin Chen

    Abstract: Cell segmentation is essential in biomedical research for analyzing cellular morphology and behavior. Deep learning methods, particularly convolutional neural networks (CNNs), have revolutionized cell segmentation by extracting intricate features from images. However, the robustness of these methods under microscope optical aberrations remains a critical challenge. This study comprehensively evalu… ▽ More

    Submitted 12 April, 2024; originally announced April 2024.

  5. arXiv:2403.14172  [pdf

    eess.SY

    Lane level joint control of off-ramp and main line speed guidance on expressway in rainy weather

    Authors: Boyao Peng, Lexing Zhang, Enkai Li

    Abstract: In the upstream of the exit ramp of the expressway, the speed limit difference leads to a significant deceleration of the vehicle in the area adjacent to the off-ramp. The friction coefficient of the road surface decreases under rainy weather, and the above deceleration process can easily lead to sideslip and rollover of the vehicle. Dynamic speed guidance is an effective way to improve the status… ▽ More

    Submitted 21 March, 2024; originally announced March 2024.

    Comments: 103rd TRB Conference

    Report number: TRBAM-24-01802 MSC Class: 90-10 ACM Class: A.0

  6. arXiv:2403.04028  [pdf, other

    cs.IT eess.SP

    RISnet: A Domain-Knowledge Driven Neural Network Architecture for RIS Optimization with Mutual Coupling and Partial CSI

    Authors: Bile Peng, Karl-Ludwig Besser, Shanpu Shen, Finn Siegismund-Poschmann, Ramprasad Raghunath, Daniel Mittleman, Vahid Jamali, Eduard A. Jorswieck

    Abstract: Multiple access techniques are cornerstones of wireless communications. Their performance depends on the channel properties, which can be improved by reconfigurable intelligent surfaces (RISs). In this work, we jointly optimize MA precoding at the base station (BS) and RIS configuration. We tackle difficulties of mutual coupling between RIS elements, scalability to more than 1000 RIS elements, and… ▽ More

    Submitted 6 March, 2024; originally announced March 2024.

    Comments: 13 pages, 16 figures

  7. arXiv:2401.16889  [pdf, other

    cs.RO cs.AI eess.SY

    Reinforcement Learning for Versatile, Dynamic, and Robust Bipedal Locomotion Control

    Authors: Zhongyu Li, Xue Bin Peng, Pieter Abbeel, Sergey Levine, Glen Berseth, Koushil Sreenath

    Abstract: This paper presents a comprehensive study on using deep reinforcement learning (RL) to create dynamic locomotion controllers for bipedal robots. Going beyond focusing on a single locomotion skill, we develop a general control solution that can be used for a range of dynamic bipedal skills, from periodic walking and running to aperiodic jum** and standing. Our RL-based controller incorporates a n… ▽ More

    Submitted 30 January, 2024; originally announced January 2024.

  8. arXiv:2401.14281  [pdf, other

    eess.SP

    Energy-Efficient Power Allocation in Cell-Free Massive MIMO via Graph Neural Networks

    Authors: Ramprasad Raghunath, Bile Peng, Eduard A. Jorswieck

    Abstract: CF-mMIMO systems are a promising solution to enhance the performance in 6G wireless networks. Its distributed nature of the architecture makes it highly reliable, provides sufficient coverage and allows higher performance than cellular networks. EE is an important metric that reduces the operating costs and also better for the environment. In this work, we optimize the downlink EE performance with… ▽ More

    Submitted 9 February, 2024; v1 submitted 25 January, 2024; originally announced January 2024.

  9. arXiv:2312.10921  [pdf, other

    cs.CV cs.SD eess.AS

    AE-NeRF: Audio Enhanced Neural Radiance Field for Few Shot Talking Head Synthesis

    Authors: Dongze Li, Kang Zhao, Wei Wang, Bo Peng, Yingya Zhang, **g Dong, Tieniu Tan

    Abstract: Audio-driven talking head synthesis is a promising topic with wide applications in digital human, film making and virtual reality. Recent NeRF-based approaches have shown superiority in quality and fidelity compared to previous studies. However, when it comes to few-shot talking head generation, a practical scenario where only few seconds of talking video is available for one identity, two limitat… ▽ More

    Submitted 17 December, 2023; originally announced December 2023.

    Comments: Accepted by AAAI 2024

  10. arXiv:2309.09565  [pdf, other

    eess.SP

    A Covariance Adaptive Student's t Based Kalman Filter

    Authors: Benyang Gong, Jiacheng He, Gang Wang, Bei Peng

    Abstract: In the classical Kalman filter(KF), the estimated state is a linear combination of the one-step predicted state and measurement state, their confidence level change when the prediction mean square error matrix and covariance matrix of measurement noise vary. The existing student's t based Kalman filter(TKF) works similarly to the way KF works, they both work well with impulse noise, but when it co… ▽ More

    Submitted 18 September, 2023; originally announced September 2023.

  11. arXiv:2309.08088  [pdf, ps, other

    eess.SY

    Interactive Model Fusion-Based GM-PHD Filter

    Authors: Jiacheng He, Shan Zhong, Bei Peng, Gang Wang, Qizhen Wang

    Abstract: In multi-target tracking (MTT), non-Gaussian measurement noise from sensors can diminish the performance of the Gaussian-assumed Gaussian mixture probability hypothesis density (GM-PHD) filter. In this paper, an approach that transforms the MTT problem under non-Gaussian conditions into an MTT problem under Gaussian conditions is developed. Specifically, measurement noise with a non-Gaussian distr… ▽ More

    Submitted 14 September, 2023; originally announced September 2023.

    Comments: conference

  12. arXiv:2307.01445  [pdf, ps, other

    eess.SP

    Distributed fusion filter over lossy wireless sensor networks with the presence of non-Gaussian noise

    Authors: Jiacheng He, Bei Peng, Zhenyu Feng, Xuemei Mao, Song Gao, Gang Wang

    Abstract: The information transmission between nodes in a wireless sensor networks (WSNs) often causes packet loss due to denial-of-service (DoS) attack, energy limitations, and environmental factors, and the information that is successfully transmitted can also be contaminated by non-Gaussian noise. The presence of these two factors poses a challenge for distributed state estimation (DSE) over WSNs. In thi… ▽ More

    Submitted 6 July, 2023; v1 submitted 3 July, 2023; originally announced July 2023.

  13. arXiv:2307.01442  [pdf, ps, other

    cs.IT eess.SP

    Quantized criterion-based kernel recursive least squares adaptive filtering for time series prediction

    Authors: Jiacheng He, Gang Wang, Kun Zhang, Shan Zhong, Bei Peng

    Abstract: The robustness of the kernel recursive least square (KRLS) algorithm has recently been improved by combining them with more robust information-theoretic learning criteria, such as minimum error entropy (MEE) and generalized MEE (GMEE), which also improves the computational complexity of the KRLS-type algorithms to a certain extent. To reduce the computational load of the KRLS-type algorithms, the… ▽ More

    Submitted 6 September, 2023; v1 submitted 3 July, 2023; originally announced July 2023.

  14. arXiv:2306.13564  [pdf, other

    cs.CV eess.IV

    Estimating Residential Solar Potential Using Aerial Data

    Authors: Ross Goroshin, Alex Wilson, Andrew Lamb, Betty Peng, Brandon Ewonus, Cornelius Ratsch, Jordan Raisher, Marisa Leung, Max Burq, Thomas Colthurst, William Rucklidge, Carl Elkin

    Abstract: Project Sunroof estimates the solar potential of residential buildings using high quality aerial data. That is, it estimates the potential solar energy (and associated financial savings) that can be captured by buildings if solar panels were to be installed on their roofs. Unfortunately its coverage is limited by the lack of high resolution digital surface map (DSM) data. We present a deep learnin… ▽ More

    Submitted 23 June, 2023; originally announced June 2023.

    Journal ref: ICLR 2023 - Tackling Climate Change with Machine Learning Workshop

  15. arXiv:2306.11476  [pdf, other

    eess.SP

    A Model Fusion Distributed Kalman Filter For Non-Gaussian Observation Noise

    Authors: Xuemei Mao, Gang Wang, Bei Peng, Jiacheng He, Kun Zhang, Song Gao

    Abstract: The distributed Kalman filter (DKF) has attracted extensive research as an information fusion method for wireless sensor systems(WSNs). And the DKF in non-Gaussian environments is still a pressing problem. In this paper, we approximate the non-Gaussian noise as a Gaussian mixture model and estimate the parameters through the expectation-maximization algorithm. A DKF, called model fusion DKF (MFDKF… ▽ More

    Submitted 20 June, 2023; originally announced June 2023.

  16. arXiv:2305.18875  [pdf, other

    eess.SY cs.LG cs.MA

    Centralised rehearsal of decentralised cooperation: Multi-agent reinforcement learning for the scalable coordination of residential energy flexibility

    Authors: Flora Charbonnier, Bei Peng, Thomas Morstyn, Malcolm McCulloch

    Abstract: This paper investigates how deep multi-agent reinforcement learning can enable the scalable and privacy-preserving coordination of residential energy flexibility. The coordination of distributed resources such as electric vehicles and heating will be critical to the successful integration of large shares of renewable energy in our electricity grid and, thus, to help mitigate climate change. The pr… ▽ More

    Submitted 5 June, 2023; v1 submitted 30 May, 2023; originally announced May 2023.

  17. arXiv:2305.00692  [pdf, other

    eess.SP cs.IT

    Non-Orthogonal Multiple Access Assisted by Reconfigurable Intelligent Surface Using Unsupervised Machine Learning

    Authors: Finn Siegismund-Poschmann, Bile Peng, Eduard A. Jorswieck

    Abstract: Nonorthogonal multiple access (NOMA) with multi-antenna base station (BS) is a promising technology for next-generation wireless communication, which has high potential in performance and user fairness. Since the performance of NOMA depends on the channel conditions, we can combine NOMA and reconfigurable intelligent surface (RIS), which is a large and passive antenna array and can optimize the wi… ▽ More

    Submitted 31 May, 2023; v1 submitted 1 May, 2023; originally announced May 2023.

  18. arXiv:2305.00667  [pdf, ps, other

    eess.SP cs.IT

    RISnet: A Scalable Approach for Reconfigurable Intelligent Surface Optimization with Partial CSI

    Authors: Bile Peng, Karl-Ludwig Besser, Ramprasad Raghunath, Vahid Jamali, Eduard A. Jorswieck

    Abstract: The reconfigurable intelligent surface (RIS) is a promising technology that enables wireless communication systems to achieve improved performance by intelligently manipulating wireless channels. In this paper, we consider the sum-rate maximization problem in a downlink multi-user multi-input-single-output (MISO) channel via space-division multiple access (SDMA). Two major challenges of this probl… ▽ More

    Submitted 18 August, 2023; v1 submitted 1 May, 2023; originally announced May 2023.

  19. arXiv:2302.09450  [pdf, other

    cs.RO cs.AI eess.SY

    Robust and Versatile Bipedal Jum** Control through Reinforcement Learning

    Authors: Zhongyu Li, Xue Bin Peng, Pieter Abbeel, Sergey Levine, Glen Berseth, Koushil Sreenath

    Abstract: This work aims to push the limits of agility for bipedal robots by enabling a torque-controlled bipedal robot to perform robust and versatile dynamic jumps in the real world. We present a reinforcement learning framework for training a robot to accomplish a large variety of jum** tasks, such as jum** to different locations and directions. To improve performance on these challenging tasks, we d… ▽ More

    Submitted 31 May, 2023; v1 submitted 18 February, 2023; originally announced February 2023.

    Comments: Accepted in Robotics: Science and Systems 2023 (RSS 2023). The accompanying video is at https://youtu.be/aAPSZ2QFB-E

  20. arXiv:2301.05867  [pdf, other

    eess.SP

    State Estimation of Wireless Sensor Networks in the Presence of Data Packet Drops and Non-Gaussian Noise

    Authors: Jiacheng He, Gang Wang, Xuemei Mao, Song Gao, Bei Peng

    Abstract: Distributed Kalman filter approaches based on the maximum correntropy criterion have recently demonstrated superior state estimation performance to that of conventional distributed Kalman filters for wireless sensor networks in the presence of non-Gaussian impulsive noise. However, these algorithms currently fail to take account of data packet drops. The present work addresses this issue by propos… ▽ More

    Submitted 3 September, 2023; v1 submitted 14 January, 2023; originally announced January 2023.

  21. arXiv:2301.05813  [pdf, ps, other

    eess.SP

    Minimum Error Entropy Rauch-Tung-Striebel Smoother

    Authors: Jiacheng He, Hongwei Wang, Gang Wang, Shan Zhong, Bei Peng

    Abstract: Outliers and impulsive disturbances often cause heavy-tailed distributions in practical applications, and these will degrade the performance of Gaussian approximation smoothing algorithms. To improve the robustness of the Rauch-Tung-Striebel (RTS) smother against complicated non-Gaussian noises, a new RTS-smoother integrated with the minimum error entropy (MEE) criterion (MEE-RTS) is proposed for… ▽ More

    Submitted 2 February, 2023; v1 submitted 13 January, 2023; originally announced January 2023.

  22. arXiv:2212.12329  [pdf, other

    eess.SP cs.LG

    Approaching Globally Optimal Energy Efficiency in Interference Networks via Machine Learning

    Authors: Bile Peng, Karl-Ludwig Besser, Ramprasad Raghunath, Eduard A. Jorswieck

    Abstract: This work presents a machine learning approach to optimize the energy efficiency (EE) in a multi-cell wireless network. This optimization problem is non-convex and its global optimum is difficult to find. In the literature, either simple but suboptimal approaches or optimal methods with high complexity and poor scalability are proposed. In contrast, we propose a machine learning framework to appro… ▽ More

    Submitted 14 December, 2023; v1 submitted 25 November, 2022; originally announced December 2022.

  23. arXiv:2212.02967  [pdf, other

    eess.SP

    RISnet: a Dedicated Scalable Neural Network Architecture for Optimization of Reconfigurable Intelligent Surfaces

    Authors: Bile Peng, Finn Siegismund-Poschmann, Eduard A. Jorswieck

    Abstract: The reconfigurable intelligent surface (RIS) is a promising technology for next-generation wireless communication. It comprises many passive antennas, which reflect signals from the transmitter to the receiver with adjusted phases without changing the amplitude. The large number of the antennas enables a huge potential of signal processing despite the simple functionality of a single antenna. Howe… ▽ More

    Submitted 15 January, 2023; v1 submitted 6 December, 2022; originally announced December 2022.

  24. arXiv:2210.04435  [pdf, other

    cs.RO cs.AI eess.SY

    Creating a Dynamic Quadrupedal Robotic Goalkeeper with Reinforcement Learning

    Authors: Xiaoyu Huang, Zhongyu Li, Yanzhen Xiang, Yiming Ni, Yufeng Chi, Yunhao Li, Lizhi Yang, Xue Bin Peng, Koushil Sreenath

    Abstract: We present a reinforcement learning (RL) framework that enables quadrupedal robots to perform soccer goalkee** tasks in the real world. Soccer goalkee** using quadrupeds is a challenging problem, that combines highly dynamic locomotion with precise and fast non-prehensile object (ball) manipulation. The robot needs to react to and intercept a potentially flying ball using dynamic locomotion ma… ▽ More

    Submitted 10 October, 2022; originally announced October 2022.

    Comments: First two authors contributed equally. Accompanying video is at https://youtu.be/iX6OgG67-ZQ

  25. arXiv:2208.01160  [pdf, other

    cs.RO cs.AI eess.SY

    Hierarchical Reinforcement Learning for Precise Soccer Shooting Skills using a Quadrupedal Robot

    Authors: Yandong Ji, Zhongyu Li, Yinan Sun, Xue Bin Peng, Sergey Levine, Glen Berseth, Koushil Sreenath

    Abstract: We address the problem of enabling quadrupedal robots to perform precise shooting skills in the real world using reinforcement learning. Develo** algorithms to enable a legged robot to shoot a soccer ball to a given target is a challenging problem that combines robot motion control and planning into one task. To solve this problem, we need to consider the dynamics limitation and motion stability… ▽ More

    Submitted 1 August, 2022; originally announced August 2022.

    Comments: Accepted to 2022 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2022)

  26. arXiv:2207.00001  [pdf

    cs.CV eess.IV

    MultiEarth 2022 -- The Champion Solution for Image-to-Image Translation Challenge via Generation Models

    Authors: Yuchuan Gou, Bo Peng, Hongchen Liu, Hang Zhou, Jui-Hsin Lai

    Abstract: The MultiEarth 2022 Image-to-Image Translation challenge provides a well-constrained test bed for generating the corresponding RGB Sentinel-2 imagery with the given Sentinel-1 VV & VH imagery. In this challenge, we designed various generation models and found the SPADE [1] and pix2pixHD [2] models could perform our best results. In our self-evaluation, the SPADE-2 model with L1-loss can achieve 0.… ▽ More

    Submitted 17 June, 2022; originally announced July 2022.

    Comments: CVPR 2022, MultiEarth 2022, Image-to-Image translation, competition

  27. arXiv:2206.09488  [pdf, other

    eess.SP

    Two-Hop Age of Information Scheduling for Multi-UAV Assisted Mobile Edge Computing: FRL vs MADDPG

    Authors: Marjan Tajik, Mohammadreza Maleki, Nader Mokari, Mohammad Reza Javan, Hamid Saeedi, Bile Peng, Eduard A. Jorswieck

    Abstract: In this work, we adopt the emerging technology of mobile edge computing (MEC) in the Unmanned aerial vehicles (UAVs) for communication-computing systems, to optimize the age of information (AoI) in the network. We assume that tasks are processed jointly on UAVs and BS to enhance edge performance with limited connectivity and computing. Using UAVs and BS jointly with MEC can reduce AoI on the netwo… ▽ More

    Submitted 19 June, 2022; originally announced June 2022.

  28. arXiv:2202.06668  [pdf, other

    eess.SP cs.IT

    Resource allocation for reconfigurable intelligent surface aided broadcast channels

    Authors: Cong Sun, Xian Liu, Bile Peng, Eduard Jorswieck

    Abstract: A two-user downlink network aided by a reconfigurable intelligent surface is considered. The weighted sum signal to interference plus noise ratio maximization and the sum rate maximization models are presented, where the precoding vectors and the RIS matrix are jointly optimized. Since the optimization problem is non-convex and difficult, new approximation models are proposed. The upper bounds of… ▽ More

    Submitted 14 February, 2022; originally announced February 2022.

  29. arXiv:2201.02834  [pdf, other

    eess.SP cs.LG

    Reconfigurable Intelligent Surface Enabled Spatial Multiplexing with Fully Convolutional Network

    Authors: Bile Peng, Jan-Aike Termöhlen, Cong Sun, Dan** He, Ke Guan, Tim Fingscheidt, Eduard A. Jorswieck

    Abstract: Reconfigurable intelligent surface (RIS) is an emerging technology for future wireless communication systems. In this work, we consider downlink spatial multiplexing enabled by the RIS for weighted sum-rate (WSR) maximization. In the literature, most solutions use alternating gradient-based optimization, which has moderate performance, high complexity, and limited scalability. We propose to apply… ▽ More

    Submitted 21 September, 2022; v1 submitted 8 January, 2022; originally announced January 2022.

  30. arXiv:2109.13322  [pdf, other

    physics.optics eess.SY physics.app-ph quant-ph

    Induced transparency: interference or polarization?

    Authors: Changqing Wang, Xuefeng Jiang, William R. Sweeney, Chia Wei Hsu, Yiming Liu, Guangming Zhao, Bo Peng, Mengzhen Zhang, Liang Jiang, A. Douglas Stone, Lan Yang

    Abstract: The polarization of optical fields is a crucial degree of freedom in the all-optical analogue of electromagnetically induced transparency (EIT). However, the physical origins of EIT and polarization induced phenomena have not been well distinguished, which can lead to confusion in associated applications such as slow light and optical/quantum storage. Here we study the polarization effects in vari… ▽ More

    Submitted 27 September, 2021; originally announced September 2021.

    Comments: 8 pages, 4 figures, 57 references. The published version can be found via ULR: https://www.pnas.org/content/118/3/e2012982118

    Journal ref: Proceedings of the National Academy of Sciences Vol. 118 No. 3 e2012982118 (19 Jan 2021)

  31. arXiv:2109.03463  [pdf, ps, other

    eess.SP

    Generalized Minimum Error Entropy for Adaptive Filtering

    Authors: Jiacheng He, Gang Wang, Bei Peng, Zhenyu Feng, Kun Zhang

    Abstract: Error entropy is a important nonlinear similarity measure, and it has received increasing attention in many practical applications. The default kernel function of error entropy criterion is Gaussian kernel function, however, which is not always the best choice. In our study, a novel concept, called generalized error entropy, utilizing the generalized Gaussian density (GGD) function as the kernel f… ▽ More

    Submitted 1 September, 2023; v1 submitted 8 September, 2021; originally announced September 2021.

    Comments: 9 pages, 8 figures

  32. arXiv:2108.11623  [pdf, other

    cs.LG cs.RO eess.SY

    Model-based Chance-Constrained Reinforcement Learning via Separated Proportional-Integral Lagrangian

    Authors: Baiyu Peng, **gliang Duan, Jianyu Chen, Shengbo Eben Li, Gen** Xie, Congsheng Zhang, Yang Guan, Yao Mu, Enxin Sun

    Abstract: Safety is essential for reinforcement learning (RL) applied in the real world. Adding chance constraints (or probabilistic constraints) is a suitable way to enhance RL safety under uncertainty. Existing chance-constrained RL methods like the penalty methods and the Lagrangian methods either exhibit periodic oscillations or learn an over-conservative or unsafe policy. In this paper, we address thes… ▽ More

    Submitted 26 August, 2021; originally announced August 2021.

  33. arXiv:2103.14295  [pdf, other

    cs.RO cs.AI cs.LG eess.SY

    Reinforcement Learning for Robust Parameterized Locomotion Control of Bipedal Robots

    Authors: Zhongyu Li, Xuxin Cheng, Xue Bin Peng, Pieter Abbeel, Sergey Levine, Glen Berseth, Koushil Sreenath

    Abstract: Develo** robust walking controllers for bipedal robots is a challenging endeavor. Traditional model-based locomotion controllers require simplifying assumptions and careful modelling; any small errors can result in unstable control. To address these challenges for bipedal locomotion, we present a model-free reinforcement learning framework for training robust locomotion policies in simulation, w… ▽ More

    Submitted 26 March, 2021; originally announced March 2021.

    Comments: To appear on 2021 International Conference on Robotics and Automation (ICRA 2021)

  34. arXiv:2102.08539  [pdf, other

    cs.LG cs.AI eess.SY

    Separated Proportional-Integral Lagrangian for Chance Constrained Reinforcement Learning

    Authors: Baiyu Peng, Yao Mu, **gliang Duan, Yang Guan, Shengbo Eben Li, Jianyu Chen

    Abstract: Safety is essential for reinforcement learning (RL) applied in real-world tasks like autonomous driving. Chance constraints which guarantee the satisfaction of state constraints at a high probability are suitable to represent the requirements in real-world environment with uncertainty. Existing chance constrained RL methods like the penalty method and the Lagrangian method either exhibit periodic… ▽ More

    Submitted 16 February, 2021; originally announced February 2021.

  35. arXiv:2012.10716  [pdf, other

    cs.LG cs.AI eess.SY

    Model-Based Actor-Critic with Chance Constraint for Stochastic System

    Authors: Baiyu Peng, Yao Mu, Yang Guan, Shengbo Eben Li, Yuming Yin, Jianyu Chen

    Abstract: Safety is essential for reinforcement learning (RL) applied in real-world situations. Chance constraints are suitable to represent the safety requirements in stochastic systems. Previous chance-constrained RL methods usually have a low convergence rate, or only learn a conservative policy. In this paper, we propose a model-based chance constrained actor-critic (CCAC) algorithm which can efficientl… ▽ More

    Submitted 16 March, 2021; v1 submitted 19 December, 2020; originally announced December 2020.

  36. arXiv:2003.00848  [pdf, other

    eess.SY cs.LG cs.RO stat.ML

    Mixed Reinforcement Learning with Additive Stochastic Uncertainty

    Authors: Yao Mu, Shengbo Eben Li, Chang Liu, Qi Sun, Bingbing Nie, Bo Cheng, Baiyu Peng

    Abstract: Reinforcement learning (RL) methods often rely on massive exploration data to search optimal policies, and suffer from poor sampling efficiency. This paper presents a mixed reinforcement learning (mixed RL) algorithm by simultaneously using dual representations of environmental dynamics to search the optimal policy with the purpose of improving both learning accuracy and training speed. The dual r… ▽ More

    Submitted 28 February, 2020; originally announced March 2020.

  37. arXiv:2002.07699  [pdf, other

    q-bio.QM cs.LG eess.IV q-bio.NC

    Cognitive Biomarker Prioritization in Alzheimer's Disease using Brain Morphometric Data

    Authors: Bo Peng, Xiaohui Yao, Shannon L. Risacher, Andrew J. Saykin, Li Shen, Xia Ning

    Abstract: Background:Cognitive assessments represent the most common clinical routine for the diagnosis of Alzheimer's Disease (AD). Given a large number of cognitive assessment tools and time-limited office visits, it is important to determine a proper set of cognitive tests for different subjects. Most current studies create guidelines of cognitive test selection for a targeted population, but they are no… ▽ More

    Submitted 12 November, 2020; v1 submitted 18 February, 2020; originally announced February 2020.

    Comments: This paper has been accepted by BMC MIDM

  38. arXiv:1911.04470  [pdf, other

    cs.CV cs.LG eess.IV

    Semi-Heterogeneous Three-Way Joint Embedding Network for Sketch-Based Image Retrieval

    Authors: Jianjun Lei, Yuxin Song, Bo Peng, Zhanyu Ma, Ling Shao, Yi-Zhe Song

    Abstract: Sketch-based image retrieval (SBIR) is a challenging task due to the large cross-domain gap between sketches and natural images. How to align abstract sketches and natural images into a common high-level semantic space remains a key problem in SBIR. In this paper, we propose a novel semi-heterogeneous three-way joint embedding network (Semi3-Net), which integrates three branches (a sketch branch,… ▽ More

    Submitted 9 November, 2019; originally announced November 2019.

    Comments: Accepted by IEEE Transactions on Circuits and Systems for Video Technology

  39. arXiv:1911.03552  [pdf

    physics.optics eess.SY physics.class-ph quant-ph

    Electromagnetically induced transparency at a chiral exceptional point

    Authors: Changqing Wang, Xuefeng Jiang, Guangming Zhao, Mengzhen Zhang, Chia Wei Hsu, Bo Peng, A. Douglas Stone, Liang Jiang, Lan Yang

    Abstract: Electromagnetically induced transparency, as a quantum interference effect to eliminate optical absorption in an opaque medium, has found extensive applications in slow light generation, optical storage, frequency conversion, optical quantum memory as well as enhanced nonlinear interactions at the few-photon level in all kinds of systems. Recently, there have been great interests in exceptional po… ▽ More

    Submitted 8 November, 2019; originally announced November 2019.

    Comments: 22 pages, 4 figures, 44 references

    Journal ref: Nature Physics 16, 334-340 (2020)

  40. arXiv:1907.04700  [pdf, other

    eess.SP cs.IT eess.SY

    Cooperative Localization with Angular Measurements and Posterior Linearization

    Authors: Yibo Wu, Bile Peng, Henk Wymeersch, Gonzalo Seco-Granados, Anastasios Kakkavas, Mario H. Castañeda Garcia, Richard A. Stirling-Gallacher

    Abstract: The application of cooperative localization in vehicular networks is attractive to improve accuracy and coverage. Conventional distance measurements between vehicles are limited by the need for synchronization and provide no heading information of the vehicle. To address this, we present a cooperative localization algorithm using posterior linearization belief propagation (PLBP) utilizing angle-of… ▽ More

    Submitted 10 July, 2019; originally announced July 2019.

    Comments: Submitted for possible publication to an IEEE conference

  41. arXiv:1904.09252  [pdf, ps, other

    eess.SP cs.IT

    Learning Physical-Layer Communication with Quantized Feedback

    Authors: **xiang Song, Bile Peng, Christian Häger, Henk Wymeersch, Anant Sahai

    Abstract: Data-driven optimization of transmitters and receivers can reveal new modulation and detection schemes and enable physical-layer communication over unknown channels. Previous work has shown that practical implementations of this approach require a feedback signal from the receiver to the transmitter. In this paper, we study the impact of quantized feedback in data-driven learning of physical-layer… ▽ More

    Submitted 4 November, 2019; v1 submitted 19 April, 2019; originally announced April 2019.

  42. Sim-to-Real Transfer of Robotic Control with Dynamics Randomization

    Authors: Xue Bin Peng, Marcin Andrychowicz, Wojciech Zaremba, Pieter Abbeel

    Abstract: Simulations are attractive environments for training agents as they provide an abundant source of data and alleviate certain safety concerns during the training process. But the behaviours developed by agents in simulation are often specific to the characteristics of the simulator. Due to modeling error, strategies that are successful in simulation may not transfer to their real world counterparts… ▽ More

    Submitted 2 March, 2018; v1 submitted 17 October, 2017; originally announced October 2017.