Search | arXiv e-print repository

Minimizing Sensor Allocation Cost for Crowdsensing On-street Parking Availability

Authors: Boyu Pang, Ruizhi Liao, Yinyu Ye

Abstract: In recent years, innovative roadside parking vacancy crowdsensing solutions have emerged as a cost-effective alternative to traditional methods, which can significantly reduce sensor installation and maintenance expenses. This crowdsensing scheme relies on vehicles equipped with sensors, such as buses and taxis, roaming around urban streets to detect on-street parking availability. Therefore, the… ▽ More In recent years, innovative roadside parking vacancy crowdsensing solutions have emerged as a cost-effective alternative to traditional methods, which can significantly reduce sensor installation and maintenance expenses. This crowdsensing scheme relies on vehicles equipped with sensors, such as buses and taxis, roaming around urban streets to detect on-street parking availability. Therefore, the accuracy of this scheme strongly depends on the vehicles' routes and the frequency of their passage through parking spots. This paper presents an integer programming-based optimal sensor allocation model to ensure the detection accuracy of the scheme while using the minimum number of sensing kits or probing vehicles. Moreover, a customized heuristic algorithm is proposed to hasten the solution process. Numerical simulations using the street dataset from San Francisco confirm the model's ability to reduce probing vehicle usage while ensuring detection accuracy. Thus, our approach represents an effective means of optimizing roadside parking detection in a crowdsensing way. △ Less

Submitted 12 October, 2023; originally announced October 2023.

arXiv:2307.15400 [pdf, other]

The FlySpeech Audio-Visual Speaker Diarization System for MISP Challenge 2022

Authors: Li Zhang, Huan Zhao, Yue Li, Bowen Pang, Yannan Wang, Hongji Wang, Wei Rao, Qing Wang, Lei Xie

Abstract: This paper describes the FlySpeech speaker diarization system submitted to the second \textbf{M}ultimodal \textbf{I}nformation Based \textbf{S}peech \textbf{P}rocessing~(\textbf{MISP}) Challenge held in ICASSP 2022. We develop an end-to-end audio-visual speaker diarization~(AVSD) system, which consists of a lip encoder, a speaker encoder, and an audio-visual decoder. Specifically, to mitigate the… ▽ More This paper describes the FlySpeech speaker diarization system submitted to the second \textbf{M}ultimodal \textbf{I}nformation Based \textbf{S}peech \textbf{P}rocessing~(\textbf{MISP}) Challenge held in ICASSP 2022. We develop an end-to-end audio-visual speaker diarization~(AVSD) system, which consists of a lip encoder, a speaker encoder, and an audio-visual decoder. Specifically, to mitigate the degradation of diarization performance caused by separate training, we jointly train the speaker encoder and the audio-visual decoder. In addition, we leverage the large-data pretrained speaker extractor to initialize the speaker encoder. △ Less

Submitted 28 July, 2023; originally announced July 2023.

arXiv:2210.14653 [pdf, other]

doi 10.1109/ISCSLP57327.2022.10037846

TSUP Speaker Diarization System for Conversational Short-phrase Speaker Diarization Challenge

Authors: Bowen Pang, Huan Zhao, Gaosheng Zhang, Xiaoyue Yang, Yang Sun, Li Zhang, Qing Wang, Lei Xie

Abstract: This paper describes the TSUP team's submission to the ISCSLP 2022 conversational short-phrase speaker diarization (CSSD) challenge which particularly focuses on short-phrase conversations with a new evaluation metric called conversational diarization error rate (CDER). In this challenge, we explore three kinds of typical speaker diarization systems, which are spectral clustering(SC) based diariza… ▽ More This paper describes the TSUP team's submission to the ISCSLP 2022 conversational short-phrase speaker diarization (CSSD) challenge which particularly focuses on short-phrase conversations with a new evaluation metric called conversational diarization error rate (CDER). In this challenge, we explore three kinds of typical speaker diarization systems, which are spectral clustering(SC) based diarization, target-speaker voice activity detection(TS-VAD) and end-to-end neural diarization(EEND) respectively. Our major findings are summarized as follows. First, the SC approach is more favored over the other two approaches under the new CDER metric. Second, tuning on hyperparameters is essential to CDER for all three types of speaker diarization systems. Specifically, CDER becomes smaller when the length of sub-segments setting longer. Finally, multi-system fusion through DOVER-LAP will worsen the CDER metric on the challenge data. Our submitted SC system eventually ranks the third place in the challenge. △ Less

Submitted 26 October, 2022; originally announced October 2022.

arXiv:2210.00204 [pdf, ps, other]

Learning-Based Adaptive Optimal Control of Linear Time-Delay Systems: A Policy Iteration Approach

Authors: Leilei Cui, Bo Pang, Zhong-** Jiang

Abstract: This paper studies the adaptive optimal control problem for a class of linear time-delay systems described by delay differential equations (DDEs). A crucial strategy is to take advantage of recent developments in reinforcement learning and adaptive dynamic programming and develop novel methods to learn adaptive optimal controllers from finite samples of input and state data. In this paper, the dat… ▽ More This paper studies the adaptive optimal control problem for a class of linear time-delay systems described by delay differential equations (DDEs). A crucial strategy is to take advantage of recent developments in reinforcement learning and adaptive dynamic programming and develop novel methods to learn adaptive optimal controllers from finite samples of input and state data. In this paper, the data-driven policy iteration (PI) is proposed to solve the infinite-dimensional algebraic Riccati equation (ARE) iteratively in the absence of exact model knowledge. Interestingly, the proposed recursive PI algorithm is new in the present context of continuous-time time-delay systems, even when the model knowledge is assumed known. The efficacy of the proposed learning-based control methods is validated by means of practical applications arising from metal cutting and autonomous driving. △ Less

Submitted 1 October, 2022; originally announced October 2022.

Comments: 12 pages, 8 figures

arXiv:2107.12829 [pdf, other]

Conflict-Free Four-Dimensional Path Planning for Urban Air Mobility Considering Airspace Occupations

Authors: Wei Dai, Bizhao Pang, Kin Huat Low

Abstract: Urban air mobility (UAM) has attracted the attention of aircraft manufacturers, air navigation service providers and governments in recent years. Preventing the conflict among urban aircraft is crucial to UAM traffic safety, which is a key in enabling large scale UAM operation. Pre-flight conflict-free path planning can provide a strategic layer in the maintenance of safety performance, thus becom… ▽ More Urban air mobility (UAM) has attracted the attention of aircraft manufacturers, air navigation service providers and governments in recent years. Preventing the conflict among urban aircraft is crucial to UAM traffic safety, which is a key in enabling large scale UAM operation. Pre-flight conflict-free path planning can provide a strategic layer in the maintenance of safety performance, thus becomes an important element in UAM. This paper aims at tackling conflict-free path planning problem for UAM operation with a consideration of four-dimensional airspace management. In the first place, we introduced and extended a four-dimensional airspace management concept, AirMatrix. On the basis of AirMatrix, we formulated the shortest flight time path planning problem considering resolution of conflicts with both static and dynamic obstacles. A Conflict-Free A-Star algorithm was developed for planning four-dimensional paths based on first-come-first-served scheme. The algorithm contains a novel design of heuristic function as well as a conflict detection and resolution strategy. Numerical experiment was carried out in Jurong East area in Singapore, and the results show that the algorithm can generate paths resolving a significant number of potential conflicts in airspace utilization, with acceptable computational time and flight delay. The contributions of this study provide references for stakeholders to support the development of UAM. △ Less

Submitted 27 July, 2021; originally announced July 2021.

arXiv:2107.07788 [pdf, other]

Reinforcement Learning for Adaptive Optimal Stationary Control of Linear Stochastic Systems

Authors: Bo Pang, Zhong-** Jiang

Abstract: This paper studies the adaptive optimal stationary control of continuous-time linear stochastic systems with both additive and multiplicative noises, using reinforcement learning techniques. Based on policy iteration, a novel off-policy reinforcement learning algorithm, named optimistic least-squares-based policy iteration, is proposed which is able to find iteratively near-optimal policies of the… ▽ More This paper studies the adaptive optimal stationary control of continuous-time linear stochastic systems with both additive and multiplicative noises, using reinforcement learning techniques. Based on policy iteration, a novel off-policy reinforcement learning algorithm, named optimistic least-squares-based policy iteration, is proposed which is able to find iteratively near-optimal policies of the adaptive optimal stationary control problem directly from input/state data without explicitly identifying any system matrices, starting from an initial admissible control policy. The solutions given by the proposed optimistic least-squares-based policy iteration are proved to converge to a small neighborhood of the optimal solution with probability one, under mild conditions. The application of the proposed algorithm to a triple inverted pendulum example validates its feasibility and effectiveness. △ Less

Submitted 5 December, 2021; v1 submitted 16 July, 2021; originally announced July 2021.

Comments: 10 pages, 3 figures

arXiv:2107.06172 [pdf, other]

Arrhenius.jl: A Differentiable Combustion SimulationPackage

Authors: Weiqi Ji, Xingyu Su, Bin Pang, Sean Joseph Cassady, Alison M. Ferris, Yujuan Li, Zhuyin Ren, Ronald Hanson, Sili Deng

Abstract: Combustion kinetic modeling is an integral part of combustion simulation, and extensive studies have been devoted to develo** both high fidelity and computationally affordable models. Despite these efforts, modeling combustion kinetics is still challenging due to the demand for expert knowledge and optimization against experiments, as well as the lack of understanding of the associated uncertain… ▽ More Combustion kinetic modeling is an integral part of combustion simulation, and extensive studies have been devoted to develo** both high fidelity and computationally affordable models. Despite these efforts, modeling combustion kinetics is still challenging due to the demand for expert knowledge and optimization against experiments, as well as the lack of understanding of the associated uncertainties. Therefore, data-driven approaches that enable efficient discovery and calibration of kinetic models have received much attention in recent years, the core of which is the optimization based on big data. Differentiable programming is a promising approach for learning kinetic models from data by efficiently computing the gradient of objective functions to model parameters. However, it is often challenging to implement differentiable programming in practice. Therefore, it is still not available in widely utilized combustion simulation packages such as CHEMKIN and Cantera. Here, we present a differentiable combustion simulation package leveraging the eco-system in Julia, including DifferentialEquations.jl for solving differential equations, ForwardDiff.jl for auto-differentiation, and Flux.jl for incorporating neural network models into combustion simulations and optimizing neural network models using the state-of-the-art deep learning optimizers. We demonstrate the benefits of differentiable programming in efficient and accurate gradient computations, with applications in uncertainty quantification, kinetic model reduction, data assimilation, and model discovery. △ Less

Submitted 19 June, 2021; originally announced July 2021.

arXiv:2008.11592 [pdf, other]

Robust Reinforcement Learning: A Case Study in Linear Quadratic Regulation

Authors: Bo Pang, Zhong-** Jiang

Abstract: This paper studies the robustness of reinforcement learning algorithms to errors in the learning process. Specifically, we revisit the benchmark problem of discrete-time linear quadratic regulation (LQR) and study the long-standing open question: Under what conditions is the policy iteration method robustly stable from a dynamical systems perspective? Using advanced stability results in control th… ▽ More This paper studies the robustness of reinforcement learning algorithms to errors in the learning process. Specifically, we revisit the benchmark problem of discrete-time linear quadratic regulation (LQR) and study the long-standing open question: Under what conditions is the policy iteration method robustly stable from a dynamical systems perspective? Using advanced stability results in control theory, it is shown that policy iteration for LQR is inherently robust to small errors in the learning process and enjoys small-disturbance input-to-state stability: whenever the error in each iteration is bounded and small, the solutions of the policy iteration algorithm are also bounded, and, moreover, enter and stay in a small neighbourhood of the optimal LQR solution. As an application, a novel off-policy optimistic least-squares policy iteration for the LQR problem is proposed, when the system dynamics are subjected to additive stochastic disturbances. The proposed new results in robust reinforcement learning are validated by a numerical example. △ Less

Submitted 15 March, 2021; v1 submitted 25 August, 2020; originally announced August 2020.

Comments: arXiv admin note: text overlap with arXiv:2005.09528

arXiv:2005.09528 [pdf, other]

Robust Policy Iteration for Continuous-time Linear Quadratic Regulation

Authors: Bo Pang, Tao Bian, Zhong-** Jiang

Abstract: This paper studies the robustness of policy iteration in the context of continuous-time infinite-horizon linear quadratic regulation (LQR) problem. It is shown that Kleinman's policy iteration algorithm is inherently robust to small disturbances and enjoys local input-to-state stability in the sense of Sontag. More precisely, whenever the disturbance-induced input term in each iteration is bounded… ▽ More This paper studies the robustness of policy iteration in the context of continuous-time infinite-horizon linear quadratic regulation (LQR) problem. It is shown that Kleinman's policy iteration algorithm is inherently robust to small disturbances and enjoys local input-to-state stability in the sense of Sontag. More precisely, whenever the disturbance-induced input term in each iteration is bounded and small, the solutions of the policy iteration algorithm are also bounded and enter a small neighborhood of the optimal solution of the LQR problem. Based on this result, an off-policy data-driven policy iteration algorithm for the LQR problem is shown to be robust when the system dynamics are subjected to small additive unknown bounded disturbances. The theoretical results are validated by a numerical example. △ Less

Submitted 31 August, 2020; v1 submitted 19 May, 2020; originally announced May 2020.

arXiv:2005.02291 [pdf, other]

NTIRE 2020 Challenge on Video Quality Map**: Methods and Results

Authors: Dario Fuoli, Zhiwu Huang, Martin Danelljan, Radu Timofte, Hua Wang, Longcun **, Dewei Su, **g Liu, Jaehoon Lee, Michal Kudelski, Lukasz Bala, Dmitry Hrybov, Marcin Mozejko, Muchen Li, Siyao Li, Bo Pang, Cewu Lu, Chao Li, Dongliang He, Fu Li, Shilei Wen

Abstract: This paper reviews the NTIRE 2020 challenge on video quality map** (VQM), which addresses the issues of quality map** from source video domain to target video domain. The challenge includes both a supervised track (track 1) and a weakly-supervised track (track 2) for two benchmark datasets. In particular, track 1 offers a new Internet video benchmark, requiring algorithms to learn the map from… ▽ More This paper reviews the NTIRE 2020 challenge on video quality map** (VQM), which addresses the issues of quality map** from source video domain to target video domain. The challenge includes both a supervised track (track 1) and a weakly-supervised track (track 2) for two benchmark datasets. In particular, track 1 offers a new Internet video benchmark, requiring algorithms to learn the map from more compressed videos to less compressed videos in a supervised training manner. In track 2, algorithms are required to learn the quality map** from one device to another when their quality varies substantially and weakly-aligned video pairs are available. For track 1, in total 7 teams competed in the final test phase, demonstrating novel and effective solutions to the problem. For track 2, some existing methods are evaluated, showing promising solutions to the weakly-supervised video quality map** problem. △ Less

Submitted 15 June, 2020; v1 submitted 5 May, 2020; originally announced May 2020.

Comments: The IEEE Conference on Computer Vision and Pattern Recognition (CVPR) Workshops

arXiv:1901.08650 [pdf, other]

Adaptive Optimal Control of Linear Periodic Systems: An Off-Policy Value Iteration Approach

Authors: Bo Pang, Zhong-** Jiang

Abstract: This paper studies the infinite-horizon adaptive optimal control of continuous-time linear periodic (CTLP) systems. A novel value iteration (VI) based off-policy ADP algorithm is proposed for a general class of CTLP systems, so that approximate optimal solutions can be obtained directly from the collected data, without the exact knowledge of system dynamics. Under mild conditions, the proofs on un… ▽ More This paper studies the infinite-horizon adaptive optimal control of continuous-time linear periodic (CTLP) systems. A novel value iteration (VI) based off-policy ADP algorithm is proposed for a general class of CTLP systems, so that approximate optimal solutions can be obtained directly from the collected data, without the exact knowledge of system dynamics. Under mild conditions, the proofs on uniform convergence of the proposed algorithm to the optimal solutions are given for both the model-based and model-free cases. The VI-based ADP algorithm is able to find suboptimal controllers without assuming the knowledge of an initial stabilizing controller. Application to the optimal control of a triple inverted pendulum subjected to a periodically varying load demonstrates the feasibility and effectiveness of the proposed method. △ Less

Submitted 19 January, 2020; v1 submitted 24 January, 2019; originally announced January 2019.

Comments: 9 pages, 2 figures

Showing 1–11 of 11 results for author: Pang, B