Search | arXiv e-print repository

Traffic Signal Control and Speed Offset Coordination Using Q-Learning for Arterial Road Networks

Authors: Tianchen Yuan, Petros A. Ioannou

Abstract: Arterial traffic interacts with freeway traffic, yet the two are controlled independently. Arterial traffic signals do not take into account freeway traffic and how ramps control ingress traffic and have no control over egress traffic from the freeway. This often results in long queues in either direction that block ramps and spill over to arterial streets or freeway lanes. In this paper, we propo… ▽ More Arterial traffic interacts with freeway traffic, yet the two are controlled independently. Arterial traffic signals do not take into account freeway traffic and how ramps control ingress traffic and have no control over egress traffic from the freeway. This often results in long queues in either direction that block ramps and spill over to arterial streets or freeway lanes. In this paper, we propose an adaptive arterial traffic control strategy that combines traffic signal control (TSC) and dynamic speed offset (DSO) coordination using a Q-learning algorithm for a traffic network that involves a freeway segment and adjacent arterial streets. The TSC agent computes the signal cycle length and split based on observed intersection demands and adjacent freeway off-ramp queues. The DSO agent computes the relative offset and the recommended speeds of both ways between consecutive intersections based on their physical distance, intersection queues, and signal cycles. We evaluate the performance of the proposed arterial traffic control strategy using microscopic traffic simulations of an arterial corridor with seven intersections near the I-710 freeway. The proposed QL-based control significantly outperforms a fixed-time control and MAXBAND in terms of the travel time and the number of stops under low or moderate demands. In high-demand scenarios, the travel-time benefit provided by the QL-based control is reduced as it mitigates off-ramp and intersection queues, which is a necessary trade-off in our perspective. In addition, mutual benefit is obtained by implementing freeway and arterial traffic control simultaneously. △ Less

Submitted 9 April, 2024; originally announced April 2024.

Comments: submitted to TR-C

arXiv:2404.01102 [pdf, other]

Diffusion based Zero-shot Medical Image-to-Image Translation for Cross Modality Segmentation

Authors: Zihao Wang, Yingyu Yang, Yuzhou Chen, Tingting Yuan, Maxime Sermesant, Herve Delingette, Ona Wu

Abstract: Cross-modality image segmentation aims to segment the target modalities using a method designed in the source modality. Deep generative models can translate the target modality images into the source modality, thus enabling cross-modality segmentation. However, a vast body of existing cross-modality image translation methods relies on supervised learning. In this work, we aim to address the challe… ▽ More Cross-modality image segmentation aims to segment the target modalities using a method designed in the source modality. Deep generative models can translate the target modality images into the source modality, thus enabling cross-modality segmentation. However, a vast body of existing cross-modality image translation methods relies on supervised learning. In this work, we aim to address the challenge of zero-shot learning-based image translation tasks (extreme scenarios in the target modality is unseen in the training phase). To leverage generative learning for zero-shot cross-modality image segmentation, we propose a novel unsupervised image translation method. The framework learns to translate the unseen source image to the target modality for image segmentation by leveraging the inherent statistical consistency between different modalities for diffusion guidance. Our framework captures identical cross-modality features in the statistical domain, offering diffusion guidance without relying on direct map**s between the source and target domains. This advantage allows our method to adapt to changing source domains without the need for retraining, making it highly practical when sufficient labeled source domain data is not available. The proposed framework is validated in zero-shot cross-modality image segmentation tasks through empirical comparisons with influential generative models, including adversarial-based and diffusion-based models. △ Less

Submitted 9 April, 2024; v1 submitted 1 April, 2024; originally announced April 2024.

Comments: Neurips 2023 Diffusion Workshop

arXiv:2310.16748 [pdf, other]

Integrated Freeway Traffic Control Using Q-Learning with Adjacent Arterial Traffic Considerations

Authors: Tianchen Yuan, Petros A. Ioannou

Abstract: Numerous studies have shown the effectiveness of intelligent transportation system techniques such as variable speed limit (VSL), lane change (LC) control, and ramp metering (RM) in freeway traffic flow control. The integration of these techniques has the potential to further enhance the traffic operation efficiency of both freeway and adjacent arterial networks. In this regard, we propose a freew… ▽ More Numerous studies have shown the effectiveness of intelligent transportation system techniques such as variable speed limit (VSL), lane change (LC) control, and ramp metering (RM) in freeway traffic flow control. The integration of these techniques has the potential to further enhance the traffic operation efficiency of both freeway and adjacent arterial networks. In this regard, we propose a freeway traffic control (FTC) strategy that coordinates VSL, LC, RM actions using a Q-learning (QL) framework which takes into account arterial traffic characteristics. The signal timing and demands of adjacent arterial intersections are incorporated as state variables of the FTC agent. The FTC agent is initially trained offline using a single-section road network, and subsequently deployed online in a connected freeway and arterial simulation network for continuous learning. The arterial network is assumed to be regulated by a traffic-responsive signal control strategy based on a cycle length model. Microscopic simulations demonstrate that the fully-trained FTC agent provides significant reductions in freeway travel time and the number of stops in scenarios with traffic congestion. It clearly outperforms an uncoordinated FTC and a decentralized feedback control strategy. Even though the FTC agent does not control the arterial traffic signals, it leads to shorter average queue lengths at arterial intersections by taking into account the arterial traffic conditions in controlling freeway traffic. These results motivate a future research where the QL framework will also include the control of arterial traffic signals. △ Less

Submitted 19 May, 2024; v1 submitted 25 October, 2023; originally announced October 2023.

Comments: 12 pages, 9 figures, 5 tables

arXiv:2303.17188 [pdf, ps, other]

doi 10.1109/TVT.2023.3263971

High-Performance Low-Complexity Hierarchical Frequency Synchronization for Distributed Massive MIMO-OFDMA Systems

Authors: Xiao-Yang Wang, Shaoshi Yang, Tian-Hao Yuan, Hou-Yu Zhai, Jianhua Zhang, Lajos Hanzo

Abstract: We propose a high-performance yet low-complexity hierarchical frequency synchronization scheme for orthogonal frequency-division multiple-access (OFDMA) aided distributed massive multi-input multi-output (MIMO) systems, where multi-ple carrier frequency offsets (CFOs) have to be estimated in the uplink. To solve this multi-CFO estimation problem efficiently, we classify the active antenna units (A… ▽ More We propose a high-performance yet low-complexity hierarchical frequency synchronization scheme for orthogonal frequency-division multiple-access (OFDMA) aided distributed massive multi-input multi-output (MIMO) systems, where multi-ple carrier frequency offsets (CFOs) have to be estimated in the uplink. To solve this multi-CFO estimation problem efficiently, we classify the active antenna units (AAUs) as the master and the slaves. Then, we split the scheme into two stages. During the first stage the distributed slave AAUs are synchronized with the master AAU, while the user equipment (UE) is synchronized with the closest slave AAU during the second stage. The mean square error (MSE) performance of our scheme is better than that of the representative state-of-the-art baseline schemes, while its computational complexity is substantially lower. △ Less

Submitted 30 March, 2023; originally announced March 2023.

Comments: 6 pages, 7 figures, accepted by IEEE Transactions on Vehicular Technology

arXiv:2211.03545 [pdf, other]

ERNIE-SAT: Speech and Text Joint Pretraining for Cross-Lingual Multi-Speaker Text-to-Speech

Authors: Xiaoran Fan, Chao Pang, Tian Yuan, He Bai, Renjie Zheng, Pengfei Zhu, Shuohuan Wang, Junkun Chen, Zeyu Chen, Liang Huang, Yu Sun, Hua Wu

Abstract: Speech representation learning has improved both speech understanding and speech synthesis tasks for single language. However, its ability in cross-lingual scenarios has not been explored. In this paper, we extend the pretraining method for cross-lingual multi-speaker speech synthesis tasks, including cross-lingual multi-speaker voice cloning and cross-lingual multi-speaker speech editing. We prop… ▽ More Speech representation learning has improved both speech understanding and speech synthesis tasks for single language. However, its ability in cross-lingual scenarios has not been explored. In this paper, we extend the pretraining method for cross-lingual multi-speaker speech synthesis tasks, including cross-lingual multi-speaker voice cloning and cross-lingual multi-speaker speech editing. We propose a speech-text joint pretraining framework, where we randomly mask the spectrogram and the phonemes given a speech example and its transcription. By learning to reconstruct the masked parts of the input in different languages, our model shows great improvements over speaker-embedding-based multi-speaker TTS methods. Moreover, our framework is end-to-end for both the training and the inference without any finetuning effort. In cross-lingual multi-speaker voice cloning and cross-lingual multi-speaker speech editing tasks, our experiments show that our model outperforms speaker-embedding-based multi-speaker TTS methods. △ Less

Submitted 4 December, 2022; v1 submitted 7 November, 2022; originally announced November 2022.

arXiv:2205.12007 [pdf, other]

PaddleSpeech: An Easy-to-Use All-in-One Speech Toolkit

Authors: Hui Zhang, Tian Yuan, Junkun Chen, Xintong Li, Renjie Zheng, Yuxin Huang, Xiaojie Chen, Enlei Gong, Zeyu Chen, Xiaoguang Hu, Dianhai Yu, Yanjun Ma, Liang Huang

Abstract: PaddleSpeech is an open-source all-in-one speech toolkit. It aims at facilitating the development and research of speech processing technologies by providing an easy-to-use command-line interface and a simple code structure. This paper describes the design philosophy and core architecture of PaddleSpeech to support several essential speech-to-text and text-to-speech tasks. PaddleSpeech achieves co… ▽ More PaddleSpeech is an open-source all-in-one speech toolkit. It aims at facilitating the development and research of speech processing technologies by providing an easy-to-use command-line interface and a simple code structure. This paper describes the design philosophy and core architecture of PaddleSpeech to support several essential speech-to-text and text-to-speech tasks. PaddleSpeech achieves competitive or state-of-the-art performance on various speech datasets and implements the most popular methods. It also provides recipes and pretrained models to quickly reproduce the experimental results in this paper. PaddleSpeech is publicly avaiable at https://github.com/PaddlePaddle/PaddleSpeech. △ Less

Submitted 20 May, 2022; originally announced May 2022.

arXiv:2111.07056 [pdf, other]

doi 10.1109/TITS.2022.3157516

Selection of the Speed Command Distance for Improved Performance of a Rule-Based VSL and Lane Change Control

Authors: Tianchen Yuan, Faisal Alasiri, Petros A. Ioannou

Abstract: Variable Speed Limit (VSL) control has been one of the most popular techniques with the potential of smoothing traffic flow, maximizing throughput at bottlenecks, and improving mobility and safety. Despite the substantial research efforts in the application of VSL control, few studies have looked into the effect of the VSL sign distance from the point of an accident or a bottleneck. In this paper,… ▽ More Variable Speed Limit (VSL) control has been one of the most popular techniques with the potential of smoothing traffic flow, maximizing throughput at bottlenecks, and improving mobility and safety. Despite the substantial research efforts in the application of VSL control, few studies have looked into the effect of the VSL sign distance from the point of an accident or a bottleneck. In this paper, we show that this distance has a significant impact on the effectiveness and performance of VSL control. We propose a rule-based VSL strategy that matches the outflow of the upstream VSL zone with the bottleneck capacity based on a multi-section Cell Transmission Model (CTM). Then, we consider the distance of the upstream VSL zone as a control variable and perform a comprehensive analysis of its impact on the performance of the closed-loop traffic control system based on the multi-section CTM. We develop a lower bound that this distance needs to satisfy in order to guarantee homogeneous traffic density across sections and reduce bottleneck congestion. The bound is verified analytically and demonstrated using microscopic simulation of traffic on I-710 in Southern California. The simulations are used to quantify the benefits on mobility, safety and emissions obtained by selecting the upstream VSL zone distance to satisfy the analytical lower bound. The developed lower bound is a design tool which can be used to tune and improve the performance of VSL controllers. △ Less

Submitted 2 February, 2022; v1 submitted 13 November, 2021; originally announced November 2021.

Comments: 10 pages, 9 figures, 2 tables, submitted to T-ITS

arXiv:2012.15737 [pdf, other]

Design Of Two Stage CMOS Operational Amplifier in 180nm Technology

Authors: Tong Yuan, Qingyuan Fan

Abstract: In this paper a CMOS two stage operational amplifier has been presented which operates at 1.8 V power supply at 0.18 micron (i.e., 180 nm) technology and whose input is depended on Bias Current. The op-amp provides a gain of 63dB and a bandwidth of 140 kHz for a load of 1 pF. This op-amp has a Common Mode gain of -25 dB, an output slew rate of 32 $V / μs$, and a output voltage swing. The power con… ▽ More In this paper a CMOS two stage operational amplifier has been presented which operates at 1.8 V power supply at 0.18 micron (i.e., 180 nm) technology and whose input is depended on Bias Current. The op-amp provides a gain of 63dB and a bandwidth of 140 kHz for a load of 1 pF. This op-amp has a Common Mode gain of -25 dB, an output slew rate of 32 $V / μs$, and a output voltage swing. The power consumption for the op-amp is $300μW$. △ Less

Submitted 27 December, 2020; originally announced December 2020.

arXiv:2009.11256 [pdf, other]

Edge Intelligence Empowered UAVs for Automated Wind Farm Monitoring in Smart Grids

Authors: Hwei-Ming Chung, Sabita Maharjan, Yan Zhang, Frank Eliassen, Tingting Yuan

Abstract: With the exploitation of wind power, more turbines will be deployed at remote areas possibly with harsh working conditions (e.g., offshore wind farm). The adverse working environment may lead to massive operating and maintenance costs of turbines. Deploying unmanned aerial vehicles (UAVs) for turbine inspection is considered as a viable alternative to manual inspections. An important objective of… ▽ More With the exploitation of wind power, more turbines will be deployed at remote areas possibly with harsh working conditions (e.g., offshore wind farm). The adverse working environment may lead to massive operating and maintenance costs of turbines. Deploying unmanned aerial vehicles (UAVs) for turbine inspection is considered as a viable alternative to manual inspections. An important objective of automated UAV inspection is to minimize the flight time of the UAVs to inspect all the turbines. A first contribution of this paper is thus formulating an optimization problem to compute the optimal routes for turbine inspection satisfying the above goal. On the other hand, the limited computational capability on UAVs can be used to increase the power generation of wind turbine. Power generation from the turbines can be optimized by controlling the yaw angle of the turbines. Forecasting wind conditions such as wind speed and wind direction is crucial for solving both optimization problems. Therefore, UAVs can utilize their limited computational capability to perform wind forecasting. In this way, UAVs form edge intelligence in offshore wind farm. With the forecasted wind conditions, we design two algorithms to solve the formulated problems, and then evaluate the proposed methods with realworld data. The results reveal that the proposed methods offer an improvement of 44% of the power generation from the turbine compared to hour-ahead forecasting and 25% reduction of the flight time of the UAVs compared to the chosen baseline method. △ Less

Submitted 23 September, 2020; originally announced September 2020.

Comments: Accepted by IEEE Globecom 2020

arXiv:2008.01902 [pdf, other]

Integrated Traffic Simulation-Prediction System using Neural Networks with Application to the Los Angeles International Airport Road Network

Authors: Yihang Zhang, Aristotelis-Angelos Papadopoulos, Pengfei Chen, Faisal Alasiri, Tianchen Yuan, ** Zhou, Petros A. Ioannou

Abstract: Transportation networks are highly complex and the design of efficient traffic management systems is difficult due to lack of adequate measured data and accurate predictions of the traffic states. Traffic simulation models can capture the complex dynamics of transportation networks by using limited available traffic data and can help central traffic authorities in their decision-making, if appropr… ▽ More Transportation networks are highly complex and the design of efficient traffic management systems is difficult due to lack of adequate measured data and accurate predictions of the traffic states. Traffic simulation models can capture the complex dynamics of transportation networks by using limited available traffic data and can help central traffic authorities in their decision-making, if appropriate input is fed into the simulator. In this paper, we design an integrated simulation-prediction system which estimates the Origin-Destination (OD) matrix of a road network using only flow rate information and predicts the behavior of the road network in different simulation scenarios. The proposed system includes an optimization-based OD matrix generation method, a Neural Network (NN) model trained to predict OD matrices via the pattern of traffic flow and a microscopic traffic simulator with a Dynamic Traffic Assignment (DTA) scheme to predict the behavior of the transportation system. We test the proposed system on the road network of the central terminal area (CTA) of the Los Angeles International Airport (LAX), which demonstrates that the integrated traffic simulation-prediction system can be used to simulate the effects of several real world scenarios such as lane closures, curbside parking and other changes. The model is an effective tool for learning the impact and possible benefits of changes in the network and for analyzing scenarios at a very low cost without disrupting the network. △ Less

Submitted 4 August, 2020; originally announced August 2020.

Comments: 19 pages. Under review

arXiv:2002.07579 [pdf, other]

Modeling Cloud Reflectance Fields using Conditional Generative Adversarial Networks

Authors: Victor Schmidt, Mustafa Alghali, Kris Sankaran, Tianle Yuan, Yoshua Bengio

Abstract: We introduce a conditional Generative Adversarial Network (cGAN) approach to generate cloud reflectance fields (CRFs) conditioned on large scale meteorological variables such as sea surface temperature and relative humidity. We show that our trained model can generate realistic CRFs from the corresponding meteorological observations, which represents a step towards a data-driven framework for stoc… ▽ More We introduce a conditional Generative Adversarial Network (cGAN) approach to generate cloud reflectance fields (CRFs) conditioned on large scale meteorological variables such as sea surface temperature and relative humidity. We show that our trained model can generate realistic CRFs from the corresponding meteorological observations, which represents a step towards a data-driven framework for stochastic cloud parameterization. △ Less

Submitted 14 April, 2020; v1 submitted 10 February, 2020; originally announced February 2020.

Comments: Code is available on Github: https://github.com/krisrs1128/clouds_dist

arXiv:1907.11210 [pdf, other]

HUGE2: a Highly Untangled Generative-model Engine for Edge-computing

Authors: Feng Shi, Ziheng Xu, Tao Yuan, Song-Chun Zhu

Abstract: As a type of prominent studies in deep learning, generative models have been widely investigated in research recently. Two research branches of the deep learning models, the Generative Networks (GANs, VAE) and the Semantic Segmentation, rely highly on the upsampling operations, especially the transposed convolution and the dilated convolution. However, these two types of convolutions are intrinsic… ▽ More As a type of prominent studies in deep learning, generative models have been widely investigated in research recently. Two research branches of the deep learning models, the Generative Networks (GANs, VAE) and the Semantic Segmentation, rely highly on the upsampling operations, especially the transposed convolution and the dilated convolution. However, these two types of convolutions are intrinsically different from standard convolution regarding the insertion of zeros in input feature maps or in kernels respectively. This distinct nature severely degrades the performance of the existing deep learning engine or frameworks, such as Darknet, Tensorflow, and PyTorch, which are mainly developed for the standard convolution. Another trend in deep learning realm is to deploy the model onto edge/ embedded devices, in which the memory resource is scarce. In this work, we propose a Highly Untangled Generative-model Engine for Edge-computing or HUGE2 for accelerating these two special convolutions on the edge-computing platform by decomposing the kernels and untangling these smaller convolutions by performing basic matrix multiplications. The methods we propose use much smaller memory footprint, hence much fewer memory accesses, and the data access patterns also dramatically increase the reusability of the data already fetched in caches, hence increasing the localities of caches. Our engine achieves a speedup of nearly 5x on embedded CPUs, and around 10x on embedded GPUs, and more than 50% reduction of memory access. △ Less

Submitted 8 May, 2021; v1 submitted 25 July, 2019; originally announced July 2019.

arXiv:1810.07377 [pdf, other]

XJTLUIndoorLoc: A New Fingerprinting Database for Indoor Localization and Trajectory Estimation Based on Wi-Fi RSS and Geomagnetic Field

Authors: Zhenghang Zhong, Zhe Tang, Xiangxing Li, Tiancheng Yuan, Yang Yang, Meng Wei, Yuanyuan Zhang, Renzhi Sheng, Naomi Grant, Chongfeng Ling, Xintao Huan, Kyeong Soo Kim, Sanghyuk Lee

Abstract: In this paper, we present a new location fingerprinting database comprised of Wi-Fi received signal strength (RSS) and geomagnetic field intensity measured with multiple devices at a multi-floor building in Xi'an Jiatong-Liverpool University, Suzhou, China. We also provide preliminary results of localization and trajectory estimation based on convolutional neural network (CNN) and long short-term… ▽ More In this paper, we present a new location fingerprinting database comprised of Wi-Fi received signal strength (RSS) and geomagnetic field intensity measured with multiple devices at a multi-floor building in Xi'an Jiatong-Liverpool University, Suzhou, China. We also provide preliminary results of localization and trajectory estimation based on convolutional neural network (CNN) and long short-term memory (LSTM) network with this database. For localization, we map RSS data for a reference point to an image-like, two-dimensional array and then apply CNN which is popular in image and video analysis and recognition. For trajectory estimation, we use a modified random way point model to efficiently generate continuous step traces imitating human walking and train a stacked two-layer LSTM network with the generated data to remember the changing pattern of geomagnetic field intensity against (x,y) coordinates. Experimental results demonstrate the usefulness of our new database and the feasibility of the CNN and LSTM-based localization and trajectory estimation with the database. △ Less

Submitted 16 October, 2018; originally announced October 2018.

Comments: 7 pages, 16 figures, 3rd International Workshop on GPU Computing and AI (GCA'18)

Showing 1–13 of 13 results for author: Yuan, T