Search | arXiv e-print repository

Sample Rate Independent Recurrent Neural Networks for Audio Effects Processing

Authors: Alistair Carson, Alec Wright, Jatin Chowdhury, Vesa Välimäki, Stefan Bilbao

Abstract: In recent years, machine learning approaches to modelling guitar amplifiers and effects pedals have been widely investigated and have become standard practice in some consumer products. In particular, recurrent neural networks (RNNs) are a popular choice for modelling non-linear devices such as vacuum tube amplifiers and distortion circuitry. One limitation of such models is that they are trained… ▽ More In recent years, machine learning approaches to modelling guitar amplifiers and effects pedals have been widely investigated and have become standard practice in some consumer products. In particular, recurrent neural networks (RNNs) are a popular choice for modelling non-linear devices such as vacuum tube amplifiers and distortion circuitry. One limitation of such models is that they are trained on audio at a specific sample rate and therefore give unreliable results when operating at another rate. Here, we investigate several methods of modifying RNN structures to make them approximately sample rate independent, with a focus on oversampling. In the case of integer oversampling, we demonstrate that a previously proposed delay-based approach provides high fidelity sample rate conversion whilst additionally reducing aliasing. For non-integer sample rate adjustment, we propose two novel methods and show that one of these, based on cubic Lagrange interpolation of a delay-line, provides a significant improvement over existing methods. To our knowledge, this work provides the first in-depth study into this problem. △ Less

Submitted 10 June, 2024; originally announced June 2024.

Comments: Accepted for publication in Proc. DAFx24, Guildford, UK, September 2024

arXiv:2312.14586 [pdf, other]

Noise Morphing for Audio Time Stretching

Authors: Eloi Moliner, Leonardo Fierro, Alec Wright, Matti Hämäläinen, Vesa Välimäki

Abstract: This letter introduces an innovative method to enhance the quality of audio time stretching by precisely decomposing a sound into sines, transients, and noise and by improving the processing of the latter component. While there are established methods for time-stretching sines and transients with high quality, the manipulation of noise or residual components has lacked robust solutions in prior re… ▽ More This letter introduces an innovative method to enhance the quality of audio time stretching by precisely decomposing a sound into sines, transients, and noise and by improving the processing of the latter component. While there are established methods for time-stretching sines and transients with high quality, the manipulation of noise or residual components has lacked robust solutions in prior research. The proposed method combines sound decomposition with previous techniques for audio spectral resynthesis. The time-stretched noise component is achieved by morphing its time-interpolated spectral magnitude with a white-noise excitation signal. This method stands out for its simplicity, efficiency, and audio quality. The results of a subjective experiment affirm the superiority of this approach over current state-of-the-art methods across all evaluated stretch factors. The proposed technique notably excels in extreme stretching scenarios, signifying a substantial elevation in performance. The proposed method holds promise for a wide range of applications in slow-motion media content, such as music or sports video production. △ Less

Submitted 22 December, 2023; originally announced December 2023.

Comments: submitted to IEEE Signal Processing Letters

arXiv:2309.09546 [pdf, other]

Training dynamic models using early exits for automatic speech recognition on resource-constrained devices

Authors: George August Wright, Umberto Cappellazzo, Salah Zaiem, Desh Raj, Lucas Ondel Yang, Daniele Falavigna, Mohamed Nabih Ali, Alessio Brutti

Abstract: The ability to dynamically adjust the computational load of neural models during inference is crucial for on-device processing scenarios characterised by limited and time-varying computational resources. A promising solution is presented by early-exit architectures, in which additional exit branches are appended to intermediate layers of the encoder. In self-attention models for automatic speech r… ▽ More The ability to dynamically adjust the computational load of neural models during inference is crucial for on-device processing scenarios characterised by limited and time-varying computational resources. A promising solution is presented by early-exit architectures, in which additional exit branches are appended to intermediate layers of the encoder. In self-attention models for automatic speech recognition (ASR), early-exit architectures enable the development of dynamic models capable of adapting their size and architecture to varying levels of computational resources and ASR performance demands. Previous research on early-exiting ASR models has relied on pre-trained self-supervised models, fine-tuned with an early-exit loss. In this paper, we undertake an experimental comparison between fine-tuning pre-trained backbones and training models from scratch with the early-exiting objective. Experiments conducted on public datasets reveal that early-exit models trained from scratch not only preserve performance when using fewer encoder layers but also exhibit enhanced task accuracy compared to single-exit or pre-trained models. Furthermore, we explore an exit selection strategy grounded in posterior probabilities as an alternative to the conventional frame-based entropy approach. Results provide insights into the training dynamics of early-exit architectures for ASR models, particularly the efficacy of training strategies and exit selection methods. △ Less

Submitted 22 February, 2024; v1 submitted 18 September, 2023; originally announced September 2023.

Comments: Accepted at the ICASSP Workshop Self-supervision in Audio, Speech and Beyond 2024

arXiv:2308.08732 [pdf, ps, other]

Recursive Detection and Analysis of Nanoparticles in Scanning Electron Microscopy Images

Authors: Aidan S. Wright, Nathaniel P. Youmans, Enrique F. Valderrama Araya

Abstract: In this study, we present a computational framework tailored for the precise detection and comprehensive analysis of nanoparticles within scanning electron microscopy (SEM) images. The primary objective of this framework revolves around the accurate localization of nanoparticle coordinates, accompanied by secondary objectives encompassing the extraction of pertinent morphological attributes includ… ▽ More In this study, we present a computational framework tailored for the precise detection and comprehensive analysis of nanoparticles within scanning electron microscopy (SEM) images. The primary objective of this framework revolves around the accurate localization of nanoparticle coordinates, accompanied by secondary objectives encompassing the extraction of pertinent morphological attributes including area, orientation, brightness, and length. Constructed leveraging the robust image processing capabilities of Python, particularly harnessing libraries such as OpenCV, SciPy, and Scikit-Image, the framework employs an amalgamation of techniques, including thresholding, dilating, and eroding, to enhance the fidelity of image processing outcomes. The ensuing nanoparticle data is seamlessly integrated into the RStudio environment to facilitate meticulous post-processing analysis. This encompasses a comprehensive evaluation of model accuracy, discernment of feature distribution patterns, and the identification of intricate particle arrangements. The finalized framework exhibits high nanoparticle identification within the primary sample image and boasts 97\% accuracy in detecting particles across five distinct test images drawn from a SEM nanoparticle dataset. Furthermore, the framework demonstrates the capability to discern nanoparticles of faint intensity, eluding manual labeling within the control group. △ Less

Submitted 16 August, 2023; originally announced August 2023.

Comments: 9 pages, 10 figures

ACM Class: I.4.7

arXiv:2305.16862 [pdf, other]

Neural modeling of magnetic tape recorders

Authors: Otto Mikkonen, Alec Wright, Eloi Moliner, Vesa Välimäki

Abstract: The sound of magnetic recording media, such as open-reel and cassette tape recorders, is still sought after by today's sound practitioners due to the imperfections embedded in the physics of the magnetic recording process. This paper proposes a method for digitally emulating this character using neural networks. The signal chain of the proposed system consists of three main components: the hystere… ▽ More The sound of magnetic recording media, such as open-reel and cassette tape recorders, is still sought after by today's sound practitioners due to the imperfections embedded in the physics of the magnetic recording process. This paper proposes a method for digitally emulating this character using neural networks. The signal chain of the proposed system consists of three main components: the hysteretic nonlinearity and filtering jointly produced by the magnetic recording process as well as the record and playback amplifiers, the fluctuating delay originating from the tape transport, and the combined additive noise component from various electromagnetic origins. In our approach, the hysteretic nonlinear block is modeled using a recurrent neural network, while the delay trajectories and the noise component are generated using separate diffusion models, which employ U-net deep convolutional neural networks. According to the conducted objective evaluation, the proposed architecture faithfully captures the character of the magnetic tape recorder. The results of this study can be used to construct virtual replicas of vintage sound recording devices with applications in music production and audio antiquing tasks. △ Less

Submitted 26 May, 2023; originally announced May 2023.

Comments: Accepted to DAFx 2023. For accompanying web page, see http://research.spa.aalto.fi/publications/papers/dafx23-neural-tape/

arXiv:2211.16992 [pdf, other]

Extreme Audio Time Stretching Using Neural Synthesis

Authors: Leonardo Fierro, Alec Wright, Vesa Välimäki, Matti Hämäläinen

Abstract: A deep neural network solution for time-scale modification (TSM) focused on large stretching factors is proposed, targeting environmental sounds. Traditional TSM artifacts such as transient smearing, loss of presence, and phasiness are heavily accentuated and cause poor audio quality when the TSM factor is four or larger. The weakness of established TSM methods, often based on a phase vocoder stru… ▽ More A deep neural network solution for time-scale modification (TSM) focused on large stretching factors is proposed, targeting environmental sounds. Traditional TSM artifacts such as transient smearing, loss of presence, and phasiness are heavily accentuated and cause poor audio quality when the TSM factor is four or larger. The weakness of established TSM methods, often based on a phase vocoder structure, lies in the poor description and scaling of the transient and noise components, or nuances, of a sound. Our novel solution combines a sines-transients-noise decomposition with an independent WaveNet synthesizer to provide a better description of the noise component and an improve sound quality for large stretching factors. Results of a subjective listening test against four other TSM algorithms are reported, showing the proposed method to be often superior. The proposed method is stereo compatible and has a wide range of applications related to the slow motion of media content. △ Less

Submitted 30 November, 2022; originally announced November 2022.

Comments: Submitted to IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2023 on Oct 27, 2022

arXiv:2211.00943 [pdf, other]

Adversarial Guitar Amplifier Modelling With Unpaired Data

Authors: Alec Wright, Vesa Välimäki, Lauri Juvela

Abstract: We propose an audio effects processing framework that learns to emulate a target electric guitar tone from a recording. We train a deep neural network using an adversarial approach, with the goal of transforming the timbre of a guitar, into the timbre of another guitar after audio effects processing has been applied, for example, by a guitar amplifier. The model training requires no paired data, a… ▽ More We propose an audio effects processing framework that learns to emulate a target electric guitar tone from a recording. We train a deep neural network using an adversarial approach, with the goal of transforming the timbre of a guitar, into the timbre of another guitar after audio effects processing has been applied, for example, by a guitar amplifier. The model training requires no paired data, and the resulting model emulates the target timbre well whilst being capable of real-time processing on a modern personal computer. To verify our approach we present two experiments, one which carries out unpaired training using paired data, allowing us to monitor training via objective metrics, and another that uses fully unpaired data, corresponding to a realistic scenario where a user wants to emulate a guitar timbre only using audio data from a recording. Our listening test results confirm that the models are perceptually convincing. △ Less

Submitted 20 March, 2023; v1 submitted 2 November, 2022; originally announced November 2022.

Comments: Accepted to ICASSP 2023

arXiv:2205.01897 [pdf, other]

Virtual Analog Modeling of Distortion Circuits Using Neural Ordinary Differential Equations

Authors: Jan Wilczek, Alec Wright, Vesa Välimäki, Emanuël Habets

Abstract: Recent research in deep learning has shown that neural networks can learn differential equations governing dynamical systems. In this paper, we adapt this concept to Virtual Analog (VA) modeling to learn the ordinary differential equations (ODEs) governing the first-order and the second-order diode clipper. The proposed models achieve performance comparable to state-of-the-art recurrent neural net… ▽ More Recent research in deep learning has shown that neural networks can learn differential equations governing dynamical systems. In this paper, we adapt this concept to Virtual Analog (VA) modeling to learn the ordinary differential equations (ODEs) governing the first-order and the second-order diode clipper. The proposed models achieve performance comparable to state-of-the-art recurrent neural networks (RNNs) albeit using fewer parameters. We show that this approach does not require oversampling and allows to increase the sampling rate after the training has completed, which results in increased accuracy. Using a sophisticated numerical solver allows to increase the accuracy at the cost of slower processing. ODEs learned this way do not require closed forms but are still physically interpretable. △ Less

Submitted 1 July, 2022; v1 submitted 4 May, 2022; originally announced May 2022.

Comments: 8 pages, 10 figures, accepted for DAFx 2022 conference, for associated audio examples, see https://thewolfsound.com/publications/dafx2022/

arXiv:1911.08922 [pdf, other]

Perceptual Loss Function for Neural Modelling of Audio Systems

Authors: Alec Wright, Vesa Välimäki

Abstract: This work investigates alternate pre-emphasis filters used as part of the loss function during neural network training for nonlinear audio processing. In our previous work, the error-to-signal ratio loss function was used during network training, with a first-order highpass pre-emphasis filter applied to both the target signal and neural network output. This work considers more perceptually releva… ▽ More This work investigates alternate pre-emphasis filters used as part of the loss function during neural network training for nonlinear audio processing. In our previous work, the error-to-signal ratio loss function was used during network training, with a first-order highpass pre-emphasis filter applied to both the target signal and neural network output. This work considers more perceptually relevant pre-emphasis filters, which include lowpass filtering at high frequencies. We conducted listening tests to determine whether they offer an improvement to the quality of a neural network model of a guitar tube amplifier. Listening test results indicate that the use of an A-weighting pre-emphasis filter offers the best improvement among the tested filters. The proposed perceptual loss function improves the sound quality of neural network models in audio processing without affecting the computational cost. △ Less

Submitted 20 November, 2019; originally announced November 2019.

Comments: Submitted to ICASSP 2020

arXiv:1909.06326 [pdf, other]

Automatic Hip Fracture Identification and Functional Subclassification with Deep Learning

Authors: Justin D Krogue, Kaiyang V Cheng, Kevin M Hwang, Paul Toogood, Eric G Meinberg, Erik J Geiger, Musa Zaid, Kevin C McGill, Rina Patel, Jae Ho Sohn, Alexandra Wright, Bryan F Darger, Kevin A Padrez, Eugene Ozhinsky, Sharmila Majumdar, Valentina Pedoia

Abstract: Purpose: Hip fractures are a common cause of morbidity and mortality. Automatic identification and classification of hip fractures using deep learning may improve outcomes by reducing diagnostic errors and decreasing time to operation. Methods: Hip and pelvic radiographs from 1118 studies were reviewed and 3034 hips were labeled via bounding boxes and classified as normal, displaced femoral neck f… ▽ More Purpose: Hip fractures are a common cause of morbidity and mortality. Automatic identification and classification of hip fractures using deep learning may improve outcomes by reducing diagnostic errors and decreasing time to operation. Methods: Hip and pelvic radiographs from 1118 studies were reviewed and 3034 hips were labeled via bounding boxes and classified as normal, displaced femoral neck fracture, nondisplaced femoral neck fracture, intertrochanteric fracture, previous ORIF, or previous arthroplasty. A deep learning-based object detection model was trained to automate the placement of the bounding boxes. A Densely Connected Convolutional Neural Network (DenseNet) was trained on a subset of the bounding box images, and its performance evaluated on a held out test set and by comparison on a 100-image subset to two groups of human observers: fellowship-trained radiologists and orthopaedists, and senior residents in emergency medicine, radiology, and orthopaedics. Results: The binary accuracy for fracture of our model was 93.8% (95% CI, 91.3-95.8%), with sensitivity of 92.7% (95% CI, 88.7-95.6%), and specificity 95.0% (95% CI, 91.5-97.3%). Multiclass classification accuracy was 90.4% (95% CI, 87.4-92.9%). When compared to human observers, our model achieved at least expert-level classification under all conditions. Additionally, when the model was used as an aid, human performance improved, with aided resident performance approximating unaided fellowship-trained expert performance. Conclusions: Our deep learning model identified and classified hip fractures with at least expert-level accuracy, and when used as an aid improved human performance, with aided resident performance approximating that of unaided fellowship-trained attendings. △ Less

Submitted 10 September, 2019; originally announced September 2019.

Comments: Presented at Orthopaedic Research Society, Austin, TX, Feb 2, 2019, currently in submission for publication

arXiv:1908.09953 [pdf, other]

Macroscopic Modeling, Calibration, and Simulation of Managed Lane-Freeway Networks, Part II: Network-scale Calibration and Case Studies

Authors: Matthew A. Wright, Roberto Horowitz, Alex A. Kurzhanskiy

Abstract: In Part I of this paper series, several macroscopic traffic model elements for mathematically describing freeway networks equipped with managed lane facilities were proposed. These modeling techniques seek to capture at the macroscopic the complex phenomena that occur on managed lane-freeway networks, where two parallel traffic flows interact with each other both in the physical sense (how and whe… ▽ More In Part I of this paper series, several macroscopic traffic model elements for mathematically describing freeway networks equipped with managed lane facilities were proposed. These modeling techniques seek to capture at the macroscopic the complex phenomena that occur on managed lane-freeway networks, where two parallel traffic flows interact with each other both in the physical sense (how and where cars flow between the two lane groups) and the physiological sense (how driving behaviors are changed by being adjacent to a quantitatively and qualitatively different traffic flow). The local descriptions we developed in Part I are not the only modeling complexity introduced in managed lane-freeway networks. The complex topologies mean that network-scale modeling of a freeway corridor is increased in complexity as well. The already-difficult model calibration problem for a dynamic model of a freeway becomes more complex when the freeway becomes, in effect, two interrelating flow streams. In the present paper, we present an iterative-learning-based approach to calibrating our model's physical and driver-behavioral parameters. We consider the common situation where a complex traffic model needs to be calibrated to recreate real-world baseline traffic behavior, such that counterfactuals can be generated by training purposes. Our method is used to identify traditional freeway parameters as well as the proposed parameters that describe managed lane-freeway-network-specific behaviors. We validate our model and calibration methodology with case studies of simulations of two managed lane-equipped California freeways. △ Less

Submitted 26 August, 2019; originally announced August 2019.

Comments: Part I is here: arXiv:1609.09470

arXiv:1905.13428 [pdf, other]

Attentional Policies for Cross-Context Multi-Agent Reinforcement Learning

Authors: Matthew A. Wright, Roberto Horowitz

Abstract: Many potential applications of reinforcement learning in the real world involve interacting with other agents whose numbers vary over time. We propose new neural policy architectures for these multi-agent problems. In contrast to other methods of training an individual, discrete policy for each agent and then enforcing cooperation through some additional inter-policy mechanism, we follow the spiri… ▽ More Many potential applications of reinforcement learning in the real world involve interacting with other agents whose numbers vary over time. We propose new neural policy architectures for these multi-agent problems. In contrast to other methods of training an individual, discrete policy for each agent and then enforcing cooperation through some additional inter-policy mechanism, we follow the spirit of recent work on the power of relational inductive biases in deep networks by learning multi-agent relationships at the policy level via an attentional architecture. In our method, all agents share the same policy, but independently apply it in their own context to aggregate the other agents' state information when selecting their next action. The structure of our architectures allow them to be applied on environments with varying numbers of agents. We demonstrate our architecture on a benchmark multi-agent autonomous vehicle coordination problem, obtaining superior results to a full-knowledge, fully-centralized reference solution, and significantly outperforming it when scaling to large numbers of agents. △ Less

Submitted 31 May, 2019; originally announced May 2019.

arXiv:1904.08831 [pdf, other]

doi 10.1109/ITSC.2019.8917174

Neural-Attention-Based Deep Learning Architectures for Modeling Traffic Dynamics on Lane Graphs

Authors: Matthew A. Wright, Simon F. G. Ehlers, Roberto Horowitz

Abstract: Deep neural networks can be powerful tools, but require careful application-specific design to ensure that the most informative relationships in the data are learnable. In this paper, we apply deep neural networks to the nonlinear spatiotemporal physics problem of vehicle traffic dynamics. We consider problems of estimating macroscopic quantities (e.g., the queue at an intersection) at a lane leve… ▽ More Deep neural networks can be powerful tools, but require careful application-specific design to ensure that the most informative relationships in the data are learnable. In this paper, we apply deep neural networks to the nonlinear spatiotemporal physics problem of vehicle traffic dynamics. We consider problems of estimating macroscopic quantities (e.g., the queue at an intersection) at a lane level. First-principles modeling at the lane scale has been a challenge due to complexities in modeling social behaviors like lane changes, and those behaviors' resultant macro-scale effects. Following domain knowledge that upstream/downstream lanes and neighboring lanes affect each others' traffic flows in distinct ways, we apply a form of neural attention that allows the neural network layers to aggregate information from different lanes in different manners. Using a microscopic traffic simulator as a testbed, we obtain results showing that an attentional neural network model can use information from nearby lanes to improve predictions, and, that explicitly encoding the lane-to-lane relationship types significantly improves performance. We also demonstrate the transfer of our learned neural network to a more complex road network, discuss how its performance degradation may be attributable to new traffic behaviors induced by increased topological complexity, and motivate learning dynamics models from many road network topologies. △ Less

Submitted 14 July, 2019; v1 submitted 18 April, 2019; originally announced April 2019.

Comments: To appear at 2019 IEEE Conference on Intelligent Transportation Systems

arXiv:1809.01271 [pdf, other]

doi 10.1109/MITS.2020.2994098

A Framework for Robust Assimilation of Potentially Malign Third-Party Data, and its Statistical Meaning

Authors: Matthew A. Wright, Roberto Horowitz

Abstract: This paper presents a model-based method for fusing data from multiple sensors with a hypothesis-test-based component for rejecting potentially faulty or otherwise malign data. Our framework is based on an extension of the classic particle filter algorithm for real-time state estimation of uncertain systems with nonlinear dynamics with partial and noisy observations. This extension, based on class… ▽ More This paper presents a model-based method for fusing data from multiple sensors with a hypothesis-test-based component for rejecting potentially faulty or otherwise malign data. Our framework is based on an extension of the classic particle filter algorithm for real-time state estimation of uncertain systems with nonlinear dynamics with partial and noisy observations. This extension, based on classical statistical theories, utilizes statistical tests against the system's observation model. We discuss the application of the two major statistical testing frameworks, Fisherian significance testing and Neyman-Pearsonian hypothesis testing, to the Monte Carlo and sensor fusion settings. The Monte Carlo Neyman-Pearson test we develop is useful when one has a reliable model of faulty data, while the Fisher one is applicable when one may not have a model of faults, which may occur when dealing with third-party data, like GNSS data of transportation system users. These statistical tests can be combined with a particle filter to obtain a Monte Carlo state estimation scheme that is robust to faulty or outlier data. We present a synthetic freeway traffic state estimation problem where the filters are able to reject simulated faulty GNSS measurements. The fault-model-free Fisher filter, while underperforming the Neyman-Pearson one when the latter has an accurate fault model, outperforms it when the assumed fault model is incorrect. △ Less

Submitted 4 March, 2019; v1 submitted 4 September, 2018; originally announced September 2018.

Comments: IEEE Intelligent Transportation Systems Magazine, special issue on GNSS-based positioning

Journal ref: IEEE Intelligent Transportation Systems Magazine, vol. 12, no. 3, pp. 147-156, Fall 2020

arXiv:1804.05119 [pdf, other]

doi 10.1115/DSCC2018-9125

A Dynamic-System-Based Approach to Modeling Driver Movements Across General-Purpose/Managed Lane Interfaces

Authors: Matthew A. Wright, Roberto Horowitz, Alex A. Kurzhanskiy

Abstract: To help mitigate road congestion caused by the unrelenting growth of traffic demand, many transportation authorities have implemented managed lane policies, which restrict certain freeway lanes to certain types of vehicles. It was originally thought that managed lanes would improve the use of existing infrastructure through demand-management behaviors like carpooling, but implementations have ofte… ▽ More To help mitigate road congestion caused by the unrelenting growth of traffic demand, many transportation authorities have implemented managed lane policies, which restrict certain freeway lanes to certain types of vehicles. It was originally thought that managed lanes would improve the use of existing infrastructure through demand-management behaviors like carpooling, but implementations have often been characterized by unpredicted phenomena that are sometimes detrimental to system performance. The development of traffic models that can capture these sorts of behaviors is a key step for hel** managed lanes deliver on their promised gains. Towards this goal, this paper presents an approach for solving for driver behavior of entering and exiting managed lanes at the macroscopic (i.e., fluid approximation of traffic) scale. Our method is inspired by recent work in extending a dynamic-system-based modeling framework from traffic behaviors on individual roads, to models at junctions, and can be considered a further extension of this dynamic-system paradigm to the route/lane choice problem. Unlike traditional route choice models that are often based on discrete-choice methods and often rely on computing and comparing drivers' estimated travel times from taking different routes, our method is agnostic to the particular choice of physical traffic model and is suited specifically towards making decisions at these interfaces using only local information. These features make it a natural drop-in component to extend existing dynamic traffic modeling methods. △ Less

Submitted 3 July, 2018; v1 submitted 13 April, 2018; originally announced April 2018.

Comments: 2018 ASME Dynamic Systems and Control Conference (DSCC 2018)

Journal ref: Proceedings of the 2018 ASME Dynamic Systems and Controls Conference, Volume 2, V002T15A003

arXiv:1707.09346 [pdf, other]

Generic second-order macroscopic traffic node model for general multi-input multi-output road junctions via a dynamic system approach

Authors: Matthew A. Wright, Roberto Horowitz

Abstract: This paper addresses an open problem in traffic modeling: the second-order macroscopic node problem. A second-order macroscopic traffic model, in contrast to a first-order model, allows for variation of driving behavior across subpopulations of vehicles in the flow. The second-order models are thus more descriptive (e.g., they have been used to model variable mixtures of behaviorally-different tra… ▽ More This paper addresses an open problem in traffic modeling: the second-order macroscopic node problem. A second-order macroscopic traffic model, in contrast to a first-order model, allows for variation of driving behavior across subpopulations of vehicles in the flow. The second-order models are thus more descriptive (e.g., they have been used to model variable mixtures of behaviorally-different traffic, like car/truck traffic, autonomous/human-driven traffic, etc.), but are much more complex. The second-order node problem is a particularly complex problem, as it requires the resolution of discontinuities in traffic density and mixture characteristics, and solving of throughflows for arbitrary numbers of input and output roads to a node (in other words, this is an arbitrary-dimensional Riemann problem with two conserved quantities). In this paper, we extend the well-known "Generic Class of Node Model" constraints to the second order and present a simple solution algorithm to the second-order node problem. Our solution makes use of a recently-introduced dynamic system characterization of the first-order node model problem, which gives insight and intuition as to the continuous-time dynamics implicit in node models. We further argue that the common "supply and demand" construction of node models that decouples them from link models is not suitable to the second-order node problem. Our second-order node model and solution method have immediate applications in allowing modeling of behaviorally-complex traffic flows of contemporary interest (like partially-autonomous-vehicle flows) in arbitrary road networks. △ Less

Submitted 18 June, 2019; v1 submitted 28 July, 2017; originally announced July 2017.

arXiv:1609.09470 [pdf, other]

Macroscopic Modeling, Calibration, and Simulation of Managed Lane-Freeway Networks, Part I: Topological and Phenomenological Modeling

Authors: Matthew A. Wright, Roberto Horowitz, Alex A. Kurzhanskiy

Abstract: To help mitigate road congestion caused by the unrelenting growth of traffic demand, many transit authorities have implemented managed lane policies. Managed lanes typically run parallel to a freeway's standard, general-purpose (GP) lanes, but are restricted to certain types of vehicles. It was originally thought that managed lanes would improve the use of existing infrastructure through incentivi… ▽ More To help mitigate road congestion caused by the unrelenting growth of traffic demand, many transit authorities have implemented managed lane policies. Managed lanes typically run parallel to a freeway's standard, general-purpose (GP) lanes, but are restricted to certain types of vehicles. It was originally thought that managed lanes would improve the use of existing infrastructure through incentivization of demand-management behaviors like carpooling, but implementations have often been characterized by unpredicted phenomena that is often to detrimental system performance. This paper presents several macroscopic traffic modeling tools we have used for study of freeways equipped with managed lanes, or "managed lane-freeway networks." The proposed framework is based on the widely-used first-order kinematic wave theory. In this model, the GP and the managed lanes are modeled as parallel links connected by nodes, where certain type of traffic may switch between GP and managed lane links. Two types of managed lane topologies are considered: full-access, where vehicles can switch between the GP and the managed lanes anywhere; and separated, where such switching is allowed only at certain locations called gates. We also describe methods to incorporate in three phenomena into our model that are particular to managed lane-freeway networks. The inertia effect reflects drivers' inclination to stay in their lane as long as possible and switch only if this would obviously improve their travel condition. The friction effect reflects the empirically-observed driver fear of moving fast in a managed lane while traffic in the adjacent GP lanes moves slowly due to congestion. The smoothing effect describes how managed lanes can increase throughput at bottlenecks by reducing lane changes. We present simple models for each of these phenomena that fit within the general macroscopic theory. △ Less

Submitted 11 June, 2019; v1 submitted 29 September, 2016; originally announced September 2016.

Comments: The above abstract is slightly abbreviated, please see the document for the full abstract

arXiv:1609.06795 [pdf, other]

doi 10.1109/CDC.2017.8264529

Particle-Filter-Enabled Real-Time Sensor Fault Detection Without a Model of Faults

Authors: Matthew A. Wright, Roberto Horowitz

Abstract: We are experiencing an explosion in the amount of sensors measuring our activities and the world around us. These sensors are spread throughout the built environment and can help us perform state estimation and control of related systems, but they are often built and/or maintained by third parties or system users. As a result, by outsourcing system measurement to third parties, the controller must… ▽ More We are experiencing an explosion in the amount of sensors measuring our activities and the world around us. These sensors are spread throughout the built environment and can help us perform state estimation and control of related systems, but they are often built and/or maintained by third parties or system users. As a result, by outsourcing system measurement to third parties, the controller must accept their measurements without being able to directly verify the sensors' correct operation. Instead, detection and rejection of measurements from faulty sensors must be done with the raw data only. Towards this goal, we present a method of detecting possibly faulty behavior of sensors. The method does not require that the control designer have any model of faulty sensor behavior. As we discuss, it turns out that the widely-used particle filter state estimation algorithm provides the ingredients necessary for a hypothesis test against all ranges of correct operating behavior, obviating the need for a fault model to compare measurements. We demonstrate the applicability of our method by demonstrating its ability to reject faulty measurements and improve state estimation accuracy in a nonlinear vehicle traffic model without information of generated faulty measurements' characteristics. In our test, we correctly identify nearly 90% of measurements as faulty or non-faulty without having any fault model. This leads to only a 3% increase in state estimation error over a theoretical 100%-accurate fault detector. △ Less

Submitted 21 September, 2017; v1 submitted 21 September, 2016; originally announced September 2016.

Comments: To appear at the 56th IEEE Conference on Decision and Control (CDC 2017)

Journal ref: Proceedings of the 56th IEEE Conference on Decision and Control (CDC 2017), pp. 5757-5763, Dec. 2017

arXiv:1608.07623 [pdf, other]

doi 10.1016/j.ifacol.2016.10.307

A dynamic system characterization of road network node models

Authors: Matthew A. Wright, Roberto Horowitz, Alex A. Kurzhanskiy

Abstract: The propagation of traffic congestion along roads is a commonplace nonlinear phenomenon. When many roads are connected in a network, congestion can spill from one road to others as drivers queue to enter a congested road, creating further nonlinearities in the network dynamics. This paper considers the node model problem, which refers to methods for solving for cross-flows when roads meet at a jun… ▽ More The propagation of traffic congestion along roads is a commonplace nonlinear phenomenon. When many roads are connected in a network, congestion can spill from one road to others as drivers queue to enter a congested road, creating further nonlinearities in the network dynamics. This paper considers the node model problem, which refers to methods for solving for cross-flows when roads meet at a junction. We present a simple hybrid dynamic system that, given a macroscopic snapshot of the roads entering and exiting a node, intuitively models the node's throughflows over time. This dynamic system produces solutions to the node model problem that are equal to those produced by many popular node models without intuitive physical meanings. We also show how the earlier node models can be rederived as executions of our dynamic system. The intuitive physical description supplied by our system provides a base for control of the road junction system dynamics, as well as the emergent network dynamics. △ Less

Submitted 26 August, 2016; originally announced August 2016.

Comments: Appeared at NOLCOS 2016, 10th IFAC Symposium on Nonlinear Control Systems

Journal ref: IFAC-PapersOnLine, Volume 49, Issue 18, 2016, Pages 1054-1059

arXiv:1601.01054 [pdf, other]

doi 10.1016/j.trb.2017.09.001

On node models for high-dimensional road networks

Authors: Matthew A. Wright, Gabriel Gomes, Roberto Horowitz, Alex A. Kurzhanskiy

Abstract: Macroscopic traffic models are necessary for simulation and study of traffic's complex macro-scale dynamics, and are often used by practitioners for road network planning, integrated corridor management, and other applications. These models have two parts: a link model, which describes traffic flow behavior on individual roads, and a node model, which describes behavior at road junctions. As the r… ▽ More Macroscopic traffic models are necessary for simulation and study of traffic's complex macro-scale dynamics, and are often used by practitioners for road network planning, integrated corridor management, and other applications. These models have two parts: a link model, which describes traffic flow behavior on individual roads, and a node model, which describes behavior at road junctions. As the road networks under study become larger and more complex --- nowadays often including arterial networks --- the node model becomes more important. This paper focuses on the first order node model and has two main contributions. First, we formalize the multi-commodity flow distribution at a junction as an optimization problem with all the necessary constraints. Most interesting here is the formalization of input flow priorities. Then, we discuss a very common "conservation of turning fractions" or "first-in-first-out" (FIFO) constraint, and how it often produces unrealistic spillback. This spillback occurs when, at a diverge, a queue develops for a movement that only a few lanes service, but FIFO requires that all lanes experience spillback from this queue. As we show, avoiding this unrealistic spillback while retaining FIFO in the node model requires complicated network topologies. Our second contribution is a "partial FIFO" mechanism that avoids this unrealistic spillback, and a node model and solution algorithm that incorporates this mechanism. The partial FIFO mechanism is parameterized through intervals that describe how individual movements influence each other, can be intuitively described from physical lane geometry and turning movement rules, and allows tuning to describe a link as having anything between full FIFO and no FIFO. Excepting the FIFO constraint, the present node model also fits within the well-established "general class of first-order node models" for multi-commodity flows. △ Less

Submitted 1 September, 2017; v1 submitted 5 January, 2016; originally announced January 2016.

Comments: The abstract on this info page is slightly abbreviated, please see the paper for the full abstract

Journal ref: Transportation Research Part B: Methodological, vol. 105, pp. 212-234, November 2017

arXiv:1510.06702 [pdf, other]

doi 10.1109/TITS.2016.2565438

Fusing Loop and GPS Probe Measurements to Estimate Freeway Density

Authors: Matthew Wright, Roberto Horowitz

Abstract: In an age of ever-increasing penetration of GPS-enabled mobile devices, the potential of real-time "probe" location information for estimating the state of transportation networks is receiving increasing attention. Much work has been done on using probe data to estimate the current speed of vehicle traffic (or equivalently, trip travel time). While travel times are useful to individual drivers, th… ▽ More In an age of ever-increasing penetration of GPS-enabled mobile devices, the potential of real-time "probe" location information for estimating the state of transportation networks is receiving increasing attention. Much work has been done on using probe data to estimate the current speed of vehicle traffic (or equivalently, trip travel time). While travel times are useful to individual drivers, the state variable for a large class of traffic models and control algorithms is vehicle density. Our goal is to use probe data to supplement traditional, fixed-location loop detector data for density estimation. To this end, we derive a method based on Rao-Blackwellized particle filters, a sequential Monte Carlo scheme. We present a simulation where we obtain a 30\% reduction in density mean absolute percentage error from fusing loop and probe data, vs. using loop data alone. We also present results using real data from a 19-mile freeway section in Los Angeles, California, where we obtain a 31\% reduction. In addition, our method's estimate when using only the real-world probe data, and no loop data, outperformed the estimate produced when only loop data were used (an 18\% reduction). These results demonstrate that probe data can be used for traffic density estimation. △ Less

Submitted 18 May, 2016; v1 submitted 22 October, 2015; originally announced October 2015.

Journal ref: IEEE Transactions on Intelligent Transportation Systems, vol. 17, no. 12, pp. 3577-3590, Dec. 2016

arXiv:1509.04995 [pdf, other]

A new model for multi-commodity macroscopic modeling of complex traffic networks

Authors: Matthew Wright, Gabriel Gomes, Roberto Horowitz, Alex A. Kurzhanskiy

Abstract: We propose a macroscopic modeling framework for a network of roads and multi-commodity traffic. The proposed framework is based on the Lighthill-Whitham-Richards kinematic wave theory; more precisely, on its discretization, the Cell Transmission Model (CTM), adapted for networks and multi-commodity traffic. The resulting model is called the Link-Node CTM (LNCTM). In the LNCTM, we use the fundame… ▽ More We propose a macroscopic modeling framework for a network of roads and multi-commodity traffic. The proposed framework is based on the Lighthill-Whitham-Richards kinematic wave theory; more precisely, on its discretization, the Cell Transmission Model (CTM), adapted for networks and multi-commodity traffic. The resulting model is called the Link-Node CTM (LNCTM). In the LNCTM, we use the fundamental diagram of an "inverse lambda" shape that allows modeling of the capacity drop and the hysteresis behavior of the traffic state in a link that goes from free flow to congestion and back. A model of the node with multiple input and multiple output links accepting multi-commodity traffic is a cornerstone of the LNCTM. We present the multi-input-multi-output (MIMO) node model for multi-commodity traffic that supersedes previously developed node models. The analysis and comparison with previous node models are provided. Sometimes, certain traffic commodities may choose between multiple output links in a node based on the current traffic state of the node's input and output links. For such situations, we propose a local traffic assignment algorithm that computes how incoming traffic of a certain commodity should be distributed between output links, if this information is not known a priori. △ Less

Submitted 16 October, 2015; v1 submitted 16 September, 2015; originally announced September 2015.

Comments: 34 pages with appendix and examples. Figures in black and white. v3: Typos corrected

Showing 1–22 of 22 results for author: Wright, A