-
Lévy flight for electrons in graphene in the presence of regions with enhanced spin-orbit coupling
Authors:
Diego B. Fonseca,
Anderson L. R. Barbosa,
Luiz Felipe C. Pereira
Abstract:
We propose an electronic Lévy glass built from graphene nanoribbons in the presence of regions with enhanced spin-orbit coupling. Although electrons in graphene nanoribbons present a low spin-orbit coupling strength, it can be increased by a proximity effect with an appropriate substrate. We consider graphene nanoribbons with different edge types, which contain circular regions with a tunable Rash…
▽ More
We propose an electronic Lévy glass built from graphene nanoribbons in the presence of regions with enhanced spin-orbit coupling. Although electrons in graphene nanoribbons present a low spin-orbit coupling strength, it can be increased by a proximity effect with an appropriate substrate. We consider graphene nanoribbons with different edge types, which contain circular regions with a tunable Rashba spin-orbit coupling, whose diameter follow a power-law distribution. We find that spin-orbital clusters induce a transition from superdiffusive to diffusive charge transport, similar to what we recently reported for nanoribbons with electrostatic clusters [Phys. Rev. B. 107, 155432 (2023)]. We also investigate spin polarization in the spin-orbital Lévy glasses, and show that a finite spin polarization can be found only in the superdiffusive regime. In contrast, the spin polarization vanishes in the diffusive regime, making the electronic Lévy glass a useful device whose electronic transmission and spin polarization can be controlled by its Fermi energy. Finally, we apply a multifractal analysis to charge transmission and spin polarization, and find that the transmission time series in the superdiffusive regime are multifractal, while they tend to be monofractal in the diffusive regime. In contrast, spin polarization time series are multifractal in both regimes, characterizing a marked difference between mesoscopic fluctuations of charge transport and spin polarization in the proposed electronic Lévy glass.
△ Less
Submitted 16 May, 2024;
originally announced May 2024.
-
Length and torsion dependence of thermal conductivity in twisted graphene nanoribbons
Authors:
Alexandre F. Fonseca,
Luiz Felipe C. Pereira
Abstract:
Research on the physical properties of materials at the nanoscale is crucial for the development of breakthrough nanotechnologies. One of the key properties to consider is the ability to conduct heat, i.e., its thermal conductivity. Graphene is a remarkable nanostructure with exceptional physical properties, including one of the highest thermal conductivities (TC) ever measured. Graphene nanoribbo…
▽ More
Research on the physical properties of materials at the nanoscale is crucial for the development of breakthrough nanotechnologies. One of the key properties to consider is the ability to conduct heat, i.e., its thermal conductivity. Graphene is a remarkable nanostructure with exceptional physical properties, including one of the highest thermal conductivities (TC) ever measured. Graphene nanoribbons (GNRs) share most fundamental properties with graphene, with the added benefit of having a controllable electronic bandgap. One method to achieve such control is by twisting the GNR, which can tailor its electronic properties, as well as change their TC. Here, we revisit the dependence of the TC of twisted GNRs (TGNRs) on the number of applied turns to the GNR by calculating more precise and mathematically well defined geometric parameters related to the TGNR shape, namely, its twist and writhe. We show that the dependence of the TC on twist is not a simple function of the number of turns initially applied to a straight GNR. In fact, we show that the TC of TGNRs requires at least two parameters to be properly described. Our conclusions are supported by atomistic molecular dynamics simulations to obtain the TC of suspended TGNRs prepared under different values of initially applied turns and different sizes of their suspended part. Among possible choices of parameter pairs, we show that TC can be appropriately described by the initial number of turns and the initial twist density of the TGNRs.
△ Less
Submitted 30 April, 2024;
originally announced April 2024.
-
Thermodynamic properties of an electron gas in a two-dimensional quantum dot: an approach using density of states
Authors:
Luís Fernando C. Pereira,
Edilberto O. Silva
Abstract:
Potential applications of quantum dots in the nanotechnology industry make these systems an important field of study in various areas of physics. In particular, thermodynamics has a significant role in technological innovations. With this in mind, we studied some thermodynamic properties in quantum dots, such as entropy and heat capacity, as a function of the magnetic field over a wide range of te…
▽ More
Potential applications of quantum dots in the nanotechnology industry make these systems an important field of study in various areas of physics. In particular, thermodynamics has a significant role in technological innovations. With this in mind, we studied some thermodynamic properties in quantum dots, such as entropy and heat capacity, as a function of the magnetic field over a wide range of temperatures. The density of states plays an important role in our analyses. At low temperatures, the variation in the magnetic field induces an oscillatory behavior in all thermodynamic properties. The depopulation of subbands is the trigger for the appearance of the oscillations.
△ Less
Submitted 7 March, 2024;
originally announced March 2024.
-
Bayesian Active Learning for Censored Regression
Authors:
Frederik Boe Hüttel,
Christoffer Riis,
Filipe Rodrigues,
Francisco Câmara Pereira
Abstract:
Bayesian active learning is based on information theoretical approaches that focus on maximising the information that new observations provide to the model parameters. This is commonly done by maximising the Bayesian Active Learning by Disagreement (BALD) acquisitions function. However, we highlight that it is challenging to estimate BALD when the new data points are subject to censorship, where o…
▽ More
Bayesian active learning is based on information theoretical approaches that focus on maximising the information that new observations provide to the model parameters. This is commonly done by maximising the Bayesian Active Learning by Disagreement (BALD) acquisitions function. However, we highlight that it is challenging to estimate BALD when the new data points are subject to censorship, where only clipped values of the targets are observed. To address this, we derive the entropy and the mutual information for censored distributions and derive the BALD objective for active learning in censored regression ($\mathcal{C}$-BALD). We propose a novel modelling approach to estimate the $\mathcal{C}$-BALD objective and use it for active learning in the censored setting. Across a wide range of datasets and models, we demonstrate that $\mathcal{C}$-BALD outperforms other Bayesian active learning methods in censored regression.
△ Less
Submitted 19 February, 2024;
originally announced February 2024.
-
Rotating effects on the photoionization cross-section of a 2D quantum ring
Authors:
Carlos Magno O. Pereira,
Frankbelson dos S. Azevedo,
Luís Fernando C. Pereira,
Edilberto O. Silva
Abstract:
In this letter, we investigate the nonrelativistic quantum motion of a charged particle within a rotating frame, taking into account the Aharonov-Bohm (AB) effect and a uniform magnetic field. Our analysis entails the derivation of the equation of motion and the corresponding radial equation to describe the system. Solving the resulting radial equation enables us to determine the eigenvalues and e…
▽ More
In this letter, we investigate the nonrelativistic quantum motion of a charged particle within a rotating frame, taking into account the Aharonov-Bohm (AB) effect and a uniform magnetic field. Our analysis entails the derivation of the equation of motion and the corresponding radial equation to describe the system. Solving the resulting radial equation enables us to determine the eigenvalues and eigenfunctions, providing a clear expression for the energy levels. Furthermore, our numerical analysis highlights the substantial influence of rotation on both energy levels and optical properties. Specifically, we evaluate the photoionization cross-section (PCS) with and without the effects of rotation. To elucidate the impact of rotation on the photoionization process of the system, we present graphics that offer an appealing visualization of the intrinsic nature of the physics involved.
△ Less
Submitted 25 January, 2024;
originally announced January 2024.
-
Study on the effects of anisotropic effective mass on electronic properties, magnetization and persistent current in semiconductor quantum ring with conical geometry
Authors:
Francisco A. G. de Lira,
Luís Fernando C. Pereira,
Edilberto O. Silva
Abstract:
We study a 2D mesoscopic ring with an anisotropic effective mass considering surface quantum confinement effects. Consider that the ring is defined on the surface of a cone, which can be controlled topologically and mapped to the 2D ring in flat space. We demonstrate through numerical analysis that the electronic properties, the magnetization, and the persistent current undergo significant changes…
▽ More
We study a 2D mesoscopic ring with an anisotropic effective mass considering surface quantum confinement effects. Consider that the ring is defined on the surface of a cone, which can be controlled topologically and mapped to the 2D ring in flat space. We demonstrate through numerical analysis that the electronic properties, the magnetization, and the persistent current undergo significant changes due to quantum confinement and non-isotropic mass. We investigate these changes in the direct band gap semiconductors SiC, ZnO, GaN, and AlN. There is a plus (or minus) shift in the energy sub-bands for different values of curvature parameter and anisotropy. Manifestations of this nature are also seen in the Fermi energy profile as a function of the magnetic field and in the ring width as a function of the curvature parameter. Aharonov-Bohm (AB) and de Haas van-Alphen (dHvA) oscillations are also studied, and we find that they are sensitive to variations in curvature and anisotropy.
△ Less
Submitted 16 November, 2023;
originally announced November 2023.
-
Deep Evidential Learning for Bayesian Quantile Regression
Authors:
Frederik Boe Hüttel,
Filipe Rodrigues,
Francisco Câmara Pereira
Abstract:
It is desirable to have accurate uncertainty estimation from a single deterministic forward-pass model, as traditional methods for uncertainty quantification are computationally expensive. However, this is difficult because single forward-pass models do not sample weights during inference and often make assumptions about the target distribution, such as assuming it is Gaussian. This can be restric…
▽ More
It is desirable to have accurate uncertainty estimation from a single deterministic forward-pass model, as traditional methods for uncertainty quantification are computationally expensive. However, this is difficult because single forward-pass models do not sample weights during inference and often make assumptions about the target distribution, such as assuming it is Gaussian. This can be restrictive in regression tasks, where the mean and standard deviation are inadequate to model the target distribution accurately. This paper proposes a deep Bayesian quantile regression model that can estimate the quantiles of a continuous target distribution without the Gaussian assumption. The proposed method is based on evidential learning, which allows the model to capture aleatoric and epistemic uncertainty with a single deterministic forward-pass model. This makes the method efficient and scalable to large models and datasets. We demonstrate that the proposed method achieves calibrated uncertainties on non-Gaussian distributions, disentanglement of aleatoric and epistemic uncertainty, and robustness to out-of-distribution samples.
△ Less
Submitted 21 August, 2023;
originally announced August 2023.
-
Applied metamodelling for ATM performance simulations
Authors:
Christoffer Riis,
Francisco N. Antunes,
Tatjana Bolić,
Gérald Gurtner,
Andrew Cook,
Carlos Lima Azevedo,
Francisco Câmara Pereira
Abstract:
The use of Air traffic management (ATM) simulators for planing and operations can be challenging due to their modelling complexity. This paper presents XALM (eXplainable Active Learning Metamodel), a three-step framework integrating active learning and SHAP (SHapley Additive exPlanations) values into simulation metamodels for supporting ATM decision-making. XALM efficiently uncovers hidden relatio…
▽ More
The use of Air traffic management (ATM) simulators for planing and operations can be challenging due to their modelling complexity. This paper presents XALM (eXplainable Active Learning Metamodel), a three-step framework integrating active learning and SHAP (SHapley Additive exPlanations) values into simulation metamodels for supporting ATM decision-making. XALM efficiently uncovers hidden relationships among input and output variables in ATM simulators, those usually of interest in policy analysis. Our experiments show XALM's predictive performance comparable to the XGBoost metamodel with fewer simulations. Additionally, XALM exhibits superior explanatory capabilities compared to non-active learning metamodels.
Using the `Mercury' (flight and passenger) ATM simulator, XALM is applied to a real-world scenario in Paris Charles de Gaulle airport, extending an arrival manager's range and scope by analysing six variables. This case study illustrates XALM's effectiveness in enhancing simulation interpretability and understanding variable interactions. By addressing computational challenges and improving explainability, XALM complements traditional simulation-based analyses.
Lastly, we discuss two practical approaches for reducing the computational burden of the metamodelling further: we introduce a stop** criterion for active learning based on the inherent uncertainty of the metamodel, and we show how the simulations used for the metamodel can be reused across key performance indicators, thus decreasing the overall number of simulations needed.
△ Less
Submitted 7 August, 2023;
originally announced August 2023.
-
Analyzing the Reporting Error of Public Transport Trips in the Danish National Travel Survey Using Smart Card Data
Authors:
Georges Sfeir,
Filipe Rodrigues,
Maya Abou Zeid,
Francisco Camara Pereira
Abstract:
Household travel surveys have been used for decades to collect individuals and households' travel behavior. However, self-reported surveys are subject to recall bias, as respondents might struggle to recall and report their activities accurately. This study examines the time reporting error of public transit users in a nationwide household travel survey by matching, at the individual level, five c…
▽ More
Household travel surveys have been used for decades to collect individuals and households' travel behavior. However, self-reported surveys are subject to recall bias, as respondents might struggle to recall and report their activities accurately. This study examines the time reporting error of public transit users in a nationwide household travel survey by matching, at the individual level, five consecutive years of data from two sources, namely the Danish National Travel Survey (TU) and the Danish Smart Card system (Rejsekort). Survey respondents are matched with travel cards from the Rejsekort data solely based on the respondents' declared spatiotemporal travel behavior. Approximately, 70% of the respondents were successfully matched with Rejsekort travel cards. The findings reveal a median time reporting error of 11.34 minutes, with an Interquartile Range of 28.14 minutes. Furthermore, a statistical analysis was performed to explore the relationships between the survey respondents' reporting error and their socio-economic and demographic characteristics. The results indicate that females and respondents with a fixed schedule are in general more accurate than males and respondents with a flexible schedule in reporting their times of travel. Moreover, trips reported during weekdays or via the internet displayed higher accuracies compared to trips reported during weekends and holidays or via telephone interviews. This disaggregated analysis provides valuable insights that could help in improving the design and analysis of travel surveys, as well accounting for reporting errors/biases in travel survey-based applications. Furthermore, it offers valuable insights underlying the psychology of travel recall by survey respondents.
△ Less
Submitted 1 July, 2024; v1 submitted 2 August, 2023;
originally announced August 2023.
-
Learning and Generalizing Polynomials in Simulation Metamodeling
Authors:
Jesper Hauch,
Christoffer Riis,
Francisco C. Pereira
Abstract:
The ability to learn polynomials and generalize out-of-distribution is essential for simulation metamodels in many disciplines of engineering, where the time step updates are described by polynomials. While feed forward neural networks can fit any function, they cannot generalize out-of-distribution for higher-order polynomials. Therefore, this paper collects and proposes multiplicative neural net…
▽ More
The ability to learn polynomials and generalize out-of-distribution is essential for simulation metamodels in many disciplines of engineering, where the time step updates are described by polynomials. While feed forward neural networks can fit any function, they cannot generalize out-of-distribution for higher-order polynomials. Therefore, this paper collects and proposes multiplicative neural network (MNN) architectures that are used as recursive building blocks for approximating higher-order polynomials. Our experiments show that MNNs are better than baseline models at generalizing, and their performance in validation is true to their performance in out-of-distribution tests. In addition to MNN architectures, a simulation metamodeling approach is proposed for simulations with polynomial time step updates. For these simulations, simulating a time interval can be performed in fewer steps by increasing the step size, which entails approximating higher-order polynomials. While our approach is compatible with any simulation with polynomial time step updates, a demonstration is shown for an epidemiology simulation model, which also shows the inductive bias in MNNs for learning and generalizing higher-order polynomials.
△ Less
Submitted 20 July, 2023;
originally announced July 2023.
-
Lattice Thermal Conductivity of 2D Nanomaterials: A Simple Semi-Empirical Approach
Authors:
R. M. Tromer,
I. M. Felix,
L. F. C. Pereira,
M. G. E. da Luz,
L. A. Ribeiro Junior,
D. S. Galvão
Abstract:
Extracting reliable information on certain physical properties of materials, like thermal behavior, such as thermal transport, which can be very computationally demanding. Aiming to overcome such difficulties in the particular case of lattice thermal conductivity (LTC) of 2D nanomaterials, we propose a simple, fast, and accurate semi-empirical approach for its calculation.The approach is based on…
▽ More
Extracting reliable information on certain physical properties of materials, like thermal behavior, such as thermal transport, which can be very computationally demanding. Aiming to overcome such difficulties in the particular case of lattice thermal conductivity (LTC) of 2D nanomaterials, we propose a simple, fast, and accurate semi-empirical approach for its calculation.The approach is based on parameterized thermochemical equations and Arrhenius-like fitting procedures, thus avoiding molecular dynamics or \textit{ab initio} protocols, which frequently demand computationally expensive simulations. As proof of concept, we obtain the LTC of some prototypical physical systems, such as graphene (and other 2D carbon allotropes), hexagonal boron nitride (hBN), silicene, germanene, binary, and ternary BNC latices and two examples of the fullerene network family. Our values are in good agreement with other theoretical and experimental estimations, nonetheless being derived in a rather straightforward way, at a fraction of the computational cost.
△ Less
Submitted 3 July, 2023;
originally announced July 2023.
-
Optical and electronic properties of a two-dimensional quantum ring under rotating effects
Authors:
Daniel F. Lima,
Frankbelson dos S. Azevedo,
Luís Fernando C. Pereira,
Cleverson Filgueiras,
Edilberto O. Silva
Abstract:
This work presents a study on the nonrelativistic quantum motion of a charged particle in a rotating frame, considering the Aharonov-Bohm effect and a uniform magnetic field. We derive the equation of motion and the corresponding radial equation to describe the system. The Schrödinger equation with minimal coupling incorporates rotation effects by substituting the momentum operator with an effecti…
▽ More
This work presents a study on the nonrelativistic quantum motion of a charged particle in a rotating frame, considering the Aharonov-Bohm effect and a uniform magnetic field. We derive the equation of motion and the corresponding radial equation to describe the system. The Schrödinger equation with minimal coupling incorporates rotation effects by substituting the momentum operator with an effective four-potential. Additionally, a radial potential term, dependent on the average radius of the ring, is introduced. The analysis is restricted to motion in a two-dimensional plane, neglecting the degree of freedom in the $z$-direction. By solving the radial equation, we determine the eigenvalues and eigenfunctions, allowing for an explicit expression of the energy. The probability distribution is analyzed for varying rotating parameter values, revealing a shift of the distribution as the rotation changes, resulting in a centrifugal effect and occupation of the ring's edges. Furthermore, numerical analysis demonstrates the significant rotational effects on energy levels and optical properties, including optical absorption and refractive coefficients.
△ Less
Submitted 25 May, 2023;
originally announced May 2023.
-
Graph Reinforcement Learning for Network Control via Bi-Level Optimization
Authors:
Daniele Gammelli,
James Harrison,
Kaidi Yang,
Marco Pavone,
Filipe Rodrigues,
Francisco C. Pereira
Abstract:
Optimization problems over dynamic networks have been extensively studied and widely used in the past decades to formulate numerous real-world problems. However, (1) traditional optimization-based approaches do not scale to large networks, and (2) the design of good heuristics or approximation algorithms often requires significant manual trial-and-error. In this work, we argue that data-driven str…
▽ More
Optimization problems over dynamic networks have been extensively studied and widely used in the past decades to formulate numerous real-world problems. However, (1) traditional optimization-based approaches do not scale to large networks, and (2) the design of good heuristics or approximation algorithms often requires significant manual trial-and-error. In this work, we argue that data-driven strategies can automate this process and learn efficient algorithms without compromising optimality. To do so, we present network control problems through the lens of reinforcement learning and propose a graph network-based framework to handle a broad class of problems. Instead of naively computing actions over high-dimensional graph elements, e.g., edges, we propose a bi-level formulation where we (1) specify a desired next state via RL, and (2) solve a convex program to best achieve it, leading to drastically improved scalability and performance. We further highlight a collection of desirable features to system designers, investigate design decisions, and present experiments on real-world control problems showing the utility, scalability, and flexibility of our framework.
△ Less
Submitted 15 May, 2023;
originally announced May 2023.
-
Learning to Control Autonomous Fleets from Observation via Offline Reinforcement Learning
Authors:
Carolin Schmidt,
Daniele Gammelli,
Francisco Camara Pereira,
Filipe Rodrigues
Abstract:
Autonomous Mobility-on-Demand (AMoD) systems are an evolving mode of transportation in which a centrally coordinated fleet of self-driving vehicles dynamically serves travel requests. The control of these systems is typically formulated as a large network optimization problem, and reinforcement learning (RL) has recently emerged as a promising approach to solve the open challenges in this space. R…
▽ More
Autonomous Mobility-on-Demand (AMoD) systems are an evolving mode of transportation in which a centrally coordinated fleet of self-driving vehicles dynamically serves travel requests. The control of these systems is typically formulated as a large network optimization problem, and reinforcement learning (RL) has recently emerged as a promising approach to solve the open challenges in this space. Recent centralized RL approaches focus on learning from online data, ignoring the per-sample-cost of interactions within real-world transportation systems. To address these limitations, we propose to formalize the control of AMoD systems through the lens of offline reinforcement learning and learn effective control strategies using solely offline data, which is readily available to current mobility operators. We further investigate design decisions and provide empirical evidence based on data from real-world mobility systems showing how offline learning allows to recover AMoD control policies that (i) exhibit performance on par with online methods, (ii) allow for sample-efficient online fine-tuning and (iii) eliminate the need for complex simulation environments. Crucially, this paper demonstrates that offline RL is a promising paradigm for the application of RL-based solutions within economically-critical systems, such as mobility systems.
△ Less
Submitted 25 August, 2023; v1 submitted 28 February, 2023;
originally announced February 2023.
-
Attitudes and Latent Class Choice Models using Machine learning
Authors:
Lorena Torres Lahoz,
Francisco Camara Pereira,
Georges Sfeir,
Ioanna Arkoudi,
Mayara Moraes Monteiro,
Carlos Lima Azevedo
Abstract:
Latent Class Choice Models (LCCM) are extensions of discrete choice models (DCMs) that capture unobserved heterogeneity in the choice process by segmenting the population based on the assumption of preference similarities. We present a method of efficiently incorporating attitudinal indicators in the specification of LCCM, by introducing Artificial Neural Networks (ANN) to formulate latent variabl…
▽ More
Latent Class Choice Models (LCCM) are extensions of discrete choice models (DCMs) that capture unobserved heterogeneity in the choice process by segmenting the population based on the assumption of preference similarities. We present a method of efficiently incorporating attitudinal indicators in the specification of LCCM, by introducing Artificial Neural Networks (ANN) to formulate latent variables constructs. This formulation overcomes structural equations in its capability of exploring the relationship between the attitudinal indicators and the decision choice, given the Machine Learning (ML) flexibility and power in capturing unobserved and complex behavioural features, such as attitudes and beliefs. All of this while still maintaining the consistency of the theoretical assumptions presented in the Generalized Random Utility model and the interpretability of the estimated parameters. We test our proposed framework for estimating a Car-Sharing (CS) service subscription choice with stated preference data from Copenhagen, Denmark. The results show that our proposed approach provides a complete and realistic segmentation, which helps design better policies.
△ Less
Submitted 20 February, 2023;
originally announced February 2023.
-
A Lévy flight for electrons in graphene: superdiffusive-to-diffusive transport transition
Authors:
Diego B. Fonseca,
Luiz Felipe C. Pereira,
Anderson L. R. Barbosa
Abstract:
In this work we propose an electronic Lévy flight device, analogous to a recent optical realization. To that end, we investigate the transmission of electrons in graphene nanoribbons in the presence of circular electrostatic clusters, whose diameter follow a power-law distribution. We analyze the effect of the electrostatic clusters on the electronic transport regime of the nanoribbons, in terms o…
▽ More
In this work we propose an electronic Lévy flight device, analogous to a recent optical realization. To that end, we investigate the transmission of electrons in graphene nanoribbons in the presence of circular electrostatic clusters, whose diameter follow a power-law distribution. We analyze the effect of the electrostatic clusters on the electronic transport regime of the nanoribbons, in terms of its diffusion behavior. Our numerical calculations show that the presence of circular electrostatic clusters induces a transition from Lévy (superdiffusive) to diffusive transport as the energy increases. Furthermore, we argue that in our electronic Lévy flight device, superdiffusive transport is an exclusive feature of the low-energy quantum regime, while diffusive transport is a feature of the semiclassical regime. Therefore, we attribute the observed transition to the chiral symmetry breaking, once the energy moves away from the Dirac point of graphene.
△ Less
Submitted 20 April, 2023; v1 submitted 4 February, 2023;
originally announced February 2023.
-
Mind the Gap: Modelling Difference Between Censored and Uncensored Electric Vehicle Charging Demand
Authors:
Frederik Boe Hüttel,
Filipe Rodrigues,
Francisco Câmara Pereira
Abstract:
Electric vehicle charging demand models, with charging records as input, will inherently be biased toward the supply of available chargers. These models often fail to account for demand lost from occupied charging stations and competitors. The lost demand suggests that the actual demand is likely higher than the charging records reflect, i.e., the true demand is latent (unobserved), and the observ…
▽ More
Electric vehicle charging demand models, with charging records as input, will inherently be biased toward the supply of available chargers. These models often fail to account for demand lost from occupied charging stations and competitors. The lost demand suggests that the actual demand is likely higher than the charging records reflect, i.e., the true demand is latent (unobserved), and the observations are censored. As a result, machine learning models that rely on these observed records for forecasting charging demand may be limited in their application in future infrastructure expansion and supply management, as they do not estimate the true demand for charging. We propose using censorship-aware models to model charging demand to address this limitation. These models incorporate censorship in their loss functions and learn the true latent demand distribution from observed charging records. We study how occupied charging stations and competing services censor demand using GPS trajectories from cars in Copenhagen, Denmark. We find that censorship occurs up to $61\%$ of the time in some areas of the city. We use the observed charging demand from our study to estimate the true demand and find that censorship-aware models provide better prediction and uncertainty estimation of actual demand than censorship-unaware models. We suggest that future charging models based on charging records should account for censoring to expand the application areas of machine learning models in supply management and infrastructure expansion.
△ Less
Submitted 30 May, 2023; v1 submitted 16 January, 2023;
originally announced January 2023.
-
Wave transmission and its universal fluctuations in one-dimensional systems with Lévy-like disorder: Schrödinger, Klein-Gordon and Dirac equations
Authors:
Anderson L. R. Barbosa,
Jonas R. F. Lima,
Luiz Felipe C. Pereira
Abstract:
We investigate the propagation of waves in one-dimensional systems with Lévy-type disorder. We perform a complete analysis of non-relativistic and relativistic wave transmission submitted to potential barriers whose width, separation or both follow Lévy distributions characterized by an exponent $0 < α<1$. For the first two cases, where one of the parameters is fixed, non-relativistic and relativi…
▽ More
We investigate the propagation of waves in one-dimensional systems with Lévy-type disorder. We perform a complete analysis of non-relativistic and relativistic wave transmission submitted to potential barriers whose width, separation or both follow Lévy distributions characterized by an exponent $0 < α<1$. For the first two cases, where one of the parameters is fixed, non-relativistic and relativistic waves present anomalous localization, $\langle T \rangle \propto L^{-α}$. However, for the latter case, in which both parameters follow a Lévy distribution, non-relativistic and relativistic waves present a transition between anomalous and standard localization as the incidence energy increases relative to the barrier height. Moreover, we obtain the localization diagram delimiting anomalous and standard localization regimes, in terms of incidence angle and energy. Finally, we verify that transmission fluctuations, characterized by its standard deviation, are universal, independent of barrier architecture, wave equation type, incidence energy and angle, further extending earlier studies on electronic localization.
△ Less
Submitted 17 October, 2022;
originally announced October 2022.
-
Mid-Infrared Photothermal-Fluorescence in Situ Hybridization for Functional Analysis and Genetic Identification of Single Cells
Authors:
Yeran Bai,
Zhongyue Guo,
Fátima C. Pereira,
Michael Wagner,
Ji-Xin Cheng
Abstract:
Simultaneous identification and metabolic analysis of microbes with single-cell resolution and high throughput is necessary to answer the question of "who eats what, when, and where" in complex microbial communities. Here, we present a mid-infrared photothermal-fluorescence in situ hybridization (MIP-FISH) platform that enables direct bridging of genotype and phenotype. Through multiple improvemen…
▽ More
Simultaneous identification and metabolic analysis of microbes with single-cell resolution and high throughput is necessary to answer the question of "who eats what, when, and where" in complex microbial communities. Here, we present a mid-infrared photothermal-fluorescence in situ hybridization (MIP-FISH) platform that enables direct bridging of genotype and phenotype. Through multiple improvements of MIP imaging, the sensitive detection of isotopically-labelled compounds incorporated into proteins of individual bacterial cells became possible, while simultaneous detection of FISH labelling with rRNA-targeted probes enabled the identification of the analyzed cells. In proof-of-concept experiments, we showed that the clear spectral red shift in the protein amide I region due to incorporation of $^{13}$C atoms originating from $^{13}$C-labelled-glucose can be exploited by MIP-FISH to discriminate and identify $^{13}$C-labelled bacterial cells within a complex human gut microbiome sample. The presented methods open new opportunities for single-cell structure-function analyses for microbiology.
△ Less
Submitted 6 September, 2022;
originally announced September 2022.
-
Determining Causality in Travel Mode Choice
Authors:
Rishabh Singh Chauhan,
Christoffer Riis,
Shishir Adhikari,
Sybil Derrible,
Elena Zheleva,
Charisma F. Choudhury,
Francisco Camara Pereira
Abstract:
This article presents one of the pioneering studies on causal modeling in travel mode choice decision-making using causal discovery algorithms. These models are a major advancement from conventional correlation-based techniques. We propose a novel methodology that combines causal discovery with structural equation modeling (SEM). This modeling approach overcomes some of the limitations of SEM by c…
▽ More
This article presents one of the pioneering studies on causal modeling in travel mode choice decision-making using causal discovery algorithms. These models are a major advancement from conventional correlation-based techniques. We propose a novel methodology that combines causal discovery with structural equation modeling (SEM). This modeling approach overcomes some of the limitations of SEM by combining the strengths of both causal discovery and SEM. Causal discovery algorithms determine causal graphs from observational data and domain knowledge, and SEMs estimate direct causal effects and test the performance of causal discovery algorithms. In this study, we test four causal discovery algorithms: Peter-Clark (PC), Fast Causal Inference (FCI), Fast Greedy Equivalence Search (FGES), and Direct Linear Non-Gaussian Acyclic Models (DirectLiNGAM). The results show that DirectLiNGAM based SEM model best captures causality in mode choice behavior. It passes several goodness-of-fit tests, including Root Mean Square Error of Approximation (RMSEA) and Goodness-of-Fit Index (GFI), and it achieves the lowest Bayesian Information Criterion (BIC) value. The analyses are conducted on data collected from the 2017 National Household Travel Survey in the New York Metropolitan area.
△ Less
Submitted 24 April, 2023; v1 submitted 10 August, 2022;
originally announced August 2022.
-
Modification of Landau levels in a two-dimensional ring due to rotation effects and edge states
Authors:
Luís Fernando C. Pereira,
Edilberto O. Silva
Abstract:
We investigate the properties of a two-dimensional quantum ring under rotating and external magnetic field effects. We initially analyse the Landau levels and inertial effects on them. Among the results obtained, we emphasize that the rotation lifted the degeneracy of Landau levels. When electrons are confined in a two-dimensional ring, which is modeled by a hard wall potential, the eigenstates ar…
▽ More
We investigate the properties of a two-dimensional quantum ring under rotating and external magnetic field effects. We initially analyse the Landau levels and inertial effects on them. Among the results obtained, we emphasize that the rotation lifted the degeneracy of Landau levels. When electrons are confined in a two-dimensional ring, which is modeled by a hard wall potential, the eigenstates are described by Landau states as long as the eigenstates are not too close to the edges of the ring. On the other hand, near the edges of the ring, the energies increase monotonically. These states are known as edge states. Edge states have an important effect on the physical properties of the ring. Thus, we analyze the Fermi energy and magnetization. In the specific case of magnetization, we consider two approaches. In the first, we obtain an analytical result for magnetization but without considering rotation. Numerical results showed the de Haas-Van Alphen (dHvA) oscillations. In the second, we consider rotating effects. In addition to the dHvA oscillations, we also verify the Aharonov-Bohm-type (AB) oscillations, which are associated with the presence of edge states. We discuss the effects of rotation on the results and find that rotation is responsible for inducing Aharonov-Bohm-type oscillations.
△ Less
Submitted 25 July, 2022;
originally announced July 2022.
-
Bayesian Active Learning with Fully Bayesian Gaussian Processes
Authors:
Christoffer Riis,
Francisco Antunes,
Frederik Boe Hüttel,
Carlos Lima Azevedo,
Francisco Câmara Pereira
Abstract:
The bias-variance trade-off is a well-known problem in machine learning that only gets more pronounced the less available data there is. In active learning, where labeled data is scarce or difficult to obtain, neglecting this trade-off can cause inefficient and non-optimal querying, leading to unnecessary data labeling. In this paper, we focus on active learning with Gaussian Processes (GPs). For…
▽ More
The bias-variance trade-off is a well-known problem in machine learning that only gets more pronounced the less available data there is. In active learning, where labeled data is scarce or difficult to obtain, neglecting this trade-off can cause inefficient and non-optimal querying, leading to unnecessary data labeling. In this paper, we focus on active learning with Gaussian Processes (GPs). For the GP, the bias-variance trade-off is made by optimization of the two hyperparameters: the length scale and noise-term. Considering that the optimal mode of the joint posterior of the hyperparameters is equivalent to the optimal bias-variance trade-off, we approximate this joint posterior and utilize it to design two new acquisition functions. The first one is a Bayesian variant of Query-by-Committee (B-QBC), and the second is an extension that explicitly minimizes the predictive variance through a Query by Mixture of Gaussian Processes (QB-MGP) formulation. Across six simulators, we empirically show that B-QBC, on average, achieves the best marginal likelihood, whereas QB-MGP achieves the best predictive performance. We show that incorporating the bias-variance trade-off in the acquisition functions mitigates unnecessary and expensive data labeling.
△ Less
Submitted 14 January, 2023; v1 submitted 20 May, 2022;
originally announced May 2022.
-
Open vs Closed-ended questions in attitudinal surveys -- comparing, combining, and interpreting using natural language processing
Authors:
Vishnu Baburajan,
João de Abreu e Silva,
Francisco Camara Pereira
Abstract:
To improve the traveling experience, researchers have been analyzing the role of attitudes in travel behavior modeling. Although most researchers use closed-ended surveys, the appropriate method to measure attitudes is debatable. Topic Modeling could significantly reduce the time to extract information from open-ended responses and eliminate subjective bias, thereby alleviating analyst concerns. O…
▽ More
To improve the traveling experience, researchers have been analyzing the role of attitudes in travel behavior modeling. Although most researchers use closed-ended surveys, the appropriate method to measure attitudes is debatable. Topic Modeling could significantly reduce the time to extract information from open-ended responses and eliminate subjective bias, thereby alleviating analyst concerns. Our research uses Topic Modeling to extract information from open-ended questions and compare its performance with closed-ended responses. Furthermore, some respondents might prefer answering questions using their preferred questionnaire type. So, we propose a modeling framework that allows respondents to use their preferred questionnaire type to answer the survey and enable analysts to use the modeling frameworks of their choice to predict behavior. We demonstrate this using a dataset collected from the USA that measures the intention to use Autonomous Vehicles for commute trips. Respondents were presented with alternative questionnaire versions (open- and closed- ended). Since our objective was also to compare the performance of alternative questionnaire versions, the survey was designed to eliminate influences resulting from statements, behavioral framework, and the choice experiment. Results indicate the suitability of using Topic Modeling to extract information from open-ended responses; however, the models estimated using the closed-ended questions perform better compared to them. Besides, the proposed model performs better compared to the models used currently. Furthermore, our proposed framework will allow respondents to choose the questionnaire type to answer, which could be particularly beneficial to them when using voice-based surveys.
△ Less
Submitted 3 May, 2022;
originally announced May 2022.
-
Transfer learning for cross-modal demand prediction of bike-share and public transit
Authors:
Mingzhuang Hua,
Francisco Camara Pereira,
Yu Jiang,
Xuewu Chen
Abstract:
The urban transportation system is a combination of multiple transport modes, and the interdependencies across those modes exist. This means that the travel demand across different travel modes could be correlated as one mode may receive demand from or create demand for another mode, not to mention natural correlations between different demand time series due to general demand flow patterns across…
▽ More
The urban transportation system is a combination of multiple transport modes, and the interdependencies across those modes exist. This means that the travel demand across different travel modes could be correlated as one mode may receive demand from or create demand for another mode, not to mention natural correlations between different demand time series due to general demand flow patterns across the network. It is expectable that cross-modal ripple effects become more prevalent, with Mobility as a Service. Therefore, by propagating demand data across modes, a better demand prediction could be obtained. To this end, this study explores various machine learning models and transfer learning strategies for cross-modal demand prediction. The trip data of bike-share, metro, and taxi are processed as the station-level passenger flows, and then the proposed prediction method is tested in the large-scale case studies of Nan**g and Chicago. The results suggest that prediction models with transfer learning perform better than unimodal prediction models. Furthermore, stacked Long Short-Term Memory model performs particularly well in cross-modal demand prediction. These results verify our combined method's forecasting improvement over existing benchmarks and demonstrate the good transferability for cross-modal demand prediction in multiple cities.
△ Less
Submitted 17 March, 2022;
originally announced March 2022.
-
Large Scale Passenger Detection with Smartphone/Bus Implicit Interaction and Multisensory Unsupervised Cause-effect Learning
Authors:
Valentino Servizi,
Dan R. Persson,
Francisco C. Pereira,
Hannah Villadsen,
Per Bækgaard,
Jeppe Rich,
Otto A. Nielsen
Abstract:
Intelligent Transportation Systems (ITS) underpin the concept of Mobility as a Service (MaaS), which requires universal and seamless users' access across multiple public and private transportation systems while allowing operators' proportional revenue sharing. Current user sensing technologies such as Walk-in/Walk-out (WIWO) and Check-in/Check-out (CICO) have limited scalability for large-scale de…
▽ More
Intelligent Transportation Systems (ITS) underpin the concept of Mobility as a Service (MaaS), which requires universal and seamless users' access across multiple public and private transportation systems while allowing operators' proportional revenue sharing. Current user sensing technologies such as Walk-in/Walk-out (WIWO) and Check-in/Check-out (CICO) have limited scalability for large-scale deployments. These limitations prevent ITS from supporting analysis, optimization, calculation of revenue sharing, and control of MaaS comfort, safety, and efficiency. We focus on the concept of implicit Be-in/Be-out (BIBO) smartphone-sensing and classification.
To close the gap and enhance smartphones towards MaaS, we developed a proprietary smartphone-sensing platform collecting contemporary Bluetooth Low Energy (BLE) signals from BLE devices installed on buses and Global Positioning System (GPS) locations of both buses and smartphones. To enable the training of a model based on GPS features against the BLE pseudo-label, we propose the Cause-Effect Multitask Wasserstein Autoencoder (CEMWA). CEMWA combines and extends several frameworks around Wasserstein autoencoders and neural networks. As a dimensionality reduction tool, CEMWA obtains an auto-validated representation of a latent space describing users' smartphones within the transport system. This representation allows BIBO clustering via DBSCAN.
We perform an ablation study of CEMWA's alternative architectures and benchmark against the best available supervised methods. We analyze performance's sensitivity to label quality. Under the naïve assumption of accurate ground truth, XGBoost outperforms CEMWA. Although XGBoost and Random Forest prove to be tolerant to label noise, CEMWA is agnostic to label noise by design and provides the best performance with an 88\% F1 score.
△ Less
Submitted 24 February, 2022;
originally announced February 2022.
-
"Is not the truth the truth?": Analyzing the Impact of User Validations for Bus In/Out Detection in Smartphone-based Surveys
Authors:
Valentino Servizi.,
Dan R. Persson,
Francisco C. Pereira,
Hannah Villadsen,
Per Bækgaard,
Inon Peled,
Otto A. Nielsen
Abstract:
Passenger flow allows the study of users' behavior through the public network and assists in designing new facilities and services. This flow is observed through interactions between passengers and infrastructure. For this task, Bluetooth technology and smartphones represent the ideal solution. The latter component allows users' identification, authentication, and billing, while the former allows…
▽ More
Passenger flow allows the study of users' behavior through the public network and assists in designing new facilities and services. This flow is observed through interactions between passengers and infrastructure. For this task, Bluetooth technology and smartphones represent the ideal solution. The latter component allows users' identification, authentication, and billing, while the former allows short-range implicit interactions, device-to-device. To assess the potential of such a use case, we need to verify how robust Bluetooth signal and related machine learning (ML) classifiers are against the noise of realistic contexts. Therefore, we model binary passenger states with respect to a public vehicle, where one can either be-in or be-out (BIBO). The BIBO label identifies a fundamental building block of continuously-valued passenger flow. This paper describes the Human-Computer interaction experimental setting in a semi-controlled environment, which involves: two autonomous vehicles operating on two routes, serving three bus stops and eighteen users, as well as a proprietary smartphone-Bluetooth sensing platform. The resulting dataset includes multiple sensors' measurements of the same event and two ground-truth levels, the first being validation by participants, the second by three video-cameras surveilling buses and track. We performed a Monte-Carlo simulation of labels-flip to emulate human errors in the labeling process, as is known to happen in smartphone surveys; next we used such flipped labels for supervised training of ML classifiers. The impact of errors on model performance bias can be large. Results show ML tolerance to label flips caused by human or machine errors up to 30%.
△ Less
Submitted 24 February, 2022;
originally announced February 2022.
-
Graph Meta-Reinforcement Learning for Transferable Autonomous Mobility-on-Demand
Authors:
Daniele Gammelli,
Kaidi Yang,
James Harrison,
Filipe Rodrigues,
Francisco C. Pereira,
Marco Pavone
Abstract:
Autonomous Mobility-on-Demand (AMoD) systems represent an attractive alternative to existing transportation paradigms, currently challenged by urbanization and increasing travel needs. By centrally controlling a fleet of self-driving vehicles, these systems provide mobility service to customers and are currently starting to be deployed in a number of cities around the world. Current learning-based…
▽ More
Autonomous Mobility-on-Demand (AMoD) systems represent an attractive alternative to existing transportation paradigms, currently challenged by urbanization and increasing travel needs. By centrally controlling a fleet of self-driving vehicles, these systems provide mobility service to customers and are currently starting to be deployed in a number of cities around the world. Current learning-based approaches for controlling AMoD systems are limited to the single-city scenario, whereby the service operator is allowed to take an unlimited amount of operational decisions within the same transportation system. However, real-world system operators can hardly afford to fully re-train AMoD controllers for every city they operate in, as this could result in a high number of poor-quality decisions during training, making the single-city strategy a potentially impractical solution. To address these limitations, we propose to formalize the multi-city AMoD problem through the lens of meta-reinforcement learning (meta-RL) and devise an actor-critic algorithm based on recurrent graph neural networks. In our approach, AMoD controllers are explicitly trained such that a small amount of experience within a new city will produce good system performance. Empirically, we show how control policies learned through meta-RL are able to achieve near-optimal performance on unseen cities by learning rapidly adaptable policies, thus making them more robust not only to novel environments, but also to distribution shifts common in real-world operations, such as special events, unexpected congestion, and dynamic pricing schemes.
△ Less
Submitted 14 February, 2022;
originally announced February 2022.
-
Unboxing the graph: Neural Relational Inference for Mobility Prediction
Authors:
Mathias Niemann Tygesen,
Francisco C. Pereira,
Filipe Rodrigues
Abstract:
Predicting the supply and demand of transport systems is vital for efficient traffic management, control, optimization, and planning. For example, predicting where from/to and when people intend to travel by taxi can support fleet managers to distribute resources; better predicting traffic speeds/congestion allows for pro-active control measures or for users to better choose their paths. Making sp…
▽ More
Predicting the supply and demand of transport systems is vital for efficient traffic management, control, optimization, and planning. For example, predicting where from/to and when people intend to travel by taxi can support fleet managers to distribute resources; better predicting traffic speeds/congestion allows for pro-active control measures or for users to better choose their paths. Making spatio-temporal predictions is known to be a hard task, but recently Graph Neural Networks (GNNs) have been widely applied on non-euclidean spatial data. However, most GNN models require a predefined graph, and so far, researchers rely on heuristics to generate this graph for the model to use. In this paper, we use Neural Relational Inference to learn the optimal graph for the model. Our approach has several advantages: 1) a Variational Auto Encoder structure allows for the graph to be dynamically determined by the data, potentially changing through time; 2) the encoder structure allows the use of external data in the generation of the graph; 3) it is possible to place Bayesian priors on the generated graphs to encode domain knowledge. We conduct experiments on two datasets, namely the NYC Yellow Taxi and the PEMS road traffic datasets. In both datasets, we outperform benchmarks and show performance comparable to state-of-the-art. Furthermore, we do an in-depth analysis of the learned graphs, providing insights on what kinds of connections GNNs use for spatio-temporal predictions in the transport domain.
△ Less
Submitted 25 January, 2022;
originally announced January 2022.
-
Combining Discrete Choice Models and Neural Networks through Embeddings: Formulation, Interpretability and Performance
Authors:
Ioanna Arkoudi,
Carlos Lima Azevedo,
Francisco C. Pereira
Abstract:
This study proposes a novel approach that combines theory and data-driven choice models using Artificial Neural Networks (ANNs). In particular, we use continuous vector representations, called embeddings, for encoding categorical or discrete explanatory variables with a special focus on interpretability and model transparency. Although embedding representations within the logit framework have been…
▽ More
This study proposes a novel approach that combines theory and data-driven choice models using Artificial Neural Networks (ANNs). In particular, we use continuous vector representations, called embeddings, for encoding categorical or discrete explanatory variables with a special focus on interpretability and model transparency. Although embedding representations within the logit framework have been conceptualized by Pereira (2019), their dimensions do not have an absolute definitive meaning, hence offering limited behavioral insights in this earlier work. The novelty of our work lies in enforcing interpretability to the embedding vectors by formally associating each of their dimensions to a choice alternative. Thus, our approach brings benefits much beyond a simple parsimonious representation improvement over dummy encoding, as it provides behaviorally meaningful outputs that can be used in travel demand analysis and policy decisions. Additionally, in contrast to previously suggested ANN-based Discrete Choice Models (DCMs) that either sacrifice interpretability for performance or are only partially interpretable, our models preserve interpretability of the utility coefficients for all the input variables despite being based on ANN principles. The proposed models were tested on two real world datasets and evaluated against benchmark and baseline models that use dummy-encoding. The results of the experiments indicate that our models deliver state-of-the-art predictive performance, outperforming existing ANN-based models while drastically reducing the number of required network parameters.
△ Less
Submitted 30 September, 2021; v1 submitted 24 September, 2021;
originally announced September 2021.
-
Predictive and Prescriptive Performance of Bike-Sharing Demand Forecasts for Inventory Management
Authors:
Daniele Gammelli,
Yihua Wang,
Dennis Prak,
Filipe Rodrigues,
Stefan Minner,
Francisco Camara Pereira
Abstract:
Bike-sharing systems are a rapidly develo** mode of transportation and provide an efficient alternative to passive, motorized personal mobility. The asymmetric nature of bike demand causes the need for rebalancing bike stations, which is typically done during night time. To determine the optimal starting inventory level of a station for a given day, a User Dissatisfaction Function (UDF) models u…
▽ More
Bike-sharing systems are a rapidly develo** mode of transportation and provide an efficient alternative to passive, motorized personal mobility. The asymmetric nature of bike demand causes the need for rebalancing bike stations, which is typically done during night time. To determine the optimal starting inventory level of a station for a given day, a User Dissatisfaction Function (UDF) models user pickups and returns as non-homogeneous Poisson processes with piece-wise linear rates. In this paper, we devise a deep generative model directly applicable in the UDF by introducing a variational Poisson recurrent neural network model (VP-RNN) to forecast future pickup and return rates. We empirically evaluate our approach against both traditional and learning-based forecasting methods on real trip travel data from the city of New York, USA, and show how our model outperforms benchmarks in terms of system efficiency and demand satisfaction. By explicitly focusing on the combination of decision-making algorithms with learning-based forecasting methods, we highlight a number of shortcomings in literature. Crucially, we show how more accurate predictions do not necessarily translate into better inventory decisions. By providing insights into the interplay between forecasts, model assumptions, and decisions, we point out that forecasts and decision models should be carefully evaluated and harmonized to optimally control shared mobility systems.
△ Less
Submitted 28 July, 2021;
originally announced August 2021.
-
Deep Spatio-Temporal Forecasting of Electrical Vehicle Charging Demand
Authors:
Frederik Boe Hüttel,
Inon Peled,
Filipe Rodrigues,
Francisco C. Pereira
Abstract:
Electric vehicles can offer a low carbon emission solution to reverse rising emission trends. However, this requires that the energy used to meet the demand is green. To meet this requirement, accurate forecasting of the charging demand is vital. Short and long-term charging demand forecasting will allow for better optimisation of the power grid and future infrastructure expansions. In this paper,…
▽ More
Electric vehicles can offer a low carbon emission solution to reverse rising emission trends. However, this requires that the energy used to meet the demand is green. To meet this requirement, accurate forecasting of the charging demand is vital. Short and long-term charging demand forecasting will allow for better optimisation of the power grid and future infrastructure expansions. In this paper, we propose to use publicly available data to forecast the electric vehicle charging demand. To model the complex spatial-temporal correlations between charging stations, we argue that Temporal Graph Convolution Models are the most suitable to capture the correlations. The proposed Temporal Graph Convolutional Networks provide the most accurate forecasts for short and long-term forecasting compared with other forecasting methods.
△ Less
Submitted 21 June, 2021;
originally announced June 2021.
-
Improving the Accuracy and Efficiency of Online Calibration for Simulation-based Dynamic Traffic Assignment
Authors:
Haizheng Zhang,
Ravi Seshadri,
A. Arun Prakash,
Constantinos Antoniou,
Francisco C. Pereira,
Moshe Ben-Akiva
Abstract:
Simulation-based Dynamic Traffic Assignment models have important applications in real-time traffic management and control. The efficacy of these systems rests on the ability to generate accurate estimates and predictions of traffic states, which necessitates online calibration. A widely used solution approach for online calibration is the Extended Kalman Filter (EKF), which -- although appealing…
▽ More
Simulation-based Dynamic Traffic Assignment models have important applications in real-time traffic management and control. The efficacy of these systems rests on the ability to generate accurate estimates and predictions of traffic states, which necessitates online calibration. A widely used solution approach for online calibration is the Extended Kalman Filter (EKF), which -- although appealing in its flexibility to incorporate any class of parameters and measurements -- poses several challenges with regard to calibration accuracy and scalability, especially in congested situations for large-scale networks. This paper addresses these issues in turn so as to improve the accuracy and efficiency of EKF-based online calibration approaches for large and congested networks. First, the concept of state augmentation is revisited to handle violations of the Markovian assumption typically implicit in online applications of the EKF. Second, a method based on graph-coloring is proposed to operationalize the partitioned finite-difference approach that enhances scalability of the gradient computations.
Several synthetic experiments and a real world case study demonstrate that application of the proposed approaches yields improvements in terms of both prediction accuracy and computational performance. The work has applications in real-world deployments of simulation-based dynamic traffic assignment systems.
△ Less
Submitted 31 May, 2021;
originally announced May 2021.
-
Graph Neural Network Reinforcement Learning for Autonomous Mobility-on-Demand Systems
Authors:
Daniele Gammelli,
Kaidi Yang,
James Harrison,
Filipe Rodrigues,
Francisco C. Pereira,
Marco Pavone
Abstract:
Autonomous mobility-on-demand (AMoD) systems represent a rapidly develo** mode of transportation wherein travel requests are dynamically handled by a coordinated fleet of robotic, self-driving vehicles. Given a graph representation of the transportation network - one where, for example, nodes represent areas of the city, and edges the connectivity between them - we argue that the AMoD control pr…
▽ More
Autonomous mobility-on-demand (AMoD) systems represent a rapidly develo** mode of transportation wherein travel requests are dynamically handled by a coordinated fleet of robotic, self-driving vehicles. Given a graph representation of the transportation network - one where, for example, nodes represent areas of the city, and edges the connectivity between them - we argue that the AMoD control problem is naturally cast as a node-wise decision-making problem. In this paper, we propose a deep reinforcement learning framework to control the rebalancing of AMoD systems through graph neural networks. Crucially, we demonstrate that graph neural networks enable reinforcement learning agents to recover behavior policies that are significantly more transferable, generalizable, and scalable than policies learned through other approaches. Empirically, we show how the learned policies exhibit promising zero-shot transfer capabilities when faced with critical portability tasks such as inter-city generalization, service area expansion, and adaptation to potentially complex urban topologies.
△ Less
Submitted 16 August, 2021; v1 submitted 23 April, 2021;
originally announced April 2021.
-
Modeling Censored Mobility Demand through Quantile Regression Neural Networks
Authors:
Frederik Boe Hüttel,
Inon Peled,
Filipe Rodrigues,
Francisco C. Pereira
Abstract:
Shared mobility services require accurate demand models for effective service planning. On the one hand, modeling the full probability distribution of demand is advantageous because the entire uncertainty structure preserves valuable information for decision-making. On the other hand, demand is often observed through the usage of the service itself, so that the observations are censored, as they a…
▽ More
Shared mobility services require accurate demand models for effective service planning. On the one hand, modeling the full probability distribution of demand is advantageous because the entire uncertainty structure preserves valuable information for decision-making. On the other hand, demand is often observed through the usage of the service itself, so that the observations are censored, as they are inherently limited by available supply. Since the 1980s, various works on Censored Quantile Regression models have performed well under such conditions. Further, in the last two decades, several papers have proposed to implement these models flexibly through Neural Networks. However, the models in current works estimate the quantiles individually, thus incurring a computational overhead and ignoring valuable relationships between the quantiles. We address this gap by extending current Censored Quantile Regression models to learn multiple quantiles at once and apply these to synthetic baseline datasets and datasets from two shared mobility providers in the Copenhagen metropolitan area in Denmark. The results show that our extended models yield fewer quantile crossings and less computational overhead without compromising model performance.
△ Less
Submitted 9 July, 2022; v1 submitted 2 April, 2021;
originally announced April 2021.
-
Population synthesis for urban resident modeling using deep generative models
Authors:
Martin Johnsen,
Oliver Brandt,
Sergio Garrido,
Francisco C. Pereira
Abstract:
The impacts of new real estate developments are strongly associated to its population distribution (types and compositions of households, incomes, social demographics) conditioned on aspects such as dwelling typology, price, location, and floor level. This paper presents a Machine Learning based method to model the population distribution of upcoming developments of new buildings within larger nei…
▽ More
The impacts of new real estate developments are strongly associated to its population distribution (types and compositions of households, incomes, social demographics) conditioned on aspects such as dwelling typology, price, location, and floor level. This paper presents a Machine Learning based method to model the population distribution of upcoming developments of new buildings within larger neighborhood/condo settings.
We use a real data set from Ecopark Township, a real estate development project in Hanoi, Vietnam, where we study two machine learning algorithms from the deep generative models literature to create a population of synthetic agents: Conditional Variational Auto-Encoder (CVAE) and Conditional Generative Adversarial Networks (CGAN). A large experimental study was performed, showing that the CVAE outperforms both the empirical distribution, a non-trivial baseline model, and the CGAN in estimating the population distribution of new real estate development projects.
△ Less
Submitted 13 November, 2020;
originally announced November 2020.
-
On the Quality Requirements of Demand Prediction for Dynamic Public Transport
Authors:
Inon Peled,
Kelvin Lee,
Yu Jiang,
Justin Dauwels,
Francisco C. Pereira
Abstract:
As Public Transport (PT) becomes more dynamic and demand-responsive, it increasingly depends on predictions of transport demand. But how accurate need such predictions be for effective PT operation? We address this question through an experimental case study of PT trips in Metropolitan Copenhagen, Denmark, which we conduct independently of any specific prediction models. First, we simulate errors…
▽ More
As Public Transport (PT) becomes more dynamic and demand-responsive, it increasingly depends on predictions of transport demand. But how accurate need such predictions be for effective PT operation? We address this question through an experimental case study of PT trips in Metropolitan Copenhagen, Denmark, which we conduct independently of any specific prediction models. First, we simulate errors in demand prediction through unbiased noise distributions that vary considerably in shape. Using the noisy predictions, we then simulate and optimize demand-responsive PT fleets via a linear programming formulation and measure their performance. Our results suggest that the optimized performance is mainly affected by the skew of the noise distribution and the presence of infrequently large prediction errors. In particular, the optimized performance can improve under non-Gaussian vs. Gaussian noise. We also find that dynamic routing could reduce trip time by at least 23% vs. static routing. This reduction is estimated at 809,000 EUR/year in terms of Value of Travel Time Savings for the case study.
△ Less
Submitted 6 November, 2021; v1 submitted 31 August, 2020;
originally announced August 2020.
-
Majority-vote model with limited visibility: an investigation into filter bubbles
Authors:
Andre L. M. Vilela,
Luiz Felipe C. Pereira,
Laercio Dias,
H. Eugene Stanley,
Luciano R. da Silva
Abstract:
The dynamics of opinion formation in a society is a complex phenomenon where many variables play an important role. Recently, the influence of algorithms to filter which content is fed to social networks users has come under scrutiny. Supposedly, the algorithms promote marketing strategies, but can also facilitate the formation of filters bubbles in which a user is most likely exposed to opinions…
▽ More
The dynamics of opinion formation in a society is a complex phenomenon where many variables play an important role. Recently, the influence of algorithms to filter which content is fed to social networks users has come under scrutiny. Supposedly, the algorithms promote marketing strategies, but can also facilitate the formation of filters bubbles in which a user is most likely exposed to opinions that conform to their own. In the two-state majority-vote model an individual adopts an opinion contrary to the majority of its neighbors with probability $q$, defined as the noise parameter. Here, we introduce a visibility parameter $V$ in the dynamics of the majority-vote model, which equals the probability of an individual ignoring the opinion of each one of its neighbors. For $V=0.5$ each individual will, on average, ignore the opinion of half of its neighboring nodes. We employ Monte Carlo simulations to calculate the critical noise parameter as a function of the visibility $q_c(V)$ and obtain the phase diagram of the model. We find that the critical noise is an increasing function of the visibility parameter, such that a lower value of $V$ favors dissensus. Via finite-size scaling analysis we obtain the critical exponents of the model, which are visibility-independent, and show that the model belongs to the Ising universality class. We compare our results to the case of a network submitted to a static site dilution, and find that the limited visibility model is a more subtle way of inducing opinion polarization in a social network.
△ Less
Submitted 31 August, 2020; v1 submitted 26 August, 2020;
originally announced August 2020.
-
Estimating Causal Effects with the Neural Autoregressive Density Estimator
Authors:
Sergio Garrido,
Stanislav S. Borysov,
Jeppe Rich,
Francisco C. Pereira
Abstract:
Estimation of causal effects is fundamental in situations were the underlying system will be subject to active interventions. Part of building a causal inference engine is defining how variables relate to each other, that is, defining the functional relationship between variables given conditional dependencies. In this paper, we deviate from the common assumption of linear relationships in causal…
▽ More
Estimation of causal effects is fundamental in situations were the underlying system will be subject to active interventions. Part of building a causal inference engine is defining how variables relate to each other, that is, defining the functional relationship between variables given conditional dependencies. In this paper, we deviate from the common assumption of linear relationships in causal models by making use of neural autoregressive density estimators and use them to estimate causal effects within the Pearl's do-calculus framework. Using synthetic data, we show that the approach can retrieve causal effects from non-linear systems without explicitly modeling the interactions between the variables.
△ Less
Submitted 1 March, 2021; v1 submitted 17 August, 2020;
originally announced August 2020.
-
Semi-nonparametric Latent Class Choice Model with a Flexible Class Membership Component: A Mixture Model Approach
Authors:
Georges Sfeir,
Maya Abou-Zeid,
Filipe Rodrigues,
Francisco Camara Pereira,
Isam Kaysi
Abstract:
This study presents a semi-nonparametric Latent Class Choice Model (LCCM) with a flexible class membership component. The proposed model formulates the latent classes using mixture models as an alternative approach to the traditional random utility specification with the aim of comparing the two approaches on various measures including prediction accuracy and representation of heterogeneity in the…
▽ More
This study presents a semi-nonparametric Latent Class Choice Model (LCCM) with a flexible class membership component. The proposed model formulates the latent classes using mixture models as an alternative approach to the traditional random utility specification with the aim of comparing the two approaches on various measures including prediction accuracy and representation of heterogeneity in the choice process. Mixture models are parametric model-based clustering techniques that have been widely used in areas such as machine learning, data mining and patter recognition for clustering and classification problems. An Expectation-Maximization (EM) algorithm is derived for the estimation of the proposed model. Using two different case studies on travel mode choice behavior, the proposed model is compared to traditional discrete choice models on the basis of parameter estimates' signs, value of time, statistical goodness-of-fit measures, and cross-validation tests. Results show that mixture models improve the overall performance of latent class choice models by providing better out-of-sample prediction accuracy in addition to better representations of heterogeneity without weakening the behavioral and economic interpretability of the choice models.
△ Less
Submitted 6 July, 2020;
originally announced July 2020.
-
Study of electronic properties, Magnetization and persistent currents in a mesoscopic ring by controlled curvature
Authors:
Luís Fernando C. Pereira,
Fabiano M. Andrade,
Cleverson Filgueiras,
Edilberto O. Silva
Abstract:
We study the model of a noninteracting spinless electron gas confined to the two-dimensional localized surface of a cone in the presence of external magnetic fields. The localized region is characterized by an annular radial potential. We write the Schrödinger equation and use the thin-layer quantization procedure to calculate the wavefunctions and the energy spectrum. In such a procedure, it aris…
▽ More
We study the model of a noninteracting spinless electron gas confined to the two-dimensional localized surface of a cone in the presence of external magnetic fields. The localized region is characterized by an annular radial potential. We write the Schrödinger equation and use the thin-layer quantization procedure to calculate the wavefunctions and the energy spectrum. In such a procedure, it arises a geometry induced potential, which depends on both the mean and the Gaussian curvatures. Nevertheless, since we consider a ring with a mesoscopic size, the effects of the Gaussian curvature on the energy spectrum are negligible. The magnetization and the persistent current are analyzed. In the former, we observed the Aharonov-Bohm (AB) and de Haas-van Alphen (dHvA) types oscillations. In the latter, it is observed only the AB type oscillations. In both cases, the curvature increases the amplitude of the oscillations.
△ Less
Submitted 2 May, 2020;
originally announced May 2020.
-
QTIP: Quick simulation-based adaptation of Traffic model per Incident Parameters
Authors:
Inon Peled,
Raghuveer Kamalakar,
Carlos Lima Azevedo,
Francisco C. Pereira
Abstract:
Current data-driven traffic prediction models are usually trained with large datasets, e.g. several months of speeds and flows. Such models provide very good fit for ordinary road conditions, but often fail just when they are most needed: when traffic suffers a sudden and significant disruption, such as a road incident. In this work, we describe QTIP: a simulation-based framework for quasi-instant…
▽ More
Current data-driven traffic prediction models are usually trained with large datasets, e.g. several months of speeds and flows. Such models provide very good fit for ordinary road conditions, but often fail just when they are most needed: when traffic suffers a sudden and significant disruption, such as a road incident. In this work, we describe QTIP: a simulation-based framework for quasi-instantaneous adaptation of prediction models upon traffic disruption. In a nutshell, QTIP performs real-time simulations of the affected road for multiple scenarios, analyzes the results, and suggests a change to an ordinary prediction model accordingly. QTIP constructs the simulated scenarios per properties of the incident, as conveyed by immediate distress signals from affected vehicles. Such real-time signals are provided by In-Vehicle Monitor Systems, which are becoming increasingly prevalent world-wide. We experiment QTIP in a case study of a Danish motorway, and the results show that QTIP can improve traffic prediction in the first critical minutes of road incidents.
△ Less
Submitted 9 March, 2020;
originally announced March 2020.
-
A Neural-embedded Choice Model: TasteNet-MNL Modeling Taste Heterogeneity with Flexibility and Interpretability
Authors:
Yafei Han,
Francisco Camara Pereira,
Moshe Ben-Akiva,
Christopher Zegras
Abstract:
Discrete choice models (DCMs) require a priori knowledge of the utility functions, especially how tastes vary across individuals. Utility misspecification may lead to biased estimates, inaccurate interpretations and limited predictability. In this paper, we utilize a neural network to learn taste representation. Our formulation consists of two modules: a neural network (TasteNet) that learns taste…
▽ More
Discrete choice models (DCMs) require a priori knowledge of the utility functions, especially how tastes vary across individuals. Utility misspecification may lead to biased estimates, inaccurate interpretations and limited predictability. In this paper, we utilize a neural network to learn taste representation. Our formulation consists of two modules: a neural network (TasteNet) that learns taste parameters (e.g., time coefficient) as flexible functions of individual characteristics; and a multinomial logit (MNL) model with utility functions defined with expert knowledge. Taste parameters learned by the neural network are fed into the choice model and link the two modules.
Our approach extends the L-MNL model (Sifringer et al., 2020) by allowing the neural network to learn the interactions between individual characteristics and alternative attributes. Moreover, we formalize and strengthen the interpretability condition - requiring realistic estimates of behavior indicators (e.g., value-of-time, elasticity) at the disaggregated level, which is crucial for a model to be suitable for scenario analysis and policy decisions. Through a unique network architecture and parameter transformation, we incorporate prior knowledge and guide the neural network to output realistic behavior indicators at the disaggregated level. We show that TasteNet-MNL reaches the ground-truth model's predictability and recovers the nonlinear taste functions on synthetic data. Its estimated value-of-time and choice elasticities at the individual level are close to the ground truth. On a publicly available Swissmetro dataset, TasteNet-MNL outperforms benchmarking MNLs and Mixed Logit model's predictability. It learns a broader spectrum of taste variations within the population and suggests a higher average value-of-time.
△ Less
Submitted 1 July, 2022; v1 submitted 3 February, 2020;
originally announced February 2020.
-
Uncovering life-course patterns with causal discovery and survival analysis
Authors:
Bojan Kostic,
Romain Crastes dit Sourd,
Stephane Hess,
Joachim Scheiner,
Christian Holz-Rau,
Francisco C. Pereira
Abstract:
We provide a novel approach and an exploratory study for modelling life event choices and occurrence from a probabilistic perspective through causal discovery and survival analysis. Our approach is formulated as a bi-level problem. In the upper level, we build the life events graph, using causal discovery tools. In the lower level, for the pairs of life events, time-to-event modelling through surv…
▽ More
We provide a novel approach and an exploratory study for modelling life event choices and occurrence from a probabilistic perspective through causal discovery and survival analysis. Our approach is formulated as a bi-level problem. In the upper level, we build the life events graph, using causal discovery tools. In the lower level, for the pairs of life events, time-to-event modelling through survival analysis is applied to model time-dependent transition probabilities. Several life events were analysed, such as getting married, buying a new car, child birth, home relocation and divorce, together with the socio-demographic attributes for survival modelling, some of which are age, nationality, number of children, number of cars and home ownership. The data originates from a survey conducted in Dortmund, Germany, with the questionnaire containing a series of retrospective questions about residential and employment biography, travel behaviour and holiday trips, as well as socio-economic characteristic. Although survival analysis has been used in the past to analyse life-course data, this is the first time that a bi-level model has been formulated. The inclusion of a causal discovery algorithm in the upper-level allows us to first identify causal relationships between life-course events and then understand the factors that might influence transition rates between events. This is very different from more classic choice models where causal relationships are subject to expert interpretations based on model results.
△ Less
Submitted 30 January, 2020;
originally announced January 2020.
-
Estimating Latent Demand of Shared Mobility through Censored Gaussian Processes
Authors:
Daniele Gammelli,
Inon Peled,
Filipe Rodrigues,
Dario Pacino,
Haci A. Kurtaran,
Francisco C. Pereira
Abstract:
Transport demand is highly dependent on supply, especially for shared transport services where availability is often limited. As observed demand cannot be higher than available supply, historical transport data typically represents a biased, or censored, version of the true underlying demand pattern. Without explicitly accounting for this inherent distinction, predictive models of demand would nec…
▽ More
Transport demand is highly dependent on supply, especially for shared transport services where availability is often limited. As observed demand cannot be higher than available supply, historical transport data typically represents a biased, or censored, version of the true underlying demand pattern. Without explicitly accounting for this inherent distinction, predictive models of demand would necessarily represent a biased version of true demand, thus less effectively predicting the needs of service users. To counter this problem, we propose a general method for censorship-aware demand modeling, for which we devise a censored likelihood function. We apply this method to the task of shared mobility demand prediction by incorporating the censored likelihood within a Gaussian Process model, which can flexibly approximate arbitrary functional forms. Experiments on artificial and real-world datasets show how taking into account the limiting effect of supply on demand is essential in the process of obtaining an unbiased predictive model of user demand behavior.
△ Less
Submitted 17 February, 2020; v1 submitted 21 January, 2020;
originally announced January 2020.
-
Suppression of coherent thermal transport in quasiperiodic graphene-hBN superlattice ribbons
Authors:
Isaac M. Felix,
Luiz Felipe C. Pereira
Abstract:
Nanostructured superlattices are promising materials for novel electronic devices due to their adjustable physical properties. Periodic superlattices facilitate coherent phonon thermal transport due to constructive wave interference at the boundaries between the materials. However, it is possible to induce a crossover from coherent to incoherent transport regimes by adjusting the superlattice peri…
▽ More
Nanostructured superlattices are promising materials for novel electronic devices due to their adjustable physical properties. Periodic superlattices facilitate coherent phonon thermal transport due to constructive wave interference at the boundaries between the materials. However, it is possible to induce a crossover from coherent to incoherent transport regimes by adjusting the superlattice period. We have recently observed such crossover in periodic graphene-boron nitride nanoribbons as the length of individual domains was increased. In general, transport properties are dominated by translational symmetry and the presence of unconventional symmetries leads to unusual transport characteristics. Here we perform non-equilibrium molecular dynamics simulations to investigate phonon heat transport in graphene-hBN superlattices following the Fibonacci quasiperiodic sequence, which lie between periodic and disordered structures. Our simulations show that the quasiperiodicity can suppress coherent phonon thermal transport in these superlattices. This behavior is related to the increasing number of interfaces per unit cell as the Fibonacci generation increases, hindering phonon coherence along the superlattice. The suppression of coherent thermal transport in graphene-hBN superlattices enables a higher degree of control on heat conduction at the nanoscale, and shows potential for application in the design of novel thermal management devices.
△ Less
Submitted 9 January, 2020;
originally announced January 2020.
-
Mining User Behaviour from Smartphone data: a literature review
Authors:
Valentino Servizi,
Francisco C. Pereira,
Marie K. Anderson,
Otto A. Nielsen
Abstract:
To study users' travel behaviour and travel time between origin and destination, researchers employ travel surveys. Although there is consensus in the field about the potential, after over ten years of research and field experimentation, Smartphone-based travel surveys still did not take off to a large scale. Here, computer intelligence algorithms take the role that operators have in Traditional T…
▽ More
To study users' travel behaviour and travel time between origin and destination, researchers employ travel surveys. Although there is consensus in the field about the potential, after over ten years of research and field experimentation, Smartphone-based travel surveys still did not take off to a large scale. Here, computer intelligence algorithms take the role that operators have in Traditional Travel Surveys; since we train each algorithm on data, performances rest on the data quality, thus on the ground truth. Inaccurate validations affect negatively: labels, algorithms' training, travel diaries precision, and therefore data validation, within a very critical loop. Interestingly, boundaries are proven burdensome to push even for Machine Learning methods. To support optimal investment decisions for practitioners, we expose the drivers they should consider when assessing what they need against what they get. This paper highlights and examines the critical aspects of the underlying research and provides some recommendations: (i) from the device perspective, on the main physical limitations; (ii) from the application perspective, the methodological framework deployed for the automatic generation of travel diaries; (iii)from the ground truth perspective, the relationship between user interaction, methods, and data.
△ Less
Submitted 3 February, 2020; v1 submitted 24 December, 2019;
originally announced December 2019.
-
Influence of rotation on the electronic states, magnetization and persistent current in 1D quantum ring
Authors:
Luís Fernando C. Pereira,
Márcio M. Cunha,
Edilberto O. Silva
Abstract:
Inertial effects can affect several properties of physical systems. In particular, in the context of quantum mechanics, such effects have been studied in diverse contexts. In this paper, starting from the Schrödinger equation for a rotating frame, we describe the influence of rotation on the energy levels of a quantum particle constrained to a one-dimensional ring in the presence of a uniform magn…
▽ More
Inertial effects can affect several properties of physical systems. In particular, in the context of quantum mechanics, such effects have been studied in diverse contexts. In this paper, starting from the Schrödinger equation for a rotating frame, we describe the influence of rotation on the energy levels of a quantum particle constrained to a one-dimensional ring in the presence of a uniform magnetic field. We also investigate how the persistent current and the magnetization in the ring are influenced by temperature and rotating effects.
△ Less
Submitted 5 December, 2019;
originally announced December 2019.
-
Effects of curvature on the electronic states of a two-dimensional mesoscopic ring
Authors:
Luís Fernando C. Pereira,
Fabiano M. Andrade,
Cleverson Filgueiras,
Edilberto O. Silva
Abstract:
The effects of surface curvature on the motion of electrons in a mesoscopic two-dimensional ring on a cone in the presence of external magnetic fields are examined. The approach follows the thin-layer quantization procedure, which gives rise to a geometry induced potential. Due to the annular geometric shape of the sample, only the mean curvature has relevant effects to the model. Nevertheless, th…
▽ More
The effects of surface curvature on the motion of electrons in a mesoscopic two-dimensional ring on a cone in the presence of external magnetic fields are examined. The approach follows the thin-layer quantization procedure, which gives rise to a geometry induced potential. Due to the annular geometric shape of the sample, only the mean curvature has relevant effects to the model. Nevertheless, the most significant contribution of the mean curvature occurs in the state $m=0$, which tends to decrease the energies when the magnetic field is null. The effects of curvature are also manifested in the cyclotron frequency as well as in the effective angular momentum through the $α$ parameter, which can be controlled in such a way that the magnitude of these effects becomes explicit. This is verified in the energies and wave functions of the system. A decrease in the number of occupied states in the Fermi energy is observed. As a consequence, there is an alteration in the radial range of the conducting region of the sample. This fact is confirmed by studying the variations in the radii of the states.
△ Less
Submitted 31 October, 2019;
originally announced November 2019.
-
Prediction of rare feature combinations in population synthesis: Application of deep generative modelling
Authors:
Sergio Garrido,
Stanislav S. Borysov,
Francisco C. Pereira,
Jeppe Rich
Abstract:
In population synthesis applications, when considering populations with many attributes, a fundamental problem is the estimation of rare combinations of feature attributes. Unsurprisingly, it is notably more difficult to reliably representthe sparser regions of such multivariate distributions and in particular combinations of attributes which are absent from the original sample. In the literature…
▽ More
In population synthesis applications, when considering populations with many attributes, a fundamental problem is the estimation of rare combinations of feature attributes. Unsurprisingly, it is notably more difficult to reliably representthe sparser regions of such multivariate distributions and in particular combinations of attributes which are absent from the original sample. In the literature this is commonly known as sampling zeros for which no systematic solution has been proposed so far. In this paper, two machine learning algorithms, from the family of deep generative models,are proposed for the problem of population synthesis and with particular attention to the problem of sampling zeros. Specifically, we introduce the Wasserstein Generative Adversarial Network (WGAN) and the Variational Autoencoder(VAE), and adapt these algorithms for a large-scale population synthesis application. The models are implemented on a Danish travel survey with a feature-space of more than 60 variables. The models are validated in a cross-validation scheme and a set of new metrics for the evaluation of the sampling-zero problem is proposed. Results show how these models are able to recover sampling zeros while kee** the estimation of truly impossible combinations, the structural zeros, at a comparatively low level. Particularly, for a low dimensional experiment, the VAE, the marginal sampler and the fully random sampler generate 5%, 21% and 26%, respectively, more structural zeros per sampling zero generated by the WGAN, while for a high dimensional case, these figures escalate to 44%, 2217% and 170440%, respectively. This research directly supports the development of agent-based systems and in particular cases where detailed socio-economic or geographical representations are required.
△ Less
Submitted 17 September, 2019;
originally announced September 2019.
-
Rethinking travel behavior modeling representations through embeddings
Authors:
Francisco C. Pereira
Abstract:
This paper introduces the concept of travel behavior embeddings, a method for re-representing discrete variables that are typically used in travel demand modeling, such as mode, trip purpose, education level, family type or occupation. This re-representation process essentially maps those variables into a latent space called the \emph{embedding space}. The benefit of this is that such spaces allow…
▽ More
This paper introduces the concept of travel behavior embeddings, a method for re-representing discrete variables that are typically used in travel demand modeling, such as mode, trip purpose, education level, family type or occupation. This re-representation process essentially maps those variables into a latent space called the \emph{embedding space}. The benefit of this is that such spaces allow for richer nuances than the typical transformations used in categorical variables (e.g. dummy encoding, contrasted encoding, principal components analysis). While the usage of latent variable representations is not new per se in travel demand modeling, the idea presented here brings several innovations: it is an entirely data driven algorithm; it is informative and consistent, since the latent space can be visualized and interpreted based on distances between different categories; it preserves interpretability of coefficients, despite being based on Neural Network principles; and it is transferrable, in that embeddings learned from one dataset can be reused for other ones, as long as travel behavior keeps consistent between the datasets.
The idea is strongly inspired on natural language processing techniques, namely the word2vec algorithm. Such algorithm is behind recent developments such as in automatic translation or next word prediction. Our method is demonstrated using a model choice model, and shows improvements of up to 60\% with respect to initial likelihood, and up to 20% with respect to likelihood of the corresponding traditional model (i.e. using dummy variables) in out-of-sample evaluation. We provide a new Python package, called PyTre (PYthon TRavel Embeddings), that others can straightforwardly use to replicate our results or improve their own models. Our experiments are themselves based on an open dataset (swissmetro).
△ Less
Submitted 31 August, 2019;
originally announced September 2019.