-
Error Analysis of Shapley Value-Based Model Explanations: An Informative Perspective
Authors:
Ningsheng Zhao,
Jia Yuan Yu,
Krzysztof Dzieciolowski,
Trang Bui
Abstract:
Shapley value attribution (SVA) is an increasingly popular explainable AI (XAI) method, which quantifies the contribution of each feature to the model's output. However, recent work has shown that most existing methods to implement SVAs have some drawbacks, resulting in biased or unreliable explanations that fail to correctly capture the true intrinsic relationships between features and model outp…
▽ More
Shapley value attribution (SVA) is an increasingly popular explainable AI (XAI) method, which quantifies the contribution of each feature to the model's output. However, recent work has shown that most existing methods to implement SVAs have some drawbacks, resulting in biased or unreliable explanations that fail to correctly capture the true intrinsic relationships between features and model outputs. Moreover, the mechanism and consequences of these drawbacks have not been discussed systematically. In this paper, we propose a novel error theoretical analysis framework, in which the explanation errors of SVAs are decomposed into two components: observation bias and structural bias. We further clarify the underlying causes of these two biases and demonstrate that there is a trade-off between them. Based on this error analysis framework, we develop two novel concepts: over-informative and underinformative explanations. We demonstrate how these concepts can be effectively used to understand potential errors of existing SVA methods. In particular, for the widely deployed assumption-based SVAs, we find that they can easily be under-informative due to the distribution drift caused by distributional assumptions. We propose a measurement tool to quantify such a distribution drift. Finally, our experiments illustrate how different existing SVA methods can be over- or under-informative. Our work sheds light on how errors incur in the estimation of SVAs and encourages new less error-prone methods.
△ Less
Submitted 29 May, 2024; v1 submitted 21 April, 2024;
originally announced April 2024.
-
Evidence for Modified Quark-Gluon Distributions in Nuclei by Correlated Nucleon Pairs
Authors:
nCTEQ Collaboration,
A. W. Denniston,
T. Jezo,
A. Kusina,
N. Derakhshanian,
P. Duwentaster,
O. Hen,
C. Keppel,
M. Klasen,
K. Kovarik,
J. G. Morfin,
K. F. Muzakka,
F. I. Olness,
E. Piasetzky,
P. Risse,
R. Ruiz,
I. Schienbein,
J. Y. Yu
Abstract:
We extend the QCD Parton Model analysis using a factorized nuclear structure model incorporating individual nucleons and pairs of correlated nucleons. Our analysis of high-energy data from lepton Deep-Inelastic Scattering, Drell-Yan and W/Z production simultaneously extracts the universal effective distribution of quarks and gluons inside correlated nucleon pairs, and their nucleus-specific fracti…
▽ More
We extend the QCD Parton Model analysis using a factorized nuclear structure model incorporating individual nucleons and pairs of correlated nucleons. Our analysis of high-energy data from lepton Deep-Inelastic Scattering, Drell-Yan and W/Z production simultaneously extracts the universal effective distribution of quarks and gluons inside correlated nucleon pairs, and their nucleus-specific fractions. Such successful extraction of these universal distributions marks a significant advance in our understanding of nuclear structure properties connecting nucleon- and parton-level quantities.
△ Less
Submitted 26 December, 2023;
originally announced December 2023.
-
Towards a New nCTEQ global nPDF release
Authors:
P. Risse,
N. Derakhshanian,
P. Duwentäster,
T. Ježo,
C. Keppel,
M. Klasen,
K. Kovařík,
A. Kusina,
C. Léger,
J. G. Morfín,
F. I. Olness,
R. Ruiz,
I. Schienbein,
J. Y. Yu
Abstract:
We discuss the foundation for a new global nCTEQ nuclear PDF analysis, combining a number of our previous analyses into one consistent framework with updates to the underlying theoretical treatment as well as the addition of new available data. In particular, the new global release will be the first nCTEQ release containing neutrino DIS scattering data in a consistent manner together with JLab hig…
▽ More
We discuss the foundation for a new global nCTEQ nuclear PDF analysis, combining a number of our previous analyses into one consistent framework with updates to the underlying theoretical treatment as well as the addition of new available data. In particular, the new global release will be the first nCTEQ release containing neutrino DIS scattering data in a consistent manner together with JLab high-x DIS data and new LHC p-Pb data. These additions will improve the data-driven description of nuclear PDFs in new regions, especially the strange quark and the gluon PDF at low-x.
△ Less
Submitted 15 July, 2023;
originally announced July 2023.
-
Target mass corrections in lepton--nucleus DIS: theory and applications to nuclear PDFs
Authors:
R. Ruiz,
K. F. Muzakka,
C. Leger,
P. Risse,
A. Accardi,
P. Duwentäster,
T. J. Hobbs,
T. Ježo,
C. Keppel,
M. Klasen,
K. Kovařík,
A. Kusina,
J. G. Morfín,
F. I. Olness,
J. F. Owens,
I. Schienbein,
J. Y. Yu
Abstract:
Motivated by the wide range of kinematics covered by current and planned deep-inelastic scattering (DIS) facilities, we revisit the formalism, practical implementation, and numerical impact of target mass corrections (TMCs) for DIS on unpolarized nuclear targets. An important aspect is that we only use nuclear and later partonic degrees of freedom, carefully avoiding a picture of the nucleus in te…
▽ More
Motivated by the wide range of kinematics covered by current and planned deep-inelastic scattering (DIS) facilities, we revisit the formalism, practical implementation, and numerical impact of target mass corrections (TMCs) for DIS on unpolarized nuclear targets. An important aspect is that we only use nuclear and later partonic degrees of freedom, carefully avoiding a picture of the nucleus in terms of nucleons. After establishing that formulae used for individual nucleon targets $(p,n)$, derived in the Operator Product Expansion (OPE) formalism, are indeed applicable to nuclear targets, we rewrite expressions for nuclear TMCs in terms of \mbox{re-scaled} (or averaged) kinematic variables. As a consequence, we find a representation for nuclear TMCs that is approximately independent of the nuclear target. We go on to construct a single-parameter fit for all nuclear targets that is in good numerical agreement with full computations of TMCs. We discuss in detail qualitative and quantitative differences between nuclear TMCs built in the OPE and the parton model formalisms, as well as give numerical predictions for current and future facilities.
△ Less
Submitted 12 March, 2024; v1 submitted 18 January, 2023;
originally announced January 2023.
-
Global analyses of nuclear PDFs with heavy-quark and neutrino data
Authors:
M. Klasen,
P. Duwentäster,
T. Jezo,
K. Kovarik,
A. Kusina,
J. G. Morfin,
K. F. Muzakka,
F. I. Olness,
R. Ruiz,
I. Schienbein,
J. Y. Yu
Abstract:
We discuss the two most recent global analyses of nuclear parton distribution functions within the nCTEQ approach. LHC data on $W/Z$-boson, single-inclusive hadron and heavy quark/quarkonium production are shown to not only significantly reduce the gluon uncertainty down to $x\geq10^{-5}$, but to also influence the strange quark density. The latter is further constrained by neutrino deep-inelastic…
▽ More
We discuss the two most recent global analyses of nuclear parton distribution functions within the nCTEQ approach. LHC data on $W/Z$-boson, single-inclusive hadron and heavy quark/quarkonium production are shown to not only significantly reduce the gluon uncertainty down to $x\geq10^{-5}$, but to also influence the strange quark density. The latter is further constrained by neutrino deep-inelastic scattering and charm dimuon production data, whose consistency with neutral-current experiments is also re-evaluated.
△ Less
Submitted 19 October, 2022;
originally announced October 2022.
-
On Unique Ergodicity Of Coupled AIMD Flows
Authors:
Pietro Ferraro,
Jia Yuan Yu,
Ramen Ghosh,
Syed Eqbal Alam,
Jakub Marecek,
Fabian Wirth,
Robert Shorten
Abstract:
The AIMD algorithm, which underpins the Transmission Control Protocol (TCP) for transporting data packets in communication networks, is perhaps the most successful control algorithm ever deployed. Recently, its use has been extended beyond communication networks, and successful applications of the AIMD algorithm have been reported in transportation, energy, and mathematical biology. A very recent…
▽ More
The AIMD algorithm, which underpins the Transmission Control Protocol (TCP) for transporting data packets in communication networks, is perhaps the most successful control algorithm ever deployed. Recently, its use has been extended beyond communication networks, and successful applications of the AIMD algorithm have been reported in transportation, energy, and mathematical biology. A very recent development in the use of AIMD is its application in solving large-scale optimization and distributed control problems without the need for inter-agent communication. In this context, an interesting problem arises when multiple AIMD networks that are coupled in some sense (usually through a nonlinearity). The purpose of this note is to prove that such systems in certain settings inherit the ergodic properties of individual AIMD networks. This result has important consequences for the convergence of the aforementioned optimization algorithms. The arguments in the paper also correct conceptual and technical errors in [1].
△ Less
Submitted 27 September, 2022;
originally announced September 2022.
-
Compatibility of Neutrino DIS Data and Its Impact on Nuclear Parton Distribution Functions
Authors:
K. F. Muzakka,
P. Duwentäster,
T. J. Hobbs,
T. Ježo,
M. Klasen,
K. Kovařík,
A. Kusina,
J. G. Morfín,
F. I. Olness,
R. Ruiz,
I. Schienbein,
J. Y. Yu
Abstract:
In global analyses of nuclear parton distribution functions (nPDFs), neutrino deep-inelastic scattering (DIS) data have been argued to exhibit tensions with the data from charged-lepton DIS. Using the nCTEQ framework, we investigate these possible tensions both internally and with the data sets used in our recent nPDF analysis nCTEQ15WZSIH. We take into account nuclear effects in the calculation o…
▽ More
In global analyses of nuclear parton distribution functions (nPDFs), neutrino deep-inelastic scattering (DIS) data have been argued to exhibit tensions with the data from charged-lepton DIS. Using the nCTEQ framework, we investigate these possible tensions both internally and with the data sets used in our recent nPDF analysis nCTEQ15WZSIH. We take into account nuclear effects in the calculation of the deuteron structure function $F_2^D$ using the CJ15 analysis. The resulting nPDF fit, nCTEQ15WZSIHdeut, serves as the basis for our comparison with inclusive neutrino DIS and charm dimuon production data. Using $χ^2$ hypothesis testing, we confirm evidence of tensions with these data and study the impact of the proton PDF baseline as well as the treatment of data correlation and normalization uncertainties. We identify the experimental data and kinematic regions that generate the tensions and present several possible approaches how a consistent global analysis with neutrino data can be performed. We show that the tension can be relieved using a kinematic cut at low $x$ ($x>0.1$) and also investigate a possibility of managing the tensions by using uncorrelated systematic errors. Finally, we present a different approach identifying a subset of neutrino data which leads to a consistent global analysis without any additional cuts. Understanding these tensions between the neutrino and charged-lepton DIS data is important not only for a better flavor separation in global analyses of nuclear and proton PDFs, but also for neutrino physics and for searches for physics beyond the Standard Model.
△ Less
Submitted 27 April, 2022;
originally announced April 2022.
-
Impact of heavy quark and quarkonium data on nuclear gluon PDFs
Authors:
P. Duwentäster,
T. Ježo,
M. Klasen,
K. Kovařík,
A. Kusina,
K. F. Muzakka,
F. I. Olness,
R. Ruiz,
I. Schienbein,
J. Y. Yu
Abstract:
A clear understanding of nuclear parton distribution functions (nPDFs) plays a crucial role in the interpretation of collider data taken at the Relativistic Heavy Ion Collider (RHIC), the Large Hadron Collider (LHC) and in the near future at the Electron-Ion Collider (EIC). Even with the recent inclusions of vector boson and light meson production data, the uncertainty of the gluon PDF remains sub…
▽ More
A clear understanding of nuclear parton distribution functions (nPDFs) plays a crucial role in the interpretation of collider data taken at the Relativistic Heavy Ion Collider (RHIC), the Large Hadron Collider (LHC) and in the near future at the Electron-Ion Collider (EIC). Even with the recent inclusions of vector boson and light meson production data, the uncertainty of the gluon PDF remains substantial and limits the interpretation of heavy ion collision data. To obtain new constraints on the nuclear gluon PDF, we extend our recent nCTEQ15WZ+SIH analysis to inclusive quarkonium and open heavy-flavor meson production data from the LHC. This vast new data set covers a wide kinematic range and puts strong constraints on the nuclear gluon PDF down to $x\lesssim 10^{-5}$. The theoretical predictions for these data sets are obtained from a data-driven approach, where proton-proton data are used to determine effective scattering matrix elements. This approach is validated with detailed comparisons to existing next-to-leading order (NLO) calculations in non-relativistic QCD (NRQCD) for quarkonia and in the general-mass variable-flavor-number scheme (GMVFNS) for the open heavy-flavored mesons. In addition, the uncertainties from the data-driven approach are determined using the Hessian method and accounted for in the PDF fits. This extension of our previous analyses represents an important step toward the next generation of PDFs not only by including new data sets, but also by exploring new methods for future analyses.
△ Less
Submitted 21 April, 2022;
originally announced April 2022.
-
Reward Modeling for Mitigating Toxicity in Transformer-based Language Models
Authors:
Farshid Faal,
Ketra Schmitt,
Jia Yuan Yu
Abstract:
Transformer-based language models are able to generate fluent text and be efficiently adapted across various natural language generation tasks. However, language models that are pretrained on large unlabeled web text corpora have been shown to suffer from degenerating toxic content and social bias behaviors, consequently hindering their safe deployment. Various detoxification methods were proposed…
▽ More
Transformer-based language models are able to generate fluent text and be efficiently adapted across various natural language generation tasks. However, language models that are pretrained on large unlabeled web text corpora have been shown to suffer from degenerating toxic content and social bias behaviors, consequently hindering their safe deployment. Various detoxification methods were proposed to mitigate the language model's toxicity; however, these methods struggled to detoxify language models when conditioned on prompts that contain specific social identities related to gender, race, or religion. In this study, we propose Reinforce-Detoxify; A reinforcement learning-based method for mitigating toxicity in language models. We address the challenge of safety in language models and propose a new reward model that is able to detect toxic content and mitigate unintended bias towards social identities in toxicity prediction. The experiments demonstrate that the Reinforce-Detoxify method for language model detoxification outperforms existing detoxification approaches in automatic evaluation metrics, indicating the ability of our approach in language model detoxification and less prone to unintended bias toward social identities in generated content.
△ Less
Submitted 27 July, 2022; v1 submitted 19 February, 2022;
originally announced February 2022.
-
Constraining the nuclear gluon PDF with inclusive hadron production data
Authors:
P. Duwentäster,
L. A. Husová,
T. Ježo,
M. Klasen,
K. Kovařík,
A. Kusina,
K. F. Muzakka,
F. I. Olness,
I. Schienbein,
J. Y. Yu
Abstract:
The nuclear parton distribution functions (nPDFs) of gluons are known to be difficult to determine with fits of deep inelastic scattering (DIS) and Drell-Yan (DY) data alone. Therefore, the nCTEQ15 analysis of nuclear PDFs added inclusive neutral pion production data from RHIC to help in constraining the gluon. In this analysis, we present a new global analysis of nuclear PDFs based on a much larg…
▽ More
The nuclear parton distribution functions (nPDFs) of gluons are known to be difficult to determine with fits of deep inelastic scattering (DIS) and Drell-Yan (DY) data alone. Therefore, the nCTEQ15 analysis of nuclear PDFs added inclusive neutral pion production data from RHIC to help in constraining the gluon. In this analysis, we present a new global analysis of nuclear PDFs based on a much larger set of single inclusive light hadron data from RHIC and the LHC. Using our new nCTEQ code (nCTEQ++) with an optimized version of INCNLO we study systematically the limitations of the theory and the impact of the fragmentation function uncertainty.
△ Less
Submitted 30 July, 2021;
originally announced July 2021.
-
Impact of W and Z Production Data and Compatibility of Neutrino DIS Data in Nuclear Parton Distribution Functions
Authors:
K. F. Muzakka,
P. Duwentäster,
T. J. Hobbs,
T. Ježo,
M. Klasen,
K. Kovařík,
A. Kusina,
J. G. Morfín,
F. I. Olness,
R. Ruiz,
I. Schienbein,
J. Y. Yu
Abstract:
Vector boson production and neutrino deep-inelastic scattering (DIS) data are crucial for constraining the strange quark parton distribution function (PDF) and more generally for flavor decomposition in PDF extractions. We extend the nCTEQ15 nuclear PDFs (nPDFs) by adding the recent $W$ and $Z$ production data from the LHC in a global nPDF fit. The new nPDF set, referred to as nCTEQ15WZ, is used a…
▽ More
Vector boson production and neutrino deep-inelastic scattering (DIS) data are crucial for constraining the strange quark parton distribution function (PDF) and more generally for flavor decomposition in PDF extractions. We extend the nCTEQ15 nuclear PDFs (nPDFs) by adding the recent $W$ and $Z$ production data from the LHC in a global nPDF fit. The new nPDF set, referred to as nCTEQ15WZ, is used as a starting point for a follow-up study in which we assess the compatibility of neutrino DIS data with charged lepton DIS data. Specifically, we re-analyze neutrino DIS data from NuTeV, Chorus, and CDHSW, as well as dimuon data from CCFR and NuTeV. To scrutinize the level of compatibility, different kinematic regions of the neutrino data are investigated. Fits to the neutrino data alone and a preliminary global fit are performed and compared to nCTEQ15WZ.
△ Less
Submitted 17 March, 2022; v1 submitted 28 July, 2021;
originally announced July 2021.
-
Impact of inclusive hadron production data on nuclear gluon PDFs
Authors:
nCTEQ Collaboration,
P. Duwentäster,
L. A. Husová,
T. Ježo,
M. Klasen,
K. Kovařík,
A. Kusina,
K. F. Muzakka,
F. I. Olness,
I. Schienbein,
J. Y. Yu
Abstract:
A precise knowledge of nuclear parton distribution functions (nPDFs) is -- among other things -- important for the unambiguous interpretation of hard process data taken in pA and AA collisions at the Relativistic Heavy Ion Collider (RHIC) and the Large Hadron Collider (LHC). The available fixed target data for deep inelastic scattering (DIS) and Drell-Yan (DY) lepton pair production mainly constra…
▽ More
A precise knowledge of nuclear parton distribution functions (nPDFs) is -- among other things -- important for the unambiguous interpretation of hard process data taken in pA and AA collisions at the Relativistic Heavy Ion Collider (RHIC) and the Large Hadron Collider (LHC). The available fixed target data for deep inelastic scattering (DIS) and Drell-Yan (DY) lepton pair production mainly constrain the light quark distributions. It is hence crucial to include more and more collider data in global analyses of nPDFs in order to better pin down the different parton flavors, in particular the gluon distribution at small x. To help constrain the nuclear gluon PDF, we extend the nCTEQ15 analysis by including single inclusive hadron (SIH) production data from RHIC (PHENIX and STAR) and LHC (ALICE). In addition to the DIS, DY and SIH data sets, we will also include LHC W/Z production data. As the SIH calculation is dependent on hadronic fragmentation functions (FFs), we use a variety of FFs available in the literature to properly estimate this source of uncertainty. We study the impact of these data on the PDFs, and compare with both the nCTEQ15 and nCTEQ15WZ sets. The calculations are performed using a new implementation of the nCTEQ code (nCTEQ++) including a modified version of INCNLO which allows faster calculations using pre-computed grids. The extension of the nCTEQ15 analysis to include the SIH data represents an important step toward the next generation of PDFs.
△ Less
Submitted 30 November, 2021; v1 submitted 20 May, 2021;
originally announced May 2021.
-
Multi-resource allocation for federated settings: A non-homogeneous Markov chain model
Authors:
Syed Eqbal Alam,
Fabian Wirth,
Jia Yuan Yu
Abstract:
In a federated setting, agents coordinate with a central agent or a server to solve an optimization problem in which agents do not share their information with each other. Wirth and his co-authors, in a recent paper, describe how the basic additive-increase multiplicative-decrease (AIMD) algorithm can be modified in a straightforward manner to solve a class of optimization problems for federated s…
▽ More
In a federated setting, agents coordinate with a central agent or a server to solve an optimization problem in which agents do not share their information with each other. Wirth and his co-authors, in a recent paper, describe how the basic additive-increase multiplicative-decrease (AIMD) algorithm can be modified in a straightforward manner to solve a class of optimization problems for federated settings for a single shared resource with no inter-agent communication. The AIMD algorithm is one of the most successful distributed resource allocation algorithms currently deployed in practice. It is best known as the backbone of the Internet and is also widely explored in other application areas. We extend the single-resource algorithm to multiple heterogeneous shared resources that emerge in smart cities, sharing economy, and many other applications. Our main results show the convergence of the average allocations to the optimal values. We model the system as a non-homogeneous Markov chain with place-dependent probabilities. Furthermore, simulation results are presented to demonstrate the efficacy of the algorithms and to highlight the main features of our analysis.
△ Less
Submitted 24 May, 2021; v1 submitted 26 April, 2021;
originally announced April 2021.
-
Bias-Corrected Peaks-Over-Threshold Estimation of the CVaR
Authors:
Dylan Troop,
Frédéric Godin,
Jia Yuan Yu
Abstract:
The conditional value-at-risk (CVaR) is a useful risk measure in fields such as machine learning, finance, insurance, energy, etc. When measuring very extreme risk, the commonly used CVaR estimation method of sample averaging does not work well due to limited data above the value-at-risk (VaR), the quantile corresponding to the CVaR level. To mitigate this problem, the CVaR can be estimated by ext…
▽ More
The conditional value-at-risk (CVaR) is a useful risk measure in fields such as machine learning, finance, insurance, energy, etc. When measuring very extreme risk, the commonly used CVaR estimation method of sample averaging does not work well due to limited data above the value-at-risk (VaR), the quantile corresponding to the CVaR level. To mitigate this problem, the CVaR can be estimated by extrapolating above a lower threshold than the VaR using a generalized Pareto distribution (GPD), which is often referred to as the peaks-over-threshold (POT) approach. This method often requires a very high threshold to fit well, leading to high variance in estimation, and can induce significant bias if the threshold is chosen too low. In this paper, we derive a new expression for the GPD approximation error of the CVaR, a bias term induced by the choice of threshold, as well as a bias correction method for the estimated GPD parameters. This leads to the derivation of a new estimator for the CVaR that we prove to be asymptotically unbiased. In a practical setting, we show through experiments that our estimator provides a significant performance improvement compared with competing CVaR estimators in finite samples. As a consequence of our bias correction method, it is also shown that a much lower threshold can be selected without introducing significant bias. This allows a larger portion of data to be be used in CVaR estimation compared with the typical POT approach, leading to more stable estimates. As secondary results, a new estimator for a second-order parameter of heavy-tailed distributions is derived, as well as a confidence interval for the CVaR which enables quantifying the level of variability in our estimator.
△ Less
Submitted 8 March, 2021;
originally announced March 2021.
-
nCTEQ15HIX -- Extending nPDF Analyses into the High-$x$, Low $Q^2$ Region
Authors:
E. P. Segarra,
T. Ježo,
A. Accardi,
P. Duwentäster,
O. Hen,
T. J. Hobbs,
C. Keppel,
M. Klasen,
K. Kovařík,
A. Kusina,
J. G. Morfín,
K. F. Muzakka,
F. I. Olness,
I. Schienbein,
J. Y. Yu
Abstract:
We use the nCTEQ analysis framework to investigate nuclear Parton Distribution Functions (nPDFs) in the region of large x and intermediate-to-low $Q$, with special attention to recent JLab Deep Inelastic Scattering data on nuclear targets. This data lies in a region which is often excluded by $W$ and $Q$ cuts in global nPDF analyses. As we relax these cuts, we enter a new kinematic region, which r…
▽ More
We use the nCTEQ analysis framework to investigate nuclear Parton Distribution Functions (nPDFs) in the region of large x and intermediate-to-low $Q$, with special attention to recent JLab Deep Inelastic Scattering data on nuclear targets. This data lies in a region which is often excluded by $W$ and $Q$ cuts in global nPDF analyses. As we relax these cuts, we enter a new kinematic region, which requires new phenomenology. In particular, we study the impact of i) target mass corrections, ii) higher twist corrections, iii) deuteron corrections, and iv) the shape of the nuclear PDF parametrization at large-$x$ close to one. Using the above tools, we produce a new nPDF set (named nCTEQ15HIX) which yields a good description of the new JLab data in this challenging kinematic region, and displays reduced uncertainties at large $x$, in particular for up and down quark flavors.
△ Less
Submitted 6 September, 2021; v1 submitted 21 December, 2020;
originally announced December 2020.
-
Impact of LHC vector boson production in heavy ion collisions on strange PDFs
Authors:
A. Kusina,
T. Ježo,
D. B. Clark,
P. Duwentäster,
E. Godat,
T. J. Hobbs,
J. Kent,
M. Klasen,
K. Kovařík,
F. Lyonnet,
K. F. Muzakka,
F. I. Olness,
I. Schienbein,
J. Y. Yu
Abstract:
The extraction of the strange quark parton distribution function (PDF) poses a long-standing puzzle. Measurements from neutrino-nucleus deep inelastic scattering (DIS) experiments suggest the strange quark is suppressed compared to the light sea quarks, while recent studies of W/Z boson production at the LHC imply a larger strange component at small x values. As the parton flavor determination in…
▽ More
The extraction of the strange quark parton distribution function (PDF) poses a long-standing puzzle. Measurements from neutrino-nucleus deep inelastic scattering (DIS) experiments suggest the strange quark is suppressed compared to the light sea quarks, while recent studies of W/Z boson production at the LHC imply a larger strange component at small x values. As the parton flavor determination in the proton depends on nuclear corrections, e.g. from heavy-target DIS, LHC heavy ion measurements can provide a distinct perspective to help clarify this situation. In this investigation we extend the nCTEQ15 nPDFs to study the impact of the LHC proton-lead W/Z production data on both the flavor differentiation and nuclear corrections. This complementary data set provides new insights on both the LHC W/Z proton analyses and the neutrino-nucleus DIS data. We identify these new nPDFs as nCTEQ15WZ. Our calculations are performed using a new implementation of the nCTEQ code (nCTEQ++) based on C++ which enables us to easily interface to external programs such as HOPPET, APPLgrid and MCFM. Our results indicate that, as suggested by the proton data, the small x nuclear strange sea appears larger than previously expected, even when the normalization of the W/Z data is accommodated in the fit. Extending the nCTEQ15 analysis to include LHC W/Z data represents an important step as we advance toward the next generation of nPDFs.
△ Less
Submitted 17 July, 2020;
originally announced July 2020.
-
Generating Embroidery Patterns Using Image-to-Image Translation
Authors:
Mohammad Akif Beg,
Jia Yuan Yu
Abstract:
In many scenarios in computer vision, machine learning, and computer graphics, there is a requirement to learn the map** from an image of one domain to an image of another domain, called Image-to-image translation. For example, style transfer, object transfiguration, visually altering the appearance of weather conditions in an image, changing the appearance of a day image into a night image or v…
▽ More
In many scenarios in computer vision, machine learning, and computer graphics, there is a requirement to learn the map** from an image of one domain to an image of another domain, called Image-to-image translation. For example, style transfer, object transfiguration, visually altering the appearance of weather conditions in an image, changing the appearance of a day image into a night image or vice versa, photo enhancement, to name a few. In this paper, we propose two machine learning techniques to solve the embroidery image-to-image translation. Our goal is to generate a preview image which looks similar to an embroidered image, from a user-uploaded image. Our techniques are modifications of two existing techniques, neural style transfer, and cycle-consistent generative-adversarial network. Neural style transfer renders the semantic content of an image from one domain in the style of a different image in another domain, whereas a cycle-consistent generative adversarial network learns the map** from an input image to output image without any paired training data, and also learn a loss function to train this map**. Furthermore, the techniques we propose are independent of any embroidery attributes, such as elevation of the image, light-source, start, and endpoints of a stitch, type of stitch used, fabric type, etc. Given the user image, our techniques can generate a preview image which looks similar to an embroidered image. We train and test our propose techniques on an embroidery dataset which consist of simple 2D images. To do so, we prepare an unpaired embroidery dataset with more than 8000 user-uploaded images along with embroidered images. Empirical results show that these techniques successfully generate an approximate preview of an embroidered version of a user image, which can help users in decision making.
△ Less
Submitted 5 March, 2020;
originally announced March 2020.
-
The Convergence of Finite-Averaging of AIMD for Distributed Heterogeneous Resource Allocations
Authors:
Syed Eqbal Alam,
Fabian Wirth,
Jia Yuan Yu,
Robert Shorten
Abstract:
In several social choice problems, agents collectively make decisions over the allocation of multiple divisible and heterogeneous resources with capacity constraints to maximize utilitarian social welfare. The agents are constrained through computational or communication resources or privacy considerations. In this paper, we analyze the convergence of a recently proposed distributed solution that…
▽ More
In several social choice problems, agents collectively make decisions over the allocation of multiple divisible and heterogeneous resources with capacity constraints to maximize utilitarian social welfare. The agents are constrained through computational or communication resources or privacy considerations. In this paper, we analyze the convergence of a recently proposed distributed solution that allocates such resources to agents with minimal communication. It is based on the randomized additive-increase and multiplicative-decrease (AIMD) algorithm. The agents are not required to exchange information with each other, but little with a central agent that keeps track of the aggregate resource allocated at a time. We formulate the time-averaged allocations over finite window size and model the system as a Markov chain with place-dependent probabilities. Furthermore, we show that the time-averaged allocations vector converges to a unique invariant measure, and also, the ergodic property holds.
△ Less
Submitted 24 January, 2020; v1 submitted 18 January, 2020;
originally announced January 2020.
-
Risk-Averse Action Selection Using Extreme Value Theory Estimates of the CVaR
Authors:
Dylan Troop,
Frédéric Godin,
Jia Yuan Yu
Abstract:
In a wide variety of sequential decision making problems, it can be important to estimate the impact of rare events in order to minimize risk exposure. A popular risk measure is the conditional value-at-risk (CVaR), which is commonly estimated by averaging observations that occur beyond a quantile at a given confidence level. When this confidence level is very high, this estimation method can exhi…
▽ More
In a wide variety of sequential decision making problems, it can be important to estimate the impact of rare events in order to minimize risk exposure. A popular risk measure is the conditional value-at-risk (CVaR), which is commonly estimated by averaging observations that occur beyond a quantile at a given confidence level. When this confidence level is very high, this estimation method can exhibit high variance due to the limited number of samples above the corresponding quantile. To mitigate this problem, extreme value theory can be used to derive an estimator for the CVaR that uses extrapolation beyond available samples. This estimator requires the selection of a threshold parameter to work well, which is a difficult challenge that has been widely studied in the extreme value theory literature. In this paper, we present an estimation procedure for the CVaR that combines extreme value theory and a recently introduced method of automated threshold selection by \cite{bader2018automated}. Under appropriate conditions, we estimate the tail risk using a generalized Pareto distribution. We compare empirically this estimation procedure with the commonly used method of sample averaging, and show an improvement in performance for some distributions. We finally show how the estimation procedure can be used in reinforcement learning by applying our method to the multi-arm bandit problem where the goal is to avoid catastrophic risk.
△ Less
Submitted 10 December, 2020; v1 submitted 3 December, 2019;
originally announced December 2019.
-
A Price-Based Iterative Double Auction for Charger Sharing Markets
Authors:
Jie Gao,
Terrence Wong,
Chun Wang,
Jia Yuan Yu
Abstract:
The unprecedented growth of demand for charging electric vehicles (EVs) calls for novel expansion solutions to today's charging networks. Riding on the wave of the proliferation of sharing economy, Airbnb-like charger sharing markets opens the opportunity to expand the existing charging networks without requiring costly and time-consuming infrastructure investments, yet the successful design of su…
▽ More
The unprecedented growth of demand for charging electric vehicles (EVs) calls for novel expansion solutions to today's charging networks. Riding on the wave of the proliferation of sharing economy, Airbnb-like charger sharing markets opens the opportunity to expand the existing charging networks without requiring costly and time-consuming infrastructure investments, yet the successful design of such markets relies on innovations at the interface between game theory, mechanism design, and large scale optimization. In this paper, we propose a price-based iterative double auction for charger sharing markets where charger owners rent out their under-utilized chargers to the charge-needing EV drivers. Charger owners and EV drivers form a two-sided market which is cleared by a price-based double auction. Chargers' locations, availability, and time unit costs as well as the EV drivers' time, distance constraints, and preferences are considered in the allocation and scheduling process. The goal is to compute social welfare maximizing allocations which benefits both charger owners and EV drivers and, in turn, ensure the continuous growth of the market. We prove that the proposed double auction is budget balanced, individually rational, and that it is a weakly dominant strategy for EV drivers and charger owners to truthfully report their charging time constraints. In addition, results from our computation study show that the double auction achieves on average 94% efficiency compared with the optimal solutions and scales well to larger problem instances.
△ Less
Submitted 30 September, 2019;
originally announced October 2019.
-
nCTEQ PDFs at the LHC: Vector boson production in heavy ion collisions
Authors:
The nCTEQ Collaboration,
D. B. Clark,
E. Godat,
T. J. Hobbs,
T. Ježo,
J. Kent,
C. Keppel,
M. Klasen,
K. Kovarík,
A. Kusina,
F. Lyonnet,
J. G. Morfin,
F. I. Olness,
J. F. Owens,
I. Schienbein,
J. Y. Yu
Abstract:
Extraction of the strange quark PDF is a long-standing puzzle. We use the nCTEQ nPDFs with uncertainties to study the impact of the LHC W/Z production data on both the flavor differentiation and nuclear corrections; this complements the information from neutrino-DIS data. As the proton flavor determination is dependent on nuclear corrections (from heavy target DIS, for example), LHC heavy ion meas…
▽ More
Extraction of the strange quark PDF is a long-standing puzzle. We use the nCTEQ nPDFs with uncertainties to study the impact of the LHC W/Z production data on both the flavor differentiation and nuclear corrections; this complements the information from neutrino-DIS data. As the proton flavor determination is dependent on nuclear corrections (from heavy target DIS, for example), LHC heavy ion measurements can also help improve proton PDFs. We introduce a new implementation of the nCTEQ code (nCTEQ++) based on C++ which has a modular strucure and enables us to easily integrate programs such as HOPPET, APPLgrid, and MCFM. Using ApplGrids generated from MCFM, we use nCTEQ++ to perform a preliminary fit including the pPb LHC W/Z vector boson data.
△ Less
Submitted 1 September, 2019;
originally announced September 2019.
-
Variance-Based Risk Estimations in Markov Processes via Transformation with State Lum**
Authors:
Shuai Ma,
Jia Yuan Yu
Abstract:
Variance plays a crucial role in risk-sensitive reinforcement learning, and most risk measures can be analyzed via variance. In this paper, we consider two law-invariant risks as examples: mean-variance risk and exponential utility risk. With the aid of the state-augmentation transformation (SAT), we show that, the two risks can be estimated in Markov decision processes (MDPs) with a stochastic tr…
▽ More
Variance plays a crucial role in risk-sensitive reinforcement learning, and most risk measures can be analyzed via variance. In this paper, we consider two law-invariant risks as examples: mean-variance risk and exponential utility risk. With the aid of the state-augmentation transformation (SAT), we show that, the two risks can be estimated in Markov decision processes (MDPs) with a stochastic transition-based reward and a randomized policy. To relieve the enlarged state space, a novel definition of isotopic states is proposed for state lum**, considering the special structure of the transformed transition probability. In the numerical experiment, we illustrate state lum** in the SAT, errors from a naive reward simplification, and the validity of the SAT for the two risk estimations.
△ Less
Submitted 9 July, 2019;
originally announced July 2019.
-
A Scheme for Dynamic Risk-Sensitive Sequential Decision Making
Authors:
Shuai Ma,
Jia Yuan Yu,
Ahmet Satir
Abstract:
We present a scheme for sequential decision making with a risk-sensitive objective and constraints in a dynamic environment. A neural network is trained as an approximator of the map** from parameter space to space of risk and policy with risk-sensitive constraints. For a given risk-sensitive problem, in which the objective and constraints are, or can be estimated by, functions of the mean and v…
▽ More
We present a scheme for sequential decision making with a risk-sensitive objective and constraints in a dynamic environment. A neural network is trained as an approximator of the map** from parameter space to space of risk and policy with risk-sensitive constraints. For a given risk-sensitive problem, in which the objective and constraints are, or can be estimated by, functions of the mean and variance of return, we generate a synthetic dataset as training data. Parameters defining a targeted process might be dynamic, i.e., they might vary over time, so we sample them within specified intervals to deal with these dynamics. We show that: i). Most risk measures can be estimated using return variance; ii). By virtue of the state-augmentation transformation, practical problems modeled by Markov decision processes with stochastic rewards can be solved in a risk-sensitive scenario; and iii). The proposed scheme is validated by a numerical experiment.
△ Less
Submitted 9 July, 2019;
originally announced July 2019.
-
Derandomized Distributed Multi-resource Allocation with Little Communication Overhead
Authors:
Syed Eqbal Alam,
Robert Shorten,
Fabian Wirth,
Jia Yuan Yu
Abstract:
We study a class of distributed optimization problems for multiple shared resource allocation in Internet-connected devices. We propose a derandomized version of an existing stochastic additive-increase and multiplicative-decrease (AIMD) algorithm. The proposed solution uses one bit feedback signal for each resource between the system and the Internet-connected devices and does not require inter-d…
▽ More
We study a class of distributed optimization problems for multiple shared resource allocation in Internet-connected devices. We propose a derandomized version of an existing stochastic additive-increase and multiplicative-decrease (AIMD) algorithm. The proposed solution uses one bit feedback signal for each resource between the system and the Internet-connected devices and does not require inter-device communication. Additionally, the Internet-connected devices do not compromise their privacy and the solution does not dependent on the number of participating devices. In the system, each Internet-connected device has private cost functions which are strictly convex, twice continuously differentiable and increasing. We show empirically that the long-term average allocations of multiple shared resources converge to optimal allocations and the system achieves minimum social cost. Furthermore, we show that the proposed derandomized AIMD algorithm converges faster than the stochastic AIMD algorithm and both the approaches provide approximately same solutions.
△ Less
Submitted 21 December, 2018;
originally announced December 2018.
-
Distributed Algorithms for Internet-of-Things-enabled Prosumer Markets: A Control Theoretic Perspective
Authors:
Syed Eqbal Alam,
Robert Shorten,
Fabian Wirth,
Jia Yuan Yu
Abstract:
Internet-of-Things (IoT) enables the development of sharing economy applications. In many sharing economy scenarios, agents both produce as well as consume a resource; we call them prosumers. A community of prosumers agrees to sell excess resource to another community in a prosumer market. In this chapter, we propose a control theoretic approach to regulate the number of prosumers in a prosumer co…
▽ More
Internet-of-Things (IoT) enables the development of sharing economy applications. In many sharing economy scenarios, agents both produce as well as consume a resource; we call them prosumers. A community of prosumers agrees to sell excess resource to another community in a prosumer market. In this chapter, we propose a control theoretic approach to regulate the number of prosumers in a prosumer community, where each prosumer has a cost function that is coupled through its time-averaged production and consumption of the resource. Furthermore, each prosumer runs its distributed algorithm and takes only binary decisions in a probabilistic way, whether to produce one unit of the resource or not and to consume one unit of the resource or not. In the proposed approach, prosumers do not explicitly exchange information with each other due to privacy reasons, but little exchange of information is required for feedback signals, broadcast by a central agency. In the proposed approach, prosumers achieve the optimal values asymptotically. Furthermore, the proposed approach is suitable to implement in an IoT context with minimal demands on infrastructure. We describe two use cases; community-based car sharing and collaborative energy storage for prosumer markets. We also present simulation results to check the efficacy of the algorithms.
△ Less
Submitted 25 March, 2019; v1 submitted 18 December, 2018;
originally announced December 2018.
-
Distributed and Efficient Resource Balancing Among Many Suppliers and Consumers
Authors:
Kamal Chaturvedi,
Jia Yuan Yu,
Shrisha Rao
Abstract:
Achieving a balance of supply and demand in a multi-agent system with many individual self-interested and rational agents that act as suppliers and consumers is a natural problem in a variety of real-life domains---smart power grids, data centers, and others. In this paper, we address the profit-maximization problem for a group of distributed supplier and consumer agents, with no inter-agent commu…
▽ More
Achieving a balance of supply and demand in a multi-agent system with many individual self-interested and rational agents that act as suppliers and consumers is a natural problem in a variety of real-life domains---smart power grids, data centers, and others. In this paper, we address the profit-maximization problem for a group of distributed supplier and consumer agents, with no inter-agent communication. We simulate a scenario of a market with $S$ suppliers and $C$ consumers such that at every instant, each supplier agent supplies a certain quantity and simultaneously, each consumer agent consumes a certain quantity. The information about the total amount supplied and consumed is only kept with the center. The proposed algorithm is a combination of the classical additive-increase multiplicative-decrease (AIMD) algorithm in conjunction with a probabilistic rule for the agents to respond to a capacity signal. This leads to a nonhomogeneous Markov chain and we show almost sure convergence of this chain to the social optimum, for our market of distributed supplier and consumer agents. Employing this AIMD-type algorithm, the center sends a feedback message to the agents in the supplier side if there is a scenario of excess supply, or to the consumer agents if there is excess consumption. Each agent has a concave utility function whose derivative tends to 0 when an optimum quantity is supplied/consumed. Hence when social convergence is reached, each agent supplies or consumes a quantity which leads to its individual maximum profit, without the need of any communication. So eventually, each agent supplies or consumes a quantity which leads to its individual maximum profit, without communicating with any other agents. Our simulations show the efficacy of this approach.
△ Less
Submitted 14 September, 2018;
originally announced September 2018.
-
PDF Flavor Determination and the nCTEQ PDFs
Authors:
nCTEQ Collaboration,
E. Godat,
D. B. Clark,
T. J. Hobbs,
T. Jezo,
J. Kent,
C. Keppel,
K. Kovarik,
A. Kusina,
F. Lyonnet,
J. G. Morfin,
F. I. Olness,
J. F. Owens,
I. Schienbein,
J. Y. Yu
Abstract:
Recent LHC W/Z vector boson production data in proton-lead collisions are quite sensitive to the heavier flavors (especially the strange PDF), and this complements the information from neutrino-DIS data. As the proton flavor determination is dependent on nuclear corrections (from heavy target DIS, for example), LHC heavy ion measurements can also help improve proton PDFs. We introduce a new implem…
▽ More
Recent LHC W/Z vector boson production data in proton-lead collisions are quite sensitive to the heavier flavors (especially the strange PDF), and this complements the information from neutrino-DIS data. As the proton flavor determination is dependent on nuclear corrections (from heavy target DIS, for example), LHC heavy ion measurements can also help improve proton PDFs. We introduce a new implementation of the nCTEQ code (nCTEQ++) based on C++ which has a modular strucure and enables us to easily integrate programs such as HOPPET, APPLgrid, and MCFM. Using ApplGrids generated from MCFM, we use nCTEQ++ to perform a fit including the $pPb$ LHC W/Z vector boson data.
△ Less
Submitted 22 August, 2018;
originally announced August 2018.
-
Efficient Single-Shot Multibox Detector for Construction Site Monitoring
Authors:
Viral Thakar,
Himani Saini,
Walid Ahmed,
Mohammad M Soltani,
Ahmed Aly,
Jia Yuan Yu
Abstract:
Asset monitoring in construction sites is an intricate, manually intensive task, that can highly benefit from automated solutions engineered using deep neural networks. We use Single-Shot Multibox Detector --- SSD, for its fine balance between speed and accuracy, to leverage ubiquitously available images and videos from the surveillance cameras on the construction sites and automate the monitoring…
▽ More
Asset monitoring in construction sites is an intricate, manually intensive task, that can highly benefit from automated solutions engineered using deep neural networks. We use Single-Shot Multibox Detector --- SSD, for its fine balance between speed and accuracy, to leverage ubiquitously available images and videos from the surveillance cameras on the construction sites and automate the monitoring tasks, hence enabling project managers to better track the performance and optimize the utilization of each resource. We propose to improve the performance of SSD by clustering the predicted boxes instead of a greedy approach like non-maximum suppression. We do so using Affinity Propagation Clustering --- APC to cluster the predicted boxes based on the similarity index computed using the spatial features as well as location of predicted boxes. In our attempts, we have been able to improve the mean average precision of SSD by 3.77% on custom dataset consist of images from construction sites and by 1.67% on PASCAL VOC Challenge.
△ Less
Submitted 19 August, 2018; v1 submitted 16 August, 2018;
originally announced August 2018.
-
Ensemble-based Adaptive Single-shot Multi-box Detector
Authors:
Viral Thakar,
Walid Ahmed,
Mohammad M Soltani,
Jia Yuan Yu
Abstract:
We propose two improvements to the SSD---single shot multibox detector. First, we propose an adaptive approach for default box selection in SSD. This uses data to reduce the uncertainty in the selection of best aspect ratios for the default boxes and improves performance of SSD for datasets containing small and complex objects (e.g., equipments at construction sites). We do so by finding the distr…
▽ More
We propose two improvements to the SSD---single shot multibox detector. First, we propose an adaptive approach for default box selection in SSD. This uses data to reduce the uncertainty in the selection of best aspect ratios for the default boxes and improves performance of SSD for datasets containing small and complex objects (e.g., equipments at construction sites). We do so by finding the distribution of aspect ratios of the given training dataset, and then choosing representative values. Secondly, we propose an ensemble algorithm, using SSD as components, which improves the performance of SSD, especially for small amount of training datasets. Compared to the conventional SSD algorithm, adaptive box selection improves mean average precision by 3%, while ensemble-based SSD improves it by 8%.
△ Less
Submitted 16 August, 2018;
originally announced August 2018.
-
Communication-efficient Distributed Multi-resource Allocation
Authors:
Syed Eqbal Alam,
Robert Shorten,
Fabian Wirth,
Jia Yuan Yu
Abstract:
In several smart city applications, multiple resources must be allocated among competing agents that are coupled through such shared resources and are constrained --- either through limitations of communication infrastructure or privacy considerations. We propose a distributed algorithm to solve such distributed multi-resource allocation problems with no direct inter-agent communication. We do so…
▽ More
In several smart city applications, multiple resources must be allocated among competing agents that are coupled through such shared resources and are constrained --- either through limitations of communication infrastructure or privacy considerations. We propose a distributed algorithm to solve such distributed multi-resource allocation problems with no direct inter-agent communication. We do so by extending a recently introduced additive-increase multiplicative-decrease (AIMD) algorithm, which only uses very little communication between the system and agents. Namely, a control unit broadcasts a one-bit signal to agents whenever one of the allocated resources exceeds capacity. Agents then respond to this signal in a probabilistic manner. In the proposed algorithm, each agent makes decision of its resource demand locally and an agent is unaware of the resource allocation of other agents. In empirical results, we observe that the average allocations converge over time to optimal allocations.
△ Less
Submitted 27 July, 2018;
originally announced July 2018.
-
Distributed, Private, and Derandomized Allocation Algorithm for EV Charging
Authors:
Hamid Nabati,
Jia Yuan Yu
Abstract:
Efficient resource allocation is challenging when privacy of users is important. Distributed approaches have recently been used extensively to find a solution for such problems. In this work, the efficiency of distributed AIMD algorithm for allocation of subsidized goods is studied. First, a suitable utility function is assigned to each user describing the amount of satisfaction that it has from a…
▽ More
Efficient resource allocation is challenging when privacy of users is important. Distributed approaches have recently been used extensively to find a solution for such problems. In this work, the efficiency of distributed AIMD algorithm for allocation of subsidized goods is studied. First, a suitable utility function is assigned to each user describing the amount of satisfaction that it has from allocated resource. Then the resource allocation is defined as a total utilitarianism problem that is an optimization problem of sum of users utility functions subjected to capacity constraint. Recently, a stochastic state-dependent variant of AIMD algorithm is used for allocation of common goods among users with strictly increasing and concave utility functions. Here, the stochastic AIMD algorithm is derandomized and its efficiency is compared with the stochastic version. Moreover, the algorithm is improved to allocate subsidized goods to users with concave and non-monotone utility functions as well as users with Sigmoidal utility functions. To illustrate the effectiveness of the proposed solutions, simulation results is presented for a public renewable-energy powered charging station in which the electric vehicles (EV) compete to be recharged.
△ Less
Submitted 16 April, 2018;
originally announced April 2018.
-
State-Augmentation Transformations for Risk-Sensitive Reinforcement Learning
Authors:
Shuai Ma,
Jia Yuan Yu
Abstract:
In the framework of MDP, although the general reward function takes three arguments-current state, action, and successor state; it is often simplified to a function of two arguments-current state and action. The former is called a transition-based reward function, whereas the latter is called a state-based reward function. When the objective involves the expected cumulative reward only, this simpl…
▽ More
In the framework of MDP, although the general reward function takes three arguments-current state, action, and successor state; it is often simplified to a function of two arguments-current state and action. The former is called a transition-based reward function, whereas the latter is called a state-based reward function. When the objective involves the expected cumulative reward only, this simplification works perfectly. However, when the objective is risk-sensitive, this simplification leads to an incorrect value. We present state-augmentation transformations (SATs), which preserve the reward sequences as well as the reward distributions and the optimal policy in risk-sensitive reinforcement learning. In risk-sensitive scenarios, firstly we prove that, for every MDP with a stochastic transition-based reward function, there exists an MDP with a deterministic state-based reward function, such that for any given (randomized) policy for the first MDP, there exists a corresponding policy for the second MDP, such that both Markov reward processes share the same reward sequence. Secondly we illustrate that two situations require the proposed SATs in an inventory control problem. One could be using Q-learning (or other learning methods) on MDPs with transition-based reward functions, and the other could be using methods, which are for the Markov processes with a deterministic state-based reward functions, on the Markov processes with general reward functions. We show the advantage of the SATs by considering Value-at-Risk as an example, which is a risk measure on the reward distribution instead of the measures (such as mean and variance) of the distribution. We illustrate the error in the reward distribution estimation from the direct use of Q-learning, and show how the SATs enable a variance formula to work on Markov processes with general reward functions.
△ Less
Submitted 29 November, 2018; v1 submitted 16 April, 2018;
originally announced April 2018.
-
On the Control of Agents Coupled through Shared Unit-demand Resources
Authors:
Syed Eqbal Alam,
Robert Shorten,
Fabian Wirth,
Jia Yuan Yu
Abstract:
We consider a control problem involving several agents coupled through multiple unit-demand resources. Such resources are indivisible, and each agent's consumption is modeled as a Bernoulli random variable. Controlling the number of such agents in a probabilistic manner, subject to capacity constraints, is ubiquitous in smart cities. For instance, such agents can be humans in a feedback loop---who…
▽ More
We consider a control problem involving several agents coupled through multiple unit-demand resources. Such resources are indivisible, and each agent's consumption is modeled as a Bernoulli random variable. Controlling the number of such agents in a probabilistic manner, subject to capacity constraints, is ubiquitous in smart cities. For instance, such agents can be humans in a feedback loop---who respond to a price signal, or automated decision-support systems that strive toward system-level goals. In this paper, we consider both single feedback loop corresponding to a single resource and multiple coupled feedback loops corresponding to multiple resources consumed by the same population of agents. For example, when a network of devices allocates resources to deliver several services, these services are coupled through capacity constraints on the resources. We propose a new algorithm with fundamental guarantees of convergence and optimality, as well as present an example illustrating its performance.
△ Less
Submitted 29 April, 2019; v1 submitted 27 March, 2018;
originally announced March 2018.
-
The Merits of Sharing a Ride
Authors:
Pooyan Ehsani,
Jia Yuan Yu
Abstract:
The culture of sharing instead of ownership is sharply increasing in individuals behaviors. Particularly in transportation, concepts of sharing a ride in either carpooling or ridesharing have been recently adopted. An efficient optimization approach to match passengers in real-time is the core of any ridesharing system. In this paper, we model ridesharing as an online matching problem on general g…
▽ More
The culture of sharing instead of ownership is sharply increasing in individuals behaviors. Particularly in transportation, concepts of sharing a ride in either carpooling or ridesharing have been recently adopted. An efficient optimization approach to match passengers in real-time is the core of any ridesharing system. In this paper, we model ridesharing as an online matching problem on general graphs such that passengers do not drive private cars and use shared taxis. We propose an optimization algorithm to solve it. The outlined algorithm calculates the optimal waiting time when a passenger arrives. This leads to a matching with minimal overall overheads while maximizing the number of partnerships. To evaluate the behavior of our algorithm, we used NYC taxi real-life data set. Results represent a substantial reduction in overall overheads.
△ Less
Submitted 19 December, 2017;
originally announced December 2017.
-
LHC data and its impact on nCTEQ15 PDFs
Authors:
D. B. Clark,
E. Godat,
T. Jezo,
C. Keppel,
K. Kovarik,
A. Kusina,
F. Lyonnet,
J. G. Morfin,
F. I. Olness,
J. F. Owens,
I. Schienbein,
J. Y. Yu
Abstract:
The LHC heavy ion data for W/Z production can provide new incisive information on the PDFs. This data is sensitive to the heavier quark flavors (strange and charm) in a high energy kinematic region; this can facilitate the determination of PDFs in the small x region where previous data was limited. At present, the flavor separation of the proton PDFs is dependent on DIS data from nuclear targets.…
▽ More
The LHC heavy ion data for W/Z production can provide new incisive information on the PDFs. This data is sensitive to the heavier quark flavors (strange and charm) in a high energy kinematic region; this can facilitate the determination of PDFs in the small x region where previous data was limited. At present, the flavor separation of the proton PDFs is dependent on DIS data from nuclear targets. Therefore, improved nuclear corrections can also yield enhanced flavor determination of both the proton and nuclear PDFs.
△ Less
Submitted 21 December, 2017;
originally announced December 2017.
-
Distributed Multi-resource Allocation with Little Communication Overhead
Authors:
Syed Eqbal Alam,
Robert Shorten,
Fabian Wirth,
Jia Yuan Yu
Abstract:
We propose a distributed algorithm to solve a special distributed multi-resource allocation problem with no direct inter-agent communication. We do so by extending a recently introduced additive-increase multiplicative-decrease (AIMD) algorithm, which only uses very little communication between the system and agents. Namely, a control unit broadcasts a one-bit signal to agents whenever one of the…
▽ More
We propose a distributed algorithm to solve a special distributed multi-resource allocation problem with no direct inter-agent communication. We do so by extending a recently introduced additive-increase multiplicative-decrease (AIMD) algorithm, which only uses very little communication between the system and agents. Namely, a control unit broadcasts a one-bit signal to agents whenever one of the allocated resources exceeds capacity. Agents then respond to this signal in a probabilistic manner. In the proposed algorithm, each agent is unaware of the resource allocation of other agents. We also propose a version of the AIMD algorithm for multiple binary resources (e.g., parking spaces). Binary resources are indivisible unit-demand resources, and each agent either allocated one unit of the resource or none. In empirical results, we observe that in both cases, the average allocations converge over time to optimal allocations.
△ Less
Submitted 6 November, 2017;
originally announced November 2017.
-
High-energy-density electron-positron pair plasma production and its dynamics in the relativistic transparency regime
Authors:
W. Y. Liu,
W. Luo,
T. Yuan,
J. Y. Yu,
M. Chen,
Z. M. Sheng
Abstract:
High-energy-density electron-positron pair plasma production and its dynamics in a thin foil illuminated by two counter-propagating laser pulses are investigated through multi-dimensional particle-in-cell simulations. We compare the production of electron-positron pairs and gamma-photons via quantum electrodynamics processes in the relativistic transparent and opaque regimes, and find that the tar…
▽ More
High-energy-density electron-positron pair plasma production and its dynamics in a thin foil illuminated by two counter-propagating laser pulses are investigated through multi-dimensional particle-in-cell simulations. We compare the production of electron-positron pairs and gamma-photons via quantum electrodynamics processes in the relativistic transparent and opaque regimes, and find that the target transparency can significantly enhance the electron-positron pair production due to the formation of stable standing wave (SW). An optimum foil density of 200 - 280 n_c (n_c is the laser critical density) is found for enhancing electron-positron pair production when laser intensity reaches a few 10e23 W/cm2. At such foil density, laser energy conversion to electron-positron pairs is approximately four times higher than at foil density of 710n_c, whereas laser energy conversion to gamma-photons keeps almost the same. Consequently, high dense electron-positron plasma with a maximum intensity above 10e20 W/cm2 is produced. Modulation dynamics of created pair plasmas is further observed when target foil becomes transparent. It is shown that stable SWs formed directly by two counter-propagating lasers, not only trap the created electron-positron pairs to their nodes, but also modulate periodically average energy and phase-space and angular distributions of trapped particles. However, similar trap** and modulation effects become obscure in the opaque regime due to the absence of stable SW field.
△ Less
Submitted 22 June, 2017;
originally announced June 2017.
-
Distributionally Robust Optimisation in Congestion Control
Authors:
Jakub Marecek,
Robert Shorten,
Jia Yuan Yu
Abstract:
The effects of real-time provision of travel-time information on the behaviour of drivers are considered. The model of Marecek et al. [arXiv:1406.7639, Int. J. Control 88(10), 2015] is extended to consider uncertainty in the response of a driver to an interval provided per route. Specifically, it is suggested that one can optimise over all distributions of a random variable associated with the dri…
▽ More
The effects of real-time provision of travel-time information on the behaviour of drivers are considered. The model of Marecek et al. [arXiv:1406.7639, Int. J. Control 88(10), 2015] is extended to consider uncertainty in the response of a driver to an interval provided per route. Specifically, it is suggested that one can optimise over all distributions of a random variable associated with the driver's response with the first two moments fixed, and for each route, over the sub-intervals within the minimum and maximum in a certain number of previous realisations of the travel time per the route.
△ Less
Submitted 25 May, 2017;
originally announced May 2017.
-
LHC lead data and nuclear PDFs
Authors:
A. Kusina,
F. Lyonnet,
D. B. Clark,
E. Godat,
T. Jezo,
K. Kovarik,
F. I. Olness,
I. Schienbein,
J. Y. Yu
Abstract:
We compare predictions of nCTEQ15 nuclear parton distribution functions with proton-lead vector boson production data from the LHC. We select data sets that are most sensitive to nuclear PDFs and have potential to constrain them. We identify the kinematic regions and flavours where these data can bring new information and will have largest impact on the nuclear PDFs. Finally, we estimate the effec…
▽ More
We compare predictions of nCTEQ15 nuclear parton distribution functions with proton-lead vector boson production data from the LHC. We select data sets that are most sensitive to nuclear PDFs and have potential to constrain them. We identify the kinematic regions and flavours where these data can bring new information and will have largest impact on the nuclear PDFs. Finally, we estimate the effect of including these data in a global analysis using a reweighting method.
△ Less
Submitted 18 May, 2017;
originally announced May 2017.
-
Transition-based versus State-based Reward Functions for MDPs with Value-at-Risk
Authors:
Shuai Ma,
Jia Yuan Yu
Abstract:
In reinforcement learning, the reward function on current state and action is widely used. When the objective is about the expectation of the (discounted) total reward only, it works perfectly. However, if the objective involves the total reward distribution, the result will be wrong. This paper studies Value-at-Risk (VaR) problems in short- and long-horizon Markov decision processes (MDPs) with t…
▽ More
In reinforcement learning, the reward function on current state and action is widely used. When the objective is about the expectation of the (discounted) total reward only, it works perfectly. However, if the objective involves the total reward distribution, the result will be wrong. This paper studies Value-at-Risk (VaR) problems in short- and long-horizon Markov decision processes (MDPs) with two reward functions, which share the same expectations. Firstly we show that with VaR objective, when the real reward function is transition-based (with respect to action and both current and next states), the simplified (state-based, with respect to action and current state only) reward function will change the VaR. Secondly, for long-horizon MDPs, we estimate the VaR function with the aid of spectral theory and the central limit theorem. Thirdly, since the estimation method is for a Markov reward process with the reward function on current state only, we present a transformation algorithm for the Markov reward process with the reward function on current and next states, in order to estimate the VaR function with an intact total reward distribution.
△ Less
Submitted 29 November, 2018; v1 submitted 6 December, 2016;
originally announced December 2016.
-
Vector boson production in proton-lead and lead-lead collisions at the LHC and its impact on nCTEQ15 PDFs
Authors:
A. Kusina,
F. Lyonnet,
D. B. Clark,
E. Godat,
T. Jezo,
K. Kovarik,
F. I. Olness,
I. Schienbein,
J. Y. Yu
Abstract:
We provide a comprehensive comparison of W/Z vector boson production data in proton-lead and lead-lead collisions at the LHC with predictions obtained using the nCTEQ15 PDFs. We identify the measurements which have the largest potential impact on the PDFs, and estimate the effect of including these data using a Monte Carlo reweighting method. We find this data set can provide information about bot…
▽ More
We provide a comprehensive comparison of W/Z vector boson production data in proton-lead and lead-lead collisions at the LHC with predictions obtained using the nCTEQ15 PDFs. We identify the measurements which have the largest potential impact on the PDFs, and estimate the effect of including these data using a Monte Carlo reweighting method. We find this data set can provide information about both the nuclear corrections and the heavy flavor (strange) PDF components. As the proton flavor determination is dependent on nuclear corrections (from heavy target DIS, for example), this information can also help improve the proton PDFs.
△ Less
Submitted 1 August, 2017; v1 submitted 6 October, 2016;
originally announced October 2016.
-
Pricing Vehicle Sharing with Proximity Information
Authors:
Jakub Marecek,
Robert Shorten,
Jia Yuan Yu
Abstract:
For vehicle sharing schemes, where drop-off positions are not fixed, we propose a pricing scheme, where the price depends in part on the distance between where a vehicle is being dropped off and where the closest shared vehicle is parked. Under certain restrictive assumptions, we show that this pricing leads to a socially optimal spread of the vehicles within a region.
For vehicle sharing schemes, where drop-off positions are not fixed, we propose a pricing scheme, where the price depends in part on the distance between where a vehicle is being dropped off and where the closest shared vehicle is parked. Under certain restrictive assumptions, we show that this pricing leads to a socially optimal spread of the vehicles within a region.
△ Less
Submitted 25 January, 2016;
originally announced January 2016.
-
Central-limit approach to risk-aware Markov decision processes
Authors:
Pengqian Yu,
Jia Yuan Yu,
Huan Xu
Abstract:
Whereas classical Markov decision processes maximize the expected reward, we consider minimizing the risk. We propose to evaluate the risk associated to a given policy over a long-enough time horizon with the help of a central limit theorem. The proposed approach works whether the transition probabilities are known or not. We also provide a gradient-based policy improvement algorithm that converge…
▽ More
Whereas classical Markov decision processes maximize the expected reward, we consider minimizing the risk. We propose to evaluate the risk associated to a given policy over a long-enough time horizon with the help of a central limit theorem. The proposed approach works whether the transition probabilities are known or not. We also provide a gradient-based policy improvement algorithm that converges to a local optimum of the risk objective.
△ Less
Submitted 2 December, 2015;
originally announced December 2015.
-
Two Phase $Q-$learning for Bidding-based Vehicle Sharing
Authors:
Yinlam Chow,
Jia Yuan Yu,
Marco Pavone
Abstract:
We consider one-way vehicle sharing systems where customers can rent a car at one station and drop it off at another. The problem we address is to optimize the distribution of cars, and quality of service, by pricing rentals appropriately. We propose a bidding approach that is inspired from auctions and takes into account the significant uncertainty inherent in the problem data (e.g., pick-up and…
▽ More
We consider one-way vehicle sharing systems where customers can rent a car at one station and drop it off at another. The problem we address is to optimize the distribution of cars, and quality of service, by pricing rentals appropriately. We propose a bidding approach that is inspired from auctions and takes into account the significant uncertainty inherent in the problem data (e.g., pick-up and drop-off locations, time of requests, and duration of trips). Specifically, in contrast to current vehicle sharing systems, the operator does not set prices. Instead, customers submit bids and the operator decides whether to rent or not. The operator can even accept negative bids to motivate drivers to rebalance available cars to unpopular destinations within a city. We model the operator's sequential decision-making problem as a \emph{constrained Markov decision problem} (CMDP) and propose and rigorously analyze a novel two phase $Q$-learning algorithm for its solution. Numerical experiments are presented and discussed.
△ Less
Submitted 19 October, 2015; v1 submitted 29 September, 2015;
originally announced September 2015.
-
nCTEQ15 - Global analysis of nuclear parton distributions with uncertainties
Authors:
A. Kusina,
K. Kovarik,
T. Jezo,
D. B. Clark,
C. Keppel,
F. Lyonnet,
J. G. Morfin,
F. I. Olness,
J. F. Owens,
I. Schienbein,
J. Y. Yu
Abstract:
We present the first official release of the nCTEQ nuclear parton distribution functions with errors. The main addition to the previous nCTEQ PDFs is the introduction of PDF uncertainties based on the Hessian method. Another important addition is the inclusion of pion production data from RHIC that give us a handle on constraining the gluon PDF. This contribution summarizes our results from arXiv:…
▽ More
We present the first official release of the nCTEQ nuclear parton distribution functions with errors. The main addition to the previous nCTEQ PDFs is the introduction of PDF uncertainties based on the Hessian method. Another important addition is the inclusion of pion production data from RHIC that give us a handle on constraining the gluon PDF. This contribution summarizes our results from arXiv:1509.00792 and concentrates on the comparison with other groups providing nuclear parton distributions.
△ Less
Submitted 6 September, 2015;
originally announced September 2015.
-
nCTEQ15 - Global analysis of nuclear parton distributions with uncertainties in the CTEQ framework
Authors:
K. Kovarik,
A. Kusina,
T. Jezo,
D. B. Clark,
C. Keppel,
F. Lyonnet,
J. G. Morfin,
F. I. Olness,
J. F. Owens,
I. Schienbein,
J. Y. Yu
Abstract:
We present the new nCTEQ15 set of nuclear parton distribution functions with uncertainties. This fit extends the CTEQ proton PDFs to include the nuclear dependence using data on nuclei all the way up to 208^Pb. The uncertainties are determined using the Hessian method with an optimal rescaling of the eigenvectors to accurately represent the uncertainties for the chosen tolerance criteria. In addit…
▽ More
We present the new nCTEQ15 set of nuclear parton distribution functions with uncertainties. This fit extends the CTEQ proton PDFs to include the nuclear dependence using data on nuclei all the way up to 208^Pb. The uncertainties are determined using the Hessian method with an optimal rescaling of the eigenvectors to accurately represent the uncertainties for the chosen tolerance criteria. In addition to the Deep Inelastic Scattering (DIS) and Drell-Yan (DY) processes, we also include inclusive pion production data from RHIC to help constrain the nuclear gluon PDF. Furthermore, we investigate the correlation of the data sets with specific nPDF flavor components, and asses the impact of individual experiments. We also provide comparisons of the nCTEQ15 set with recent fits from other groups.
△ Less
Submitted 15 February, 2016; v1 submitted 2 September, 2015;
originally announced September 2015.
-
A Fair Assignment of Drivers to Parking Lots
Authors:
Nicole Taheri,
Jia Yuan Yu,
Robert Shorten
Abstract:
Searching for a parking spot can waste time and gasoline. This waste can be reduced by assigning drivers to parking lots based on their destination and arrival time. In such a system, drivers could request a parking spot in advance and be alerted (e.g., via their phone or vehicle) of their assignment to a specific parking lot or available spot. In this paper, a parking assignment system is describ…
▽ More
Searching for a parking spot can waste time and gasoline. This waste can be reduced by assigning drivers to parking lots based on their destination and arrival time. In such a system, drivers could request a parking spot in advance and be alerted (e.g., via their phone or vehicle) of their assignment to a specific parking lot or available spot. In this paper, a parking assignment system is described to allocate parking spaces in a fair and equitable manner. Heuristics are developed to solve the underlying large scale optimization problem. The efficacy of the system is demonstrated by applying our algorithms to real data sets.
△ Less
Submitted 13 July, 2015;
originally announced July 2015.
-
On the Design of Campus Parking Systems with QoS guarantees
Authors:
Wynita Griggs,
Jia Yuan Yu,
Fabian Wirth,
Florian Haeusler,
Robert Shorten
Abstract:
Parking spaces are resources that can be pooled together and shared, especially when there are complementary day-time and night-time users. We answer two design questions. First, given a quality of service requirement, how many spaces should be set aside as contingency during day-time for night-time users? Next, how can we replace the first-come-first-served access method by one that aims at optim…
▽ More
Parking spaces are resources that can be pooled together and shared, especially when there are complementary day-time and night-time users. We answer two design questions. First, given a quality of service requirement, how many spaces should be set aside as contingency during day-time for night-time users? Next, how can we replace the first-come-first-served access method by one that aims at optimal efficiency while kee** user preferences private?
△ Less
Submitted 9 June, 2015;
originally announced June 2015.
-
Update on nCTEQ PDFs: nuclear PDF uncertainties and LHC applications
Authors:
A. Kusina,
K. Kovarik,
T. Jezo,
D. B. Clark,
F. I. Olness,
I. Schienbein,
J. Y. Yu
Abstract:
We present updated nCTEQ nuclear parton distribution functions with errors including pion production data from RHIC. We compare them with the results of other groups and present selected LHC applications.
We present updated nCTEQ nuclear parton distribution functions with errors including pion production data from RHIC. We compare them with the results of other groups and present selected LHC applications.
△ Less
Submitted 5 August, 2014;
originally announced August 2014.
-
Search for a Light Sterile Neutrino at Daya Bay
Authors:
F. P. An,
A. B. Balantekin,
H. R. Band,
W. Beriguete,
M. Bishai,
S. Blyth,
I. Butorov,
G. F. Cao,
J. Cao,
Y. L. Chan,
J. F. Chang,
L. C. Chang,
Y. Chang,
C. Chasman,
H. Chen,
Q. Y. Chen,
S. M. Chen,
X. Chen,
X. Chen,
Y. X. Chen,
Y. Chen,
Y. P. Cheng,
J. J. Cherwinka,
M. C. Chu,
J. P. Cummings
, et al. (210 additional authors not shown)
Abstract:
A search for light sterile neutrino mixing was performed with the first 217 days of data from the Daya Bay Reactor Antineutrino Experiment. The experiment's unique configuration of multiple baselines from six 2.9~GW$_{\rm th}$ nuclear reactors to six antineutrino detectors deployed in two near (effective baselines 512~m and 561~m) and one far (1579~m) underground experimental halls makes it possib…
▽ More
A search for light sterile neutrino mixing was performed with the first 217 days of data from the Daya Bay Reactor Antineutrino Experiment. The experiment's unique configuration of multiple baselines from six 2.9~GW$_{\rm th}$ nuclear reactors to six antineutrino detectors deployed in two near (effective baselines 512~m and 561~m) and one far (1579~m) underground experimental halls makes it possible to test for oscillations to a fourth (sterile) neutrino in the $10^{\rm -3}~{\rm eV}^{2} < |Δm_{41}^{2}| < 0.3~{\rm eV}^{2}$ range. The relative spectral distortion due to electron antineutrino disappearance was found to be consistent with that of the three-flavor oscillation model. The derived limits on $\sin^22θ_{14}$ cover the $10^{-3}~{\rm eV}^{2} \lesssim |Δm^{2}_{41}| \lesssim 0.1~{\rm eV}^{2}$ region, which was largely unexplored.
△ Less
Submitted 8 October, 2014; v1 submitted 27 July, 2014;
originally announced July 2014.