-
Direct Measurement of the Critical Cooling Rate for the Vitrification of Water
Authors:
Nathan J. Mowry,
Constantin R. Kruger,
Marcel Drabbels,
Ulrich J. Lorenz
Abstract:
The vitrification of aqueous solutions through rapid cooling is a remarkable achievement that launched the field of cryo-electron microscopy (cryo-EM) and has enabled the cryopreservation of biological specimens. For judging the feasibility of a vitrification experiment, the critical cooling rate of pure water is a frequently cited reference quantity. However, an accurate determination has remaine…
▽ More
The vitrification of aqueous solutions through rapid cooling is a remarkable achievement that launched the field of cryo-electron microscopy (cryo-EM) and has enabled the cryopreservation of biological specimens. For judging the feasibility of a vitrification experiment, the critical cooling rate of pure water is a frequently cited reference quantity. However, an accurate determination has remained elusive, with estimates varying by several orders of magnitude. Here, we employ in situ and time-resolved electron microscopy to obtain a precise measurement. We use shaped microsecond laser pulses to briefly melt an amorphous ice sample before flash freezing it with a variable, well-defined cooling rate. This allows us to directly measure the critical cooling rate of pure water, which we determine to be $6.4\cdot10^{6}$ K/s. Our experimental approach also expands the toolkit of microsecond time-resolved cryo-EM, an emerging technique, in which a cryo sample is similarly flash melted and revitrified with a laser pulse.
△ Less
Submitted 1 July, 2024;
originally announced July 2024.
-
Shaped Laser Pulses for Microsecond Time-Resolved Cryo-EM: Outrunning Crystallization During Flash Melting
Authors:
Constantin R. Krüger,
Nathan J. Mowry,
Marcel Drabbels,
Ulrich J. Lorenz
Abstract:
Water vitrifies if cooled at rates above $3 \cdot 10^5$ K/s. Surprisingly, this process cannot simply be reversed by heating the resulting amorphous ice at a similar rate. Instead, we have recently shown that the sample transiently crystallizes even if the heating rate is more than one order of magnitude higher. This may present an issue for microsecond time-resolved cryo-electron microscopy exper…
▽ More
Water vitrifies if cooled at rates above $3 \cdot 10^5$ K/s. Surprisingly, this process cannot simply be reversed by heating the resulting amorphous ice at a similar rate. Instead, we have recently shown that the sample transiently crystallizes even if the heating rate is more than one order of magnitude higher. This may present an issue for microsecond time-resolved cryo-electron microscopy experiments, in which vitreous ice samples are briefly flash melted with a laser pulse, since transient crystallization could potentially alter the dynamics of the embedded proteins. Here, we demonstrate how shaped microsecond laser pulses can be used to increase the heating rate and outrun crystallization during flash melting of amorphous solid water (ASW) samples. We use time-resolved electron diffraction experiments to determine that the critical heating rate is about $10^8$ K/s, more than two orders of magnitude higher than the critical cooling rate. Our experiments add to the toolbox of the emerging field of microsecond time-resolved cryo-electron microscopy by demonstrating a straightforward approach for avoiding crystallization during laser melting and for achieving significantly higher heating rates, which paves the way for nanosecond time-resolved experiments.
△ Less
Submitted 31 January, 2024;
originally announced January 2024.
-
Flash Melting Amorphous Ice
Authors:
Nathan J. Mowry,
Constantin R. Krüger,
Gabriele Bongiovanni,
Marcel Drabbels,
Ulrich J. Lorenz
Abstract:
Water can be vitrified if it is cooled at rates exceeding $3*10^5$ K/s. This makes it possible to outrun crystallization in so-called no man's land, a range of deeply supercooled temperatures where water crystallizes rapidly. One would naively assume that the process can simply be reversed by heating the resulting amorphous ice at a similar rate. We demonstrate that this is not the case. When amor…
▽ More
Water can be vitrified if it is cooled at rates exceeding $3*10^5$ K/s. This makes it possible to outrun crystallization in so-called no man's land, a range of deeply supercooled temperatures where water crystallizes rapidly. One would naively assume that the process can simply be reversed by heating the resulting amorphous ice at a similar rate. We demonstrate that this is not the case. When amorphous ice samples are flash melted with a microsecond laser pulse, time-resolved electron diffraction reveals that the sample transiently crystallizes despite a heating rate of more than $5*10^6$ K/s, demonstrating that the critical heating rate for outrunning crystallization is significantly higher than the critical cooling rate during vitrification. Moreover, we observe different crystallization kinetics for amorphous solid water (ASW) and hyperquenched glassy water (HGW), which suggests that the supercooled liquids formed during laser heating transiently retain distinct non-equilibrium structures that are associated with different nucleation rates. These experiments open up new avenues for elucidating the crystallization mechanism of water and studying its dynamics in no man's land. They also add important mechanistic details to the laser melting and revitrification process that is integral to the emerging field of microsecond time-resolved cryo-electron microscopy.
△ Less
Submitted 18 December, 2023;
originally announced December 2023.
-
ForMAX -- a beamline for multiscale and multimodal structural characterization of hierarchical materials
Authors:
K. Nygård,
S. A. McDonald,
J. B. González,
V. Haghighat,
C. Appel,
E. Larsson,
R. Ghanbari,
M. Viljanen,
J. Silva,
S. Malki,
Y. Li,
V. Silva,
C. Weninger,
F. Engelmann,
T. Jeppsson,
G. Felcsuti,
T. Rosén,
K. Gordeyeva,
L. D. Söderberg,
H. Dierks,
Y. Zhang,
Z. Yao,
R. Yang,
E. M. Asimakopoulou,
J. K. Rogalinski
, et al. (13 additional authors not shown)
Abstract:
The ForMAX beamline at the MAX IV Laboratory provides multiscale and multimodal structural characterization of hierarchical materials in the nm to mm range by combining small- and wide-angle x-ray scattering with full-field microtomography. The modular design of the beamline is optimized for easy switching between different experimental modalities. The beamline has a special focus on the developme…
▽ More
The ForMAX beamline at the MAX IV Laboratory provides multiscale and multimodal structural characterization of hierarchical materials in the nm to mm range by combining small- and wide-angle x-ray scattering with full-field microtomography. The modular design of the beamline is optimized for easy switching between different experimental modalities. The beamline has a special focus on the development of novel, fibrous materials from forest resources, but it is also well suited for studies within, e.g., food science and biomedical research.
△ Less
Submitted 2 February, 2024; v1 submitted 13 December, 2023;
originally announced December 2023.
-
Tuning Colloidal Reactions
Authors:
Ryan Krueger,
Ella King,
Michael Brenner
Abstract:
The precise control of complex reactions is critical for biological processes ranging from cell division to metabolism. Synthetic analogues of living materials suffer from our inability to tune chemical reactions with precise outcomes. Here, we leverage differentiable simulators to design nontrivial reaction pathways in colloidal assemblies. By optimizing interactions between reactants and substra…
▽ More
The precise control of complex reactions is critical for biological processes ranging from cell division to metabolism. Synthetic analogues of living materials suffer from our inability to tune chemical reactions with precise outcomes. Here, we leverage differentiable simulators to design nontrivial reaction pathways in colloidal assemblies. By optimizing interactions between reactants and substrates, we achieve controlled disassembly of octahedral and icosahedral shells. As a potential engineering target, we design a reaction that provokes the release of a small particle trapped in a shell.
△ Less
Submitted 12 December, 2023;
originally announced December 2023.
-
Residency Octree: A Hybrid Approach for Scalable Web-Based Multi-Volume Rendering
Authors:
Lukas Herzberger,
Markus Hadwiger,
Robert Krüger,
Peter Sorger,
Hanspeter Pfister,
Eduard Gröller,
Johanna Beyer
Abstract:
We present a hybrid multi-volume rendering approach based on a novel Residency Octree that combines the advantages of out-of-core volume rendering using page tables with those of standard octrees. Octree approaches work by performing hierarchical tree traversal. However, in octree volume rendering, tree traversal and the selection of data resolution are intrinsically coupled. This makes fine-grain…
▽ More
We present a hybrid multi-volume rendering approach based on a novel Residency Octree that combines the advantages of out-of-core volume rendering using page tables with those of standard octrees. Octree approaches work by performing hierarchical tree traversal. However, in octree volume rendering, tree traversal and the selection of data resolution are intrinsically coupled. This makes fine-grained empty-space skip** costly. Page tables, on the other hand, allow access to any cached brick from any resolution. However, they do not offer a clear and efficient strategy for substituting missing high-resolution data with lower-resolution data. We enable flexible mixed-resolution out-of-core multi-volume rendering by decoupling the cache residency of multi-resolution data from a resolution-independent spatial subdivision determined by the tree. Instead of one-to-one node-to-brick correspondences, each residency octree node is mapped to a set of bricks from different resolution levels. This makes it possible to efficiently and adaptively choose and mix resolutions, adapt sampling rates, and compensate for cache misses. At the same time, residency octrees support fine-grained empty-space skip**, independent of the data subdivision used for caching. Finally, to facilitate collaboration and outreach, and to eliminate local data storage, our implementation is a web-based, pure client-side renderer using WebGPU and WebAssembly. Our method is faster than prior approaches and efficient for many data channels with a flexible and adaptive choice of data resolution.
△ Less
Submitted 8 September, 2023;
originally announced September 2023.
-
Decoherence effects in reactor and Gallium neutrino oscillation experiments -- a QFT approach
Authors:
Raphael Krueger,
Thomas Schwetz
Abstract:
We adopt the quantum field theoretical method to calculate the amplitude and event rate for a neutrino oscillation experiment, considering neutrino production, propagation and detection as a single process. This method allows to take into account decoherence effects in the transition amplitude induced by the quantum mechanical uncertainties of all particles involved in the process. We extend the m…
▽ More
We adopt the quantum field theoretical method to calculate the amplitude and event rate for a neutrino oscillation experiment, considering neutrino production, propagation and detection as a single process. This method allows to take into account decoherence effects in the transition amplitude induced by the quantum mechanical uncertainties of all particles involved in the process. We extend the method to include coherence loss due to interactions with the environment, similar to collisional line broadening. In addition to generic decoherence induced at the amplitude level, the formalism allows to include, in a straightforward way, additional dam** effects related to phase-space integrals over momenta of unobserved particles as well as other classical averaging effects. We apply this method to neutrino oscillation searches at reactor and Gallium experiments and confirm that quantum decoherence is many orders of magnitudes smaller than classical averaging effects and therefore unobservable. The method used here can be applied with minimal modifications also to other types of oscillation experiments, e.g., accelerator based beam experiments.
△ Less
Submitted 27 March, 2023;
originally announced March 2023.
-
Triangle Percolation on the Grid
Authors:
Igor Araujo,
Bryce Frederickson,
Robert A. Krueger,
Bernard Lidický,
Tyrrell B. McAllister,
Florian Pfender,
Sam Spiro,
Eric Nathan Stucky
Abstract:
We consider a geometric percolation process partially motivated by recent work of Hejda and Kala. Specifically, we start with an initial set $X \subseteq \mathbb{Z}^2$, and then iteratively check whether there exists a triangle $T \subseteq \mathbb{R}^2$ with its vertices in $\mathbb{Z}^2$ such that $T$ contains exactly four points of $\mathbb{Z}^2$ and exactly three points of $X$. In this case, w…
▽ More
We consider a geometric percolation process partially motivated by recent work of Hejda and Kala. Specifically, we start with an initial set $X \subseteq \mathbb{Z}^2$, and then iteratively check whether there exists a triangle $T \subseteq \mathbb{R}^2$ with its vertices in $\mathbb{Z}^2$ such that $T$ contains exactly four points of $\mathbb{Z}^2$ and exactly three points of $X$. In this case, we add the missing lattice point of $T$ to $X$, and we repeat until no such triangle exists. We study the limit sets $S$, the sets stable under this process, including determining their possible densities and some of their structure.
△ Less
Submitted 10 January, 2024; v1 submitted 27 March, 2023;
originally announced March 2023.
-
Realizable Standard Young Tableaux
Authors:
Igor Araujo,
Alexander E. Black,
Amanda Burcroff,
Yibo Gao,
Robert A. Krueger,
Alex McDonough
Abstract:
Given two vectors $u$ and $v$, their outer sum is given by the matrix $A$ with entries $A_{ij} = u_{i} + v_{j}$. If the entries of $u$ and $v$ are increasing and sufficiently generic, the total ordering of the entries of the matrix is a standard Young tableau of rectangular shape. We call standard Young tableaux arising in this way realizable. The set of realizable tableaux was defined by Mallows…
▽ More
Given two vectors $u$ and $v$, their outer sum is given by the matrix $A$ with entries $A_{ij} = u_{i} + v_{j}$. If the entries of $u$ and $v$ are increasing and sufficiently generic, the total ordering of the entries of the matrix is a standard Young tableau of rectangular shape. We call standard Young tableaux arising in this way realizable. The set of realizable tableaux was defined by Mallows and Vanderbei for studying a deconvolution algorithm, but we show they have appeared in many other contexts including sorting algorithms, quantum computing, random sorting networks, reflection arrangements, fiber polytopes, and Goodman and Pollack's theory of allowable sequences. In our work, we prove tight bounds on the asymptotic number of realizable rectangular tableaux. We also derive tight asymptotics for the number of realizable allowable sequences, which are in bijection with realizable staircase-shaped standard Young tableaux with the notion of realizability coming from the theory of sorting networks. As a consequence, we resolve an open question of Angel, Gorin, and Holroyd from 2012 and improve upon a 1986 result of Goodman and Pollack.
△ Less
Submitted 17 February, 2023;
originally announced February 2023.
-
On oriented cycles in randomly perturbed digraphs
Authors:
Igor Araujo,
József Balogh,
Robert A. Krueger,
Simón Piga,
Andrew Treglown
Abstract:
In 2003, Bohman, Frieze, and Martin initiated the study of randomly perturbed graphs and digraphs. For digraphs, they showed that for every $α>0$, there exists a constant $C$ such that for every $n$-vertex digraph of minimum semi-degree at least $αn$, if one adds $Cn$ random edges then asymptotically almost surely the resulting digraph contains a consistently oriented Hamilton cycle. We generalize…
▽ More
In 2003, Bohman, Frieze, and Martin initiated the study of randomly perturbed graphs and digraphs. For digraphs, they showed that for every $α>0$, there exists a constant $C$ such that for every $n$-vertex digraph of minimum semi-degree at least $αn$, if one adds $Cn$ random edges then asymptotically almost surely the resulting digraph contains a consistently oriented Hamilton cycle. We generalize their result, showing that the hypothesis of this theorem actually asymptotically almost surely ensures the existence of every orientation of a cycle of every possible length, simultaneously. Moreover, we prove that we can relax the minimum semi-degree condition to a minimum total degree condition when considering orientations of a cycle that do not contain a large number of vertices of indegree $1$. Our proofs make use of a variant of an absorbing method of Montgomery.
△ Less
Submitted 13 October, 2023; v1 submitted 20 December, 2022;
originally announced December 2022.
-
A Data Fusion Approach for Ride-sourcing Demand Estimation: A Discrete Choice Model with Sampling and Endogeneity Corrections
Authors:
Rico Krueger,
Michel Bierlaire,
Prateek Bansal
Abstract:
Ride-sourcing services offered by companies like Uber and Didi have grown rapidly in the last decade. Understanding the demand for these services is essential for planning and managing modern transportation systems. Existing studies develop statistical models for ride-sourcing demand estimation at an aggregate level due to limited data availability. These models lack foundations in microeconomic t…
▽ More
Ride-sourcing services offered by companies like Uber and Didi have grown rapidly in the last decade. Understanding the demand for these services is essential for planning and managing modern transportation systems. Existing studies develop statistical models for ride-sourcing demand estimation at an aggregate level due to limited data availability. These models lack foundations in microeconomic theory, ignore competition of ride-sourcing with other travel modes, and cannot be seamlessly integrated into existing individual-level (disaggregate) activity-based models to evaluate system-level impacts of ride-sourcing services. In this paper, we present and apply an approach for estimating ride-sourcing demand at a disaggregate level using discrete choice models and multiple data sources. We first construct a sample of trip-based mode choices in Chicago, USA by enriching household travel survey with publicly available ride-sourcing and taxi trip records. We then formulate a multivariate extreme value-based discrete choice with sampling and endogeneity corrections to account for the construction of the estimation sample from multiple data sources and endogeneity biases arising from supply-side constraints and surge pricing mechanisms in ride-sourcing systems. Our analysis of the constructed dataset reveals insights into the influence of various socio-economic, land use and built environment features on ride-sourcing demand. We also derive elasticities of ride-sourcing demand relative to travel cost and time. Finally, we illustrate how the developed model can be employed to quantify the welfare implications of ride-sourcing policies and regulations such as terminating certain types of services and introducing ride-sourcing taxes.
△ Less
Submitted 5 December, 2022;
originally announced December 2022.
-
Electron Diffraction of Water in No Man's Land
Authors:
Constantin R. Krüger,
Nathan J. Mowry,
Gabriele Bongiovanni,
Marcel Drabbels,
Ulrich J. Lorenz
Abstract:
A generally accepted understanding of the anomalous properties of water will only emerge if it becomes possible to systematically characterize water in the deeply supercooled regime, from where the anomalies appear to emanate. This has largely remained elusive because water crystallizes rapidly between 160 K and 232 K. Here, we present an experimental approach to rapidly prepare deeply supercooled…
▽ More
A generally accepted understanding of the anomalous properties of water will only emerge if it becomes possible to systematically characterize water in the deeply supercooled regime, from where the anomalies appear to emanate. This has largely remained elusive because water crystallizes rapidly between 160 K and 232 K. Here, we present an experimental approach to rapidly prepare deeply supercooled water at a well-defined temperature and probe it with electron diffraction before crystallization occurs. We show that as water is cooled from room temperature to cryogenic temperature, its structure evolves smoothly, approaching that of amorphous ice just below 200 K. Our experiments narrow down the range of possible explanations of the origin for the water anomalies and open up new avenues for studying supercooled water.
△ Less
Submitted 8 November, 2022;
originally announced November 2022.
-
Vanishing 2-Qubit Gates with Non-Simplification ZX-Rules
Authors:
Ryan Krueger
Abstract:
Traditional quantum circuit optimization is performed directly at the circuit level. Alternatively, a quantum circuit can be translated to a ZX-diagram which can be simplified using the rules of the ZX-calculus, after which a simplified circuit can be extracted. However, the best-known extraction procedures can drastically increase the number of 2-qubit gates. In this work, we take advantage of th…
▽ More
Traditional quantum circuit optimization is performed directly at the circuit level. Alternatively, a quantum circuit can be translated to a ZX-diagram which can be simplified using the rules of the ZX-calculus, after which a simplified circuit can be extracted. However, the best-known extraction procedures can drastically increase the number of 2-qubit gates. In this work, we take advantage of the fact that local changes in a ZX-diagram can drastically affect the complexity of the extracted circuit. We use a pair of congruences (i.e., non-simplification rewrite rules) based on the graph-theoretic notions of local complementation and pivoting to generate local variants of a simplified ZX-diagram. We explore the space of equivalent ZX-diagrams generated by these congruences using simulated annealing and genetic algorithms to obtain a simplified circuit with fewer 2-qubit gates. On randomly generated circuits, our method can outperform state-of-the-art optimization techniques for low-qubit (<10) circuits. On a set of previously reported benchmark circuits with <=14 qubits, our method outperforms off-the-shelf methods in 87% of cases, consistently reducing overall circuit complexity by an additional ~15-30% and eliminating up to 46% of 2-qubit gates. These preliminary results serve as a proof-of-concept for a new circuit optimization strategy in the ZX-calculus.
△ Less
Submitted 14 September, 2022;
originally announced September 2022.
-
A sharp threshold for a random version of Sperner's Theorem
Authors:
József Balogh,
Robert A. Krueger
Abstract:
The Boolean lattice $\mathcal{P}(n)$ consists of all subsets of $[n] = \{1,\dots, n\}$ partially ordered under the containment relation. Sperner's Theorem states that the largest antichain of the Boolean lattice is given by a middle layer: the collection of all sets of size $\lfloor{n/2}\rfloor$, or also, if $n$ is odd, the collection of all sets of size $\lceil{n/2}\rceil$. Given $p$, choose each…
▽ More
The Boolean lattice $\mathcal{P}(n)$ consists of all subsets of $[n] = \{1,\dots, n\}$ partially ordered under the containment relation. Sperner's Theorem states that the largest antichain of the Boolean lattice is given by a middle layer: the collection of all sets of size $\lfloor{n/2}\rfloor$, or also, if $n$ is odd, the collection of all sets of size $\lceil{n/2}\rceil$. Given $p$, choose each subset of $[n]$ with probability $p$ independently. We show that for every constant $p>3/4$, the largest antichain among these subsets is also given by a middle layer, with probability tending to $1$ as $n$ tends to infinity. This $3/4$ is best possible, and we also characterize the largest antichains for every constant $p>1/2$. Our proof is based on some new variations of Sapozhenko's graph container method.
△ Less
Submitted 20 September, 2023; v1 submitted 23 May, 2022;
originally announced May 2022.
-
The Pattern is in the Details: An Evaluation of Interaction Techniques for Locating, Searching, and Contextualizing Details in Multivariate Matrix Visualizations
Authors:
Yalong Yang,
Wenyu Xia,
Fritz Lekschas,
Carolina Nobre,
Robert Krueger,
Hanspeter Pfister
Abstract:
Matrix visualizations are widely used to display large-scale network, tabular, set, or sequential data. They typically only encode a single value per cell, e.g., through color. However, this can greatly limit the visualizations' utility when exploring multivariate data, where each cell represents a data point with multiple values (referred to as details). Three well-established interaction approac…
▽ More
Matrix visualizations are widely used to display large-scale network, tabular, set, or sequential data. They typically only encode a single value per cell, e.g., through color. However, this can greatly limit the visualizations' utility when exploring multivariate data, where each cell represents a data point with multiple values (referred to as details). Three well-established interaction approaches can be applicable in multivariate matrix visualizations (or MMV): focus+context, pan&zoom, and overview+detail. However, there is little empirical knowledge of how these approaches compare in exploring MMV. We report on two studies comparing them for locating, searching, and contextualizing details in MMV. We first compared four focus+context techniques and found that the fisheye lens overall outperformed the others. We then compared the fisheye lens, to pan&zoom and overview+detail. We found that pan&zoom was faster in locating and searching details, and as good as overview+detail in contextualizing details.
△ Less
Submitted 9 March, 2022;
originally announced March 2022.
-
Mathematical Content Browsing for Print-Disabled Readers Based on Virtual-World Exploration and Audio-Visual Sensory substitution
Authors:
Rynhardt Kruger,
Febe de Wet,
Thomas Niesler
Abstract:
Documents containing mathematical content remain largely inaccessible to blind and visually impaired readers because they are predominantly published as untagged PDF which does not include the semantic data necessary for effective accessibility. We present a browsing approach for print-disabled readers specifically aimed at such mathematical content. This approach draws on the navigational mechani…
▽ More
Documents containing mathematical content remain largely inaccessible to blind and visually impaired readers because they are predominantly published as untagged PDF which does not include the semantic data necessary for effective accessibility. We present a browsing approach for print-disabled readers specifically aimed at such mathematical content. This approach draws on the navigational mechanisms often used to explore the virtual worlds of text adventure games with audio-visual sensory substitution for graphical content. The relative spatial placement of the elements of an equation are represented as a virtual world, so that the reader can navigate from element to element. Text elements are announced conventionally using synthesised speech while graphical elements, such as roots and fraction lines, are rendered using a modification of the vOICe algorithm. The virtual world allows the reader to interactively discover the spatial structure of the equation, while the rendition of graphical elements as sound allows the shape and identity of elements that cannot be synthesised as speech to be discovered and recognised. The browsing approach was evaluated by eleven blind and fourteen sighted participants in a user trial that included the identification of twelve equations extracted from PDF documents. Overall, equations were identified completely correctly in 78% of cases (74% and 83% respectively for blind and sighted subjects). If partial correctness is considered, the performance is substantially higher. We conclude that the integration of a spatial model represented as a virtual world in conjunction with audio-visual sensory substitution for non-textual elements can be an effective way for blind and visually impaired readers to read currently inaccessible mathematical content in PDF documents.
△ Less
Submitted 3 February, 2022;
originally announced February 2022.
-
GenNI: Human-AI Collaboration for Data-Backed Text Generation
Authors:
Hendrik Strobelt,
Jambay Kinley,
Robert Krueger,
Johanna Beyer,
Hanspeter Pfister,
Alexander M. Rush
Abstract:
Table2Text systems generate textual output based on structured data utilizing machine learning. These systems are essential for fluent natural language interfaces in tools such as virtual assistants; however, left to generate freely these ML systems often produce misleading or unexpected outputs. GenNI (Generation Negotiation Interface) is an interactive visual system for high-level human-AI colla…
▽ More
Table2Text systems generate textual output based on structured data utilizing machine learning. These systems are essential for fluent natural language interfaces in tools such as virtual assistants; however, left to generate freely these ML systems often produce misleading or unexpected outputs. GenNI (Generation Negotiation Interface) is an interactive visual system for high-level human-AI collaboration in producing descriptive text. The tool utilizes a deep learning model designed with explicit control states. These controls allow users to globally constrain model generations, without sacrificing the representation power of the deep learning models. The visual interface makes it possible for users to interact with AI systems following a Refine-Forecast paradigm to ensure that the generation system acts in a manner human users find suitable. We report multiple use cases on two experiments that improve over uncontrolled generation approaches, while at the same time providing fine-grained control. A demo and source code are available at https://genni.vizhub.ai .
△ Less
Submitted 19 October, 2021;
originally announced October 2021.
-
Scope2Screen: Focus+Context Techniques for Pathology Tumor Assessment in Multivariate Image Data
Authors:
Jared Jessup,
Robert Krueger,
Simon Warchol,
John Hoffer,
Jeremy Muhlich,
Cecily C. Ritch,
Giorgio Gaglia,
Shannon Coy,
Yu-An Chen,
Jia-Ren Lin,
Sandro Santagata,
Peter K. Sorger,
Hanspeter Pfister
Abstract:
Inspection of tissues using a light microscope is the primary method of diagnosing many diseases, notably cancer. Highly multiplexed tissue imaging builds on this foundation, enabling the collection of up to 60 channels of molecular information plus cell and tissue morphology using antibody staining. This provides unique insight into disease biology and promises to help with the design of patient-…
▽ More
Inspection of tissues using a light microscope is the primary method of diagnosing many diseases, notably cancer. Highly multiplexed tissue imaging builds on this foundation, enabling the collection of up to 60 channels of molecular information plus cell and tissue morphology using antibody staining. This provides unique insight into disease biology and promises to help with the design of patient-specific therapies. However, a substantial gap remains with respect to visualizing the resulting multivariate image data and effectively supporting pathology workflows in digital environments on screen. We, therefore, developed Scope2Screen, a scalable software system for focus+context exploration and annotation of whole-slide, high-plex, tissue images. Our approach scales to analyzing 100GB images of 10^9 or more pixels per channel, containing millions of cells. A multidisciplinary team of visualization experts, microscopists, and pathologists identified key image exploration and annotation tasks involving finding, magnifying, quantifying, and organizing ROIs in an intuitive and cohesive manner. Building on a scope2screen metaphor, we present interactive lensing techniques that operate at single-cell and tissue levels. Lenses are equipped with task-specific functionality and descriptive statistics, making it possible to analyze image features, cell types, and spatial arrangements (neighborhoods) across image channels and scales. A fast sliding-window search guides users to regions similar to those under the lens; these regions can be analyzed and considered either separately or as part of a larger image collection. A novel snapshot method enables linked lens configurations and image statistics to be saved, restored, and shared. We validate our designs with domain experts and apply Scope2Screen in two case studies involving lung and colorectal cancers to discover cancer-relevant image features.
△ Less
Submitted 10 October, 2021;
originally announced October 2021.
-
Face masks, vaccination rates and low crowding drive the demand for the London Underground during the COVID-19 pandemic
Authors:
Prateek Bansal,
Roselinde Kessels,
Rico Krueger,
Daniel J Graham
Abstract:
The COVID-19 pandemic has drastically impacted people's travel behaviour and out-of-home activity participation. While countermeasures are being eased with increasing vaccination rates, the demand for public transport remains uncertain. To investigate user preferences to travel by London Underground during the pandemic, we conducted a stated choice experiment among its pre-pandemic users (N=961).…
▽ More
The COVID-19 pandemic has drastically impacted people's travel behaviour and out-of-home activity participation. While countermeasures are being eased with increasing vaccination rates, the demand for public transport remains uncertain. To investigate user preferences to travel by London Underground during the pandemic, we conducted a stated choice experiment among its pre-pandemic users (N=961). We analysed the collected data using multinomial and mixed logit models. Our analysis provides insights into the sensitivity of the demand for the London Underground with respect to travel attributes (crowding density and travel time), the epidemic situation (confirmed new COVID-19 cases), and interventions (vaccination rates and mandatory face masks). Mandatory face masks and higher vaccination rates are the top two drivers of travel demand for the London Underground during COVID-19. The positive impact of vaccination rates on the Underground demand increases with crowding density, and the positive effect of mandatory face masks decreases with travel time. Mixed logit reveals substantial preference heterogeneity. For instance, while the average effect of mandatory face masks is positive, preferences of around 20% of the pre-pandemic users to travel by the Underground are negatively affected. The estimated demand sensitivities are relevant for supply-demand management in transit systems and the calibration of advanced epidemiological models.
△ Less
Submitted 6 July, 2021;
originally announced July 2021.
-
Sharp threshold for the Erdős-Ko-Rado theorem
Authors:
József Balogh,
Robert A. Krueger,
Haoran Luo
Abstract:
For positive integers $n$ and $k$ with $n\geq 2k+1$, the Kneser graph $K(n,k)$ is the graph with vertex set consisting of all $k$-sets of $\{1,\dots,n\}$, where two $k$-sets are adjacent exactly when they are disjoint. The independent sets of $K(n,k)$ are $k$-uniform intersecting families, and hence the maximum size independent sets are given by the Erdős-Ko-Rado Theorem. Let $K_p(n,k)$ be a rando…
▽ More
For positive integers $n$ and $k$ with $n\geq 2k+1$, the Kneser graph $K(n,k)$ is the graph with vertex set consisting of all $k$-sets of $\{1,\dots,n\}$, where two $k$-sets are adjacent exactly when they are disjoint. The independent sets of $K(n,k)$ are $k$-uniform intersecting families, and hence the maximum size independent sets are given by the Erdős-Ko-Rado Theorem. Let $K_p(n,k)$ be a random spanning subgraph of $K(n,k)$ where each edge is included independently with probability $p$. Bollobás, Narayanan, and Raigorodskii asked for what $p$ does $K_p(n,k)$ have the same independence number as $K(n,k)$ with high probability. For $n=2k+1$, we prove a hitting time result, which gives a sharp threshold for this problem at $p=3/4$. Additionally, completing work of Das and Tran and work of Devlin and Kahn, we determine a sharp threshold function for all $n>2k+1$.
△ Less
Submitted 6 June, 2022; v1 submitted 6 May, 2021;
originally announced May 2021.
-
Lower bounds on the Erdős-Gyárfás problem via color energy graphs
Authors:
József Balogh,
Sean English,
Emily Heath,
Robert A. Krueger
Abstract:
Given positive integers $p$ and $q$, a $(p,q)$-coloring of the complete graph $K_n$ is an edge-coloring in which every $p$-clique receives at least $q$ colors. Erdős and Shelah posed the question of determining $f(n,p,q)$, the minimum number of colors needed for a $(p,q)$-coloring of $K_n$. In this paper, we expand on the color energy technique introduced by Pohoata and Sheffer to prove new lower…
▽ More
Given positive integers $p$ and $q$, a $(p,q)$-coloring of the complete graph $K_n$ is an edge-coloring in which every $p$-clique receives at least $q$ colors. Erdős and Shelah posed the question of determining $f(n,p,q)$, the minimum number of colors needed for a $(p,q)$-coloring of $K_n$. In this paper, we expand on the color energy technique introduced by Pohoata and Sheffer to prove new lower bounds on this function, making explicit the connection between bounds on extremal numbers and $f(n,p,q)$. Using results on the extremal numbers of subdivided complete graphs, theta graphs, and subdivided complete bipartite graphs, we generalize results of Fish, Pohoata, and Sheffer, giving the first nontrivial lower bounds on $f(n,p,q)$ for some pairs $(p,q)$ and improving previous lower bounds for other pairs.
△ Less
Submitted 24 May, 2022; v1 submitted 22 February, 2021;
originally announced February 2021.
-
Automatically Building Diagrams for Olympiad Geometry Problems
Authors:
Ryan Krueger,
Jesse Michael Han,
Daniel Selsam
Abstract:
We present a method for automatically building diagrams for olympiad-level geometry problems and implement our approach in a new open-source software tool, the Geometry Model Builder (GMB). Central to our method is a new domain-specific language, the Geometry Model-Building Language (GMBL), for specifying geometry problems along with additional metadata useful for building diagrams. A GMBL program…
▽ More
We present a method for automatically building diagrams for olympiad-level geometry problems and implement our approach in a new open-source software tool, the Geometry Model Builder (GMB). Central to our method is a new domain-specific language, the Geometry Model-Building Language (GMBL), for specifying geometry problems along with additional metadata useful for building diagrams. A GMBL program specifies (1) how to parameterize geometric objects (or sets of geometric objects) and initialize these parameterized quantities, (2) which quantities to compute directly from other quantities, and (3) additional constraints to accumulate into a (differentiable) loss function. A GMBL program induces a (usually) tractable numerical optimization problem whose solutions correspond to diagrams of the original problem statement, and that we can solve reliably using gradient descent. Of the 39 geometry problems since 2000 appearing in the International Mathematical Olympiad, 36 can be expressed in our logic and our system can produce diagrams for 94% of them on average. To the best of our knowledge, our method is the first in automated geometry diagram construction to generate models for such complex problems.
△ Less
Submitted 30 April, 2021; v1 submitted 1 December, 2020;
originally announced December 2020.
-
Firefighting on the Hexagonal Grid and on Infinite Trees
Authors:
Alexander Dean,
Sean English,
Tongyun Huang,
Robert A. Krueger,
Andy Lee,
Mose Mizrahi,
Casey Wheaton-Werle
Abstract:
The firefighter problem with $k$ firefighters on an infinite graph $G$ is an iterative graph process, defined as follows: Suppose a fire breaks out at a given vertex $v\in V(G)$ on Turn 1. On each subsequent even turn, $k$ firefighters protect $k$ vertices that are not on fire, and on each subsequent odd turn, any vertex that is on fire spreads the fire to all adjacent unprotected vertices. The fi…
▽ More
The firefighter problem with $k$ firefighters on an infinite graph $G$ is an iterative graph process, defined as follows: Suppose a fire breaks out at a given vertex $v\in V(G)$ on Turn 1. On each subsequent even turn, $k$ firefighters protect $k$ vertices that are not on fire, and on each subsequent odd turn, any vertex that is on fire spreads the fire to all adjacent unprotected vertices. The firefighters' goal is to eventually stop the spread of the fire. If there exists a strategy for $k$ firefighters to eventually stop the spread of the fire, then we say $G$ is $k$-containable.
We consider the firefighter problem on the hexagonal grid, which is the graph whose vertices and edges are exactly the vertices and edges of a regular hexagonal tiling of the plane. It is not known if the hexagonal grid is $1$-containable. In arXiv:1305.7076 [math.CO], it was shown that if the firefighters have one firefighter per turn and one extra firefighter on two turns, the firefighters can contain the fire. We improve on this result by showing that even with only one extra firefighter on one turn, the firefighters can still contain the fire.
In addition, we explore $k$-containability for birth sequence trees, which are infinite rooted trees that have the property that every vertex at the same level has the same degree. A birth sequence forest is an infinite forest, each component of which is a birth sequence tree. For birth sequence trees and forests, the fire always starts at the root of each tree. We provide a pseudopolynomial time algorithm to decide if all the vertices at a fixed level can be protected or not.
△ Less
Submitted 6 June, 2021; v1 submitted 10 October, 2020;
originally announced October 2020.
-
Robust discrete choice models with t-distributed kernel errors
Authors:
Rico Krueger,
Michel Bierlaire,
Thomas Gasos,
Prateek Bansal
Abstract:
Outliers in discrete choice response data may result from misclassification and misreporting of the response variable and from choice behaviour that is inconsistent with modelling assumptions (e.g. random utility maximisation). In the presence of outliers, standard discrete choice models produce biased estimates and suffer from compromised predictive accuracy. Robust statistical models are less se…
▽ More
Outliers in discrete choice response data may result from misclassification and misreporting of the response variable and from choice behaviour that is inconsistent with modelling assumptions (e.g. random utility maximisation). In the presence of outliers, standard discrete choice models produce biased estimates and suffer from compromised predictive accuracy. Robust statistical models are less sensitive to outliers than standard non-robust models. This paper analyses two robust alternatives to the multinomial probit (MNP) model. The two models are robit models whose kernel error distributions are heavy-tailed t-distributions to moderate the influence of outliers. The first model is the multinomial robit (MNR) model, in which a generic degrees of freedom parameter controls the heavy-tailedness of the kernel error distribution. The second model, the generalised multinomial robit (Gen-MNR) model, is more flexible than MNR, as it allows for distinct heavy-tailedness in each dimension of the kernel error distribution. For both models, we derive Gibbs samplers for posterior inference. In a simulation study, we illustrate the excellent finite sample properties of the proposed Bayes estimators and show that MNR and Gen-MNR produce more accurate estimates if the choice data contain outliers through the lens of the non-robust MNP model. In a case study on transport mode choice behaviour, MNR and Gen-MNR outperform MNP by substantial margins in terms of in-sample fit and out-of-sample predictive accuracy. The case study also highlights differences in elasticity estimates across models.
△ Less
Submitted 5 December, 2022; v1 submitted 14 September, 2020;
originally announced September 2020.
-
Fast Bayesian Estimation of Spatial Count Data Models
Authors:
Prateek Bansal,
Rico Krueger,
Daniel J. Graham
Abstract:
Spatial count data models are used to explain and predict the frequency of phenomena such as traffic accidents in geographically distinct entities such as census tracts or road segments. These models are typically estimated using Bayesian Markov chain Monte Carlo (MCMC) simulation methods, which, however, are computationally expensive and do not scale well to large datasets. Variational Bayes (VB)…
▽ More
Spatial count data models are used to explain and predict the frequency of phenomena such as traffic accidents in geographically distinct entities such as census tracts or road segments. These models are typically estimated using Bayesian Markov chain Monte Carlo (MCMC) simulation methods, which, however, are computationally expensive and do not scale well to large datasets. Variational Bayes (VB), a method from machine learning, addresses the shortcomings of MCMC by casting Bayesian estimation as an optimisation problem instead of a simulation problem. Considering all these advantages of VB, a VB method is derived for posterior inference in negative binomial models with unobserved parameter heterogeneity and spatial dependence. Pólya-Gamma augmentation is used to deal with the non-conjugacy of the negative binomial likelihood and an integrated non-factorised specification of the variational distribution is adopted to capture posterior dependencies. The benefits of the proposed approach are demonstrated in a Monte Carlo study and an empirical application on estimating youth pedestrian injury counts in census tracts of New York City. The VB approach is around 45 to 50 times faster than MCMC on a regular eight-core processor in a simulation and an empirical study, while offering similar estimation and predictive accuracy. Conditional on the availability of computational resources, the embarrassingly parallel architecture of the proposed VB method can be exploited to further accelerate its estimation by up to 20 times.
△ Less
Submitted 16 October, 2020; v1 submitted 7 July, 2020;
originally announced July 2020.
-
A note about monochromatic components in graphs of large minimum degree
Authors:
Louis DeBiasio,
Robert A. Krueger
Abstract:
For all positive integers $r\geq 3$ and $n$ such that $r^2-r$ divides $n$ and an affine plane of order $r$ exists, we construct an $r$-edge colored graph with minimum degree $(1-\frac{r-2}{r^2-r})n-2$ such that the largest monochromatic component has order less than $\frac{n}{r-1}$. This generalizes an example of Guggiari and Scott and, independently, Rahimi for $r=3$ and thus disproves a conjectu…
▽ More
For all positive integers $r\geq 3$ and $n$ such that $r^2-r$ divides $n$ and an affine plane of order $r$ exists, we construct an $r$-edge colored graph with minimum degree $(1-\frac{r-2}{r^2-r})n-2$ such that the largest monochromatic component has order less than $\frac{n}{r-1}$. This generalizes an example of Guggiari and Scott and, independently, Rahimi for $r=3$ and thus disproves a conjecture of Gyárfás and Sárközy for all integers $r\geq 3$ such that an affine plane of order $r$ exists.
△ Less
Submitted 15 June, 2020;
originally announced June 2020.
-
A New Spatial Count Data Model with Bayesian Additive Regression Trees for Accident Hot Spot Identification
Authors:
Rico Krueger,
Prateek Bansal,
Prasad Buddhavarapu
Abstract:
The identification of accident hot spots is a central task of road safety management. Bayesian count data models have emerged as the workhorse method for producing probabilistic rankings of hazardous sites in road networks. Typically, these methods assume simple linear link function specifications, which, however, limit the predictive power of a model. Furthermore, extensive specification searches…
▽ More
The identification of accident hot spots is a central task of road safety management. Bayesian count data models have emerged as the workhorse method for producing probabilistic rankings of hazardous sites in road networks. Typically, these methods assume simple linear link function specifications, which, however, limit the predictive power of a model. Furthermore, extensive specification searches are precluded by complex model structures arising from the need to account for unobserved heterogeneity and spatial correlations. Modern machine learning (ML) methods offer ways to automate the specification of the link function. However, these methods do not capture estimation uncertainty, and it is also difficult to incorporate spatial correlations. In light of these gaps in the literature, this paper proposes a new spatial negative binomial model, which uses Bayesian additive regression trees to endogenously select the specification of the link function. Posterior inference in the proposed model is made feasible with the help of the Polya-Gamma data augmentation technique. We test the performance of this new model on a crash count data set from a metropolitan highway network. The empirical results show that the proposed model performs at least as well as a baseline spatial count data model with random parameters in terms of goodness of fit and site ranking ability.
△ Less
Submitted 24 May, 2020;
originally announced May 2020.
-
Visual Interaction with Deep Learning Models through Collaborative Semantic Inference
Authors:
Sebastian Gehrmann,
Hendrik Strobelt,
Robert Krüger,
Hanspeter Pfister,
Alexander M. Rush
Abstract:
Automation of tasks can have critical consequences when humans lose agency over decision processes. Deep learning models are particularly susceptible since current black-box approaches lack explainable reasoning. We argue that both the visual interface and model structure of deep learning systems need to take into account interaction design. We propose a framework of collaborative semantic inferen…
▽ More
Automation of tasks can have critical consequences when humans lose agency over decision processes. Deep learning models are particularly susceptible since current black-box approaches lack explainable reasoning. We argue that both the visual interface and model structure of deep learning systems need to take into account interaction design. We propose a framework of collaborative semantic inference (CSI) for the co-design of interactions and models to enable visual collaboration between humans and algorithms. The approach exposes the intermediate reasoning process of models which allows semantic interactions with the visual metaphors of a problem, which means that a user can both understand and control parts of the model reasoning process. We demonstrate the feasibility of CSI with a co-designed case study of a document summarization system.
△ Less
Submitted 24 July, 2019;
originally announced July 2019.
-
Semi-Parametric Hierarchical Bayes Estimates of New Yorkers' Willingness to Pay for Features of Shared Automated Vehicle Services
Authors:
Rico Krueger,
Taha H. Rashidi,
Akshay Vij
Abstract:
In this paper, we contrast parametric and semi-parametric representations of unobserved heterogeneity in hierarchical Bayesian multinomial logit models and leverage these methods to infer distributions of willingness to pay for features of shared automated vehicle (SAV) services. Specifically, we compare the multivariate normal (MVN), finite mixture of normals (F-MON) and Dirichlet process mixture…
▽ More
In this paper, we contrast parametric and semi-parametric representations of unobserved heterogeneity in hierarchical Bayesian multinomial logit models and leverage these methods to infer distributions of willingness to pay for features of shared automated vehicle (SAV) services. Specifically, we compare the multivariate normal (MVN), finite mixture of normals (F-MON) and Dirichlet process mixture of normals (DP-MON) mixing distributions. The latter promises to be particularly flexible in respect to the shapes it can assume and unlike other semi-parametric approaches does not require that its complexity is fixed prior to estimation. However, its properties relative to simpler mixing distributions are not well understood. In this paper, we evaluate the performance of the MVN, F-MON and DP-MON mixing distributions using simulated data and real data sourced from a stated choice study on preferences for SAV services in New York City. Our analysis shows that the DP-MON mixing distribution provides superior fit to the data and performs at least as well as the competing methods at out-of-sample prediction. The DP-MON mixing distribution also offers substantive behavioural insights into the adoption of SAVs. We find that preferences for in-vehicle travel time by SAV with ride-splitting are strongly polarised. Whereas one third of the sample is willing to pay between 10 and 80 USD/h to avoid sharing a vehicle with strangers, the remainder of the sample is either indifferent to ride-splitting or even desires it. Moreover, we estimate that new technologies such as vehicle automation and electrification are relatively unimportant to travellers. This suggests that travellers may primarily derive indirect, rather than immediate benefits from these new technologies through increases in operational efficiency and lower operating costs.
△ Less
Submitted 22 July, 2019;
originally announced July 2019.
-
Generalized Ramsey numbers: forbidding paths with few colors
Authors:
Robert A. Krueger
Abstract:
Let $f(K_n, H, q)$ be the minimum number of colors needed to edge-color $K_n$ so that every copy of $H$ is colored with at least $q$ colors. Originally posed by Erdős and Shelah when $H$ is complete, the asymptotics of this extremal function have been extensively studied when $H$ is a complete graph or a complete balanced bipartite graph. Here we investigate this function for some other $H$, and i…
▽ More
Let $f(K_n, H, q)$ be the minimum number of colors needed to edge-color $K_n$ so that every copy of $H$ is colored with at least $q$ colors. Originally posed by Erdős and Shelah when $H$ is complete, the asymptotics of this extremal function have been extensively studied when $H$ is a complete graph or a complete balanced bipartite graph. Here we investigate this function for some other $H$, and in particular we determine the asymptotic behavior of $f(K_n, P_v, q)$ for almost all values of $v$ and $q$, where $P_v$ is a path on $v$ vertices.
△ Less
Submitted 27 January, 2020; v1 submitted 17 June, 2019;
originally announced June 2019.
-
Autonomous Driving and Residential Location Preferences: Evidence from a Stated Choice Survey
Authors:
Rico Krueger,
Taha H. Rashidi,
Vinayak V. Dixit
Abstract:
The literature suggests that autonomous vehicles (AVs) may drastically change the user experience of private automobile travel by allowing users to engage in productive or relaxing activities while travelling. As a consequence, the generalised cost of car travel may decrease, and car users may become less sensitive to travel time. By facilitating private motorised mobility, AVs may eventually impa…
▽ More
The literature suggests that autonomous vehicles (AVs) may drastically change the user experience of private automobile travel by allowing users to engage in productive or relaxing activities while travelling. As a consequence, the generalised cost of car travel may decrease, and car users may become less sensitive to travel time. By facilitating private motorised mobility, AVs may eventually impact land use and households' residential location choices. This paper seeks to advance the understanding of the potential impacts of AVs on travel behaviour and land use by investigating stated preferences for combinations of residential locations and travel options for the commute in the context of autonomous automobile travel. Our analysis draws from a stated preference survey, which was completed by 512 commuters from the Sydney metropolitan area in Australia and provides insights into travel time valuations in a long-term decision-making context. For the analysis of the stated choice data, mixed logit models are estimated. Based on the empirical results, no changes in the valuation of travel time due to the advent of AVs should be expected. However, given the hypothetical nature of the stated preference survey, the results may be affected by methodological limitations.
△ Less
Submitted 25 September, 2019; v1 submitted 27 May, 2019;
originally announced May 2019.
-
Variational Bayesian Inference for Mixed Logit Models with Unobserved Inter- and Intra-Individual Heterogeneity
Authors:
Rico Krueger,
Prateek Bansal,
Michel Bierlaire,
Ricardo A. Daziano,
Taha H. Rashidi
Abstract:
Variational Bayes (VB), a method originating from machine learning, enables fast and scalable estimation of complex probabilistic models. Thus far, applications of VB in discrete choice analysis have been limited to mixed logit models with unobserved inter-individual taste heterogeneity. However, such a model formulation may be too restrictive in panel data settings, since tastes may vary both bet…
▽ More
Variational Bayes (VB), a method originating from machine learning, enables fast and scalable estimation of complex probabilistic models. Thus far, applications of VB in discrete choice analysis have been limited to mixed logit models with unobserved inter-individual taste heterogeneity. However, such a model formulation may be too restrictive in panel data settings, since tastes may vary both between individuals as well as across choice tasks encountered by the same individual. In this paper, we derive a VB method for posterior inference in mixed logit models with unobserved inter- and intra-individual heterogeneity. In a simulation study, we benchmark the performance of the proposed VB method against maximum simulated likelihood (MSL) and Markov chain Monte Carlo (MCMC) methods in terms of parameter recovery, predictive accuracy and computational efficiency. The simulation study shows that VB can be a fast, scalable and accurate alternative to MSL and MCMC estimation, especially in applications in which fast predictions are paramount. VB is observed to be between 2.8 and 17.7 times faster than the two competing methods, while affording comparable or superior accuracy. Besides, the simulation study demonstrates that a parallelised implementation of the MSL estimator with analytical gradients is a viable alternative to MCMC in terms of both estimation accuracy and computational efficiency, as the MSL estimator is observed to be between 0.9 and 2.1 times faster than MCMC.
△ Less
Submitted 16 January, 2020; v1 submitted 1 May, 2019;
originally announced May 2019.
-
Pólygamma Data Augmentation to address Non-conjugacy in the Bayesian Estimation of Mixed Multinomial Logit Models
Authors:
Prateek Bansal,
Rico Krueger,
Michel Bierlaire,
Ricardo A. Daziano,
Taha H. Rashidi
Abstract:
The standard Gibbs sampler of Mixed Multinomial Logit (MMNL) models involves sampling from conditional densities of utility parameters using Metropolis-Hastings (MH) algorithm due to unavailability of conjugate prior for logit kernel. To address this non-conjugacy concern, we propose the application of Pólygamma data augmentation (PG-DA) technique for the MMNL estimation. The posterior estimates o…
▽ More
The standard Gibbs sampler of Mixed Multinomial Logit (MMNL) models involves sampling from conditional densities of utility parameters using Metropolis-Hastings (MH) algorithm due to unavailability of conjugate prior for logit kernel. To address this non-conjugacy concern, we propose the application of Pólygamma data augmentation (PG-DA) technique for the MMNL estimation. The posterior estimates of the augmented and the default Gibbs sampler are similar for two-alternative scenario (binary choice), but we encounter empirical identification issues in the case of more alternatives ($J \geq 3$).
△ Less
Submitted 13 April, 2019;
originally announced April 2019.
-
Bayesian Estimation of Mixed Multinomial Logit Models: Advances and Simulation-Based Evaluations
Authors:
Prateek Bansal,
Rico Krueger,
Michel Bierlaire,
Ricardo A. Daziano,
Taha H. Rashidi
Abstract:
Variational Bayes (VB) methods have emerged as a fast and computationally-efficient alternative to Markov chain Monte Carlo (MCMC) methods for scalable Bayesian estimation of mixed multinomial logit (MMNL) models. It has been established that VB is substantially faster than MCMC at practically no compromises in predictive accuracy. In this paper, we address two critical gaps concerning the usage a…
▽ More
Variational Bayes (VB) methods have emerged as a fast and computationally-efficient alternative to Markov chain Monte Carlo (MCMC) methods for scalable Bayesian estimation of mixed multinomial logit (MMNL) models. It has been established that VB is substantially faster than MCMC at practically no compromises in predictive accuracy. In this paper, we address two critical gaps concerning the usage and understanding of VB for MMNL. First, extant VB methods are limited to utility specifications involving only individual-specific taste parameters. Second, the finite-sample properties of VB estimators and the relative performance of VB, MCMC and maximum simulated likelihood estimation (MSLE) are not known. To address the former, this study extends several VB methods for MMNL to admit utility specifications including both fixed and random utility parameters. To address the latter, we conduct an extensive simulation-based evaluation to benchmark the extended VB methods against MCMC and MSLE in terms of estimation times, parameter recovery and predictive accuracy. The results suggest that all VB variants with the exception of the ones relying on an alternative variational lower bound constructed with the help of the modified Jensen's inequality perform as well as MCMC and MSLE at prediction and parameter recovery. In particular, VB with nonconjugate variational message passing and the delta-method (VB-NCVMP-Delta) is up to 16 times faster than MCMC and MSLE. Thus, VB-NCVMP-Delta can be an attractive alternative to MCMC and MSLE for fast, scalable and accurate estimation of MMNL models.
△ Less
Submitted 12 December, 2019; v1 submitted 7 April, 2019;
originally announced April 2019.
-
Partitioning the power set of $[n]$ into $C_k$-free parts
Authors:
Eben Blaisdell,
András Gyárfás,
Robert A. Krueger,
Ronen Wdowinski
Abstract:
We show that for $n \geq 3, n\ne 5$, in any partition of $\mathcal{P}(n)$, the set of all subsets of $[n]=\{1,2,\dots,n\}$, into $2^{n-2}-1$ parts, some part must contain a triangle --- three different subsets $A,B,C\subseteq [n]$ such that $A\cap B$, $A\cap C$, and $B\cap C$ have distinct representatives. This is sharp, since by placing two complementary pairs of sets into each partition class, w…
▽ More
We show that for $n \geq 3, n\ne 5$, in any partition of $\mathcal{P}(n)$, the set of all subsets of $[n]=\{1,2,\dots,n\}$, into $2^{n-2}-1$ parts, some part must contain a triangle --- three different subsets $A,B,C\subseteq [n]$ such that $A\cap B$, $A\cap C$, and $B\cap C$ have distinct representatives. This is sharp, since by placing two complementary pairs of sets into each partition class, we have a partition into $2^{n-2}$ triangle-free parts. We also address a more general Ramsey-type problem: for a given graph $G$, find (estimate) $f(n,G)$, the smallest number of colors needed for a coloring of $\mathcal{P}(n)$, such that no color class contains a Berge-$G$ subhypergraph. We give an upper bound for $f(n,G)$ for any connected graph $G$ which is asymptotically sharp (for fixed $k$) when $G=C_k, P_k, S_k$, a cycle, path, or star with $k$ edges. Additional bounds are given for $G=C_4$ and $G=S_3$.
△ Less
Submitted 16 December, 2018;
originally announced December 2018.
-
Visual Pattern-Driven Exploration of Big Data
Authors:
Michael Behrisch,
Robert Krueger,
Fritz Lekschas,
Tobias Schreck,
Nils Gehlenborg,
Hanspeter Pfister
Abstract:
Pattern extraction algorithms are enabling insights into the ever-growing amount of today's datasets by translating reoccurring data properties into compact representations. Yet, a practical problem arises: With increasing data volumes and complexity also the number of patterns increases, leaving the analyst with a vast result space. Current algorithmic and especially visualization approaches ofte…
▽ More
Pattern extraction algorithms are enabling insights into the ever-growing amount of today's datasets by translating reoccurring data properties into compact representations. Yet, a practical problem arises: With increasing data volumes and complexity also the number of patterns increases, leaving the analyst with a vast result space. Current algorithmic and especially visualization approaches often fail to answer central overview questions essential for a comprehensive understanding of pattern distributions and support, their quality, and relevance to the analysis task. To address these challenges, we contribute a visual analytics pipeline targeted on the pattern-driven exploration of result spaces in a semi-automatic fashion. Specifically, we combine image feature analysis and unsupervised learning to partition the pattern space into interpretable, coherent chunks, which should be given priority in a subsequent in-depth analysis. In our analysis scenarios, no ground-truth is given. Thus, we employ and evaluate novel quality metrics derived from the distance distributions of our image feature vectors and the derived cluster model to guide the feature selection process. We visualize our results interactively, allowing the user to drill down from overview to detail into the pattern space and demonstrate our techniques in a case study on biomedical genomic data.
△ Less
Submitted 3 July, 2018;
originally announced July 2018.
-
Large monochromatic components in multicolored bipartite graphs
Authors:
Louis DeBiasio,
Robert A. Krueger,
Gábor N. Sárközy
Abstract:
It is well-known that in every $r$-coloring of the edges of the complete bipartite graph $K_{m,n}$ there is a monochromatic connected component with at least ${m+n\over r}$ vertices. In this paper we study an extension of this problem by replacing complete bipartite graphs by bipartite graphs of large minimum degree. We conjecture that in every $r$-coloring of the edges of an $(X,Y)$-bipartite gra…
▽ More
It is well-known that in every $r$-coloring of the edges of the complete bipartite graph $K_{m,n}$ there is a monochromatic connected component with at least ${m+n\over r}$ vertices. In this paper we study an extension of this problem by replacing complete bipartite graphs by bipartite graphs of large minimum degree. We conjecture that in every $r$-coloring of the edges of an $(X,Y)$-bipartite graph with $|X|=m$, $|Y|=n$, $δ(X,Y) > \left( 1 - \frac{1}{r+1}\right) n$ and $δ(Y,X) > \left( 1 - \frac{1}{r+1}\right) m$, there exists a monochromatic component on at least $\frac{m+n}{r}$ vertices (as in the complete bipartite graph). If true, the minimum degree condition is sharp (in that both inequalities cannot be made weak when $m$ and $n$ are divisible by $r+1$).
We prove the conjecture for $r=2$ and we prove a weaker bound for all $r\geq 3$. As a corollary, we obtain a result about the existence of monochromatic components with at least $\frac{n}{r-1}$ vertices in $r$-colored graphs with large minimum degree.
△ Less
Submitted 9 October, 2019; v1 submitted 13 June, 2018;
originally announced June 2018.
-
Long monochromatic paths and cycles in 2-colored bipartite graphs
Authors:
Louis DeBiasio,
Robert A. Krueger
Abstract:
Gyárfás and Lehel and independently Faudree and Schelp proved that in any 2-coloring of the edges of $K_{n,n}$ there exists a monochromatic path on at least $2\lceil n/2\rceil$ vertices, and this is tight. We prove a stability version of this result which holds even if the host graph is not complete; that is, if $G$ is a balanced bipartite graph on $2n$ vertices with minimum degree at least…
▽ More
Gyárfás and Lehel and independently Faudree and Schelp proved that in any 2-coloring of the edges of $K_{n,n}$ there exists a monochromatic path on at least $2\lceil n/2\rceil$ vertices, and this is tight. We prove a stability version of this result which holds even if the host graph is not complete; that is, if $G$ is a balanced bipartite graph on $2n$ vertices with minimum degree at least $(3/4+o(1))n$, then in every 2-coloring of the edges of $G$, either there exists a monochromatic cycle on at least $(1+o(1))n$ vertices, or the coloring of $G$ is close to an extremal coloring -- in which case $G$ has a monochromatic path on at least $2\lceil n/2\rceil$ vertices and a monochromatic cycle on at least $2\lfloor n/2\rfloor$ vertices. Furthermore, we determine an asymptotically tight bound on the length of a longest monochromatic cycle in a 2-colored balanced bipartite graph on $2n$ vertices with minimum degree $δn$ for all $0\leq δ\leq 1$.
△ Less
Submitted 13 June, 2018;
originally announced June 2018.
-
Monochromatic balanced components, matchings, and paths in multicolored complete bipartite graphs
Authors:
Louis DeBiasio,
András Gyárfás,
Robert A. Krueger,
Miklós Ruszinkó,
Gábor N. Sárközy
Abstract:
It is well-known that in every $r$-coloring of the edges of the complete bipartite graph $K_{n,n}$ there is a monochromatic connected component with at least ${2n\over r}$ vertices. It would be interesting to know whether we can additionally require that this large component be balanced; that is, is it true that in every $r$-coloring of $K_{n,n}$ there is a monochromatic component that meets both…
▽ More
It is well-known that in every $r$-coloring of the edges of the complete bipartite graph $K_{n,n}$ there is a monochromatic connected component with at least ${2n\over r}$ vertices. It would be interesting to know whether we can additionally require that this large component be balanced; that is, is it true that in every $r$-coloring of $K_{n,n}$ there is a monochromatic component that meets both sides in at least $n/r$ vertices?
Over forty years ago, Gyárfás and Lehel and independently Faudree and Schelp proved that any $2$-colored $K_{n,n}$ contains a monochromatic $P_n$. Very recently, Bucić, Letzter and Sudakov proved that every $3$-colored $K_{n,n}$ contains a monochromatic connected matching (a matching whose edges are in the same connected component) of size $\lceil n/3 \rceil$. So the answer is strongly "yes" for $1\leq r\leq 3$.
We provide a short proof of (a non-symmetric version of) the original question for $1\leq r\leq 3$; that is, every $r$-coloring of $K_{m,n}$ has a monochromatic component that meets each side in a $1/r$ proportion of its part size. Then, somewhat surprisingly, we show that the answer to the question is "no" for all $r\ge 4$. For instance, there are $4$-colorings of $K_{n,n}$ where the largest balanced monochromatic component has $n/5$ vertices in both partite classes (instead of $n/4$). Our constructions are based on lower bounds for the $r$-color bipartite Ramsey number of $P_4$, denoted $f(r)$, which is the smallest integer $\ell$ such that in every $r$-coloring of the edges of $K_{\ell,\ell}$ there is a monochromatic path on four vertices. Furthermore, combined with earlier results, we determine $f(r)$ for every value of $r$.
△ Less
Submitted 9 October, 2019; v1 submitted 11 April, 2018;
originally announced April 2018.
-
Random taste heterogeneity in discrete choice models: Flexible nonparametric finite mixture distributions
Authors:
Akshay Vij,
Rico Krueger
Abstract:
This study proposes a mixed logit model with multivariate nonparametric finite mixture distributions. The support of the distribution is specified as a high-dimensional grid over the coefficient space, with equal or unequal intervals between successive points along the same dimension; the location of each point on the grid and the probability mass at that point are model parameters that need to be…
▽ More
This study proposes a mixed logit model with multivariate nonparametric finite mixture distributions. The support of the distribution is specified as a high-dimensional grid over the coefficient space, with equal or unequal intervals between successive points along the same dimension; the location of each point on the grid and the probability mass at that point are model parameters that need to be estimated. The framework does not require the analyst to specify the shape of the distribution prior to model estimation, but can approximate any multivariate probability distribution function to any arbitrary degree of accuracy. The grid with unequal intervals, in particular, offers greater flexibility than existing multivariate nonparametric specifications, while requiring the estimation of a small number of additional parameters. An expectation maximization algorithm is developed for the estimation of these models. Multiple synthetic datasets and a case study on travel mode choice behavior are used to demonstrate the value of the model framework and estimation algorithm. Compared to extant models that incorporate random taste heterogeneity through continuous mixture distributions, the proposed model provides better out-of-sample predictive ability. Findings reveal significant differences in willingness to pay measures between the proposed model and extant specifications. The case study further demonstrates the ability of the proposed model to endogenously recover patterns of attribute non-attendance and choice set formation.
△ Less
Submitted 6 February, 2018;
originally announced February 2018.
-
A Dirichlet Process Mixture Model of Discrete Choice
Authors:
Rico Krueger,
Akshay Vij,
Taha H. Rashidi
Abstract:
We present a mixed multinomial logit (MNL) model, which leverages the truncated stick-breaking process representation of the Dirichlet process as a flexible nonparametric mixing distribution. The proposed model is a Dirichlet process mixture model and accommodates discrete representations of heterogeneity, like a latent class MNL model. Yet, unlike a latent class MNL model, the proposed discrete c…
▽ More
We present a mixed multinomial logit (MNL) model, which leverages the truncated stick-breaking process representation of the Dirichlet process as a flexible nonparametric mixing distribution. The proposed model is a Dirichlet process mixture model and accommodates discrete representations of heterogeneity, like a latent class MNL model. Yet, unlike a latent class MNL model, the proposed discrete choice model does not require the analyst to fix the number of mixture components prior to estimation, as the complexity of the discrete mixing distribution is inferred from the evidence. For posterior inference in the proposed Dirichlet process mixture model of discrete choice, we derive an expectation maximisation algorithm. In a simulation study, we demonstrate that the proposed model framework can flexibly capture differently-shaped taste parameter distributions. Furthermore, we empirically validate the model framework in a case study on motorists' route choice preferences and find that the proposed Dirichlet process mixture model of discrete choice outperforms a latent class MNL model and mixed MNL models with common parametric mixing distributions in terms of both in-sample fit and out-of-sample predictive ability. Compared to extant modelling approaches, the proposed discrete choice model substantially abbreviates specification searches, as it relies on less restrictive parametric assumptions and does not require the analyst to specify the complexity of the discrete mixing distribution prior to estimation.
△ Less
Submitted 19 January, 2018;
originally announced January 2018.
-
Hamiltonian cycles in $k$-partite graphs
Authors:
Louis DeBiasio,
Robert A. Krueger,
Dan Pritikin,
Eli Thompson
Abstract:
Chen, Faudree, Gould, Jacobson, and Lesniak determined the minimum degree threshold for which a balanced $k$-partite graph has a Hamiltonian cycle. We give an asymptotically tight minimum degree condition for Hamiltonian cycles in arbitrary $k$-partite graphs in which all parts have at most $n/2$ vertices (a necessary condition). To do this, we first prove a general result which both simplifies th…
▽ More
Chen, Faudree, Gould, Jacobson, and Lesniak determined the minimum degree threshold for which a balanced $k$-partite graph has a Hamiltonian cycle. We give an asymptotically tight minimum degree condition for Hamiltonian cycles in arbitrary $k$-partite graphs in which all parts have at most $n/2$ vertices (a necessary condition). To do this, we first prove a general result which both simplifies the process of checking whether a graph $G$ is a robust expander and gives useful structural information in the case when $G$ is not a robust expander. Then we use this result to prove that any $k$-partite graph satisfying the minimum degree condition is either a robust expander or else contains a Hamiltonian cycle directly.
△ Less
Submitted 9 October, 2019; v1 submitted 24 July, 2017;
originally announced July 2017.