-
Developments in Sheaf-Theoretic Models of Natural Language Ambiguities
Authors:
Kin Ian Lo,
Mehrnoosh Sadrzadeh,
Shane Mansfield
Abstract:
Sheaves are mathematical objects consisting of a base which constitutes a topological space and the data associated with each open set thereof, e.g. continuous functions defined on the open sets. Sheaves have originally been used in algebraic topology and logic. Recently, they have also modelled events such as physical experiments and natural language disambiguation processes. We extend the latter…
▽ More
Sheaves are mathematical objects consisting of a base which constitutes a topological space and the data associated with each open set thereof, e.g. continuous functions defined on the open sets. Sheaves have originally been used in algebraic topology and logic. Recently, they have also modelled events such as physical experiments and natural language disambiguation processes. We extend the latter models from lexical ambiguities to discourse ambiguities arising from anaphora. To begin, we calculated a new measure of contextuality for a dataset of basic anaphoric discourses, resulting in a higher proportion of contextual models--82.9%--compared to previous work which only yielded 3.17% contextual models. Then, we show how an extension of the natural language processing challenge, known as the Winograd Schema, which involves anaphoric ambiguities can be modelled on the Bell-CHSH scenario with a contextual fraction of 0.096.
△ Less
Submitted 6 February, 2024;
originally announced February 2024.
-
Generalised Winograd Schema and its Contextuality
Authors:
Kin Ian Lo,
Mehrnoosh Sadrzadeh,
Shane Mansfield
Abstract:
Ambiguities in natural language give rise to probability distributions over interpretations. The distributions are often over multiple ambiguous words at a time; a multiplicity which makes them a suitable topic for sheaf-theoretic models of quantum contextuality. Previous research showed that different quantitative measures of contextuality correlate well with Psycholinguistic research on lexical…
▽ More
Ambiguities in natural language give rise to probability distributions over interpretations. The distributions are often over multiple ambiguous words at a time; a multiplicity which makes them a suitable topic for sheaf-theoretic models of quantum contextuality. Previous research showed that different quantitative measures of contextuality correlate well with Psycholinguistic research on lexical ambiguities. In this work, we focus on coreference ambiguities and investigate the Winograd Schema Challenge (WSC), a test proposed by Levesque in 2011 to evaluate the intelligence of machines. The WSC consists of a collection of multiple-choice questions that require disambiguating pronouns in sentences structured according to the Winograd schema, in a way that makes it difficult for machines to determine the correct referents but remains intuitive for human comprehension. In this study, we propose an approach that analogously models the Winograd schema as an experiment in quantum physics. However, we argue that the original Winograd Schema is inherently too simplistic to facilitate contextuality. We introduce a novel mechanism for generalising the schema, rendering it analogous to a Bell-CHSH measurement scenario. We report an instance of this generalised schema, complemented by the human judgements we gathered via a crowdsourcing platform. The resulting model violates the Bell-CHSH inequality by 0.192, thus exhibiting contextuality in a coreference resolution setting.
△ Less
Submitted 31 August, 2023;
originally announced August 2023.
-
Rank-heterogeneous Preference Models for School Choice
Authors:
Amel Awadelkarim,
Arjun Seshadri,
Itai Ashlagi,
Irene Lo,
Johan Ugander
Abstract:
School choice mechanism designers use discrete choice models to understand and predict families' preferences. The most widely-used choice model, the multinomial logit (MNL), is linear in school and/or household attributes. While the model is simple and interpretable, it assumes the ranked preference lists arise from a choice process that is uniform throughout the ranking, from top to bottom. In th…
▽ More
School choice mechanism designers use discrete choice models to understand and predict families' preferences. The most widely-used choice model, the multinomial logit (MNL), is linear in school and/or household attributes. While the model is simple and interpretable, it assumes the ranked preference lists arise from a choice process that is uniform throughout the ranking, from top to bottom. In this work, we introduce two strategies for rank-heterogeneous choice modeling tailored for school choice. First, we adapt a context-dependent random utility model (CDM), considering down-rank choices as occurring in the context of earlier up-rank choices. Second, we consider stratifying the choice modeling by rank, regularizing rank-adjacent models towards one another when appropriate. Using data on household preferences from the San Francisco Unified School District (SFUSD) across multiple years, we show that the contextual models considerably improve our out-of-sample evaluation metrics across all rank positions over the non-contextual models in the literature. Meanwhile, stratifying the model by rank can yield more accurate first-choice predictions while down-rank predictions are relatively unimproved. These models provide performance upgrades that school choice researchers can adopt to improve predictions and counterfactual analyses.
△ Less
Submitted 1 June, 2023;
originally announced June 2023.
-
A Model of Anaphoric Ambiguities using Sheaf Theoretic Quantum-like Contextuality and BERT
Authors:
Kin Ian Lo,
Mehrnoosh Sadrzadeh,
Shane Mansfield
Abstract:
Ambiguities of natural language do not preclude us from using it and context helps in getting ideas across. They, nonetheless, pose a key challenge to the development of competent machines to understand natural language and use it as humans do. Contextuality is an unparalleled phenomenon in quantum mechanics, where different mathematical formalisms have been put forwards to understand and reason…
▽ More
Ambiguities of natural language do not preclude us from using it and context helps in getting ideas across. They, nonetheless, pose a key challenge to the development of competent machines to understand natural language and use it as humans do. Contextuality is an unparalleled phenomenon in quantum mechanics, where different mathematical formalisms have been put forwards to understand and reason about it. In this paper, we construct a schema for anaphoric ambiguities that exhibits quantum-like contextuality. We use a recently developed criterion of sheaf-theoretic contextuality that is applicable to signalling models. We then take advantage of the neural word embedding engine BERT to instantiate the schema to natural language examples and extract probability distributions for the instances. As a result, plenty of sheaf-contextual examples were discovered in the natural language corpora BERT utilises. Our hope is that these examples will pave the way for future research and for finding ways to extend applications of quantum computing to natural language processing.
△ Less
Submitted 11 August, 2022;
originally announced August 2022.
-
A Quantum Natural Language Processing Approach to Pronoun Resolution
Authors:
Hadi Wazni,
Kin Ian Lo,
Lachlan McPheat,
Mehrnoosh Sadrzadeh
Abstract:
We use the Lambek Calculus with soft sub-exponential modalities to model and reason about discourse relations such as anaphora and ellipsis. A semantics for this logic is obtained by using truncated Fock spaces, developed in our previous work. We depict these semantic computations via a new string diagram. The Fock Space semantics has the advantage that its terms are learnable from large corpora o…
▽ More
We use the Lambek Calculus with soft sub-exponential modalities to model and reason about discourse relations such as anaphora and ellipsis. A semantics for this logic is obtained by using truncated Fock spaces, developed in our previous work. We depict these semantic computations via a new string diagram. The Fock Space semantics has the advantage that its terms are learnable from large corpora of data using machine learning and they can be experimented with on mainstream natural language tasks. Further, and thanks to an existing translation from vector spaces to quantum circuits, we can also learn these terms on quantum computers and their simulators, such as the IBMQ range. We extend the existing translation to Fock spaces and develop quantum circuit semantics for discourse relations. We then experiment with the IBMQ AerSimulations of these circuits in a definite pronoun resolution task, where the highest accuracies were recorded for models when the anaphora was resolved.
△ Less
Submitted 10 August, 2022;
originally announced August 2022.
-
Decentralized Matching in a Probabilistic Environment
Authors:
Mobin Y. Jeloudar,
Irene Lo,
Tristan Pollner,
Amin Saberi
Abstract:
We consider a model for repeated stochastic matching where compatibility is probabilistic, is realized the first time agents are matched, and persists in the future. Such a model has applications in the gig economy, kidney exchange, and mentorship matching.
We ask whether a $decentralized$ matching process can approximate the optimal online algorithm. In particular, we consider a decentralized…
▽ More
We consider a model for repeated stochastic matching where compatibility is probabilistic, is realized the first time agents are matched, and persists in the future. Such a model has applications in the gig economy, kidney exchange, and mentorship matching.
We ask whether a $decentralized$ matching process can approximate the optimal online algorithm. In particular, we consider a decentralized $stable$ $matching$ process where agents match with the most compatible partner who does not prefer matching with someone else, and known compatible pairs continue matching in all future rounds. We demonstrate that the above process provides a 0.316-approximation to the optimal online algorithm for matching on general graphs. We also provide a $\frac{1}{7}$-approximation for many-to-one bipartite matching, a $\frac{1}{11}$-approximation for capacitated matching on general graphs, and a $\frac{1}{2k}$-approximation for forming teams of up to $k$ agents. Our results rely on a novel coupling argument that decomposes the successful edges of the optimal online algorithm in terms of their round-by-round comparison with stable matching.
△ Less
Submitted 12 June, 2021;
originally announced June 2021.
-
Commitment on Volunteer Crowdsourcing Platforms: Implications for Growth and Engagement
Authors:
Irene Lo,
Vahideh Manshadi,
Scott Rodilitz,
Ali Shameli
Abstract:
Volunteer crowdsourcing platforms match volunteers with tasks which are often recurring. To ensure completion of such tasks, platforms frequently use a lever known as "adoption," which amounts to a commitment by the volunteer to repeatedly perform the task. Despite reducing match uncertainty, high levels of adoption can decrease the probability of forming new matches, which in turn can suppress gr…
▽ More
Volunteer crowdsourcing platforms match volunteers with tasks which are often recurring. To ensure completion of such tasks, platforms frequently use a lever known as "adoption," which amounts to a commitment by the volunteer to repeatedly perform the task. Despite reducing match uncertainty, high levels of adoption can decrease the probability of forming new matches, which in turn can suppress growth. We study how platforms should manage this trade-off. Our research is motivated by a collaboration with Food Rescue U.S. (FRUS), a volunteer-based food recovery organization active in over 30 locations. For platforms such as FRUS, success crucially depends on volunteer engagement. Consequently, effectively utilizing non-monetary levers, such as adoption, is critical. Motivated by the volunteer management literature and our analysis of FRUS data, we develop a model for two-sided markets which repeatedly match volunteers with tasks. Our model incorporates match uncertainty as well as the negative impact of failing to match on future engagement. We study the platform's optimal policy for setting the adoption level to maximize the total discounted number of matches. We fully characterize the optimal myopic policy and show that it takes a simple form: depending on volunteer characteristics and market thickness, either allow for full adoption or disallow adoption. In the long run, we show that such a policy is either optimal or achieves a constant-factor approximation. Our finding is robust to incorporating heterogeneity in volunteer behavior. Our work sheds light on how two-sided platforms need to carefully control the double-edged impacts that commitment levers have on growth and engagement. A one-size-fits-all solution may not be effective, as the optimal design crucially depends on the characteristics of the volunteer population.
△ Less
Submitted 15 July, 2021; v1 submitted 21 May, 2020;
originally announced May 2020.
-
Coloring Square-free Berge Graphs
Authors:
Maria Chudnovsky,
Irene Lo,
Frederic Maffray,
Nicolas Trotignon,
Kristina Vuskovic
Abstract:
We consider the class of Berge graphs that do not contain a chordless cycle of length $4$. We present a purely graph-theoretical algorithm that produces an optimal coloring in polynomial time for every graph in that class.
We consider the class of Berge graphs that do not contain a chordless cycle of length $4$. We present a purely graph-theoretical algorithm that produces an optimal coloring in polynomial time for every graph in that class.
△ Less
Submitted 8 June, 2022; v1 submitted 30 September, 2015;
originally announced September 2015.
-
The extremal function for disconnected minors
Authors:
Endre Csóka,
Irene Lo,
Sergey Norin,
Hehui Wu,
Liana Yepremyan
Abstract:
For a graph $H$ let $c(H)$ denote the supremum of $|E(G)|/|V(G)|$ taken over all non-null graphs $G$ not containing $H$ as a minor. We show that $$c(H) \leq \frac{|V(H)|+\mathrm{comp}(H)}{2}-1,$$ when $H$ is a union of cycles, verifying conjectures of Reed and Wood, and Harvey and Wood.
We derive the above result from a theorem which allows us to find two vertex disjoint subgraphs with prescribe…
▽ More
For a graph $H$ let $c(H)$ denote the supremum of $|E(G)|/|V(G)|$ taken over all non-null graphs $G$ not containing $H$ as a minor. We show that $$c(H) \leq \frac{|V(H)|+\mathrm{comp}(H)}{2}-1,$$ when $H$ is a union of cycles, verifying conjectures of Reed and Wood, and Harvey and Wood.
We derive the above result from a theorem which allows us to find two vertex disjoint subgraphs with prescribed densities in a sufficiently dense graph, which might be of independent interest.
△ Less
Submitted 3 September, 2015;
originally announced September 2015.
-
Misere Hackenbush Flowers
Authors:
Irene Y. Lo
Abstract:
We show that any disjunctive sum of Hackenbush Flowers $G$ has as evil twin $G^* \in {G, G+*}$ such that the outcomes of $G$ under normal and misère play are the same as the outcomes of $G^*$ under misère and normal play respectively. We also show that, under misère play, any Green Hackenbush position that has a single edge incident with the ground is equivalent to a nim-heap.
We show that any disjunctive sum of Hackenbush Flowers $G$ has as evil twin $G^* \in {G, G+*}$ such that the outcomes of $G$ under normal and misère play are the same as the outcomes of $G^*$ under misère and normal play respectively. We also show that, under misère play, any Green Hackenbush position that has a single edge incident with the ground is equivalent to a nim-heap.
△ Less
Submitted 7 January, 2013; v1 submitted 24 December, 2012;
originally announced December 2012.
-
Some Bounds on the Rainbow Connection Number of 3-, 4- and 5-connected Graphs
Authors:
Irene Y. Lo
Abstract:
The rainbow connection number, $rc(G)$, of a connected graph $G$ is the minimum number of colors needed to color its edges so that every pair of vertices is connected by at least one path in which no two edges are colored the same. We show that for $κ=3$ or $κ= 4$, every $κ$-connected graph $G$ on $n$ vertices with diameter $\frac{n}κ-c$ satisfies $rc(G) \leq \frac{n}κ + 15c + 18$. We also show th…
▽ More
The rainbow connection number, $rc(G)$, of a connected graph $G$ is the minimum number of colors needed to color its edges so that every pair of vertices is connected by at least one path in which no two edges are colored the same. We show that for $κ=3$ or $κ= 4$, every $κ$-connected graph $G$ on $n$ vertices with diameter $\frac{n}κ-c$ satisfies $rc(G) \leq \frac{n}κ + 15c + 18$. We also show that for every maximal planar graph $G$, $rc(G) \leq \frac{n}κ + 36$. This proves a conjecture of Li et al. for graphs with large diameter and maximal planar graphs.
△ Less
Submitted 24 December, 2012;
originally announced December 2012.
-
Anomalous k-dependent spin splitting in wurtzite AlxGa1-xN/GaN heterostructures
Authors:
Ikai Lo,
M. H. Gau,
J. K. Tsai,
Y. L. Chen,
Z. J. Chang,
W. T. Wang,
J. C. Chiang,
T. Aggerstam
Abstract:
We have confirmed the k-dependent spin splitting in wurtzite AlxGa1-xN/GaN heterostructures. Anomalous beating pattern in Shubnikov-de Haas measurements arises from the interference of Rashba and Dresselhaus spin-orbit interactions. The dominant mechanism for the k-dependent spin splitting at high values of k is attributed to Dresselhaus term which is enhanced by the Delta C1-Delta C3 coupling o…
▽ More
We have confirmed the k-dependent spin splitting in wurtzite AlxGa1-xN/GaN heterostructures. Anomalous beating pattern in Shubnikov-de Haas measurements arises from the interference of Rashba and Dresselhaus spin-orbit interactions. The dominant mechanism for the k-dependent spin splitting at high values of k is attributed to Dresselhaus term which is enhanced by the Delta C1-Delta C3 coupling of wurtzite band folding effect.
△ Less
Submitted 9 November, 2006; v1 submitted 15 September, 2006;
originally announced September 2006.
-
Study of two-subband population in Fe-doped AlxGa1-xN/GaN heterostructures by persistent photoconductivity effect
Authors:
Ikai Lo,
J. K. Tsai,
M. H. Gau,
Y. L. Chen,
Z. J. Chang,
W. T. Wang,
J. C. Chiang,
K. R. Wang,
Chun-Nan Chen,
T. Aggerstam
Abstract:
The electronic properties of Fe-doped Al0.31Ga0.69N/GaN heterostructures have been studied by Shubnikov-de Haas measurement. Two subbands of the two-dimensional electron gas in the hetero-interface were populated. After the low temperature illumination, the electron density increases from 11.99 x 1012 cm-2 to 13.40 x 1012 cm-2 for the first subband and from 0.66 x 1012 cm-2 to 0.94 x 1012 cm-2 f…
▽ More
The electronic properties of Fe-doped Al0.31Ga0.69N/GaN heterostructures have been studied by Shubnikov-de Haas measurement. Two subbands of the two-dimensional electron gas in the hetero-interface were populated. After the low temperature illumination, the electron density increases from 11.99 x 1012 cm-2 to 13.40 x 1012 cm-2 for the first subband and from 0.66 x 1012 cm-2 to 0.94 x 1012 cm-2 for the second subband. The persistent photoconductivity effect (~13% increase) is mostly attributed to the Fe-related deep-donor level in GaN layer. The second subband starts to populate when the first subband is filled at a density n1 = 9.40 x 1012 cm-2. We calculate the energy separation between the first and second subbands to be 105 meV.
△ Less
Submitted 14 September, 2006;
originally announced September 2006.
-
Wurtzite Effects on Spin Splitting of GaN/AlN Quantum Wells
Authors:
Ikai Lo,
W. T. Wang,
M. H. Gau,
S. F. Tsay,
J. C. Chiang
Abstract:
A new mechanism (DeltaC1-DeltaC3 coupling) is accounted for the spin splitting of wurtzite GaN, which is originated from the intrinsic wurtzite effects (band folding and structure inversion asymmetry). The band-folding effect generates two conduction bands (DeltaC1 and DeltaC3), in which p-wave probability has tremendous change when kz approaches anti-crossing zone. The spin-splitting energy ind…
▽ More
A new mechanism (DeltaC1-DeltaC3 coupling) is accounted for the spin splitting of wurtzite GaN, which is originated from the intrinsic wurtzite effects (band folding and structure inversion asymmetry). The band-folding effect generates two conduction bands (DeltaC1 and DeltaC3), in which p-wave probability has tremendous change when kz approaches anti-crossing zone. The spin-splitting energy induced by the DeltaC1-DeltaC3 coupling and wurtzite structure inversion asymmetry is much larger than that evaluated by traditional Rashba or Dresselhaus effects. When we apply the coupling to GaN/AlN quantum wells, we find that the spin-splitting energy is sensitively controllable by an electric field. Based on the mechanism, we proposed a p-wave-enhanced spin-polarized field effect transistor, made of InxGa1-xN/InyAl1-yN, for spintronics application.
△ Less
Submitted 31 October, 2005;
originally announced October 2005.