Search | arXiv e-print repository

Learning Program Behavioral Models from Synthesized Input-Output Pairs

Authors: Tural Mammadov, Dietrich Klakow, Alexander Koller, Andreas Zeller

Abstract: We introduce Modelizer - a novel framework that, given a black-box program, learns a _model from its input/output behavior_ using _neural machine translation_. The resulting model _mocks_ the original program: Given an input, the model predicts the output that would have been produced by the program. However, the model is also _reversible_ - that is, the model can predict the input that would have… ▽ More We introduce Modelizer - a novel framework that, given a black-box program, learns a _model from its input/output behavior_ using _neural machine translation_. The resulting model _mocks_ the original program: Given an input, the model predicts the output that would have been produced by the program. However, the model is also _reversible_ - that is, the model can predict the input that would have produced a given output. Finally, the model is _differentiable_ and can be efficiently restricted to predict only a certain aspect of the program behavior. Modelizer uses _grammars_ to synthesize inputs and to parse the resulting outputs, allowing it to learn sequence-to-sequence associations between token streams. Other than input and output grammars, Modelizer only requires the ability to execute the program. The resulting models are _small_, requiring fewer than 6.3 million parameters for languages such as Markdown or HTML; and they are _accurate_, achieving up to 95.4% accuracy and a BLEU score of 0.98 with standard error 0.04 in mocking real-world applications. We foresee several _applications_ of these models, especially as the output of the program can be any aspect of program behavior. Besides mocking and predicting program behavior, the model can also synthesize inputs that are likely to produce a particular behavior, such as failures or coverage. △ Less

Submitted 11 July, 2024; originally announced July 2024.

Comments: 42 pages, 6 figures, 8 tables

MSC Class: 68T07 (Primary); 68N30 (Secondary); 68Q42 ACM Class: D.2.5; D.2.7; I.2.6; F.1.1; F.4.3

arXiv:2407.04543 [pdf, other]

Strengthening Structural Inductive Biases by Pre-training to Perform Syntactic Transformations

Authors: Matthias Lindemann, Alexander Koller, Ivan Titov

Abstract: Models need appropriate inductive biases to effectively learn from small amounts of data and generalize systematically outside of the training distribution. While Transformers are highly versatile and powerful, they can still benefit from enhanced structural inductive biases for seq2seq tasks, especially those involving syntactic transformations, such as converting active to passive voice or seman… ▽ More Models need appropriate inductive biases to effectively learn from small amounts of data and generalize systematically outside of the training distribution. While Transformers are highly versatile and powerful, they can still benefit from enhanced structural inductive biases for seq2seq tasks, especially those involving syntactic transformations, such as converting active to passive voice or semantic parsing. In this paper, we propose to strengthen the structural inductive bias of a Transformer by intermediate pre-training to perform synthetically generated syntactic transformations of dependency trees given a description of the transformation. Our experiments confirm that this helps with few-shot learning of syntactic tasks such as chunking, and also improves structural generalization for semantic parsing. Our analysis shows that the intermediate pre-training leads to attention heads that keep track of which syntactic transformation needs to be applied to which token, and that the model can leverage these attention heads on downstream tasks. △ Less

Submitted 5 July, 2024; originally announced July 2024.

arXiv:2407.01899 [pdf, other]

Scope-enhanced Compositional Semantic Parsing for DRT

Authors: Xiulin Yang, Jonas Groschwitz, Alexander Koller, Johan Bos

Abstract: Discourse Representation Theory (DRT) distinguishes itself from other semantic representation frameworks by its ability to model complex semantic and discourse phenomena through structural nesting and variable binding. While seq2seq models hold the state of the art on DRT parsing, their accuracy degrades with the complexity of the sentence, and they sometimes struggle to produce well-formed DRT re… ▽ More Discourse Representation Theory (DRT) distinguishes itself from other semantic representation frameworks by its ability to model complex semantic and discourse phenomena through structural nesting and variable binding. While seq2seq models hold the state of the art on DRT parsing, their accuracy degrades with the complexity of the sentence, and they sometimes struggle to produce well-formed DRT representations. We introduce the AMS parser, a compositional, neurosymbolic semantic parser for DRT. It rests on a novel mechanism for predicting quantifier scope. We show that the AMS parser reliably produces well-formed outputs and performs well on DRT parsing, especially on complex sentences. △ Less

Submitted 1 July, 2024; originally announced July 2024.

arXiv:2406.18403 [pdf, other]

LLMs instead of Human Judges? A Large Scale Empirical Study across 20 NLP Evaluation Tasks

Authors: Anna Bavaresco, Raffaella Bernardi, Leonardo Bertolazzi, Desmond Elliott, Raquel Fernández, Albert Gatt, Esam Ghaleb, Mario Giulianelli, Michael Hanna, Alexander Koller, André F. T. Martins, Philipp Mondorf, Vera Neplenbroek, Sandro Pezzelle, Barbara Plank, David Schlangen, Alessandro Suglia, Aditya K Surikuchi, Ece Takmaz, Alberto Testoni

Abstract: There is an increasing trend towards evaluating NLP models with LLM-generated judgments instead of human judgments. In the absence of a comparison against human data, this raises concerns about the validity of these evaluations; in case they are conducted with proprietary models, this also raises concerns over reproducibility. We provide JUDGE-BENCH, a collection of 20 NLP datasets with human anno… ▽ More There is an increasing trend towards evaluating NLP models with LLM-generated judgments instead of human judgments. In the absence of a comparison against human data, this raises concerns about the validity of these evaluations; in case they are conducted with proprietary models, this also raises concerns over reproducibility. We provide JUDGE-BENCH, a collection of 20 NLP datasets with human annotations, and comprehensively evaluate 11 current LLMs, covering both open-weight and proprietary models, for their ability to replicate the annotations. Our evaluations show that each LLM exhibits a large variance across datasets in its correlation to human judgments. We conclude that LLMs are not yet ready to systematically replace human judges in NLP. △ Less

Submitted 26 June, 2024; originally announced June 2024.

arXiv:2406.11338 [pdf, other]

Fine-grained Controllable Text Generation through In-context Learning with Feedback

Authors: Sarubi Thillainathan, Alexander Koller

Abstract: We present a method for rewriting an input sentence to match specific values of nontrivial linguistic features, such as dependency depth. In contrast to earlier work, our method uses in-context learning rather than finetuning, making it applicable in use cases where data is sparse. We show that our model performs accurate rewrites and matches the state of the art on rewriting sentences to a specif… ▽ More We present a method for rewriting an input sentence to match specific values of nontrivial linguistic features, such as dependency depth. In contrast to earlier work, our method uses in-context learning rather than finetuning, making it applicable in use cases where data is sparse. We show that our model performs accurate rewrites and matches the state of the art on rewriting sentences to a specified school grade level. △ Less

Submitted 17 June, 2024; originally announced June 2024.

arXiv:2406.08202 [pdf, other]

A Dialogue Game for Eliciting Balanced Collaboration

Authors: Isidora Jeknić, David Schlangen, Alexander Koller

Abstract: Collaboration is an integral part of human dialogue. Typical task-oriented dialogue games assign asymmetric roles to the participants, which limits their ability to elicit naturalistic role-taking in collaboration and its negotiation. We present a novel and simple online setup that favors balanced collaboration: a two-player 2D object placement game in which the players must negotiate the goal sta… ▽ More Collaboration is an integral part of human dialogue. Typical task-oriented dialogue games assign asymmetric roles to the participants, which limits their ability to elicit naturalistic role-taking in collaboration and its negotiation. We present a novel and simple online setup that favors balanced collaboration: a two-player 2D object placement game in which the players must negotiate the goal state themselves. We show empirically that human players exhibit a variety of role distributions, and that balanced collaboration improves task performance. We also present an LLM-based baseline agent which demonstrates that automatic playing of our game is an interesting challenge for artificial systems. △ Less

Submitted 11 July, 2024; v1 submitted 12 June, 2024; originally announced June 2024.

arXiv:2401.09815 [pdf, other]

Simple and effective data augmentation for compositional generalization

Authors: Yuekun Yao, Alexander Koller

Abstract: Compositional generalization, the ability to predict complex meanings from training on simpler sentences, poses challenges for powerful pretrained seq2seq models. In this paper, we show that data augmentation methods that sample MRs and backtranslate them can be effective for compositional generalization, but only if we sample from the right distribution. Remarkably, sampling from a uniform distri… ▽ More Compositional generalization, the ability to predict complex meanings from training on simpler sentences, poses challenges for powerful pretrained seq2seq models. In this paper, we show that data augmentation methods that sample MRs and backtranslate them can be effective for compositional generalization, but only if we sample from the right distribution. Remarkably, sampling from a uniform distribution performs almost as well as sampling from the test distribution, and greatly outperforms earlier methods that sampled from the training distribution. We further conduct experiments to investigate the reason why this happens and where the benefit of such data augmentation methods come from. △ Less

Submitted 18 January, 2024; originally announced January 2024.

arXiv:2311.09830 [pdf, other]

AutoPlanBench: Automatically generating benchmarks for LLM planners from PDDL

Authors: Katharina Stein, Daniel Fišer, Jörg Hoffmann, Alexander Koller

Abstract: LLMs are being increasingly used for planning-style tasks, but their capabilities for planning and reasoning are poorly understood. We present AutoPlanBench, a novel method for automatically converting planning benchmarks written in PDDL into textual descriptions and offer a benchmark dataset created with our method. We show that while the best LLM planners do well on some planning tasks, others r… ▽ More LLMs are being increasingly used for planning-style tasks, but their capabilities for planning and reasoning are poorly understood. We present AutoPlanBench, a novel method for automatically converting planning benchmarks written in PDDL into textual descriptions and offer a benchmark dataset created with our method. We show that while the best LLM planners do well on some planning tasks, others remain out of reach of current methods. △ Less

Submitted 9 February, 2024; v1 submitted 16 November, 2023; originally announced November 2023.

arXiv:2311.09422 [pdf, other]

Predicting generalization performance with correctness discriminators

Authors: Yuekun Yao, Alexander Koller

Abstract: The ability to predict an NLP model's accuracy on unseen, potentially out-of-distribution data is a prerequisite for trustworthiness. We present a novel model that establishes upper and lower bounds on the accuracy, without requiring gold labels for the unseen data. We achieve this by training a discriminator which predicts whether the output of a given sequence-to-sequence model is correct or not… ▽ More The ability to predict an NLP model's accuracy on unseen, potentially out-of-distribution data is a prerequisite for trustworthiness. We present a novel model that establishes upper and lower bounds on the accuracy, without requiring gold labels for the unseen data. We achieve this by training a discriminator which predicts whether the output of a given sequence-to-sequence model is correct or not. We show across a variety of tagging, parsing, and semantic parsing tasks that the gold accuracy is reliably between the predicted upper and lower bounds, and that these bounds are remarkably close together. △ Less

Submitted 15 November, 2023; originally announced November 2023.

arXiv:2311.05772 [pdf, other]

ADaPT: As-Needed Decomposition and Planning with Language Models

Authors: Archiki Prasad, Alexander Koller, Mareike Hartmann, Peter Clark, Ashish Sabharwal, Mohit Bansal, Tushar Khot

Abstract: Large Language Models (LLMs) are increasingly being used for interactive decision-making tasks requiring planning and adapting to the environment. Recent works employ LLMs-as-agents in broadly two ways: iteratively determining the next action (iterative executors) or generating plans and executing sub-tasks using LLMs (plan-and-execute). However, these methods struggle with task complexity, as the… ▽ More Large Language Models (LLMs) are increasingly being used for interactive decision-making tasks requiring planning and adapting to the environment. Recent works employ LLMs-as-agents in broadly two ways: iteratively determining the next action (iterative executors) or generating plans and executing sub-tasks using LLMs (plan-and-execute). However, these methods struggle with task complexity, as the inability to execute any sub-task may lead to task failure. To address these shortcomings, we introduce As-Needed Decomposition and Planning for complex Tasks (ADaPT), an approach that explicitly plans and decomposes complex sub-tasks as-needed, i.e., when the LLM is unable to execute them. ADaPT recursively decomposes sub-tasks to adapt to both task complexity and LLM capability. Our results demonstrate that ADaPT substantially outperforms established strong baselines, achieving success rates up to 28.3% higher in ALFWorld, 27% in WebShop, and 33% in TextCraft -- a novel compositional dataset that we introduce. Through extensive analysis, we illustrate the importance of multilevel decomposition and establish that ADaPT dynamically adjusts to the capabilities of the executor LLM as well as to task complexity. △ Less

Submitted 8 April, 2024; v1 submitted 8 November, 2023; originally announced November 2023.

Comments: NAACL 2024 (findings) camera-ready. Project Page: https://allenai.github.io/adaptllm

arXiv:2310.15040 [pdf, other]

SLOG: A Structural Generalization Benchmark for Semantic Parsing

Authors: Bingzhi Li, Lucia Donatelli, Alexander Koller, Tal Linzen, Yuekun Yao, Najoung Kim

Abstract: The goal of compositional generalization benchmarks is to evaluate how well models generalize to new complex linguistic expressions. Existing benchmarks often focus on lexical generalization, the interpretation of novel lexical items in syntactic structures familiar from training; structural generalization tasks, where a model needs to interpret syntactic structures that are themselves unfamiliar… ▽ More The goal of compositional generalization benchmarks is to evaluate how well models generalize to new complex linguistic expressions. Existing benchmarks often focus on lexical generalization, the interpretation of novel lexical items in syntactic structures familiar from training; structural generalization tasks, where a model needs to interpret syntactic structures that are themselves unfamiliar from training, are often underrepresented, resulting in overly optimistic perceptions of how well models can generalize. We introduce SLOG, a semantic parsing dataset that extends COGS (Kim and Linzen, 2020) with 17 structural generalization cases. In our experiments, the generalization accuracy of Transformer models, including pretrained ones, only reaches 40.6%, while a structure-aware parser only achieves 70.8%. These results are far from the near-perfect accuracy existing models achieve on COGS, demonstrating the role of SLOG in foregrounding the large discrepancy between models' lexical and structural generalization capacities. △ Less

Submitted 23 October, 2023; originally announced October 2023.

Comments: Accepted to EMNLP 2023

arXiv:2310.01693 [pdf, other]

Closing the Curious Case of Neural Text Degeneration

Authors: Matthew Finlayson, John Hewitt, Alexander Koller, Swabha Swayamdipta, Ashish Sabharwal

Abstract: Despite their ubiquity in language generation, it remains unknown why truncation sampling heuristics like nucleus sampling are so effective. We provide a theoretical explanation for the effectiveness of the truncation sampling by proving that truncation methods that discard tokens below some probability threshold (the most common type of truncation) can guarantee that all sampled tokens have nonze… ▽ More Despite their ubiquity in language generation, it remains unknown why truncation sampling heuristics like nucleus sampling are so effective. We provide a theoretical explanation for the effectiveness of the truncation sampling by proving that truncation methods that discard tokens below some probability threshold (the most common type of truncation) can guarantee that all sampled tokens have nonzero true probability. However, thresholds are a coarse heuristic, and necessarily discard some tokens with nonzero true probability as well. In pursuit of a more precise sampling strategy, we show that we can leverage a known source of model errors, the softmax bottleneck, to prove that certain tokens have nonzero true probability, without relying on a threshold. Based on our findings, we develop an experimental truncation strategy and the present pilot studies demonstrating the promise of this type of algorithm. Our evaluations show that our method outperforms its threshold-based counterparts under automatic and human evaluation metrics for low-entropy (i.e., close to greedy) open-ended text generation. Our theoretical findings and pilot experiments provide both insight into why truncation sampling works, and make progress toward more expressive sampling algorithms that better surface the generative capabilities of large language models. △ Less

Submitted 2 October, 2023; originally announced October 2023.

MSC Class: 68T50 ACM Class: I.2.7

arXiv:2310.00796 [pdf, other]

SIP: Injecting a Structural Inductive Bias into a Seq2Seq Model by Simulation

Authors: Matthias Lindemann, Alexander Koller, Ivan Titov

Abstract: Strong inductive biases enable learning from little data and help generalization outside of the training distribution. Popular neural architectures such as Transformers lack strong structural inductive biases for seq2seq NLP tasks on their own. Consequently, they struggle with systematic generalization beyond the training distribution, e.g. with extrapolating to longer inputs, even when pre-traine… ▽ More Strong inductive biases enable learning from little data and help generalization outside of the training distribution. Popular neural architectures such as Transformers lack strong structural inductive biases for seq2seq NLP tasks on their own. Consequently, they struggle with systematic generalization beyond the training distribution, e.g. with extrapolating to longer inputs, even when pre-trained on large amounts of text. We show how a structural inductive bias can be efficiently injected into a seq2seq model by pre-training it to simulate structural transformations on synthetic data. Specifically, we inject an inductive bias towards Finite State Transducers (FSTs) into a Transformer by pre-training it to simulate FSTs given their descriptions. Our experiments show that our method imparts the desired inductive bias, resulting in improved systematic generalization and better few-shot learning for FST-like tasks. Our analysis shows that fine-tuned models accurately capture the state dynamics of the unseen underlying FSTs, suggesting that the simulation process is internalized by the fine-tuned model. △ Less

Submitted 10 July, 2024; v1 submitted 1 October, 2023; originally announced October 2023.

Comments: ACL 2024 camera-ready

arXiv:2306.12226 [pdf, ps, other]

The Hessian of surface tension characterises scaling limit of gradient models with non-convex energy

Authors: Stefan Adams, Andreas Koller

Abstract: We study the scaling limit of statistical mechanics models with non-convex Hamiltonians that are gradient perturbations of Gaussian measures. Characterising features of our gradient models are the imposed boundary tilt and the surface tension (free energy) as a function of tilt. In the regime of low temperatures and bounded tilt, we prove the scaling limit for macroscopic functions on the torus, a… ▽ More We study the scaling limit of statistical mechanics models with non-convex Hamiltonians that are gradient perturbations of Gaussian measures. Characterising features of our gradient models are the imposed boundary tilt and the surface tension (free energy) as a function of tilt. In the regime of low temperatures and bounded tilt, we prove the scaling limit for macroscopic functions on the torus, and we show that the limit is a continuum Gaussian Free Field with covariance (diffusion) matrix given as the Hessian of surface tension. Our proof of this longstanding conjecture complements recent studies in [Hil16], [ABKM]. △ Less

Submitted 21 June, 2023; originally announced June 2023.

MSC Class: 60K35; 82B20; 82B28

arXiv:2305.16954 [pdf, other]

Compositional Generalization without Trees using Multiset Tagging and Latent Permutations

Authors: Matthias Lindemann, Alexander Koller, Ivan Titov

Abstract: Seq2seq models have been shown to struggle with compositional generalization in semantic parsing, i.e. generalizing to unseen compositions of phenomena that the model handles correctly in isolation. We phrase semantic parsing as a two-step process: we first tag each input token with a multiset of output tokens. Then we arrange the tokens into an output sequence using a new way of parameterizing… ▽ More Seq2seq models have been shown to struggle with compositional generalization in semantic parsing, i.e. generalizing to unseen compositions of phenomena that the model handles correctly in isolation. We phrase semantic parsing as a two-step process: we first tag each input token with a multiset of output tokens. Then we arrange the tokens into an output sequence using a new way of parameterizing and predicting permutations. We formulate predicting a permutation as solving a regularized linear program and we backpropagate through the solver. In contrast to prior work, our approach does not place a priori restrictions on possible permutations, making it very expressive. Our model outperforms pretrained seq2seq models and prior work on realistic semantic parsing tasks that require generalization to longer examples. We also outperform non-tree-based models on structural generalization on the COGS benchmark. For the first time, we show that a model without an inductive bias provided by trees achieves high accuracy on generalization to deeper recursion. △ Less

Submitted 26 May, 2023; originally announced May 2023.

Comments: ACL 2023

arXiv:2305.08414 [pdf, other]

What's the Meaning of Superhuman Performance in Today's NLU?

Authors: Simone Tedeschi, Johan Bos, Thierry Declerck, Jan Hajic, Daniel Hershcovich, Eduard H. Hovy, Alexander Koller, Simon Krek, Steven Schockaert, Rico Sennrich, Ekaterina Shutova, Roberto Navigli

Abstract: In the last five years, there has been a significant focus in Natural Language Processing (NLP) on develo** larger Pretrained Language Models (PLMs) and introducing benchmarks such as SuperGLUE and SQuAD to measure their abilities in language understanding, reasoning, and reading comprehension. These PLMs have achieved impressive results on these benchmarks, even surpassing human performance in… ▽ More In the last five years, there has been a significant focus in Natural Language Processing (NLP) on develo** larger Pretrained Language Models (PLMs) and introducing benchmarks such as SuperGLUE and SQuAD to measure their abilities in language understanding, reasoning, and reading comprehension. These PLMs have achieved impressive results on these benchmarks, even surpassing human performance in some cases. This has led to claims of superhuman capabilities and the provocative idea that certain tasks have been solved. In this position paper, we take a critical look at these claims and ask whether PLMs truly have superhuman abilities and what the current benchmarks are really evaluating. We show that these benchmarks have serious limitations affecting the comparison between humans and PLMs and provide recommendations for fairer and more transparent benchmarks. △ Less

Submitted 15 May, 2023; originally announced May 2023.

Comments: 9 pages, long paper at ACL 2023 proceedings

arXiv:2304.14399 [pdf, other]

We're Afraid Language Models Aren't Modeling Ambiguity

Authors: Alisa Liu, Zhaofeng Wu, Julian Michael, Alane Suhr, Peter West, Alexander Koller, Swabha Swayamdipta, Noah A. Smith, Ye** Choi

Abstract: Ambiguity is an intrinsic feature of natural language. Managing ambiguity is a key part of human language understanding, allowing us to anticipate misunderstanding as communicators and revise our interpretations as listeners. As language models (LMs) are increasingly employed as dialogue interfaces and writing aids, handling ambiguous language is critical to their success. We characterize ambiguit… ▽ More Ambiguity is an intrinsic feature of natural language. Managing ambiguity is a key part of human language understanding, allowing us to anticipate misunderstanding as communicators and revise our interpretations as listeners. As language models (LMs) are increasingly employed as dialogue interfaces and writing aids, handling ambiguous language is critical to their success. We characterize ambiguity in a sentence by its effect on entailment relations with another sentence, and collect AmbiEnt, a linguist-annotated benchmark of 1,645 examples with diverse kinds of ambiguity. We design a suite of tests based on AmbiEnt, presenting the first evaluation of pretrained LMs to recognize ambiguity and disentangle possible meanings. We find that the task remains extremely challenging, including for GPT-4, whose generated disambiguations are considered correct only 32% of the time in human evaluation, compared to 90% for disambiguations in our dataset. Finally, to illustrate the value of ambiguity-sensitive tools, we show that a multilabel NLI model can flag political claims in the wild that are misleading due to ambiguity. We encourage the field to rediscover the importance of ambiguity for NLP. △ Less

Submitted 20 October, 2023; v1 submitted 27 April, 2023; originally announced April 2023.

Comments: EMNLP 2023 camera-ready

arXiv:2210.13050 [pdf, other]

Structural generalization is hard for sequence-to-sequence models

Authors: Yuekun Yao, Alexander Koller

Abstract: Sequence-to-sequence (seq2seq) models have been successful across many NLP tasks, including ones that require predicting linguistic structure. However, recent work on compositional generalization has shown that seq2seq models achieve very low accuracy in generalizing to linguistic structures that were not seen in training. We present new evidence that this is a general limitation of seq2seq models… ▽ More Sequence-to-sequence (seq2seq) models have been successful across many NLP tasks, including ones that require predicting linguistic structure. However, recent work on compositional generalization has shown that seq2seq models achieve very low accuracy in generalizing to linguistic structures that were not seen in training. We present new evidence that this is a general limitation of seq2seq models that is present not just in semantic parsing, but also in syntactic parsing and in text-to-text tasks, and that this limitation can often be overcome by neurosymbolic models that have linguistic knowledge built in. We further report on some experiments that give initial answers on the reasons for these limitations. △ Less

Submitted 24 October, 2022; originally announced October 2022.

Comments: Accepted in EMNLP 2022

arXiv:2210.03183 [pdf, other]

Compositional Generalisation with Structured Reordering and Fertility Layers

Authors: Matthias Lindemann, Alexander Koller, Ivan Titov

Abstract: Seq2seq models have been shown to struggle with compositional generalisation, i.e. generalising to new and potentially more complex structures than seen during training. Taking inspiration from grammar-based models that excel at compositional generalisation, we present a flexible end-to-end differentiable neural model that composes two structural operations: a fertility step, which we introduce in… ▽ More Seq2seq models have been shown to struggle with compositional generalisation, i.e. generalising to new and potentially more complex structures than seen during training. Taking inspiration from grammar-based models that excel at compositional generalisation, we present a flexible end-to-end differentiable neural model that composes two structural operations: a fertility step, which we introduce in this work, and a reordering step based on previous work (Wang et al., 2021). To ensure differentiability, we use the expected value of each step. Our model outperforms seq2seq models by a wide margin on challenging compositional splits of realistic semantic parsing tasks that require generalisation to longer examples. It also compares favourably to other models targeting compositional generalisation. △ Less

Submitted 15 February, 2023; v1 submitted 6 October, 2022; originally announced October 2022.

Comments: EACL 2023 camera-ready

ACM Class: I.2.7

arXiv:2202.11937 [pdf, other]

Compositional Generalization Requires Compositional Parsers

Authors: Pia Weißenhorn, Yuekun Yao, Lucia Donatelli, Alexander Koller

Abstract: A rapidly growing body of research on compositional generalization investigates the ability of a semantic parser to dynamically recombine linguistic elements seen in training into unseen sequences. We present a systematic comparison of sequence-to-sequence models and models guided by compositional principles on the recent COGS corpus (Kim and Linzen, 2020). Though seq2seq models can perform well o… ▽ More A rapidly growing body of research on compositional generalization investigates the ability of a semantic parser to dynamically recombine linguistic elements seen in training into unseen sequences. We present a systematic comparison of sequence-to-sequence models and models guided by compositional principles on the recent COGS corpus (Kim and Linzen, 2020). Though seq2seq models can perform well on lexical tasks, they perform with near-zero accuracy on structural generalization tasks that require novel syntactic structures; this holds true even when they are trained to predict syntax instead of semantics. In contrast, compositional models achieve near-perfect accuracy on structural generalization; we present new results confirming this from the AM parser (Groschwitz et al., 2021). Our findings show structural generalization is a key measure of compositional generalization and requires models that are aware of complex structure. △ Less

Submitted 24 February, 2022; originally announced February 2022.

arXiv:2106.04398 [pdf, other]

Learning compositional structures for semantic graph parsing

Authors: Jonas Groschwitz, Meaghan Fowlie, Alexander Koller

Abstract: AM dependency parsing is a method for neural semantic graph parsing that exploits the principle of compositionality. While AM dependency parsers have been shown to be fast and accurate across several graphbanks, they require explicit annotations of the compositional tree structures for training. In the past, these were obtained using complex graphbank-specific heuristics written by experts. Here w… ▽ More AM dependency parsing is a method for neural semantic graph parsing that exploits the principle of compositionality. While AM dependency parsers have been shown to be fast and accurate across several graphbanks, they require explicit annotations of the compositional tree structures for training. In the past, these were obtained using complex graphbank-specific heuristics written by experts. Here we show how they can instead be trained directly on the graphs with a neural latent-variable model, drastically reducing the amount and complexity of manual heuristics. We demonstrate that our model picks up on several linguistic phenomena on its own and achieves comparable accuracy to supervised training, greatly facilitating the use of AM dependency parsing for new sembanks. △ Less

Submitted 8 June, 2021; originally announced June 2021.

Comments: Accepted at the 5th Workshop on Structured Prediction for NLP (http://structuredprediction.github.io/SPNLP21)

arXiv:2010.03982 [pdf, other]

Generating Instructions at Different Levels of Abstraction

Authors: Arne Köhn, Julia Wichlacz, Álvaro Torralba, Daniel Höller, Jörg Hoffmann, Alexander Koller

Abstract: When generating technical instructions, it is often convenient to describe complex objects in the world at different levels of abstraction. A novice user might need an object explained piece by piece, while for an expert, talking about the complex object (e.g. a wall or railing) directly may be more succinct and efficient. We show how to generate building instructions at different levels of abstra… ▽ More When generating technical instructions, it is often convenient to describe complex objects in the world at different levels of abstraction. A novice user might need an object explained piece by piece, while for an expert, talking about the complex object (e.g. a wall or railing) directly may be more succinct and efficient. We show how to generate building instructions at different levels of abstraction in Minecraft. We introduce the use of hierarchical planning to this end, a method from AI planning which can capture the structure of complex objects neatly. A crowdsourcing evaluation shows that the choice of abstraction level matters to users, and that an abstraction strategy which balances low-level and high-level object descriptions compares favorably to ones which don't. △ Less

Submitted 8 October, 2020; originally announced October 2020.

Comments: Accepted COLING 2020 long paper

arXiv:2009.07365 [pdf, other]

Fast semantic parsing with well-typedness guarantees

Authors: Matthias Lindemann, Jonas Groschwitz, Alexander Koller

Abstract: AM dependency parsing is a linguistically principled method for neural semantic parsing with high accuracy across multiple graphbanks. It relies on a type system that models semantic valency but makes existing parsers slow. We describe an A* parser and a transition-based parser for AM dependency parsing which guarantee well-typedness and improve parsing speed by up to 3 orders of magnitude, while… ▽ More AM dependency parsing is a linguistically principled method for neural semantic parsing with high accuracy across multiple graphbanks. It relies on a type system that models semantic valency but makes existing parsers slow. We describe an A* parser and a transition-based parser for AM dependency parsing which guarantee well-typedness and improve parsing speed by up to 3 orders of magnitude, while maintaining or improving accuracy. △ Less

Submitted 6 October, 2020; v1 submitted 15 September, 2020; originally announced September 2020.

Comments: Accepted at EMNLP 2020, camera-ready version

arXiv:2004.14236 [pdf, other]

Normalizing Compositional Structures Across Graphbanks

Authors: Lucia Donatelli, Jonas Groschwitz, Alexander Koller, Matthias Lindemann, Pia Weißenhorn

Abstract: The emergence of a variety of graph-based meaning representations (MRs) has sparked an important conversation about how to adequately represent semantic structure. These MRs exhibit structural differences that reflect different theoretical and design considerations, presenting challenges to uniform linguistic analysis and cross-framework semantic parsing. Here, we ask the question of which design… ▽ More The emergence of a variety of graph-based meaning representations (MRs) has sparked an important conversation about how to adequately represent semantic structure. These MRs exhibit structural differences that reflect different theoretical and design considerations, presenting challenges to uniform linguistic analysis and cross-framework semantic parsing. Here, we ask the question of which design differences between MRs are meaningful and semantically-rooted, and which are superficial. We present a methodology for normalizing discrepancies between MRs at the compositional level (Lindemann et al., 2019), finding that we can normalize the majority of divergent phenomena using linguistically-grounded rules. Our work significantly increases the match in compositional structure between MRs and improves multi-task learning (MTL) in a low-resource setting, demonstrating the usefulness of careful MR design analysis and comparison. △ Less

Submitted 30 April, 2020; v1 submitted 29 April, 2020; originally announced April 2020.

Comments: 16 pages, 6 figures

arXiv:1906.11752 [pdf, other]

Semantic expressive capacity with bounded memory

Authors: Antoine Venant, Alexander Koller

Abstract: We investigate the capacity of mechanisms for compositional semantic parsing to describe relations between sentences and semantic representations. We prove that in order to represent certain relations, mechanisms which are syntactically projective must be able to remember an unbounded number of locations in the semantic representations, where nonprojective mechanisms need not. This is the firs… ▽ More We investigate the capacity of mechanisms for compositional semantic parsing to describe relations between sentences and semantic representations. We prove that in order to represent certain relations, mechanisms which are syntactically projective must be able to remember an unbounded number of locations in the semantic representations, where nonprojective mechanisms need not. This is the first result of this kind, and has consequences both for grammar-based and for neural systems. △ Less

Submitted 27 June, 2019; originally announced June 2019.

Comments: Accepted at ACL 2019

arXiv:1906.11746 [pdf, other]

Compositional Semantic Parsing Across Graphbanks

Authors: Matthias Lindemann, Jonas Groschwitz, Alexander Koller

Abstract: Most semantic parsers that map sentences to graph-based meaning representations are hand-designed for specific graphbanks. We present a compositional neural semantic parser which achieves, for the first time, competitive accuracies across a diverse range of graphbanks. Incorporating BERT embeddings and multi-task learning improves the accuracy further, setting new states of the art on DM, PAS, PSD… ▽ More Most semantic parsers that map sentences to graph-based meaning representations are hand-designed for specific graphbanks. We present a compositional neural semantic parser which achieves, for the first time, competitive accuracies across a diverse range of graphbanks. Incorporating BERT embeddings and multi-task learning improves the accuracy further, setting new states of the art on DM, PAS, PSD, AMR 2015 and EDS. △ Less

Submitted 13 July, 2019; v1 submitted 27 June, 2019; originally announced June 2019.

Comments: Accepted at ACL 2019

arXiv:1806.10654 [pdf, other]

Generalized chart constraints for efficient PCFG and TAG parsing

Authors: Stefan Grünewald, Sophie Henning, Alexander Koller

Abstract: Chart constraints, which specify at which string positions a constituent may begin or end, have been shown to speed up chart parsers for PCFGs. We generalize chart constraints to more expressive grammar formalisms and describe a neural tagger which predicts chart constraints at very high precision. Our constraints accelerate both PCFG and TAG parsing, and combine effectively with other pruning tec… ▽ More Chart constraints, which specify at which string positions a constituent may begin or end, have been shown to speed up chart parsers for PCFGs. We generalize chart constraints to more expressive grammar formalisms and describe a neural tagger which predicts chart constraints at very high precision. Our constraints accelerate both PCFG and TAG parsing, and combine effectively with other pruning techniques (coarse-to-fine and supertagging) for an overall speedup of two orders of magnitude, while improving accuracy. △ Less

Submitted 27 June, 2018; originally announced June 2018.

Journal ref: Proceedings of ACL 2018 (Short Papers)

arXiv:1806.05947 [pdf, other]

Discovering User Groups for Natural Language Generation

Authors: Nikos Engonopoulos, Christoph Teichmann, Alexander Koller

Abstract: We present a model which predicts how individual users of a dialog system understand and produce utterances based on user groups. In contrast to previous work, these user groups are not specified beforehand, but learned in training. We evaluate on two referring expression (RE) generation tasks; our experiments show that our model can identify user groups and learn how to most effectively talk to t… ▽ More We present a model which predicts how individual users of a dialog system understand and produce utterances based on user groups. In contrast to previous work, these user groups are not specified beforehand, but learned in training. We evaluate on two referring expression (RE) generation tasks; our experiments show that our model can identify user groups and learn how to most effectively talk to them, and can dynamically assign unseen users to the correct groups as they interact with the system. △ Less

Submitted 15 June, 2018; originally announced June 2018.

Comments: 9 pages, 7 Figures, Accepted for SIGDIAL 2018

arXiv:1805.11465 [pdf, other]

doi 10.18653/v1/P18-1170

AMR Dependency Parsing with a Typed Semantic Algebra

Authors: Jonas Groschwitz, Matthias Lindemann, Meaghan Fowlie, Mark Johnson, Alexander Koller

Abstract: We present a semantic parser for Abstract Meaning Representations which learns to parse strings into tree representations of the compositional structure of an AMR graph. This allows us to use standard neural techniques for supertagging and dependency tree parsing, constrained by a linguistically principled type system. We present two approximative decoding algorithms, which achieve state-of-the-ar… ▽ More We present a semantic parser for Abstract Meaning Representations which learns to parse strings into tree representations of the compositional structure of an AMR graph. This allows us to use standard neural techniques for supertagging and dependency tree parsing, constrained by a linguistically principled type system. We present two approximative decoding algorithms, which achieve state-of-the-art accuracy and outperform strong baselines. △ Less

Submitted 29 May, 2018; originally announced May 2018.

Comments: This paper will be presented at ACL 2018 (see https://acl2018.org/programme/papers/)

Journal ref: Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2018

arXiv:1608.03854 [pdf, other]

doi 10.1038/nature20811

Spin-orbit coupled fermions in an optical lattice clock

Authors: S. Kolkowitz, S. L. Bromley, T. Bothwell, M. L. Wall, G. E. Marti, A. P. Koller, X. Zhang, A. M. Rey, J. Ye

Abstract: Engineered spin-orbit coupling (SOC) in cold atom systems can aid in the study of novel synthetic materials and complex condensed matter phenomena. Despite great advances, alkali atom SOC systems are hindered by heating from spontaneous emission, which limits the observation of many-body effects, motivating research into potential alternatives. Here we demonstrate that SOC can be engineered to occ… ▽ More Engineered spin-orbit coupling (SOC) in cold atom systems can aid in the study of novel synthetic materials and complex condensed matter phenomena. Despite great advances, alkali atom SOC systems are hindered by heating from spontaneous emission, which limits the observation of many-body effects, motivating research into potential alternatives. Here we demonstrate that SOC can be engineered to occur naturally in a one-dimensional fermionic 87Sr optical lattice clock (OLC). In contrast to previous SOC experiments, in this work the SOC is both generated and probed using a direct ultra-narrow optical clock transition between two electronic orbital states. We use clock spectroscopy to prepare lattice band populations, internal electronic states, and quasimomenta, as well as to produce SOC dynamics. The exceptionally long lifetime of the excited clock state (160 s) eliminates decoherence and atom loss from spontaneous emission at all relevant experimental timescales, allowing subsequent momentum- and spin-resolved in situ probing of the SOC band structure and eigenstates. We utilize these capabilities to study Bloch oscillations, spin-momentum locking, and Van Hove singularities in the transition density of states. Our results lay the groundwork for the use of OLCs to probe novel SOC phases of matter. △ Less

Submitted 8 November, 2016; v1 submitted 12 August, 2016; originally announced August 2016.

Comments: 17 pages, 4 figures, and Methods section with 3 Extended Data figures. Reformatted from previous version, with reduced length and reduced number of citations to match journal format

arXiv:1601.01004 [pdf, other]

doi 10.1103/PhysRevLett.117.195302

Dynamics of interacting fermions in spin-dependent potentials

Authors: Andrew P. Koller, Michael L. Wall, Josh Mundinger, Ana Maria Rey

Abstract: Recent experiments with dilute trapped Fermi gases observed that weak interactions can drastically modify spin transport dynamics and give rise to robust collective effects including global demagnetization, macroscopic spin waves, spin segregation, and spin self-rephasing. In this work we develop a framework for studying the dynamics of weakly interacting fermionic gases following a spin-dependent… ▽ More Recent experiments with dilute trapped Fermi gases observed that weak interactions can drastically modify spin transport dynamics and give rise to robust collective effects including global demagnetization, macroscopic spin waves, spin segregation, and spin self-rephasing. In this work we develop a framework for studying the dynamics of weakly interacting fermionic gases following a spin-dependent change of the trap** potential which illuminates the interplay between spin, motion, Fermi statistics, and interactions. The key idea is the projection of the state of the system onto a set of lattice spin models defined on the single-particle mode space. Collective phenomena, including the global spreading of quantum correlations in real space, arise as a consequence of the long-ranged character of the spin model couplings. This approach achieves good agreement with prior measurements and suggests a number of directions for future experiments. △ Less

Submitted 7 December, 2016; v1 submitted 5 January, 2016; originally announced January 2016.

Comments: 5.5 pages and 4 figures + 8 pages and 6 figures supplemental material

Journal ref: Phys. Rev. Lett. 117, 195302 (2016)

arXiv:1509.05846 [pdf, other]

doi 10.1103/PhysRevLett.116.035301

Synthetic spin-orbit coupling in an optical lattice clock

Authors: Michael L. Wall, Andrew P. Koller, Shuming Li, Xibo Zhang, Nigel R. Cooper, Jun Ye, Ana Maria Rey

Abstract: We propose the use of optical lattice clocks operated with fermionic alkaline-earth-atoms to study spin-orbit coupling (SOC) in interacting many-body systems. The SOC emerges naturally during the clock interrogation when atoms are allowed to tunnel and accumulate a phase set by the ratio of the "magic" lattice wavelength to the clock transition wavelength. We demonstrate how standard protocols suc… ▽ More We propose the use of optical lattice clocks operated with fermionic alkaline-earth-atoms to study spin-orbit coupling (SOC) in interacting many-body systems. The SOC emerges naturally during the clock interrogation when atoms are allowed to tunnel and accumulate a phase set by the ratio of the "magic" lattice wavelength to the clock transition wavelength. We demonstrate how standard protocols such as Rabi and Ramsey spectroscopy, that take advantage of the sub-Hertz resolution of state-of-the-art clock lasers, can perform momentum-resolved band tomography and determine SOC-induced $s$-wave collisions in nuclear spin polarized fermions. By adding a second counter-propagating clock beam a sliding superlattice can be implemented and used for controlled atom transport and as a probe of $p$ and $s$-wave interactions. The proposed spectroscopic probes provide clean and well-resolved signatures at current clock operating temperatures. △ Less

Submitted 19 September, 2015; originally announced September 2015.

Comments: 5.5 pages and 4 figures+6 pages and 1 figure supplemental material

Journal ref: Phys. Rev. Lett. 116, 035301 (2016)

arXiv:1506.02353 [pdf, other]

doi 10.1103/PhysRevA.92.033608

Demagnetization dynamics of non-interacting trapped fermions

Authors: Andrew P. Koller, Joshua Mundinger, Michael L. Wall, Ana Maria Rey

Abstract: Motivated by several experimental efforts to understand spin diffusion and transport in ultracold fermionic gases, we study the spin dynamics of initially spin-polarized ensembles of harmonically trapped non-interacting spin-1/2 fermionic atoms, subjected to a magnetic field gradient. We obtain simple analytic expressions for spin observables in the presence of both constant and linear magnetic fi… ▽ More Motivated by several experimental efforts to understand spin diffusion and transport in ultracold fermionic gases, we study the spin dynamics of initially spin-polarized ensembles of harmonically trapped non-interacting spin-1/2 fermionic atoms, subjected to a magnetic field gradient. We obtain simple analytic expressions for spin observables in the presence of both constant and linear magnetic field gradients, with and without a spin-echo pulse, and at zero and finite temperatures. The analysis shows the relevance of spin-motional coupling in the non-interacting regime where the demagnetization decay rate at short times can be faster than the experimentally measured rates in the strongly interacting regime under similar trap** conditions. Our calculations also show that particle motion limits the ability of a spin-echo pulse to remove the effect of magnetic field inhomogeneity, and that a spin-echo pulse can instead lead to an increased decay of magnetization at times comparable to the trap** period. △ Less

Submitted 8 June, 2015; originally announced June 2015.

Journal ref: Phys. Rev. A 92, 033608 (2015)

arXiv:1409.3234 [pdf, other]

doi 10.1103/PhysRevA.93.051601

Realizing Exactly Solvable SU(N) Magnets with Thermal Atoms

Authors: Michael E. Beverland, Gorjan Alagic, Michael J. Martin, Andrew P. Koller, Ana M. Rey, Alexey V. Gorshkov

Abstract: We show that $n$ thermal fermionic alkaline-earth atoms in a flat-bottom trap allow one to robustly implement a spin model displaying two symmetries: the $S_n$ symmetry that permutes atoms occupying different vibrational levels of the trap and the SU($N$) symmetry associated with $N$ nuclear spin states. The high symmetry makes the model exactly solvable, which, in turn, enables the analytic study… ▽ More We show that $n$ thermal fermionic alkaline-earth atoms in a flat-bottom trap allow one to robustly implement a spin model displaying two symmetries: the $S_n$ symmetry that permutes atoms occupying different vibrational levels of the trap and the SU($N$) symmetry associated with $N$ nuclear spin states. The high symmetry makes the model exactly solvable, which, in turn, enables the analytic study of dynamical processes such as spin diffusion in this SU($N$) system. We also show how to use this system to generate entangled states that allow for Heisenberg-limited metrology. This highly symmetric spin model should be experimentally realizable even when the vibrational levels are occupied according to a high-temperature thermal or an arbitrary non-thermal distribution. △ Less

Submitted 5 August, 2016; v1 submitted 10 September, 2014; originally announced September 2014.

Comments: 12 pages, 5 figures (including supplemental materials)

Journal ref: Physical Review A 93 (5), 051601 (2016)

arXiv:1312.0887 [pdf, other]

doi 10.1103/PhysRevLett.112.123001

Beyond the Spin Model Approximation for Ramsey Spectroscopy

Authors: A. P. Koller, M. Beverland, A. V. Gorshkov, A. M. Rey

Abstract: Ramsey spectroscopy has become a powerful technique for probing non-equilibrium dynamics of internal (pseudospin) degrees of freedom of interacting systems. In many theoretical treatments, the key to understanding the dynamics has been to assume the external (motional) degrees of freedom are decoupled from the pseudospin degrees of freedom. Determining the validity of this approximation -- known a… ▽ More Ramsey spectroscopy has become a powerful technique for probing non-equilibrium dynamics of internal (pseudospin) degrees of freedom of interacting systems. In many theoretical treatments, the key to understanding the dynamics has been to assume the external (motional) degrees of freedom are decoupled from the pseudospin degrees of freedom. Determining the validity of this approximation -- known as the spin model approximation -- is complicated, and has not been addressed in detail. Here we shed light in this direction by calculating Ramsey dynamics exactly for two interacting spin-1/2 particles in a harmonic trap. We focus on $s$-wave-interacting fermions in quasi-one and two-dimensional geometries. We find that in 1D the spin model assumption works well over a wide range of experimentally-relevant conditions, but can fail at time scales longer than those set by the mean interaction energy. Surprisingly, in 2D a modified version of the spin model is exact to first order in the interaction strength. This analysis is important for a correct interpretation of Ramsey spectroscopy and has broad applications ranging from precision measurements to quantum information and to fundamental probes of many-body systems. △ Less

Submitted 17 April, 2014; v1 submitted 3 December, 2013; originally announced December 2013.

Journal ref: Phys. Rev. Lett. 112, 123001 (2014)

arXiv:1309.0828 [pdf, ps, other]

doi 10.1103/PhysRevA.89.021601

Quenching to unitarity: Quantum dynamics in a 3D Bose gas

Authors: A. G. Sykes, J. P. Corson, J. P. D'Incao, A. P. Koller, C. H. Greene, A. M. Rey, K. R. A. Hazzard, J. L. Bohn

Abstract: We study the dynamics of a dilute Bose gas at zero temperature following a sudden quench of the scattering length from a noninteracting Bose condensate to unitarity (infinite scattering length). We apply three complementary approaches to understand the momentum distribution and loss rates. First, using a time-dependent variational ansatz for the many-body state, we calculate the dynamics of the mo… ▽ More We study the dynamics of a dilute Bose gas at zero temperature following a sudden quench of the scattering length from a noninteracting Bose condensate to unitarity (infinite scattering length). We apply three complementary approaches to understand the momentum distribution and loss rates. First, using a time-dependent variational ansatz for the many-body state, we calculate the dynamics of the momentum distribution. Second, we demonstrate that, at short times and large momenta compared to those set by the density, the physics can be well understood within a simple, analytic two-body model. We derive a quantitative prediction for the evolution of Tan's contact, which increases linearly at short times. We also study the three-body losses at finite densities. Consistent with experiments, we observe lifetimes which are long compared to the dynamics of large momentum modes. △ Less

Submitted 3 September, 2013; originally announced September 2013.

Comments: 4 pages, 3 figures

arXiv:1306.0300 [pdf, ps, other]

Emergence of Reflectionless Scattering from Linearizations of Integrable PDEs around Solitons

Authors: Andrew Koller, Zaijong Hwang, Maxim Olshanii

Abstract: We present four examples of integrable partial differential equations (PDEs) of mathematical physics that---when linearized around a stationary soliton---exhibit scattering without reflection at {\it all} energies. Starting from the most well-known and the most empirically relevant phenomenon of the transparency of one-dimensional bright bosonic solitons to Bogoliubov excitations, we proceed to th… ▽ More We present four examples of integrable partial differential equations (PDEs) of mathematical physics that---when linearized around a stationary soliton---exhibit scattering without reflection at {\it all} energies. Starting from the most well-known and the most empirically relevant phenomenon of the transparency of one-dimensional bright bosonic solitons to Bogoliubov excitations, we proceed to the sine-Gordon, Korteweg-de Vries, and Liouville's equation whose stationary solitons also support our assertion. The proposed connection between integrability and reflectionless scattering seems to span at least two distinct paradigms of integrability: S-integrability in the first three cases, and C-integrability in the last one. We argue that the transparency of linearized integrable PDEs is necessary to ensure that they can support the transparency of stationary solitons in the original integrable PDEs. As contrasting cases, the analysis is further extended to cover two non-integrable systems: a sawtooth-Gordon and a $φ^4$ model. △ Less

Submitted 27 November, 2014; v1 submitted 3 June, 2013; originally announced June 2013.

Comments: General copy edits. Added studies on two non-integrable systems: a sawtooth-Gordon and a phi-4 model. 6 pages, 1 figure

Journal ref: J. Phys. A: Math. Theor. 48, 075203 (2015)

arXiv:1101.3596 [pdf, ps, other]

Minkowski and packing Dimension comparisons for sets with Reifenberg properties

Authors: Amos N. Koeller

Abstract: In Koeller \cite{koerprops} the twelve variants of the Reifenberg properties known to be instrumental in the theory of minimal surfaces were classified with respect to various Hausdorff measure based measure theoretic properties. The classification lead to the consideration of fine geometric properties and a connection to fractal geometry. The current work develops this connection and extends the… ▽ More In Koeller \cite{koerprops} the twelve variants of the Reifenberg properties known to be instrumental in the theory of minimal surfaces were classified with respect to various Hausdorff measure based measure theoretic properties. The classification lead to the consideration of fine geometric properties and a connection to fractal geometry. The current work develops this connection and extends the classification to consider Minkowski-dimension, packing dimension, measure, and rectifiability, and the equality of packing and Hausdorff measures with interesting results. △ Less

Submitted 18 January, 2011; originally announced January 2011.

MSC Class: primary 28A78 and 37F35; secondary 51M15

arXiv:1101.1532 [pdf, ps, other]

Outer measure preserving ergodic transformations generate the Carathéodory definition of measurable sets

Authors: Amos N. Koeller

Abstract: It is known that there are specific examples of ergodic transformations on measure spaces for which the calculation of the outer measure of transformation invariant sets leads to a condition closely resembling Carathéodory's condition for sets to be measurable. It is then natural to ask what functions are capable of `generating', that is leading to, the Carathéodory definition in the same way. The… ▽ More It is known that there are specific examples of ergodic transformations on measure spaces for which the calculation of the outer measure of transformation invariant sets leads to a condition closely resembling Carathéodory's condition for sets to be measurable. It is then natural to ask what functions are capable of `generating', that is leading to, the Carathéodory definition in the same way. The present work answers this question by showing that the property of generating Carathéodory's definition holds for the general class of outer measure preserving ergodic transformations on measure spaces. We further show that the previously found specific examples of functions generating Carathéodory's definition fall into this family of transformations. △ Less

Submitted 7 January, 2011; originally announced January 2011.

MSC Class: 28A05; 28D05

arXiv:1012.4275 [pdf, ps, other]

A classification of Reifenberg properties

Authors: Amos N. Koeller

Abstract: We define twelve variants of a Reifenberg's affine approximation property, which are known to be connected with the singular sets of minimal surfaces. With this motivation we investigate the regularity of the sets possessing these. We classify the properties with respect to whether $j$-dimensional Hausdorff dimension, locally finite $j$-dimensional Hausdorff measure or countable $j$-rectifiability… ▽ More We define twelve variants of a Reifenberg's affine approximation property, which are known to be connected with the singular sets of minimal surfaces. With this motivation we investigate the regularity of the sets possessing these. We classify the properties with respect to whether $j$-dimensional Hausdorff dimension, locally finite $j$-dimensional Hausdorff measure or countable $j$-rectifiability hold. In showing that varying levels of regularity hold for the differing properties, quasi-self-similar sets, interesting in their own right, are constructed as counter examples. These counter examples also admit a connection to number theory via the use of the normal number theorem. Additionally, the intriguing result that such complexity in the counter examples is actually a necessity is shown. △ Less

Submitted 20 December, 2010; originally announced December 2010.

MSC Class: primary 28A78 and 37F35; secondary 51M15 and 53A10

arXiv:1012.2843 [pdf, ps, other]

doi 10.1103/PhysRevE.84.066601

Supersymmetric Quantum Mechanics and Solitons of the sine-Gordon and Nonlinear Schrödinger Equations

Authors: Andrew Koller, Maxim Olshanii

Abstract: We present a case demonstrating the connection between supersymmetric quantum mechanics (SUSY--QM), reflectionless scattering, and soliton solutions of integrable partial differential equations. We show that the members of a class of reflectionless Hamiltonians, namely, Akulin's Hamiltonians, are connected via supersymmetric chains to a potential-free Hamiltonian, explaining their reflectionless n… ▽ More We present a case demonstrating the connection between supersymmetric quantum mechanics (SUSY--QM), reflectionless scattering, and soliton solutions of integrable partial differential equations. We show that the members of a class of reflectionless Hamiltonians, namely, Akulin's Hamiltonians, are connected via supersymmetric chains to a potential-free Hamiltonian, explaining their reflectionless nature. While the reflectionless property in question has been mentioned in the literature for over two decades, the enabling algebraic mechanism was previously unknown. Our results indicate that the multi-solition solutions of the sine-Gordon and nonlinear Schrödinger equations can be systematically generated via the supersymmetric chains connecting Akulin's Hamiltonians. Our findings also explain a well-known but little-understood effect in laser physics: when a two-level atom, initially in the ground state, is subjected to a laser pulse of the form $V(t) = (n\hbar/τ)/\cosh(t/τ)$, with $n$ being an integer and $τ$ being the pulse duration, it remains in the ground state after the pulse has been applied, for {\it any} choice of the laser detuning. △ Less

Submitted 30 December, 2011; v1 submitted 13 December, 2010; originally announced December 2010.

Comments: minor improvements | 5 pages, 3 figures

Journal ref: Phys. Rev. E 84, 066601 (2011)

arXiv:1012.0601 [pdf, ps, other]

On the singular set of mean curvature flows with Neumann free boundary conditions

Authors: Amos N. Koeller

Abstract: We consider $n$-dimensional hypersurfaces flowing by mean curvature flow with Neumann free boundary conditions supported on a smooth support surface. We show that the Hausdorff $n$-measure of the singular set is zero. In fact, we consider two types of interaction between the support and flowing surfaces. In the case of weaker interaction, we need make no further assumptions than in the case withou… ▽ More We consider $n$-dimensional hypersurfaces flowing by mean curvature flow with Neumann free boundary conditions supported on a smooth support surface. We show that the Hausdorff $n$-measure of the singular set is zero. In fact, we consider two types of interaction between the support and flowing surfaces. In the case of weaker interaction, we need make no further assumptions than in the case without boundary to achieve our result. In the case of stronger interaction, we need only make the additional assumption that $H_Σ>0$, that is, that the support surface be mean convex. We go on, in this case, to show that the result is not, in general, true without the mean convexity assumption. △ Less

Submitted 19 December, 2010; v1 submitted 2 December, 2010; originally announced December 2010.

Comments: Revised version, typos removed. Argumentation added in section 5

MSC Class: 53C44

arXiv:0711.1108 [pdf, ps, other]

Evolution of convex lens-shaped networks under curve shortening flow

Authors: Oliver C. Schnürer, Abderrahim Azouani, Marc Georgi, Juliette Hell, Nihar Jangle, Amos Koeller, Tobias Marxen, Sandra Ritthaler, Mariel Sáez, Felix Schulze, Brian Smith

Abstract: We consider convex symmetric lens-shaped networks in R^2 that evolve under curve shortening flow. We show that the enclosed convex domain shrinks to a point in finite time. Furthermore, after appropriate rescaling the evolving networks converge to a self-similarly shrinking network, which we prove to be unique in an appropriate class. We also include a classification result for some self-similar… ▽ More We consider convex symmetric lens-shaped networks in R^2 that evolve under curve shortening flow. We show that the enclosed convex domain shrinks to a point in finite time. Furthermore, after appropriate rescaling the evolving networks converge to a self-similarly shrinking network, which we prove to be unique in an appropriate class. We also include a classification result for some self-similarly shrinking networks. △ Less

Submitted 7 November, 2007; originally announced November 2007.

Comments: 29 pages, 5 figures

MSC Class: 53C44; 35B40

arXiv:math/0608528 [pdf, ps, other]

Approximately j-dimensional Koch type sets are potentially minimal surfaces

Authors: Amos N. Koeller

Abstract: We investigate the approximate j-dimensionality of the singularity sets of minimal surfaces prescribed by Simon. This leads to the clasification of 8 variations of approximately j-dimensional surfacs in terms of dimension and locally finite Hausdorff measure. We show that the singularity sets must either be well behaved or essentially purely unrectifiable. Examples of the unrectifiable sets th… ▽ More We investigate the approximate j-dimensionality of the singularity sets of minimal surfaces prescribed by Simon. This leads to the clasification of 8 variations of approximately j-dimensional surfacs in terms of dimension and locally finite Hausdorff measure. We show that the singularity sets must either be well behaved or essentially purely unrectifiable. Examples of the unrectifiable sets that could occur are constructed and generalised. They are shown to be pseudo-fractal and related to the well known Koch Sets. Various representations of the measure and dimension of the sets are shown, as well as a fine balance in the spiralling nature of the sets. △ Less

Submitted 21 August, 2006; originally announced August 2006.

Comments: 155 pages, Section 1 of the PhD thesis of the author

MSC Class: 28A78;28A80;49Q20;53A10

Showing 1–44 of 44 results for author: Koller, A