-
Learning Program Behavioral Models from Synthesized Input-Output Pairs
Authors:
Tural Mammadov,
Dietrich Klakow,
Alexander Koller,
Andreas Zeller
Abstract:
We introduce Modelizer - a novel framework that, given a black-box program, learns a _model from its input/output behavior_ using _neural machine translation_. The resulting model _mocks_ the original program: Given an input, the model predicts the output that would have been produced by the program. However, the model is also _reversible_ - that is, the model can predict the input that would have…
▽ More
We introduce Modelizer - a novel framework that, given a black-box program, learns a _model from its input/output behavior_ using _neural machine translation_. The resulting model _mocks_ the original program: Given an input, the model predicts the output that would have been produced by the program. However, the model is also _reversible_ - that is, the model can predict the input that would have produced a given output. Finally, the model is _differentiable_ and can be efficiently restricted to predict only a certain aspect of the program behavior.
Modelizer uses _grammars_ to synthesize inputs and to parse the resulting outputs, allowing it to learn sequence-to-sequence associations between token streams. Other than input and output grammars, Modelizer only requires the ability to execute the program.
The resulting models are _small_, requiring fewer than 6.3 million parameters for languages such as Markdown or HTML; and they are _accurate_, achieving up to 95.4% accuracy and a BLEU score of 0.98 with standard error 0.04 in mocking real-world applications. We foresee several _applications_ of these models, especially as the output of the program can be any aspect of program behavior. Besides mocking and predicting program behavior, the model can also synthesize inputs that are likely to produce a particular behavior, such as failures or coverage.
△ Less
Submitted 11 July, 2024;
originally announced July 2024.
-
Strengthening Structural Inductive Biases by Pre-training to Perform Syntactic Transformations
Authors:
Matthias Lindemann,
Alexander Koller,
Ivan Titov
Abstract:
Models need appropriate inductive biases to effectively learn from small amounts of data and generalize systematically outside of the training distribution. While Transformers are highly versatile and powerful, they can still benefit from enhanced structural inductive biases for seq2seq tasks, especially those involving syntactic transformations, such as converting active to passive voice or seman…
▽ More
Models need appropriate inductive biases to effectively learn from small amounts of data and generalize systematically outside of the training distribution. While Transformers are highly versatile and powerful, they can still benefit from enhanced structural inductive biases for seq2seq tasks, especially those involving syntactic transformations, such as converting active to passive voice or semantic parsing. In this paper, we propose to strengthen the structural inductive bias of a Transformer by intermediate pre-training to perform synthetically generated syntactic transformations of dependency trees given a description of the transformation. Our experiments confirm that this helps with few-shot learning of syntactic tasks such as chunking, and also improves structural generalization for semantic parsing. Our analysis shows that the intermediate pre-training leads to attention heads that keep track of which syntactic transformation needs to be applied to which token, and that the model can leverage these attention heads on downstream tasks.
△ Less
Submitted 5 July, 2024;
originally announced July 2024.
-
Scope-enhanced Compositional Semantic Parsing for DRT
Authors:
Xiulin Yang,
Jonas Groschwitz,
Alexander Koller,
Johan Bos
Abstract:
Discourse Representation Theory (DRT) distinguishes itself from other semantic representation frameworks by its ability to model complex semantic and discourse phenomena through structural nesting and variable binding. While seq2seq models hold the state of the art on DRT parsing, their accuracy degrades with the complexity of the sentence, and they sometimes struggle to produce well-formed DRT re…
▽ More
Discourse Representation Theory (DRT) distinguishes itself from other semantic representation frameworks by its ability to model complex semantic and discourse phenomena through structural nesting and variable binding. While seq2seq models hold the state of the art on DRT parsing, their accuracy degrades with the complexity of the sentence, and they sometimes struggle to produce well-formed DRT representations. We introduce the AMS parser, a compositional, neurosymbolic semantic parser for DRT. It rests on a novel mechanism for predicting quantifier scope. We show that the AMS parser reliably produces well-formed outputs and performs well on DRT parsing, especially on complex sentences.
△ Less
Submitted 1 July, 2024;
originally announced July 2024.
-
LLMs instead of Human Judges? A Large Scale Empirical Study across 20 NLP Evaluation Tasks
Authors:
Anna Bavaresco,
Raffaella Bernardi,
Leonardo Bertolazzi,
Desmond Elliott,
Raquel Fernández,
Albert Gatt,
Esam Ghaleb,
Mario Giulianelli,
Michael Hanna,
Alexander Koller,
André F. T. Martins,
Philipp Mondorf,
Vera Neplenbroek,
Sandro Pezzelle,
Barbara Plank,
David Schlangen,
Alessandro Suglia,
Aditya K Surikuchi,
Ece Takmaz,
Alberto Testoni
Abstract:
There is an increasing trend towards evaluating NLP models with LLM-generated judgments instead of human judgments. In the absence of a comparison against human data, this raises concerns about the validity of these evaluations; in case they are conducted with proprietary models, this also raises concerns over reproducibility. We provide JUDGE-BENCH, a collection of 20 NLP datasets with human anno…
▽ More
There is an increasing trend towards evaluating NLP models with LLM-generated judgments instead of human judgments. In the absence of a comparison against human data, this raises concerns about the validity of these evaluations; in case they are conducted with proprietary models, this also raises concerns over reproducibility. We provide JUDGE-BENCH, a collection of 20 NLP datasets with human annotations, and comprehensively evaluate 11 current LLMs, covering both open-weight and proprietary models, for their ability to replicate the annotations. Our evaluations show that each LLM exhibits a large variance across datasets in its correlation to human judgments. We conclude that LLMs are not yet ready to systematically replace human judges in NLP.
△ Less
Submitted 26 June, 2024;
originally announced June 2024.
-
Fine-grained Controllable Text Generation through In-context Learning with Feedback
Authors:
Sarubi Thillainathan,
Alexander Koller
Abstract:
We present a method for rewriting an input sentence to match specific values of nontrivial linguistic features, such as dependency depth. In contrast to earlier work, our method uses in-context learning rather than finetuning, making it applicable in use cases where data is sparse. We show that our model performs accurate rewrites and matches the state of the art on rewriting sentences to a specif…
▽ More
We present a method for rewriting an input sentence to match specific values of nontrivial linguistic features, such as dependency depth. In contrast to earlier work, our method uses in-context learning rather than finetuning, making it applicable in use cases where data is sparse. We show that our model performs accurate rewrites and matches the state of the art on rewriting sentences to a specified school grade level.
△ Less
Submitted 17 June, 2024;
originally announced June 2024.
-
A Dialogue Game for Eliciting Balanced Collaboration
Authors:
Isidora Jeknić,
David Schlangen,
Alexander Koller
Abstract:
Collaboration is an integral part of human dialogue. Typical task-oriented dialogue games assign asymmetric roles to the participants, which limits their ability to elicit naturalistic role-taking in collaboration and its negotiation. We present a novel and simple online setup that favors balanced collaboration: a two-player 2D object placement game in which the players must negotiate the goal sta…
▽ More
Collaboration is an integral part of human dialogue. Typical task-oriented dialogue games assign asymmetric roles to the participants, which limits their ability to elicit naturalistic role-taking in collaboration and its negotiation. We present a novel and simple online setup that favors balanced collaboration: a two-player 2D object placement game in which the players must negotiate the goal state themselves. We show empirically that human players exhibit a variety of role distributions, and that balanced collaboration improves task performance. We also present an LLM-based baseline agent which demonstrates that automatic playing of our game is an interesting challenge for artificial systems.
△ Less
Submitted 11 July, 2024; v1 submitted 12 June, 2024;
originally announced June 2024.
-
Simple and effective data augmentation for compositional generalization
Authors:
Yuekun Yao,
Alexander Koller
Abstract:
Compositional generalization, the ability to predict complex meanings from training on simpler sentences, poses challenges for powerful pretrained seq2seq models. In this paper, we show that data augmentation methods that sample MRs and backtranslate them can be effective for compositional generalization, but only if we sample from the right distribution. Remarkably, sampling from a uniform distri…
▽ More
Compositional generalization, the ability to predict complex meanings from training on simpler sentences, poses challenges for powerful pretrained seq2seq models. In this paper, we show that data augmentation methods that sample MRs and backtranslate them can be effective for compositional generalization, but only if we sample from the right distribution. Remarkably, sampling from a uniform distribution performs almost as well as sampling from the test distribution, and greatly outperforms earlier methods that sampled from the training distribution. We further conduct experiments to investigate the reason why this happens and where the benefit of such data augmentation methods come from.
△ Less
Submitted 18 January, 2024;
originally announced January 2024.
-
AutoPlanBench: Automatically generating benchmarks for LLM planners from PDDL
Authors:
Katharina Stein,
Daniel Fišer,
Jörg Hoffmann,
Alexander Koller
Abstract:
LLMs are being increasingly used for planning-style tasks, but their capabilities for planning and reasoning are poorly understood. We present AutoPlanBench, a novel method for automatically converting planning benchmarks written in PDDL into textual descriptions and offer a benchmark dataset created with our method. We show that while the best LLM planners do well on some planning tasks, others r…
▽ More
LLMs are being increasingly used for planning-style tasks, but their capabilities for planning and reasoning are poorly understood. We present AutoPlanBench, a novel method for automatically converting planning benchmarks written in PDDL into textual descriptions and offer a benchmark dataset created with our method. We show that while the best LLM planners do well on some planning tasks, others remain out of reach of current methods.
△ Less
Submitted 9 February, 2024; v1 submitted 16 November, 2023;
originally announced November 2023.
-
Predicting generalization performance with correctness discriminators
Authors:
Yuekun Yao,
Alexander Koller
Abstract:
The ability to predict an NLP model's accuracy on unseen, potentially out-of-distribution data is a prerequisite for trustworthiness. We present a novel model that establishes upper and lower bounds on the accuracy, without requiring gold labels for the unseen data. We achieve this by training a discriminator which predicts whether the output of a given sequence-to-sequence model is correct or not…
▽ More
The ability to predict an NLP model's accuracy on unseen, potentially out-of-distribution data is a prerequisite for trustworthiness. We present a novel model that establishes upper and lower bounds on the accuracy, without requiring gold labels for the unseen data. We achieve this by training a discriminator which predicts whether the output of a given sequence-to-sequence model is correct or not. We show across a variety of tagging, parsing, and semantic parsing tasks that the gold accuracy is reliably between the predicted upper and lower bounds, and that these bounds are remarkably close together.
△ Less
Submitted 15 November, 2023;
originally announced November 2023.
-
ADaPT: As-Needed Decomposition and Planning with Language Models
Authors:
Archiki Prasad,
Alexander Koller,
Mareike Hartmann,
Peter Clark,
Ashish Sabharwal,
Mohit Bansal,
Tushar Khot
Abstract:
Large Language Models (LLMs) are increasingly being used for interactive decision-making tasks requiring planning and adapting to the environment. Recent works employ LLMs-as-agents in broadly two ways: iteratively determining the next action (iterative executors) or generating plans and executing sub-tasks using LLMs (plan-and-execute). However, these methods struggle with task complexity, as the…
▽ More
Large Language Models (LLMs) are increasingly being used for interactive decision-making tasks requiring planning and adapting to the environment. Recent works employ LLMs-as-agents in broadly two ways: iteratively determining the next action (iterative executors) or generating plans and executing sub-tasks using LLMs (plan-and-execute). However, these methods struggle with task complexity, as the inability to execute any sub-task may lead to task failure. To address these shortcomings, we introduce As-Needed Decomposition and Planning for complex Tasks (ADaPT), an approach that explicitly plans and decomposes complex sub-tasks as-needed, i.e., when the LLM is unable to execute them. ADaPT recursively decomposes sub-tasks to adapt to both task complexity and LLM capability. Our results demonstrate that ADaPT substantially outperforms established strong baselines, achieving success rates up to 28.3% higher in ALFWorld, 27% in WebShop, and 33% in TextCraft -- a novel compositional dataset that we introduce. Through extensive analysis, we illustrate the importance of multilevel decomposition and establish that ADaPT dynamically adjusts to the capabilities of the executor LLM as well as to task complexity.
△ Less
Submitted 8 April, 2024; v1 submitted 8 November, 2023;
originally announced November 2023.
-
SLOG: A Structural Generalization Benchmark for Semantic Parsing
Authors:
Bingzhi Li,
Lucia Donatelli,
Alexander Koller,
Tal Linzen,
Yuekun Yao,
Najoung Kim
Abstract:
The goal of compositional generalization benchmarks is to evaluate how well models generalize to new complex linguistic expressions. Existing benchmarks often focus on lexical generalization, the interpretation of novel lexical items in syntactic structures familiar from training; structural generalization tasks, where a model needs to interpret syntactic structures that are themselves unfamiliar…
▽ More
The goal of compositional generalization benchmarks is to evaluate how well models generalize to new complex linguistic expressions. Existing benchmarks often focus on lexical generalization, the interpretation of novel lexical items in syntactic structures familiar from training; structural generalization tasks, where a model needs to interpret syntactic structures that are themselves unfamiliar from training, are often underrepresented, resulting in overly optimistic perceptions of how well models can generalize. We introduce SLOG, a semantic parsing dataset that extends COGS (Kim and Linzen, 2020) with 17 structural generalization cases. In our experiments, the generalization accuracy of Transformer models, including pretrained ones, only reaches 40.6%, while a structure-aware parser only achieves 70.8%. These results are far from the near-perfect accuracy existing models achieve on COGS, demonstrating the role of SLOG in foregrounding the large discrepancy between models' lexical and structural generalization capacities.
△ Less
Submitted 23 October, 2023;
originally announced October 2023.
-
Closing the Curious Case of Neural Text Degeneration
Authors:
Matthew Finlayson,
John Hewitt,
Alexander Koller,
Swabha Swayamdipta,
Ashish Sabharwal
Abstract:
Despite their ubiquity in language generation, it remains unknown why truncation sampling heuristics like nucleus sampling are so effective. We provide a theoretical explanation for the effectiveness of the truncation sampling by proving that truncation methods that discard tokens below some probability threshold (the most common type of truncation) can guarantee that all sampled tokens have nonze…
▽ More
Despite their ubiquity in language generation, it remains unknown why truncation sampling heuristics like nucleus sampling are so effective. We provide a theoretical explanation for the effectiveness of the truncation sampling by proving that truncation methods that discard tokens below some probability threshold (the most common type of truncation) can guarantee that all sampled tokens have nonzero true probability. However, thresholds are a coarse heuristic, and necessarily discard some tokens with nonzero true probability as well. In pursuit of a more precise sampling strategy, we show that we can leverage a known source of model errors, the softmax bottleneck, to prove that certain tokens have nonzero true probability, without relying on a threshold. Based on our findings, we develop an experimental truncation strategy and the present pilot studies demonstrating the promise of this type of algorithm. Our evaluations show that our method outperforms its threshold-based counterparts under automatic and human evaluation metrics for low-entropy (i.e., close to greedy) open-ended text generation. Our theoretical findings and pilot experiments provide both insight into why truncation sampling works, and make progress toward more expressive sampling algorithms that better surface the generative capabilities of large language models.
△ Less
Submitted 2 October, 2023;
originally announced October 2023.
-
SIP: Injecting a Structural Inductive Bias into a Seq2Seq Model by Simulation
Authors:
Matthias Lindemann,
Alexander Koller,
Ivan Titov
Abstract:
Strong inductive biases enable learning from little data and help generalization outside of the training distribution. Popular neural architectures such as Transformers lack strong structural inductive biases for seq2seq NLP tasks on their own. Consequently, they struggle with systematic generalization beyond the training distribution, e.g. with extrapolating to longer inputs, even when pre-traine…
▽ More
Strong inductive biases enable learning from little data and help generalization outside of the training distribution. Popular neural architectures such as Transformers lack strong structural inductive biases for seq2seq NLP tasks on their own. Consequently, they struggle with systematic generalization beyond the training distribution, e.g. with extrapolating to longer inputs, even when pre-trained on large amounts of text. We show how a structural inductive bias can be efficiently injected into a seq2seq model by pre-training it to simulate structural transformations on synthetic data. Specifically, we inject an inductive bias towards Finite State Transducers (FSTs) into a Transformer by pre-training it to simulate FSTs given their descriptions. Our experiments show that our method imparts the desired inductive bias, resulting in improved systematic generalization and better few-shot learning for FST-like tasks. Our analysis shows that fine-tuned models accurately capture the state dynamics of the unseen underlying FSTs, suggesting that the simulation process is internalized by the fine-tuned model.
△ Less
Submitted 10 July, 2024; v1 submitted 1 October, 2023;
originally announced October 2023.
-
The Hessian of surface tension characterises scaling limit of gradient models with non-convex energy
Authors:
Stefan Adams,
Andreas Koller
Abstract:
We study the scaling limit of statistical mechanics models with non-convex Hamiltonians that are gradient perturbations of Gaussian measures. Characterising features of our gradient models are the imposed boundary tilt and the surface tension (free energy) as a function of tilt. In the regime of low temperatures and bounded tilt, we prove the scaling limit for macroscopic functions on the torus, a…
▽ More
We study the scaling limit of statistical mechanics models with non-convex Hamiltonians that are gradient perturbations of Gaussian measures. Characterising features of our gradient models are the imposed boundary tilt and the surface tension (free energy) as a function of tilt. In the regime of low temperatures and bounded tilt, we prove the scaling limit for macroscopic functions on the torus, and we show that the limit is a continuum Gaussian Free Field with covariance (diffusion) matrix given as the Hessian of surface tension. Our proof of this longstanding conjecture complements recent studies in [Hil16], [ABKM].
△ Less
Submitted 21 June, 2023;
originally announced June 2023.
-
Compositional Generalization without Trees using Multiset Tagging and Latent Permutations
Authors:
Matthias Lindemann,
Alexander Koller,
Ivan Titov
Abstract:
Seq2seq models have been shown to struggle with compositional generalization in semantic parsing, i.e. generalizing to unseen compositions of phenomena that the model handles correctly in isolation.
We phrase semantic parsing as a two-step process: we first tag each input token with a multiset of output tokens. Then we arrange the tokens into an output sequence using a new way of parameterizing…
▽ More
Seq2seq models have been shown to struggle with compositional generalization in semantic parsing, i.e. generalizing to unseen compositions of phenomena that the model handles correctly in isolation.
We phrase semantic parsing as a two-step process: we first tag each input token with a multiset of output tokens. Then we arrange the tokens into an output sequence using a new way of parameterizing and predicting permutations. We formulate predicting a permutation as solving a regularized linear program and we backpropagate through the solver. In contrast to prior work, our approach does not place a priori restrictions on possible permutations, making it very expressive.
Our model outperforms pretrained seq2seq models and prior work on realistic semantic parsing tasks that require generalization to longer examples. We also outperform non-tree-based models on structural generalization on the COGS benchmark. For the first time, we show that a model without an inductive bias provided by trees achieves high accuracy on generalization to deeper recursion.
△ Less
Submitted 26 May, 2023;
originally announced May 2023.
-
What's the Meaning of Superhuman Performance in Today's NLU?
Authors:
Simone Tedeschi,
Johan Bos,
Thierry Declerck,
Jan Hajic,
Daniel Hershcovich,
Eduard H. Hovy,
Alexander Koller,
Simon Krek,
Steven Schockaert,
Rico Sennrich,
Ekaterina Shutova,
Roberto Navigli
Abstract:
In the last five years, there has been a significant focus in Natural Language Processing (NLP) on develo** larger Pretrained Language Models (PLMs) and introducing benchmarks such as SuperGLUE and SQuAD to measure their abilities in language understanding, reasoning, and reading comprehension. These PLMs have achieved impressive results on these benchmarks, even surpassing human performance in…
▽ More
In the last five years, there has been a significant focus in Natural Language Processing (NLP) on develo** larger Pretrained Language Models (PLMs) and introducing benchmarks such as SuperGLUE and SQuAD to measure their abilities in language understanding, reasoning, and reading comprehension. These PLMs have achieved impressive results on these benchmarks, even surpassing human performance in some cases. This has led to claims of superhuman capabilities and the provocative idea that certain tasks have been solved. In this position paper, we take a critical look at these claims and ask whether PLMs truly have superhuman abilities and what the current benchmarks are really evaluating. We show that these benchmarks have serious limitations affecting the comparison between humans and PLMs and provide recommendations for fairer and more transparent benchmarks.
△ Less
Submitted 15 May, 2023;
originally announced May 2023.
-
We're Afraid Language Models Aren't Modeling Ambiguity
Authors:
Alisa Liu,
Zhaofeng Wu,
Julian Michael,
Alane Suhr,
Peter West,
Alexander Koller,
Swabha Swayamdipta,
Noah A. Smith,
Ye** Choi
Abstract:
Ambiguity is an intrinsic feature of natural language. Managing ambiguity is a key part of human language understanding, allowing us to anticipate misunderstanding as communicators and revise our interpretations as listeners. As language models (LMs) are increasingly employed as dialogue interfaces and writing aids, handling ambiguous language is critical to their success. We characterize ambiguit…
▽ More
Ambiguity is an intrinsic feature of natural language. Managing ambiguity is a key part of human language understanding, allowing us to anticipate misunderstanding as communicators and revise our interpretations as listeners. As language models (LMs) are increasingly employed as dialogue interfaces and writing aids, handling ambiguous language is critical to their success. We characterize ambiguity in a sentence by its effect on entailment relations with another sentence, and collect AmbiEnt, a linguist-annotated benchmark of 1,645 examples with diverse kinds of ambiguity. We design a suite of tests based on AmbiEnt, presenting the first evaluation of pretrained LMs to recognize ambiguity and disentangle possible meanings. We find that the task remains extremely challenging, including for GPT-4, whose generated disambiguations are considered correct only 32% of the time in human evaluation, compared to 90% for disambiguations in our dataset. Finally, to illustrate the value of ambiguity-sensitive tools, we show that a multilabel NLI model can flag political claims in the wild that are misleading due to ambiguity. We encourage the field to rediscover the importance of ambiguity for NLP.
△ Less
Submitted 20 October, 2023; v1 submitted 27 April, 2023;
originally announced April 2023.
-
Structural generalization is hard for sequence-to-sequence models
Authors:
Yuekun Yao,
Alexander Koller
Abstract:
Sequence-to-sequence (seq2seq) models have been successful across many NLP tasks, including ones that require predicting linguistic structure. However, recent work on compositional generalization has shown that seq2seq models achieve very low accuracy in generalizing to linguistic structures that were not seen in training. We present new evidence that this is a general limitation of seq2seq models…
▽ More
Sequence-to-sequence (seq2seq) models have been successful across many NLP tasks, including ones that require predicting linguistic structure. However, recent work on compositional generalization has shown that seq2seq models achieve very low accuracy in generalizing to linguistic structures that were not seen in training. We present new evidence that this is a general limitation of seq2seq models that is present not just in semantic parsing, but also in syntactic parsing and in text-to-text tasks, and that this limitation can often be overcome by neurosymbolic models that have linguistic knowledge built in. We further report on some experiments that give initial answers on the reasons for these limitations.
△ Less
Submitted 24 October, 2022;
originally announced October 2022.
-
Compositional Generalisation with Structured Reordering and Fertility Layers
Authors:
Matthias Lindemann,
Alexander Koller,
Ivan Titov
Abstract:
Seq2seq models have been shown to struggle with compositional generalisation, i.e. generalising to new and potentially more complex structures than seen during training. Taking inspiration from grammar-based models that excel at compositional generalisation, we present a flexible end-to-end differentiable neural model that composes two structural operations: a fertility step, which we introduce in…
▽ More
Seq2seq models have been shown to struggle with compositional generalisation, i.e. generalising to new and potentially more complex structures than seen during training. Taking inspiration from grammar-based models that excel at compositional generalisation, we present a flexible end-to-end differentiable neural model that composes two structural operations: a fertility step, which we introduce in this work, and a reordering step based on previous work (Wang et al., 2021). To ensure differentiability, we use the expected value of each step. Our model outperforms seq2seq models by a wide margin on challenging compositional splits of realistic semantic parsing tasks that require generalisation to longer examples. It also compares favourably to other models targeting compositional generalisation.
△ Less
Submitted 15 February, 2023; v1 submitted 6 October, 2022;
originally announced October 2022.
-
Compositional Generalization Requires Compositional Parsers
Authors:
Pia Weißenhorn,
Yuekun Yao,
Lucia Donatelli,
Alexander Koller
Abstract:
A rapidly growing body of research on compositional generalization investigates the ability of a semantic parser to dynamically recombine linguistic elements seen in training into unseen sequences. We present a systematic comparison of sequence-to-sequence models and models guided by compositional principles on the recent COGS corpus (Kim and Linzen, 2020). Though seq2seq models can perform well o…
▽ More
A rapidly growing body of research on compositional generalization investigates the ability of a semantic parser to dynamically recombine linguistic elements seen in training into unseen sequences. We present a systematic comparison of sequence-to-sequence models and models guided by compositional principles on the recent COGS corpus (Kim and Linzen, 2020). Though seq2seq models can perform well on lexical tasks, they perform with near-zero accuracy on structural generalization tasks that require novel syntactic structures; this holds true even when they are trained to predict syntax instead of semantics. In contrast, compositional models achieve near-perfect accuracy on structural generalization; we present new results confirming this from the AM parser (Groschwitz et al., 2021). Our findings show structural generalization is a key measure of compositional generalization and requires models that are aware of complex structure.
△ Less
Submitted 24 February, 2022;
originally announced February 2022.
-
Learning compositional structures for semantic graph parsing
Authors:
Jonas Groschwitz,
Meaghan Fowlie,
Alexander Koller
Abstract:
AM dependency parsing is a method for neural semantic graph parsing that exploits the principle of compositionality. While AM dependency parsers have been shown to be fast and accurate across several graphbanks, they require explicit annotations of the compositional tree structures for training. In the past, these were obtained using complex graphbank-specific heuristics written by experts. Here w…
▽ More
AM dependency parsing is a method for neural semantic graph parsing that exploits the principle of compositionality. While AM dependency parsers have been shown to be fast and accurate across several graphbanks, they require explicit annotations of the compositional tree structures for training. In the past, these were obtained using complex graphbank-specific heuristics written by experts. Here we show how they can instead be trained directly on the graphs with a neural latent-variable model, drastically reducing the amount and complexity of manual heuristics. We demonstrate that our model picks up on several linguistic phenomena on its own and achieves comparable accuracy to supervised training, greatly facilitating the use of AM dependency parsing for new sembanks.
△ Less
Submitted 8 June, 2021;
originally announced June 2021.
-
Generating Instructions at Different Levels of Abstraction
Authors:
Arne Köhn,
Julia Wichlacz,
Álvaro Torralba,
Daniel Höller,
Jörg Hoffmann,
Alexander Koller
Abstract:
When generating technical instructions, it is often convenient to describe complex objects in the world at different levels of abstraction. A novice user might need an object explained piece by piece, while for an expert, talking about the complex object (e.g. a wall or railing) directly may be more succinct and efficient. We show how to generate building instructions at different levels of abstra…
▽ More
When generating technical instructions, it is often convenient to describe complex objects in the world at different levels of abstraction. A novice user might need an object explained piece by piece, while for an expert, talking about the complex object (e.g. a wall or railing) directly may be more succinct and efficient. We show how to generate building instructions at different levels of abstraction in Minecraft. We introduce the use of hierarchical planning to this end, a method from AI planning which can capture the structure of complex objects neatly. A crowdsourcing evaluation shows that the choice of abstraction level matters to users, and that an abstraction strategy which balances low-level and high-level object descriptions compares favorably to ones which don't.
△ Less
Submitted 8 October, 2020;
originally announced October 2020.
-
Fast semantic parsing with well-typedness guarantees
Authors:
Matthias Lindemann,
Jonas Groschwitz,
Alexander Koller
Abstract:
AM dependency parsing is a linguistically principled method for neural semantic parsing with high accuracy across multiple graphbanks. It relies on a type system that models semantic valency but makes existing parsers slow. We describe an A* parser and a transition-based parser for AM dependency parsing which guarantee well-typedness and improve parsing speed by up to 3 orders of magnitude, while…
▽ More
AM dependency parsing is a linguistically principled method for neural semantic parsing with high accuracy across multiple graphbanks. It relies on a type system that models semantic valency but makes existing parsers slow. We describe an A* parser and a transition-based parser for AM dependency parsing which guarantee well-typedness and improve parsing speed by up to 3 orders of magnitude, while maintaining or improving accuracy.
△ Less
Submitted 6 October, 2020; v1 submitted 15 September, 2020;
originally announced September 2020.
-
Normalizing Compositional Structures Across Graphbanks
Authors:
Lucia Donatelli,
Jonas Groschwitz,
Alexander Koller,
Matthias Lindemann,
Pia Weißenhorn
Abstract:
The emergence of a variety of graph-based meaning representations (MRs) has sparked an important conversation about how to adequately represent semantic structure. These MRs exhibit structural differences that reflect different theoretical and design considerations, presenting challenges to uniform linguistic analysis and cross-framework semantic parsing. Here, we ask the question of which design…
▽ More
The emergence of a variety of graph-based meaning representations (MRs) has sparked an important conversation about how to adequately represent semantic structure. These MRs exhibit structural differences that reflect different theoretical and design considerations, presenting challenges to uniform linguistic analysis and cross-framework semantic parsing. Here, we ask the question of which design differences between MRs are meaningful and semantically-rooted, and which are superficial. We present a methodology for normalizing discrepancies between MRs at the compositional level (Lindemann et al., 2019), finding that we can normalize the majority of divergent phenomena using linguistically-grounded rules. Our work significantly increases the match in compositional structure between MRs and improves multi-task learning (MTL) in a low-resource setting, demonstrating the usefulness of careful MR design analysis and comparison.
△ Less
Submitted 30 April, 2020; v1 submitted 29 April, 2020;
originally announced April 2020.
-
Semantic expressive capacity with bounded memory
Authors:
Antoine Venant,
Alexander Koller
Abstract:
We investigate the capacity of mechanisms for compositional semantic parsing to describe relations between sentences and semantic representations.
We prove that in order to represent certain relations, mechanisms which are syntactically projective must be able to remember an unbounded number of locations in the semantic representations, where nonprojective mechanisms need not.
This is the firs…
▽ More
We investigate the capacity of mechanisms for compositional semantic parsing to describe relations between sentences and semantic representations.
We prove that in order to represent certain relations, mechanisms which are syntactically projective must be able to remember an unbounded number of locations in the semantic representations, where nonprojective mechanisms need not.
This is the first result of this kind, and has consequences both for grammar-based and for neural systems.
△ Less
Submitted 27 June, 2019;
originally announced June 2019.
-
Compositional Semantic Parsing Across Graphbanks
Authors:
Matthias Lindemann,
Jonas Groschwitz,
Alexander Koller
Abstract:
Most semantic parsers that map sentences to graph-based meaning representations are hand-designed for specific graphbanks. We present a compositional neural semantic parser which achieves, for the first time, competitive accuracies across a diverse range of graphbanks. Incorporating BERT embeddings and multi-task learning improves the accuracy further, setting new states of the art on DM, PAS, PSD…
▽ More
Most semantic parsers that map sentences to graph-based meaning representations are hand-designed for specific graphbanks. We present a compositional neural semantic parser which achieves, for the first time, competitive accuracies across a diverse range of graphbanks. Incorporating BERT embeddings and multi-task learning improves the accuracy further, setting new states of the art on DM, PAS, PSD, AMR 2015 and EDS.
△ Less
Submitted 13 July, 2019; v1 submitted 27 June, 2019;
originally announced June 2019.
-
Generalized chart constraints for efficient PCFG and TAG parsing
Authors:
Stefan Grünewald,
Sophie Henning,
Alexander Koller
Abstract:
Chart constraints, which specify at which string positions a constituent may begin or end, have been shown to speed up chart parsers for PCFGs. We generalize chart constraints to more expressive grammar formalisms and describe a neural tagger which predicts chart constraints at very high precision. Our constraints accelerate both PCFG and TAG parsing, and combine effectively with other pruning tec…
▽ More
Chart constraints, which specify at which string positions a constituent may begin or end, have been shown to speed up chart parsers for PCFGs. We generalize chart constraints to more expressive grammar formalisms and describe a neural tagger which predicts chart constraints at very high precision. Our constraints accelerate both PCFG and TAG parsing, and combine effectively with other pruning techniques (coarse-to-fine and supertagging) for an overall speedup of two orders of magnitude, while improving accuracy.
△ Less
Submitted 27 June, 2018;
originally announced June 2018.
-
Discovering User Groups for Natural Language Generation
Authors:
Nikos Engonopoulos,
Christoph Teichmann,
Alexander Koller
Abstract:
We present a model which predicts how individual users of a dialog system understand and produce utterances based on user groups. In contrast to previous work, these user groups are not specified beforehand, but learned in training. We evaluate on two referring expression (RE) generation tasks; our experiments show that our model can identify user groups and learn how to most effectively talk to t…
▽ More
We present a model which predicts how individual users of a dialog system understand and produce utterances based on user groups. In contrast to previous work, these user groups are not specified beforehand, but learned in training. We evaluate on two referring expression (RE) generation tasks; our experiments show that our model can identify user groups and learn how to most effectively talk to them, and can dynamically assign unseen users to the correct groups as they interact with the system.
△ Less
Submitted 15 June, 2018;
originally announced June 2018.
-
AMR Dependency Parsing with a Typed Semantic Algebra
Authors:
Jonas Groschwitz,
Matthias Lindemann,
Meaghan Fowlie,
Mark Johnson,
Alexander Koller
Abstract:
We present a semantic parser for Abstract Meaning Representations which learns to parse strings into tree representations of the compositional structure of an AMR graph. This allows us to use standard neural techniques for supertagging and dependency tree parsing, constrained by a linguistically principled type system. We present two approximative decoding algorithms, which achieve state-of-the-ar…
▽ More
We present a semantic parser for Abstract Meaning Representations which learns to parse strings into tree representations of the compositional structure of an AMR graph. This allows us to use standard neural techniques for supertagging and dependency tree parsing, constrained by a linguistically principled type system. We present two approximative decoding algorithms, which achieve state-of-the-art accuracy and outperform strong baselines.
△ Less
Submitted 29 May, 2018;
originally announced May 2018.
-
Spin-orbit coupled fermions in an optical lattice clock
Authors:
S. Kolkowitz,
S. L. Bromley,
T. Bothwell,
M. L. Wall,
G. E. Marti,
A. P. Koller,
X. Zhang,
A. M. Rey,
J. Ye
Abstract:
Engineered spin-orbit coupling (SOC) in cold atom systems can aid in the study of novel synthetic materials and complex condensed matter phenomena. Despite great advances, alkali atom SOC systems are hindered by heating from spontaneous emission, which limits the observation of many-body effects, motivating research into potential alternatives. Here we demonstrate that SOC can be engineered to occ…
▽ More
Engineered spin-orbit coupling (SOC) in cold atom systems can aid in the study of novel synthetic materials and complex condensed matter phenomena. Despite great advances, alkali atom SOC systems are hindered by heating from spontaneous emission, which limits the observation of many-body effects, motivating research into potential alternatives. Here we demonstrate that SOC can be engineered to occur naturally in a one-dimensional fermionic 87Sr optical lattice clock (OLC). In contrast to previous SOC experiments, in this work the SOC is both generated and probed using a direct ultra-narrow optical clock transition between two electronic orbital states. We use clock spectroscopy to prepare lattice band populations, internal electronic states, and quasimomenta, as well as to produce SOC dynamics. The exceptionally long lifetime of the excited clock state (160 s) eliminates decoherence and atom loss from spontaneous emission at all relevant experimental timescales, allowing subsequent momentum- and spin-resolved in situ probing of the SOC band structure and eigenstates. We utilize these capabilities to study Bloch oscillations, spin-momentum locking, and Van Hove singularities in the transition density of states. Our results lay the groundwork for the use of OLCs to probe novel SOC phases of matter.
△ Less
Submitted 8 November, 2016; v1 submitted 12 August, 2016;
originally announced August 2016.
-
Dynamics of interacting fermions in spin-dependent potentials
Authors:
Andrew P. Koller,
Michael L. Wall,
Josh Mundinger,
Ana Maria Rey
Abstract:
Recent experiments with dilute trapped Fermi gases observed that weak interactions can drastically modify spin transport dynamics and give rise to robust collective effects including global demagnetization, macroscopic spin waves, spin segregation, and spin self-rephasing. In this work we develop a framework for studying the dynamics of weakly interacting fermionic gases following a spin-dependent…
▽ More
Recent experiments with dilute trapped Fermi gases observed that weak interactions can drastically modify spin transport dynamics and give rise to robust collective effects including global demagnetization, macroscopic spin waves, spin segregation, and spin self-rephasing. In this work we develop a framework for studying the dynamics of weakly interacting fermionic gases following a spin-dependent change of the trap** potential which illuminates the interplay between spin, motion, Fermi statistics, and interactions. The key idea is the projection of the state of the system onto a set of lattice spin models defined on the single-particle mode space. Collective phenomena, including the global spreading of quantum correlations in real space, arise as a consequence of the long-ranged character of the spin model couplings. This approach achieves good agreement with prior measurements and suggests a number of directions for future experiments.
△ Less
Submitted 7 December, 2016; v1 submitted 5 January, 2016;
originally announced January 2016.
-
Synthetic spin-orbit coupling in an optical lattice clock
Authors:
Michael L. Wall,
Andrew P. Koller,
Shuming Li,
Xibo Zhang,
Nigel R. Cooper,
Jun Ye,
Ana Maria Rey
Abstract:
We propose the use of optical lattice clocks operated with fermionic alkaline-earth-atoms to study spin-orbit coupling (SOC) in interacting many-body systems. The SOC emerges naturally during the clock interrogation when atoms are allowed to tunnel and accumulate a phase set by the ratio of the "magic" lattice wavelength to the clock transition wavelength. We demonstrate how standard protocols suc…
▽ More
We propose the use of optical lattice clocks operated with fermionic alkaline-earth-atoms to study spin-orbit coupling (SOC) in interacting many-body systems. The SOC emerges naturally during the clock interrogation when atoms are allowed to tunnel and accumulate a phase set by the ratio of the "magic" lattice wavelength to the clock transition wavelength. We demonstrate how standard protocols such as Rabi and Ramsey spectroscopy, that take advantage of the sub-Hertz resolution of state-of-the-art clock lasers, can perform momentum-resolved band tomography and determine SOC-induced $s$-wave collisions in nuclear spin polarized fermions. By adding a second counter-propagating clock beam a sliding superlattice can be implemented and used for controlled atom transport and as a probe of $p$ and $s$-wave interactions. The proposed spectroscopic probes provide clean and well-resolved signatures at current clock operating temperatures.
△ Less
Submitted 19 September, 2015;
originally announced September 2015.
-
Demagnetization dynamics of non-interacting trapped fermions
Authors:
Andrew P. Koller,
Joshua Mundinger,
Michael L. Wall,
Ana Maria Rey
Abstract:
Motivated by several experimental efforts to understand spin diffusion and transport in ultracold fermionic gases, we study the spin dynamics of initially spin-polarized ensembles of harmonically trapped non-interacting spin-1/2 fermionic atoms, subjected to a magnetic field gradient. We obtain simple analytic expressions for spin observables in the presence of both constant and linear magnetic fi…
▽ More
Motivated by several experimental efforts to understand spin diffusion and transport in ultracold fermionic gases, we study the spin dynamics of initially spin-polarized ensembles of harmonically trapped non-interacting spin-1/2 fermionic atoms, subjected to a magnetic field gradient. We obtain simple analytic expressions for spin observables in the presence of both constant and linear magnetic field gradients, with and without a spin-echo pulse, and at zero and finite temperatures. The analysis shows the relevance of spin-motional coupling in the non-interacting regime where the demagnetization decay rate at short times can be faster than the experimentally measured rates in the strongly interacting regime under similar trap** conditions. Our calculations also show that particle motion limits the ability of a spin-echo pulse to remove the effect of magnetic field inhomogeneity, and that a spin-echo pulse can instead lead to an increased decay of magnetization at times comparable to the trap** period.
△ Less
Submitted 8 June, 2015;
originally announced June 2015.
-
Realizing Exactly Solvable SU(N) Magnets with Thermal Atoms
Authors:
Michael E. Beverland,
Gorjan Alagic,
Michael J. Martin,
Andrew P. Koller,
Ana M. Rey,
Alexey V. Gorshkov
Abstract:
We show that $n$ thermal fermionic alkaline-earth atoms in a flat-bottom trap allow one to robustly implement a spin model displaying two symmetries: the $S_n$ symmetry that permutes atoms occupying different vibrational levels of the trap and the SU($N$) symmetry associated with $N$ nuclear spin states. The high symmetry makes the model exactly solvable, which, in turn, enables the analytic study…
▽ More
We show that $n$ thermal fermionic alkaline-earth atoms in a flat-bottom trap allow one to robustly implement a spin model displaying two symmetries: the $S_n$ symmetry that permutes atoms occupying different vibrational levels of the trap and the SU($N$) symmetry associated with $N$ nuclear spin states. The high symmetry makes the model exactly solvable, which, in turn, enables the analytic study of dynamical processes such as spin diffusion in this SU($N$) system. We also show how to use this system to generate entangled states that allow for Heisenberg-limited metrology. This highly symmetric spin model should be experimentally realizable even when the vibrational levels are occupied according to a high-temperature thermal or an arbitrary non-thermal distribution.
△ Less
Submitted 5 August, 2016; v1 submitted 10 September, 2014;
originally announced September 2014.
-
Beyond the Spin Model Approximation for Ramsey Spectroscopy
Authors:
A. P. Koller,
M. Beverland,
A. V. Gorshkov,
A. M. Rey
Abstract:
Ramsey spectroscopy has become a powerful technique for probing non-equilibrium dynamics of internal (pseudospin) degrees of freedom of interacting systems. In many theoretical treatments, the key to understanding the dynamics has been to assume the external (motional) degrees of freedom are decoupled from the pseudospin degrees of freedom. Determining the validity of this approximation -- known a…
▽ More
Ramsey spectroscopy has become a powerful technique for probing non-equilibrium dynamics of internal (pseudospin) degrees of freedom of interacting systems. In many theoretical treatments, the key to understanding the dynamics has been to assume the external (motional) degrees of freedom are decoupled from the pseudospin degrees of freedom. Determining the validity of this approximation -- known as the spin model approximation -- is complicated, and has not been addressed in detail. Here we shed light in this direction by calculating Ramsey dynamics exactly for two interacting spin-1/2 particles in a harmonic trap. We focus on $s$-wave-interacting fermions in quasi-one and two-dimensional geometries. We find that in 1D the spin model assumption works well over a wide range of experimentally-relevant conditions, but can fail at time scales longer than those set by the mean interaction energy. Surprisingly, in 2D a modified version of the spin model is exact to first order in the interaction strength. This analysis is important for a correct interpretation of Ramsey spectroscopy and has broad applications ranging from precision measurements to quantum information and to fundamental probes of many-body systems.
△ Less
Submitted 17 April, 2014; v1 submitted 3 December, 2013;
originally announced December 2013.
-
Quenching to unitarity: Quantum dynamics in a 3D Bose gas
Authors:
A. G. Sykes,
J. P. Corson,
J. P. D'Incao,
A. P. Koller,
C. H. Greene,
A. M. Rey,
K. R. A. Hazzard,
J. L. Bohn
Abstract:
We study the dynamics of a dilute Bose gas at zero temperature following a sudden quench of the scattering length from a noninteracting Bose condensate to unitarity (infinite scattering length). We apply three complementary approaches to understand the momentum distribution and loss rates. First, using a time-dependent variational ansatz for the many-body state, we calculate the dynamics of the mo…
▽ More
We study the dynamics of a dilute Bose gas at zero temperature following a sudden quench of the scattering length from a noninteracting Bose condensate to unitarity (infinite scattering length). We apply three complementary approaches to understand the momentum distribution and loss rates. First, using a time-dependent variational ansatz for the many-body state, we calculate the dynamics of the momentum distribution. Second, we demonstrate that, at short times and large momenta compared to those set by the density, the physics can be well understood within a simple, analytic two-body model. We derive a quantitative prediction for the evolution of Tan's contact, which increases linearly at short times. We also study the three-body losses at finite densities. Consistent with experiments, we observe lifetimes which are long compared to the dynamics of large momentum modes.
△ Less
Submitted 3 September, 2013;
originally announced September 2013.
-
Emergence of Reflectionless Scattering from Linearizations of Integrable PDEs around Solitons
Authors:
Andrew Koller,
Zaijong Hwang,
Maxim Olshanii
Abstract:
We present four examples of integrable partial differential equations (PDEs) of mathematical physics that---when linearized around a stationary soliton---exhibit scattering without reflection at {\it all} energies. Starting from the most well-known and the most empirically relevant phenomenon of the transparency of one-dimensional bright bosonic solitons to Bogoliubov excitations, we proceed to th…
▽ More
We present four examples of integrable partial differential equations (PDEs) of mathematical physics that---when linearized around a stationary soliton---exhibit scattering without reflection at {\it all} energies. Starting from the most well-known and the most empirically relevant phenomenon of the transparency of one-dimensional bright bosonic solitons to Bogoliubov excitations, we proceed to the sine-Gordon, Korteweg-de Vries, and Liouville's equation whose stationary solitons also support our assertion. The proposed connection between integrability and reflectionless scattering seems to span at least two distinct paradigms of integrability: S-integrability in the first three cases, and C-integrability in the last one. We argue that the transparency of linearized integrable PDEs is necessary to ensure that they can support the transparency of stationary solitons in the original integrable PDEs. As contrasting cases, the analysis is further extended to cover two non-integrable systems: a sawtooth-Gordon and a $φ^4$ model.
△ Less
Submitted 27 November, 2014; v1 submitted 3 June, 2013;
originally announced June 2013.
-
Minkowski and packing Dimension comparisons for sets with Reifenberg properties
Authors:
Amos N. Koeller
Abstract:
In Koeller \cite{koerprops} the twelve variants of the Reifenberg properties known to be instrumental in the theory of minimal surfaces were classified with respect to various Hausdorff measure based measure theoretic properties. The classification lead to the consideration of fine geometric properties and a connection to fractal geometry. The current work develops this connection and extends the…
▽ More
In Koeller \cite{koerprops} the twelve variants of the Reifenberg properties known to be instrumental in the theory of minimal surfaces were classified with respect to various Hausdorff measure based measure theoretic properties. The classification lead to the consideration of fine geometric properties and a connection to fractal geometry. The current work develops this connection and extends the classification to consider Minkowski-dimension, packing dimension, measure, and rectifiability, and the equality of packing and Hausdorff measures with interesting results.
△ Less
Submitted 18 January, 2011;
originally announced January 2011.
-
Outer measure preserving ergodic transformations generate the Carathéodory definition of measurable sets
Authors:
Amos N. Koeller
Abstract:
It is known that there are specific examples of ergodic transformations on measure spaces for which the calculation of the outer measure of transformation invariant sets leads to a condition closely resembling Carathéodory's condition for sets to be measurable. It is then natural to ask what functions are capable of `generating', that is leading to, the Carathéodory definition in the same way. The…
▽ More
It is known that there are specific examples of ergodic transformations on measure spaces for which the calculation of the outer measure of transformation invariant sets leads to a condition closely resembling Carathéodory's condition for sets to be measurable. It is then natural to ask what functions are capable of `generating', that is leading to, the Carathéodory definition in the same way. The present work answers this question by showing that the property of generating Carathéodory's definition holds for the general class of outer measure preserving ergodic transformations on measure spaces. We further show that the previously found specific examples of functions generating Carathéodory's definition fall into this family of transformations.
△ Less
Submitted 7 January, 2011;
originally announced January 2011.
-
A classification of Reifenberg properties
Authors:
Amos N. Koeller
Abstract:
We define twelve variants of a Reifenberg's affine approximation property, which are known to be connected with the singular sets of minimal surfaces. With this motivation we investigate the regularity of the sets possessing these. We classify the properties with respect to whether $j$-dimensional Hausdorff dimension, locally finite $j$-dimensional Hausdorff measure or countable $j$-rectifiability…
▽ More
We define twelve variants of a Reifenberg's affine approximation property, which are known to be connected with the singular sets of minimal surfaces. With this motivation we investigate the regularity of the sets possessing these. We classify the properties with respect to whether $j$-dimensional Hausdorff dimension, locally finite $j$-dimensional Hausdorff measure or countable $j$-rectifiability hold. In showing that varying levels of regularity hold for the differing properties, quasi-self-similar sets, interesting in their own right, are constructed as counter examples. These counter examples also admit a connection to number theory via the use of the normal number theorem. Additionally, the intriguing result that such complexity in the counter examples is actually a necessity is shown.
△ Less
Submitted 20 December, 2010;
originally announced December 2010.
-
Supersymmetric Quantum Mechanics and Solitons of the sine-Gordon and Nonlinear Schrödinger Equations
Authors:
Andrew Koller,
Maxim Olshanii
Abstract:
We present a case demonstrating the connection between supersymmetric quantum mechanics (SUSY--QM), reflectionless scattering, and soliton solutions of integrable partial differential equations. We show that the members of a class of reflectionless Hamiltonians, namely, Akulin's Hamiltonians, are connected via supersymmetric chains to a potential-free Hamiltonian, explaining their reflectionless n…
▽ More
We present a case demonstrating the connection between supersymmetric quantum mechanics (SUSY--QM), reflectionless scattering, and soliton solutions of integrable partial differential equations. We show that the members of a class of reflectionless Hamiltonians, namely, Akulin's Hamiltonians, are connected via supersymmetric chains to a potential-free Hamiltonian, explaining their reflectionless nature. While the reflectionless property in question has been mentioned in the literature for over two decades, the enabling algebraic mechanism was previously unknown. Our results indicate that the multi-solition solutions of the sine-Gordon and nonlinear Schrödinger equations can be systematically generated via the supersymmetric chains connecting Akulin's Hamiltonians. Our findings also explain a well-known but little-understood effect in laser physics: when a two-level atom, initially in the ground state, is subjected to a laser pulse of the form $V(t) = (n\hbar/τ)/\cosh(t/τ)$, with $n$ being an integer and $τ$ being the pulse duration, it remains in the ground state after the pulse has been applied, for {\it any} choice of the laser detuning.
△ Less
Submitted 30 December, 2011; v1 submitted 13 December, 2010;
originally announced December 2010.
-
On the singular set of mean curvature flows with Neumann free boundary conditions
Authors:
Amos N. Koeller
Abstract:
We consider $n$-dimensional hypersurfaces flowing by mean curvature flow with Neumann free boundary conditions supported on a smooth support surface. We show that the Hausdorff $n$-measure of the singular set is zero. In fact, we consider two types of interaction between the support and flowing surfaces. In the case of weaker interaction, we need make no further assumptions than in the case withou…
▽ More
We consider $n$-dimensional hypersurfaces flowing by mean curvature flow with Neumann free boundary conditions supported on a smooth support surface. We show that the Hausdorff $n$-measure of the singular set is zero. In fact, we consider two types of interaction between the support and flowing surfaces. In the case of weaker interaction, we need make no further assumptions than in the case without boundary to achieve our result. In the case of stronger interaction, we need only make the additional assumption that $H_Σ>0$, that is, that the support surface be mean convex. We go on, in this case, to show that the result is not, in general, true without the mean convexity assumption.
△ Less
Submitted 19 December, 2010; v1 submitted 2 December, 2010;
originally announced December 2010.
-
Evolution of convex lens-shaped networks under curve shortening flow
Authors:
Oliver C. Schnürer,
Abderrahim Azouani,
Marc Georgi,
Juliette Hell,
Nihar Jangle,
Amos Koeller,
Tobias Marxen,
Sandra Ritthaler,
Mariel Sáez,
Felix Schulze,
Brian Smith
Abstract:
We consider convex symmetric lens-shaped networks in R^2 that evolve under curve shortening flow. We show that the enclosed convex domain shrinks to a point in finite time. Furthermore, after appropriate rescaling the evolving networks converge to a self-similarly shrinking network, which we prove to be unique in an appropriate class. We also include a classification result for some self-similar…
▽ More
We consider convex symmetric lens-shaped networks in R^2 that evolve under curve shortening flow. We show that the enclosed convex domain shrinks to a point in finite time. Furthermore, after appropriate rescaling the evolving networks converge to a self-similarly shrinking network, which we prove to be unique in an appropriate class. We also include a classification result for some self-similarly shrinking networks.
△ Less
Submitted 7 November, 2007;
originally announced November 2007.
-
Approximately j-dimensional Koch type sets are potentially minimal surfaces
Authors:
Amos N. Koeller
Abstract:
We investigate the approximate j-dimensionality of the singularity sets of minimal surfaces prescribed by Simon. This leads to the clasification of 8 variations of approximately j-dimensional surfacs in terms of dimension and locally finite Hausdorff measure. We show that the singularity sets must either be well behaved or essentially purely unrectifiable.
Examples of the unrectifiable sets th…
▽ More
We investigate the approximate j-dimensionality of the singularity sets of minimal surfaces prescribed by Simon. This leads to the clasification of 8 variations of approximately j-dimensional surfacs in terms of dimension and locally finite Hausdorff measure. We show that the singularity sets must either be well behaved or essentially purely unrectifiable.
Examples of the unrectifiable sets that could occur are constructed and generalised. They are shown to be pseudo-fractal and related to the well known Koch Sets. Various representations of the measure and dimension of the sets are shown, as well as a fine balance in the spiralling nature of the sets.
△ Less
Submitted 21 August, 2006;
originally announced August 2006.