-
Modular Pluralism: Pluralistic Alignment via Multi-LLM Collaboration
Authors:
Shangbin Feng,
Taylor Sorensen,
Yuhan Liu,
Jillian Fisher,
Chan Young Park,
Ye** Choi,
Yulia Tsvetkov
Abstract:
While existing alignment paradigms have been integral in develo** large language models (LLMs), LLMs often learn an averaged human preference and struggle to model diverse preferences across cultures, demographics, and communities. We propose Modular Pluralism, a modular framework based on multi-LLM collaboration for pluralistic alignment: it "plugs into" a base LLM a pool of smaller but special…
▽ More
While existing alignment paradigms have been integral in develo** large language models (LLMs), LLMs often learn an averaged human preference and struggle to model diverse preferences across cultures, demographics, and communities. We propose Modular Pluralism, a modular framework based on multi-LLM collaboration for pluralistic alignment: it "plugs into" a base LLM a pool of smaller but specialized community LMs, where models collaborate in distinct modes to flexibility support three modes of pluralism: Overton, steerable, and distributional. Modular Pluralism is uniquely compatible with black-box LLMs and offers the modular control of adding new community LMs for previously underrepresented communities. We evaluate Modular Pluralism with six tasks and four datasets featuring questions/instructions with value-laden and perspective-informed responses. Extensive experiments demonstrate that Modular Pluralism advances the three pluralism objectives across six black-box and open-source LLMs. Further analysis reveals that LLMs are generally faithful to the inputs from smaller community LLMs, allowing seamless patching by adding a new community LM to better cover previously underrepresented communities.
△ Less
Submitted 22 June, 2024;
originally announced June 2024.
-
Calculating Bayesian evidence for inflationary models using CONNECT
Authors:
Camilla T. G. Sørensen,
Steen Hannestad,
Andreas Nygaard,
Thomas Tram
Abstract:
Bayesian evidence is a standard method used for comparing the ability of different models to fit available data and is used extensively in cosmology. However, since the evidence calculation involves performing an integral of the likelihood function over the entire space of model parameters this can be prohibitively expensive in terms of both CPU and time consumption. For example, in the simplest…
▽ More
Bayesian evidence is a standard method used for comparing the ability of different models to fit available data and is used extensively in cosmology. However, since the evidence calculation involves performing an integral of the likelihood function over the entire space of model parameters this can be prohibitively expensive in terms of both CPU and time consumption. For example, in the simplest $Λ$CDM model and using CMB data from the Planck satellite, the dimensionality of the model space is over 30 (typically 6 cosmological parameters and 28 nuisance parameters). Even the simplest possible model requires $\mathcal{O}(10^6)$ calls to an Einstein--Boltzmann solver such as CLASS or CAMB and takes several days.
Here we present calculations of Bayesian evidence using the CONNECT framework to calculate cosmological observables. We demonstrate that we can achieve results comparable to those obtained using Einstein--Boltzmann solvers, but at a minute fraction of the computational cost. As a test case, we then go on to compute Bayesian evidence ratios for a selection of slow-roll inflationary models.
In the setup presented here, the total computation time is completely dominated by the likelihood function calculation which now becomes the main bottleneck for increasing computation speed.
△ Less
Submitted 6 June, 2024;
originally announced June 2024.
-
CULTURE-GEN: Revealing Global Cultural Perception in Language Models through Natural Language Prompting
Authors:
Huihan Li,
Liwei Jiang,
Jena D. Huang,
Hyunwoo Kim,
Sebastin Santy,
Taylor Sorensen,
Bill Yuchen Lin,
Nouha Dziri,
Xiang Ren,
Ye** Choi
Abstract:
As the utilization of large language models (LLMs) has proliferated worldwide, it is crucial for them to have adequate knowledge and fair representation for diverse global cultures. In this work, we uncover culture perceptions of three SOTA models on 110 countries and regions on 8 culture-related topics through culture-conditioned generations, and extract symbols from these generations that are as…
▽ More
As the utilization of large language models (LLMs) has proliferated worldwide, it is crucial for them to have adequate knowledge and fair representation for diverse global cultures. In this work, we uncover culture perceptions of three SOTA models on 110 countries and regions on 8 culture-related topics through culture-conditioned generations, and extract symbols from these generations that are associated to each culture by the LLM. We discover that culture-conditioned generation consist of linguistic "markers" that distinguish marginalized cultures apart from default cultures. We also discover that LLMs have an uneven degree of diversity in the culture symbols, and that cultures from different geographic regions have different presence in LLMs' culture-agnostic generation. Our findings promote further research in studying the knowledge and fairness of global culture perception in LLMs. Code and Data can be found in: https://github.com/huihanlhh/Culture-Gen/
△ Less
Submitted 26 April, 2024; v1 submitted 15 April, 2024;
originally announced April 2024.
-
Observations on Large-Scale Attenuation Effects in a 26 GHz Urban Micro-Cell Environment
Authors:
Alejandro Ramírez-Arroyo,
Troels B. Sørensen,
Peter Beltoft,
Henrik Christiansen,
Juan F. Valenzuela-Valdés,
Preben Mogensen
Abstract:
This letter presents a measurement campaign carried out in an FR2 urban outdoor environment in a live experimental network deployment. The radio propagation analysis from a physical perspective at 26 GHz is essential for the correct deployment and dimensioning of future communication networks. This study summarises and evaluates some of the typical effects encountered in a communications scenario…
▽ More
This letter presents a measurement campaign carried out in an FR2 urban outdoor environment in a live experimental network deployment. The radio propagation analysis from a physical perspective at 26 GHz is essential for the correct deployment and dimensioning of future communication networks. This study summarises and evaluates some of the typical effects encountered in a communications scenario such as penetration losses in a building, losses due to vegetation or the human body, or diffraction/scattering propagation around corners in street canyon-like environment given a FR2 live network.
△ Less
Submitted 8 April, 2024;
originally announced April 2024.
-
A Roadmap to Pluralistic Alignment
Authors:
Taylor Sorensen,
Jared Moore,
Jillian Fisher,
Mitchell Gordon,
Niloofar Mireshghallah,
Christopher Michael Rytting,
Andre Ye,
Liwei Jiang,
Ximing Lu,
Nouha Dziri,
Tim Althoff,
Ye** Choi
Abstract:
With increased power and prevalence of AI systems, it is ever more critical that AI systems are designed to serve all, i.e., people with diverse values and perspectives. However, aligning models to serve pluralistic human values remains an open research question. In this piece, we propose a roadmap to pluralistic alignment, specifically using language models as a test bed. We identify and formaliz…
▽ More
With increased power and prevalence of AI systems, it is ever more critical that AI systems are designed to serve all, i.e., people with diverse values and perspectives. However, aligning models to serve pluralistic human values remains an open research question. In this piece, we propose a roadmap to pluralistic alignment, specifically using language models as a test bed. We identify and formalize three possible ways to define and operationalize pluralism in AI systems: 1) Overton pluralistic models that present a spectrum of reasonable responses; 2) Steerably pluralistic models that can steer to reflect certain perspectives; and 3) Distributionally pluralistic models that are well-calibrated to a given population in distribution. We also propose and formalize three possible classes of pluralistic benchmarks: 1) Multi-objective benchmarks, 2) Trade-off steerable benchmarks, which incentivize models to steer to arbitrary trade-offs, and 3) Jury-pluralistic benchmarks which explicitly model diverse human ratings. We use this framework to argue that current alignment techniques may be fundamentally limited for pluralistic AI; indeed, we highlight empirical evidence, both from our own experiments and from other work, that standard alignment procedures might reduce distributional pluralism in models, motivating the need for further research on pluralistic alignment.
△ Less
Submitted 7 February, 2024;
originally announced February 2024.
-
LeftoverLocals: Listening to LLM Responses Through Leaked GPU Local Memory
Authors:
Tyler Sorensen,
Heidy Khlaaf
Abstract:
This paper describes LeftoverLocals: a vulnerability that allows data recovery from GPU memory created by another process on Apple, Qualcomm, and AMD GPUs. LeftoverLocals impacts the security posture of GPU applications, with particular significance to LLMs and ML models that run on impacted GPUs. By recovering local memory, an optimized GPU memory region, we built a PoC where an attacker can list…
▽ More
This paper describes LeftoverLocals: a vulnerability that allows data recovery from GPU memory created by another process on Apple, Qualcomm, and AMD GPUs. LeftoverLocals impacts the security posture of GPU applications, with particular significance to LLMs and ML models that run on impacted GPUs. By recovering local memory, an optimized GPU memory region, we built a PoC where an attacker can listen into another user's interactive LLM session (e.g., llama.cpp) across process or container boundaries.
△ Less
Submitted 29 January, 2024;
originally announced January 2024.
-
Interferometric Single-Shot Parity Measurement in an InAs-Al Hybrid Device
Authors:
Morteza Aghaee,
Alejandro Alcaraz Ramirez,
Zulfi Alam,
Rizwan Ali,
Mariusz Andrzejczuk,
Andrey Antipov,
Mikhail Astafev,
Amin Barzegar,
Bela Bauer,
Jonathan Becker,
Umesh Kumar Bhaskar,
Alex Bocharov,
Srini Boddapati,
David Bohn,
Jouri Bommer,
Leo Bourdet,
Arnaud Bousquet,
Samuel Boutin,
Lucas Casparis,
Benjamin James Chapman,
Sohail Chatoor,
Anna Wulff Christensen,
Cassandra Chua,
Patrick Codd,
William Cole
, et al. (137 additional authors not shown)
Abstract:
The fusion of non-Abelian anyons or topological defects is a fundamental operation in measurement-only topological quantum computation. In topological superconductors, this operation amounts to a determination of the shared fermion parity of Majorana zero modes. As a step towards this, we implement a single-shot interferometric measurement of fermion parity in indium arsenide-aluminum heterostruct…
▽ More
The fusion of non-Abelian anyons or topological defects is a fundamental operation in measurement-only topological quantum computation. In topological superconductors, this operation amounts to a determination of the shared fermion parity of Majorana zero modes. As a step towards this, we implement a single-shot interferometric measurement of fermion parity in indium arsenide-aluminum heterostructures with a gate-defined nanowire. The interferometer is formed by tunnel-coupling the proximitized nanowire to quantum dots. The nanowire causes a state-dependent shift of these quantum dots' quantum capacitance of up to 1 fF. Our quantum capacitance measurements show flux h/2e-periodic bimodality with a signal-to-noise ratio of 1 in 3.7 $μ$s at optimal flux values. From the time traces of the quantum capacitance measurements, we extract a dwell time in the two associated states that is longer than 1 ms at in-plane magnetic fields of approximately 2 T. These results are consistent with a measurement of the fermion parity encoded in a pair of Majorana zero modes that are separated by approximately 3 $μ$m and subjected to a low rate of poisoning by non-equilibrium quasiparticles. The large capacitance shift and long poisoning time enable a parity measurement error probability of 1%.
△ Less
Submitted 2 April, 2024; v1 submitted 17 January, 2024;
originally announced January 2024.
-
NovaCOMET: Open Commonsense Foundation Models with Symbolic Knowledge Distillation
Authors:
Peter West,
Ronan Le Bras,
Taylor Sorensen,
Bill Yuchen Lin,
Liwei Jiang,
Ximing Lu,
Khyathi Chandu,
Jack Hessel,
Ashutosh Baheti,
Chandra Bhagavatula,
Ye** Choi
Abstract:
We present NovaCOMET, an open commonsense knowledge model, that combines the best aspects of knowledge and general task models. Compared to previous knowledge models, NovaCOMET allows open-format relations enabling direct application to reasoning tasks; compared to general task models like Flan-T5, it explicitly centers knowledge, enabling superior performance for commonsense reasoning.
NovaCOME…
▽ More
We present NovaCOMET, an open commonsense knowledge model, that combines the best aspects of knowledge and general task models. Compared to previous knowledge models, NovaCOMET allows open-format relations enabling direct application to reasoning tasks; compared to general task models like Flan-T5, it explicitly centers knowledge, enabling superior performance for commonsense reasoning.
NovaCOMET leverages the knowledge of opaque proprietary models to create an open knowledge pipeline. First, knowledge is symbolically distilled into NovATOMIC, a publicly-released discrete knowledge graph which can be audited, critiqued, and filtered. Next, we train NovaCOMET on NovATOMIC by fine-tuning an open-source pretrained model. NovaCOMET uses an open-format training objective, replacing the fixed relation sets of past knowledge models, enabling arbitrary structures within the data to serve as inputs or outputs.
The resulting generation model, optionally augmented with human annotation, matches or exceeds comparable open task models like Flan-T5 on a range of commonsense generation tasks. NovaCOMET serves as a counterexample to the contemporary focus on instruction tuning only, demonstrating a distinct advantage to explicitly modeling commonsense knowledge as well.
△ Less
Submitted 10 December, 2023;
originally announced December 2023.
-
Value Kaleidoscope: Engaging AI with Pluralistic Human Values, Rights, and Duties
Authors:
Taylor Sorensen,
Liwei Jiang,
Jena Hwang,
Sydney Levine,
Valentina Pyatkin,
Peter West,
Nouha Dziri,
Ximing Lu,
Kavel Rao,
Chandra Bhagavatula,
Maarten Sap,
John Tasioulas,
Ye** Choi
Abstract:
Human values are crucial to human decision-making. Value pluralism is the view that multiple correct values may be held in tension with one another (e.g., when considering lying to a friend to protect their feelings, how does one balance honesty with friendship?). As statistical learners, AI systems fit to averages by default, washing out these potentially irreducible value conflicts. To improve A…
▽ More
Human values are crucial to human decision-making. Value pluralism is the view that multiple correct values may be held in tension with one another (e.g., when considering lying to a friend to protect their feelings, how does one balance honesty with friendship?). As statistical learners, AI systems fit to averages by default, washing out these potentially irreducible value conflicts. To improve AI systems to better reflect value pluralism, the first-order challenge is to explore the extent to which AI systems can model pluralistic human values, rights, and duties as well as their interaction.
We introduce ValuePrism, a large-scale dataset of 218k values, rights, and duties connected to 31k human-written situations. ValuePrism's contextualized values are generated by GPT-4 and deemed high-quality by human annotators 91% of the time. We conduct a large-scale study with annotators across diverse social and demographic backgrounds to try to understand whose values are represented.
With ValuePrism, we build Kaleido, an open, light-weight, and structured language-based multi-task model that generates, explains, and assesses the relevance and valence (i.e., support or oppose) of human values, rights, and duties within a specific context. Humans prefer the sets of values output by our system over the teacher GPT-4, finding them more accurate and with broader coverage. In addition, we demonstrate that Kaleido can help explain variability in human decision-making by outputting contrasting values. Finally, we show that Kaleido's representations transfer to other philosophical frameworks and datasets, confirming the benefit of an explicit, modular, and interpretable approach to value pluralism. We hope that our work will serve as a step to making more explicit the implicit values behind human decision-making and to steering AI systems to make decisions that are more in accordance with them.
△ Less
Submitted 2 April, 2024; v1 submitted 1 September, 2023;
originally announced September 2023.
-
FR2 5G Networks for Industrial Scenarios: Experimental Characterization and Beam Management Procedures in Operational Conditions
Authors:
Alejandro Ramírez-Arroyo,
Melisa López,
Ignacio Rodríguez,
Troels B. Sørensen,
Samantha Caporal del Barrio,
Pablo Padilla,
Juan F. Valenzuela-Valdés,
Preben Mogensen
Abstract:
Industrial environments constitute a challenge in terms of radio propagation due to the presence of machinery and the mobility of the different agents, especially at mmWave bands. This paper presents an experimental evaluation of a FR2 5G network deployed in an operational factory scenario at 26 GHz. The experimental characterization, performed with autonomous mobile robots that self-navigate the…
▽ More
Industrial environments constitute a challenge in terms of radio propagation due to the presence of machinery and the mobility of the different agents, especially at mmWave bands. This paper presents an experimental evaluation of a FR2 5G network deployed in an operational factory scenario at 26 GHz. The experimental characterization, performed with autonomous mobile robots that self-navigate the industrial lab, leads to the analysis of the received power along the factory and the evaluation of reference path gain models. The proposed assessment deeply analyzes the physical layer of the communication network under operational conditions. Thus, two different network configurations are assessed by measuring the power received in the entire factory, providing a comparison between deployments. Additionally, beam management procedures, such as beam recovery, beam swee** or beam switching, are analyzed since they are crucial in environments where mobile agents are involved. They aim for a zero interruption approach based on reliable communications. The results analysis shows that beam recovery procedures can perform a beam switching to an alternative serving beam with power losses of less than 1.6 dB on average. Beam swee** analysis demonstrates the prevalence of the direct component in Line-of-Sight conditions despite the strong scattering component and large-scale fading in the environment.
△ Less
Submitted 8 May, 2024; v1 submitted 17 August, 2023;
originally announced August 2023.
-
Towards Coding Social Science Datasets with Language Models
Authors:
Christopher Michael Rytting,
Taylor Sorensen,
Lisa Argyle,
Ethan Busby,
Nancy Fulda,
Joshua Gubler,
David Wingate
Abstract:
Researchers often rely on humans to code (label, annotate, etc.) large sets of texts. This kind of human coding forms an important part of social science research, yet the coding process is both resource intensive and highly variable from application to application. In some cases, efforts to automate this process have achieved human-level accuracies, but to achieve this, these attempts frequently…
▽ More
Researchers often rely on humans to code (label, annotate, etc.) large sets of texts. This kind of human coding forms an important part of social science research, yet the coding process is both resource intensive and highly variable from application to application. In some cases, efforts to automate this process have achieved human-level accuracies, but to achieve this, these attempts frequently rely on thousands of hand-labeled training examples, which makes them inapplicable to small-scale research studies and costly for large ones. Recent advances in a specific kind of artificial intelligence tool - language models (LMs) - provide a solution to this problem. Work in computer science makes it clear that LMs are able to classify text, without the cost (in financial terms and human effort) of alternative methods. To demonstrate the possibilities of LMs in this area of political science, we use GPT-3, one of the most advanced LMs, as a synthetic coder and compare it to human coders. We find that GPT-3 can match the performance of typical human coders and offers benefits over other machine learning methods of coding text. We find this across a variety of domains using very different coding procedures. This provides exciting evidence that language models can serve as a critical advance in the coding of open-ended texts in a variety of applications.
△ Less
Submitted 3 June, 2023;
originally announced June 2023.
-
Impossible Distillation: from Low-Quality Model to High-Quality Dataset & Model for Summarization and Paraphrasing
Authors:
Jaehun Jung,
Peter West,
Liwei Jiang,
Faeze Brahman,
Ximing Lu,
Jillian Fisher,
Taylor Sorensen,
Ye** Choi
Abstract:
We present Impossible Distillation, a novel framework for paraphrasing and sentence summarization, that distills a high-quality dataset and model from a low-quality teacher that itself cannot perform these tasks. Unlike prior works that rely on an extreme-scale teacher model (e.g., GPT3) or task-specific architecture, we hypothesize and verify the paraphrastic proximity intrinsic to pre-trained LM…
▽ More
We present Impossible Distillation, a novel framework for paraphrasing and sentence summarization, that distills a high-quality dataset and model from a low-quality teacher that itself cannot perform these tasks. Unlike prior works that rely on an extreme-scale teacher model (e.g., GPT3) or task-specific architecture, we hypothesize and verify the paraphrastic proximity intrinsic to pre-trained LMs (e.g., GPT2), where paraphrases occupy a proximal subspace in the LM distribution. By identifying and distilling generations from these subspaces, Impossible Distillation produces a high-quality dataset and model even from GPT2-scale LMs. We evaluate our method on multiple benchmarks spanning unconstrained / syntax-controlled paraphrase generation and sentence summarization. Our model with 770M parameters consistently outperforms strong baselines, including models distilled from ChatGPT, and sometimes, even ChatGPT itself. Also, we find that our distilled dataset from 1.5B LMs exhibits higher diversity and fidelity than up to 13 times larger datasets.
△ Less
Submitted 5 April, 2024; v1 submitted 26 May, 2023;
originally announced May 2023.
-
Prompt Compression and Contrastive Conditioning for Controllability and Toxicity Reduction in Language Models
Authors:
David Wingate,
Mohammad Shoeybi,
Taylor Sorensen
Abstract:
We explore the idea of compressing the prompts used to condition language models, and show that compressed prompts can retain a substantive amount of information about the original prompt. For severely compressed prompts, while fine-grained information is lost, abstract information and general sentiments can be retained with surprisingly few parameters, which can be useful in the context of decode…
▽ More
We explore the idea of compressing the prompts used to condition language models, and show that compressed prompts can retain a substantive amount of information about the original prompt. For severely compressed prompts, while fine-grained information is lost, abstract information and general sentiments can be retained with surprisingly few parameters, which can be useful in the context of decode-time algorithms for controllability and toxicity reduction. We explore contrastive conditioning to steer language model generation towards desirable text and away from undesirable text, and find that some complex prompts can be effectively compressed into a single token to guide generation. We also show that compressed prompts are largely compositional, and can be constructed such that they can be used to control independent aspects of generated text.
△ Less
Submitted 6 October, 2022;
originally announced October 2022.
-
An Information-theoretic Approach to Prompt Engineering Without Ground Truth Labels
Authors:
Taylor Sorensen,
Joshua Robinson,
Christopher Michael Rytting,
Alexander Glenn Shaw,
Kyle Jeffrey Rogers,
Alexia Pauline Delorey,
Mahmoud Khalil,
Nancy Fulda,
David Wingate
Abstract:
Pre-trained language models derive substantial linguistic and factual knowledge from the massive corpora on which they are trained, and prompt engineering seeks to align these models to specific tasks. Unfortunately, existing prompt engineering methods require significant amounts of labeled data, access to model parameters, or both. We introduce a new method for selecting prompt templates \textit{…
▽ More
Pre-trained language models derive substantial linguistic and factual knowledge from the massive corpora on which they are trained, and prompt engineering seeks to align these models to specific tasks. Unfortunately, existing prompt engineering methods require significant amounts of labeled data, access to model parameters, or both. We introduce a new method for selecting prompt templates \textit{without labeled examples} and \textit{without direct access to the model}. Specifically, over a set of candidate templates, we choose the template that maximizes the mutual information between the input and the corresponding model output. Across 8 datasets representing 7 distinct NLP tasks, we show that when a template has high mutual information, it also has high accuracy on the task. On the largest model, selecting prompts with our method gets 90\% of the way from the average prompt accuracy to the best prompt accuracy and requires no ground truth labels.
△ Less
Submitted 21 March, 2022;
originally announced March 2022.
-
NL-Augmenter: A Framework for Task-Sensitive Natural Language Augmentation
Authors:
Kaustubh D. Dhole,
Varun Gangal,
Sebastian Gehrmann,
Aadesh Gupta,
Zhenhao Li,
Saad Mahamood,
Abinaya Mahendiran,
Simon Mille,
Ashish Shrivastava,
Samson Tan,
Tongshuang Wu,
Jascha Sohl-Dickstein,
**ho D. Choi,
Eduard Hovy,
Ondrej Dusek,
Sebastian Ruder,
Sajant Anand,
Nagender Aneja,
Rabin Banjade,
Lisa Barthe,
Hanna Behnke,
Ian Berlot-Attwell,
Connor Boyle,
Caroline Brun,
Marco Antonio Sobrevilla Cabezudo
, et al. (101 additional authors not shown)
Abstract:
Data augmentation is an important component in the robustness evaluation of models in natural language processing (NLP) and in enhancing the diversity of the data they are trained on. In this paper, we present NL-Augmenter, a new participatory Python-based natural language augmentation framework which supports the creation of both transformations (modifications to the data) and filters (data split…
▽ More
Data augmentation is an important component in the robustness evaluation of models in natural language processing (NLP) and in enhancing the diversity of the data they are trained on. In this paper, we present NL-Augmenter, a new participatory Python-based natural language augmentation framework which supports the creation of both transformations (modifications to the data) and filters (data splits according to specific features). We describe the framework and an initial set of 117 transformations and 23 filters for a variety of natural language tasks. We demonstrate the efficacy of NL-Augmenter by using several of its transformations to analyze the robustness of popular natural language models. The infrastructure, datacards and robustness analysis results are available publicly on the NL-Augmenter repository (https://github.com/GEM-benchmark/NL-Augmenter).
△ Less
Submitted 11 October, 2022; v1 submitted 5 December, 2021;
originally announced December 2021.
-
Signaling Design for Cooperative Resource Allocation and its Impact to Reliability
Authors:
Rasmus Liborius Bruun,
C. Santiago Morejón García,
Troels B. Sørensen,
Nuno K. Pratas,
Tatiana Kozlova Madsen,
Preben Mogensen
Abstract:
Decentralized cooperative resource allocation schemes for robotic swarms are essential to enable high reliability in high throughput data exchanges. These cooperative schemes require control signaling with the aim to avoid half-duplex problems at the receiver and mitigate interference. We propose two cooperative resource allocation schemes, device sequential and group scheduling, and introduce a c…
▽ More
Decentralized cooperative resource allocation schemes for robotic swarms are essential to enable high reliability in high throughput data exchanges. These cooperative schemes require control signaling with the aim to avoid half-duplex problems at the receiver and mitigate interference. We propose two cooperative resource allocation schemes, device sequential and group scheduling, and introduce a control signaling design. We observe that failure in the reception of these control signals leads to non-cooperative behavior and to significant performance degradation. The cause of these failures are identified and specific countermeasures are proposed and evaluated. We compare the proposed resource allocation schemes against the NR sidelink mode 2 resource allocation and show that even though signaling has an important impact on the resource allocation performance, our proposed device sequential and group scheduling resource allocation schemes improve reliability by an order of magnitude compared to sidelink mode 2.
△ Less
Submitted 15 September, 2022; v1 submitted 15 September, 2021;
originally announced September 2021.
-
Specifying and Testing GPU Workgroup Progress Models
Authors:
Tyler Sorensen,
Lucas F. Salvador,
Harmit Raval,
Hugues Evrard,
John Wickerson,
Margaret Martonosi,
Alastair F. Donaldson
Abstract:
As GPU availability has increased and programming support has matured, a wider variety of applications are being ported to these platforms. Many parallel applications contain fine-grained synchronization idioms; as such, their correct execution depends on a degree of relative forward progress between threads (or thread groups). Unfortunately, many GPU programming specifications say almost nothing…
▽ More
As GPU availability has increased and programming support has matured, a wider variety of applications are being ported to these platforms. Many parallel applications contain fine-grained synchronization idioms; as such, their correct execution depends on a degree of relative forward progress between threads (or thread groups). Unfortunately, many GPU programming specifications say almost nothing about relative forward progress guarantees between workgroups. Although prior work has proposed a spectrum of plausible progress models for GPUs, cross-vendor specifications have yet to commit to any model.
This work is a collection of tools experimental data to aid specification designers when considering forward progress guarantees in programming frameworks. As a foundation, we formalize a small parallel programming language that captures the essence of fine-grained synchronization. We then provide a means of formally specifying a progress model, and develop a termination oracle that decides whether a given program is guaranteed to eventually terminate with respect to a given progress model. Next, we formalize a constraint for concurrent programs that require relative forward progress to terminate. Using this constraint, we synthesize a large set of 483 progress litmus tests. Combined with the termination oracle, this allows us to determine the expected status of each litmus test -- i.e. whether it is guaranteed eventual termination -- under various progress models. We present a large experimental campaign running the litmus tests across 8 GPUs from 5 different vendors. Our results highlight that GPUs have significantly different termination behaviors under our test suite. Most notably, we find that Apple and ARM GPUs do not support the linear occupancy-bound model, an intuitive progress model defined by prior work and hypothesized to describe the workgroup schedulers of existing GPUs.
△ Less
Submitted 13 September, 2021;
originally announced September 2021.
-
Age and helium content of the open cluster NGC 6791 from multiple eclipsing binary members. III. Constraints from a subgiant
Authors:
K. Brogaard,
F. Grundahl,
E. L. Sandquist,
D. Slumstrup,
M. L. Jensen,
J. B. Thomsen,
J. H. Jørgensen,
J. R. Larsen,
S. T. Bjørn,
C. T. G. Sørensen,
H. Bruntt,
T. Arentoft,
S. Frandsen,
J. Jessen-Hansen,
J. A. Orosz,
R. Mathieu,
A. Geller,
N. Ryde,
D. Stello,
S. Meibom,
I. Platais
Abstract:
Models of stellar structure and evolution can be constrained using accurate measurements of the parameters of eclipsing binary members of open clusters. Multiple binary stars provide the means to tighten the constraints and, in turn, to improve the precision and accuracy of the age estimate of the host cluster. In the previous two papers of this series, we have demonstrated the use of measurements…
▽ More
Models of stellar structure and evolution can be constrained using accurate measurements of the parameters of eclipsing binary members of open clusters. Multiple binary stars provide the means to tighten the constraints and, in turn, to improve the precision and accuracy of the age estimate of the host cluster. In the previous two papers of this series, we have demonstrated the use of measurements of multiple eclipsing binaries in the old open cluster NGC6791 to set tighter constraints on the properties of stellar models than was previously possible, thereby improving both the accuracy and precision of the cluster age. We identify and measure the properties of a non-eclipsing cluster member, V56, in NGC\,6791 and demonstrate how this provides additional model constraints that support and strengthen our previous findings. We analyse multi-epoch spectra of V56 from FLAMES in conjunction with the existing photometry and measurements of eclipsing binaries in NGC6971. The parameters of the V56 components are found to be $M_{\rm p}=1.103\pm 0.008 M_{\odot}$ and $M_{\rm s}=0.974\pm 0.007 M_{\odot}$, $R_{\rm p}=1.764\pm0.099 R_{\odot}$ and $R_{\rm s}=1.045\pm0.057 R_{\odot}$, $T_{\rm eff,p}=5447\pm125$ K and $T_{\rm eff,s}=5552\pm125$ K, and surface [Fe/H]=$+0.29\pm0.06$ assuming that they have the same abundance. The derived properties strengthen our previous best estimate of the cluster age of $8.3\pm0.3$ Gyr and the mass of stars on the lower red giant branch (RGB), which is $M_{\rm RGB} = 1.15\pm0.02M_{\odot}$ for NGC6791. These numbers therefore continue to serve as verification points for other methods of age and mass measures, such as asteroseismology.
△ Less
Submitted 29 April, 2021;
originally announced April 2021.
-
A multispeaker dataset of raw and reconstructed speech production real-time MRI video and 3D volumetric images
Authors:
Yongwan Lim,
Asterios Toutios,
Yannick Bliesener,
Ye Tian,
Sajan Goud Lingala,
Colin Vaz,
Tanner Sorensen,
Miran Oh,
Sarah Harper,
Weiyi Chen,
Yoonjeong Lee,
Johannes Töger,
Mairym Lloréns Montesserin,
Caitlin Smith,
Bianca Godinez,
Louis Goldstein,
Dani Byrd,
Krishna S. Nayak,
Shrikanth S. Narayanan
Abstract:
Real-time magnetic resonance imaging (RT-MRI) of human speech production is enabling significant advances in speech science, linguistics, bio-inspired speech technology development, and clinical applications. Easy access to RT-MRI is however limited, and comprehensive datasets with broad access are needed to catalyze research across numerous domains. The imaging of the rapidly moving articulators…
▽ More
Real-time magnetic resonance imaging (RT-MRI) of human speech production is enabling significant advances in speech science, linguistics, bio-inspired speech technology development, and clinical applications. Easy access to RT-MRI is however limited, and comprehensive datasets with broad access are needed to catalyze research across numerous domains. The imaging of the rapidly moving articulators and dynamic airway sha** during speech demands high spatio-temporal resolution and robust reconstruction methods. Further, while reconstructed images have been published, to-date there is no open dataset providing raw multi-coil RT-MRI data from an optimized speech production experimental setup. Such datasets could enable new and improved methods for dynamic image reconstruction, artifact correction, feature extraction, and direct extraction of linguistically-relevant biomarkers. The present dataset offers a unique corpus of 2D sagittal-view RT-MRI videos along with synchronized audio for 75 subjects performing linguistically motivated speech tasks, alongside the corresponding first-ever public domain raw RT-MRI data. The dataset also includes 3D volumetric vocal tract MRI during sustained speech sounds and high-resolution static anatomical T2-weighted upper airway MRI for each subject.
△ Less
Submitted 15 February, 2021;
originally announced February 2021.
-
The MosaicSim Simulator (Full Technical Report)
Authors:
Opeoluwa Matthews,
Aninda Manocha,
Davide Giri,
Marcelo Orenes-Vera,
Esin Tureci,
Tyler Sorensen,
Tae Jun Ham,
Juan L. Aragón,
Luca P. Carloni,
Margaret Martonosi
Abstract:
As Moore's Law has slowed and Dennard Scaling has ended, architects are increasingly turning to heterogeneous parallelism and domain-specific hardware-software co-designs. These trends present new challenges for simulation-based performance assessments that are central to early-stage architectural exploration. Simulators must be lightweight to support rich heterogeneous combinations of general pur…
▽ More
As Moore's Law has slowed and Dennard Scaling has ended, architects are increasingly turning to heterogeneous parallelism and domain-specific hardware-software co-designs. These trends present new challenges for simulation-based performance assessments that are central to early-stage architectural exploration. Simulators must be lightweight to support rich heterogeneous combinations of general purpose cores and specialized processing units. They must also support agile exploration of hardware-software co-design, i.e. changes in the programming model, compiler, ISA, and specialized hardware.
To meet these challenges, we introduce MosaicSim, a lightweight, modular simulator for heterogeneous systems, offering accuracy and agility designed specifically for hardware-software co-design explorations. By integrating the LLVM toolchain, MosaicSim enables efficient modeling of instruction dependencies and flexible additions across the stack. Its modularity also allows the composition and integration of different hardware components. We first demonstrate that MosaicSim captures architectural bottlenecks in applications, and accurately models both scaling trends in a multicore setting and accelerator behavior. We then present two case-studies where MosaicSim enables straightforward design space explorations for emerging systems, i.e. data science application acceleration and heterogeneous parallel architectures.
△ Less
Submitted 15 April, 2020;
originally announced April 2020.
-
Derivation of Fitts' law from the Task Dynamics model of speech production
Authors:
Tanner Sorensen,
Adam Lammert,
Louis Goldstein,
Shrikanth Narayanan
Abstract:
Fitts' law is a linear equation relating movement time to an index of movement difficulty. The recent finding that Fitts' law applies to voluntary movement of the vocal tract raises the question of whether the theory of speech production implies Fitts' law. The present letter establishes a theoretical connection between Fitts' law and the Task Dynamics model of speech production. We derive a varia…
▽ More
Fitts' law is a linear equation relating movement time to an index of movement difficulty. The recent finding that Fitts' law applies to voluntary movement of the vocal tract raises the question of whether the theory of speech production implies Fitts' law. The present letter establishes a theoretical connection between Fitts' law and the Task Dynamics model of speech production. We derive a variant of Fitts' law where the intercept and slope are functions of the parameters of the Task Dynamics model and the index of difficulty is a product logarithm, or Lambert W function, rather than a logarithm.
△ Less
Submitted 17 March, 2020; v1 submitted 14 January, 2020;
originally announced January 2020.
-
Do Your Cores Play Nicely? A Portable Framework for Multi-core Interference Tuning and Analysis
Authors:
Dan Iorga,
Tyler Sorensen,
Alastair F. Donaldson
Abstract:
Multi-core architectures can be leveraged to allow independent processes to run in parallel. However, due to resources shared across cores, such as caches, distinct processes may interfere with one another, e.g. affecting execution time. Analysing the extent of this interference is difficult due to: (1) the diversity of modern architectures, which may contain different implementations of shared re…
▽ More
Multi-core architectures can be leveraged to allow independent processes to run in parallel. However, due to resources shared across cores, such as caches, distinct processes may interfere with one another, e.g. affecting execution time. Analysing the extent of this interference is difficult due to: (1) the diversity of modern architectures, which may contain different implementations of shared resources, and (2) the complex nature of modern processors, in which interference might arise due to subtle interactions. To address this, we propose a black-box auto-tuning approach that searches for processes that are effective at causing slowdowns for a program when executed in parallel. Such slowdowns provide lower bounds on worst-case execution time; an important metric in systems with real-time constraints.
Our approach considers a set of parameterised "enemy" processes and "victim" programs, each targeting a shared resource. The autotuner searches for enemy process parameters that are effective at causing slowdowns in the victim programs. The idea is that victim programs behave as a proxy for shared resource usage of arbitrary programs. We evaluate our approach on: 5 different chips; 3 resources (cache, memory bus, and main memory); and consider several search strategies and slowdown metrics. Using enemy processes tuned per chip, we evaluate the slowdowns on the autobench and coremark benchmark suites and show that our method is able to achieve slowdowns in 98% of benchmark/chip combinations and provide similar results to manually written enemy processes.
△ Less
Submitted 13 September, 2018;
originally announced September 2018.
-
Pointwise estimates on derivatives of Coulombic wave functions and their electron densities
Authors:
Søren Fournais,
Thomas Østergaard Sørensen
Abstract:
We prove a priori bounds for all derivatives of non-relativistic Coulombic eigenfunctions, involving negative powers of the distance to the singularities of the many-body potential. We use these to derive bounds for all derivatives of the corresponding one-electron densities, involving negative powers of the distance from the nuclei. The results are both natural and optimal, as seen from the groun…
▽ More
We prove a priori bounds for all derivatives of non-relativistic Coulombic eigenfunctions, involving negative powers of the distance to the singularities of the many-body potential. We use these to derive bounds for all derivatives of the corresponding one-electron densities, involving negative powers of the distance from the nuclei. The results are both natural and optimal, as seen from the ground state of Hydrogen.
△ Less
Submitted 9 March, 2018;
originally announced March 2018.
-
The Semantics of Transactions and Weak Memory in x86, Power, ARM, and C++
Authors:
Nathan Chong,
Tyler Sorensen,
John Wickerson
Abstract:
Weak memory models provide a complex, system-centric semantics for concurrent programs, while transactional memory (TM) provides a simpler, programmer-centric semantics. Both have been studied in detail, but their combined semantics is not well understood. This is problematic because such widely-used architectures and languages as x86, Power, and C++ all support TM, and all have weak memory models…
▽ More
Weak memory models provide a complex, system-centric semantics for concurrent programs, while transactional memory (TM) provides a simpler, programmer-centric semantics. Both have been studied in detail, but their combined semantics is not well understood. This is problematic because such widely-used architectures and languages as x86, Power, and C++ all support TM, and all have weak memory models.
Our work aims to clarify the interplay between weak memory and TM by extending existing axiomatic weak memory models (x86, Power, ARMv8, and C++) with new rules for TM. Our formal models are backed by automated tooling that enables (1) the synthesis of tests for validating our models against existing implementations and (2) the model-checking of TM-related transformations, such as lock elision and compiling C++ transactions to hardware. A key finding is that a proposed TM extension to ARMv8 currently being considered within ARM Research is incompatible with lock elision without sacrificing portability or performance.
△ Less
Submitted 16 April, 2018; v1 submitted 13 October, 2017;
originally announced October 2017.
-
Cooperative Kernels: GPU Multitasking for Blocking Algorithms (Extended Version)
Authors:
Tyler Sorensen,
Hugues Evrard,
Alastair F. Donaldson
Abstract:
There is growing interest in accelerating irregular data-parallel algorithms on GPUs. These algorithms are typically blocking, so they require fair scheduling. But GPU programming models (e.g.\ OpenCL) do not mandate fair scheduling, and GPU schedulers are unfair in practice. Current approaches avoid this issue by exploiting scheduling quirks of today's GPUs in a manner that does not allow the GPU…
▽ More
There is growing interest in accelerating irregular data-parallel algorithms on GPUs. These algorithms are typically blocking, so they require fair scheduling. But GPU programming models (e.g.\ OpenCL) do not mandate fair scheduling, and GPU schedulers are unfair in practice. Current approaches avoid this issue by exploiting scheduling quirks of today's GPUs in a manner that does not allow the GPU to be shared with other workloads (such as graphics rendering tasks). We propose cooperative kernels, an extension to the traditional GPU programming model geared towards writing blocking algorithms. Workgroups of a cooperative kernel are fairly scheduled, and multitasking is supported via a small set of language extensions through which the kernel and scheduler cooperate. We describe a prototype implementation of a cooperative kernel framework implemented in OpenCL 2.0 and evaluate our approach by porting a set of blocking GPU applications to cooperative kernels and examining their performance under multitasking. Our prototype exploits no vendor-specific hardware, driver or compiler support, thus our results provide a lower-bound on the efficiency with which cooperative kernels can be implemented in practice.
△ Less
Submitted 6 July, 2017;
originally announced July 2017.
-
Coulomb potentials and Taylor expansions in Time-Dependent Density Functional Theory
Authors:
Søren Fournais,
Jonas Lampart,
Mathieu Lewin,
Thomas Østergaard Sørensen
Abstract:
We investigate when Taylor expansions can be used to prove the Runge-Gross Theorem, which is at the foundation of Time-Dependent Density Functional Theory (TDDFT). We start with a general analysis of the conditions for the Runge-Gross argument, especially the time-differentiability of the density. The latter should be questioned in the presence of singular (e.g. Coulomb) potentials. Then, we show…
▽ More
We investigate when Taylor expansions can be used to prove the Runge-Gross Theorem, which is at the foundation of Time-Dependent Density Functional Theory (TDDFT). We start with a general analysis of the conditions for the Runge-Gross argument, especially the time-differentiability of the density. The latter should be questioned in the presence of singular (e.g. Coulomb) potentials. Then, we show that a singular potential in a one-body operator considerably decreases the class of time-dependent external potentials to which the original argument can be applied. A two-body singularity has an even stronger impact and an external potential is essentially incompatible with it. For the Coulomb interaction and all reasonable initial many-body states, the Taylor expansion only exists to a finite order, except for constant external potentials. Therefore, high-order Taylor expansions are not the right tool to study atoms and molecules in TDDFT.
△ Less
Submitted 29 June, 2016; v1 submitted 7 March, 2016;
originally announced March 2016.
-
Computation of Stackelberg Equilibria of Finite Sequential Games
Authors:
Branislav Bosansky,
Simina Branzei,
Kristoffer Arnsfelt Hansen,
Peter Bro Miltersen,
Troels Bjerre Sorensen
Abstract:
The Stackelberg equilibrium solution concept describes optimal strategies to commit to: Player 1 (termed the leader) publicly commits to a strategy and Player 2 (termed the follower) plays a best response to this strategy (ties are broken in favor of the leader). We study Stackelberg equilibria in finite sequential games (or extensive-form games) and provide new exact algorithms, approximate algor…
▽ More
The Stackelberg equilibrium solution concept describes optimal strategies to commit to: Player 1 (termed the leader) publicly commits to a strategy and Player 2 (termed the follower) plays a best response to this strategy (ties are broken in favor of the leader). We study Stackelberg equilibria in finite sequential games (or extensive-form games) and provide new exact algorithms, approximate algorithms, and hardness results for several classes of these sequential games.
△ Less
Submitted 23 August, 2016; v1 submitted 28 July, 2015;
originally announced July 2015.
-
Bayesian linear mixed models using Stan: A tutorial for psychologists, linguists, and cognitive scientists
Authors:
Tanner Sorensen,
Shravan Vasishth
Abstract:
With the arrival of the R packages nlme and lme4, linear mixed models (LMMs) have come to be widely used in experimentally-driven areas like psychology, linguistics, and cognitive science. This tutorial provides a practical introduction to fitting LMMs in a Bayesian framework using the probabilistic programming language Stan. We choose Stan (rather than WinBUGS or JAGS) because it provides an eleg…
▽ More
With the arrival of the R packages nlme and lme4, linear mixed models (LMMs) have come to be widely used in experimentally-driven areas like psychology, linguistics, and cognitive science. This tutorial provides a practical introduction to fitting LMMs in a Bayesian framework using the probabilistic programming language Stan. We choose Stan (rather than WinBUGS or JAGS) because it provides an elegant and scalable framework for fitting models in most of the standard applications of LMMs. We ease the reader into fitting increasingly complex LMMs, first using a two-condition repeated measures self-paced reading study, followed by a more complex $2\times 2$ repeated measures factorial design that can be generalized to much more complex designs.
△ Less
Submitted 20 June, 2015;
originally announced June 2015.
-
Timeability of Extensive-Form Games
Authors:
Sune K. Jakobsen,
Troels B. Sørensen,
Vincent Conitzer
Abstract:
Extensive-form games constitute the standard representation scheme for games with a temporal component. But do all extensive-form games correspond to protocols that we can implement in the real world? We often rule out games with imperfect recall, which prescribe that an agent forget something that she knew before. In this paper, we show that even some games with perfect recall can be problematic…
▽ More
Extensive-form games constitute the standard representation scheme for games with a temporal component. But do all extensive-form games correspond to protocols that we can implement in the real world? We often rule out games with imperfect recall, which prescribe that an agent forget something that she knew before. In this paper, we show that even some games with perfect recall can be problematic to implement. Specifically, we show that if the agents have a sense of time passing (say, access to a clock), then some extensive-form games can no longer be implemented; no matter how we attempt to time the game, some information will leak to the agents that they are not supposed to have. We say such a game is not exactly timeable. We provide easy-to-check necessary and sufficient conditions for a game to be exactly timeable. Most of the technical depth of the paper concerns how to approximately time games, which we show can always be done, though it may require large amounts of time. Specifically, we show that for some games the time required to approximately implement the game grows as a power tower of height proportional to the number of players and with a parameter that measures the precision of the approximation at the top of the power tower. In practice, that makes the games untimeable. Besides the conceptual contribution to game theory, we believe our methodology can have applications to preventing information leakage in security protocols.
△ Less
Submitted 11 February, 2015;
originally announced February 2015.
-
The complexity of approximating a trembling hand perfect equilibrium of a multi-player game in strategic form
Authors:
Kousha Etessami,
Kristoffer Arnsfelt Hansen,
Peter Bro Miltersen,
Troels Bjerre Sorensen
Abstract:
We consider the task of computing an approximation of a trembling hand perfect equilibrium for an n-player game in strategic form, n >= 3. We show that this task is complete for the complexity class FIXP_a. In particular, the task is polynomial time equivalent to the task of computing an approximation of a Nash equilibrium in strategic form games with three (or more) players.
We consider the task of computing an approximation of a trembling hand perfect equilibrium for an n-player game in strategic form, n >= 3. We show that this task is complete for the complexity class FIXP_a. In particular, the task is polynomial time equivalent to the task of computing an approximation of a Nash equilibrium in strategic form games with three (or more) players.
△ Less
Submitted 5 August, 2014;
originally announced August 2014.
-
Stability of an upwind Petrov Galerkin discretization of convection diffusion equations
Authors:
Snorre H. Christiansen,
Tore G. Halvorsen,
Torquil M. Sørensen
Abstract:
We study a numerical method for convection diffusion equations, in the regime of small viscosity. It can be described as an exponentially fitted conforming Petrov-Galerkin method. We identify norms for which we have both continuity and an inf-sup condition, which are uniform in mesh-width and viscosity, up to a logarithm, as long as the viscosity is smaller than the mesh-width or the crosswind dif…
▽ More
We study a numerical method for convection diffusion equations, in the regime of small viscosity. It can be described as an exponentially fitted conforming Petrov-Galerkin method. We identify norms for which we have both continuity and an inf-sup condition, which are uniform in mesh-width and viscosity, up to a logarithm, as long as the viscosity is smaller than the mesh-width or the crosswind diffusion is smaller than the streamline diffusion. The analysis allows for the formation of a boundary layer.
△ Less
Submitted 22 February, 2016; v1 submitted 2 June, 2014;
originally announced June 2014.
-
Real analyticity of solutions to Schrödinger equations involving fractional Laplacians
Authors:
Anna Dall'Acqua,
Søren Fournais,
Thomas Østergaard Sørensen,
Edgardo Stockmeyer
Abstract:
We prove analyticity of solutions in $\mathbb{R}^{n}$, $n\ge1$, to certain nonlocal linear Schrödinger equations with analytic potentials.
We prove analyticity of solutions in $\mathbb{R}^{n}$, $n\ge1$, to certain nonlocal linear Schrödinger equations with analytic potentials.
△ Less
Submitted 6 November, 2013; v1 submitted 18 September, 2012;
originally announced September 2012.
-
Approximate Well-supported Nash Equilibria below Two-thirds
Authors:
John Fearnley,
Paul W. Goldberg,
Rahul Savani,
Troels Bjerre Sørensen
Abstract:
In an epsilon-Nash equilibrium, a player can gain at most epsilon by changing his behaviour. Recent work has addressed the question of how best to compute epsilon-Nash equilibria, and for what values of epsilon a polynomial-time algorithm exists. An epsilon-well-supported Nash equilibrium (epsilon-WSNE) has the additional requirement that any strategy that is used with non-zero probability by a pl…
▽ More
In an epsilon-Nash equilibrium, a player can gain at most epsilon by changing his behaviour. Recent work has addressed the question of how best to compute epsilon-Nash equilibria, and for what values of epsilon a polynomial-time algorithm exists. An epsilon-well-supported Nash equilibrium (epsilon-WSNE) has the additional requirement that any strategy that is used with non-zero probability by a player must have payoff at most epsilon less than the best response. A recent algorithm of Kontogiannis and Spirakis shows how to compute a 2/3-WSNE in polynomial time, for bimatrix games. Here we introduce a new technique that leads to an improvement to the worst-case approximation guarantee.
△ Less
Submitted 2 December, 2014; v1 submitted 3 April, 2012;
originally announced April 2012.
-
Comment on "The Stochastic Nonlinear Schrödinger Equation in $H^{1}$"
Authors:
Torquil Macdonald Sørensen
Abstract:
The paper "The Stochastic Nonlinear Schrödinger Equation in $H^{1}$" \cite{debouard2003} gives an existence proof for a stochastic nonlinear Schrödinger equation with multiplicative noise. We point out two mistakes that draw the validity of the proof into question.
The paper "The Stochastic Nonlinear Schrödinger Equation in $H^{1}$" \cite{debouard2003} gives an existence proof for a stochastic nonlinear Schrödinger equation with multiplicative noise. We point out two mistakes that draw the validity of the proof into question.
△ Less
Submitted 25 February, 2012;
originally announced February 2012.
-
Levy process simulation by stochastic step functions
Authors:
Torquil Macdonald Sørensen,
Fred Espen Benth
Abstract:
We study a Monte Carlo algorithm for simulation of probability distributions based on stochastic step functions, and compare to the traditional Metropolis/Hastings method. Unlike the latter, the step function algorithm can produce an uncorrelated Markov chain. We apply this method to the simulation of Levy processes, for which simulation of uncorrelated jumps are essential.
We perform numerical…
▽ More
We study a Monte Carlo algorithm for simulation of probability distributions based on stochastic step functions, and compare to the traditional Metropolis/Hastings method. Unlike the latter, the step function algorithm can produce an uncorrelated Markov chain. We apply this method to the simulation of Levy processes, for which simulation of uncorrelated jumps are essential.
We perform numerical tests consisting of simulation from probability distributions, as well as simulation of Levy process paths. The Levy processes include a jump-diffusion with a Gaussian Levy measure, as well as jump-diffusion approximations of the infinite activity NIG and CGMY processes.
To increase efficiency of the step function method, and to decrease correlations in the Metropolis/Hastings method, we introduce adaptive hybrid algorithms which employ uncorrelated draws from an adaptive discrete distribution defined on a space of subdivisions of the Levy measure space.
The nonzero correlations in Metropolis/Hastings simulations result in heavy tails for the Levy process distribution at any fixed time. This problem is eliminated in the step function approach. In each case of the Gaussian, NIG and CGMY processes, we compare the distribution at t=1 with exact results and note the superiority of the step function approach.
△ Less
Submitted 11 October, 2011;
originally announced October 2011.
-
Simplicial gauge theory on spacetime
Authors:
Tore Gunnar Halvorsen,
Torquil Macdonald Sørensen
Abstract:
We define a discrete gauge-invariant Yang-Mills-Higgs action on spacetime simplicial meshes. The formulation is a generalization of classical lattice gauge theory, and we prove consistency of the action in the sense of approximation theory. In addition, we perform numerical tests of convergence towards exact continuum results for several choices of gauge fields in pure gauge theory.
We define a discrete gauge-invariant Yang-Mills-Higgs action on spacetime simplicial meshes. The formulation is a generalization of classical lattice gauge theory, and we prove consistency of the action in the sense of approximation theory. In addition, we perform numerical tests of convergence towards exact continuum results for several choices of gauge fields in pure gauge theory.
△ Less
Submitted 7 July, 2011;
originally announced July 2011.
-
Simplicial gauge theory and quantum gauge theory simulation
Authors:
Tore Gunnar Halvorsen,
Torquil Macdonald Sørensen
Abstract:
We propose a general formulation of simplicial lattice gauge theory inspired by the finite element method. Numerical tests of convergence towards continuum results are performed for several SU(2) gauge fields. Additionaly, we perform simplicial Monte Carlo quantum gauge field simulations involving measurements of the action as well as differently sized Wilson loops as functions of $β$.
We propose a general formulation of simplicial lattice gauge theory inspired by the finite element method. Numerical tests of convergence towards continuum results are performed for several SU(2) gauge fields. Additionaly, we perform simplicial Monte Carlo quantum gauge field simulations involving measurements of the action as well as differently sized Wilson loops as functions of $β$.
△ Less
Submitted 7 July, 2011;
originally announced July 2011.
-
Real analyticity away from the nucleus of pseudorelativistic Hartree-Fock orbitals
Authors:
Anna Dall'Acqua,
Søren Fournais,
Thomas Østergaard Sørensen,
Edgardo Stockmeyer
Abstract:
We prove that the Hartree--Fock orbitals of pseudorelativistic atoms, that is, atoms where the kinetic energy of the electrons is given by the pseudorelativistic operator sqrt{-Delta+1}-1, are real analytic away from the origin. As a consequence, the quantum mechanical ground state of such atoms is never a Hartree-Fock state.
Our proof is inspired by the classical proof of analyticity by nested…
▽ More
We prove that the Hartree--Fock orbitals of pseudorelativistic atoms, that is, atoms where the kinetic energy of the electrons is given by the pseudorelativistic operator sqrt{-Delta+1}-1, are real analytic away from the origin. As a consequence, the quantum mechanical ground state of such atoms is never a Hartree-Fock state.
Our proof is inspired by the classical proof of analyticity by nested balls of Morrey and Nirenberg. However, the technique has to be adapted to take care of the non-local pseudodifferential operator, the singularity of the potential at the origin, and the non-linear terms in the equation.
△ Less
Submitted 4 November, 2011; v1 submitted 25 March, 2011;
originally announced March 2011.
-
Path coalitional games
Authors:
Haris Aziz,
Troels Bjerre Sørensen
Abstract:
We present a general framework to model strategic aspects and stable and fair resource allocations in networks via variants and generalizations of path coalitional games. In these games, a coalition of edges or vertices is successful if it can enable an s-t path. We present polynomial-time algorithms to compute and verify least core payoffs of cost-based generalizations of path coalitional games a…
▽ More
We present a general framework to model strategic aspects and stable and fair resource allocations in networks via variants and generalizations of path coalitional games. In these games, a coalition of edges or vertices is successful if it can enable an s-t path. We present polynomial-time algorithms to compute and verify least core payoffs of cost-based generalizations of path coalitional games and their duals, thereby settling a number of open problems. The least core payoffs of path coalitional games are completely characterized and a polynomial-time algorithm for computing the nucleolus of edge path coalitional games on undirected series-parallel graphs is presented.
△ Less
Submitted 27 April, 2011; v1 submitted 16 March, 2011;
originally announced March 2011.
-
On the Approximation Performance of Fictitious Play in Finite Games
Authors:
Paul W. Goldberg,
Rahul Savani,
Troels Bjerre Sorensen,
Carmine Ventre
Abstract:
We study the performance of Fictitious Play, when used as a heuristic for finding an approximate Nash equilibrium of a 2-player game. We exhibit a class of 2-player games having payoffs in the range [0,1] that show that Fictitious Play fails to find a solution having an additive approximation guarantee significantly better than 1/2. Our construction shows that for n times n games, in the worst cas…
▽ More
We study the performance of Fictitious Play, when used as a heuristic for finding an approximate Nash equilibrium of a 2-player game. We exhibit a class of 2-player games having payoffs in the range [0,1] that show that Fictitious Play fails to find a solution having an additive approximation guarantee significantly better than 1/2. Our construction shows that for n times n games, in the worst case both players may perpetually have mixed strategies whose payoffs fall short of the best response by an additive quantity 1/2 - O(1/n^(1-delta)) for arbitrarily small delta. We also show an essentially matching upper bound of 1/2 - O(1/n).
△ Less
Submitted 19 March, 2011; v1 submitted 5 March, 2011;
originally announced March 2011.
-
Analytic structure of solutions to multiconfiguration equations
Authors:
Søren Fournais,
Maria Hoffmann-Ostenhof,
Thomas Hoffmann-Ostenhof,
Thomas Østergaard Sørensen
Abstract:
We study the regularity at the positions of the (fixed) nuclei of solutions to (non-relativistic) multiconfiguration equations (including Hartree--Fock) of Coulomb systems. We prove the following: Let {phi_1,...,phi_M} be any solution to the rank--M multiconfiguration equations for a molecule with L fixed nuclei at R_1,...,R_L in R^3. Then, for any j in {1,...,M} and k in {1,...,L}, there exists…
▽ More
We study the regularity at the positions of the (fixed) nuclei of solutions to (non-relativistic) multiconfiguration equations (including Hartree--Fock) of Coulomb systems. We prove the following: Let {phi_1,...,phi_M} be any solution to the rank--M multiconfiguration equations for a molecule with L fixed nuclei at R_1,...,R_L in R^3. Then, for any j in {1,...,M} and k in {1,...,L}, there exists a neighbourhood U_{j,k} in R^3 of R_k, and functions phi^{(1)}_{j,k}, phi^{(2)}_{j,k}, real analytic in U_{j,k}, such that phi_j(x) = phi^{(1)}_{j,k}(x) + |x - R_k| phi^{(2)}_{j,k}(x), x in U_{j,k} A similar result holds for the corresponding electron density. The proof uses the Kustaanheimo--Stiefel transformation, as applied earlier by the authors to the study of the eigenfunctions of the Schr"odinger operator of atoms and molecules near two-particle coalescence points.
△ Less
Submitted 8 December, 2008;
originally announced December 2008.
-
The electron densities of pseudorelativistic eigenfunctions are smooth away from the nuclei
Authors:
Søren Fournais,
Thomas Østergaard Sørensen
Abstract:
We consider a pseudorelativistic model of atoms and molecules, where the kinetic energy of the electrons is given by $\sqrt{p^2+m^2}-m$. In this model the eigenfunctions are generally not even bounded, however, we prove that the corresponding one-electron densities are smooth away from the nuclei.
We consider a pseudorelativistic model of atoms and molecules, where the kinetic energy of the electrons is given by $\sqrt{p^2+m^2}-m$. In this model the eigenfunctions are generally not even bounded, however, we prove that the corresponding one-electron densities are smooth away from the nuclei.
△ Less
Submitted 27 November, 2008;
originally announced November 2008.
-
The relativistic Scott correction for atoms and molecules
Authors:
Jan Philip Solovej,
Thomas Østergaard Sørensen,
Wolfgang L. Spitzer
Abstract:
We prove the first correction to the leading Thomas-Fermi energy for the ground state energy of atoms and molecules in a model where the kinetic energy of the electrons is treated relativistically. The leading Thomas-Fermi energy, established in [Sorensen], as well as the correction given here are of semi-classical nature. Our result on atoms and molecules is proved from a general semi-classical…
▽ More
We prove the first correction to the leading Thomas-Fermi energy for the ground state energy of atoms and molecules in a model where the kinetic energy of the electrons is treated relativistically. The leading Thomas-Fermi energy, established in [Sorensen], as well as the correction given here are of semi-classical nature. Our result on atoms and molecules is proved from a general semi-classical estimate for relativistic operators with potentials with Coulomb-like singularities. This semi-classical estimate is obtained using the coherent state calculus introduced in [Solovej-Spitzer]. The paper contains a unified treatment of the relativistic as well as the non-relativistic case.
△ Less
Submitted 15 August, 2008;
originally announced August 2008.
-
Approximability and parameterized complexity of minmax values
Authors:
Kristoffer Arnsfelt Hansen,
Thomas Dueholm Hansen,
Peter Bro Miltersen,
Troels Bjerre Sørensen
Abstract:
We consider approximating the minmax value of a multi-player game in strategic form. Tightening recent bounds by Borgs et al., we observe that approximating the value with a precision of epsilon log n digits (for any constant epsilon>0 is NP-hard, where n is the size of the game. On the other hand, approximating the value with a precision of c log log n digits (for any constant c >= 1) can be do…
▽ More
We consider approximating the minmax value of a multi-player game in strategic form. Tightening recent bounds by Borgs et al., we observe that approximating the value with a precision of epsilon log n digits (for any constant epsilon>0 is NP-hard, where n is the size of the game. On the other hand, approximating the value with a precision of c log log n digits (for any constant c >= 1) can be done in quasi-polynomial time. We consider the parameterized complexity of the problem, with the parameter being the number of pure strategies k of the player for which the minmax value is computed. We show that if there are three players, k=2 and there are only two possible rational payoffs, the minmax value is a rational number and can be computed exactly in linear time. In the general case, we show that the value can be approximated with any polynomial number of digits of accuracy in time n^(O(k)). On the other hand, we show that minmax value approximation is W[1]-hard and hence not likely to be fixed parameter tractable. Concretely, we show that if k-CLIQUE requires time n^(Omega(k)) then so does minmax value computation.
△ Less
Submitted 26 June, 2008;
originally announced June 2008.
-
Analytic structure of many-body Coulombic wave functions
Authors:
Søren Fournais,
Maria Hoffmann-Ostenhof,
Thomas Hoffmann-Ostenhof,
Thomas Østergaard Sørensen
Abstract:
We investigate the analytic structure of solutions of non-relativistic Schr"odinger equations describing Coulombic many-particle systems. We prove the following: Let psi(x) with x=(x_1,...,x_N) in R^{3N} denote an N-electron wavefunction of such a system with one nucleus fixed at the origin. Then in a neighbourhood of a coalescence point, for which x_1=0 and the other electron coordinates do not…
▽ More
We investigate the analytic structure of solutions of non-relativistic Schr"odinger equations describing Coulombic many-particle systems. We prove the following: Let psi(x) with x=(x_1,...,x_N) in R^{3N} denote an N-electron wavefunction of such a system with one nucleus fixed at the origin. Then in a neighbourhood of a coalescence point, for which x_1=0 and the other electron coordinates do not coincide, and differ from 0, psi can be represented locally as psi(x) = psi^(1)(x) + |x_1|psi^(2)(x) with psi^(1), psi^(2) real analytic. A similar representation holds near two-electron coalescence points. The Kustaanheimo-Stiefel transform and analytic hypoellipticity play an essential role in the proof.
△ Less
Submitted 5 June, 2008;
originally announced June 2008.
-
Simple Recursive Games
Authors:
Daniel Andersson,
Kristoffer Arnsfelt Hansen,
Peter Bro Miltersen,
Troels Bjerre Sorensen
Abstract:
We define the class of "simple recursive games". A simple recursive game is defined as a simple stochastic game (a notion due to Anne Condon), except that we allow arbitrary real payoffs but disallow moves of chance. We study the complexity of solving simple recursive games and obtain an almost-linear time comparison-based algorithm for computing an equilibrium of such a game. The existence of a…
▽ More
We define the class of "simple recursive games". A simple recursive game is defined as a simple stochastic game (a notion due to Anne Condon), except that we allow arbitrary real payoffs but disallow moves of chance. We study the complexity of solving simple recursive games and obtain an almost-linear time comparison-based algorithm for computing an equilibrium of such a game. The existence of a linear time comparison-based algorithm remains an open problem.
△ Less
Submitted 7 November, 2007;
originally announced November 2007.
-
Hartree-Fock theory for pseudorelativistic atoms
Authors:
Anna Dall'Acqua,
Thomas Østergaard Sørensen,
Edgardo Stockmeyer
Abstract:
We study the Hartree-Fock model for pseudorelativistic atoms, that is, atoms where the kinetic energy of the electrons is given by the pseudorelativistic operator \sqrt{(pc)^2+(mc^2)^2}-mc^2. We prove the existence of a Hartree-Fock minimizer, and prove regularity away from the nucleus and pointwise exponential decay of the corresponding orbitals.
We study the Hartree-Fock model for pseudorelativistic atoms, that is, atoms where the kinetic energy of the electrons is given by the pseudorelativistic operator \sqrt{(pc)^2+(mc^2)^2}-mc^2. We prove the existence of a Hartree-Fock minimizer, and prove regularity away from the nucleus and pointwise exponential decay of the corresponding orbitals.
△ Less
Submitted 31 July, 2007;
originally announced July 2007.
-
Positivity and lower bounds to the decay of the atomic one-electron density
Authors:
Søren Fournais,
Maria Hoffmann-Ostenhof,
Thomas Hoffmann-Ostenhof,
Thomas Østergaard Sørensen
Abstract:
We investigate properties of the spherically averaged atomic one-electron density rho~(r). For a rho~ which stems from a physical ground state we prove that rho~ > 0. We also give exponentially decreasing lower bounds to rho~ in the case when the eigenvalue is below the corresponding essential spectrum.
We investigate properties of the spherically averaged atomic one-electron density rho~(r). For a rho~ which stems from a physical ground state we prove that rho~ > 0. We also give exponentially decreasing lower bounds to rho~ in the case when the eigenvalue is below the corresponding essential spectrum.
△ Less
Submitted 8 August, 2006;
originally announced August 2006.
-
Third derivative of the one-electron density at the nucleus
Authors:
Søren Fournais,
Maria Hoffmann-Ostenhof,
Thomas Østergaard Sørensen
Abstract:
We study electron densities of eigenfunctions of atomic Schroedinger operators. We prove the existence of rho~'''(0), the third derivative of the spherically averaged atomic density rho~ at the nucleus. For eigenfunctions with corresponding eigenvalue below the essential spectrum we obtain the bound rho~'''(0) \leq -(7/12)Z^3 rho~(0), where Z denotes the nuclear charge. This bound is optimal.
We study electron densities of eigenfunctions of atomic Schroedinger operators. We prove the existence of rho~'''(0), the third derivative of the spherically averaged atomic density rho~ at the nucleus. For eigenfunctions with corresponding eigenvalue below the essential spectrum we obtain the bound rho~'''(0) \leq -(7/12)Z^3 rho~(0), where Z denotes the nuclear charge. This bound is optimal.
△ Less
Submitted 4 July, 2006;
originally announced July 2006.
-
On the convergence of eigenfunctions to threshold energy states
Authors:
Thomas Østergaard Sørensen,
Edgardo Stockmeyer
Abstract:
We prove the convergence in certain weighted spaces in momentum space of eigenfunctions of H = T-lambda*V as the energy goes to an energy threshold. We do this for three choices of kinetic energy T, namely the non-relativistic Schr"odinger operator, the pseudorelativistc operator sqrt{-Δ+m^2}-m, and the Dirac operator.
We prove the convergence in certain weighted spaces in momentum space of eigenfunctions of H = T-lambda*V as the energy goes to an energy threshold. We do this for three choices of kinetic energy T, namely the non-relativistic Schr"odinger operator, the pseudorelativistc operator sqrt{-Δ+m^2}-m, and the Dirac operator.
△ Less
Submitted 9 May, 2006; v1 submitted 6 April, 2006;
originally announced April 2006.