-
Tower: An Open Multilingual Large Language Model for Translation-Related Tasks
Authors:
Duarte M. Alves,
José Pombal,
Nuno M. Guerreiro,
Pedro H. Martins,
João Alves,
Amin Farajian,
Ben Peters,
Ricardo Rei,
Patrick Fernandes,
Sweta Agrawal,
Pierre Colombo,
José G. C. de Souza,
André F. T. Martins
Abstract:
While general-purpose large language models (LLMs) demonstrate proficiency on multiple tasks within the domain of translation, approaches based on open LLMs are competitive only when specializing on a single task. In this paper, we propose a recipe for tailoring LLMs to multiple tasks present in translation workflows. We perform continued pretraining on a multilingual mixture of monolingual and pa…
▽ More
While general-purpose large language models (LLMs) demonstrate proficiency on multiple tasks within the domain of translation, approaches based on open LLMs are competitive only when specializing on a single task. In this paper, we propose a recipe for tailoring LLMs to multiple tasks present in translation workflows. We perform continued pretraining on a multilingual mixture of monolingual and parallel data, creating TowerBase, followed by finetuning on instructions relevant for translation processes, creating TowerInstruct. Our final model surpasses open alternatives on several tasks relevant to translation workflows and is competitive with general-purpose closed LLMs. To facilitate future research, we release the Tower models, our specialization dataset, an evaluation framework for LLMs focusing on the translation ecosystem, and a collection of model generations, including ours, on our benchmark.
△ Less
Submitted 27 February, 2024;
originally announced February 2024.
-
CroissantLLM: A Truly Bilingual French-English Language Model
Authors:
Manuel Faysse,
Patrick Fernandes,
Nuno M. Guerreiro,
António Loison,
Duarte M. Alves,
Caio Corro,
Nicolas Boizard,
João Alves,
Ricardo Rei,
Pedro H. Martins,
Antoni Bigata Casademunt,
François Yvon,
André F. T. Martins,
Gautier Viaud,
Céline Hudelot,
Pierre Colombo
Abstract:
We introduce CroissantLLM, a 1.3B language model pretrained on a set of 3T English and French tokens, to bring to the research and industrial community a high-performance, fully open-sourced bilingual model that runs swiftly on consumer-grade local hardware. To that end, we pioneer the approach of training an intrinsically bilingual model with a 1:1 English-to-French pretraining data ratio, a cust…
▽ More
We introduce CroissantLLM, a 1.3B language model pretrained on a set of 3T English and French tokens, to bring to the research and industrial community a high-performance, fully open-sourced bilingual model that runs swiftly on consumer-grade local hardware. To that end, we pioneer the approach of training an intrinsically bilingual model with a 1:1 English-to-French pretraining data ratio, a custom tokenizer, and bilingual finetuning datasets. We release the training dataset, notably containing a French split with manually curated, high-quality, and varied data sources. To assess performance outside of English, we craft a novel benchmark, FrenchBench, consisting of an array of classification and generation tasks, covering various orthogonal aspects of model performance in the French Language. Additionally, rooted in transparency and to foster further Large Language Model research, we release codebases, and dozens of checkpoints across various model sizes, training data distributions, and training steps, as well as fine-tuned Chat models, and strong translation models. We evaluate our model through the FMTI framework, and validate 81 % of the transparency criteria, far beyond the scores of even most open initiatives. This work enriches the NLP landscape, breaking away from previous English-centric work in order to strengthen our understanding of multilinguality in language models.
△ Less
Submitted 29 March, 2024; v1 submitted 1 February, 2024;
originally announced February 2024.
-
Bridging the Gap: A Survey on Integrating (Human) Feedback for Natural Language Generation
Authors:
Patrick Fernandes,
Aman Madaan,
Emmy Liu,
António Farinhas,
Pedro Henrique Martins,
Amanda Bertsch,
José G. C. de Souza,
Shuyan Zhou,
Tongshuang Wu,
Graham Neubig,
André F. T. Martins
Abstract:
Many recent advances in natural language generation have been fueled by training large language models on internet-scale data. However, this paradigm can lead to models that generate toxic, inaccurate, and unhelpful content, and automatic evaluation metrics often fail to identify these behaviors. As models become more capable, human feedback is an invaluable signal for evaluating and improving mod…
▽ More
Many recent advances in natural language generation have been fueled by training large language models on internet-scale data. However, this paradigm can lead to models that generate toxic, inaccurate, and unhelpful content, and automatic evaluation metrics often fail to identify these behaviors. As models become more capable, human feedback is an invaluable signal for evaluating and improving models. This survey aims to provide an overview of the recent research that has leveraged human feedback to improve natural language generation. First, we introduce an encompassing formalization of feedback, and identify and organize existing research into a taxonomy following this formalization. Next, we discuss how feedback can be described by its format and objective, and cover the two approaches proposed to use feedback (either for training or decoding): directly using the feedback or training feedback models. We also discuss existing datasets for human-feedback data collection, and concerns surrounding feedback collection. Finally, we provide an overview of the nascent field of AI feedback, which exploits large language models to make judgments based on a set of principles and minimize the need for human intervention.
△ Less
Submitted 31 May, 2023; v1 submitted 1 May, 2023;
originally announced May 2023.
-
Percolation in two-species antagonistic random sequential adsorption in two dimensions
Authors:
Paulo H. L. Martins,
Ronald Dickman,
Robert M. Ziff
Abstract:
We consider two-species random sequential adsorption (RSA) in which species A and B adsorb randomly on a lattice with the restriction that opposite species cannot occupy nearest-neighbor sites. When the probability $x_A$ of choosing an A particle for an adsorption trial reaches a critical value $0.626441(1)$, the A species percolates and/or the blocked sites X (those with at least one A and one B…
▽ More
We consider two-species random sequential adsorption (RSA) in which species A and B adsorb randomly on a lattice with the restriction that opposite species cannot occupy nearest-neighbor sites. When the probability $x_A$ of choosing an A particle for an adsorption trial reaches a critical value $0.626441(1)$, the A species percolates and/or the blocked sites X (those with at least one A and one B nearest neighbor) percolate. Analysis of the size-distribution exponent $τ$, the wrap** probabilities, and the excess cluster number shows that the percolation transition is consistent with that of ordinary percolation. We obtain an exact result for the low $x_B = 1 - x_A$ jamming behavior: $θ_A = 1 - x_B +b_2 x_B^2+\mathcal{O}(x_B^3)$, $θ_B = x_B/(z+1)+\mathcal{O}(x_B^2)$ for a $z$-coordinated lattice, where $θ_A$ and $θ_B$ are respectively the saturation coverages of species A and B. We also show how differences between wrap** probabilities of A and X clusters, as well as differences in the number of A and X clusters, can be used to find the transition point accurately. For the one-dimensional case a three-site approximation appears to provide exact results for the coverages.
△ Less
Submitted 8 November, 2022;
originally announced November 2022.
-
Efficient Methods for Natural Language Processing: A Survey
Authors:
Marcos Treviso,
Ji-Ung Lee,
Tianchu Ji,
Betty van Aken,
Qingqing Cao,
Manuel R. Ciosici,
Michael Hassid,
Kenneth Heafield,
Sara Hooker,
Colin Raffel,
Pedro H. Martins,
André F. T. Martins,
Jessica Zosa Forde,
Peter Milder,
Edwin Simpson,
Noam Slonim,
Jesse Dodge,
Emma Strubell,
Niranjan Balasubramanian,
Leon Derczynski,
Iryna Gurevych,
Roy Schwartz
Abstract:
Recent work in natural language processing (NLP) has yielded appealing results from scaling model parameters and training data; however, using only scale to improve performance means that resource consumption also grows. Such resources include data, time, storage, or energy, all of which are naturally limited and unevenly distributed. This motivates research into efficient methods that require few…
▽ More
Recent work in natural language processing (NLP) has yielded appealing results from scaling model parameters and training data; however, using only scale to improve performance means that resource consumption also grows. Such resources include data, time, storage, or energy, all of which are naturally limited and unevenly distributed. This motivates research into efficient methods that require fewer resources to achieve similar results. This survey synthesizes and relates current methods and findings in efficient NLP. We aim to provide both guidance for conducting NLP under limited resources, and point towards promising research directions for develo** more efficient methods.
△ Less
Submitted 24 March, 2023; v1 submitted 31 August, 2022;
originally announced September 2022.
-
Chunk-based Nearest Neighbor Machine Translation
Authors:
Pedro Henrique Martins,
Zita Marinho,
André F. T. Martins
Abstract:
Semi-parametric models, which augment generation with retrieval, have led to impressive results in language modeling and machine translation, due to their ability to retrieve fine-grained information from a datastore of examples. One of the most prominent approaches, $k$NN-MT, exhibits strong domain adaptation capabilities by retrieving tokens from domain-specific datastores \citep{khandelwal2020n…
▽ More
Semi-parametric models, which augment generation with retrieval, have led to impressive results in language modeling and machine translation, due to their ability to retrieve fine-grained information from a datastore of examples. One of the most prominent approaches, $k$NN-MT, exhibits strong domain adaptation capabilities by retrieving tokens from domain-specific datastores \citep{khandelwal2020nearest}. However, $k$NN-MT requires an expensive retrieval operation for every single generated token, leading to a very low decoding speed (around 8 times slower than a parametric model). In this paper, we introduce a \textit{chunk-based} $k$NN-MT model which retrieves chunks of tokens from the datastore, instead of a single token. We propose several strategies for incorporating the retrieved chunks into the generation process, and for selecting the steps at which the model needs to search for neighbors in the datastore. Experiments on machine translation in two settings, static and ``on-the-fly'' domain adaptation, show that the chunk-based $k$NN-MT model leads to significant speed-ups (up to 4 times) with only a small drop in translation quality.
△ Less
Submitted 7 November, 2022; v1 submitted 24 May, 2022;
originally announced May 2022.
-
Efficient Machine Translation Domain Adaptation
Authors:
Pedro Henrique Martins,
Zita Marinho,
André F. T. Martins
Abstract:
Machine translation models struggle when translating out-of-domain text, which makes domain adaptation a topic of critical importance. However, most domain adaptation methods focus on fine-tuning or training the entire or part of the model on every new domain, which can be costly. On the other hand, semi-parametric models have been shown to successfully perform domain adaptation by retrieving exam…
▽ More
Machine translation models struggle when translating out-of-domain text, which makes domain adaptation a topic of critical importance. However, most domain adaptation methods focus on fine-tuning or training the entire or part of the model on every new domain, which can be costly. On the other hand, semi-parametric models have been shown to successfully perform domain adaptation by retrieving examples from an in-domain datastore (Khandelwal et al., 2021). A drawback of these retrieval-augmented models, however, is that they tend to be substantially slower. In this paper, we explore several approaches to speed up nearest neighbor machine translation. We adapt the methods recently proposed by He et al. (2021) for language modeling, and introduce a simple but effective caching strategy that avoids performing retrieval when similar contexts have been seen before. Translation quality and runtimes for several domains show the effectiveness of the proposed solutions.
△ Less
Submitted 26 April, 2022;
originally announced April 2022.
-
$\infty$-former: Infinite Memory Transformer
Authors:
Pedro Henrique Martins,
Zita Marinho,
André F. T. Martins
Abstract:
Transformers are unable to model long-term memories effectively, since the amount of computation they need to perform grows with the context length. While variations of efficient transformers have been proposed, they all have a finite memory capacity and are forced to drop old information. In this paper, we propose the $\infty$-former, which extends the vanilla transformer with an unbounded long-t…
▽ More
Transformers are unable to model long-term memories effectively, since the amount of computation they need to perform grows with the context length. While variations of efficient transformers have been proposed, they all have a finite memory capacity and are forced to drop old information. In this paper, we propose the $\infty$-former, which extends the vanilla transformer with an unbounded long-term memory. By making use of a continuous-space attention mechanism to attend over the long-term memory, the $\infty$-former's attention complexity becomes independent of the context length, trading off memory length with precision. In order to control where precision is more important, $\infty$-former maintains "sticky memories" being able to model arbitrarily long contexts while kee** the computation budget fixed. Experiments on a synthetic sorting task, language modeling, and document grounded dialogue generation demonstrate the $\infty$-former's ability to retain information from long sequences.
△ Less
Submitted 25 March, 2022; v1 submitted 1 September, 2021;
originally announced September 2021.
-
The GEM Benchmark: Natural Language Generation, its Evaluation and Metrics
Authors:
Sebastian Gehrmann,
Tosin Adewumi,
Karmanya Aggarwal,
Pawan Sasanka Ammanamanchi,
Aremu Anuoluwapo,
Antoine Bosselut,
Khyathi Raghavi Chandu,
Miruna Clinciu,
Dipanjan Das,
Kaustubh D. Dhole,
Wanyu Du,
Esin Durmus,
Ondřej Dušek,
Chris Emezue,
Varun Gangal,
Cristina Garbacea,
Tatsunori Hashimoto,
Yufang Hou,
Yacine Jernite,
Harsh Jhamtani,
Yangfeng Ji,
Shailza Jolly,
Mihir Kale,
Dhruv Kumar,
Faisal Ladhak
, et al. (31 additional authors not shown)
Abstract:
We introduce GEM, a living benchmark for natural language Generation (NLG), its Evaluation, and Metrics. Measuring progress in NLG relies on a constantly evolving ecosystem of automated metrics, datasets, and human evaluation standards. Due to this moving target, new models often still evaluate on divergent anglo-centric corpora with well-established, but flawed, metrics. This disconnect makes it…
▽ More
We introduce GEM, a living benchmark for natural language Generation (NLG), its Evaluation, and Metrics. Measuring progress in NLG relies on a constantly evolving ecosystem of automated metrics, datasets, and human evaluation standards. Due to this moving target, new models often still evaluate on divergent anglo-centric corpora with well-established, but flawed, metrics. This disconnect makes it challenging to identify the limitations of current models and opportunities for progress. Addressing this limitation, GEM provides an environment in which models can easily be applied to a wide set of tasks and in which evaluation strategies can be tested. Regular updates to the benchmark will help NLG research become more multilingual and evolve the challenge alongside models. This paper serves as the description of the data for which we are organizing a shared task at our ACL 2021 Workshop and to which we invite the entire NLG community to participate.
△ Less
Submitted 1 April, 2021; v1 submitted 2 February, 2021;
originally announced February 2021.
-
An entropic simulational study of the spin-$1$ Baxter-Wu model in a crystal field
Authors:
L. N. Jorge,
P. H. L. Martins,
C. J. DaSilva,
L. S. Ferreira,
A. A. Caparica
Abstract:
We investigate the critical behavior of the two-dimensional spin-$1$ Baxter-Wu model in a crystal field using entropic sampling simulations with the joint density of states. We obtain the temperature-crystal field phase diagram, which includes a tetracritical line ending at a pentacritical point. A finite-size scaling analysis of the maximum of the specific heat, while changing the crystal field a…
▽ More
We investigate the critical behavior of the two-dimensional spin-$1$ Baxter-Wu model in a crystal field using entropic sampling simulations with the joint density of states. We obtain the temperature-crystal field phase diagram, which includes a tetracritical line ending at a pentacritical point. A finite-size scaling analysis of the maximum of the specific heat, while changing the crystal field anisotropy, is used to obtain a precise location of the pentacritical point. Our results give the critical temperature and crystal field as $T_{pc}=0.98030(10)$ and $D_{pc}=1.68288(62)$. We also detect that at the first-order region of the phase diagram, the specific heat exhibits a double peak structure as in the Schottky-like anomaly, which is associated with an order-disorder transition.
△ Less
Submitted 29 August, 2020;
originally announced August 2020.
-
Sparse Text Generation
Authors:
Pedro Henrique Martins,
Zita Marinho,
André F. T. Martins
Abstract:
Current state-of-the-art text generators build on powerful language models such as GPT-2, achieving impressive performance. However, to avoid degenerate text, they require sampling from a modified softmax, via temperature parameters or ad-hoc truncation techniques, as in top-$k$ or nucleus sampling. This creates a mismatch between training and testing conditions. In this paper, we use the recently…
▽ More
Current state-of-the-art text generators build on powerful language models such as GPT-2, achieving impressive performance. However, to avoid degenerate text, they require sampling from a modified softmax, via temperature parameters or ad-hoc truncation techniques, as in top-$k$ or nucleus sampling. This creates a mismatch between training and testing conditions. In this paper, we use the recently introduced entmax transformation to train and sample from a natively sparse language model, avoiding this mismatch. The result is a text generator with favorable performance in terms of fluency and consistency, fewer repetitions, and n-gram diversity closer to human text. In order to evaluate our model, we propose three new metrics for comparing sparse or truncated distributions: $ε$-perplexity, sparsemax score, and Jensen-Shannon divergence. Human-evaluated experiments in story completion and dialogue generation show that entmax sampling leads to more engaging and coherent stories and conversations.
△ Less
Submitted 5 October, 2020; v1 submitted 6 April, 2020;
originally announced April 2020.
-
Sparse and Structured Visual Attention
Authors:
Pedro Henrique Martins,
Vlad Niculae,
Zita Marinho,
André Martins
Abstract:
Visual attention mechanisms are widely used in multimodal tasks, as visual question answering (VQA). One drawback of softmax-based attention mechanisms is that they assign some probability mass to all image regions, regardless of their adjacency structure and of their relevance to the text. In this paper, to better link the image structure with the text, we replace the traditional softmax attentio…
▽ More
Visual attention mechanisms are widely used in multimodal tasks, as visual question answering (VQA). One drawback of softmax-based attention mechanisms is that they assign some probability mass to all image regions, regardless of their adjacency structure and of their relevance to the text. In this paper, to better link the image structure with the text, we replace the traditional softmax attention mechanism with two alternative sparsity-promoting transformations: sparsemax, which is able to select only the relevant regions (assigning zero weight to the rest), and a newly proposed Total-Variation Sparse Attention (TVmax), which further encourages the joint selection of adjacent spatial locations. Experiments in VQA show gains in accuracy as well as higher similarity to human attention, which suggests better interpretability.
△ Less
Submitted 8 July, 2021; v1 submitted 13 February, 2020;
originally announced February 2020.
-
Joint Learning of Named Entity Recognition and Entity Linking
Authors:
Pedro Henrique Martins,
Zita Marinho,
André F. T. Martins
Abstract:
Named entity recognition (NER) and entity linking (EL) are two fundamentally related tasks, since in order to perform EL, first the mentions to entities have to be detected. However, most entity linking approaches disregard the mention detection part, assuming that the correct mentions have been previously detected. In this paper, we perform joint learning of NER and EL to leverage their relatedne…
▽ More
Named entity recognition (NER) and entity linking (EL) are two fundamentally related tasks, since in order to perform EL, first the mentions to entities have to be detected. However, most entity linking approaches disregard the mention detection part, assuming that the correct mentions have been previously detected. In this paper, we perform joint learning of NER and EL to leverage their relatedness and obtain a more robust and generalisable system. For that, we introduce a model inspired by the Stack-LSTM approach (Dyer et al., 2015). We observe that, in fact, doing multi-task learning of NER and EL improves the performance in both tasks when comparing with models trained with individual objectives. Furthermore, we achieve results competitive with the state-of-the-art in both NER and EL.
△ Less
Submitted 18 July, 2019;
originally announced July 2019.
-
A deep learning approach for understanding natural language commands for mobile service robots
Authors:
Pedro Henrique Martins,
Luís Custódio,
Rodrigo Ventura
Abstract:
Using natural language to give instructions to robots is challenging, since natural language understanding is still largely an open problem. In this paper we address this problem by restricting our attention to commands modeled as one action, plus arguments (also known as slots). For action detection (also called intent detection) and slot filling various architectures of Recurrent Neural Networks…
▽ More
Using natural language to give instructions to robots is challenging, since natural language understanding is still largely an open problem. In this paper we address this problem by restricting our attention to commands modeled as one action, plus arguments (also known as slots). For action detection (also called intent detection) and slot filling various architectures of Recurrent Neural Networks and Long Short Term Memory (LSTM) networks were evaluated, having LSTMs achieved a superior accuracy. As the action requested may not fall within the robots capabilities, a Support Vector Machine(SVM) is used to determine whether it is or not. For the input of the neural networks, several word embedding algorithms were compared. Finally, to implement the system in a robot, a ROS package is created using a SMACH state machine. The proposed system is then evaluated both using well-known datasets and benchmarks in the context of domestic service robots.
△ Less
Submitted 9 July, 2018;
originally announced July 2018.
-
Adsorption of flexible polymer chains on a surface: Effects of different solvent conditions
Authors:
P. H. L. Martins,
J. A. Plascak,
M. Bachmann
Abstract:
Polymer chains undergoing a continuous adsorption-desorption transition are studied through extensive computer simulations. A three-dimensional self-avoiding walk lattice model of a polymer chain grafted onto a surface has been treated for different solvent conditions. We have used an advanced contact-density chain-growth algorithm, in which the density of contacts can be directly obtained. From t…
▽ More
Polymer chains undergoing a continuous adsorption-desorption transition are studied through extensive computer simulations. A three-dimensional self-avoiding walk lattice model of a polymer chain grafted onto a surface has been treated for different solvent conditions. We have used an advanced contact-density chain-growth algorithm, in which the density of contacts can be directly obtained. From this quantity, the order parameter and its fourth-order Binder cumulant are computed, as well as the corresponding critical exponents and the adsorption-desorption transition temperature. As the number of configurations with a given number of surface contacts and monomer-monomer contacts is independent of the temperature and solvent conditions, it can be easily applied to get results for different solvent parameter values without the need of any extra simulations. In analogy to continuous magnetic phase transitions, finite-size-scaling methods have been employed. Quite good results for the critical properties and phase diagram of very long single polymer chains have been obtained by properly taking into account the effects of corrections to scaling. The study covers all solvent effects, going from the limit of {\it super-self-avoiding walks}, characterized by effective monomer-monomer repulsion, to poor solvent conditions that enable the formation of compact polymer structures.
△ Less
Submitted 25 May, 2018;
originally announced May 2018.
-
Solvent-Dependent Critical Properties of Polymer Adsorption
Authors:
J. A. Plascak,
Paulo H. L. Martins,
Michael Bachmann
Abstract:
Advanced chain-growth computer simulation methodologies have been employed for a systematic statistical analysis of the critical behavior of a polymer adsorbing at a substrate. We use finitesize scaling techniques to investigate the solvent-quality dependence of critical exponents, critical temperature, and the structure of the phase diagram. Our study covers all solvent effects from the limit of…
▽ More
Advanced chain-growth computer simulation methodologies have been employed for a systematic statistical analysis of the critical behavior of a polymer adsorbing at a substrate. We use finitesize scaling techniques to investigate the solvent-quality dependence of critical exponents, critical temperature, and the structure of the phase diagram. Our study covers all solvent effects from the limit of super-self-avoiding walks, characterized by effective monomer-monomer repulsion, to poor solvent conditions that enable the formation of compact polymer structures. The results significantly benefit from taking into account corrections to scaling.
△ Less
Submitted 7 May, 2017;
originally announced May 2017.
-
Probability distribution of the order parameter in the directed percolation universality class
Authors:
P. H. L. Martins
Abstract:
The probability distributions of the order parameter for two models in the directed percolation universality class were evaluated. Monte Carlo simulations have been performed for the one-dimensional generalized contact process and the Domany-Kinzel cellular automaton. In both cases, the density of active sites was chosen as the order parameter. The criticality of those models was obtained by solel…
▽ More
The probability distributions of the order parameter for two models in the directed percolation universality class were evaluated. Monte Carlo simulations have been performed for the one-dimensional generalized contact process and the Domany-Kinzel cellular automaton. In both cases, the density of active sites was chosen as the order parameter. The criticality of those models was obtained by solely using the corresponding probability distribution function. It has been shown that the present method, which has been successfully employed in treating equilibrium systems, is indeed also useful in the study of nonequilibrium phase transitions.
△ Less
Submitted 9 September, 2012;
originally announced September 2012.
-
Probability Distribution Function of the Order Parameter: Mixing Fields and Universality
Authors:
J. A. Plascak,
P. H. L. Martins
Abstract:
We briefly review the use of the order parameter probability distribution function as a useful tool to obtain the critical properties of statistical mechanical models using computer Monte Carlo simulations. Some simple discrete spin magnetic systems on a lattice, such as Ising, general spin-$S$ Blume-Capel and Baxter-Wu, $Q$-state Potts, among other models, will be considered as examples. The impo…
▽ More
We briefly review the use of the order parameter probability distribution function as a useful tool to obtain the critical properties of statistical mechanical models using computer Monte Carlo simulations. Some simple discrete spin magnetic systems on a lattice, such as Ising, general spin-$S$ Blume-Capel and Baxter-Wu, $Q$-state Potts, among other models, will be considered as examples. The importance and the necessity of the role of mixing fields in asymmetric magnetic models will be discussed in more detail, as well as the corresponding distributions of the extensive conjugate variables.
△ Less
Submitted 9 September, 2012;
originally announced September 2012.
-
Probability distribution of the order parameter
Authors:
P. H. L. Martins,
J. A. Plascak
Abstract:
The probability distribution of the order parameter is exploited in order to obtain the criticality of magnetic systems. Monte Carlo simulations have been employed by using single spin flip Metropolis algorithm aided by finite-size scaling and histogram reweighting techniques. A method is proposed to obtain this probability distribution even when the transition temperature of the model is unknow…
▽ More
The probability distribution of the order parameter is exploited in order to obtain the criticality of magnetic systems. Monte Carlo simulations have been employed by using single spin flip Metropolis algorithm aided by finite-size scaling and histogram reweighting techniques. A method is proposed to obtain this probability distribution even when the transition temperature of the model is unknown. A test is performed on the two-dimensional spin-1/2 and spin-1 Ising model and the results show that the present procedure can be quite efficient and accurate to describe the criticality of the system.
△ Less
Submitted 9 April, 2004;
originally announced April 2004.
-
Percolation model for structural phase transitions in Li$_{1-x}$H$_x$IO$_3$ mixed crystals
Authors:
P. H. L. Martins,
J. A. Plascak,
M. A. Pimenta
Abstract:
A percolation model is proposed to explain the structural phase transitions found in Li$_{1-x}$H$_x$IO$_3$ mixed crystals as a function of the concentration parameter $x$. The percolation thresholds are obtained from Monte Carlo simulations on the specific lattices occupied by lithium atoms and hydrogen bonds. The theoretical results strongly suggest that percolating lithium vacancies and hydrog…
▽ More
A percolation model is proposed to explain the structural phase transitions found in Li$_{1-x}$H$_x$IO$_3$ mixed crystals as a function of the concentration parameter $x$. The percolation thresholds are obtained from Monte Carlo simulations on the specific lattices occupied by lithium atoms and hydrogen bonds. The theoretical results strongly suggest that percolating lithium vacancies and hydrogen bonds are indeed responsible for the solid solution observed in the experimental range $0.22 < x < 0.36$.
△ Less
Submitted 9 April, 2004;
originally announced April 2004.
-
Percolation on two- and three-dimensional lattices
Authors:
P. H. L. Martins,
J. A. Plascak
Abstract:
In this work we apply a highly efficient Monte Carlo algorithm recently proposed by Newman and Ziff to treat percolation problems. The site and bond percolation are studied on a number of lattices in two and three dimensions. Quite good results for the wrap** probabilities, correlation length critical exponent and critical concentration are obtained for the square, simple cubic, HCP and hexago…
▽ More
In this work we apply a highly efficient Monte Carlo algorithm recently proposed by Newman and Ziff to treat percolation problems. The site and bond percolation are studied on a number of lattices in two and three dimensions. Quite good results for the wrap** probabilities, correlation length critical exponent and critical concentration are obtained for the square, simple cubic, HCP and hexagonal lattices by using relatively small systems. We also confirm the universal aspect of the wrap** probabilities regarding site and bond dilution.
△ Less
Submitted 1 April, 2003;
originally announced April 2003.