Search | arXiv e-print repository

Tower: An Open Multilingual Large Language Model for Translation-Related Tasks

Authors: Duarte M. Alves, José Pombal, Nuno M. Guerreiro, Pedro H. Martins, João Alves, Amin Farajian, Ben Peters, Ricardo Rei, Patrick Fernandes, Sweta Agrawal, Pierre Colombo, José G. C. de Souza, André F. T. Martins

Abstract: While general-purpose large language models (LLMs) demonstrate proficiency on multiple tasks within the domain of translation, approaches based on open LLMs are competitive only when specializing on a single task. In this paper, we propose a recipe for tailoring LLMs to multiple tasks present in translation workflows. We perform continued pretraining on a multilingual mixture of monolingual and pa… ▽ More While general-purpose large language models (LLMs) demonstrate proficiency on multiple tasks within the domain of translation, approaches based on open LLMs are competitive only when specializing on a single task. In this paper, we propose a recipe for tailoring LLMs to multiple tasks present in translation workflows. We perform continued pretraining on a multilingual mixture of monolingual and parallel data, creating TowerBase, followed by finetuning on instructions relevant for translation processes, creating TowerInstruct. Our final model surpasses open alternatives on several tasks relevant to translation workflows and is competitive with general-purpose closed LLMs. To facilitate future research, we release the Tower models, our specialization dataset, an evaluation framework for LLMs focusing on the translation ecosystem, and a collection of model generations, including ours, on our benchmark. △ Less

Submitted 27 February, 2024; originally announced February 2024.

arXiv:2402.00786 [pdf, other]

CroissantLLM: A Truly Bilingual French-English Language Model

Authors: Manuel Faysse, Patrick Fernandes, Nuno M. Guerreiro, António Loison, Duarte M. Alves, Caio Corro, Nicolas Boizard, João Alves, Ricardo Rei, Pedro H. Martins, Antoni Bigata Casademunt, François Yvon, André F. T. Martins, Gautier Viaud, Céline Hudelot, Pierre Colombo

Abstract: We introduce CroissantLLM, a 1.3B language model pretrained on a set of 3T English and French tokens, to bring to the research and industrial community a high-performance, fully open-sourced bilingual model that runs swiftly on consumer-grade local hardware. To that end, we pioneer the approach of training an intrinsically bilingual model with a 1:1 English-to-French pretraining data ratio, a cust… ▽ More We introduce CroissantLLM, a 1.3B language model pretrained on a set of 3T English and French tokens, to bring to the research and industrial community a high-performance, fully open-sourced bilingual model that runs swiftly on consumer-grade local hardware. To that end, we pioneer the approach of training an intrinsically bilingual model with a 1:1 English-to-French pretraining data ratio, a custom tokenizer, and bilingual finetuning datasets. We release the training dataset, notably containing a French split with manually curated, high-quality, and varied data sources. To assess performance outside of English, we craft a novel benchmark, FrenchBench, consisting of an array of classification and generation tasks, covering various orthogonal aspects of model performance in the French Language. Additionally, rooted in transparency and to foster further Large Language Model research, we release codebases, and dozens of checkpoints across various model sizes, training data distributions, and training steps, as well as fine-tuned Chat models, and strong translation models. We evaluate our model through the FMTI framework, and validate 81 % of the transparency criteria, far beyond the scores of even most open initiatives. This work enriches the NLP landscape, breaking away from previous English-centric work in order to strengthen our understanding of multilinguality in language models. △ Less

Submitted 29 March, 2024; v1 submitted 1 February, 2024; originally announced February 2024.

arXiv:2305.00955 [pdf, other]

Bridging the Gap: A Survey on Integrating (Human) Feedback for Natural Language Generation

Authors: Patrick Fernandes, Aman Madaan, Emmy Liu, António Farinhas, Pedro Henrique Martins, Amanda Bertsch, José G. C. de Souza, Shuyan Zhou, Tongshuang Wu, Graham Neubig, André F. T. Martins

Abstract: Many recent advances in natural language generation have been fueled by training large language models on internet-scale data. However, this paradigm can lead to models that generate toxic, inaccurate, and unhelpful content, and automatic evaluation metrics often fail to identify these behaviors. As models become more capable, human feedback is an invaluable signal for evaluating and improving mod… ▽ More Many recent advances in natural language generation have been fueled by training large language models on internet-scale data. However, this paradigm can lead to models that generate toxic, inaccurate, and unhelpful content, and automatic evaluation metrics often fail to identify these behaviors. As models become more capable, human feedback is an invaluable signal for evaluating and improving models. This survey aims to provide an overview of the recent research that has leveraged human feedback to improve natural language generation. First, we introduce an encompassing formalization of feedback, and identify and organize existing research into a taxonomy following this formalization. Next, we discuss how feedback can be described by its format and objective, and cover the two approaches proposed to use feedback (either for training or decoding): directly using the feedback or training feedback models. We also discuss existing datasets for human-feedback data collection, and concerns surrounding feedback collection. Finally, we provide an overview of the nascent field of AI feedback, which exploits large language models to make judgments based on a set of principles and minimize the need for human intervention. △ Less

Submitted 31 May, 2023; v1 submitted 1 May, 2023; originally announced May 2023.

Comments: Work in Progress

arXiv:2211.04622 [pdf, ps, other]

doi 10.1103/PhysRevE.107.024104

Percolation in two-species antagonistic random sequential adsorption in two dimensions

Authors: Paulo H. L. Martins, Ronald Dickman, Robert M. Ziff

Abstract: We consider two-species random sequential adsorption (RSA) in which species A and B adsorb randomly on a lattice with the restriction that opposite species cannot occupy nearest-neighbor sites. When the probability $x_A$ of choosing an A particle for an adsorption trial reaches a critical value $0.626441(1)$, the A species percolates and/or the blocked sites X (those with at least one A and one B… ▽ More We consider two-species random sequential adsorption (RSA) in which species A and B adsorb randomly on a lattice with the restriction that opposite species cannot occupy nearest-neighbor sites. When the probability $x_A$ of choosing an A particle for an adsorption trial reaches a critical value $0.626441(1)$, the A species percolates and/or the blocked sites X (those with at least one A and one B nearest neighbor) percolate. Analysis of the size-distribution exponent $τ$, the wrap** probabilities, and the excess cluster number shows that the percolation transition is consistent with that of ordinary percolation. We obtain an exact result for the low $x_B = 1 - x_A$ jamming behavior: $θ_A = 1 - x_B +b_2 x_B^2+\mathcal{O}(x_B^3)$, $θ_B = x_B/(z+1)+\mathcal{O}(x_B^2)$ for a $z$-coordinated lattice, where $θ_A$ and $θ_B$ are respectively the saturation coverages of species A and B. We also show how differences between wrap** probabilities of A and X clusters, as well as differences in the number of A and X clusters, can be used to find the transition point accurately. For the one-dimensional case a three-site approximation appears to provide exact results for the coverages. △ Less

Submitted 8 November, 2022; originally announced November 2022.

arXiv:2209.00099 [pdf, other]

Efficient Methods for Natural Language Processing: A Survey

Authors: Marcos Treviso, Ji-Ung Lee, Tianchu Ji, Betty van Aken, Qingqing Cao, Manuel R. Ciosici, Michael Hassid, Kenneth Heafield, Sara Hooker, Colin Raffel, Pedro H. Martins, André F. T. Martins, Jessica Zosa Forde, Peter Milder, Edwin Simpson, Noam Slonim, Jesse Dodge, Emma Strubell, Niranjan Balasubramanian, Leon Derczynski, Iryna Gurevych, Roy Schwartz

Abstract: Recent work in natural language processing (NLP) has yielded appealing results from scaling model parameters and training data; however, using only scale to improve performance means that resource consumption also grows. Such resources include data, time, storage, or energy, all of which are naturally limited and unevenly distributed. This motivates research into efficient methods that require few… ▽ More Recent work in natural language processing (NLP) has yielded appealing results from scaling model parameters and training data; however, using only scale to improve performance means that resource consumption also grows. Such resources include data, time, storage, or energy, all of which are naturally limited and unevenly distributed. This motivates research into efficient methods that require fewer resources to achieve similar results. This survey synthesizes and relates current methods and findings in efficient NLP. We aim to provide both guidance for conducting NLP under limited resources, and point towards promising research directions for develo** more efficient methods. △ Less

Submitted 24 March, 2023; v1 submitted 31 August, 2022; originally announced September 2022.

Comments: Accepted at TACL, pre publication version

arXiv:2205.12230 [pdf, other]

Chunk-based Nearest Neighbor Machine Translation

Authors: Pedro Henrique Martins, Zita Marinho, André F. T. Martins

Abstract: Semi-parametric models, which augment generation with retrieval, have led to impressive results in language modeling and machine translation, due to their ability to retrieve fine-grained information from a datastore of examples. One of the most prominent approaches, $k$NN-MT, exhibits strong domain adaptation capabilities by retrieving tokens from domain-specific datastores \citep{khandelwal2020n… ▽ More Semi-parametric models, which augment generation with retrieval, have led to impressive results in language modeling and machine translation, due to their ability to retrieve fine-grained information from a datastore of examples. One of the most prominent approaches, $k$NN-MT, exhibits strong domain adaptation capabilities by retrieving tokens from domain-specific datastores \citep{khandelwal2020nearest}. However, $k$NN-MT requires an expensive retrieval operation for every single generated token, leading to a very low decoding speed (around 8 times slower than a parametric model). In this paper, we introduce a \textit{chunk-based} $k$NN-MT model which retrieves chunks of tokens from the datastore, instead of a single token. We propose several strategies for incorporating the retrieved chunks into the generation process, and for selecting the steps at which the model needs to search for neighbors in the datastore. Experiments on machine translation in two settings, static and ``on-the-fly'' domain adaptation, show that the chunk-based $k$NN-MT model leads to significant speed-ups (up to 4 times) with only a small drop in translation quality. △ Less

Submitted 7 November, 2022; v1 submitted 24 May, 2022; originally announced May 2022.

arXiv:2204.12608 [pdf, other]

Efficient Machine Translation Domain Adaptation

Authors: Pedro Henrique Martins, Zita Marinho, André F. T. Martins

Abstract: Machine translation models struggle when translating out-of-domain text, which makes domain adaptation a topic of critical importance. However, most domain adaptation methods focus on fine-tuning or training the entire or part of the model on every new domain, which can be costly. On the other hand, semi-parametric models have been shown to successfully perform domain adaptation by retrieving exam… ▽ More Machine translation models struggle when translating out-of-domain text, which makes domain adaptation a topic of critical importance. However, most domain adaptation methods focus on fine-tuning or training the entire or part of the model on every new domain, which can be costly. On the other hand, semi-parametric models have been shown to successfully perform domain adaptation by retrieving examples from an in-domain datastore (Khandelwal et al., 2021). A drawback of these retrieval-augmented models, however, is that they tend to be substantially slower. In this paper, we explore several approaches to speed up nearest neighbor machine translation. We adapt the methods recently proposed by He et al. (2021) for language modeling, and introduce a simple but effective caching strategy that avoids performing retrieval when similar contexts have been seen before. Translation quality and runtimes for several domains show the effectiveness of the proposed solutions. △ Less

Submitted 26 April, 2022; originally announced April 2022.

Comments: Workshop Semiparametric Methods in NLP: Decoupling Logic from Knowledge

arXiv:2109.00301 [pdf, other]

$\infty$-former: Infinite Memory Transformer

Authors: Pedro Henrique Martins, Zita Marinho, André F. T. Martins

Abstract: Transformers are unable to model long-term memories effectively, since the amount of computation they need to perform grows with the context length. While variations of efficient transformers have been proposed, they all have a finite memory capacity and are forced to drop old information. In this paper, we propose the $\infty$-former, which extends the vanilla transformer with an unbounded long-t… ▽ More Transformers are unable to model long-term memories effectively, since the amount of computation they need to perform grows with the context length. While variations of efficient transformers have been proposed, they all have a finite memory capacity and are forced to drop old information. In this paper, we propose the $\infty$-former, which extends the vanilla transformer with an unbounded long-term memory. By making use of a continuous-space attention mechanism to attend over the long-term memory, the $\infty$-former's attention complexity becomes independent of the context length, trading off memory length with precision. In order to control where precision is more important, $\infty$-former maintains "sticky memories" being able to model arbitrarily long contexts while kee** the computation budget fixed. Experiments on a synthetic sorting task, language modeling, and document grounded dialogue generation demonstrate the $\infty$-former's ability to retain information from long sequences. △ Less

Submitted 25 March, 2022; v1 submitted 1 September, 2021; originally announced September 2021.

Comments: ACL 2022

arXiv:2102.01672 [pdf, other]

The GEM Benchmark: Natural Language Generation, its Evaluation and Metrics

Authors: Sebastian Gehrmann, Tosin Adewumi, Karmanya Aggarwal, Pawan Sasanka Ammanamanchi, Aremu Anuoluwapo, Antoine Bosselut, Khyathi Raghavi Chandu, Miruna Clinciu, Dipanjan Das, Kaustubh D. Dhole, Wanyu Du, Esin Durmus, Ondřej Dušek, Chris Emezue, Varun Gangal, Cristina Garbacea, Tatsunori Hashimoto, Yufang Hou, Yacine Jernite, Harsh Jhamtani, Yangfeng Ji, Shailza Jolly, Mihir Kale, Dhruv Kumar, Faisal Ladhak , et al. (31 additional authors not shown)

Abstract: We introduce GEM, a living benchmark for natural language Generation (NLG), its Evaluation, and Metrics. Measuring progress in NLG relies on a constantly evolving ecosystem of automated metrics, datasets, and human evaluation standards. Due to this moving target, new models often still evaluate on divergent anglo-centric corpora with well-established, but flawed, metrics. This disconnect makes it… ▽ More We introduce GEM, a living benchmark for natural language Generation (NLG), its Evaluation, and Metrics. Measuring progress in NLG relies on a constantly evolving ecosystem of automated metrics, datasets, and human evaluation standards. Due to this moving target, new models often still evaluate on divergent anglo-centric corpora with well-established, but flawed, metrics. This disconnect makes it challenging to identify the limitations of current models and opportunities for progress. Addressing this limitation, GEM provides an environment in which models can easily be applied to a wide set of tasks and in which evaluation strategies can be tested. Regular updates to the benchmark will help NLG research become more multilingual and evolve the challenge alongside models. This paper serves as the description of the data for which we are organizing a shared task at our ACL 2021 Workshop and to which we invite the entire NLG community to participate. △ Less

Submitted 1 April, 2021; v1 submitted 2 February, 2021; originally announced February 2021.

arXiv:2008.13037 [pdf, ps, other]

doi 10.1016/j.physa.2021.126071

An entropic simulational study of the spin-$1$ Baxter-Wu model in a crystal field

Authors: L. N. Jorge, P. H. L. Martins, C. J. DaSilva, L. S. Ferreira, A. A. Caparica

Abstract: We investigate the critical behavior of the two-dimensional spin-$1$ Baxter-Wu model in a crystal field using entropic sampling simulations with the joint density of states. We obtain the temperature-crystal field phase diagram, which includes a tetracritical line ending at a pentacritical point. A finite-size scaling analysis of the maximum of the specific heat, while changing the crystal field a… ▽ More We investigate the critical behavior of the two-dimensional spin-$1$ Baxter-Wu model in a crystal field using entropic sampling simulations with the joint density of states. We obtain the temperature-crystal field phase diagram, which includes a tetracritical line ending at a pentacritical point. A finite-size scaling analysis of the maximum of the specific heat, while changing the crystal field anisotropy, is used to obtain a precise location of the pentacritical point. Our results give the critical temperature and crystal field as $T_{pc}=0.98030(10)$ and $D_{pc}=1.68288(62)$. We also detect that at the first-order region of the phase diagram, the specific heat exhibits a double peak structure as in the Schottky-like anomaly, which is associated with an order-disorder transition. △ Less

Submitted 29 August, 2020; originally announced August 2020.

Comments: 7 pages. 7 figures

arXiv:2004.02644 [pdf, other]

Sparse Text Generation

Authors: Pedro Henrique Martins, Zita Marinho, André F. T. Martins

Abstract: Current state-of-the-art text generators build on powerful language models such as GPT-2, achieving impressive performance. However, to avoid degenerate text, they require sampling from a modified softmax, via temperature parameters or ad-hoc truncation techniques, as in top-$k$ or nucleus sampling. This creates a mismatch between training and testing conditions. In this paper, we use the recently… ▽ More Current state-of-the-art text generators build on powerful language models such as GPT-2, achieving impressive performance. However, to avoid degenerate text, they require sampling from a modified softmax, via temperature parameters or ad-hoc truncation techniques, as in top-$k$ or nucleus sampling. This creates a mismatch between training and testing conditions. In this paper, we use the recently introduced entmax transformation to train and sample from a natively sparse language model, avoiding this mismatch. The result is a text generator with favorable performance in terms of fluency and consistency, fewer repetitions, and n-gram diversity closer to human text. In order to evaluate our model, we propose three new metrics for comparing sparse or truncated distributions: $ε$-perplexity, sparsemax score, and Jensen-Shannon divergence. Human-evaluated experiments in story completion and dialogue generation show that entmax sampling leads to more engaging and coherent stories and conversations. △ Less

Submitted 5 October, 2020; v1 submitted 6 April, 2020; originally announced April 2020.

arXiv:2002.05556 [pdf, other]

Sparse and Structured Visual Attention

Authors: Pedro Henrique Martins, Vlad Niculae, Zita Marinho, André Martins

Abstract: Visual attention mechanisms are widely used in multimodal tasks, as visual question answering (VQA). One drawback of softmax-based attention mechanisms is that they assign some probability mass to all image regions, regardless of their adjacency structure and of their relevance to the text. In this paper, to better link the image structure with the text, we replace the traditional softmax attentio… ▽ More Visual attention mechanisms are widely used in multimodal tasks, as visual question answering (VQA). One drawback of softmax-based attention mechanisms is that they assign some probability mass to all image regions, regardless of their adjacency structure and of their relevance to the text. In this paper, to better link the image structure with the text, we replace the traditional softmax attention mechanism with two alternative sparsity-promoting transformations: sparsemax, which is able to select only the relevant regions (assigning zero weight to the rest), and a newly proposed Total-Variation Sparse Attention (TVmax), which further encourages the joint selection of adjacent spatial locations. Experiments in VQA show gains in accuracy as well as higher similarity to human attention, which suggests better interpretability. △ Less

Submitted 8 July, 2021; v1 submitted 13 February, 2020; originally announced February 2020.

arXiv:1907.08243 [pdf, other]

Joint Learning of Named Entity Recognition and Entity Linking

Authors: Pedro Henrique Martins, Zita Marinho, André F. T. Martins

Abstract: Named entity recognition (NER) and entity linking (EL) are two fundamentally related tasks, since in order to perform EL, first the mentions to entities have to be detected. However, most entity linking approaches disregard the mention detection part, assuming that the correct mentions have been previously detected. In this paper, we perform joint learning of NER and EL to leverage their relatedne… ▽ More Named entity recognition (NER) and entity linking (EL) are two fundamentally related tasks, since in order to perform EL, first the mentions to entities have to be detected. However, most entity linking approaches disregard the mention detection part, assuming that the correct mentions have been previously detected. In this paper, we perform joint learning of NER and EL to leverage their relatedness and obtain a more robust and generalisable system. For that, we introduce a model inspired by the Stack-LSTM approach (Dyer et al., 2015). We observe that, in fact, doing multi-task learning of NER and EL improves the performance in both tasks when comparing with models trained with individual objectives. Furthermore, we achieve results competitive with the state-of-the-art in both NER and EL. △ Less

Submitted 18 July, 2019; originally announced July 2019.

arXiv:1807.03053 [pdf, other]

A deep learning approach for understanding natural language commands for mobile service robots

Authors: Pedro Henrique Martins, Luís Custódio, Rodrigo Ventura

Abstract: Using natural language to give instructions to robots is challenging, since natural language understanding is still largely an open problem. In this paper we address this problem by restricting our attention to commands modeled as one action, plus arguments (also known as slots). For action detection (also called intent detection) and slot filling various architectures of Recurrent Neural Networks… ▽ More Using natural language to give instructions to robots is challenging, since natural language understanding is still largely an open problem. In this paper we address this problem by restricting our attention to commands modeled as one action, plus arguments (also known as slots). For action detection (also called intent detection) and slot filling various architectures of Recurrent Neural Networks and Long Short Term Memory (LSTM) networks were evaluated, having LSTMs achieved a superior accuracy. As the action requested may not fall within the robots capabilities, a Support Vector Machine(SVM) is used to determine whether it is or not. For the input of the neural networks, several word embedding algorithms were compared. Finally, to implement the system in a robot, a ROS package is created using a SMACH state machine. The proposed system is then evaluated both using well-known datasets and benchmarks in the context of domestic service robots. △ Less

Submitted 9 July, 2018; originally announced July 2018.

arXiv:1805.11459 [pdf, ps, other]

doi 10.1063/1.5027270

Adsorption of flexible polymer chains on a surface: Effects of different solvent conditions

Authors: P. H. L. Martins, J. A. Plascak, M. Bachmann

Abstract: Polymer chains undergoing a continuous adsorption-desorption transition are studied through extensive computer simulations. A three-dimensional self-avoiding walk lattice model of a polymer chain grafted onto a surface has been treated for different solvent conditions. We have used an advanced contact-density chain-growth algorithm, in which the density of contacts can be directly obtained. From t… ▽ More Polymer chains undergoing a continuous adsorption-desorption transition are studied through extensive computer simulations. A three-dimensional self-avoiding walk lattice model of a polymer chain grafted onto a surface has been treated for different solvent conditions. We have used an advanced contact-density chain-growth algorithm, in which the density of contacts can be directly obtained. From this quantity, the order parameter and its fourth-order Binder cumulant are computed, as well as the corresponding critical exponents and the adsorption-desorption transition temperature. As the number of configurations with a given number of surface contacts and monomer-monomer contacts is independent of the temperature and solvent conditions, it can be easily applied to get results for different solvent parameter values without the need of any extra simulations. In analogy to continuous magnetic phase transitions, finite-size-scaling methods have been employed. Quite good results for the critical properties and phase diagram of very long single polymer chains have been obtained by properly taking into account the effects of corrections to scaling. The study covers all solvent effects, going from the limit of {\it super-self-avoiding walks}, characterized by effective monomer-monomer repulsion, to poor solvent conditions that enable the formation of compact polymer structures. △ Less

Submitted 25 May, 2018; originally announced May 2018.

Comments: 10 pages, 11 figures. arXiv admin note: text overlap with arXiv:1705.02645

Journal ref: The Journal of Chemical Physics 148, 204901 (2018)

arXiv:1705.02645 [pdf, ps, other]

doi 10.1103/PhysRevE.95.050501

Solvent-Dependent Critical Properties of Polymer Adsorption

Authors: J. A. Plascak, Paulo H. L. Martins, Michael Bachmann

Abstract: Advanced chain-growth computer simulation methodologies have been employed for a systematic statistical analysis of the critical behavior of a polymer adsorbing at a substrate. We use finitesize scaling techniques to investigate the solvent-quality dependence of critical exponents, critical temperature, and the structure of the phase diagram. Our study covers all solvent effects from the limit of… ▽ More Advanced chain-growth computer simulation methodologies have been employed for a systematic statistical analysis of the critical behavior of a polymer adsorbing at a substrate. We use finitesize scaling techniques to investigate the solvent-quality dependence of critical exponents, critical temperature, and the structure of the phase diagram. Our study covers all solvent effects from the limit of super-self-avoiding walks, characterized by effective monomer-monomer repulsion, to poor solvent conditions that enable the formation of compact polymer structures. The results significantly benefit from taking into account corrections to scaling. △ Less

Submitted 7 May, 2017; originally announced May 2017.

Comments: 6 pages, 5 figures

Journal ref: Physical Review E 95, 050501(R) (2017)

arXiv:1209.1818 [pdf, ps, other]

doi 10.1103/PhysRevE.85.041110

Probability distribution of the order parameter in the directed percolation universality class

Authors: P. H. L. Martins

Abstract: The probability distributions of the order parameter for two models in the directed percolation universality class were evaluated. Monte Carlo simulations have been performed for the one-dimensional generalized contact process and the Domany-Kinzel cellular automaton. In both cases, the density of active sites was chosen as the order parameter. The criticality of those models was obtained by solel… ▽ More The probability distributions of the order parameter for two models in the directed percolation universality class were evaluated. Monte Carlo simulations have been performed for the one-dimensional generalized contact process and the Domany-Kinzel cellular automaton. In both cases, the density of active sites was chosen as the order parameter. The criticality of those models was obtained by solely using the corresponding probability distribution function. It has been shown that the present method, which has been successfully employed in treating equilibrium systems, is indeed also useful in the study of nonequilibrium phase transitions. △ Less

Submitted 9 September, 2012; originally announced September 2012.

Comments: 6 pages, 4 figures

Journal ref: P. H. L. Martins, Phys. Rev. E 85, 041110 (2012)

arXiv:1209.1815 [pdf, ps, other]

doi 10.1016/j.cpc.2012.09.014

Probability Distribution Function of the Order Parameter: Mixing Fields and Universality

Authors: J. A. Plascak, P. H. L. Martins

Abstract: We briefly review the use of the order parameter probability distribution function as a useful tool to obtain the critical properties of statistical mechanical models using computer Monte Carlo simulations. Some simple discrete spin magnetic systems on a lattice, such as Ising, general spin-$S$ Blume-Capel and Baxter-Wu, $Q$-state Potts, among other models, will be considered as examples. The impo… ▽ More We briefly review the use of the order parameter probability distribution function as a useful tool to obtain the critical properties of statistical mechanical models using computer Monte Carlo simulations. Some simple discrete spin magnetic systems on a lattice, such as Ising, general spin-$S$ Blume-Capel and Baxter-Wu, $Q$-state Potts, among other models, will be considered as examples. The importance and the necessity of the role of mixing fields in asymmetric magnetic models will be discussed in more detail, as well as the corresponding distributions of the extensive conjugate variables. △ Less

Submitted 9 September, 2012; originally announced September 2012.

Comments: 14 pages, 13 figures, accepted for publication (Computer Physics Communications)

arXiv:cond-mat/0404231 [pdf, ps, other]

doi 10.1590/S0103-97332004000300021

Probability distribution of the order parameter

Authors: P. H. L. Martins, J. A. Plascak

Abstract: The probability distribution of the order parameter is exploited in order to obtain the criticality of magnetic systems. Monte Carlo simulations have been employed by using single spin flip Metropolis algorithm aided by finite-size scaling and histogram reweighting techniques. A method is proposed to obtain this probability distribution even when the transition temperature of the model is unknow… ▽ More The probability distribution of the order parameter is exploited in order to obtain the criticality of magnetic systems. Monte Carlo simulations have been employed by using single spin flip Metropolis algorithm aided by finite-size scaling and histogram reweighting techniques. A method is proposed to obtain this probability distribution even when the transition temperature of the model is unknown. A test is performed on the two-dimensional spin-1/2 and spin-1 Ising model and the results show that the present procedure can be quite efficient and accurate to describe the criticality of the system. △ Less

Submitted 9 April, 2004; originally announced April 2004.

Comments: 5 pages, 7 figures, to appear in Braz. J. Phys. 34, June 2004

arXiv:cond-mat/0404230 [pdf, ps, other]

doi 10.1103/PhysRevB.69.092107

Percolation model for structural phase transitions in Li$_{1-x}$H$_x$IO$_3$ mixed crystals

Authors: P. H. L. Martins, J. A. Plascak, M. A. Pimenta

Abstract: A percolation model is proposed to explain the structural phase transitions found in Li$_{1-x}$H$_x$IO$_3$ mixed crystals as a function of the concentration parameter $x$. The percolation thresholds are obtained from Monte Carlo simulations on the specific lattices occupied by lithium atoms and hydrogen bonds. The theoretical results strongly suggest that percolating lithium vacancies and hydrog… ▽ More A percolation model is proposed to explain the structural phase transitions found in Li$_{1-x}$H$_x$IO$_3$ mixed crystals as a function of the concentration parameter $x$. The percolation thresholds are obtained from Monte Carlo simulations on the specific lattices occupied by lithium atoms and hydrogen bonds. The theoretical results strongly suggest that percolating lithium vacancies and hydrogen bonds are indeed responsible for the solid solution observed in the experimental range $0.22 < x < 0.36$. △ Less

Submitted 9 April, 2004; originally announced April 2004.

Comments: 4 pages, 2 figures

Journal ref: Phys. Rev. B 69, 092107 (2004)

arXiv:cond-mat/0304024 [pdf, ps, other]

doi 10.1103/PhysRevE.67.046119

Percolation on two- and three-dimensional lattices

Authors: P. H. L. Martins, J. A. Plascak

Abstract: In this work we apply a highly efficient Monte Carlo algorithm recently proposed by Newman and Ziff to treat percolation problems. The site and bond percolation are studied on a number of lattices in two and three dimensions. Quite good results for the wrap** probabilities, correlation length critical exponent and critical concentration are obtained for the square, simple cubic, HCP and hexago… ▽ More In this work we apply a highly efficient Monte Carlo algorithm recently proposed by Newman and Ziff to treat percolation problems. The site and bond percolation are studied on a number of lattices in two and three dimensions. Quite good results for the wrap** probabilities, correlation length critical exponent and critical concentration are obtained for the square, simple cubic, HCP and hexagonal lattices by using relatively small systems. We also confirm the universal aspect of the wrap** probabilities regarding site and bond dilution. △ Less

Submitted 1 April, 2003; originally announced April 2003.

Comments: 15 pages, 6 figures, 3 tables

Journal ref: Phys. Rev. E 67, 046119 (2003)

Showing 1–21 of 21 results for author: Martins, P H