-
Reducing hallucination in structured outputs via Retrieval-Augmented Generation
Authors:
Patrice Béchard,
Orlando Marquez Ayala
Abstract:
A common and fundamental limitation of Generative AI (GenAI) is its propensity to hallucinate. While large language models (LLM) have taken the world by storm, without eliminating or at least reducing hallucinations, real-world GenAI systems may face challenges in user adoption. In the process of deploying an enterprise application that produces workflows based on natural language requirements, we…
▽ More
A common and fundamental limitation of Generative AI (GenAI) is its propensity to hallucinate. While large language models (LLM) have taken the world by storm, without eliminating or at least reducing hallucinations, real-world GenAI systems may face challenges in user adoption. In the process of deploying an enterprise application that produces workflows based on natural language requirements, we devised a system leveraging Retrieval Augmented Generation (RAG) to greatly improve the quality of the structured output that represents such workflows. Thanks to our implementation of RAG, our proposed system significantly reduces hallucinations in the output and improves the generalization of our LLM in out-of-domain settings. In addition, we show that using a small, well-trained retriever encoder can reduce the size of the accompanying LLM, thereby making deployments of LLM-based systems less resource-intensive.
△ Less
Submitted 11 April, 2024;
originally announced April 2024.
-
A fugacity-based Lattice Boltzmann method for multicomponent multiphase systems
Authors:
Muzammil Soomro,
Luis F. Ayala,
Cheng Peng,
Orlando M. Ayala
Abstract:
The free energy model can extend the Lattice Boltzmann method to multiphase systems. However, there is a lack of models capable of simulating multicomponent multiphase fluids with partial miscibility. In addition, existing models cannot be generalized to honor thermodynamic information provided by any multicomponent equation of state of choice. In this paper, we introduce a free energy Lattice Bol…
▽ More
The free energy model can extend the Lattice Boltzmann method to multiphase systems. However, there is a lack of models capable of simulating multicomponent multiphase fluids with partial miscibility. In addition, existing models cannot be generalized to honor thermodynamic information provided by any multicomponent equation of state of choice. In this paper, we introduce a free energy Lattice Boltzmann model where the forcing term is determined by the fugacity of the species, the thermodynamic property that connects species partial pressure to chemical potential calculations. By doing so, we are able to carry out multicomponent multiphase simulations of partially miscible fluids and generalize the methodology for use with any multicomponent equation of state of interest. We test this fugacity-based Lattice Boltzmann method for the cases of vapor-liquid equilibrium for two and three-component mixtures in various temperature and pressure conditions. We demonstrate that the model is able to reliably reproduce phase densities and compositions as predicted by multicomponent thermodynamics and can reproduce different characteristic pressure-composition and temperature-composition envelopes with a high degree of accuracy. We also demonstrate that the model can offer accurate predictions under dynamic conditions.
△ Less
Submitted 20 December, 2022;
originally announced December 2022.
-
Azimuth: Systematic Error Analysis for Text Classification
Authors:
Gabrielle Gauthier-Melançon,
Orlando Marquez Ayala,
Lindsay Brin,
Chris Tyler,
Frédéric Branchaud-Charron,
Joseph Marinier,
Karine Grande,
Di Le
Abstract:
We present Azimuth, an open-source and easy-to-use tool to perform error analysis for text classification. Compared to other stages of the ML development cycle, such as model training and hyper-parameter tuning, the process and tooling for the error analysis stage are less mature. However, this stage is critical for the development of reliable and trustworthy AI systems. To make error analysis mor…
▽ More
We present Azimuth, an open-source and easy-to-use tool to perform error analysis for text classification. Compared to other stages of the ML development cycle, such as model training and hyper-parameter tuning, the process and tooling for the error analysis stage are less mature. However, this stage is critical for the development of reliable and trustworthy AI systems. To make error analysis more systematic, we propose an approach comprising dataset analysis and model quality assessment, which Azimuth facilitates. We aim to help AI practitioners discover and address areas where the model does not generalize by leveraging and integrating a range of ML techniques, such as saliency maps, similarity, uncertainty, and behavioral analyses, all in one tool. Our code and documentation are available at github.com/servicenow/azimuth.
△ Less
Submitted 18 December, 2022; v1 submitted 15 December, 2022;
originally announced December 2022.
-
A thermodynamically consistent pseudo-potential lattice Boltzmann model for multi-component, multiphase, partially miscible mixtures
Authors:
Cheng Peng,
Luis F. Ayala,
Orlando M. Ayala
Abstract:
Current multi-component, multiphase pseudo-potential lattice Boltzmann models have thermodynamic inconsistencies that prevent them to correctly predict the thermodynamic phase behavior of partially miscible multi-component mixtures, such as hydrocarbon mixtures. This paper identifies these inconsistencies and attempts to design a thermodynamically consistent multi-component, multiphase pseudo-pote…
▽ More
Current multi-component, multiphase pseudo-potential lattice Boltzmann models have thermodynamic inconsistencies that prevent them to correctly predict the thermodynamic phase behavior of partially miscible multi-component mixtures, such as hydrocarbon mixtures. This paper identifies these inconsistencies and attempts to design a thermodynamically consistent multi-component, multiphase pseudo-potential lattice Boltzmann model that allows mass transfer across the phase interfaces and is capable to predict the phase behavior of typically partially miscible hydrocarbon mixtures. The designed model defines the total interaction force for the entire phase and split the force into individual components. Through a properly derived force split factor associated with the volatility of each component, the model can achieve precise thermodynamic consistency in multi-component hydrocarbon mixtures, which is described by the iso-fugacity rule.
△ Less
Submitted 24 November, 2020; v1 submitted 20 September, 2019;
originally announced September 2019.
-
Isotropy and spurious currents in pseudo-potential multiphase lattice Boltzmann models
Authors:
Cheng Peng,
Luis F. Ayala,
Orlando M. Ayala,
Lian-** Wang
Abstract:
The spurious currents observed in multiphase flow simulations with pseudo-potential lattice Boltzmann (LB) models are usually understood to be the result of the lack of isotropy of the model-generated interaction force between phases. Remedies have been proposed to utilize larger stencils to compute the interaction force with higher orders of isotropy. In this short communication, we point out the…
▽ More
The spurious currents observed in multiphase flow simulations with pseudo-potential lattice Boltzmann (LB) models are usually understood to be the result of the lack of isotropy of the model-generated interaction force between phases. Remedies have been proposed to utilize larger stencils to compute the interaction force with higher orders of isotropy. In this short communication, we point out the incompleteness in the current understanding and propose a new consistent implementation to more effectively suppress the spurious currents. We also demonstrate theoretically that certain low-level spurious currents cannot be eliminated by increasing isotropy if the local hydrostatic balance inside the diffuse interface is not established in the LB models.
△ Less
Submitted 12 June, 2019;
originally announced June 2019.
-
A comparative study of immersed boundary method and interpolated bounce-back scheme for no-slip boundary treatment in the lattice Boltzmann method: Part II, turbulent flows
Authors:
Cheng Peng,
Orlando M. Ayala,
Jorge César Brändle de Motta,
Lian-** Wang
Abstract:
In the first part of this study, we compared the performances of two categories of no-slip boundary treatments, i.e., the interpolated bounce-back schemes and the immersed boundary methods in a series of laminar flow simulations within the lattice Boltzmann method. In this second part, these boundary treatments are further compared in the simulations of turbulent flows with complex geometry to pro…
▽ More
In the first part of this study, we compared the performances of two categories of no-slip boundary treatments, i.e., the interpolated bounce-back schemes and the immersed boundary methods in a series of laminar flow simulations within the lattice Boltzmann method. In this second part, these boundary treatments are further compared in the simulations of turbulent flows with complex geometry to provide a next-level assessment of these schemes. Two non-trivial turbulent flow problems, a fully developed turbulent pipe flow at a low Reynolds number, and a decaying homogeneous isotropic turbulent flow laden with a large number of resolved spherical particles are considered. The major problem of the immersed boundary method revealed by the present study is its incapability in computing the local velocity gradients inside the diffused interface, which can result in significantly underestimated dissipation rate and viscous diffusion locally near the particle surfaces. Otherwise, both categories of the no-slip boundary treatments are able to provide accurate results for most of turbulent statistics in both the carrier and dispersed phases, provided that sufficient grid resolutions are used.
△ Less
Submitted 12 June, 2019;
originally announced June 2019.
-
A comparative study of immersed boundary method and interpolated bounce-back scheme for no-slip boundary treatment in the lattice Boltzmann method: Part I, laminar flows
Authors:
Cheng Peng,
Orlando M. Ayala,
Lian-** Wang
Abstract:
The interpolated bounce-back scheme and the immersed boundary method are the two most popular algorithms in treating a no-slip boundary on curved surfaces in the lattice Boltzmann method. While those algorithms are frequently implemented in the numerical simulations involving complex geometries, such as particle-laden flows, their performances are seldom compared systematically over the same local…
▽ More
The interpolated bounce-back scheme and the immersed boundary method are the two most popular algorithms in treating a no-slip boundary on curved surfaces in the lattice Boltzmann method. While those algorithms are frequently implemented in the numerical simulations involving complex geometries, such as particle-laden flows, their performances are seldom compared systematically over the same local quantities within the same context. In this paper, we present a systematic comparative investigation on some frequently used and most state-of-the-art interpolated bounce-back schemes and immersed boundary methods, based on both theoretical analyses and numerical simulations of four selected 2D and 3D laminar flow problems. Our analyses show that immersed boundary methods (IBM) typically yield a first-order accuracy when the regularized delta-function is employed to interpolate velocity from the Eulerian to Lagrangian mesh, and the resulting boundary force back to the Eulerian mesh. This first order in accuracy for IBM is observed for both the local velocity and hydrodynamic force/torque, apparently different from the second-order accuracy sometimes claimed in the literature. Another serious problem of immersed boundary methods is that the local stress within the diffused fluid-solid interface tends to be significantly underestimated. On the other hand, the interpolated bounce-back generally possesses a second-order accuracy for velocity, hydrodynamic force/torque, and local stress field. The main disadvantage of the interpolated bounce-back schemes is its higher level of fluctuations in the calculated hydrodynamic force/torque when a solid object moves across the grid lines. General guidelines are also provided for the necessary grid resolutions in the two approaches in order to accurately simulate flows over a solid particle.
△ Less
Submitted 12 June, 2019;
originally announced June 2019.