Search | arXiv e-print repository

doi 10.5121/ijci.2024.130202

Improving the Capabilities of Large Language Model Based Marketing Analytics Copilots With Semantic Search And Fine-Tuning

Authors: Yilin Gao, Sai Kumar Arava, Yancheng Li, James W. Snyder Jr

Abstract: Artificial intelligence (AI) is widely deployed to solve problems related to marketing attribution and budget optimization. However, AI models can be quite complex, and it can be difficult to understand model workings and insights without extensive implementation teams. In principle, recently developed large language models (LLMs), like GPT-4, can be deployed to provide marketing insights, reducin… ▽ More Artificial intelligence (AI) is widely deployed to solve problems related to marketing attribution and budget optimization. However, AI models can be quite complex, and it can be difficult to understand model workings and insights without extensive implementation teams. In principle, recently developed large language models (LLMs), like GPT-4, can be deployed to provide marketing insights, reducing the time and effort required to make critical decisions. In practice, there are substantial challenges that need to be overcome to reliably use such models. We focus on domain-specific question-answering, SQL generation needed for data retrieval, and tabular analysis and show how a combination of semantic search, prompt engineering, and fine-tuning can be applied to dramatically improve the ability of LLMs to execute these tasks accurately. We compare both proprietary models, like GPT-4, and open-source models, like Llama-2-70b, as well as various embedding methods. These models are tested on sample use cases specific to marketing mix modeling and attribution. △ Less

Submitted 15 April, 2024; originally announced April 2024.

Comments: 16 pages, 5 figures, presented at the 2nd International Conference on NLP & AI (NLPAI 2024)

ACM Class: I.2.1; I.2.7

Journal ref: International Journal on Cybernetics & Informatics (IJCI), vol. 13, no. 2, pp. 15-31, Apr. 2024

arXiv:2311.00224 [pdf, other]

Domain decomposition-based coupling of physics-informed neural networks via the Schwarz alternating method

Authors: Will Snyder, Irina Tezaur, Christopher Wentland

Abstract: Physics-informed neural networks (PINNs) are appealing data-driven tools for solving and inferring solutions to nonlinear partial differential equations (PDEs). Unlike traditional neural networks (NNs), which train only on solution data, a PINN incorporates a PDE's residual into its loss function and trains to minimize the said residual at a set of collocation points in the solution domain. This p… ▽ More Physics-informed neural networks (PINNs) are appealing data-driven tools for solving and inferring solutions to nonlinear partial differential equations (PDEs). Unlike traditional neural networks (NNs), which train only on solution data, a PINN incorporates a PDE's residual into its loss function and trains to minimize the said residual at a set of collocation points in the solution domain. This paper explores the use of the Schwarz alternating method as a means to couple PINNs with each other and with conventional numerical models (i.e., full order models, or FOMs, obtained via the finite element, finite difference or finite volume methods) following a decomposition of the physical domain. It is well-known that training a PINN can be difficult when the PDE solution has steep gradients. We investigate herein the use of domain decomposition and the Schwarz alternating method as a means to accelerate the PINN training phase. Within this context, we explore different approaches for imposing Dirichlet boundary conditions within each subdomain PINN: weakly through the loss and/or strongly through a solution transformation. As a numerical example, we consider the one-dimensional steady state advection-diffusion equation in the advection-dominated (high Peclet) regime. Our results suggest that the convergence of the Schwarz method is strongly linked to the choice of boundary condition implementation within the PINNs being coupled. Surprisingly, strong enforcement of the Schwarz boundary conditions does not always lead to a faster convergence of the method. While it is not clear from our preliminary study that the PINN-PINN coupling via the Schwarz alternating method accelerates PINN convergence in the advection-dominated regime, it reveals that PINN training can be improved substantially for Peclet numbers as high as 1e6 by performing a PINN-FOM coupling. △ Less

Submitted 31 October, 2023; originally announced November 2023.

Report number: SAND2023-11869O

arXiv:2302.06075 [pdf, other]

A Graphical Point Process Framework for Understanding Removal Effects in Multi-Touch Attribution

Authors: Jun Tao, Qian Chen, James W. Snyder Jr., Arava Sai Kumar, Amirhossein Meisami, Lingzhou Xue

Abstract: Marketers employ various online advertising channels to reach customers, and they are particularly interested in attribution for measuring the degree to which individual touchpoints contribute to an eventual conversion. The availability of individual customer-level path-to-purchase data and the increasing number of online marketing channels and types of touchpoints bring new challenges to this fun… ▽ More Marketers employ various online advertising channels to reach customers, and they are particularly interested in attribution for measuring the degree to which individual touchpoints contribute to an eventual conversion. The availability of individual customer-level path-to-purchase data and the increasing number of online marketing channels and types of touchpoints bring new challenges to this fundamental problem. We aim to tackle the attribution problem with finer granularity by conducting attribution at the path level. To this end, we develop a novel graphical point process framework to study the direct conversion effects and the full relational structure among numerous types of touchpoints simultaneously. Utilizing the temporal point process of conversion and the graphical structure, we further propose graphical attribution methods to allocate proper path-level conversion credit, called the attribution score, to individual touchpoints or corresponding channels for each customer's path to purchase. Our proposed attribution methods consider the attribution score as the removal effect, and we use the rigorous probabilistic definition to derive two types of removal effects. We examine the performance of our proposed methods in extensive simulation studies and compare their performance with commonly used attribution models. We also demonstrate the performance of the proposed methods in a real-world attribution application. △ Less

Submitted 12 February, 2023; originally announced February 2023.

Comments: 38 pages, 10 figures

arXiv:2202.14017 [pdf, other]

Reduced Order Model Closures: A Brief Tutorial

Authors: William Snyder, Changhong Mou, Honghu Liu, Omer San, Raffaella De Vita, Traian Iliescu

Abstract: In this paper, we present a brief tutorial on reduced order model (ROM) closures. First, we carefully motivate the need for ROM closure modeling in under-resolved simulations. Then, we construct step by step the ROM closure model by extending the classical Galerkin framework to the spaces of resolved and unresolved scales. Finally, we develop the data-driven variational multiscale ROM closure and… ▽ More In this paper, we present a brief tutorial on reduced order model (ROM) closures. First, we carefully motivate the need for ROM closure modeling in under-resolved simulations. Then, we construct step by step the ROM closure model by extending the classical Galerkin framework to the spaces of resolved and unresolved scales. Finally, we develop the data-driven variational multiscale ROM closure and then we test it in fluid flow simulations. Our tutorial on ROM closures is structured as a sequence of questions and answers, and is aimed at first year graduate students and advanced undergraduate students. Our goal is not to explain the "how," but the "why." That is, we carefully explain the principles used to develop ROM closures, without focusing on particular approaches. Furthermore, we try to keep the technical details to a minimum and describe the general ideas in broad terms while citing appropriate references for details. △ Less

Submitted 28 February, 2022; originally announced February 2022.

arXiv:2007.00117 [pdf]

doi 10.1021/acs.nanolett.0c03263

Growth kinetics and atomistic mechanisms of native oxidation of ZrS$_x$Se$_{2-x}$ and MoS$_2$ crystals

Authors: Seong Soon Jo, Akshay Singh, Liqiu Yang, Subodh C. Tiwari, Sungwook Hong, Aravind Krishnamoorthy, Maria Gabriela Sales, Sean M. Oliver, Joshua Fox, Randal L. Cavalero, David W. Snyder, Patrick M. Vora, Stephen J. McDonnell, Priya Vashishta, Rajiv K. Kalia, Aiichiro Nakano, Rafael Jaramillo

Abstract: A thorough understanding of native oxides is essential for designing semiconductor devices. Here we report a study of the rate and mechanisms of spontaneous oxidation of bulk single crystals of ZrS$_x$Se$_{2-x}$ alloys and MoS$_2$. ZrS$_x$Se$_{2-x}$ alloys oxidize rapidly, and the oxidation rate increases with Se content. Oxidation of basal surfaces is initiated by favorable O$_2$ adsorption and p… ▽ More A thorough understanding of native oxides is essential for designing semiconductor devices. Here we report a study of the rate and mechanisms of spontaneous oxidation of bulk single crystals of ZrS$_x$Se$_{2-x}$ alloys and MoS$_2$. ZrS$_x$Se$_{2-x}$ alloys oxidize rapidly, and the oxidation rate increases with Se content. Oxidation of basal surfaces is initiated by favorable O$_2$ adsorption and proceeds by a mechanism of Zr-O bond switching, that collapses the van der Waals gaps, and is facilitated by progressive redox transitions of the chalcogen. The rate-limiting process is the formation and out-diffusion of SO$_2$. In contrast, MoS$_2$ basal surfaces are stable due to unfavorable oxygen adsorption. Our results provide insight and quantitative guidance for designing and processing semiconductor devices based on ZrS$_x$Se$_{2-x}$ and MoS$_2$, and identify the atomistic-scale mechanisms of bonding and phase transformations in layered materials with competing anions. △ Less

Submitted 16 November, 2020; v1 submitted 30 June, 2020; originally announced July 2020.

arXiv:1605.05707 [pdf, other]

Ultrafast Isomerization in Acetylene Dication: To Be or Not To Be

Authors: Zheng Li, Ludger Inhester, Chelsea Liekhus-Schmaltz, Basile Curchod, James William Snyder Jr., Nikita Medvedev, James Cryan, Timur Osipov, Stefan Pabst, Oriol Vendrell, Phil Bucksbaum, Todd Martinez

Abstract: Experimental evidence has pointed toward the existence of ultrafast proton migration and isomerization as a key process for acetylene and its ions, however the actual mechanism for ultrafast isomerization of the acetylene [HCCH]2+ to vinylidene [H2CC]2+ dication remains nebulous. Theoretical studies show a high potential barrier of over 2eV for the isomerization pathways on the low lying dicationi… ▽ More Experimental evidence has pointed toward the existence of ultrafast proton migration and isomerization as a key process for acetylene and its ions, however the actual mechanism for ultrafast isomerization of the acetylene [HCCH]2+ to vinylidene [H2CC]2+ dication remains nebulous. Theoretical studies show a high potential barrier of over 2eV for the isomerization pathways on the low lying dicationic states, implying that the corresponding isomerization should take picoseconds or even longer according to transition state theory. However a recent experiment at a femtosecond X-ray free electron laser (XFEL) [Nature Commun. 6, 8199 (2015)] suggests that large amplitude hydrogen migration proceeds on a sub-100 femtosecond time scale. In order to resolve the contradiction, we present a complete theoretical study of the dynamics of acetylene dication produced by Auger decay after X-ray photoionization of the carbon atom K shell. We find that isomerization does not occur on the sub-100 fs timescale and is not required to explain the time-resolved Coulomb imaging experiment. This study resolves the seeming contradiction between experiment and theory concerning the isomerization time scale in acetylene dication. This work calls for careful interpretation of structural information from the widely applied Coulomb momentum imaging method but also points out its strengths in map** out momentum dispersion dynamics even when structural variation is minor. △ Less

Submitted 7 September, 2017; v1 submitted 18 May, 2016; originally announced May 2016.

Comments: 38 pages, 16 figures

arXiv:1310.1870 [pdf]

doi 10.1557/jmr.2013.323

Prospects of Direct Growth Boron Nitride Films as Substrates for Graphene Electronics

Authors: Michael S. Bresnehan, Matthew J. Hollander, Maxwell Wetherington, Ke Wang, Takahira Miyagi, Gregory Pastir, David W. Snyder, Jamie J. Gengler, Andrey A. Voevodin, William C. Mitchel, Joshua A. Robinson

Abstract: We present a route for direct growth of boron nitride via a polyborazylene to h-BN conversion process. This two-step growth process ultimately leads to a >25x reduction in the RMS surface roughness of h-BN films when compared to a high temperature growth on Al2O3(0001) and Si(111) substrates. Additionally, the stoichiometry is shown to be highly dependent on the initial polyborazylene deposition t… ▽ More We present a route for direct growth of boron nitride via a polyborazylene to h-BN conversion process. This two-step growth process ultimately leads to a >25x reduction in the RMS surface roughness of h-BN films when compared to a high temperature growth on Al2O3(0001) and Si(111) substrates. Additionally, the stoichiometry is shown to be highly dependent on the initial polyborazylene deposition temperature. Importantly, CVD graphene transferred to direct-grown boron nitride films on Al2O3 at 400°C results in a >1.5x and >2.5x improvement in mobility compared to CVD graphene transferred to Al2O3 and SiO2 substrates, respectively, which is attributed to the combined reduction of remote charged impurity scattering and surface roughness scattering. Simulation of mobility versus carrier concentration confirms the importance of limiting the introduction of charged impurities in the h-BN film and highlights the importance of these results in producing optimized h-BN substrates for high performance graphene and TMD devices. △ Less

Submitted 7 October, 2013; originally announced October 2013.

Comments: To be published in the Journal of Materials Research, Focus Issue: Graphene and Beyond

arXiv:1103.6052 [pdf, other]

Internal Constraints of the Trifocal Tensor

Authors: Stuart B. Heinrich, Wesley E. Snyder

Abstract: The fundamental matrix and trifocal tensor are convenient algebraic representations of the epipolar geometry of two and three view configurations, respectively. The estimation of these entities is central to most reconstruction algorithms, and a solid understanding of their properties and constraints is therefore very important. The fundamental matrix has 1 internal constraint which is well unders… ▽ More The fundamental matrix and trifocal tensor are convenient algebraic representations of the epipolar geometry of two and three view configurations, respectively. The estimation of these entities is central to most reconstruction algorithms, and a solid understanding of their properties and constraints is therefore very important. The fundamental matrix has 1 internal constraint which is well understood, whereas the trifocal tensor has 8 independent algebraic constraints. The internal tensor constraints can be represented in many ways, although there is only one minimal and sufficient set of 8 constraints known. In this paper, we derive a second set of minimal and sufficient constraints that is simpler. We also show how this can be used in a new parameterization of the trifocal tensor. We hope that this increased understanding of the internal constraints may lead to improved algorithms for estimating the trifocal tensor, although the primary contribution is an improved theoretical understanding. △ Less

Submitted 30 March, 2011; originally announced March 2011.

arXiv:1103.5808 [pdf, other]

Improved Edge Awareness in Discontinuity Preserving Smoothing

Authors: Stuart B. Heinrich, Wesley E. Snyder

Abstract: Discontinuity preserving smoothing is a fundamentally important procedure that is useful in a wide variety of image processing contexts. It is directly useful for noise reduction, and frequently used as an intermediate step in higher level algorithms. For example, it can be particularly useful in edge detection and segmentation. Three well known algorithms for discontinuity preserving smoothing ar… ▽ More Discontinuity preserving smoothing is a fundamentally important procedure that is useful in a wide variety of image processing contexts. It is directly useful for noise reduction, and frequently used as an intermediate step in higher level algorithms. For example, it can be particularly useful in edge detection and segmentation. Three well known algorithms for discontinuity preserving smoothing are nonlinear anisotropic diffusion, bilateral filtering, and mean shift filtering. Although slight differences make them each better suited to different tasks, all are designed to preserve discontinuities while smoothing. However, none of them satisfy this goal perfectly: they each have exception cases in which smoothing may occur across hard edges. The principal contribution of this paper is the identification of a property we call edge awareness that should be satisfied by any discontinuity preserving smoothing algorithm. This constraint can be incorporated into existing algorithms to improve quality, and usually has negligible changes in runtime performance and/or complexity. We present modifications necessary to augment diffusion and mean shift, as well as a new formulation of the bilateral filter that unifies the spatial and range spaces to achieve edge awareness. △ Less

Submitted 29 March, 2011; originally announced March 2011.

Showing 1–9 of 9 results for author: Snyder, W