-
An Automated SQL Query Grading System Using An Attention-Based Convolutional Neural Network
Authors:
Donald R. Schwartz,
Pablo Rivas
Abstract:
Grading SQL queries can be a time-consuming, tedious and challenging task, especially as the number of student submissions increases. Several systems have been introduced in an attempt to mitigate these challenges, but those systems have their own limitations. This paper describes our novel approach to automating the process of grading SQL queries. Unlike previous approaches, we employ a unique co…
▽ More
Grading SQL queries can be a time-consuming, tedious and challenging task, especially as the number of student submissions increases. Several systems have been introduced in an attempt to mitigate these challenges, but those systems have their own limitations. This paper describes our novel approach to automating the process of grading SQL queries. Unlike previous approaches, we employ a unique convolutional neural network architecture that employs a parameter-sharing approach for different machine learning tasks that enables the architecture to induce different knowledge representations of the data to increase its potential for understanding SQL statements.
△ Less
Submitted 22 June, 2024;
originally announced June 2024.
-
A Review of Pulse-Coupled Neural Network Applications in Computer Vision and Image Processing
Authors:
Nurul Rafi,
Pablo Rivas
Abstract:
Research in neural models inspired by mammal's visual cortex has led to many spiking neural networks such as pulse-coupled neural networks (PCNNs). These models are oscillating, spatio-temporal models stimulated with images to produce several time-based responses. This paper reviews PCNN's state of the art, covering its mathematical formulation, variants, and other simplifications found in the lit…
▽ More
Research in neural models inspired by mammal's visual cortex has led to many spiking neural networks such as pulse-coupled neural networks (PCNNs). These models are oscillating, spatio-temporal models stimulated with images to produce several time-based responses. This paper reviews PCNN's state of the art, covering its mathematical formulation, variants, and other simplifications found in the literature. We present several applications in which PCNN architectures have successfully addressed some fundamental image processing and computer vision challenges, including image segmentation, edge detection, medical imaging, image fusion, image compression, object recognition, and remote sensing. Results achieved in these applications suggest that the PCNN architecture generates useful perceptual information relevant to a wide variety of computer vision tasks.
△ Less
Submitted 31 May, 2024;
originally announced June 2024.
-
Detecting Hallucinations in Large Language Model Generation: A Token Probability Approach
Authors:
Ernesto Quevedo,
Jorge Yero,
Rachel Koerner,
Pablo Rivas,
Tomas Cerny
Abstract:
Concerns regarding the propensity of Large Language Models (LLMs) to produce inaccurate outputs, also known as hallucinations, have escalated. Detecting them is vital for ensuring the reliability of applications relying on LLM-generated content. Current methods often demand substantial resources and rely on extensive LLMs or employ supervised learning with multidimensional features or intricate li…
▽ More
Concerns regarding the propensity of Large Language Models (LLMs) to produce inaccurate outputs, also known as hallucinations, have escalated. Detecting them is vital for ensuring the reliability of applications relying on LLM-generated content. Current methods often demand substantial resources and rely on extensive LLMs or employ supervised learning with multidimensional features or intricate linguistic and semantic analyses difficult to reproduce and largely depend on using the same LLM that hallucinated. This paper introduces a supervised learning approach employing two simple classifiers utilizing only four numerical features derived from tokens and vocabulary probabilities obtained from other LLM evaluators, which are not necessarily the same. The method yields promising results, surpassing state-of-the-art outcomes in multiple tasks across three different benchmarks. Additionally, we provide a comprehensive examination of the strengths and weaknesses of our approach, highlighting the significance of the features utilized and the LLM employed as an evaluator. We have released our code publicly at https://github.com/Baylor-AI/HalluDetect.
△ Less
Submitted 29 May, 2024;
originally announced May 2024.
-
Efficacy of ByT5 in Multilingual Translation of Biblical Texts for Underrepresented Languages
Authors:
Corinne Aars,
Lauren Adams,
Xiaokan Tian,
Zhaoyu Wang,
Colton Wismer,
Jason Wu,
Pablo Rivas,
Korn Sooksatra,
Matthew Fendt
Abstract:
This study presents the development and evaluation of a ByT5-based multilingual translation model tailored for translating the Bible into underrepresented languages. Utilizing the comprehensive Johns Hopkins University Bible Corpus, we trained the model to capture the intricate nuances of character-based and morphologically rich languages. Our results, measured by the BLEU score and supplemented w…
▽ More
This study presents the development and evaluation of a ByT5-based multilingual translation model tailored for translating the Bible into underrepresented languages. Utilizing the comprehensive Johns Hopkins University Bible Corpus, we trained the model to capture the intricate nuances of character-based and morphologically rich languages. Our results, measured by the BLEU score and supplemented with sample translations, suggest the model can improve accessibility to sacred texts. It effectively handles the distinctive biblical lexicon and structure, thus bridging the linguistic divide. The study also discusses the model's limitations and suggests pathways for future enhancements, focusing on expanding access to sacred literature across linguistic boundaries.
△ Less
Submitted 30 May, 2024; v1 submitted 22 May, 2024;
originally announced May 2024.
-
On the Challenges of Creating Datasets for Analyzing Commercial Sex Advertisements to Assess Human Trafficking Risk and Organized Activity
Authors:
Pablo Rivas,
Tomas Cerny,
Alejandro Rodriguez Perez,
Javier Turek,
Laurie Giddens,
Gisela Bichler,
Stacie Petter
Abstract:
Our study addresses the challenges of building datasets to understand the risks associated with organized activities and human trafficking through commercial sex advertisements. These challenges include data scarcity, rapid obsolescence, and privacy concerns. Traditional approaches, which are not automated and are difficult to reproduce, fall short in addressing these issues. We have developed a r…
▽ More
Our study addresses the challenges of building datasets to understand the risks associated with organized activities and human trafficking through commercial sex advertisements. These challenges include data scarcity, rapid obsolescence, and privacy concerns. Traditional approaches, which are not automated and are difficult to reproduce, fall short in addressing these issues. We have developed a reproducible and automated methodology to analyze five million advertisements. In the process, we identified further challenges in dataset creation within this sensitive domain. This paper presents a streamlined methodology to assist researchers in constructing effective datasets for combating organized crime, allowing them to focus on advancing detection technologies.
△ Less
Submitted 22 May, 2024;
originally announced May 2024.
-
On Adversarial Examples for Text Classification by Perturbing Latent Representations
Authors:
Korn Sooksatra,
Bikram Khanal,
Pablo Rivas
Abstract:
Recently, with the advancement of deep learning, several applications in text classification have advanced significantly. However, this improvement comes with a cost because deep learning is vulnerable to adversarial examples. This weakness indicates that deep learning is not very robust. Fortunately, the input of a text classifier is discrete. Hence, it can prevent the classifier from state-of-th…
▽ More
Recently, with the advancement of deep learning, several applications in text classification have advanced significantly. However, this improvement comes with a cost because deep learning is vulnerable to adversarial examples. This weakness indicates that deep learning is not very robust. Fortunately, the input of a text classifier is discrete. Hence, it can prevent the classifier from state-of-the-art attacks. Nonetheless, previous works have generated black-box attacks that successfully manipulate the discrete values of the input to find adversarial examples. Therefore, instead of changing the discrete values, we transform the input into its embedding vector containing real values to perform the state-of-the-art white-box attacks. Then, we convert the perturbed embedding vector back into a text and name it an adversarial example. In summary, we create a framework that measures the robustness of a text classifier by using the gradients of the classifier.
△ Less
Submitted 6 May, 2024;
originally announced May 2024.
-
Is ReLU Adversarially Robust?
Authors:
Korn Sooksatra,
Greg Hamerly,
Pablo Rivas
Abstract:
The efficacy of deep learning models has been called into question by the presence of adversarial examples. Addressing the vulnerability of deep learning models to adversarial examples is crucial for ensuring their continued development and deployment. In this work, we focus on the role of rectified linear unit (ReLU) activation functions in the generation of adversarial examples. ReLU functions a…
▽ More
The efficacy of deep learning models has been called into question by the presence of adversarial examples. Addressing the vulnerability of deep learning models to adversarial examples is crucial for ensuring their continued development and deployment. In this work, we focus on the role of rectified linear unit (ReLU) activation functions in the generation of adversarial examples. ReLU functions are commonly used in deep learning models because they facilitate the training process. However, our empirical analysis demonstrates that ReLU functions are not robust against adversarial examples. We propose a modified version of the ReLU function, which improves robustness against adversarial examples. Our results are supported by an experiment, which confirms the effectiveness of our proposed modification. Additionally, we demonstrate that applying adversarial training to our customized model further enhances its robustness compared to a general model.
△ Less
Submitted 6 May, 2024;
originally announced May 2024.
-
A Review on Machine Learning Algorithms for Dust Aerosol Detection using Satellite Data
Authors:
Nurul Rafi,
Pablo Rivas
Abstract:
Dust storms are associated with certain respiratory illnesses across different areas in the world. Researchers have devoted time and resources to study the elements surrounding dust storm phenomena. This paper reviews the efforts of those who have investigated dust aerosols using sensors onboard of satellites using machine learning-based approaches. We have reviewed the most common issues revolvin…
▽ More
Dust storms are associated with certain respiratory illnesses across different areas in the world. Researchers have devoted time and resources to study the elements surrounding dust storm phenomena. This paper reviews the efforts of those who have investigated dust aerosols using sensors onboard of satellites using machine learning-based approaches. We have reviewed the most common issues revolving dust aerosol modeling using different datasets and different sensors from a historical perspective. Our findings suggest that multi-spectral approaches based on linear and non-linear combinations of spectral bands are some of the most successful for visualization and quantitative analysis; however, when researchers have leveraged machine learning, performance has been improved and new opportunities to solve unique problems arise.
△ Less
Submitted 14 April, 2024;
originally announced April 2024.
-
A Modified Depolarization Approach for Efficient Quantum Machine Learning
Authors:
Bikram Khanal,
Pablo Rivas
Abstract:
Quantum Computing in the Noisy Intermediate-Scale Quantum (NISQ) era has shown promising applications in machine learning, optimization, and cryptography. Despite the progress, challenges persist due to system noise, errors, and decoherence that complicate the simulation of quantum systems. The depolarization channel is a standard tool for simulating a quantum system's noise. However, modeling suc…
▽ More
Quantum Computing in the Noisy Intermediate-Scale Quantum (NISQ) era has shown promising applications in machine learning, optimization, and cryptography. Despite the progress, challenges persist due to system noise, errors, and decoherence that complicate the simulation of quantum systems. The depolarization channel is a standard tool for simulating a quantum system's noise. However, modeling such noise for practical applications is computationally expensive when we have limited hardware resources, as is the case in the NISQ era. We propose a modified representation for a single-qubit depolarization channel with two Kraus operators based only on X and Z Pauli matrices. Our approach reduces the computational complexity from six to four matrix multiplications per execution of a channel. Experiments on a Quantum Machine Learning (QML) model on the Iris dataset across various circuit depths and depolarization rates validate that our approach maintains the model's accuracy while improving efficiency. This simplified noise model enables more scalable simulations of quantum circuits under depolarization, advancing capabilities in the NISQ era.
△ Less
Submitted 10 April, 2024;
originally announced April 2024.
-
Combatting Human Trafficking in the Cyberspace: A Natural Language Processing-Based Methodology to Analyze the Language in Online Advertisements
Authors:
Alejandro Rodriguez Perez,
Pablo Rivas
Abstract:
This project tackles the pressing issue of human trafficking in online C2C marketplaces through advanced Natural Language Processing (NLP) techniques. We introduce a novel methodology for generating pseudo-labeled datasets with minimal supervision, serving as a rich resource for training state-of-the-art NLP models. Focusing on tasks like Human Trafficking Risk Prediction (HTRP) and Organized Acti…
▽ More
This project tackles the pressing issue of human trafficking in online C2C marketplaces through advanced Natural Language Processing (NLP) techniques. We introduce a novel methodology for generating pseudo-labeled datasets with minimal supervision, serving as a rich resource for training state-of-the-art NLP models. Focusing on tasks like Human Trafficking Risk Prediction (HTRP) and Organized Activity Detection (OAD), we employ cutting-edge Transformer models for analysis. A key contribution is the implementation of an interpretability framework using Integrated Gradients, providing explainable insights crucial for law enforcement. This work not only fills a critical gap in the literature but also offers a scalable, machine learning-driven approach to combat human exploitation online. It serves as a foundation for future research and practical applications, emphasizing the role of machine learning in addressing complex social issues.
△ Less
Submitted 21 November, 2023;
originally announced November 2023.
-
Dynamics and Control of Bubble-Propelled Microrobots
Authors:
David P. Rivas,
Max Sokolich,
Harrison Muller,
Sambeeta Das
Abstract:
Having the advantage of being relatively fast and powerful, as well as readily fabricated, spherical bubble-propelled microrobots are particularly well suited for applications such as cargo delivery, micromanipulation, and biological or environmental remediation. However, there have been limited examples of control and manipulation with these microrobots and few studies on their dynamics. Here we…
▽ More
Having the advantage of being relatively fast and powerful, as well as readily fabricated, spherical bubble-propelled microrobots are particularly well suited for applications such as cargo delivery, micromanipulation, and biological or environmental remediation. However, there have been limited examples of control and manipulation with these microrobots and few studies on their dynamics. Here we investigate the bubble formation and dynamics of both hemispherically coated Janus microrobots as well as GLAD "patchy" microrobots which not only provide for an interesting comparison, but also exhibit useful properties in their own right. Specifically, we find that the patchy microrobots have a tendency to produce smaller bubbles and undergo smoother motion, properties that are beneficial for applications such as precise micro-manipulation, for example. We demonstrate manipulation and assemble of passive spheres on a substrate as well as at an air-liquid interface. We also characterize the propulsion and bubble formation of both types of microrobots and find that previously proposed theories insufficiently describe their motion and bubble bursting mechanism. Additionally, we observe that the microrobots, which reside at the air-liquid interface, demonstrate positive gravitaxis towards the droplet edges which we attribute to a torque resulting from opposing downward and buoyant forces on the microrobot.
△ Less
Submitted 24 March, 2022;
originally announced March 2022.
-
Driven Topological Transitions in Active Nematic Films
Authors:
David P. Rivas,
Tyler N. Shendruk,
Robert R. Henry,
Daniel H. Reich,
Robert L. Leheny
Abstract:
The topological properties of many materials are central to their behavior, with the dynamics of topological defects being particularly important to intrinsically out-of-equilibrium, active materials. In this paper, local manipulation of the ordering, dynamics, and topological properties of microtubule-based extensile active nematic films is demonstrated in a joint experimental and simulation stud…
▽ More
The topological properties of many materials are central to their behavior, with the dynamics of topological defects being particularly important to intrinsically out-of-equilibrium, active materials. In this paper, local manipulation of the ordering, dynamics, and topological properties of microtubule-based extensile active nematic films is demonstrated in a joint experimental and simulation study. Hydrodynamic stresses created by magnetically actuated rotation of disk-shaped colloids in proximity to the films compete with internal stresses in the active nematic, enabling local control of the motion of the +1/2 charge topological defects that are intrinsic to spontaneously turbulent active films. Sufficiently large applied stresses drive the formation of +1 charge topological vortices in the director field through the merger of two +1/2 defects. The directed motion of the defects is accompanied by ordering of the vorticity and velocity of the active flows within the film that is qualitatively unlike the response of passive viscous films. Many features of the film's response to the disk are captured by Lattice Boltzmann simulations, leading to insight into the anomalous viscoelastic nature of the active nematic. The topological vortex formation is accompanied by a rheological instability in the film that leads to significant increase in the flow velocities. Comparison of the velocity profile in vicinity of the vortex with fluid-dynamics calculations provides an estimate of film viscosity.
△ Less
Submitted 31 October, 2019;
originally announced October 2019.