-
Evaluating the Uncanny Valley Effect in Dark Colored Skin Virtual Humans
Authors:
Victor Araujo,
Angelo Brandelli Costa,
Soraia Raupp Musse
Abstract:
With the rapid advancement of technology, the design of virtual humans has led to a very realistic user experience, such as in movies, video games, and simulations. As a result, virtual humans are becoming increasingly similar to real humans. However, following the Uncanny Valley (UV) theory, users tend to feel discomfort when watching entities with anthropomorphic traits that differ from real hum…
▽ More
With the rapid advancement of technology, the design of virtual humans has led to a very realistic user experience, such as in movies, video games, and simulations. As a result, virtual humans are becoming increasingly similar to real humans. However, following the Uncanny Valley (UV) theory, users tend to feel discomfort when watching entities with anthropomorphic traits that differ from real humans. This phenomenon is related to social identity theory, where the observer looks for something familiar. In Computer Graphics (CG), techniques used to create virtual humans with dark skin tones often rely on approaches initially developed for rendering characters with white skin tones. Furthermore, most CG characters portrayed in various media, including movies and games, predominantly exhibit white skin tones. Consequently, it is pertinent to explore people's perceptions regarding different groups of virtual humans. Thus, this paper aims to examine and evaluate the human perception of CG characters from different media, comparing two types of skin colors. The findings indicate that individuals felt more comfortable and perceived less realism when watching characters with dark colored skin than those with white colored skin. Our central hypothesis is that dark colored characters, rendered with classical developed algorithms, are considered more cartoon than realistic and placed on the left of the Valley in the UV chart.
△ Less
Submitted 11 December, 2023;
originally announced December 2023.
-
Generative Large Language Models Are All-purpose Text Analytics Engines: Text-to-text Learning Is All Your Need
Authors:
Cheng Peng,
Xi Yang,
Aokun Chen,
Zehao Yu,
Kaleb E Smith,
Anthony B Costa,
Mona G Flores,
Jiang Bian,
Yonghui Wu
Abstract:
Objective To solve major clinical natural language processing (NLP) tasks using a unified text-to-text learning architecture based on a generative large language model (LLM) via prompt tuning. Methods We formulated 7 key clinical NLP tasks as text-to-text learning and solved them using one unified generative clinical LLM, GatorTronGPT, developed using GPT-3 architecture and trained with up to 20 b…
▽ More
Objective To solve major clinical natural language processing (NLP) tasks using a unified text-to-text learning architecture based on a generative large language model (LLM) via prompt tuning. Methods We formulated 7 key clinical NLP tasks as text-to-text learning and solved them using one unified generative clinical LLM, GatorTronGPT, developed using GPT-3 architecture and trained with up to 20 billion parameters. We adopted soft prompts (i.e., trainable vectors) with frozen LLM, where the LLM parameters were not updated (i.e., frozen) and only the vectors of soft prompts were updated, known as prompt tuning. We added additional soft prompts as a prefix to the input layer, which were optimized during the prompt tuning. We evaluated the proposed method using 7 clinical NLP tasks and compared them with previous task-specific solutions based on Transformer models. Results and Conclusion The proposed approach achieved state-of-the-art performance for 5 out of 7 major clinical NLP tasks using one unified generative LLM. Our approach outperformed previous task-specific transformer models by ~3% for concept extraction and 7% for relation extraction applied to social determinants of health, 3.4% for clinical concept normalization, 3.4~10% for clinical abbreviation disambiguation, and 5.5~9% for natural language inference. Our approach also outperformed a previously developed prompt-based machine reading comprehension (MRC) model, GatorTron-MRC, for clinical concept and relation extraction. The proposed approach can deliver the ``one model for all`` promise from training to deployment using a unified generative LLM.
△ Less
Submitted 10 December, 2023;
originally announced December 2023.
-
Domain Knowledge Distillation from Large Language Model: An Empirical Study in the Autonomous Driving Domain
Authors:
Yun Tang,
Antonio A. Bruto da Costa,
Jason Zhang,
Irvine Patrick,
Siddartha Khastgir,
Paul Jennings
Abstract:
Engineering knowledge-based (or expert) systems require extensive manual effort and domain knowledge. As Large Language Models (LLMs) are trained using an enormous amount of cross-domain knowledge, it becomes possible to automate such engineering processes. This paper presents an empirical automation and semi-automation framework for domain knowledge distillation using prompt engineering and the L…
▽ More
Engineering knowledge-based (or expert) systems require extensive manual effort and domain knowledge. As Large Language Models (LLMs) are trained using an enormous amount of cross-domain knowledge, it becomes possible to automate such engineering processes. This paper presents an empirical automation and semi-automation framework for domain knowledge distillation using prompt engineering and the LLM ChatGPT. We assess the framework empirically in the autonomous driving domain and present our key observations. In our implementation, we construct the domain knowledge ontology by "chatting" with ChatGPT. The key finding is that while fully automated domain ontology construction is possible, human supervision and early intervention typically improve efficiency and output quality as they lessen the effects of response randomness and the butterfly effect. We, therefore, also develop a web-based distillation assistant enabling supervision and flexible intervention at runtime. We hope our findings and tools could inspire future research toward revolutionizing the engineering of knowledge-based systems across application domains.
△ Less
Submitted 17 July, 2023;
originally announced July 2023.
-
A Study of Generative Large Language Model for Medical Research and Healthcare
Authors:
Cheng Peng,
Xi Yang,
Aokun Chen,
Kaleb E Smith,
Nima PourNejatian,
Anthony B Costa,
Cheryl Martin,
Mona G Flores,
Ying Zhang,
Tanja Magoc,
Gloria Lipori,
Duane A Mitchell,
Naykky S Ospina,
Mustafa M Ahmed,
William R Hogan,
Elizabeth A Shenkman,
Yi Guo,
Jiang Bian,
Yonghui Wu
Abstract:
There is enormous enthusiasm and concerns in using large language models (LLMs) in healthcare, yet current assumptions are all based on general-purpose LLMs such as ChatGPT. This study develops a clinical generative LLM, GatorTronGPT, using 277 billion words of mixed clinical and English text with a GPT-3 architecture of 20 billion parameters. GatorTronGPT improves biomedical natural language proc…
▽ More
There is enormous enthusiasm and concerns in using large language models (LLMs) in healthcare, yet current assumptions are all based on general-purpose LLMs such as ChatGPT. This study develops a clinical generative LLM, GatorTronGPT, using 277 billion words of mixed clinical and English text with a GPT-3 architecture of 20 billion parameters. GatorTronGPT improves biomedical natural language processing for medical research. Synthetic NLP models trained using GatorTronGPT generated text outperform NLP models trained using real-world clinical text. Physicians Turing test using 1 (worst) to 9 (best) scale shows that there is no significant difference in linguistic readability (p = 0.22; 6.57 of GatorTronGPT compared with 6.93 of human) and clinical relevance (p = 0.91; 7.0 of GatorTronGPT compared with 6.97 of human) and that physicians cannot differentiate them (p < 0.001). This study provides insights on the opportunities and challenges of LLMs for medical research and healthcare.
△ Less
Submitted 22 May, 2023;
originally announced May 2023.
-
Ensemble learning techniques for intrusion detection system in the context of cybersecurity
Authors:
Andricson Abeline Moreira,
Carlos A. C. Tojeiro,
Carlos J. Reis,
Gustavo Henrique Massaro,
Igor Andrade Brito e Kelton A. P. da Costa
Abstract:
Recently, there has been an interest in improving the resources available in Intrusion Detection System (IDS) techniques. In this sense, several studies related to cybersecurity show that the environment invasions and information kidnap** are increasingly recurrent and complex. The criticality of the business involving operations in an environment using computing resources does not allow the vul…
▽ More
Recently, there has been an interest in improving the resources available in Intrusion Detection System (IDS) techniques. In this sense, several studies related to cybersecurity show that the environment invasions and information kidnap** are increasingly recurrent and complex. The criticality of the business involving operations in an environment using computing resources does not allow the vulnerability of the information. Cybersecurity has taken on a dimension within the universe of indispensable technology in corporations, and the prevention of risks of invasions into the environment is dealt with daily by Security teams. Thus, the main objective of the study was to investigate the Ensemble Learning technique using the Stacking method, supported by the Support Vector Machine (SVM) and k-Nearest Neighbour (kNN) algorithms aiming at an optimization of the results for DDoS attack detection. For this, the Intrusion Detection System concept was used with the application of the Data Mining and Machine Learning Orange tool to obtain better results
△ Less
Submitted 21 December, 2022;
originally announced December 2022.
-
Can gender categorization influence the perception of animated virtual humans?
Authors:
V. Araujo,
D. Schaffer,
A. B. Costa,
S. R. Musse
Abstract:
Animations have become increasingly realistic with the evolution of Computer Graphics (CG). In particular, human models and behaviors were represented through animated virtual humans, sometimes with a high level of realism. In particular, gender is a characteristic that is related to human identification, so that virtual humans assigned to a specific gender have, in general, stereotyped representa…
▽ More
Animations have become increasingly realistic with the evolution of Computer Graphics (CG). In particular, human models and behaviors were represented through animated virtual humans, sometimes with a high level of realism. In particular, gender is a characteristic that is related to human identification, so that virtual humans assigned to a specific gender have, in general, stereotyped representations through movements, clothes, hair and colors, in order to be understood by users as desired by designers. An important area of study is finding out whether participants' perceptions change depending on how a virtual human is visually presented. Findings in this area can help the industry to guide the modeling and animation of virtual humans to deliver the expected impact to the audience. In this paper, we reproduce, through CG, a perceptual study that aims to assess gender bias in relation to a simulated baby. In the original study, two groups of people watched the same video of a baby reacting to the same stimuli, but one group was told the baby was female and the other group was told the same baby was male, producing different perceptions. The results of our study with virtual babies were similar to the findings with real babies. First, it shows that people's emotional response change depending on the character gender attribute, in this case the only difference was the baby's name. Our research indicates that by just informing the name of a virtual human can be enough to create a gender perception that impact the participant emotional answer.
△ Less
Submitted 3 August, 2022;
originally announced August 2022.
-
Explaining Outcomes of Multi-Party Dialogues using Causal Learning
Authors:
Priyanka Sinha,
Pabitra Mitra,
Antonio Anastasio Bruto da Costa,
Nikolaos Kekatos
Abstract:
Multi-party dialogues are common in enterprise social media on technical as well as non-technical topics. The outcome of a conversation may be positive or negative. It is important to analyze why a dialogue ends with a particular sentiment from the point of view of conflict analysis as well as future collaboration design. We propose an explainable time series mining algorithm for such analysis. A…
▽ More
Multi-party dialogues are common in enterprise social media on technical as well as non-technical topics. The outcome of a conversation may be positive or negative. It is important to analyze why a dialogue ends with a particular sentiment from the point of view of conflict analysis as well as future collaboration design. We propose an explainable time series mining algorithm for such analysis. A dialogue is represented as an attributed time series of occurrences of keywords, EMPATH categories, and inferred sentiments at various points in its progress. A special decision tree, with decision metrics that take into account temporal relationships between dialogue events, is used for predicting the cause of the outcome sentiment. Interpretable rules mined from the classifier are used to explain the prediction. Experimental results are presented for the enterprise social media posts in a large company.
△ Less
Submitted 3 May, 2021;
originally announced May 2021.
-
Detecting Personality and Emotion Traits in Crowds from Video Sequences
Authors:
Rodolfo Migon Favaretto,
Paulo Knob,
Soraia Raupp Musse,
Felipe Vilanova,
Ângelo Brandelli Costa
Abstract:
This paper presents a methodology to detect personality and basic emotion characteristics of crowds in video sequences. Firstly, individuals are detected and tracked, then groups are recognized and characterized. Such information is then mapped to OCEAN dimensions, used to find out personality and emotion in videos, based on OCC emotion models. Although it is a clear challenge to validate our resu…
▽ More
This paper presents a methodology to detect personality and basic emotion characteristics of crowds in video sequences. Firstly, individuals are detected and tracked, then groups are recognized and characterized. Such information is then mapped to OCEAN dimensions, used to find out personality and emotion in videos, based on OCC emotion models. Although it is a clear challenge to validate our results with real life experiments, we evaluate our method with the available literature information regarding OCEAN values of different Countries and also emergent Personal distance among people. Hence, such analysis refer to cultural differences of each country too. Our results indicate that this model generates coherent information when compared to data provided in available literature, as shown in qualitative and quantitative results.
△ Less
Submitted 26 April, 2021;
originally announced April 2021.
-
Quantitative Corner Case Feature Analysis of Hybrid Automata with ForFET$^{SMT}$
Authors:
Antonio Anastasio Bruto da Costa,
Pallab Dasgupta,
Nikolaos Kekatos
Abstract:
The analysis and verification of hybrid automata (HA) models against rich formal properties can be a challenging task. Existing methods and tools can mainly reason whether a given property is satisfied or violated. However, such qualitative answers might not provide sufficient information about the model behaviors. This paper presents the ForFET$^{SMT}$ tool which can be used to reason quantitativ…
▽ More
The analysis and verification of hybrid automata (HA) models against rich formal properties can be a challenging task. Existing methods and tools can mainly reason whether a given property is satisfied or violated. However, such qualitative answers might not provide sufficient information about the model behaviors. This paper presents the ForFET$^{SMT}$ tool which can be used to reason quantitatively about such properties. It employs feature automata and can evaluate quantitative property corners of HA. ForFET$^{SMT}$ uses two third-party formal verification tools as its backbone: the SpaceEx reachability tool and the SMT solver dReach/dReal. Herein, we describe the design and implementation of ForFET$^{SMT}$ and present its functionalities and modules. To improve the usability of the tool for non-expert users, we also provide a list of quantitative property templates.
△ Less
Submitted 30 December, 2020;
originally announced January 2021.
-
Recurrence in Dense-time AMS Assertions
Authors:
Sayandeep Sanyal,
Antonio Anastasio Bruto da Costa,
Pallab Dasgupta
Abstract:
The notion of recurrence over continuous or dense time, as required for expressing Analog and Mixed-Signal (AMS) behaviours, is fundamentally different from what is offered by the recurrence operators of SystemVerilog Assertions (SVA). This article introduces the formal semantics of recurrence over dense time and provides a methodology for the runtime verification of such properties using interval…
▽ More
The notion of recurrence over continuous or dense time, as required for expressing Analog and Mixed-Signal (AMS) behaviours, is fundamentally different from what is offered by the recurrence operators of SystemVerilog Assertions (SVA). This article introduces the formal semantics of recurrence over dense time and provides a methodology for the runtime verification of such properties using interval arithmetic. Our property language extends SVA with dense real-time intervals and predicates containing real-valued signals. We provide a tool kit which interfaces with off-the-shelf EDA tools through standard VPI.
△ Less
Submitted 17 November, 2020;
originally announced November 2020.
-
Investigating Cultural Aspects in the Fundamental Diagram using Convolutional Neural Networks and Simulation
Authors:
Rodolfo M. Favaretto,
Roberto R. Santos,
Marcio Ballotin,
Paulo Knob,
Soraia R. Musse,
Felipe Vilanova,
Angelo B. Costa
Abstract:
This paper presents a study regarding group behavior in a controlled experiment focused on differences in an important attribute that vary across cultures -- the personal spaces -- in two Countries: Brazil and Germany. In order to coherently compare Germany and Brazil evolutions with same population applying same task, we performed the pedestrian Fundamental Diagram experiment in Brazil, as perfor…
▽ More
This paper presents a study regarding group behavior in a controlled experiment focused on differences in an important attribute that vary across cultures -- the personal spaces -- in two Countries: Brazil and Germany. In order to coherently compare Germany and Brazil evolutions with same population applying same task, we performed the pedestrian Fundamental Diagram experiment in Brazil, as performed in Germany. We use CNNs to detect and track people in video sequences. With this data, we use Voronoi Diagrams to find out the neighbor relation among people and then compute the walking distances to find out the personal spaces. Based on personal spaces analyses, we found out that people behavior is more similar, in terms of their behaviours, in high dense populations and vary more in low and medium densities. So, we focused our study on cultural differences between the two Countries in low and medium densities. Results indicate that personal space analyses can be a relevant feature in order to understand cultural aspects in video sequences. In addition to the cultural differences, we also investigate the personality model in crowds, using OCEAN. We also proposed a way to simulate the FD experiment from other countries using the OCEAN psychological traits model as input. The simulated countries were consistent with the literature.
△ Less
Submitted 30 September, 2020;
originally announced October 2020.
-
A Software to Detect OCC Emotion, Big-Five Personality and Hofstede Cultural Dimensions of Pedestrians from Video Sequences
Authors:
Rodolfo Migon Favaretto,
Victor Araujo,
Soraia Raupp Musse,
Felipe Vilanova,
Angelo Brandelli Costa
Abstract:
This paper presents a video analysis application to detect personality, emotion and cultural aspects from pedestrians in video sequences, along with a visualizer of features. The proposed model considers a series of characteristics of the pedestrians and the crowd, such as number and size of groups, distances, speeds, among others, and performs the map** of these characteristics in personalities…
▽ More
This paper presents a video analysis application to detect personality, emotion and cultural aspects from pedestrians in video sequences, along with a visualizer of features. The proposed model considers a series of characteristics of the pedestrians and the crowd, such as number and size of groups, distances, speeds, among others, and performs the map** of these characteristics in personalities, emotions and cultural aspects, considering the Cultural Dimensions of Hofstede (HCD), the Big-Five Personality Model (OCEAN) and the OCC Emotional Model. The main hypothesis is that there is a relationship between so-called intrinsic human variables (such as emotion) and the way people behave in space and time. The software was tested in a set of videos from different countries and results seem promising in order to identify these three different levels of psychological traits in the filmed sequences. In addition, the data of the people present in the videos can be seen in a crowd viewer.
△ Less
Submitted 18 August, 2019;
originally announced August 2019.
-
Learning Temporal Causal Sequence Relationships from Real-Time Time-Series
Authors:
Antonio Anastasio Bruto da Costa,
Pallab Dasgupta
Abstract:
We aim to mine temporal causal sequences that explain observed events (consequents) in time-series traces. Causal explanations of key events in a time-series has applications in design debugging, anomaly detection, planning, root-cause analysis and many more. We make use of decision trees and interval arithmetic to mine sequences that explain defining events in the time-series. We propose modified…
▽ More
We aim to mine temporal causal sequences that explain observed events (consequents) in time-series traces. Causal explanations of key events in a time-series has applications in design debugging, anomaly detection, planning, root-cause analysis and many more. We make use of decision trees and interval arithmetic to mine sequences that explain defining events in the time-series. We propose modified decision tree construction metrics to handle the non-determinism introduced by the temporal dimension. The mined sequences are expressed in a readable temporal logic language that is easy to interpret. The application of the proposed methodology is illustrated through various examples.
△ Less
Submitted 24 January, 2021; v1 submitted 29 May, 2019;
originally announced May 2019.
-
How much do you perceive this? An analysis on perceptions of geometric features, personalities and emotions in virtual humans (Extended Version)
Authors:
Victor Araujo,
Rodolfo Migon Favaretto,
Paulo Knob,
Soraia Raupp Musse,
Felipe Vilanova,
Angelo Brandelli Costa
Abstract:
This work aims to evaluate people's perception regarding geometric features, personalities and emotions characteristics in virtual humans. For this, we use as a basis, a dataset containing the tracking files of pedestrians captured from spontaneous videos and visualized them as identical virtual humans. The goal is to focus on their behavior and not being distracted by other features. In addition…
▽ More
This work aims to evaluate people's perception regarding geometric features, personalities and emotions characteristics in virtual humans. For this, we use as a basis, a dataset containing the tracking files of pedestrians captured from spontaneous videos and visualized them as identical virtual humans. The goal is to focus on their behavior and not being distracted by other features. In addition to tracking files containing their positions, the dataset also contains pedestrian emotions and personalities detected using Computer Vision and Pattern Recognition techniques. We proceed with our analysis in order to answer the question if subjects can perceive geometric features as distances/speeds as well as emotions and personalities in video sequences when pedestrians are represented by virtual humans. Regarding the participants, an amount of 73 people volunteered for the experiment. The analysis was divided in two parts: i) evaluation on perception of geometric characteristics, such as density, angular variation, distances and speeds, and ii) evaluation on personality and emotion perceptions. Results indicate that, even without explaining to the participants the concepts of each personality or emotion and how they were calculated (considering geometric characteristics), in most of the cases, participants perceived the personality and emotion expressed by the virtual agents, in accordance with the available ground truth.
△ Less
Submitted 24 April, 2019;
originally announced April 2019.
-
Using Big Five Personality Model to Detect Cultural Aspects in Crowds
Authors:
Rodolfo Migon Favaretto,
Leandro Dihl,
Soraia Raupp Musse,
Felipe Vilanova,
Angelo Brandelli Costa
Abstract:
The use of information technology in the study of human behavior is a subject of great scientific interest. Cultural and personality aspects are factors that influence how people interact with one another in a crowd. This paper presents a methodology to detect cultural characteristics of crowds in video sequences. Based on filmed sequences, pedestrians are detected, tracked and characterized. Such…
▽ More
The use of information technology in the study of human behavior is a subject of great scientific interest. Cultural and personality aspects are factors that influence how people interact with one another in a crowd. This paper presents a methodology to detect cultural characteristics of crowds in video sequences. Based on filmed sequences, pedestrians are detected, tracked and characterized. Such information is then used to find out cultural differences in those videos, based on the Big-five personality model. Regarding cultural differences of each country, results indicate that this model generates coherent information when compared to data provided in literature.
△ Less
Submitted 5 March, 2019;
originally announced March 2019.
-
Confounding variables can degrade generalization performance of radiological deep learning models
Authors:
John R. Zech,
Marcus A. Badgeley,
Manway Liu,
Anthony B. Costa,
Joseph J. Titano,
Eric K. Oermann
Abstract:
Early results in using convolutional neural networks (CNNs) on x-rays to diagnose disease have been promising, but it has not yet been shown that models trained on x-rays from one hospital or one group of hospitals will work equally well at different hospitals. Before these tools are used for computer-aided diagnosis in real-world clinical settings, we must verify their ability to generalize acros…
▽ More
Early results in using convolutional neural networks (CNNs) on x-rays to diagnose disease have been promising, but it has not yet been shown that models trained on x-rays from one hospital or one group of hospitals will work equally well at different hospitals. Before these tools are used for computer-aided diagnosis in real-world clinical settings, we must verify their ability to generalize across a variety of hospital systems. A cross-sectional design was used to train and evaluate pneumonia screening CNNs on 158,323 chest x-rays from NIH (n=112,120 from 30,805 patients), Mount Sinai (42,396 from 12,904 patients), and Indiana (n=3,807 from 3,683 patients). In 3 / 5 natural comparisons, performance on chest x-rays from outside hospitals was significantly lower than on held-out x-rays from the original hospital systems. CNNs were able to detect where an x-ray was acquired (hospital system, hospital department) with extremely high accuracy and calibrate predictions accordingly. The performance of CNNs in diagnosing diseases on x-rays may reflect not only their ability to identify disease-specific imaging findings on x-rays, but also their ability to exploit confounding information. Estimates of CNN performance based on test data from hospital systems used for model training may overstate their likely real-world performance.
△ Less
Submitted 12 July, 2018; v1 submitted 1 July, 2018;
originally announced July 2018.
-
Formal Feature Interpretation of Hybrid Systems
Authors:
Antonio Anastasio Bruto da Costa,
Goran Frehse,
Pallab Dasgupta
Abstract:
In current practice a formal analysis of hybrid system models is assertion-based. The work presented here is based on features that look beyond functional correctness toward a quantitative evaluation of behavioral attributes. A feature defines a real-valued evaluation function over a specific set of traces. This paper describes an improved method for the interpretation of features over hybrid auto…
▽ More
In current practice a formal analysis of hybrid system models is assertion-based. The work presented here is based on features that look beyond functional correctness toward a quantitative evaluation of behavioral attributes. A feature defines a real-valued evaluation function over a specific set of traces. This paper describes an improved method for the interpretation of features over hybrid automata models. It further demonstrates how satisfiability modulo theory solvers can be used for extracting behavioral traces corresponding to corner cases of a feature. Results are demonstrated on examples from the control and circuit domains.
△ Less
Submitted 22 February, 2019; v1 submitted 2 November, 2017;
originally announced November 2017.