-
Who Writes the Review, Human or AI?
Authors:
Panagiotis C. Theocharopoulos,
Spiros V. Georgakopoulos,
Sotiris K. Tasoulis,
Vassilis P. Plagianakos
Abstract:
With the increasing use of Artificial Intelligence in Natural Language Processing, concerns have been raised regarding the detection of AI-generated text in various domains. This study aims to investigate this issue by proposing a methodology to accurately distinguish AI-generated and human-written book reviews. Our approach utilizes transfer learning, enabling the model to identify generated text…
▽ More
With the increasing use of Artificial Intelligence in Natural Language Processing, concerns have been raised regarding the detection of AI-generated text in various domains. This study aims to investigate this issue by proposing a methodology to accurately distinguish AI-generated and human-written book reviews. Our approach utilizes transfer learning, enabling the model to identify generated text across different topics while improving its ability to detect variations in writing style and vocabulary. To evaluate the effectiveness of the proposed methodology, we developed a dataset consisting of real book reviews and AI-generated reviews using the recently proposed Vicuna open-source language model. The experimental results demonstrate that it is feasible to detect the original source of text, achieving an accuracy rate of 96.86%. Our efforts are oriented toward the exploration of the capabilities and limitations of Large Language Models in the context of text identification. Expanding our knowledge in these aspects will be valuable for effectively navigating similar models in the future and ensuring the integrity and authenticity of human-generated content.
△ Less
Submitted 30 May, 2024;
originally announced May 2024.
-
Detection of Fake Generated Scientific Abstracts
Authors:
Panagiotis C. Theocharopoulos,
Panagiotis Anagnostou,
Anastasia Tsoukala,
Spiros V. Georgakopoulos,
Sotiris K. Tasoulis,
Vassilis P. Plagianakos
Abstract:
The widespread adoption of Large Language Models and publicly available ChatGPT has marked a significant turning point in the integration of Artificial Intelligence into people's everyday lives. The academic community has taken notice of these technological advancements and has expressed concerns regarding the difficulty of discriminating between what is real and what is artificially generated. Th…
▽ More
The widespread adoption of Large Language Models and publicly available ChatGPT has marked a significant turning point in the integration of Artificial Intelligence into people's everyday lives. The academic community has taken notice of these technological advancements and has expressed concerns regarding the difficulty of discriminating between what is real and what is artificially generated. Thus, researchers have been working on develo** effective systems to identify machine-generated text. In this study, we utilize the GPT-3 model to generate scientific paper abstracts through Artificial Intelligence and explore various text representation methods when combined with Machine Learning models with the aim of identifying machine-written text. We analyze the models' performance and address several research questions that rise during the analysis of the results. By conducting this research, we shed light on the capabilities and limitations of Artificial Intelligence generated text.
△ Less
Submitted 12 April, 2023;
originally announced April 2023.
-
Supervised Dimensionality Reduction and Image Classification Utilizing Convolutional Autoencoders
Authors:
Ioannis A. Nellas,
Sotiris K. Tasoulis,
Vassilis P. Plagianakos,
Spiros V. Georgakopoulos
Abstract:
The joint optimization of the reconstruction and classification error is a hard non convex problem, especially when a non linear map** is utilized. In order to overcome this obstacle, a novel optimization strategy is proposed, in which a Convolutional Autoencoder for dimensionality reduction and a classifier composed by a Fully Connected Network, are combined to simultaneously produce supervised…
▽ More
The joint optimization of the reconstruction and classification error is a hard non convex problem, especially when a non linear map** is utilized. In order to overcome this obstacle, a novel optimization strategy is proposed, in which a Convolutional Autoencoder for dimensionality reduction and a classifier composed by a Fully Connected Network, are combined to simultaneously produce supervised dimensionality reduction and predictions. It turned out that this methodology can also be greatly beneficial in enforcing explainability of deep learning architectures. Additionally, the resulting Latent Space, optimized for the classification task, can be utilized to improve traditional, interpretable classification algorithms. The experimental results, showed that the proposed methodology achieved competitive results against the state of the art deep learning methods, while being much more efficient in terms of parameter count. Finally, it was empirically justified that the proposed methodology introduces advanced explainability regarding, not only the data structure through the produced latent space, but also about the classification behaviour.
△ Less
Submitted 3 November, 2022; v1 submitted 25 August, 2022;
originally announced August 2022.
-
Real Time Sentiment Change Detection of Twitter Data Streams
Authors:
Sotiris K. Tasoulis,
Aristidis G. Vrahatis,
Spiros V. Georgakopoulos,
Vassilis P. Plagianakos
Abstract:
In the past few years, there has been a huge growth in Twitter sentiment analysis having already provided a fair amount of research on sentiment detection of public opinion among Twitter users. Given the fact that Twitter messages are generated constantly with dizzying rates, a huge volume of streaming data is created, thus there is an imperative need for accurate methods for knowledge discovery a…
▽ More
In the past few years, there has been a huge growth in Twitter sentiment analysis having already provided a fair amount of research on sentiment detection of public opinion among Twitter users. Given the fact that Twitter messages are generated constantly with dizzying rates, a huge volume of streaming data is created, thus there is an imperative need for accurate methods for knowledge discovery and mining of this information. Although there exists a plethora of twitter sentiment analysis methods in the recent literature, the researchers have shifted to real-time sentiment identification on twitter streaming data, as expected. A major challenge is to deal with the Big Data challenges arising in Twitter streaming applications concerning both Volume and Velocity. Under this perspective, in this paper, a methodological approach based on open source tools is provided for real-time detection of changes in sentiment that is ultra efficient with respect to both memory consumption and computational cost. This is achieved by iteratively collecting tweets in real time and discarding them immediately after their process. For this purpose, we employ the Lexicon approach for sentiment characterizations, while change detection is achieved through appropriate control charts that do not require historical information. We believe that the proposed methodology provides the trigger for a potential large-scale monitoring of threads in an attempt to discover fake news spread or propaganda efforts in their early stages. Our experimental real-time analysis based on a recent hashtag provides evidence that the proposed approach can detect meaningful sentiment changes across a hashtags lifetime.
△ Less
Submitted 2 April, 2018;
originally announced April 2018.
-
Convolutional Neural Networks for Toxic Comment Classification
Authors:
Spiros V. Georgakopoulos,
Sotiris K. Tasoulis,
Aristidis G. Vrahatis,
Vassilis P. Plagianakos
Abstract:
Flood of information is produced in a daily basis through the global Internet usage arising from the on-line interactive communications among users. While this situation contributes significantly to the quality of human life, unfortunately it involves enormous dangers, since on-line texts with high toxicity can cause personal attacks, on-line harassment and bullying behaviors. This has triggered b…
▽ More
Flood of information is produced in a daily basis through the global Internet usage arising from the on-line interactive communications among users. While this situation contributes significantly to the quality of human life, unfortunately it involves enormous dangers, since on-line texts with high toxicity can cause personal attacks, on-line harassment and bullying behaviors. This has triggered both industrial and research community in the last few years while there are several tries to identify an efficient model for on-line toxic comment prediction. However, these steps are still in their infancy and new approaches and frameworks are required. On parallel, the data explosion that appears constantly, makes the construction of new machine learning computational tools for managing this information, an imperative need. Thankfully advances in hardware, cloud computing and big data management allow the development of Deep Learning approaches appearing very promising performance so far. For text classification in particular the use of Convolutional Neural Networks (CNN) have recently been proposed approaching text analytics in a modern manner emphasizing in the structure of words in a document. In this work, we employ this approach to discover toxic comments in a large pool of documents provided by a current Kaggle's competition regarding Wikipedia's talk page edits. To justify this decision we choose to compare CNNs against the traditional bag-of-words approach for text analysis combined with a selection of algorithms proven to be very effective in text classification. The reported results provide enough evidence that CNN enhance toxic comment classification reinforcing research interest towards this direction.
△ Less
Submitted 27 February, 2018;
originally announced February 2018.
-
Laser patterned polymer/nanotube composite electrodes for nanowire transistors on flexible substrates
Authors:
Kiron Prabha Rajeev,
Michael Beliatis,
Stamatis Georgakopoulos,
Vlad Stolojan,
John Underwood,
Maxim Shkunov
Abstract:
Fabrication techniques such as laser patterning offer excellent potential for low cost and large area device fabrication. Conductive polymers can be used to replace expensive metallic inks such as silver and gold nanoparticles for printing technology. Electrical conductivity of the polymers can be improved by blending with carbon nanotubes. In this work, formulations of acid functionalised multiwa…
▽ More
Fabrication techniques such as laser patterning offer excellent potential for low cost and large area device fabrication. Conductive polymers can be used to replace expensive metallic inks such as silver and gold nanoparticles for printing technology. Electrical conductivity of the polymers can be improved by blending with carbon nanotubes. In this work, formulations of acid functionalised multiwall carbon nanotubes (f-MWCNT) and poly (ethylenedioxythiophene) [PEDOT]: polystyrene sulphonate [PSS] were processed, and thin films were prepared on plastic substrates. Conductivity of PEDOT: PSS increased almost four orders of magnitude after adding f-MWCNT. Work function of PEDOT:PSS/f-MWCNT films was ~ 0.5eV higher as compared to the work function of pure PEDOT:PSS films, determined by Kelvin probe method. Field-effect transistors source-drain electrodes were prepared on PET plastic substrates where PEDOT:PSS/f-MWCNT were patterned using laser ablation at 44mJ/pulse energy to define 36 micron electrode separation. Silicon nanowires were deposited using dielectrophoresis alignment technique to bridge the PEDOT:PSS/f-MWCNT laser patterned electrodes. Finally, top-gated nanowire field effect transistors were completed by depositing parylene C as polymer gate dielectric and gold as the top-gate electrode. Transistor characteristics showed p-type conduction with excellent gate electrode coupling, with an ON/OFF ratio of ~ 200. Thereby, we demonstrate the feasibility of using high workfunction, printable PEDOT:PSS/MWCNT composite inks for patterning source/drain electrodes for nanowire transistors on flexible substrates.
△ Less
Submitted 18 November, 2017;
originally announced November 2017.
-
Measurements of the Generalized Electric and Magnetic Polarizabilities of the Proton at Low Q2 Using the VCS Reaction
Authors:
P. Bourgeois,
Y. Sato,
J. Shaw,
R. Alarcon,
A. M. Bernstein,
W. Bertozzi,
T. Botto,
J. Calarco,
F. Casagrande,
M. O. Distler,
K. Dow,
M. Farkondeh,
S. Georgakopoulos,
S. Gilad,
R. Hicks,
M. Holtrop,
A. Hotta,
X. Jiang,
A. Karabarbounis,
J. Kirkpatrick,
S. Kowalski,
R. Milner,
R. Miskimen,
I. Nakagawa,
C. N. Papanicolas
, et al. (12 additional authors not shown)
Abstract:
The mean square polarizability radii of the proton have been measured for the first time in a virtual Compton scattering experiment performed at the MIT-Bates out-of-plane scattering facility. Response functions and polarizabilities obtained from a dispersion analysis of the data at Q2=0.06 GeV2/c2 are in agreement with O(p3) heavy baryon chiral perturbation theory. The data support the dominanc…
▽ More
The mean square polarizability radii of the proton have been measured for the first time in a virtual Compton scattering experiment performed at the MIT-Bates out-of-plane scattering facility. Response functions and polarizabilities obtained from a dispersion analysis of the data at Q2=0.06 GeV2/c2 are in agreement with O(p3) heavy baryon chiral perturbation theory. The data support the dominance of mesonic effects in the polarizabilities, and the increase of beta with increasing Q2 is evidence for the cancellation of long-range diamagnetism by short-range paramagnetism from the pion cloud.
△ Less
Submitted 10 May, 2006;
originally announced May 2006.