Search | arXiv e-print repository

Organic or Diffused: Can We Distinguish Human Art from AI-generated Images?

Authors: Anna Yoo Jeong Ha, Josephine Passananti, Ronik Bhaskar, Shawn Shan, Reid Southen, Haitao Zheng, Ben Y. Zhao

Abstract: The advent of generative AI images has completely disrupted the art world. Distinguishing AI generated images from human art is a challenging problem whose impact is growing over time. A failure to address this problem allows bad actors to defraud individuals paying a premium for human art and companies whose stated policies forbid AI imagery. It is also critical for content owners to establish co… ▽ More The advent of generative AI images has completely disrupted the art world. Distinguishing AI generated images from human art is a challenging problem whose impact is growing over time. A failure to address this problem allows bad actors to defraud individuals paying a premium for human art and companies whose stated policies forbid AI imagery. It is also critical for content owners to establish copyright, and for model trainers interested in curating training data in order to avoid potential model collapse. There are several different approaches to distinguishing human art from AI images, including classifiers trained by supervised learning, research tools targeting diffusion models, and identification by professional artists using their knowledge of artistic techniques. In this paper, we seek to understand how well these approaches can perform against today's modern generative models in both benign and adversarial settings. We curate real human art across 7 styles, generate matching images from 5 generative models, and apply 8 detectors (5 automated detectors and 3 different human groups including 180 crowdworkers, 4000+ professional artists, and 13 expert artists experienced at detecting AI). Both Hive and expert artists do very well, but make mistakes in different ways (Hive is weaker against adversarial perturbations while Expert artists produce higher false positives). We believe these weaknesses will remain as models continue to evolve, and use our data to demonstrate why a combined team of human and automated detectors provides the best combination of accuracy and robustness. △ Less

Submitted 2 July, 2024; v1 submitted 5 February, 2024; originally announced February 2024.

arXiv:2202.10456 [pdf, other]

Feasibility Study of Multi-Site Split Learning for Privacy-Preserving Medical Systems under Data Imbalance Constraints in COVID-19, X-Ray, and Cholesterol Dataset

Authors: Yoo Jeong Ha, Gusang Lee, Minjae Yoo, Soyi Jung, Seehwan Yoo, Joongheon Kim

Abstract: It seems as though progressively more people are in the race to upload content, data, and information online; and hospitals haven't neglected this trend either. Hospitals are now at the forefront for multi-site medical data sharing to provide groundbreaking advancements in the way health records are shared and patients are diagnosed. Sharing of medical data is essential in modern medical research.… ▽ More It seems as though progressively more people are in the race to upload content, data, and information online; and hospitals haven't neglected this trend either. Hospitals are now at the forefront for multi-site medical data sharing to provide groundbreaking advancements in the way health records are shared and patients are diagnosed. Sharing of medical data is essential in modern medical research. Yet, as with all data sharing technology, the challenge is to balance improved treatment with protecting patient's personal information. This paper provides a novel split learning algorithm coined the term, "multi-site split learning", which enables a secure transfer of medical data between multiple hospitals without fear of exposing personal data contained in patient records. It also explores the effects of varying the number of end-systems and the ratio of data-imbalance on the deep learning performance. A guideline for the most optimal configuration of split learning that ensures privacy of patient data whilst achieving performance is empirically given. We argue the benefits of our multi-site split learning algorithm, especially regarding the privacy preserving factor, using CT scans of COVID-19 patients, X-ray bone scans, and cholesterol level medical data. △ Less

Submitted 20 February, 2022; originally announced February 2022.

arXiv:2111.11856 [pdf, other]

Spatio-Temporal Split Learning for Autonomous Aerial Surveillance using Urban Air Mobility (UAM) Networks

Authors: Yoo Jeong Ha, Soyi Jung, Jae-Hyun Kim, Marco Levorato, Joongheon Kim

Abstract: Autonomous surveillance unmanned aerial vehicles (UAVs) are deployed to observe the streets of the city for any suspicious activities. This paper utilizes surveillance UAVs for the purpose of detecting the presence of a fire in the streets. An extensive database is collected from UAV surveillance drones. With the aid of artificial intelligence (AI), fire stations can swiftly identify the presence… ▽ More Autonomous surveillance unmanned aerial vehicles (UAVs) are deployed to observe the streets of the city for any suspicious activities. This paper utilizes surveillance UAVs for the purpose of detecting the presence of a fire in the streets. An extensive database is collected from UAV surveillance drones. With the aid of artificial intelligence (AI), fire stations can swiftly identify the presence of a fire emerging in the neighborhood. Spatio-temporal split learning is applied to this scenario to preserve privacy and globally train a fire classification model. Fires are hazardous natural disasters that can spread very quickly. Swift identification of fire is required to deploy firefighters to the scene. In order to do this, strong communication between the UAV and the central server where the deep learning process occurs is required. Improving communication resilience is integral to enhancing a safe experience on the roads. Therefore, this paper explores the adequate number of clients and data ratios for split learning in this UAV setting, as well as the required network infrastructure. △ Less

Submitted 14 November, 2021; originally announced November 2021.

arXiv:2108.10147 [pdf, other]

Spatio-Temporal Split Learning for Privacy-Preserving Medical Platforms: Case Studies with COVID-19 CT, X-Ray, and Cholesterol Data

Authors: Yoo Jeong Ha, Minjae Yoo, Gusang Lee, Soyi Jung, Sae Won Choi, Joongheon Kim, Seehwan Yoo

Abstract: Machine learning requires a large volume of sample data, especially when it is used in high-accuracy medical applications. However, patient records are one of the most sensitive private information that is not usually shared among institutes. This paper presents spatio-temporal split learning, a distributed deep neural network framework, which is a turning point in allowing collaboration among pri… ▽ More Machine learning requires a large volume of sample data, especially when it is used in high-accuracy medical applications. However, patient records are one of the most sensitive private information that is not usually shared among institutes. This paper presents spatio-temporal split learning, a distributed deep neural network framework, which is a turning point in allowing collaboration among privacy-sensitive organizations. Our spatio-temporal split learning presents how distributed machine learning can be efficiently conducted with minimal privacy concerns. The proposed split learning consists of a number of clients and a centralized server. Each client has only has one hidden layer, which acts as the privacy-preserving layer, and the centralized server comprises the other hidden layers and the output layer. Since the centralized server does not need to access the training data and trains the deep neural network with parameters received from the privacy-preserving layer, privacy of original data is guaranteed. We have coined the term, spatio-temporal split learning, as multiple clients are spatially distributed to cover diverse datasets from different participants, and we can temporally split the learning process, detaching the privacy preserving layer from the rest of the learning process to minimize privacy breaches. This paper shows how we can analyze the medical data whilst ensuring privacy using our proposed multi-site spatio-temporal split learning algorithm on Coronavirus Disease-19 (COVID-19) chest Computed Tomography (CT) scans, MUsculoskeletal RAdiographs (MURA) X-ray images, and cholesterol levels. △ Less

Submitted 20 August, 2021; originally announced August 2021.

arXiv:2107.08628 [pdf, other]

Secure Aerial Surveillance using Split Learning

Authors: Yoo Jeong Ha, Minjae Yoo, Soohyun Park, Soyi Jung, Joongheon Kim

Abstract: Personal monitoring devices such as cyclist helmet cameras to record accidents or dash cams to catch collisions have proliferated, with more companies producing smaller and compact recording gadgets. As these devices are becoming a part of citizens' everyday arsenal, concerns over the residents' privacy are progressing. Therefore, this paper presents SASSL, a secure aerial surveillance drone using… ▽ More Personal monitoring devices such as cyclist helmet cameras to record accidents or dash cams to catch collisions have proliferated, with more companies producing smaller and compact recording gadgets. As these devices are becoming a part of citizens' everyday arsenal, concerns over the residents' privacy are progressing. Therefore, this paper presents SASSL, a secure aerial surveillance drone using split learning to classify whether there is a presence of a fire on the streets. This innovative split learning method transfers CCTV footage captured with a drone to a nearby server to run a deep neural network to detect a fire's presence in real-time without exposing the original data. We devise a scenario where surveillance UAVs roam around the suburb, recording any unnatural behavior. The UAV can process the recordings through its on-mobile deep neural network system or transfer the information to a server. Due to the resource limitations of mobile UAVs, the UAV does not have the capacity to run an entire deep neural network on its own. This is where the split learning method comes in handy. The UAV runs the deep neural network only up to the first hidden layer and sends only the feature map to the cloud server, where the rest of the deep neural network is processed. By ensuring that the learning process is divided between the UAV and the server, the privacy of raw data is secured while the UAV does not overexert its minimal resources. △ Less

Submitted 19 July, 2021; originally announced July 2021.

arXiv:2010.05440 [pdf]

Using Empirical Trajectory Data to Design Connected Autonomous Vehicle Controllers for Traffic Stabilization

Authors: Yujie Li, Sikai Chen, Runjia Du, Paul Young Joun Ha, Jiqian Dong, Samuel Labi

Abstract: Emerging transportation technologies offer unprecedented opportunities to improve the efficiency of the transportation system from the perspectives of energy consumption, congestion, and emissions. One of these technologies is connected and autonomous vehicles (CAVs). With the prospective duality of operations of CAVs and human driven vehicles in the same roadway space (also referred to as a mixed… ▽ More Emerging transportation technologies offer unprecedented opportunities to improve the efficiency of the transportation system from the perspectives of energy consumption, congestion, and emissions. One of these technologies is connected and autonomous vehicles (CAVs). With the prospective duality of operations of CAVs and human driven vehicles in the same roadway space (also referred to as a mixed stream), CAVs are expected to address a variety of traffic problems particularly those that are either caused or exacerbated by the heterogeneous nature of human driving. In efforts to realize such specific benefits of CAVs in mixed-stream traffic, it is essential to understand and simulate the behavior of human drivers in such environments, and microscopic traffic flow (MTF) models can be used to carry out this task. By hel** to comprehend the fundamental dynamics of traffic flow, MTF models serve as a powerful approach to assess the impacts of such flow in terms of safety, stability, and efficiency. In this paper, we seek to calibrate MTF models based on empirical trajectory data as basis of not only understanding traffic dynamics such as traffic instabilities, but ultimately using CAVs to mitigate stop-and-go wave propagation. The paper therefore duly considers the heterogeneity and uncertainty associated with human driving behavior in order to calibrate the dynamics of each HDV. Also, the paper designs the CAV controllers based on the microscopic HDV models that are calibrated in real time. The data for the calibration is from the Next Generation SIMulation (NGSIM) trajectory datasets. The results are encouraging, as they indicate the efficacy of the designed controller to significantly improve not only the stability of the mixed traffic stream but also the safety of both CAVs and HDVs in the traffic stream. △ Less

Submitted 11 October, 2020; originally announced October 2020.

Comments: TRB 2021 Annual Meeting

arXiv:2010.05439 [pdf]

A Cooperative Control Framework for CAV Lane Change in a Mixed Traffic Environment

Authors: Runjia Du, Sikai Chen, Yujie Li, Jiqian Dong, Paul Young Joun Ha, Samuel Labi

Abstract: In preparing for connected and autonomous vehicles (CAVs), a worrisome aspect is the transition era which will be characterized by mixed traffic (where CAVs and human-driven vehicles (HDVs) share the roadway). Consistent with expectations that CAVs will improve road safety, on-road CAVs may adopt rather conservative control policies, and this will likely cause HDVs to unduly exploit CAV conservati… ▽ More In preparing for connected and autonomous vehicles (CAVs), a worrisome aspect is the transition era which will be characterized by mixed traffic (where CAVs and human-driven vehicles (HDVs) share the roadway). Consistent with expectations that CAVs will improve road safety, on-road CAVs may adopt rather conservative control policies, and this will likely cause HDVs to unduly exploit CAV conservativeness by driving in ways that imperil safety. A context of this situation is lane-changing by the CAV. Without cooperation from other vehicles in the traffic stream, it can be extremely unsafe for the CAV to change lanes under dense, high-speed traffic conditions. The cooperation of neighboring vehicles is indispensable. To address this issue, this paper develops a control framework where connected HDVs and CAV can cooperate to facilitate safe and efficient lane changing by the CAV. Throughout the lane-change process, the safety of not only the CAV but also of all neighboring vehicles, is ensured through a collision avoidance mechanism in the control framework. The overall traffic flow efficiency is analyzed in terms of the ambient level of CHDV-CAV cooperation. The analysis outcomes are including the CAVs lane-change feasibility, the overall duration of the lane change. Lane change is a major source of traffic disturbance at multi-lane highways that impair their traffic flow efficiency. In providing a control framework for lane change in mixed traffic, this study shows how CHDV-CAV cooperation could help enhancing system efficiency. △ Less

Submitted 11 October, 2020; originally announced October 2020.

Comments: TRB 2021 Annual Meeting

arXiv:2010.05437 [pdf]

A DRL-based Multiagent Cooperative Control Framework for CAV Networks: a Graphic Convolution Q Network

Authors: Jiqian Dong, Sikai Chen, Paul Young Joun Ha, Yujie Li, Samuel Labi

Abstract: Connected Autonomous Vehicle (CAV) Network can be defined as a collection of CAVs operating at different locations on a multilane corridor, which provides a platform to facilitate the dissemination of operational information as well as control instructions. Cooperation is crucial in CAV operating systems since it can greatly enhance operation in terms of safety and mobility, and high-level coopera… ▽ More Connected Autonomous Vehicle (CAV) Network can be defined as a collection of CAVs operating at different locations on a multilane corridor, which provides a platform to facilitate the dissemination of operational information as well as control instructions. Cooperation is crucial in CAV operating systems since it can greatly enhance operation in terms of safety and mobility, and high-level cooperation between CAVs can be expected by jointly plan and control within CAV network. However, due to the highly dynamic and combinatory nature such as dynamic number of agents (CAVs) and exponentially growing joint action space in a multiagent driving task, achieving cooperative control is NP hard and cannot be governed by any simple rule-based methods. In addition, existing literature contains abundant information on autonomous driving's sensing technology and control logic but relatively little guidance on how to fuse the information acquired from collaborative sensing and build decision processor on top of fused information. In this paper, a novel Deep Reinforcement Learning (DRL) based approach combining Graphic Convolution Neural Network (GCN) and Deep Q Network (DQN), namely Graphic Convolution Q network (GCQ) is proposed as the information fusion module and decision processor. The proposed model can aggregate the information acquired from collaborative sensing and output safe and cooperative lane changing decisions for multiple CAVs so that individual intention can be satisfied even under a highly dynamic and partially observed mixed traffic. The proposed algorithm can be deployed on centralized control infrastructures such as road-side units (RSU) or cloud platforms to improve the CAV operation. △ Less

Submitted 11 October, 2020; originally announced October 2020.

Comments: TRB 2021 Annual Meeting

arXiv:2010.05436 [pdf]

Leveraging the Capabilities of Connected and Autonomous Vehicles and Multi-Agent Reinforcement Learning to Mitigate Highway Bottleneck Congestion

Authors: Paul Young Joun Ha, Sikai Chen, Jiqian Dong, Runjia Du, Yujie Li, Samuel Labi

Abstract: Active Traffic Management strategies are often adopted in real-time to address such sudden flow breakdowns. When queuing is imminent, Speed Harmonization (SH), which adjusts speeds in upstream traffic to mitigate traffic showckwaves downstream, can be applied. However, because SH depends on driver awareness and compliance, it may not always be effective in mitigating congestion. The use of multiag… ▽ More Active Traffic Management strategies are often adopted in real-time to address such sudden flow breakdowns. When queuing is imminent, Speed Harmonization (SH), which adjusts speeds in upstream traffic to mitigate traffic showckwaves downstream, can be applied. However, because SH depends on driver awareness and compliance, it may not always be effective in mitigating congestion. The use of multiagent reinforcement learning for collaborative learning, is a promising solution to this challenge. By incorporating this technique in the control algorithms of connected and autonomous vehicle (CAV), it may be possible to train the CAVs to make joint decisions that can mitigate highway bottleneck congestion without human driver compliance to altered speed limits. In this regard, we present an RL-based multi-agent CAV control model to operate in mixed traffic (both CAVs and human-driven vehicles (HDVs)). The results suggest that even at CAV percent share of corridor traffic as low as 10%, CAVs can significantly mitigate bottlenecks in highway traffic. Another objective was to assess the efficacy of the RL-based controller vis-à-vis that of the rule-based controller. In addressing this objective, we duly recognize that one of the main challenges of RL-based CAV controllers is the variety and complexity of inputs that exist in the real world, such as the information provided to the CAV by other connected entities and sensed information. These translate as dynamic length inputs which are difficult to process and learn from. For this reason, we propose the use of Graphical Convolution Networks (GCN), a specific RL technique, to preserve information network topology and corresponding dynamic length inputs. We then use this, combined with Deep Deterministic Policy Gradient (DDPG), to carry out multi-agent training for congestion mitigation using the CAV controllers. △ Less

Submitted 11 October, 2020; originally announced October 2020.

Comments: TRB 20201 Annual Meeting

arXiv:2008.04351 [pdf]

Leveraging Vehicle Connectivity and Autonomy to Stabilize Flow in Mixed Traffic Conditions: Accounting for Human-driven Vehicle Driver Behavioral Heterogeneity and Perception-reaction Time Delay

Authors: Yujie Li, Sikai Chen, Paul Young Joun Ha, Jiqian Dong, Aaron Steinfeld, Samuel Labi

Abstract: The erratic nature of human driving tends to trigger undesired waves that amplify as successive driver reactions propagate from the errant vehicle to vehicles upstream. Known as phantom jams, this phenomenon has been identified in the literature as one of the main causes of traffic congestion. This paper is based on the premise that vehicle automation and connectivity can help mitigate such jams.… ▽ More The erratic nature of human driving tends to trigger undesired waves that amplify as successive driver reactions propagate from the errant vehicle to vehicles upstream. Known as phantom jams, this phenomenon has been identified in the literature as one of the main causes of traffic congestion. This paper is based on the premise that vehicle automation and connectivity can help mitigate such jams. In the paper, we design a controller for use in a connected and autonomous vehicle (CAV) to stabilize the flow of human-driven vehicles (HDVs) that are upstream of the CAV, and consequently to lower collision risk in the upstream traffic environment. In modeling the HDV dynamics in the mixed traffic stream, we duly consider HDV driver heterogeneity and the time delays associated with their perception reaction time. We can find that the maximum number of HDVs that a CAV can stabilize is lower when human drivers potential time delay and heterogeneity are considered, compared to the scenario where such are not considered. This result suggests that heterogeneity and time delay in HDV behavior impairs the CAVs capability to stabilize traffic. Therefore, in designing CAV controllers for traffic stabilization, it is essential to consider such uncertainty-related conditions. In our demonstration, we also show that the designed controller can significantly improve both the stability of the mixed traffic stream and the safety of both CAVs and HDVs in the stream. The results are useful for real-time calibration of the model parameters that characterize HDV movements in the mixed stream. △ Less

Submitted 17 August, 2020; v1 submitted 10 August, 2020; originally announced August 2020.

Showing 1–10 of 10 results for author: Ha, Y J