-
Is Your AI Truly Yours? Leveraging Blockchain for Copyrights, Provenance, and Lineage
Authors:
Yilin Sai,
Qin Wang,
Guangsheng Yu,
H. M. N. Dilum Bandara,
Shi** Chen
Abstract:
As Artificial Intelligence (AI) integrates into diverse areas, particularly in content generation, ensuring rightful ownership and ethical use becomes paramount. AI service providers are expected to prioritize responsibly sourcing training data and obtaining licenses from data owners. However, existing studies primarily center on safeguarding static copyrights, which simply treats metadata/dataset…
▽ More
As Artificial Intelligence (AI) integrates into diverse areas, particularly in content generation, ensuring rightful ownership and ethical use becomes paramount. AI service providers are expected to prioritize responsibly sourcing training data and obtaining licenses from data owners. However, existing studies primarily center on safeguarding static copyrights, which simply treats metadata/datasets as non-fungible items with transferable/trading capabilities, neglecting the dynamic nature of training procedures that can shape an ongoing trajectory.
In this paper, we present \textsc{IBis}, a blockchain-based framework tailored for AI model training workflows. \textsc{IBis} integrates on-chain registries for datasets, licenses and models, alongside off-chain signing services to facilitate collaboration among multiple participants. Our framework addresses concerns regarding data and model provenance and copyright compliance. \textsc{IBis} enables iterative model retraining and fine-tuning, and offers flexible license checks and renewals. Further, \textsc{IBis} provides APIs designed for seamless integration with existing contract management software, minimizing disruptions to established model training processes. We implement \textsc{IBis} using Daml on the Canton blockchain. Evaluation results showcase the feasibility and scalability of \textsc{IBis} across varying numbers of users, datasets, models, and licenses.
△ Less
Submitted 9 April, 2024;
originally announced April 2024.
-
Maximizing NFT Incentives: References Make You Rich
Authors:
Guangsheng Yu,
Qin Wang,
Caijun Sun,
Lam Duc Nguyen,
H. M. N. Dilum Bandara,
Shi** Chen
Abstract:
In this paper, we study how to optimize existing Non-Fungible Token (NFT) incentives. Upon exploring a large number of NFT-related standards and real-world projects, we come across an unexpected finding. That is, the current NFT incentive mechanisms, often organized in an isolated and one-time-use fashion, tend to overlook their potential for scalable organizational structures.
We propose, analy…
▽ More
In this paper, we study how to optimize existing Non-Fungible Token (NFT) incentives. Upon exploring a large number of NFT-related standards and real-world projects, we come across an unexpected finding. That is, the current NFT incentive mechanisms, often organized in an isolated and one-time-use fashion, tend to overlook their potential for scalable organizational structures.
We propose, analyze, and implement a novel reference incentive model, which is inherently structured as a Directed Acyclic Graph (DAG)-based NFT network. This model aims to maximize connections (or references) between NFTs, enabling each isolated NFT to expand its network and accumulate rewards derived from subsequent or subscribed ones. We conduct both theoretical and practical analyses of the model, demonstrating its optimal utility.
△ Less
Submitted 9 February, 2024;
originally announced February 2024.
-
NetDiffus: Network Traffic Generation by Diffusion Models through Time-Series Imaging
Authors:
Nirhoshan Sivaroopan,
Dumindu Bandara,
Chamara Madarasingha,
Guilluame Jourjon,
Anura Jayasumana,
Kanchana Thilakarathna
Abstract:
Network data analytics are now at the core of almost every networking solution. Nonetheless, limited access to networking data has been an enduring challenge due to many reasons including complexity of modern networks, commercial sensitivity, privacy and regulatory constraints. In this work, we explore how to leverage recent advancements in Diffusion Models (DM) to generate synthetic network traff…
▽ More
Network data analytics are now at the core of almost every networking solution. Nonetheless, limited access to networking data has been an enduring challenge due to many reasons including complexity of modern networks, commercial sensitivity, privacy and regulatory constraints. In this work, we explore how to leverage recent advancements in Diffusion Models (DM) to generate synthetic network traffic data. We develop an end-to-end framework - NetDiffus that first converts one-dimensional time-series network traffic into two-dimensional images, and then synthesizes representative images for the original data. We demonstrate that NetDiffus outperforms the state-of-the-art traffic generation methods based on Generative Adversarial Networks (GANs) by providing 66.4% increase in fidelity of the generated data and 18.1% increase in downstream machine learning tasks. We evaluate NetDiffus on seven diverse traffic traces and show that utilizing synthetic data significantly improves traffic fingerprinting, anomaly detection and traffic classification.
△ Less
Submitted 23 September, 2023;
originally announced October 2023.
-
Blockchain-Empowered Trustworthy Data Sharing: Fundamentals, Applications, and Challenges
Authors:
Linh T. Nguyen,
Lam Duc Nguyen,
Thong Hoang,
Dilum Bandara,
Qin Wang,
Qinghua Lu,
Xiwei Xu,
Liming Zhu,
Petar Popovski,
Shi** Chen
Abstract:
Various data-sharing platforms have emerged with the growing public demand for open data and legislation mandating certain data to remain open. Most of these platforms remain opaque, leading to many questions about data accuracy, provenance and lineage, privacy implications, consent management, and the lack of fair incentives for data providers. With their transparency, immutability, non-repudiati…
▽ More
Various data-sharing platforms have emerged with the growing public demand for open data and legislation mandating certain data to remain open. Most of these platforms remain opaque, leading to many questions about data accuracy, provenance and lineage, privacy implications, consent management, and the lack of fair incentives for data providers. With their transparency, immutability, non-repudiation, and decentralization properties, blockchains could not be more apt to answer these questions and enhance trust in a data-sharing platform. However, blockchains are not good at handling the four Vs of big data (i.e., volume, variety, velocity, and veracity) due to their limited performance, scalability, and high cost. Given many related works proposes blockchain-based trustworthy data-sharing solutions, there is increasing confusion and difficulties in understanding and selecting these technologies and platforms in terms of their sharing mechanisms, sharing services, quality of services, and applications. In this paper, we conduct a comprehensive survey on blockchain-based data-sharing architectures and applications to fill the gap. First, we present the foundations of blockchains and discuss the challenges of current data-sharing techniques. Second, we focus on the convergence of blockchain and data sharing to give a clear picture of this landscape and propose a reference architecture for blockchain-based data sharing. Third, we discuss the industrial applications of blockchain-based data sharing, ranging from healthcare and smart grid to transportation and decarbonization. For each application, we provide lessons learned for the deployment of Blockchain-based data sharing. Finally, we discuss research challenges and open research directions.
△ Less
Submitted 11 March, 2023;
originally announced March 2023.
-
A Tale of Two Cities: Data and Configuration Variances in Robust Deep Learning
Authors:
Guanqin Zhang,
Jiankun Sun,
Feng Xu,
H. M. N. Dilum Bandara,
Shi** Chen,
Yulei Sui,
Tim Menzies
Abstract:
Deep neural networks (DNNs), are widely used in many industries such as image recognition, supply chain, medical diagnosis, and autonomous driving. However, prior work has shown the high accuracy of a DNN model does not imply high robustness (i.e., consistent performances on new and future datasets) because the input data and external environment (e.g., software and model configurations) for a dep…
▽ More
Deep neural networks (DNNs), are widely used in many industries such as image recognition, supply chain, medical diagnosis, and autonomous driving. However, prior work has shown the high accuracy of a DNN model does not imply high robustness (i.e., consistent performances on new and future datasets) because the input data and external environment (e.g., software and model configurations) for a deployed model are constantly changing. Hence, ensuring the robustness of deep learning is not an option but a priority to enhance business and consumer confidence. Previous studies mostly focus on the data aspect of model variance. In this article, we systematically summarize DNN robustness issues and formulate them in a holistic view through two important aspects, i.e., data and software configuration variances in DNNs. We also provide a predictive framework to generate representative variances (counterexamples) by considering both data and configurations for robust learning through the lens of search-based optimization.
△ Less
Submitted 25 November, 2022; v1 submitted 17 November, 2022;
originally announced November 2022.
-
Patterns for Blockchain-Based Payment Applications
Authors:
Qinghua Lu,
Xiwei Xu,
H. M. N. Dilum Bandara,
Shi** Chen,
Liming Zhu
Abstract:
As the killer application of blockchain technology, blockchain-based payments have attracted extensive attention ranging from hobbyists to corporates to regulatory bodies. Blockchain facilitates fast, secure, and cross-border payments without the need for intermediaries such as banks. Because blockchain technology is still emerging, systematically organised knowledge providing a holistic and compr…
▽ More
As the killer application of blockchain technology, blockchain-based payments have attracted extensive attention ranging from hobbyists to corporates to regulatory bodies. Blockchain facilitates fast, secure, and cross-border payments without the need for intermediaries such as banks. Because blockchain technology is still emerging, systematically organised knowledge providing a holistic and comprehensive view on designing payment applications that use blockchain is yet to be established. If such knowledge could be established in the form of a set of blockchain-specific patterns, architects could use those patterns in designing a payment application that leverages blockchain. Therefore, in this paper, we first identify a token's lifecycle and then present 12 patterns that cover critical aspects in enabling the state transitions of a token in blockchain-based payment applications. The lifecycle and the annotated patterns provide a payment-focused systematic view of system interactions and a guide to effective use of the patterns.
△ Less
Submitted 17 August, 2021; v1 submitted 19 February, 2021;
originally announced February 2021.
-
Real-Time Monitoring and Driver Feedback to Promote Fuel Efficient Driving
Authors:
Sandareka Wickramanayake,
H. M. N Dilum Bandara,
Nishal A. Samarasekara
Abstract:
Improving the fuel efficiency of vehicles is imperative to reduce costs and protect the environment. While the efficient engine and vehicle designs, as well as intelligent route planning, are well-known solutions to enhance the fuel efficiency, research has also demonstrated that the adoption of fuel-efficient driving behaviors could lead to further savings. In this work, we propose a novel framew…
▽ More
Improving the fuel efficiency of vehicles is imperative to reduce costs and protect the environment. While the efficient engine and vehicle designs, as well as intelligent route planning, are well-known solutions to enhance the fuel efficiency, research has also demonstrated that the adoption of fuel-efficient driving behaviors could lead to further savings. In this work, we propose a novel framework to promote fuel-efficient driving behaviors through real-time automatic monitoring and driver feedback. In this framework, a random-forest based classification model developed using historical data to identifies fuel-inefficient driving behaviors. The classifier considers driver-dependent parameters such as speed and acceleration/deceleration pattern, as well as environmental parameters such as traffic, road topography, and weather to evaluate the fuel efficiency of one-minute driving events. When an inefficient driving action is detected, a fuzzy logic inference system is used to determine what the driver should do to maintain fuel-efficient driving behavior. The decided action is then conveyed to the driver via a smartphone in a non-intrusive manner. Using a dataset from a long-distance bus, we demonstrate that the proposed classification model yields an accuracy of 85.2% while increasing the fuel efficiency up to 16.4%.
△ Less
Submitted 3 July, 2020;
originally announced July 2020.
-
An Analysis of Data Driven, Decision-Making Capabilities of Managers in Banks
Authors:
M. Shazmin Marikar,
H. M. N. Dilum Bandara
Abstract:
Organizations are adopting data analytics and Business Intelligence (BI) tools to gain insights from the past data, forecast future events, and to get timely and reliable information for decision making. While the tools are becoming mature, affordable, and more comfortable to use, it is also essential to understand whether the contemporary managers and leaders are ready for Data-Driven Decision Ma…
▽ More
Organizations are adopting data analytics and Business Intelligence (BI) tools to gain insights from the past data, forecast future events, and to get timely and reliable information for decision making. While the tools are becoming mature, affordable, and more comfortable to use, it is also essential to understand whether the contemporary managers and leaders are ready for Data-Driven Decision Making (DDDM). We explore the extent the Decision Makers (DMs) utilize data and tools, as well as their ability to interpret various forms of outputs from tools and to apply those insights to gain competitive advantage. Our methodology was based on a qualitative survey, where we interviewed 12 DMs of six commercial banks in Sri Lanka at the branch, regional, and CTO, CIO, and Head of IT levels. We identified that on many occasions, DMs' intuition overrules the DDDM due to uncertainty, lack of trust, knowledge, and risk-taking. Moreover, it was identified that the quality of visualizations has a significant impact on the use of intuition by overruling DDDM. We further provide a set of recommendations on the adoption of BI tools and how to overcome the struggles faced while performing DDDM.
△ Less
Submitted 3 July, 2020;
originally announced July 2020.
-
Patterns for Blockchain Data Migration
Authors:
HMN Dilum Bandara,
Xiwei Xu,
Ingo Weber
Abstract:
With the rapid evolution of technological, economic, and regulatory landscapes, contemporary blockchain platforms are all but certain to undergo major changes. Therefore, the applications that rely on them will eventually need to migrate from one blockchain instance to another to remain competitive and secure, as well as to enhance the business process, performance, cost efficiency, privacy, and r…
▽ More
With the rapid evolution of technological, economic, and regulatory landscapes, contemporary blockchain platforms are all but certain to undergo major changes. Therefore, the applications that rely on them will eventually need to migrate from one blockchain instance to another to remain competitive and secure, as well as to enhance the business process, performance, cost efficiency, privacy, and regulatory compliance. However, the differences in data and smart contract representations, modes of hosting, transaction fees, as well as the need to preserve consistency, immutability, and data provenance introduce unique challenges over database migration. We first present a set of blockchain migration scenarios and data fidelity levels using an illustrative example. We then present a set of migration patterns to address those scenarios and the above data management challenges. Finally, we demonstrate how the effort, cost, and risk of migration could be minimized by choosing a suitable set of data migration patterns, data fidelity level, and proactive system design. Practical considerations and research challenges are also highlighted.
△ Less
Submitted 25 May, 2021; v1 submitted 1 June, 2019;
originally announced June 2019.
-
Collaborative Applications over Peer-to-Peer Systems - Challenges and Solutions
Authors:
H. M. N. Dilum Bandara,
Anura P. Jayasumana
Abstract:
Emerging collaborative Peer-to-Peer (P2P) systems require discovery and utilization of diverse, multi-attribute, distributed, and dynamic groups of resources to achieve greater tasks beyond conventional file and processor cycle sharing. Collaborations involving application specific resources and dynamic quality of service goals are stressing current P2P architectures. Salient features and desirabl…
▽ More
Emerging collaborative Peer-to-Peer (P2P) systems require discovery and utilization of diverse, multi-attribute, distributed, and dynamic groups of resources to achieve greater tasks beyond conventional file and processor cycle sharing. Collaborations involving application specific resources and dynamic quality of service goals are stressing current P2P architectures. Salient features and desirable characteristics of collaborative P2P systems are highlighted. Resource advertising, selecting, matching, and binding, the critical phases in these systems, and their associated challenges are reviewed using examples from distributed collaborative adaptive sensing systems, cloud computing, and mobile social networks. State-of-the-art resource discovery/aggregation solutions are compared with respect to their architecture, lookup overhead, load balancing, etc., to determine their ability to meet the goals and challenges of each critical phase. Incentives, trust, privacy, and security issues are also discussed, as they will ultimately determine the success of a collaborative P2P system. Open issues and research opportunities that are essential to achieve the true potential of collaborative P2P systems are discussed.
△ Less
Submitted 7 July, 2012; v1 submitted 3 July, 2012;
originally announced July 2012.