-
Co-evolution of RDF Datasets
Authors:
Sidra Faisal,
Kemele M. Endris,
Saeedeh Shekarpour,
Sören Auer
Abstract:
Linking Data initiatives have fostered the publication of large number of RDF datasets in the Linked Open Data (LOD) cloud, as well as the development of query processing infrastructures to access these data in a federated fashion. However, different experimental studies have shown that availability of LOD datasets cannot be always ensured, being RDF data replication required for envisioning relia…
▽ More
Linking Data initiatives have fostered the publication of large number of RDF datasets in the Linked Open Data (LOD) cloud, as well as the development of query processing infrastructures to access these data in a federated fashion. However, different experimental studies have shown that availability of LOD datasets cannot be always ensured, being RDF data replication required for envisioning reliable federated query frameworks. Albeit enhancing data availability, RDF data replication requires synchronization and conflict resolution when replicas and source datasets are allowed to change data over time, i.e., co-evolution management needs to be provided to ensure consistency. In this paper, we tackle the problem of RDF data co-evolution and devise an approach for conflict resolution during co-evolution of RDF datasets. Our proposed approach is property-oriented and allows for exploiting semantics about RDF properties during co-evolution management. The quality of our approach is empirically evaluated in different scenarios on the DBpedia-live dataset. Experimental results suggest that proposed proposed techniques have a positive impact on the quality of data in source datasets and replicas.
△ Less
Submitted 28 March, 2016; v1 submitted 20 January, 2016;
originally announced January 2016.
-
Interest-based RDF Update Propagation
Authors:
Kemele M. Endris,
Sidra Faisal,
Fabrizio Orlandi,
Sören Auer,
Simon Scerri
Abstract:
Many LOD datasets, such as DBpedia and LinkedGeoData, are voluminous and process large amounts of requests from diverse applications. Many data products and services rely on full or partial local LOD replications to ensure faster querying and processing. While such replicas enhance the flexibility of information sharing and integration infrastructures, they also introduce data duplication with all…
▽ More
Many LOD datasets, such as DBpedia and LinkedGeoData, are voluminous and process large amounts of requests from diverse applications. Many data products and services rely on full or partial local LOD replications to ensure faster querying and processing. While such replicas enhance the flexibility of information sharing and integration infrastructures, they also introduce data duplication with all the associated undesirable consequences. Given the evolving nature of the original and authoritative datasets, to ensure consistent and up-to-date replicas frequent replacements are required at a great cost. In this paper, we introduce an approach for interest-based RDF update propagation, which propagates only interesting parts of updates from the source to the target dataset. Effectively, this enables remote applications to `subscribe' to relevant datasets and consistently reflect the necessary changes locally without the need to frequently replace the entire dataset (or a relevant subset). Our approach is based on a formal definition for graph-pattern-based interest expressions that is used to filter interesting parts of updates from the source. We implement the approach in the iRap framework and perform a comprehensive evaluation based on DBpedia Live updates, to confirm the validity and value of our approach.
△ Less
Submitted 26 May, 2015;
originally announced May 2015.
-
Measuring Fatigue of Soldiers in Wireless Body Area Sensor Networks
Authors:
N. Javaid,
S. Faisal,
Z. A. Khan,
D. Nayab,
M. Zahid
Abstract:
Wireless Body Area Sensor Networks (WBASNs) consist of on-body or in-body sensors placed on human body for health monitoring. Energy conservation of these sensors, while guaranteeing a required level of performance, is a challenging task. Energy efficient routing schemes are designed for the longevity of network lifetime. In this paper, we propose a routing protocol for measuring fatigue of a sold…
▽ More
Wireless Body Area Sensor Networks (WBASNs) consist of on-body or in-body sensors placed on human body for health monitoring. Energy conservation of these sensors, while guaranteeing a required level of performance, is a challenging task. Energy efficient routing schemes are designed for the longevity of network lifetime. In this paper, we propose a routing protocol for measuring fatigue of a soldier. Three sensors are attached to soldier's body that monitor specific parameters. Our proposed protocol is an event driven protocol and takes three scenarios for measuring the fatigue of a soldier. We evaluate our proposed work in terms of network lifetime, throughput, remaining energy of sensors and fatigue of a soldier.
△ Less
Submitted 27 July, 2013;
originally announced July 2013.
-
Z-SEP: Zonal-Stable Election Protocol for Wireless Sensor Networks
Authors:
S. Faisal,
N. Javaid,
A. Javaid,
M. A. Khan,
S. H. Bouk,
Z. A. Khan
Abstract:
Wireless Sensor Networks (WSNs) are comprised of thousands of sensor nodes, with restricted energy, that co-operate to accomplish a sensing task. Various routing Protocols are designed for transmission in WSNs. In this paper, we proposed a hybrid routing protocol: Zonal-Stable Election Protocol (Z-SEP) for heterogeneous WSNs. In this protocol, some nodes transmit data directly to base station whil…
▽ More
Wireless Sensor Networks (WSNs) are comprised of thousands of sensor nodes, with restricted energy, that co-operate to accomplish a sensing task. Various routing Protocols are designed for transmission in WSNs. In this paper, we proposed a hybrid routing protocol: Zonal-Stable Election Protocol (Z-SEP) for heterogeneous WSNs. In this protocol, some nodes transmit data directly to base station while some use clustering technique to send data to base station as in SEP. We implemented Z-SEP and compared it with traditional Low Energy adaptive clustering hierarchy (LEACH) and SEP. Simulation results showed that Z-SEP enhanced the stability period and throughput than existing protocols like LEACH and SEP.
△ Less
Submitted 24 March, 2013; v1 submitted 21 March, 2013;
originally announced March 2013.
-
Hash in a Flash: Hash Tables for Solid State Devices
Authors:
Tyler Clemons,
S. M. Faisal,
Shirish Tatikonda,
Charu Aggarawl,
Srinivasan Parthasarathy
Abstract:
In recent years, information retrieval algorithms have taken center stage for extracting important data in ever larger datasets. Advances in hardware technology have lead to the increasingly wide spread use of flash storage devices. Such devices have clear benefits over traditional hard drives in terms of latency of access, bandwidth and random access capabilities particularly when reading data. T…
▽ More
In recent years, information retrieval algorithms have taken center stage for extracting important data in ever larger datasets. Advances in hardware technology have lead to the increasingly wide spread use of flash storage devices. Such devices have clear benefits over traditional hard drives in terms of latency of access, bandwidth and random access capabilities particularly when reading data. There are however some interesting trade-offs to consider when leveraging the advanced features of such devices. On a relative scale writing to such devices can be expensive. This is because typical flash devices (NAND technology) are updated in blocks. A minor update to a given block requires the entire block to be erased, followed by a re-writing of the block. On the other hand, sequential writes can be two orders of magnitude faster than random writes. In addition, random writes are degrading to the life of the flash drive, since each block can support only a limited number of erasures. TF-IDF can be implemented using a counting hash table. In general, hash tables are a particularly challenging case for the flash drive because this data structure is inherently dependent upon the randomness of the hash function, as opposed to the spatial locality of the data. This makes it difficult to avoid the random writes incurred during the construction of the counting hash table for TF-IDF. In this paper, we will study the design landscape for the development of a hash table for flash storage devices. We demonstrate how to effectively design a hash table with two related hash functions, one of which exhibits a data placement property with respect to the other. Specifically, we focus on three designs based on this general philosophy and evaluate the trade-offs among them along the axes of query performance, insert and update times and I/O time through an implementation of the TF-IDF algorithm.
△ Less
Submitted 19 November, 2012;
originally announced November 2012.
-
Elastic Fidelity: Trading-off Computational Accuracy for Energy Reduction
Authors:
Sourya Roy,
Tyler Clemons,
S M Faisal,
Ke Liu,
Nikos Hardavellas,
Srinivasan Parthasarathy
Abstract:
Power dissipation and energy consumption have become one of the most important problems in the design of processors today. This is especially true in power-constrained environments, such as embedded and mobile computing. While lowering the operational voltage can reduce power consumption, there are limits imposed at design time, beyond which hardware components experience faulty operation. Moreove…
▽ More
Power dissipation and energy consumption have become one of the most important problems in the design of processors today. This is especially true in power-constrained environments, such as embedded and mobile computing. While lowering the operational voltage can reduce power consumption, there are limits imposed at design time, beyond which hardware components experience faulty operation. Moreover, the decrease in feature size has led to higher susceptibility to process variations, leading to reliability issues and lowering yield. However, not all computations and all data in a workload need to maintain 100% fidelity. In this paper, we explore the idea of employing functional or storage units that let go the conservative guardbands imposed on the design to guarantee reliable execution. Rather, these units exhibit Elastic Fidelity, by judiciously lowering the voltage to trade-off reliable execution for power consumption based on the error guarantees required by the executing code. By estimating the accuracy required by each computational segment of a workload, and steering each computation to different functional and storage units, Elastic Fidelity Computing obtains power and energy savings while reaching the reliability targets required by each computational segment. Our preliminary results indicate that even with conservative estimates, Elastic Fidelity can reduce the power and energy consumption of a processor by 11-13% when executing applications involving human perception that are typically included in modern mobile platforms, such as audio, image, and video decoding.
△ Less
Submitted 17 November, 2011;
originally announced November 2011.