-
Negative Impact and Probable Management Policy of E-Waste in Bangladesh
Authors:
S. M. Rezaul Karim,
Shariful Islam Sharif,
Md. Anisur Rahman Anik
Abstract:
Due to the wider usage of electrical and electronic products throughout the country, Electronic waste (E-waste) is growing and creating problems. E-waste contains toxic materials which may cause severe health risk and environmental damage. Bangladesh produces almost 2.7 million metric tons of E-waste per year and stays on high risk. This paper indicates the problem and suggests proper management o…
▽ More
Due to the wider usage of electrical and electronic products throughout the country, Electronic waste (E-waste) is growing and creating problems. E-waste contains toxic materials which may cause severe health risk and environmental damage. Bangladesh produces almost 2.7 million metric tons of E-waste per year and stays on high risk. This paper indicates the problem and suggests proper management of E-waste.
△ Less
Submitted 18 September, 2018;
originally announced September 2018.
-
Exploring Scientific Application Performance Using Large Scale Object Storage
Authors:
Steven Wei-der Chien,
Stefano Markidis,
Rami Karim,
Erwin Laure,
Sai Narasimhamurthy
Abstract:
One of the major performance and scalability bottlenecks in large scientific applications is parallel reading and writing to supercomputer I/O systems. The usage of parallel file systems and consistency requirements of POSIX, that all the traditional HPC parallel I/O interfaces adhere to, pose limitations to the scalability of scientific applications. Object storage is a widely used storage techno…
▽ More
One of the major performance and scalability bottlenecks in large scientific applications is parallel reading and writing to supercomputer I/O systems. The usage of parallel file systems and consistency requirements of POSIX, that all the traditional HPC parallel I/O interfaces adhere to, pose limitations to the scalability of scientific applications. Object storage is a widely used storage technology in cloud computing and is more frequently proposed for HPC workload to address and improve the current scalability and performance of I/O in scientific applications. While object storage is a promising technology, it is still unclear how scientific applications will use object storage and what the main performance benefits will be. This work addresses these questions, by emulating an object storage used by a traditional scientific application and evaluating potential performance benefits. We show that scientific applications can benefit from the usage of object storage on large scales.
△ Less
Submitted 6 July, 2018;
originally announced July 2018.
-
Convolutional Embedded Networks for Population Scale Clustering and Bio-ancestry Inferencing
Authors:
Md. Rezaul Karim,
Michael Cochez,
Achille Zappa,
Ratnesh Sahay,
Oya Beyan,
Dietrich-Rebholz Schuhmann,
Stefan Decker
Abstract:
The study of genetic variants can help find correlating population groups to identify cohorts that are predisposed to common diseases and explain differences in disease susceptibility and how patients react to drugs. Machine learning algorithms are increasingly being applied to identify interacting GVs to understand their complex phenotypic traits. Since the performance of a learning algorithm not…
▽ More
The study of genetic variants can help find correlating population groups to identify cohorts that are predisposed to common diseases and explain differences in disease susceptibility and how patients react to drugs. Machine learning algorithms are increasingly being applied to identify interacting GVs to understand their complex phenotypic traits. Since the performance of a learning algorithm not only depends on the size and nature of the data but also on the quality of underlying representation, deep neural networks can learn non-linear map**s that allow transforming GVs data into more clustering and classification friendly representations than manual feature selection. In this paper, we proposed convolutional embedded networks in which we combine two DNN architectures called convolutional embedded clustering and convolutional autoencoder classifier for clustering individuals and predicting geographic ethnicity based on GVs, respectively. We employed CAE-based representation learning on 95 million GVs from the 1000 genomes and Simons genome diversity projects. Quantitative and qualitative analyses with a focus on accuracy and scalability show that our approach outperforms state-of-the-art approaches such as VariantSpark and ADMIXTURE. In particular, CEC can cluster targeted population groups in 22 hours with an adjusted rand index of 0.915, the normalized mutual information of 0.92, and the clustering accuracy of 89%. Contrarily, the CAE classifier can predict the geographic ethnicity of unknown samples with an F1 and Mathews correlation coefficient(MCC) score of 0.9004 and 0.8245, respectively. To provide interpretations of the predictions, we identify significant biomarkers using gradient boosted trees(GBT) and SHAP. Overall, our approach is transparent and faster than the baseline methods, and scalable for 5% to 100% of the full human genome.
△ Less
Submitted 19 April, 2020; v1 submitted 30 May, 2018;
originally announced May 2018.
-
A Wideband Self-Consistent Disk-Averaged Spectrum of Jupiter Near 30 GHz and Its Implications for NH$_{3}$ Saturation in the Upper Troposphere
Authors:
Ramsey L. Karim,
David DeBoer,
Imke de Pater,
Garrett K. Keating
Abstract:
We present a new set of measurements obtained with the Combined Array for Research in Millimeter-wave Astronomy (CARMA) of Jupiter's microwave thermal emission near the 1.3 cm ammonia (NH$_{3}$) absorption band. We use these observations to investigate the ammonia mole fraction in the upper troposphere, near $0.3 < P < 2$ bar, based on radiative transfer modeling. We find that the ammonia mole fra…
▽ More
We present a new set of measurements obtained with the Combined Array for Research in Millimeter-wave Astronomy (CARMA) of Jupiter's microwave thermal emission near the 1.3 cm ammonia (NH$_{3}$) absorption band. We use these observations to investigate the ammonia mole fraction in the upper troposphere, near $0.3 < P < 2$ bar, based on radiative transfer modeling. We find that the ammonia mole fraction must be $\sim2.4\times 10^{-4}$ below the NH$_{3}$ ice cloud, i.e., at $0.8 < P < 8$ bar, in agreement with results by de Pater et al. (2001, 2016a). We find the NH$_{3}$ cloud-forming region between $0.3 < P < 0.8$ bar to be globally sub-saturated by 55% on average, in accordance with the result in Gibson et al. (2005). Although our data are not very sensitive to the region above the cloud layer, we are able to set an upper limit of $2.4\times 10^{-7}$ on the mole fraction here, a factor of $\sim$10 above the saturated vapor curve.
△ Less
Submitted 23 January, 2018;
originally announced January 2018.
-
CardiacNET: Segmentation of Left Atrium and Proximal Pulmonary Veins from MRI Using Multi-View CNN
Authors:
Aliasghar Mortazi,
Rashed Karim,
Kawal Rhode,
Jeremy Burt,
Ulas Bagci
Abstract:
Anatomical and biophysical modeling of left atrium (LA) and proximal pulmonary veins (PPVs) is important for clinical management of several cardiac diseases. Magnetic resonance imaging (MRI) allows qualitative assessment of LA and PPVs through visualization. However, there is a strong need for an advanced image segmentation method to be applied to cardiac MRI for quantitative analysis of LA and PP…
▽ More
Anatomical and biophysical modeling of left atrium (LA) and proximal pulmonary veins (PPVs) is important for clinical management of several cardiac diseases. Magnetic resonance imaging (MRI) allows qualitative assessment of LA and PPVs through visualization. However, there is a strong need for an advanced image segmentation method to be applied to cardiac MRI for quantitative analysis of LA and PPVs. In this study, we address this unmet clinical need by exploring a new deep learning-based segmentation strategy for quantification of LA and PPVs with high accuracy and heightened efficiency. Our approach is based on a multi-view convolutional neural network (CNN) with an adaptive fusion strategy and a new loss function that allows fast and more accurate convergence of the backpropagation based optimization. After training our network from scratch by using more than 60K 2D MRI images (slices), we have evaluated our segmentation strategy to the STACOM 2013 cardiac segmentation challenge benchmark. Qualitative and quantitative evaluations, obtained from the segmentation challenge, indicate that the proposed method achieved the state-of-the-art sensitivity (90%), specificity (99%), precision (94%), and efficiency levels (10 seconds in GPU, and 7.5 minutes in CPU).
△ Less
Submitted 19 May, 2017; v1 submitted 17 May, 2017;
originally announced May 2017.
-
Decision Support for Increasing the Efficiency of Crowdsourced Software Development
Authors:
Muhammad Rezaul Karim,
David Messinger,
Ye Yang,
Guenther Ruhe
Abstract:
Crowdsourced software development (CSD) offers a series of specified tasks to a large crowd of trustworthy software workers. Topcoder is a leading platform to manage the whole process of CSD. While increasingly accepted as a realistic option for software development, preliminary analysis on Topcoder's software crowd worker behaviors reveals an alarming task-quitting rate of 82.9%. In addition, a s…
▽ More
Crowdsourced software development (CSD) offers a series of specified tasks to a large crowd of trustworthy software workers. Topcoder is a leading platform to manage the whole process of CSD. While increasingly accepted as a realistic option for software development, preliminary analysis on Topcoder's software crowd worker behaviors reveals an alarming task-quitting rate of 82.9%. In addition, a substantial number of tasks do not receive any successful submission.
In this paper, we report about a methodology to improve the efficiency of CSD. We apply massive data analytics and machine leaning to (i) perform comparative analysis on alternative technique analysis to predict likelihood of winners and quitters for each task, (ii) significantly reduce the amount of non-succeeding development effort in registered but inappropriate tasks, (iii) identify and rank the most qualified registered workers for each task, and (iv) provide reliable prediction of tasks risky to get any successful submission.
Our results and analysis show that Random Forest (RF) based predictive technique performs best among the alternative techniques studied. Applying RF, the tasks recommended to workers can reduce the amount of non-succeeding development effort to a great extent. On average, over a period of 30 days, the savings are 3.5 and 4.6 person-days per registered tasks for experienced resp. unexperienced workers. For the task-related recommendations of workers, we can accurately recommend at least 1 actual winner in the top ranked workers, particularly 94.07% of the time among the top-2 recommended workers for each task. Finally, we can predict, with more than 80% F-measure, the tasks likely not getting any submission, thus triggering timely corrective actions from CSD platforms or task requesters.
△ Less
Submitted 13 October, 2016;
originally announced October 2016.
-
A novel and effective scoring scheme for structure classification and pairwise similarity measurement
Authors:
Rezaul Karim,
Md. Momin Al Aziz,
Swakkhar Shatabda,
M. Sohel Rahman
Abstract:
Protein tertiary structure defines its functions, classification and binding sites. Similar structural characteristics between two proteins often lead to the similar characteristics thereof. Determining structural similarity accurately in real time is a crucial research issue. In this paper, we present a novel and effective scoring scheme that is dependent on novel features extracted from protein…
▽ More
Protein tertiary structure defines its functions, classification and binding sites. Similar structural characteristics between two proteins often lead to the similar characteristics thereof. Determining structural similarity accurately in real time is a crucial research issue. In this paper, we present a novel and effective scoring scheme that is dependent on novel features extracted from protein alpha carbon distance matrices. Our scoring scheme is inspired from pattern recognition and computer vision. Our method is significantly better than the current state of the art methods in terms of family match of pairs of protein structures and other statistical measurements. The effectiveness of our method is tested on standard benchmark structures. A web service is available at http://research.buet.ac.bd:8080/Comograd/score.html where you can get the similarity measurement score between two protein structures based on our method.
△ Less
Submitted 4 October, 2016;
originally announced October 2016.
-
CoMOGrad and PHOG: From Computer Vision to Fast and Accurate Protein Tertiary Structure Retrieval
Authors:
Rezaul Karim,
Mohd. Momin Al Aziz,
Swakkhar Shatabda,
M. Sohel Rahman,
Md. Abul Kashem Mia,
Farhana Zaman,
Salman Rakin
Abstract:
Due to the advancements in technology number of entries in the structural database of proteins are increasing day by day. Methods for retrieving protein tertiary structures from this large database is the key to comparative analysis of structures which plays an important role to understand proteins and their function. In this paper, we present fast and accurate methods for the retrieval of protein…
▽ More
Due to the advancements in technology number of entries in the structural database of proteins are increasing day by day. Methods for retrieving protein tertiary structures from this large database is the key to comparative analysis of structures which plays an important role to understand proteins and their function. In this paper, we present fast and accurate methods for the retrieval of proteins from a large database with tertiary structures similar to a query protein. Our proposed methods borrow ideas from the field of computer vision. The speed and accuracy of our methods comes from the two newly introduced features, the co-occurrence matrix of the oriented gradient and pyramid histogram of oriented gradient and from the use of Euclidean distance as the distance measure. Experimental results clearly indicate the superiority of our approach in both running time and accuracy. Our method is readily available for use from this website: http://research.buet.ac.bd:8080/Comograd/.
△ Less
Submitted 2 September, 2014;
originally announced September 2014.
-
A New Approach to Constraint Weight Learning for Variable Ordering in CSPs
Authors:
Muhammad Rezaul Karim
Abstract:
A Constraint Satisfaction Problem (CSP) is a framework used for modeling and solving constrained problems. Tree-search algorithms like backtracking try to construct a solution to a CSP by selecting the variables of the problem one after another. The order in which these algorithm select the variables potentially have significant impact on the search performance. Various heuristics have been propos…
▽ More
A Constraint Satisfaction Problem (CSP) is a framework used for modeling and solving constrained problems. Tree-search algorithms like backtracking try to construct a solution to a CSP by selecting the variables of the problem one after another. The order in which these algorithm select the variables potentially have significant impact on the search performance. Various heuristics have been proposed for choosing good variable ordering. Many powerful variable ordering heuristics weigh the constraints first and then utilize the weights for selecting good order of the variables. Constraint weighting are basically employed to identify global bottlenecks in a CSP.
In this paper, we propose a new approach for learning weights for the constraints using competitive coevolutionary Genetic Algorithm (GA). Weights learned by the coevolutionary GA later help to make better choices for the first few variables in a search. In the competitive coevolutionary GA, constraints and candidate solutions for a CSP evolve together through an inverse fitness interaction process. We have conducted experiments on several random, quasi-random and patterned instances to measure the efficiency of the proposed approach. The results and analysis show that the proposed approach is good at learning weights to distinguish the hard constraints for quasi-random instances and forced satisfiable random instances generated with the Model RB. For other type of instances, RNDI still seems to be the best approach as our experiments show.
△ Less
Submitted 25 December, 2013;
originally announced December 2013.
-
Speaker Identification using MFCC-Domain Support Vector Machine
Authors:
S. M. Kamruzzaman,
A. N. M. Rezaul Karim,
Md. Saiful Islam,
Md. Emdadul Haque
Abstract:
Speech recognition and speaker identification are important for authentication and verification in security purpose, but they are difficult to achieve. Speaker identification methods can be divided into text-independent and text-dependent. This paper presents a technique of text-dependent speaker identification using MFCC-domain support vector machine (SVM). In this work, melfrequency cepstrum coe…
▽ More
Speech recognition and speaker identification are important for authentication and verification in security purpose, but they are difficult to achieve. Speaker identification methods can be divided into text-independent and text-dependent. This paper presents a technique of text-dependent speaker identification using MFCC-domain support vector machine (SVM). In this work, melfrequency cepstrum coefficients (MFCCs) and their statistical distribution properties are used as features, which will be inputs to the neural network. This work firstly used sequential minimum optimization (SMO) learning technique for SVM that improve performance over traditional techniques Chunking, Osuna. The cepstrum coefficients representing the speaker characteristics of a speech segment are computed by nonlinear filter bank analysis and discrete cosine transform. The speaker identification ability and convergence speed of the SVMs are investigated for different combinations of features. Extensive experimental results on several samples show the effectiveness of the proposed approach.
△ Less
Submitted 25 September, 2010;
originally announced September 2010.
-
Wrapper/TAM Co-Optimization and constrained Test Scheduling for SOCs Using Rectangle Bin Packing
Authors:
Hafiz Md. Hasan Babu,
Md. Rafiqul Islam,
Muhammad Rezaul Karim,
Abdullah Al Mahmud,
Md. Saiful Islam
Abstract:
This paper describes an integrated framework for SOC test automation. This framework is based on a new approach for Wrapper/TAM co-optimization based on rectangle packing considering the diagonal length of the rectangles to emphasize on both TAM widths required by a core and its corresponding testing time .In this paper, an efficient algorithm has been proposed to construct wrappers that reduce te…
▽ More
This paper describes an integrated framework for SOC test automation. This framework is based on a new approach for Wrapper/TAM co-optimization based on rectangle packing considering the diagonal length of the rectangles to emphasize on both TAM widths required by a core and its corresponding testing time .In this paper, an efficient algorithm has been proposed to construct wrappers that reduce testing time for cores. Rectangle packing has been used to develop an integrated scheduling algorithm that incorporates power constraints in the test schedule. The test power consumption is important to consider since exceeding the system's power limit might damage the system.
△ Less
Submitted 26 August, 2010;
originally announced August 2010.
-
Wrapper/TAM Co-Optimization and Test Scheduling for SOCs Using Rectangle Bin Packing Considering Diagonal Length of Rectangles
Authors:
Md. Rafiqul Islam,
Muhammad Rezaul Karim,
Abdullah Al Mahmud,
Md. Saiful Islam,
Hafiz Md. Hasan Babu
Abstract:
This paper describes an integrated framework for SOC test automation. This framework is based on a new approach for Wrapper/TAM co-optimization based on rectangle packing considering the diagonal length of the rectangles to emphasize on both TAM widths required by a core and its corresponding testing time. In this paper, we propose an efficient algorithm to construct wrappers that reduce testing t…
▽ More
This paper describes an integrated framework for SOC test automation. This framework is based on a new approach for Wrapper/TAM co-optimization based on rectangle packing considering the diagonal length of the rectangles to emphasize on both TAM widths required by a core and its corresponding testing time. In this paper, we propose an efficient algorithm to construct wrappers that reduce testing time for cores. We then use rectangle packing to develop an integrated scheduling algorithm that incorporates power constraints in the test schedule. The test power consumption is important to consider since exceeding the system's power limit might damage the system.
△ Less
Submitted 26 August, 2010;
originally announced August 2010.
-
Sorting Network for Reversible Logic Synthesis
Authors:
Md. Saiful Islam,
Md. Rafiqul Islam,
Abdullah Al Mahmud,
Muhammad Rezaul karim
Abstract:
In this paper, we have introduced an algorithm to implement a sorting network for reversible logic synthesis based on swap** bit strings. The algorithm first constructs a network in terms of n*n Toffoli gates read from left to right. The number of gates in the circuit produced by our algorithm is then reduced by template matching and removing useless gates from the network. We have also compared…
▽ More
In this paper, we have introduced an algorithm to implement a sorting network for reversible logic synthesis based on swap** bit strings. The algorithm first constructs a network in terms of n*n Toffoli gates read from left to right. The number of gates in the circuit produced by our algorithm is then reduced by template matching and removing useless gates from the network. We have also compared the efficiency of the proposed method with the existing ones.
△ Less
Submitted 22 August, 2010;
originally announced August 2010.
-
Building Toffoli Network for Reversible Logic Synthesis Based on Swap** Bit Strings
Authors:
Hafiz Md. Hasaan Babu,
Md. Saiful Islam,
Md. Rafiqul Islam,
Lafifa Jamal,
Abu Ahmed Ferdaus,
Muhammad Rezaul Karim,
Abdullah Al Mahmud
Abstract:
In this paper, we have implemented and designed a sorting network for reversible logic circuits synthesis in terms of n*n Toffoli gates. The algorithm presented in this paper constructs a Toffoli Network based on swap** bit strings. Reduction rules are then applied by simple template matching and removing useless gates from the network. Random selection of bit strings and reduction of control in…
▽ More
In this paper, we have implemented and designed a sorting network for reversible logic circuits synthesis in terms of n*n Toffoli gates. The algorithm presented in this paper constructs a Toffoli Network based on swap** bit strings. Reduction rules are then applied by simple template matching and removing useless gates from the network. Random selection of bit strings and reduction of control inputs are used to minimize both the number of gates and gate width. The method produces near optimal results for up to 3-input 3-output circuits.
△ Less
Submitted 19 August, 2010;
originally announced August 2010.
-
Variable Block Carry Skip Logic using Reversible Gates
Authors:
Md. Rafiqul Islam,
Md. Saiful Islam,
Muhammad Rezaul Karim,
Abdullah Al Mahmud,
Hafiz Md. Hasan Babu
Abstract:
Reversible circuits have applications in digital signal processing, computer graphics, quantum computation and cryptography. In this paper, a generalized k*k reversible gate family is proposed and a 3*3 gate of the family is discussed. Inverter, AND, OR, NAND, NOR, and EXOR gates can be realized by this gate. Implementation of a full-adder circuit using two such 3*3 gates is given. This full-adder…
▽ More
Reversible circuits have applications in digital signal processing, computer graphics, quantum computation and cryptography. In this paper, a generalized k*k reversible gate family is proposed and a 3*3 gate of the family is discussed. Inverter, AND, OR, NAND, NOR, and EXOR gates can be realized by this gate. Implementation of a full-adder circuit using two such 3*3 gates is given. This full-adder circuit contains only two reversible gates and produces no extra garbage outputs. The proposed full-adder circuit is efficient in terms of gate count, garbage outputs and quantum cost. A 4-bit carry skip adder is designed using this full-adder circuit and a variable block carry skip adder is discussed. Necessary equations required to evaluate these adder are presented.
△ Less
Submitted 19 August, 2010;
originally announced August 2010.
-
Efficient Wrapper/TAM Co-Optimization for SOC Using Rectangle Packing
Authors:
Md. Rafiqul Islam,
Muhammad Rezaul Karim,
Abdullah Al Mahmud,
Md. Saiful Islam,
Hafiz Md. Hasan Babu
Abstract:
The testing time for a system-on-chip(SOC) largely depends on the design of test wrappers and the test access mechanism(TAM).Wrapper/TAM co-optimization is therefore necessary to minimize SOC testing time . In this paper, we propose an efficient algorithm to construct wrappers that reduce testing time for cores. We further propose a new approach for wrapper/TAM co-optimization based on two-dimensi…
▽ More
The testing time for a system-on-chip(SOC) largely depends on the design of test wrappers and the test access mechanism(TAM).Wrapper/TAM co-optimization is therefore necessary to minimize SOC testing time . In this paper, we propose an efficient algorithm to construct wrappers that reduce testing time for cores. We further propose a new approach for wrapper/TAM co-optimization based on two-dimensional rectangle packing. This approach considers the diagonal length of the rectangles to emphasize on both TAM widths required by a core and its corresponding testing time.
△ Less
Submitted 19 August, 2010;
originally announced August 2010.
-
Clustering of Content Supporting Computer Mediated Courseware Development
Authors:
G. M. M. Bashir,
M. J. Hossain,
M. R. Karim
Abstract:
Computer Mediated Courseware (CMC) has been developed so far for individual courses considering single or multiple text books. A group of courseware can be developed by using multiple text books and in this case, it is a requirement to cluster the contents of different books to form a generalized clustered content. No work has been found to develop courseware applying generalized clustered content…
▽ More
Computer Mediated Courseware (CMC) has been developed so far for individual courses considering single or multiple text books. A group of courseware can be developed by using multiple text books and in this case, it is a requirement to cluster the contents of different books to form a generalized clustered content. No work has been found to develop courseware applying generalized clustered content. We have proposed a clustering of content supporting computer mediated courseware development based on data mining techniques to construct a hierarchical general structure of a group of courseware combining the individual structure of a set of books. The clustering will help the courseware developer to dynamically allocate contents to develop different courses using a group of books. The authors have applied this methodology for different level of courses on database. The methodology is generalized and can be applied to any other courses.
△ Less
Submitted 26 April, 2010;
originally announced April 2010.