-
On-chip lateral Si:Te PIN photodiodes for room-temperature detection in the telecom optical wavelength bands
Authors:
Mohd Saif Shaikh,
Shuyu Wen,
Mircea-Traian Catuneanu,
Mao Wang,
Artur Erbe,
Slawomir Prucnal,
Lars Rebohle,
Shengqiang Zhou,
Kambiz Jamshidi,
Manfred Helm,
Yonder Berencén
Abstract:
Photonic integrated circuits require photodetectors that operate at room temperature with sensitivity at telecom wavelengths and are suitable for integration with planar complementary-metal-oxide-semiconductor (CMOS) technology. Silicon hyperdoped with deep-level impurities is a promising material for silicon infrared detectors because of its strong room-temperature photoresponse in the short-wave…
▽ More
Photonic integrated circuits require photodetectors that operate at room temperature with sensitivity at telecom wavelengths and are suitable for integration with planar complementary-metal-oxide-semiconductor (CMOS) technology. Silicon hyperdoped with deep-level impurities is a promising material for silicon infrared detectors because of its strong room-temperature photoresponse in the short-wavelength infrared region caused by the creation of an impurity band within the silicon band gap. In this work, we present the first experimental demonstration of lateral Te-hyperdoped Si PIN photodetectors operating at room temperature in the optical telecom bands. We provide a detailed description of the fabrication process, working principle, and performance of the photodiodes, including their key figure of merits. Our results are promising for the integration of active and passive photonic elements on a single Si chip, leveraging the advantages of planar CMOS technology.
△ Less
Submitted 2 May, 2023;
originally announced May 2023.
-
Mid- and far-infrared localized surface plasmon resonances in chalcogen-hyperdoped silicon
Authors:
Mao Wang,
Ye Yu,
Slawomir Prucnal,
Yonder Berencén,
Mohd Saif Shaikh,
Lars Rebohle,
Muhammad Bilal Khan,
Vitaly Zviagin,
René Hübner,
Alexej Pashkin,
Artur Erbe,
Yordan M. Georgiev,
Marius Grundmann,
Manfred Helm,
Robert Kirchner,
Shengqiang Zhou
Abstract:
Plasmonic sensing in the infrared region employs the direct interaction of the vibrational fingerprints of molecules with the plasmonic resonances, creating surface-enhanced sensing platforms that are superior than the traditional spectroscopy. However, the standard noble metals used for plasmonic resonances suffer from high radiative losses as well as fabrication challenges, such as tuning the sp…
▽ More
Plasmonic sensing in the infrared region employs the direct interaction of the vibrational fingerprints of molecules with the plasmonic resonances, creating surface-enhanced sensing platforms that are superior than the traditional spectroscopy. However, the standard noble metals used for plasmonic resonances suffer from high radiative losses as well as fabrication challenges, such as tuning the spectral resonance positions into mid- to far-infrared regions, and the compatibility issue with the existing complementary metal-oxide-semiconductor (CMOS) manufacturing platform. Here, we demonstrate the occurrence of mid-infrared localized surface plasmon resonances (LSPR) in thin Si films hyperdoped with the known deep-level impurity tellurium. We show that the mid-infrared LSPR can be further enhanced and spectrally extended to the far-infrared range by fabricating two-dimensional arrays of micrometer-sized antennas in a Te-hyperdoped Si chip. Since Te-hyperdoped Si can also work as an infrared photodetector, we believe that our results will unlock the route toward the direct integration of plasmonic sensors with the one-chip CMOS platform, greatly advancing the possibility of mass manufacturing of high-performance plasmonic sensing systems.
△ Less
Submitted 7 October, 2022;
originally announced October 2022.
-
Human Computations in Citizen Crowds: A Knowledge Management Solution Framework
Authors:
Nadeem Kafi,
Zubair Ahmed Shaikh,
Muhammad Shahid Shaikh
Abstract:
KG (Knowledge Generation) and understanding have traditionally been a Human-centric activity. KE (Knowledge Engineering) and KM (Knowledge Management) have tried to augment human knowledge on two separate planes: the first deals with machine interpretation of knowledge while the later explore interactions in human networks for KG and understanding. However, both remain computer-centric. Crowdsourc…
▽ More
KG (Knowledge Generation) and understanding have traditionally been a Human-centric activity. KE (Knowledge Engineering) and KM (Knowledge Management) have tried to augment human knowledge on two separate planes: the first deals with machine interpretation of knowledge while the later explore interactions in human networks for KG and understanding. However, both remain computer-centric. Crowdsourced HC (Human Computations) have recently utilized human cognition and memory to generate diverse knowledge streams on specific tasks, which are mostly easy for humans to solve but remain challenging for machine algorithms. Literature shows little work on KM frameworks for citizen crowds, which gather input from the diverse category of Humans, organize that knowledge concerning tasks and knowledge categories and recreate new knowledge as a computer-centric activity. In this paper, we present an attempt to create a framework by implementing a simple solution, called ExamCheck, to focus on the generation of knowledge, feedback on that knowledge and recording the results of that knowledge in academic settings. Our solution, based on HC, shows that a structured KM framework can address a complex problem in a context that is important for participants themselves.
△ Less
Submitted 27 November, 2020;
originally announced November 2020.
-
Document clustering using graph based document representation with constraints
Authors:
Muhammad Rafi,
Farnaz Amin,
Mohammad Shahid Shaikh
Abstract:
Document clustering is an unsupervised approach in which a large collection of documents (corpus) is subdivided into smaller, meaningful, identifiable, and verifiable sub-groups (clusters). Meaningful representation of documents and implicitly identifying the patterns, on which this separation is performed, is the challenging part of document clustering. We have proposed a document clustering tech…
▽ More
Document clustering is an unsupervised approach in which a large collection of documents (corpus) is subdivided into smaller, meaningful, identifiable, and verifiable sub-groups (clusters). Meaningful representation of documents and implicitly identifying the patterns, on which this separation is performed, is the challenging part of document clustering. We have proposed a document clustering technique using graph based document representation with constraints. A graph data structure can easily capture the non-linear relationships of nodes, document contains various feature terms that can be non-linearly connected hence a graph can easily represents this information. Constrains, are explicit conditions for document clustering where background knowledge is use to set the direction for Linking or Not-Linking a set of documents for a target clusters, thus guiding the clustering process. We deemed clustering is an ill-define problem, there can be many clustering results. Background knowledge can be used to drive the clustering algorithm in the right direction. We have proposed three different types of constraints, Instance level, corpus level and cluster level constraints. A new algorithm Constrained HAC is also proposed which will incorporate Instance level constraints as prior knowledge; it will guide the clustering process leading to better results. Extensive set of experiments have been performed on both synthetic and standard document clustering datasets, results are compared on standard clustering measures like: purity, entropy and F-measure. Results clearly establish that our proposed approach leads to improvement in cluster quality.
△ Less
Submitted 4 December, 2014;
originally announced December 2014.
-
An improved semantic similarity measure for document clustering based on topic maps
Authors:
Muhammad Rafi,
Mohammad Shahid Shaikh
Abstract:
A major computational burden, while performing document clustering, is the calculation of similarity measure between a pair of documents. Similarity measure is a function that assigns a real number between 0 and 1 to a pair of documents, depending upon the degree of similarity between them. A value of zero means that the documents are completely dissimilar whereas a value of one indicates that the…
▽ More
A major computational burden, while performing document clustering, is the calculation of similarity measure between a pair of documents. Similarity measure is a function that assigns a real number between 0 and 1 to a pair of documents, depending upon the degree of similarity between them. A value of zero means that the documents are completely dissimilar whereas a value of one indicates that the documents are practically identical. Traditionally, vector-based models have been used for computing the document similarity. The vector-based models represent several features present in documents. These approaches to similarity measures, in general, cannot account for the semantics of the document. Documents written in human languages contain contexts and the words used to describe these contexts are generally semantically related. Motivated by this fact, many researchers have proposed seman-tic-based similarity measures by utilizing text annotation through external thesauruses like WordNet (a lexical database). In this paper, we define a semantic similarity measure based on documents represented in topic maps. Topic maps are rapidly becoming an industrial standard for knowledge representation with a focus for later search and extraction. The documents are transformed into a topic map based coded knowledge and the similarity between a pair of documents is represented as a correlation between the common patterns (sub-trees). The experimental studies on the text mining datasets reveal that this new similarity measure is more effective as compared to commonly used similarity measures in text clustering.
△ Less
Submitted 17 March, 2013;
originally announced March 2013.
-
A comparison of SVM and RVM for Document Classification
Authors:
Muhammad Rafi,
Mohammad Shahid Shaikh
Abstract:
Document classification is a task of assigning a new unclassified document to one of the predefined set of classes. The content based document classification uses the content of the document with some weighting criteria to assign it to one of the predefined classes. It is a major task in library science, electronic document management systems and information sciences. This paper investigates docum…
▽ More
Document classification is a task of assigning a new unclassified document to one of the predefined set of classes. The content based document classification uses the content of the document with some weighting criteria to assign it to one of the predefined classes. It is a major task in library science, electronic document management systems and information sciences. This paper investigates document classification by using two different classification techniques (1) Support Vector Machine (SVM) and (2) Relevance Vector Machine (RVM). SVM is a supervised machine learning technique that can be used for classification task. In its basic form, SVM represents the instances of the data into space and tries to separate the distinct classes by a maximum possible wide gap (hyper plane) that separates the classes. On the other hand RVM uses probabilistic measure to define this separation space. RVM uses Bayesian inference to obtain succinct solution, thus RVM uses significantly fewer basis functions. Experimental studies on three standard text classification datasets reveal that although RVM takes more training time, its classification is much better as compared to SVM.
△ Less
Submitted 13 January, 2013;
originally announced January 2013.
-
Content-based Text Categorization using Wikitology
Authors:
Muhammad Rafi,
Sundus Hassan,
Mohammad Shahid Shaikh
Abstract:
A major computational burden, while performing document clustering, is the calculation of similarity measure between a pair of documents. Similarity measure is a function that assign a real number between 0 and 1 to a pair of documents, depending upon the degree of similarity between them. A value of zero means that the documents are completely dissimilar whereas a value of one indicates that the…
▽ More
A major computational burden, while performing document clustering, is the calculation of similarity measure between a pair of documents. Similarity measure is a function that assign a real number between 0 and 1 to a pair of documents, depending upon the degree of similarity between them. A value of zero means that the documents are completely dissimilar whereas a value of one indicates that the documents are practically identical. Traditionally, vector-based models have been used for computing the document similarity. The vector-based models represent several features present in documents. These approaches to similarity measures, in general, cannot account for the semantics of the document. Documents written in human languages contain contexts and the words used to describe these contexts are generally semantically related. Motivated by this fact, many researchers have proposed semantic-based similarity measures by utilizing text annotation through external thesauruses like WordNet (a lexical database). In this paper, we define a semantic similarity measure based on documents represented in topic maps. Topic maps are rapidly becoming an industrial standard for knowledge representation with a focus for later search and extraction. The documents are transformed into a topic map based coded knowledge and the similarity between a pair of documents is represented as a correlation between the common patterns. The experimental studies on the text mining datasets reveal that this new similarity measure is more effective as compared to commonly used similarity measures in text clustering.
△ Less
Submitted 17 August, 2012;
originally announced August 2012.
-
Comparing SVM and Naive Bayes classifiers for text categorization with Wikitology as knowledge enrichment
Authors:
Sundus Hassan,
Muhammad Rafi,
Muhammad Shahid Shaikh
Abstract:
The activity of labeling of documents according to their content is known as text categorization. Many experiments have been carried out to enhance text categorization by adding background knowledge to the document using knowledge repositories like Word Net, Open Project Directory (OPD), Wikipedia and Wikitology. In our previous work, we have carried out intensive experiments by extracting knowled…
▽ More
The activity of labeling of documents according to their content is known as text categorization. Many experiments have been carried out to enhance text categorization by adding background knowledge to the document using knowledge repositories like Word Net, Open Project Directory (OPD), Wikipedia and Wikitology. In our previous work, we have carried out intensive experiments by extracting knowledge from Wikitology and evaluating the experiment on Support Vector Machine with 10- fold cross-validations. The results clearly indicate Wikitology is far better than other knowledge bases. In this paper we are comparing Support Vector Machine (SVM) and Naïve Bayes (NB) classifiers under text enrichment through Wikitology. We validated results with 10-fold cross validation and shown that NB gives an improvement of +28.78%, on the other hand SVM gives an improvement of +6.36% when compared with baseline results. Naïve Bayes classifier is better choice when external enriching is used through any external knowledge base.
△ Less
Submitted 18 February, 2012;
originally announced February 2012.
-
Document Clustering based on Topic Maps
Authors:
Muhammad Rafi,
M. Shahid Shaikh,
Amir Farooq
Abstract:
Importance of document clustering is now widely acknowledged by researchers for better management, smart navigation, efficient filtering, and concise summarization of large collection of documents like World Wide Web (WWW). The next challenge lies in semantically performing clustering based on the semantic contents of the document. The problem of document clustering has two main components: (1) to…
▽ More
Importance of document clustering is now widely acknowledged by researchers for better management, smart navigation, efficient filtering, and concise summarization of large collection of documents like World Wide Web (WWW). The next challenge lies in semantically performing clustering based on the semantic contents of the document. The problem of document clustering has two main components: (1) to represent the document in such a form that inherently captures semantics of the text. This may also help to reduce dimensionality of the document, and (2) to define a similarity measure based on the semantic representation such that it assigns higher numerical values to document pairs which have higher semantic relationship. Feature space of the documents can be very challenging for document clustering. A document may contain multiple topics, it may contain a large set of class-independent general-words, and a handful class-specific core-words. With these features in mind, traditional agglomerative clustering algorithms, which are based on either Document Vector model (DVM) or Suffix Tree model (STC), are less efficient in producing results with high cluster quality. This paper introduces a new approach for document clustering based on the Topic Map representation of the documents. The document is being transformed into a compact form. A similarity measure is proposed based upon the inferred information through topic maps data and structures. The suggested method is implemented using agglomerative hierarchal clustering and tested on standard Information retrieval (IR) datasets. The comparative experiment reveals that the proposed approach is effective in improving the cluster quality.
△ Less
Submitted 28 December, 2011;
originally announced December 2011.