-
LLM aided semi-supervision for Extractive Dialog Summarization
Authors:
Nishant Mishra,
Gaurav Sahu,
Iacer Calixto,
Ameen Abu-Hanna,
Issam H. Laradji
Abstract:
Generating high-quality summaries for chat dialogs often requires large labeled datasets. We propose a method to efficiently use unlabeled data for extractive summarization of customer-agent dialogs. In our method, we frame summarization as a question-answering problem and use state-of-the-art large language models (LLMs) to generate pseudo-labels for a dialog. We then use these pseudo-labels to f…
▽ More
Generating high-quality summaries for chat dialogs often requires large labeled datasets. We propose a method to efficiently use unlabeled data for extractive summarization of customer-agent dialogs. In our method, we frame summarization as a question-answering problem and use state-of-the-art large language models (LLMs) to generate pseudo-labels for a dialog. We then use these pseudo-labels to fine-tune a chat summarization model, effectively transferring knowledge from the large LLM into a smaller specialized model. We demonstrate our method on the \tweetsumm dataset, and show that using 10% of the original labelled data set we can achieve 65.9/57.0/61.0 ROUGE-1/-2/-L, whereas the current state-of-the-art trained on the entire training data set obtains 65.16/55.81/64.37 ROUGE-1/-2/-L. In other words, in the worst case (i.e., ROUGE-L) we still effectively retain 94.7% of the performance while using only 10% of the data.
△ Less
Submitted 23 November, 2023; v1 submitted 19 November, 2023;
originally announced November 2023.
-
Prompt-based Pseudo-labeling Strategy for Sample-Efficient Semi-Supervised Extractive Summarization
Authors:
Gaurav Sahu,
Olga Vechtomova,
Issam H. Laradji
Abstract:
Semi-supervised learning (SSL) is a widely used technique in scenarios where labeled data is scarce and unlabeled data is abundant. While SSL is popular for image and text classification, it is relatively underexplored for the task of extractive text summarization. Standard SSL methods follow a teacher-student paradigm to first train a classification model and then use the classifier's confidence…
▽ More
Semi-supervised learning (SSL) is a widely used technique in scenarios where labeled data is scarce and unlabeled data is abundant. While SSL is popular for image and text classification, it is relatively underexplored for the task of extractive text summarization. Standard SSL methods follow a teacher-student paradigm to first train a classification model and then use the classifier's confidence values to select pseudo-labels for the subsequent training cycle; however, such classifiers are not suitable to measure the accuracy of pseudo-labels as they lack specific tuning for evaluation, which leads to confidence values that fail to capture the semantics and correctness of the generated summary. To address this problem, we propose a prompt-based pseudo-labeling strategy with LLMs that picks unlabeled examples with more accurate pseudo-labels than using just the classifier's probability outputs. Our approach also includes a relabeling mechanism that improves the quality of pseudo-labels. We evaluate our method on three text summarization datasets: TweetSumm, WikiHow, and ArXiv/PubMed. We empirically show that a prompting-based LLM that scores and generates pseudo-labels outperforms existing SSL methods on ROUGE-1, ROUGE-2, and ROUGE-L scores on all the datasets. Furthermore, our method achieves competitive L-Eval scores (evaluation with LLaMa-3) as a fully supervised method in a data-scarce setting and outperforms fully supervised method in a data-abundant setting.
△ Less
Submitted 1 July, 2024; v1 submitted 15 November, 2023;
originally announced November 2023.
-
PromptMix: A Class Boundary Augmentation Method for Large Language Model Distillation
Authors:
Gaurav Sahu,
Olga Vechtomova,
Dzmitry Bahdanau,
Issam H. Laradji
Abstract:
Data augmentation is a widely used technique to address the problem of text classification when there is a limited amount of training data. Recent work often tackles this problem using large language models (LLMs) like GPT3 that can generate new examples given already available ones. In this work, we propose a method to generate more helpful augmented data by utilizing the LLM's abilities to follo…
▽ More
Data augmentation is a widely used technique to address the problem of text classification when there is a limited amount of training data. Recent work often tackles this problem using large language models (LLMs) like GPT3 that can generate new examples given already available ones. In this work, we propose a method to generate more helpful augmented data by utilizing the LLM's abilities to follow instructions and perform few-shot classifications. Our specific PromptMix method consists of two steps: 1) generate challenging text augmentations near class boundaries; however, generating borderline examples increases the risk of false positives in the dataset, so we 2) relabel the text augmentations using a prompting-based LLM classifier to enhance the correctness of labels in the generated data. We evaluate the proposed method in challenging 2-shot and zero-shot settings on four text classification datasets: Banking77, TREC6, Subjectivity (SUBJ), and Twitter Complaints. Our experiments show that generating and, crucially, relabeling borderline examples facilitates the transfer of knowledge of a massive LLM like GPT3.5-turbo into smaller and cheaper classifiers like DistilBERT$_{base}$ and BERT$_{base}$. Furthermore, 2-shot PromptMix outperforms multiple 5-shot data augmentation methods on the four datasets. Our code is available at https://github.com/ServiceNow/PromptMix-EMNLP-2023.
△ Less
Submitted 22 October, 2023;
originally announced October 2023.
-
Multi-Modal Discussion Transformer: Integrating Text, Images and Graph Transformers to Detect Hate Speech on Social Media
Authors:
Liam Hebert,
Gaurav Sahu,
Yuxuan Guo,
Nanda Kishore Sreenivas,
Lukasz Golab,
Robin Cohen
Abstract:
We present the Multi-Modal Discussion Transformer (mDT), a novel methodfor detecting hate speech in online social networks such as Reddit discussions. In contrast to traditional comment-only methods, our approach to labelling a comment as hate speech involves a holistic analysis of text and images grounded in the discussion context. This is done by leveraging graph transformers to capture the cont…
▽ More
We present the Multi-Modal Discussion Transformer (mDT), a novel methodfor detecting hate speech in online social networks such as Reddit discussions. In contrast to traditional comment-only methods, our approach to labelling a comment as hate speech involves a holistic analysis of text and images grounded in the discussion context. This is done by leveraging graph transformers to capture the contextual relationships in the discussion surrounding a comment and grounding the interwoven fusion layers that combine text and image embeddings instead of processing modalities separately. To evaluate our work, we present a new dataset, HatefulDiscussions, comprising complete multi-modal discussions from multiple online communities on Reddit. We compare the performance of our model to baselines that only process individual comments and conduct extensive ablation studies.
△ Less
Submitted 22 February, 2024; v1 submitted 18 July, 2023;
originally announced July 2023.
-
Future Sight: Dynamic Story Generation with Large Pretrained Language Models
Authors:
Brian D. Zimmerman,
Gaurav Sahu,
Olga Vechtomova
Abstract:
Recent advances in deep learning research, such as transformers, have bolstered the ability for automated agents to generate creative texts similar to those that a human would write. By default, transformer decoders can only generate new text with respect to previously generated text. The output distribution of candidate tokens at any position is conditioned on previously selected tokens using a s…
▽ More
Recent advances in deep learning research, such as transformers, have bolstered the ability for automated agents to generate creative texts similar to those that a human would write. By default, transformer decoders can only generate new text with respect to previously generated text. The output distribution of candidate tokens at any position is conditioned on previously selected tokens using a self-attention mechanism to emulate the property of autoregression. This is inherently limiting for tasks such as controllable story generation where it may be necessary to condition on future plot events when writing a story. In this work, we propose Future Sight, a method for finetuning a pretrained generative transformer on the task of future conditioning. Transformer decoders are typically pretrained on the task of completing a context, one token at a time, by means of self-attention. Future Sight additionally enables a decoder to attend to an encoded future plot event. This motivates the decoder to expand on the context in a way that logically concludes with the provided future. During inference, the future plot event can be written by a human author to steer the narrative being generated in a certain direction. We evaluate the efficacy of our approach on a story generation task with human evaluators.
△ Less
Submitted 19 December, 2022;
originally announced December 2022.
-
LyricJam Sonic: A Generative System for Real-Time Composition and Musical Improvisation
Authors:
Olga Vechtomova,
Gaurav Sahu
Abstract:
Electronic music artists and sound designers have unique workflow practices that necessitate specialized approaches for develo** music information retrieval and creativity support tools. Furthermore, electronic music instruments, such as modular synthesizers, have near-infinite possibilities for sound creation and can be combined to create unique and complex audio paths. The process of discoveri…
▽ More
Electronic music artists and sound designers have unique workflow practices that necessitate specialized approaches for develo** music information retrieval and creativity support tools. Furthermore, electronic music instruments, such as modular synthesizers, have near-infinite possibilities for sound creation and can be combined to create unique and complex audio paths. The process of discovering interesting sounds is often serendipitous and impossible to replicate. For this reason, many musicians in electronic genres record audio output at all times while they work in the studio. Subsequently, it is difficult for artists to rediscover audio segments that might be suitable for use in their compositions from thousands of hours of recordings. In this paper, we describe LyricJam Sonic -- a novel creative tool for musicians to rediscover their previous recordings, re-contextualize them with other recordings, and create original live music compositions in real-time. A bi-modal AI-driven approach uses generated lyric lines to find matching audio clips from the artist's past studio recordings, and uses them to generate new lyric lines, which in turn are used to find other clips, thus creating a continuous and evolving stream of music and lyrics. The intent is to keep the artists in a state of creative flow conducive to music creation rather than taking them into an analytical/critical state of deliberately searching for past audio segments. The system can run in either a fully autonomous mode without user input, or in a live performance mode, where the artist plays live music, while the system "listens" and creates a continuous stream of music and lyrics in response.
△ Less
Submitted 27 October, 2022;
originally announced October 2022.
-
Data Augmentation for Intent Classification with Off-the-shelf Large Language Models
Authors:
Gaurav Sahu,
Pau Rodriguez,
Issam H. Laradji,
Parmida Atighehchian,
David Vazquez,
Dzmitry Bahdanau
Abstract:
Data augmentation is a widely employed technique to alleviate the problem of data scarcity. In this work, we propose a prompting-based approach to generate labelled training data for intent classification with off-the-shelf language models (LMs) such as GPT-3. An advantage of this method is that no task-specific LM-fine-tuning for data generation is required; hence the method requires no hyper-par…
▽ More
Data augmentation is a widely employed technique to alleviate the problem of data scarcity. In this work, we propose a prompting-based approach to generate labelled training data for intent classification with off-the-shelf language models (LMs) such as GPT-3. An advantage of this method is that no task-specific LM-fine-tuning for data generation is required; hence the method requires no hyper-parameter tuning and is applicable even when the available training data is very scarce. We evaluate the proposed method in a few-shot setting on four diverse intent classification tasks. We find that GPT-generated data significantly boosts the performance of intent classifiers when intents in consideration are sufficiently distinct from each other. In tasks with semantically close intents, we observe that the generated data is less helpful. Our analysis shows that this is because GPT often generates utterances that belong to a closely-related intent instead of the desired one. We present preliminary evidence that a prompting-based GPT classifier could be helpful in filtering the generated data to enhance its quality.
△ Less
Submitted 4 April, 2022;
originally announced April 2022.
-
Personalized multi-faceted trust modeling to determine trust links in social media and its potential for misinformation management
Authors:
Alexandre Parmentier,
Robin Cohen,
Xueguang Ma,
Gaurav Sahu,
Queenie Chen
Abstract:
In this paper, we present an approach for predicting trust links between peers in social media, one that is grounded in the artificial intelligence area of multiagent trust modeling. In particular, we propose a data-driven multi-faceted trust modeling which incorporates many distinct features for a comprehensive analysis. We focus on demonstrating how clustering of similar users enables a critical…
▽ More
In this paper, we present an approach for predicting trust links between peers in social media, one that is grounded in the artificial intelligence area of multiagent trust modeling. In particular, we propose a data-driven multi-faceted trust modeling which incorporates many distinct features for a comprehensive analysis. We focus on demonstrating how clustering of similar users enables a critical new functionality: supporting more personalized, and thus more accurate predictions for users. Illustrated in a trust-aware item recommendation task, we evaluate the proposed framework in the context of a large Yelp dataset. We then discuss how improving the detection of trusted relationships in social media can assist in supporting online users in their battle against the spread of misinformation and rumours, within a social networking environment which has recently exploded in popularity. We conclude with a reflection on a particularly vulnerable user base, older adults, in order to illustrate the value of reasoning about groups of users, looking to some future directions for integrating known preferences with insights gained through data analysis.
△ Less
Submitted 11 November, 2021;
originally announced November 2021.
-
LyricJam: A system for generating lyrics for live instrumental music
Authors:
Olga Vechtomova,
Gaurav Sahu,
Dhruv Kumar
Abstract:
We describe a real-time system that receives a live audio stream from a jam session and generates lyric lines that are congruent with the live music being played. Two novel approaches are proposed to align the learned latent spaces of audio and text representations that allow the system to generate novel lyric lines matching live instrumental music. One approach is based on adversarial alignment o…
▽ More
We describe a real-time system that receives a live audio stream from a jam session and generates lyric lines that are congruent with the live music being played. Two novel approaches are proposed to align the learned latent spaces of audio and text representations that allow the system to generate novel lyric lines matching live instrumental music. One approach is based on adversarial alignment of latent representations of audio and lyrics, while the other approach learns to transfer the topology from the music latent space to the lyric latent space. A user study with music artists using the system showed that the system was useful not only in lyric composition, but also encouraged the artists to improvise and find new musical expressions. Another user study demonstrated that users preferred the lines generated using the proposed methods to the lines generated by a baseline model.
△ Less
Submitted 3 June, 2021;
originally announced June 2021.
-
Towards A Multi-agent System for Online Hate Speech Detection
Authors:
Gaurav Sahu,
Robin Cohen,
Olga Vechtomova
Abstract:
This paper envisions a multi-agent system for detecting the presence of hate speech in online social media platforms such as Twitter and Facebook. We introduce a novel framework employing deep learning techniques to coordinate the channels of textual and im-age processing. Our experimental results aim to demonstrate the effectiveness of our methods for classifying online content, training the prop…
▽ More
This paper envisions a multi-agent system for detecting the presence of hate speech in online social media platforms such as Twitter and Facebook. We introduce a novel framework employing deep learning techniques to coordinate the channels of textual and im-age processing. Our experimental results aim to demonstrate the effectiveness of our methods for classifying online content, training the proposed neural network model to effectively detect hateful instances in the input. We conclude with a discussion of how our system may be of use to provide recommendations to users who are managing online social networks, showcasing the immense potential of intelligent multi-agent systems towards delivering social good.
△ Less
Submitted 3 May, 2021;
originally announced May 2021.
-
Generation of lyrics lines conditioned on music audio clips
Authors:
Olga Vechtomova,
Gaurav Sahu,
Dhruv Kumar
Abstract:
We present a system for generating novel lyrics lines conditioned on music audio. A bimodal neural network model learns to generate lines conditioned on any given short audio clip. The model consists of a spectrogram variational autoencoder (VAE) and a text VAE. Both automatic and human evaluations demonstrate effectiveness of our model in generating lines that have an emotional impact matching a…
▽ More
We present a system for generating novel lyrics lines conditioned on music audio. A bimodal neural network model learns to generate lines conditioned on any given short audio clip. The model consists of a spectrogram variational autoencoder (VAE) and a text VAE. Both automatic and human evaluations demonstrate effectiveness of our model in generating lines that have an emotional impact matching a given audio clip. The system is intended to serve as a creativity tool for songwriters.
△ Less
Submitted 29 September, 2020;
originally announced September 2020.
-
Adaptive Fusion Techniques for Multimodal Data
Authors:
Gaurav Sahu,
Olga Vechtomova
Abstract:
Effective fusion of data from multiple modalities, such as video, speech, and text, is challenging due to the heterogeneous nature of multimodal data. In this paper, we propose adaptive fusion techniques that aim to model context from different modalities effectively. Instead of defining a deterministic fusion operation, such as concatenation, for the network, we let the network decide "how" to co…
▽ More
Effective fusion of data from multiple modalities, such as video, speech, and text, is challenging due to the heterogeneous nature of multimodal data. In this paper, we propose adaptive fusion techniques that aim to model context from different modalities effectively. Instead of defining a deterministic fusion operation, such as concatenation, for the network, we let the network decide "how" to combine a given set of multimodal features more effectively. We propose two networks: 1) Auto-Fusion, which learns to compress information from different modalities while preserving the context, and 2) GAN-Fusion, which regularizes the learned latent space given context from complementing modalities. A quantitative evaluation on the tasks of multimodal machine translation and emotion recognition suggests that our lightweight, adaptive networks can better model context from other modalities than existing methods, many of which employ massive transformer-based networks.
△ Less
Submitted 26 January, 2021; v1 submitted 9 November, 2019;
originally announced November 2019.
-
Adversarial Learning on the Latent Space for Diverse Dialog Generation
Authors:
Kashif Khan,
Gaurav Sahu,
Vikash Balasubramanian,
Lili Mou,
Olga Vechtomova
Abstract:
Generating relevant responses in a dialog is challenging, and requires not only proper modeling of context in the conversation but also being able to generate fluent sentences during inference. In this paper, we propose a two-step framework based on generative adversarial nets for generating conditioned responses. Our model first learns a meaningful representation of sentences by autoencoding and…
▽ More
Generating relevant responses in a dialog is challenging, and requires not only proper modeling of context in the conversation but also being able to generate fluent sentences during inference. In this paper, we propose a two-step framework based on generative adversarial nets for generating conditioned responses. Our model first learns a meaningful representation of sentences by autoencoding and then learns to map an input query to the response representation, which is in turn decoded as a response sentence. Both quantitative and qualitative evaluations show that our model generates more fluent, relevant, and diverse responses than existing state-of-the-art methods.
△ Less
Submitted 3 November, 2020; v1 submitted 9 November, 2019;
originally announced November 2019.
-
Multimodal Speech Emotion Recognition and Ambiguity Resolution
Authors:
Gaurav Sahu
Abstract:
Identifying emotion from speech is a non-trivial task pertaining to the ambiguous definition of emotion itself. In this work, we adopt a feature-engineering based approach to tackle the task of speech emotion recognition. Formalizing our problem as a multi-class classification problem, we compare the performance of two categories of models. For both, we extract eight hand-crafted features from the…
▽ More
Identifying emotion from speech is a non-trivial task pertaining to the ambiguous definition of emotion itself. In this work, we adopt a feature-engineering based approach to tackle the task of speech emotion recognition. Formalizing our problem as a multi-class classification problem, we compare the performance of two categories of models. For both, we extract eight hand-crafted features from the audio signal. In the first approach, the extracted features are used to train six traditional machine learning classifiers, whereas the second approach is based on deep learning wherein a baseline feed-forward neural network and an LSTM-based classifier are trained over the same features. In order to resolve ambiguity in communication, we also include features from the text domain. We report accuracy, f-score, precision, and recall for the different experiment settings we evaluated our models in. Overall, we show that lighter machine learning based models trained over a few hand-crafted features are able to achieve performance comparable to the current deep learning based state-of-the-art method for emotion recognition.
△ Less
Submitted 11 April, 2019;
originally announced April 2019.
-
Non-Extensive Statistics in Free-Electron Metals and Thermal Effective Mass
Authors:
Arvind Khuntia,
Gayatri Sahu,
Raghunath Sahoo,
Durga P. Mahapatra,
Niranjan Barik
Abstract:
We have applied the non-extensive statistical mechanics to free electrons in several metals to calculate the electronic specific heat at low temperature. In this case, the Fermi-Dirac (FD) function is modified from its Boltzmann-Gibbs (BG) form, with the exponential part going to a $q$-exponential, in its non-extensive form. In most cases, the non-extensive parameter, $q$, is found to be greater t…
▽ More
We have applied the non-extensive statistical mechanics to free electrons in several metals to calculate the electronic specific heat at low temperature. In this case, the Fermi-Dirac (FD) function is modified from its Boltzmann-Gibbs (BG) form, with the exponential part going to a $q$-exponential, in its non-extensive form. In most cases, the non-extensive parameter, $q$, is found to be greater than unity to produce the correct thermal effective mass, $m^*$, of electrons. The ratio $m^*/m$ is found to show a nice systematic dependence on $q$. Results indicate, electrons in metals, in the presence of long range correlations are reasonably well described by Tsallis statistics.
△ Less
Submitted 8 April, 2019; v1 submitted 21 September, 2018;
originally announced September 2018.
-
Free as in Free Word Order: An Energy Based Model for Word Segmentation and Morphological Tagging in Sanskrit
Authors:
Amrith Krishna,
Bishal Santra,
Sasi Prasanth Bandaru,
Gaurav Sahu,
Vishnu Dutt Sharma,
Pavankumar Satuluri,
Pawan Goyal
Abstract:
The configurational information in sentences of a free word order language such as Sanskrit is of limited use. Thus, the context of the entire sentence will be desirable even for basic processing tasks such as word segmentation. We propose a structured prediction framework that jointly solves the word segmentation and morphological tagging tasks in Sanskrit. We build an energy based model where we…
▽ More
The configurational information in sentences of a free word order language such as Sanskrit is of limited use. Thus, the context of the entire sentence will be desirable even for basic processing tasks such as word segmentation. We propose a structured prediction framework that jointly solves the word segmentation and morphological tagging tasks in Sanskrit. We build an energy based model where we adopt approaches generally employed in graph based parsing techniques (McDonald et al., 2005a; Carreras, 2007). Our model outperforms the state of the art with an F-Score of 96.92 (percentage improvement of 7.06%) while using less than one-tenth of the task-specific training data. We find that the use of a graph based ap- proach instead of a traditional lattice-based sequential labelling approach leads to a percentage gain of 12.6% in F-Score for the segmentation task.
△ Less
Submitted 25 October, 2018; v1 submitted 5 September, 2018;
originally announced September 2018.
-
Effect of silicon resistivity on its porosification using metal induced chemical etching
Authors:
Shailendra K Saxena,
Gayatri Sahu,
P. K. Sahoo,
Pankaj R. Sagdeo,
Rajesh Kumar
Abstract:
A comparison of porous structures formed from silicon (Si) wafers with different resistivities has been reported here based on the morphological studies carried out using scanning electron microscope (SEM). The porous Si samples have been prepared using metal induced etching (MIE) technique from two different Si wafers having two different resistivities. It is observed that porous Si containing we…
▽ More
A comparison of porous structures formed from silicon (Si) wafers with different resistivities has been reported here based on the morphological studies carried out using scanning electron microscope (SEM). The porous Si samples have been prepared using metal induced etching (MIE) technique from two different Si wafers having two different resistivities. It is observed that porous Si containing well aligned Si nanowires are formed from high resistivity (1-20 $Ω$cm) Si wafer whereas interconnected pores or cheese like structures are formed from low resistivity (0.02 $Ω$cm ) Si wafers after MIE. An explanation for the different porosification processes has also been proposed based on the initial do** level where number of dopants seems to be playing an important role on the etching process. Visible photoluminescence have been observed from all the porous samples possibly due to quantum confinement effect.
△ Less
Submitted 19 August, 2014;
originally announced August 2014.
-
Comparison of porous silicon prepared using metal-induced etching (MIE) and laser-induced etching (LIE)
Authors:
Shailendra K. Saxena,
Vivek Kumar,
Hari M. Rai,
Gayatri Sahu,
Ravi K. Late,
Kapil Saxena,
A. K. Shukla,
Pankaj R. Sagdeo,
Rajesh Kumar
Abstract:
Porous silicon (p-Si), prepared by two routes (metal induced etching (MIE) and laser induced etching (LIE)) have been studied by comparing the observed surface morphologies using SEM. A uniformly distributed smaller (submicron sized) pores are formed when MIE technique is used because the pore formation is driven by uniformly distributed metal (silver in present case) nanoparticles, deposited prio…
▽ More
Porous silicon (p-Si), prepared by two routes (metal induced etching (MIE) and laser induced etching (LIE)) have been studied by comparing the observed surface morphologies using SEM. A uniformly distributed smaller (submicron sized) pores are formed when MIE technique is used because the pore formation is driven by uniformly distributed metal (silver in present case) nanoparticles, deposited prior to the porosification step. Whereas in p-Si, prepared by LIE technique, wider pores with some variation in pore size as compared to MIE technique is observed because a laser having gaussian profile of intensity is used for porosification. Uniformly distribute well-aligned Si nanowires are observed in samples prepared by MIE method as seen using cross-sectional SEM imaging. A single photoluminescence (PL) peak at 1.96 eV corresponding to red emission at room temperature is observed which reveals that the Si nanowires, present in p-Si prepared by MIE, show quantum confinement effect. The single PL peak confirms the presence of uniform sized nanowires in MIE samples. These vertically aligned Si nanowires can be used for field emission application.
△ Less
Submitted 25 March, 2014;
originally announced March 2014.
-
Evolution of Asymmetric Raman line-shape from nano-structures
Authors:
Rajesh Kumar,
Gayatri Sahu,
Shailendra K. Saxena,
Hari M. Rai,
Pankaj R. Sagdeo
Abstract:
A step-by-step evolution of an asymmetric Raman line-shape function from a Lorentzian line-shape is presented here for low dimensional semiconductors. The evolution reported here is based on the phonon confinement model which is successfully used in literature to explain the asymmetric Raman line-shape from semiconductor nano-structures. Physical significance of different terms in the theoretical…
▽ More
A step-by-step evolution of an asymmetric Raman line-shape function from a Lorentzian line-shape is presented here for low dimensional semiconductors. The evolution reported here is based on the phonon confinement model which is successfully used in literature to explain the asymmetric Raman line-shape from semiconductor nano-structures. Physical significance of different terms in the theoretical asymmetric Raman line-shape has been explained here. Better understanding of theoretical reasoning behind each term allows one to use the theoretical Raman line-shape without going into details of theory from first principle. This will enable one to empirically derive a theoretical Raman line-shape function for any material if information about its phonon dispersion, size dependence etc is known.
△ Less
Submitted 20 September, 2013;
originally announced September 2013.
-
Fabrication of silicon nanocrystals using sequential Au ion implantation
Authors:
Gayatri Sahu,
Rajesh Kumar,
D. P. Mahapatra
Abstract:
Silicon nanocrystals are produced using a two-stage gold ion implantation technique. First stage implantation using low energy ions leads to the formation of an amorphous Si (a-Si) layer. A subsequent high energy Au irradiation in the second stage is found to produce strained Si NCs. An annealing at a temperature as low as 500$^o$C is seen to result in strain free NCs showing quantum confinement e…
▽ More
Silicon nanocrystals are produced using a two-stage gold ion implantation technique. First stage implantation using low energy ions leads to the formation of an amorphous Si (a-Si) layer. A subsequent high energy Au irradiation in the second stage is found to produce strained Si NCs. An annealing at a temperature as low as 500$^o$C is seen to result in strain free NCs showing quantum confinement effects. Higher temperature annealing of the samples is found to result in growth in size from recrystallization of the a-Si matrix. Raman Scattering, X-ray diffraction (XRD) and Rutherford Backscattering spectrometry (RBS) have been used to study the effect of annealing on the samples and the size of Si NCs formed. The data could be well explained using a phonon confinement model with an extremely narrow size distribution. XRD results go in line with Raman analysis.
△ Less
Submitted 14 February, 2013;
originally announced February 2013.
-
Narrow band UV emission from direct band gap Si nanoclusters embedded in bulk Si
Authors:
G. Sahu,
H. P. Lenka,
D. P. Mahapatra,
Karol Grycginski,
A. K. Singh,
Jianyou Li,
B. Rout,
F. D. McDaniel,
Arup Neogi
Abstract:
This paper has been withdrawn kee** in view of publication elsewhere with some appropriate modifications.
This paper has been withdrawn kee** in view of publication elsewhere with some appropriate modifications.
△ Less
Submitted 24 December, 2009; v1 submitted 15 October, 2009;
originally announced October 2009.
-
The mechanism of ion induced amorphization in Si
Authors:
H. P. Lenka,
U. M. Bhatta,
P. K. Kuiri,
G. Sahu,
B. Joseph,
B. Satpati,
D. P. Mahapatra
Abstract:
Some results on damage build up in, and amorphization of, Si, induced by 25-30 keV Al$_5^-$, Si$_5^-$ and Cs$^-$ ions, at room temperature, are reported. We show that at low energy, amorphization is a nucleation and growth process, based on the direct impact mechanism. With an Avrami exponent $\sim 1.6$, the growth towards amorphization seems to be diffusion limited. A transition to a completely…
▽ More
Some results on damage build up in, and amorphization of, Si, induced by 25-30 keV Al$_5^-$, Si$_5^-$ and Cs$^-$ ions, at room temperature, are reported. We show that at low energy, amorphization is a nucleation and growth process, based on the direct impact mechanism. With an Avrami exponent $\sim 1.6$, the growth towards amorphization seems to be diffusion limited. A transition to a completely amorphized state is indicated at a dose exceeding 17 eV/atom, which is higher than 6-12 eV/atom as predicted by simulations. The observed higher threshold could be due to temperature effects although an underestimation of keV-energy recoils, in simulation, may not be ruled out.
△ Less
Submitted 16 February, 2009; v1 submitted 5 November, 2008;
originally announced November 2008.
-
Enhanced UV Light emission from Silicon nanoparticles induced by Au ion implantation
Authors:
Akhilesh Singh,
Karol G. Grycznski,
Bibhu Rout,
Jianyou Li,
Floyd McDaniel,
Arup Neogi,
Gayatri Sahu,
Durga P. Mahapatra
Abstract:
Study of light emitting silicon fabricated by ion implantation.
Study of light emitting silicon fabricated by ion implantation.
△ Less
Submitted 1 November, 2008;
originally announced November 2008.
-
Study of low energy Si$_5^-$ and Cs$^-$ implantation induced amorphization effects in Si(100)
Authors:
H. P. Lenka,
B. Joseph,
P. K. Kuiri,
G. Sahu,
P. Mishra,
D. Ghose,
D. P. Mahapatra
Abstract:
The damage growth and surface modifications in Si(100), induced by 25 keV Si$_5^-$ cluster ions, as a function of fluence, $φ$, has been studied using atomic force microscopy (AFM) and channeling Rutherford backscattering spectrometry (CRBS). CRBS results indicate a nonlinear growth in damage from which it has been possible to get a threshold fluence, $φ_0$, for amorphization as…
▽ More
The damage growth and surface modifications in Si(100), induced by 25 keV Si$_5^-$ cluster ions, as a function of fluence, $φ$, has been studied using atomic force microscopy (AFM) and channeling Rutherford backscattering spectrometry (CRBS). CRBS results indicate a nonlinear growth in damage from which it has been possible to get a threshold fluence, $φ_0$, for amorphization as $2.5\times 10^{13}$ ions-cm$^{-2}$. For $φ$ below $φ_0$, a growth in damage as well as surface roughness has been observed. At a $φ$ of $1\times 10^{14}$ ions-cm$^{-2}$, damage saturation coupled with a much reduced surface roughness has been found. In this case a power spectrum analysis of AFM data showed a significant drop, in spectral density, as compared to the same obtained for a fluence, $φ< φ_0$. This drop, together with damage saturation, can be correlated with a transition to a stress relaxed amorphous phase. Irradiation with similar mass Cs$^-$ ions, at the same energy and fluence, has been found to result in a reduced accumulation of defects in the near surface region leading to reduced surface features.
△ Less
Submitted 4 June, 2008; v1 submitted 1 May, 2008;
originally announced May 2008.
-
Observation of a Universal Aggregation Mechanism and a Possible Phase Transition in Au Sputtered by Swift Heavy Ions
Authors:
P. K. Kuiri,
B. Joseph,
H. P. Lenka,
G. Sahu,
J. Ghatak,
D. Kanjilal,
D. P. Mahapatra
Abstract:
Two exponents, $δ$, for size distribution of $n$-atom clusters, $Y(n)\sim n^{-δ}$, have been found in Au clusters sputtered from embedded Au nanoparticles under swift heavy ion irradiation. For small clusters, below 12.5 nm in size, $δ$ has been found to be 3/2, which can be rationalized as occurring from a steady state aggregation process with size independent aggregation. For larger clusters,…
▽ More
Two exponents, $δ$, for size distribution of $n$-atom clusters, $Y(n)\sim n^{-δ}$, have been found in Au clusters sputtered from embedded Au nanoparticles under swift heavy ion irradiation. For small clusters, below 12.5 nm in size, $δ$ has been found to be 3/2, which can be rationalized as occurring from a steady state aggregation process with size independent aggregation. For larger clusters, a $δ$ value of 7/2 is suggested, which might come from a dynamical transition to another steady state where aggregation and evaporation rates are size dependent. In the present case, the observed decay exponents do not support any possibility of a thermodynamic liquid-gas type phase transition taking place, resulting in cluster formation.
△ Less
Submitted 16 October, 2009; v1 submitted 17 February, 2008;
originally announced February 2008.