Search | arXiv e-print repository

LLM aided semi-supervision for Extractive Dialog Summarization

Authors: Nishant Mishra, Gaurav Sahu, Iacer Calixto, Ameen Abu-Hanna, Issam H. Laradji

Abstract: Generating high-quality summaries for chat dialogs often requires large labeled datasets. We propose a method to efficiently use unlabeled data for extractive summarization of customer-agent dialogs. In our method, we frame summarization as a question-answering problem and use state-of-the-art large language models (LLMs) to generate pseudo-labels for a dialog. We then use these pseudo-labels to f… ▽ More Generating high-quality summaries for chat dialogs often requires large labeled datasets. We propose a method to efficiently use unlabeled data for extractive summarization of customer-agent dialogs. In our method, we frame summarization as a question-answering problem and use state-of-the-art large language models (LLMs) to generate pseudo-labels for a dialog. We then use these pseudo-labels to fine-tune a chat summarization model, effectively transferring knowledge from the large LLM into a smaller specialized model. We demonstrate our method on the \tweetsumm dataset, and show that using 10% of the original labelled data set we can achieve 65.9/57.0/61.0 ROUGE-1/-2/-L, whereas the current state-of-the-art trained on the entire training data set obtains 65.16/55.81/64.37 ROUGE-1/-2/-L. In other words, in the worst case (i.e., ROUGE-L) we still effectively retain 94.7% of the performance while using only 10% of the data. △ Less

Submitted 23 November, 2023; v1 submitted 19 November, 2023; originally announced November 2023.

Comments: to be published in EMNLP Findings

arXiv:2311.09559 [pdf, other]

Prompt-based Pseudo-labeling Strategy for Sample-Efficient Semi-Supervised Extractive Summarization

Authors: Gaurav Sahu, Olga Vechtomova, Issam H. Laradji

Abstract: Semi-supervised learning (SSL) is a widely used technique in scenarios where labeled data is scarce and unlabeled data is abundant. While SSL is popular for image and text classification, it is relatively underexplored for the task of extractive text summarization. Standard SSL methods follow a teacher-student paradigm to first train a classification model and then use the classifier's confidence… ▽ More Semi-supervised learning (SSL) is a widely used technique in scenarios where labeled data is scarce and unlabeled data is abundant. While SSL is popular for image and text classification, it is relatively underexplored for the task of extractive text summarization. Standard SSL methods follow a teacher-student paradigm to first train a classification model and then use the classifier's confidence values to select pseudo-labels for the subsequent training cycle; however, such classifiers are not suitable to measure the accuracy of pseudo-labels as they lack specific tuning for evaluation, which leads to confidence values that fail to capture the semantics and correctness of the generated summary. To address this problem, we propose a prompt-based pseudo-labeling strategy with LLMs that picks unlabeled examples with more accurate pseudo-labels than using just the classifier's probability outputs. Our approach also includes a relabeling mechanism that improves the quality of pseudo-labels. We evaluate our method on three text summarization datasets: TweetSumm, WikiHow, and ArXiv/PubMed. We empirically show that a prompting-based LLM that scores and generates pseudo-labels outperforms existing SSL methods on ROUGE-1, ROUGE-2, and ROUGE-L scores on all the datasets. Furthermore, our method achieves competitive L-Eval scores (evaluation with LLaMa-3) as a fully supervised method in a data-scarce setting and outperforms fully supervised method in a data-abundant setting. △ Less

Submitted 1 July, 2024; v1 submitted 15 November, 2023; originally announced November 2023.

Comments: 8 pages, 6 figures, 3 tables

arXiv:2310.14192 [pdf, other]

PromptMix: A Class Boundary Augmentation Method for Large Language Model Distillation

Authors: Gaurav Sahu, Olga Vechtomova, Dzmitry Bahdanau, Issam H. Laradji

Abstract: Data augmentation is a widely used technique to address the problem of text classification when there is a limited amount of training data. Recent work often tackles this problem using large language models (LLMs) like GPT3 that can generate new examples given already available ones. In this work, we propose a method to generate more helpful augmented data by utilizing the LLM's abilities to follo… ▽ More Data augmentation is a widely used technique to address the problem of text classification when there is a limited amount of training data. Recent work often tackles this problem using large language models (LLMs) like GPT3 that can generate new examples given already available ones. In this work, we propose a method to generate more helpful augmented data by utilizing the LLM's abilities to follow instructions and perform few-shot classifications. Our specific PromptMix method consists of two steps: 1) generate challenging text augmentations near class boundaries; however, generating borderline examples increases the risk of false positives in the dataset, so we 2) relabel the text augmentations using a prompting-based LLM classifier to enhance the correctness of labels in the generated data. We evaluate the proposed method in challenging 2-shot and zero-shot settings on four text classification datasets: Banking77, TREC6, Subjectivity (SUBJ), and Twitter Complaints. Our experiments show that generating and, crucially, relabeling borderline examples facilitates the transfer of knowledge of a massive LLM like GPT3.5-turbo into smaller and cheaper classifiers like DistilBERT$_{base}$ and BERT$_{base}$. Furthermore, 2-shot PromptMix outperforms multiple 5-shot data augmentation methods on the four datasets. Our code is available at https://github.com/ServiceNow/PromptMix-EMNLP-2023. △ Less

Submitted 22 October, 2023; originally announced October 2023.

Comments: Accepted to EMNLP 2023 (Long paper)

arXiv:2307.09312 [pdf, other]

Multi-Modal Discussion Transformer: Integrating Text, Images and Graph Transformers to Detect Hate Speech on Social Media

Authors: Liam Hebert, Gaurav Sahu, Yuxuan Guo, Nanda Kishore Sreenivas, Lukasz Golab, Robin Cohen

Abstract: We present the Multi-Modal Discussion Transformer (mDT), a novel methodfor detecting hate speech in online social networks such as Reddit discussions. In contrast to traditional comment-only methods, our approach to labelling a comment as hate speech involves a holistic analysis of text and images grounded in the discussion context. This is done by leveraging graph transformers to capture the cont… ▽ More We present the Multi-Modal Discussion Transformer (mDT), a novel methodfor detecting hate speech in online social networks such as Reddit discussions. In contrast to traditional comment-only methods, our approach to labelling a comment as hate speech involves a holistic analysis of text and images grounded in the discussion context. This is done by leveraging graph transformers to capture the contextual relationships in the discussion surrounding a comment and grounding the interwoven fusion layers that combine text and image embeddings instead of processing modalities separately. To evaluate our work, we present a new dataset, HatefulDiscussions, comprising complete multi-modal discussions from multiple online communities on Reddit. We compare the performance of our model to baselines that only process individual comments and conduct extensive ablation studies. △ Less

Submitted 22 February, 2024; v1 submitted 18 July, 2023; originally announced July 2023.

Comments: Accepted to AAAI 2024 (AI for Social Impact Track)

arXiv:2212.09947 [pdf, other]

Future Sight: Dynamic Story Generation with Large Pretrained Language Models

Authors: Brian D. Zimmerman, Gaurav Sahu, Olga Vechtomova

Abstract: Recent advances in deep learning research, such as transformers, have bolstered the ability for automated agents to generate creative texts similar to those that a human would write. By default, transformer decoders can only generate new text with respect to previously generated text. The output distribution of candidate tokens at any position is conditioned on previously selected tokens using a s… ▽ More Recent advances in deep learning research, such as transformers, have bolstered the ability for automated agents to generate creative texts similar to those that a human would write. By default, transformer decoders can only generate new text with respect to previously generated text. The output distribution of candidate tokens at any position is conditioned on previously selected tokens using a self-attention mechanism to emulate the property of autoregression. This is inherently limiting for tasks such as controllable story generation where it may be necessary to condition on future plot events when writing a story. In this work, we propose Future Sight, a method for finetuning a pretrained generative transformer on the task of future conditioning. Transformer decoders are typically pretrained on the task of completing a context, one token at a time, by means of self-attention. Future Sight additionally enables a decoder to attend to an encoded future plot event. This motivates the decoder to expand on the context in a way that logically concludes with the provided future. During inference, the future plot event can be written by a human author to steer the narrative being generated in a certain direction. We evaluate the efficacy of our approach on a story generation task with human evaluators. △ Less

Submitted 19 December, 2022; originally announced December 2022.

Comments: 9 pages, 1 figure, 4 tables

arXiv:2210.15638 [pdf, other]

LyricJam Sonic: A Generative System for Real-Time Composition and Musical Improvisation

Authors: Olga Vechtomova, Gaurav Sahu

Abstract: Electronic music artists and sound designers have unique workflow practices that necessitate specialized approaches for develo** music information retrieval and creativity support tools. Furthermore, electronic music instruments, such as modular synthesizers, have near-infinite possibilities for sound creation and can be combined to create unique and complex audio paths. The process of discoveri… ▽ More Electronic music artists and sound designers have unique workflow practices that necessitate specialized approaches for develo** music information retrieval and creativity support tools. Furthermore, electronic music instruments, such as modular synthesizers, have near-infinite possibilities for sound creation and can be combined to create unique and complex audio paths. The process of discovering interesting sounds is often serendipitous and impossible to replicate. For this reason, many musicians in electronic genres record audio output at all times while they work in the studio. Subsequently, it is difficult for artists to rediscover audio segments that might be suitable for use in their compositions from thousands of hours of recordings. In this paper, we describe LyricJam Sonic -- a novel creative tool for musicians to rediscover their previous recordings, re-contextualize them with other recordings, and create original live music compositions in real-time. A bi-modal AI-driven approach uses generated lyric lines to find matching audio clips from the artist's past studio recordings, and uses them to generate new lyric lines, which in turn are used to find other clips, thus creating a continuous and evolving stream of music and lyrics. The intent is to keep the artists in a state of creative flow conducive to music creation rather than taking them into an analytical/critical state of deliberately searching for past audio segments. The system can run in either a fully autonomous mode without user input, or in a live performance mode, where the artist plays live music, while the system "listens" and creates a continuous stream of music and lyrics in response. △ Less

Submitted 27 October, 2022; originally announced October 2022.

Comments: 15 pages, 9 figures, 2 tables

arXiv:2204.01959 [pdf, other]

Data Augmentation for Intent Classification with Off-the-shelf Large Language Models

Authors: Gaurav Sahu, Pau Rodriguez, Issam H. Laradji, Parmida Atighehchian, David Vazquez, Dzmitry Bahdanau

Abstract: Data augmentation is a widely employed technique to alleviate the problem of data scarcity. In this work, we propose a prompting-based approach to generate labelled training data for intent classification with off-the-shelf language models (LMs) such as GPT-3. An advantage of this method is that no task-specific LM-fine-tuning for data generation is required; hence the method requires no hyper-par… ▽ More Data augmentation is a widely employed technique to alleviate the problem of data scarcity. In this work, we propose a prompting-based approach to generate labelled training data for intent classification with off-the-shelf language models (LMs) such as GPT-3. An advantage of this method is that no task-specific LM-fine-tuning for data generation is required; hence the method requires no hyper-parameter tuning and is applicable even when the available training data is very scarce. We evaluate the proposed method in a few-shot setting on four diverse intent classification tasks. We find that GPT-generated data significantly boosts the performance of intent classifiers when intents in consideration are sufficiently distinct from each other. In tasks with semantically close intents, we observe that the generated data is less helpful. Our analysis shows that this is because GPT often generates utterances that belong to a closely-related intent instead of the desired one. We present preliminary evidence that a prompting-based GPT classifier could be helpful in filtering the generated data to enhance its quality. △ Less

Submitted 4 April, 2022; originally announced April 2022.

Comments: Accepted to 4th Workshop on NLP for Conversational AI, ACL 2022

arXiv:2111.06440 [pdf, other]

Personalized multi-faceted trust modeling to determine trust links in social media and its potential for misinformation management

Authors: Alexandre Parmentier, Robin Cohen, Xueguang Ma, Gaurav Sahu, Queenie Chen

Abstract: In this paper, we present an approach for predicting trust links between peers in social media, one that is grounded in the artificial intelligence area of multiagent trust modeling. In particular, we propose a data-driven multi-faceted trust modeling which incorporates many distinct features for a comprehensive analysis. We focus on demonstrating how clustering of similar users enables a critical… ▽ More In this paper, we present an approach for predicting trust links between peers in social media, one that is grounded in the artificial intelligence area of multiagent trust modeling. In particular, we propose a data-driven multi-faceted trust modeling which incorporates many distinct features for a comprehensive analysis. We focus on demonstrating how clustering of similar users enables a critical new functionality: supporting more personalized, and thus more accurate predictions for users. Illustrated in a trust-aware item recommendation task, we evaluate the proposed framework in the context of a large Yelp dataset. We then discuss how improving the detection of trusted relationships in social media can assist in supporting online users in their battle against the spread of misinformation and rumours, within a social networking environment which has recently exploded in popularity. We conclude with a reflection on a particularly vulnerable user base, older adults, in order to illustrate the value of reasoning about groups of users, looking to some future directions for integrating known preferences with insights gained through data analysis. △ Less

Submitted 11 November, 2021; originally announced November 2021.

Comments: 28 pages

arXiv:2106.01960 [pdf, other]

LyricJam: A system for generating lyrics for live instrumental music

Authors: Olga Vechtomova, Gaurav Sahu, Dhruv Kumar

Abstract: We describe a real-time system that receives a live audio stream from a jam session and generates lyric lines that are congruent with the live music being played. Two novel approaches are proposed to align the learned latent spaces of audio and text representations that allow the system to generate novel lyric lines matching live instrumental music. One approach is based on adversarial alignment o… ▽ More We describe a real-time system that receives a live audio stream from a jam session and generates lyric lines that are congruent with the live music being played. Two novel approaches are proposed to align the learned latent spaces of audio and text representations that allow the system to generate novel lyric lines matching live instrumental music. One approach is based on adversarial alignment of latent representations of audio and lyrics, while the other approach learns to transfer the topology from the music latent space to the lyric latent space. A user study with music artists using the system showed that the system was useful not only in lyric composition, but also encouraged the artists to improvise and find new musical expressions. Another user study demonstrated that users preferred the lines generated using the proposed methods to the lines generated by a baseline model. △ Less

Submitted 3 June, 2021; originally announced June 2021.

Comments: Accepted to International Conference on Computational Creativity (ICCC) 2021 [Oral]

arXiv:2105.01129 [pdf, other]

Towards A Multi-agent System for Online Hate Speech Detection

Authors: Gaurav Sahu, Robin Cohen, Olga Vechtomova

Abstract: This paper envisions a multi-agent system for detecting the presence of hate speech in online social media platforms such as Twitter and Facebook. We introduce a novel framework employing deep learning techniques to coordinate the channels of textual and im-age processing. Our experimental results aim to demonstrate the effectiveness of our methods for classifying online content, training the prop… ▽ More This paper envisions a multi-agent system for detecting the presence of hate speech in online social media platforms such as Twitter and Facebook. We introduce a novel framework employing deep learning techniques to coordinate the channels of textual and im-age processing. Our experimental results aim to demonstrate the effectiveness of our methods for classifying online content, training the proposed neural network model to effectively detect hateful instances in the input. We conclude with a discussion of how our system may be of use to provide recommendations to users who are managing online social networks, showcasing the immense potential of intelligent multi-agent systems towards delivering social good. △ Less

Submitted 3 May, 2021; originally announced May 2021.

Comments: Accepted to the 2nd International Workshop on Autonomous Agents for Social Good (AASG), AAMAS, 2021

arXiv:2009.14375 [pdf, other]

Generation of lyrics lines conditioned on music audio clips

Authors: Olga Vechtomova, Gaurav Sahu, Dhruv Kumar

Abstract: We present a system for generating novel lyrics lines conditioned on music audio. A bimodal neural network model learns to generate lines conditioned on any given short audio clip. The model consists of a spectrogram variational autoencoder (VAE) and a text VAE. Both automatic and human evaluations demonstrate effectiveness of our model in generating lines that have an emotional impact matching a… ▽ More We present a system for generating novel lyrics lines conditioned on music audio. A bimodal neural network model learns to generate lines conditioned on any given short audio clip. The model consists of a spectrogram variational autoencoder (VAE) and a text VAE. Both automatic and human evaluations demonstrate effectiveness of our model in generating lines that have an emotional impact matching a given audio clip. The system is intended to serve as a creativity tool for songwriters. △ Less

Submitted 29 September, 2020; originally announced September 2020.

Comments: Accepted to First Workshop on NLP for Music and Audio (NLP4MusA) at ISMIR 2020

arXiv:1911.03821 [pdf, other]

Adaptive Fusion Techniques for Multimodal Data

Authors: Gaurav Sahu, Olga Vechtomova

Abstract: Effective fusion of data from multiple modalities, such as video, speech, and text, is challenging due to the heterogeneous nature of multimodal data. In this paper, we propose adaptive fusion techniques that aim to model context from different modalities effectively. Instead of defining a deterministic fusion operation, such as concatenation, for the network, we let the network decide "how" to co… ▽ More Effective fusion of data from multiple modalities, such as video, speech, and text, is challenging due to the heterogeneous nature of multimodal data. In this paper, we propose adaptive fusion techniques that aim to model context from different modalities effectively. Instead of defining a deterministic fusion operation, such as concatenation, for the network, we let the network decide "how" to combine a given set of multimodal features more effectively. We propose two networks: 1) Auto-Fusion, which learns to compress information from different modalities while preserving the context, and 2) GAN-Fusion, which regularizes the learned latent space given context from complementing modalities. A quantitative evaluation on the tasks of multimodal machine translation and emotion recognition suggests that our lightweight, adaptive networks can better model context from other modalities than existing methods, many of which employ massive transformer-based networks. △ Less

Submitted 26 January, 2021; v1 submitted 9 November, 2019; originally announced November 2019.

Comments: Camera-ready version for EACL 2021

arXiv:1911.03817 [pdf, other]

Adversarial Learning on the Latent Space for Diverse Dialog Generation

Authors: Kashif Khan, Gaurav Sahu, Vikash Balasubramanian, Lili Mou, Olga Vechtomova

Abstract: Generating relevant responses in a dialog is challenging, and requires not only proper modeling of context in the conversation but also being able to generate fluent sentences during inference. In this paper, we propose a two-step framework based on generative adversarial nets for generating conditioned responses. Our model first learns a meaningful representation of sentences by autoencoding and… ▽ More Generating relevant responses in a dialog is challenging, and requires not only proper modeling of context in the conversation but also being able to generate fluent sentences during inference. In this paper, we propose a two-step framework based on generative adversarial nets for generating conditioned responses. Our model first learns a meaningful representation of sentences by autoencoding and then learns to map an input query to the response representation, which is in turn decoded as a response sentence. Both quantitative and qualitative evaluations show that our model generates more fluent, relevant, and diverse responses than existing state-of-the-art methods. △ Less

Submitted 3 November, 2020; v1 submitted 9 November, 2019; originally announced November 2019.

Comments: Accepted to COLING 2020

arXiv:1904.06022 [pdf, other]

Multimodal Speech Emotion Recognition and Ambiguity Resolution

Authors: Gaurav Sahu

Abstract: Identifying emotion from speech is a non-trivial task pertaining to the ambiguous definition of emotion itself. In this work, we adopt a feature-engineering based approach to tackle the task of speech emotion recognition. Formalizing our problem as a multi-class classification problem, we compare the performance of two categories of models. For both, we extract eight hand-crafted features from the… ▽ More Identifying emotion from speech is a non-trivial task pertaining to the ambiguous definition of emotion itself. In this work, we adopt a feature-engineering based approach to tackle the task of speech emotion recognition. Formalizing our problem as a multi-class classification problem, we compare the performance of two categories of models. For both, we extract eight hand-crafted features from the audio signal. In the first approach, the extracted features are used to train six traditional machine learning classifiers, whereas the second approach is based on deep learning wherein a baseline feed-forward neural network and an LSTM-based classifier are trained over the same features. In order to resolve ambiguity in communication, we also include features from the text domain. We report accuracy, f-score, precision, and recall for the different experiment settings we evaluated our models in. Overall, we show that lighter machine learning based models trained over a few hand-crafted features are able to achieve performance comparable to the current deep learning based state-of-the-art method for emotion recognition. △ Less

Submitted 11 April, 2019; originally announced April 2019.

Comments: 9 pages

arXiv:1809.08096 [pdf, other]

doi 10.1016/j.physa.2019.02.043

Non-Extensive Statistics in Free-Electron Metals and Thermal Effective Mass

Authors: Arvind Khuntia, Gayatri Sahu, Raghunath Sahoo, Durga P. Mahapatra, Niranjan Barik

Abstract: We have applied the non-extensive statistical mechanics to free electrons in several metals to calculate the electronic specific heat at low temperature. In this case, the Fermi-Dirac (FD) function is modified from its Boltzmann-Gibbs (BG) form, with the exponential part going to a $q$-exponential, in its non-extensive form. In most cases, the non-extensive parameter, $q$, is found to be greater t… ▽ More We have applied the non-extensive statistical mechanics to free electrons in several metals to calculate the electronic specific heat at low temperature. In this case, the Fermi-Dirac (FD) function is modified from its Boltzmann-Gibbs (BG) form, with the exponential part going to a $q$-exponential, in its non-extensive form. In most cases, the non-extensive parameter, $q$, is found to be greater than unity to produce the correct thermal effective mass, $m^*$, of electrons. The ratio $m^*/m$ is found to show a nice systematic dependence on $q$. Results indicate, electrons in metals, in the presence of long range correlations are reasonably well described by Tsallis statistics. △ Less

Submitted 8 April, 2019; v1 submitted 21 September, 2018; originally announced September 2018.

Comments: Final Published version

Journal ref: Physica A 523 (2019) 852

arXiv:1809.01446 [pdf, other]

Free as in Free Word Order: An Energy Based Model for Word Segmentation and Morphological Tagging in Sanskrit

Authors: Amrith Krishna, Bishal Santra, Sasi Prasanth Bandaru, Gaurav Sahu, Vishnu Dutt Sharma, Pavankumar Satuluri, Pawan Goyal

Abstract: The configurational information in sentences of a free word order language such as Sanskrit is of limited use. Thus, the context of the entire sentence will be desirable even for basic processing tasks such as word segmentation. We propose a structured prediction framework that jointly solves the word segmentation and morphological tagging tasks in Sanskrit. We build an energy based model where we… ▽ More The configurational information in sentences of a free word order language such as Sanskrit is of limited use. Thus, the context of the entire sentence will be desirable even for basic processing tasks such as word segmentation. We propose a structured prediction framework that jointly solves the word segmentation and morphological tagging tasks in Sanskrit. We build an energy based model where we adopt approaches generally employed in graph based parsing techniques (McDonald et al., 2005a; Carreras, 2007). Our model outperforms the state of the art with an F-Score of 96.92 (percentage improvement of 7.06%) while using less than one-tenth of the task-specific training data. We find that the use of a graph based ap- proach instead of a traditional lattice-based sequential labelling approach leads to a percentage gain of 12.6% in F-Score for the segmentation task. △ Less

Submitted 25 October, 2018; v1 submitted 5 September, 2018; originally announced September 2018.

Comments: version 2: Corrected typo in Table1, page7 | Accepted in EMNLP 2018. Supplementary material can be found at - http://cse.iitkgp.ac.in/~amrithk/1080_supp.pdf

arXiv:1408.4314 [pdf, other]

doi 10.1088/2053-1591/2/3/036501

Effect of silicon resistivity on its porosification using metal induced chemical etching

Authors: Shailendra K Saxena, Gayatri Sahu, P. K. Sahoo, Pankaj R. Sagdeo, Rajesh Kumar

Abstract: A comparison of porous structures formed from silicon (Si) wafers with different resistivities has been reported here based on the morphological studies carried out using scanning electron microscope (SEM). The porous Si samples have been prepared using metal induced etching (MIE) technique from two different Si wafers having two different resistivities. It is observed that porous Si containing we… ▽ More A comparison of porous structures formed from silicon (Si) wafers with different resistivities has been reported here based on the morphological studies carried out using scanning electron microscope (SEM). The porous Si samples have been prepared using metal induced etching (MIE) technique from two different Si wafers having two different resistivities. It is observed that porous Si containing well aligned Si nanowires are formed from high resistivity (1-20 $Ω$cm) Si wafer whereas interconnected pores or cheese like structures are formed from low resistivity (0.02 $Ω$cm ) Si wafers after MIE. An explanation for the different porosification processes has also been proposed based on the initial do** level where number of dopants seems to be playing an important role on the etching process. Visible photoluminescence have been observed from all the porous samples possibly due to quantum confinement effect. △ Less

Submitted 19 August, 2014; originally announced August 2014.

Comments: 9 Pages, 5 Figures

Journal ref: Material Research Express, Vol. 2, 036501, 2015

arXiv:1403.6269 [pdf, other]

Comparison of porous silicon prepared using metal-induced etching (MIE) and laser-induced etching (LIE)

Authors: Shailendra K. Saxena, Vivek Kumar, Hari M. Rai, Gayatri Sahu, Ravi K. Late, Kapil Saxena, A. K. Shukla, Pankaj R. Sagdeo, Rajesh Kumar

Abstract: Porous silicon (p-Si), prepared by two routes (metal induced etching (MIE) and laser induced etching (LIE)) have been studied by comparing the observed surface morphologies using SEM. A uniformly distributed smaller (submicron sized) pores are formed when MIE technique is used because the pore formation is driven by uniformly distributed metal (silver in present case) nanoparticles, deposited prio… ▽ More Porous silicon (p-Si), prepared by two routes (metal induced etching (MIE) and laser induced etching (LIE)) have been studied by comparing the observed surface morphologies using SEM. A uniformly distributed smaller (submicron sized) pores are formed when MIE technique is used because the pore formation is driven by uniformly distributed metal (silver in present case) nanoparticles, deposited prior to the porosification step. Whereas in p-Si, prepared by LIE technique, wider pores with some variation in pore size as compared to MIE technique is observed because a laser having gaussian profile of intensity is used for porosification. Uniformly distribute well-aligned Si nanowires are observed in samples prepared by MIE method as seen using cross-sectional SEM imaging. A single photoluminescence (PL) peak at 1.96 eV corresponding to red emission at room temperature is observed which reveals that the Si nanowires, present in p-Si prepared by MIE, show quantum confinement effect. The single PL peak confirms the presence of uniform sized nanowires in MIE samples. These vertically aligned Si nanowires can be used for field emission application. △ Less

Submitted 25 March, 2014; originally announced March 2014.

arXiv:1309.5180 [pdf, other]

doi 10.1007/s12633-013-9176-9

Evolution of Asymmetric Raman line-shape from nano-structures

Authors: Rajesh Kumar, Gayatri Sahu, Shailendra K. Saxena, Hari M. Rai, Pankaj R. Sagdeo

Abstract: A step-by-step evolution of an asymmetric Raman line-shape function from a Lorentzian line-shape is presented here for low dimensional semiconductors. The evolution reported here is based on the phonon confinement model which is successfully used in literature to explain the asymmetric Raman line-shape from semiconductor nano-structures. Physical significance of different terms in the theoretical… ▽ More A step-by-step evolution of an asymmetric Raman line-shape function from a Lorentzian line-shape is presented here for low dimensional semiconductors. The evolution reported here is based on the phonon confinement model which is successfully used in literature to explain the asymmetric Raman line-shape from semiconductor nano-structures. Physical significance of different terms in the theoretical asymmetric Raman line-shape has been explained here. Better understanding of theoretical reasoning behind each term allows one to use the theoretical Raman line-shape without going into details of theory from first principle. This will enable one to empirically derive a theoretical Raman line-shape function for any material if information about its phonon dispersion, size dependence etc is known. △ Less

Submitted 20 September, 2013; originally announced September 2013.

Journal ref: Silicon, Vol. 6, Page 117, Year 2014

arXiv:1302.3402 [pdf, other]

doi 10.1007/s12633-013-9157-z

Fabrication of silicon nanocrystals using sequential Au ion implantation

Authors: Gayatri Sahu, Rajesh Kumar, D. P. Mahapatra

Abstract: Silicon nanocrystals are produced using a two-stage gold ion implantation technique. First stage implantation using low energy ions leads to the formation of an amorphous Si (a-Si) layer. A subsequent high energy Au irradiation in the second stage is found to produce strained Si NCs. An annealing at a temperature as low as 500$^o$C is seen to result in strain free NCs showing quantum confinement e… ▽ More Silicon nanocrystals are produced using a two-stage gold ion implantation technique. First stage implantation using low energy ions leads to the formation of an amorphous Si (a-Si) layer. A subsequent high energy Au irradiation in the second stage is found to produce strained Si NCs. An annealing at a temperature as low as 500$^o$C is seen to result in strain free NCs showing quantum confinement effects. Higher temperature annealing of the samples is found to result in growth in size from recrystallization of the a-Si matrix. Raman Scattering, X-ray diffraction (XRD) and Rutherford Backscattering spectrometry (RBS) have been used to study the effect of annealing on the samples and the size of Si NCs formed. The data could be well explained using a phonon confinement model with an extremely narrow size distribution. XRD results go in line with Raman analysis. △ Less

Submitted 14 February, 2013; originally announced February 2013.

Comments: PDFLATEX, 10 Pages, 6 Figures

Journal ref: Silicon, Vol. 6, Page 65-71, Year 2014

arXiv:0910.2839

doi 10.1088/0953-8984/22/7/072203

Narrow band UV emission from direct band gap Si nanoclusters embedded in bulk Si

Authors: G. Sahu, H. P. Lenka, D. P. Mahapatra, Karol Grycginski, A. K. Singh, Jianyou Li, B. Rout, F. D. McDaniel, Arup Neogi

Abstract: This paper has been withdrawn kee** in view of publication elsewhere with some appropriate modifications. This paper has been withdrawn kee** in view of publication elsewhere with some appropriate modifications. △ Less

Submitted 24 December, 2009; v1 submitted 15 October, 2009; originally announced October 2009.

Comments: This paper has been withdrawn

arXiv:0811.0806 [pdf, ps, other]

The mechanism of ion induced amorphization in Si

Authors: H. P. Lenka, U. M. Bhatta, P. K. Kuiri, G. Sahu, B. Joseph, B. Satpati, D. P. Mahapatra

Abstract: Some results on damage build up in, and amorphization of, Si, induced by 25-30 keV Al$_5^-$, Si$_5^-$ and Cs$^-$ ions, at room temperature, are reported. We show that at low energy, amorphization is a nucleation and growth process, based on the direct impact mechanism. With an Avrami exponent $\sim 1.6$, the growth towards amorphization seems to be diffusion limited. A transition to a completely… ▽ More Some results on damage build up in, and amorphization of, Si, induced by 25-30 keV Al$_5^-$, Si$_5^-$ and Cs$^-$ ions, at room temperature, are reported. We show that at low energy, amorphization is a nucleation and growth process, based on the direct impact mechanism. With an Avrami exponent $\sim 1.6$, the growth towards amorphization seems to be diffusion limited. A transition to a completely amorphized state is indicated at a dose exceeding 17 eV/atom, which is higher than 6-12 eV/atom as predicted by simulations. The observed higher threshold could be due to temperature effects although an underestimation of keV-energy recoils, in simulation, may not be ruled out. △ Less

Submitted 16 February, 2009; v1 submitted 5 November, 2008; originally announced November 2008.

Comments: 4 pages, 5 figures

arXiv:0811.0122 [pdf, other]

Enhanced UV Light emission from Silicon nanoparticles induced by Au ion implantation

Authors: Akhilesh Singh, Karol G. Grycznski, Bibhu Rout, Jianyou Li, Floyd McDaniel, Arup Neogi, Gayatri Sahu, Durga P. Mahapatra

Abstract: Study of light emitting silicon fabricated by ion implantation. Study of light emitting silicon fabricated by ion implantation. △ Less

Submitted 1 November, 2008; originally announced November 2008.

Comments: G-COE Conference - Kyoto 2008 abstract

arXiv:0805.0066 [pdf, ps, other]

doi 10.1088/0022-3727/41/21/215305

Study of low energy Si$_5^-$ and Cs$^-$ implantation induced amorphization effects in Si(100)

Authors: H. P. Lenka, B. Joseph, P. K. Kuiri, G. Sahu, P. Mishra, D. Ghose, D. P. Mahapatra

Abstract: The damage growth and surface modifications in Si(100), induced by 25 keV Si$_5^-$ cluster ions, as a function of fluence, $φ$, has been studied using atomic force microscopy (AFM) and channeling Rutherford backscattering spectrometry (CRBS). CRBS results indicate a nonlinear growth in damage from which it has been possible to get a threshold fluence, $φ_0$, for amorphization as… ▽ More The damage growth and surface modifications in Si(100), induced by 25 keV Si$_5^-$ cluster ions, as a function of fluence, $φ$, has been studied using atomic force microscopy (AFM) and channeling Rutherford backscattering spectrometry (CRBS). CRBS results indicate a nonlinear growth in damage from which it has been possible to get a threshold fluence, $φ_0$, for amorphization as $2.5\times 10^{13}$ ions-cm$^{-2}$. For $φ$ below $φ_0$, a growth in damage as well as surface roughness has been observed. At a $φ$ of $1\times 10^{14}$ ions-cm$^{-2}$, damage saturation coupled with a much reduced surface roughness has been found. In this case a power spectrum analysis of AFM data showed a significant drop, in spectral density, as compared to the same obtained for a fluence, $φ< φ_0$. This drop, together with damage saturation, can be correlated with a transition to a stress relaxed amorphous phase. Irradiation with similar mass Cs$^-$ ions, at the same energy and fluence, has been found to result in a reduced accumulation of defects in the near surface region leading to reduced surface features. △ Less

Submitted 4 June, 2008; v1 submitted 1 May, 2008; originally announced May 2008.

Comments: 7 pages, 4 figures

arXiv:0802.2494 [pdf, ps, other]

doi 10.1103/PhysRevLett.100.245501

Observation of a Universal Aggregation Mechanism and a Possible Phase Transition in Au Sputtered by Swift Heavy Ions

Authors: P. K. Kuiri, B. Joseph, H. P. Lenka, G. Sahu, J. Ghatak, D. Kanjilal, D. P. Mahapatra

Abstract: Two exponents, $δ$, for size distribution of $n$-atom clusters, $Y(n)\sim n^{-δ}$, have been found in Au clusters sputtered from embedded Au nanoparticles under swift heavy ion irradiation. For small clusters, below 12.5 nm in size, $δ$ has been found to be 3/2, which can be rationalized as occurring from a steady state aggregation process with size independent aggregation. For larger clusters,… ▽ More Two exponents, $δ$, for size distribution of $n$-atom clusters, $Y(n)\sim n^{-δ}$, have been found in Au clusters sputtered from embedded Au nanoparticles under swift heavy ion irradiation. For small clusters, below 12.5 nm in size, $δ$ has been found to be 3/2, which can be rationalized as occurring from a steady state aggregation process with size independent aggregation. For larger clusters, a $δ$ value of 7/2 is suggested, which might come from a dynamical transition to another steady state where aggregation and evaporation rates are size dependent. In the present case, the observed decay exponents do not support any possibility of a thermodynamic liquid-gas type phase transition taking place, resulting in cluster formation. △ Less

Submitted 16 October, 2009; v1 submitted 17 February, 2008; originally announced February 2008.

Comments: 4 pages, 3 figures

Journal ref: Phys Rev Lett 100, 245501 (2008)

Showing 1–25 of 25 results for author: Sahu, G