-
Debiasing Machine Unlearning with Counterfactual Examples
Authors:
Ziheng Chen,
Jia Wang,
Jun Zhuang,
Abbavaram Gowtham Reddy,
Fabrizio Silvestri,
** Huang,
Kaushiki Nag,
Kun Kuang,
Xin Ning,
Gabriele Tolomei
Abstract:
The right to be forgotten (RTBF) seeks to safeguard individuals from the enduring effects of their historical actions by implementing machine-learning techniques. These techniques facilitate the deletion of previously acquired knowledge without requiring extensive model retraining. However, they often overlook a critical issue: unlearning processes bias. This bias emerges from two main sources: (1…
▽ More
The right to be forgotten (RTBF) seeks to safeguard individuals from the enduring effects of their historical actions by implementing machine-learning techniques. These techniques facilitate the deletion of previously acquired knowledge without requiring extensive model retraining. However, they often overlook a critical issue: unlearning processes bias. This bias emerges from two main sources: (1) data-level bias, characterized by uneven data removal, and (2) algorithm-level bias, which leads to the contamination of the remaining dataset, thereby degrading model accuracy. In this work, we analyze the causal factors behind the unlearning process and mitigate biases at both data and algorithmic levels. Typically, we introduce an intervention-based approach, where knowledge to forget is erased with a debiased dataset. Besides, we guide the forgetting procedure by leveraging counterfactual examples, as they maintain semantic data consistency without hurting performance on the remaining dataset. Experimental results demonstrate that our method outperforms existing machine unlearning baselines on evaluation metrics.
△ Less
Submitted 24 April, 2024;
originally announced April 2024.
-
Prompt Optimizer of Text-to-Image Diffusion Models for Abstract Concept Understanding
Authors:
Zezhong Fan,
Xiaohan Li,
Chenhao Fang,
Topojoy Biswas,
Kaushiki Nag,
Jianpeng Xu,
Kannan Achan
Abstract:
The rapid evolution of text-to-image diffusion models has opened the door of generative AI, enabling the translation of textual descriptions into visually compelling images with remarkable quality. However, a persistent challenge within this domain is the optimization of prompts to effectively convey abstract concepts into concrete objects. For example, text encoders can hardly express "peace", wh…
▽ More
The rapid evolution of text-to-image diffusion models has opened the door of generative AI, enabling the translation of textual descriptions into visually compelling images with remarkable quality. However, a persistent challenge within this domain is the optimization of prompts to effectively convey abstract concepts into concrete objects. For example, text encoders can hardly express "peace", while can easily illustrate olive branches and white doves. This paper introduces a novel approach named Prompt Optimizer for Abstract Concepts (POAC) specifically designed to enhance the performance of text-to-image diffusion models in interpreting and generating images from abstract concepts. We propose a Prompt Language Model (PLM), which is initialized from a pre-trained language model, and then fine-tuned with a curated dataset of abstract concept prompts. The dataset is created with GPT-4 to extend the abstract concept to a scene and concrete objects. Our framework employs a Reinforcement Learning (RL)-based optimization strategy, focusing on the alignment between the generated images by a stable diffusion model and optimized prompts. Through extensive experiments, we demonstrate that our proposed POAC significantly improves the accuracy and aesthetic quality of generated images, particularly in the description of abstract concepts and alignment with optimized prompts. We also present a comprehensive analysis of our model's performance across diffusion models under different settings, showcasing its versatility and effectiveness in enhancing abstract concept representation.
△ Less
Submitted 17 April, 2024;
originally announced April 2024.
-
Emotional Intelligence Through Artificial Intelligence : NLP and Deep Learning in the Analysis of Healthcare Texts
Authors:
Prashant Kumar Nag,
Amit Bhagat,
R. Vishnu Priya,
Deepak kumar Khare
Abstract:
This manuscript presents a methodical examination of the utilization of Artificial Intelligence in the assessment of emotions in texts related to healthcare, with a particular focus on the incorporation of Natural Language Processing and deep learning technologies. We scrutinize numerous research studies that employ AI to augment sentiment analysis, categorize emotions, and forecast patient outcom…
▽ More
This manuscript presents a methodical examination of the utilization of Artificial Intelligence in the assessment of emotions in texts related to healthcare, with a particular focus on the incorporation of Natural Language Processing and deep learning technologies. We scrutinize numerous research studies that employ AI to augment sentiment analysis, categorize emotions, and forecast patient outcomes based on textual information derived from clinical narratives, patient feedback on medications, and online health discussions. The review demonstrates noteworthy progress in the precision of algorithms used for sentiment classification, the prognostic capabilities of AI models for neurodegenerative diseases, and the creation of AI-powered systems that offer support in clinical decision-making. Remarkably, the utilization of AI applications has exhibited an enhancement in personalized therapy plans by integrating patient sentiment and contributing to the early identification of mental health disorders. There persist challenges, which encompass ensuring the ethical application of AI, safeguarding patient confidentiality, and addressing potential biases in algorithmic procedures. Nevertheless, the potential of AI to revolutionize healthcare practices is unmistakable, offering a future where healthcare is not only more knowledgeable and efficient but also more empathetic and centered around the needs of patients. This investigation underscores the transformative influence of AI on healthcare, delivering a comprehensive comprehension of its role in examining emotional content in healthcare texts and highlighting the trajectory towards a more compassionate approach to patient care. The findings advocate for a harmonious synergy between AI's analytical capabilities and the human aspects of healthcare.
△ Less
Submitted 14 March, 2024;
originally announced March 2024.
-
Chaining text-to-image and large language model: A novel approach for generating personalized e-commerce banners
Authors:
Shanu Vashishtha,
Abhinav Prakash,
Lalitesh Morishetti,
Kaushiki Nag,
Yokila Arora,
Sushant Kumar,
Kannan Achan
Abstract:
Text-to-image models such as stable diffusion have opened a plethora of opportunities for generating art. Recent literature has surveyed the use of text-to-image models for enhancing the work of many creative artists. Many e-commerce platforms employ a manual process to generate the banners, which is time-consuming and has limitations of scalability. In this work, we demonstrate the use of text-to…
▽ More
Text-to-image models such as stable diffusion have opened a plethora of opportunities for generating art. Recent literature has surveyed the use of text-to-image models for enhancing the work of many creative artists. Many e-commerce platforms employ a manual process to generate the banners, which is time-consuming and has limitations of scalability. In this work, we demonstrate the use of text-to-image models for generating personalized web banners with dynamic content for online shoppers based on their interactions. The novelty in this approach lies in converting users' interaction data to meaningful prompts without human intervention. To this end, we utilize a large language model (LLM) to systematically extract a tuple of attributes from item meta-information. The attributes are then passed to a text-to-image model via prompt engineering to generate images for the banner. Our results show that the proposed approach can create high-quality personalized banners for users.
△ Less
Submitted 28 February, 2024;
originally announced March 2024.
-
LLM-Ensemble: Optimal Large Language Model Ensemble Method for E-commerce Product Attribute Value Extraction
Authors:
Chenhao Fang,
Xiaohan Li,
Zezhong Fan,
Jianpeng Xu,
Kaushiki Nag,
Evren Korpeoglu,
Sushant Kumar,
Kannan Achan
Abstract:
Product attribute value extraction is a pivotal component in Natural Language Processing (NLP) and the contemporary e-commerce industry. The provision of precise product attribute values is fundamental in ensuring high-quality recommendations and enhancing customer satisfaction. The recently emerging Large Language Models (LLMs) have demonstrated state-of-the-art performance in numerous attribute…
▽ More
Product attribute value extraction is a pivotal component in Natural Language Processing (NLP) and the contemporary e-commerce industry. The provision of precise product attribute values is fundamental in ensuring high-quality recommendations and enhancing customer satisfaction. The recently emerging Large Language Models (LLMs) have demonstrated state-of-the-art performance in numerous attribute extraction tasks, without the need for domain-specific training data. Nevertheless, varying strengths and weaknesses are exhibited by different LLMs due to the diversity in data, architectures, and hyperparameters. This variation makes them complementary to each other, with no single LLM dominating all others. Considering the diverse strengths and weaknesses of LLMs, it becomes necessary to develop an ensemble method that leverages their complementary potentials. In this paper, we propose a novel algorithm called LLM-ensemble to ensemble different LLMs' outputs for attribute value extraction. We iteratively learn the weights for different LLMs to aggregate the labels with weights to predict the final attribute value. Not only can our proposed method be proven theoretically optimal, but it also ensures efficient computation, fast convergence, and safe deployment. We have also conducted extensive experiments with various state-of-the-art LLMs, including Llama2-13B, Llama2-70B, PaLM-2, GPT-3.5, and GPT-4, on Walmart's internal data. Our offline metrics demonstrate that the LLM-ensemble method outperforms all the state-of-the-art single LLMs on Walmart's internal dataset. This method has been launched in several production models, leading to improved Gross Merchandise Volume (GMV), Click-Through Rate (CTR), Conversion Rate (CVR), and Add-to-Cart Rate (ATC).
△ Less
Submitted 20 June, 2024; v1 submitted 29 February, 2024;
originally announced March 2024.
-
Superconductivity Mediated by Nematic Fluctuations in Tetragonal $\textrm{Fe}\textrm{Se}_{1-x}\textrm{S}_{x}$
Authors:
Pranab Kumar Nag,
Kirsty Scott,
Vanuildo S. de Carvalho,
Journey K. Byland,
Xinze Yang,
Morgan Walker,
Aaron G. Greenberg,
Peter Klavins,
Eduardo Miranda,
Adrian Gozar,
Valentin Taufour,
Rafael M. Fernandes,
Eduardo H. da Silva Neto
Abstract:
Nematic phases, where electrons in a solid spontaneously break rotational symmetry while preserving the translational symmetry, exist in several families of unconventional superconductors [1, 2]. Although superconductivity mediated by nematic fluctuations is well established theoretically [3-7], it has yet to be unambiguously identified experimentally [8, 9]. A major challenge is that nematicity i…
▽ More
Nematic phases, where electrons in a solid spontaneously break rotational symmetry while preserving the translational symmetry, exist in several families of unconventional superconductors [1, 2]. Although superconductivity mediated by nematic fluctuations is well established theoretically [3-7], it has yet to be unambiguously identified experimentally [8, 9]. A major challenge is that nematicity is often intertwined with other degrees of freedom, such as magnetism and charge order. The FeSe$_{1-x}$S$_x$ family of iron based superconductors provides a unique opportunity to explore this concept, as it features an isolated nematic phase that can be suppressed by sulfur substitution at a quantum critical point (QCP) near $x_c = 0.17$, where nematic fluctuations are the largest [10-12]. Here, we performed scanning tunneling spectroscopy measurements to visualize Boguliubov quasiparticle interference patterns, from which we determined the momentum structure of the superconducting gap near the Brillouin zone $Γ$ point of FeSe$_{0.81}$S$_{0.19}$. The results reveal an anisotropic, near nodal gap with minima that are $45^\circ$ rotated with respect to the Fe-Fe direction, characteristic of a nematic pairing interaction, contrary to the usual isotropic gaps due to spin mediated pairing in other tetragonal Fe-based superconductors. The results are also in contrast with pristine FeSe, where the pairing is mediated by spin fluctuations and the gap minima are aligned with the Fe-Fe direction. Therefore, the measured gap structure demonstrates not only a fundamental change of the pairing mechanism across the phase diagram of FeSe$_{1-x}$S$_x$, but it also indicates the existence of superconductivity mediated by nematic fluctuations in FeSe$_{0.81}$S$_{0.19}$.
△ Less
Submitted 1 March, 2024;
originally announced March 2024.
-
Seller-side Outcome Fairness in Online Marketplaces
Authors:
Zikun Ye,
Reza Yousefi Maragheh,
Lalitesh Morishetti,
Shanu Vashishtha,
Jason Cho,
Kaushiki Nag,
Sushant Kumar,
Kannan Achan
Abstract:
This paper aims to investigate and achieve seller-side fairness within online marketplaces, where many sellers and their items are not sufficiently exposed to customers in an e-commerce platform. This phenomenon raises concerns regarding the potential loss of revenue associated with less exposed items as well as less marketplace diversity. We introduce the notion of seller-side outcome fairness an…
▽ More
This paper aims to investigate and achieve seller-side fairness within online marketplaces, where many sellers and their items are not sufficiently exposed to customers in an e-commerce platform. This phenomenon raises concerns regarding the potential loss of revenue associated with less exposed items as well as less marketplace diversity. We introduce the notion of seller-side outcome fairness and build an optimization model to balance collected recommendation rewards and the fairness metric. We then propose a gradient-based data-driven algorithm based on the duality and bandit theory. Our numerical experiments on real e-commerce data sets show that our algorithm can lift seller fairness measures while not hurting metrics like collected Gross Merchandise Value (GMV) and total purchases.
△ Less
Submitted 5 December, 2023;
originally announced December 2023.
-
Knowledge Graph Completion Models are Few-shot Learners: An Empirical Study of Relation Labeling in E-commerce with LLMs
Authors:
Jiao Chen,
Luyi Ma,
Xiaohan Li,
Nikhil Thakurdesai,
Jianpeng Xu,
Jason H. D. Cho,
Kaushiki Nag,
Evren Korpeoglu,
Sushant Kumar,
Kannan Achan
Abstract:
Knowledge Graphs (KGs) play a crucial role in enhancing e-commerce system performance by providing structured information about entities and their relationships, such as complementary or substitutable relations between products or product types, which can be utilized in recommender systems. However, relation labeling in KGs remains a challenging task due to the dynamic nature of e-commerce domains…
▽ More
Knowledge Graphs (KGs) play a crucial role in enhancing e-commerce system performance by providing structured information about entities and their relationships, such as complementary or substitutable relations between products or product types, which can be utilized in recommender systems. However, relation labeling in KGs remains a challenging task due to the dynamic nature of e-commerce domains and the associated cost of human labor. Recently, breakthroughs in Large Language Models (LLMs) have shown surprising results in numerous natural language processing tasks. In this paper, we conduct an empirical study of LLMs for relation labeling in e-commerce KGs, investigating their powerful learning capabilities in natural language and effectiveness in predicting relations between product types with limited labeled data. We evaluate various LLMs, including PaLM and GPT-3.5, on benchmark datasets, demonstrating their ability to achieve competitive performance compared to humans on relation labeling tasks using just 1 to 5 labeled examples per relation. Additionally, we experiment with different prompt engineering techniques to examine their impact on model performance. Our results show that LLMs significantly outperform existing KG completion models in relation labeling for e-commerce KGs and exhibit performance strong enough to replace human labeling.
△ Less
Submitted 16 May, 2023;
originally announced May 2023.
-
Mitigating Frequency Bias in Next-Basket Recommendation via Deconfounders
Authors:
Xiaohan Li,
Zheng Liu,
Luyi Ma,
Kaushiki Nag,
Stephen Guo,
Philip Yu,
Kannan Achan
Abstract:
Recent studies on Next-basket Recommendation (NBR) have achieved much progress by leveraging Personalized Item Frequency (PIF) as one of the main features, which measures the frequency of the user's interactions with the item. However, taking the PIF as an explicit feature incurs bias towards frequent items. Items that a user purchases frequently are assigned higher weights in the PIF-based recomm…
▽ More
Recent studies on Next-basket Recommendation (NBR) have achieved much progress by leveraging Personalized Item Frequency (PIF) as one of the main features, which measures the frequency of the user's interactions with the item. However, taking the PIF as an explicit feature incurs bias towards frequent items. Items that a user purchases frequently are assigned higher weights in the PIF-based recommender system and appear more frequently in the personalized recommendation list. As a result, the system will lose the fairness and balance between items that the user frequently purchases and items that the user never purchases. We refer to this systematic bias on personalized recommendation lists as frequency bias, which narrows users' browsing scope and reduces the system utility. We adopt causal inference theory to address this issue. Considering the influence of historical purchases on users' future interests, the user and item representations can be viewed as unobserved confounders in the causal diagram. In this paper, we propose a deconfounder model named FENDER (Frequency-aware Deconfounder for Next-basket Recommendation) to mitigate the frequency bias. With the deconfounder theory and the causal diagram we propose, FENDER decomposes PIF with a neural tensor layer to obtain substitute confounders for users and items. Then, FENDER performs unbiased recommendations considering the effect of these substitute confounders. Experimental results demonstrate that FENDER has derived diverse and fair results compared to ten baseline models on three datasets while achieving competitive performance. Further experiments illustrate how FENDER balances users' historical purchases and potential interests.
△ Less
Submitted 16 November, 2022;
originally announced November 2022.
-
Absence of hexagonal to square structural transition in LiFeAs vortex matter
Authors:
Sven Hoffmann,
Ronny Schlegel,
Christian Salazar,
Steffen Sykora,
Pranab Kumar Nag,
Pavlo Khanenko,
Robert Beck,
Saicharan Aswartham,
Sabine Wurmehl,
Bernd Büchner,
Yanina Fasano,
Christian Hess
Abstract:
We investigated magnetic vortices in two stoichiometric LiFeAs samples by means of scanning tunneling microscopy and spectroscopy. The vortices were revealed by measuring the local electronic density of states (LDOS) at zero bias conductance of samples in magnetic fields between 0.5 and 12 T. From single vortex spectroscopy we extract the Ginzburg-Landau coherence length of both samples as…
▽ More
We investigated magnetic vortices in two stoichiometric LiFeAs samples by means of scanning tunneling microscopy and spectroscopy. The vortices were revealed by measuring the local electronic density of states (LDOS) at zero bias conductance of samples in magnetic fields between 0.5 and 12 T. From single vortex spectroscopy we extract the Ginzburg-Landau coherence length of both samples as $4.4\pm0.5$ nm and $4.1\pm0.5$ nm, in accordance with previous findings. However, in contrast to previous reports, our study reveals that the reported hexagonal to square-like vortex lattice transition is absent up to 12 T both in field-cooling and zero-field-cooling processes. Remarkably, a highly ordered zero field cooled hexagonal vortex lattice is observed up to 8 T. We argue that several factors are likely to determine the structure of the vortex lattice in LiFeAs such as (i) details of the cooling procedure (ii) sample stoichiometry that alters the formation of nematic fluctuations, (iii) details of the order parameter and (iv) magnetoelastic coupling.
△ Less
Submitted 5 October, 2022;
originally announced October 2022.
-
Fermi-arc diversity on surface terminations of the magnetic Weyl semimetal Co3Sn2S2
Authors:
Noam Morali,
Rajib Batabyal,
Pranab Kumar Nag,
Enke Liu,
Qiunan Xu,
Yan Sun,
Binghai Yan,
Claudia Felser,
Nurit Avraham,
Haim Beidenkopf
Abstract:
Bulk-surface correspondence in Weyl semimetals assures the formation of topological "Fermi-arc" surface bands whose existence is guaranteed by bulk Weyl nodes. By investigating three distinct surface terminations of the ferromagnetic semimetal Co3Sn2S2 we verify spectroscopically its classification as a time reversal symmetry broken Weyl semimetal. We show that the distinct surface potentials impo…
▽ More
Bulk-surface correspondence in Weyl semimetals assures the formation of topological "Fermi-arc" surface bands whose existence is guaranteed by bulk Weyl nodes. By investigating three distinct surface terminations of the ferromagnetic semimetal Co3Sn2S2 we verify spectroscopically its classification as a time reversal symmetry broken Weyl semimetal. We show that the distinct surface potentials imposed by three different terminations modify the Fermi-arc contour and Weyl node connectivity. On the Sn surface we identify intra-Brillouin zone Weyl node connectivity of Fermi-arcs, while on Co termination the connectivity is across adjacent Brillouin zones. On the S surface Fermi-arcs overlap with non-topological bulk and surface states that ambiguate their connectivity and obscure their exact identification. By these we resolve the topologically protected electronic properties of a Weyl semimetal and its unprotected ones that can be manipulated and engineered.
△ Less
Submitted 1 March, 2019;
originally announced March 2019.
-
Spectroscopic evidence of nematic fluctuations in LiFeAs
Authors:
Zhixiang Sun,
Pranab Kumar Nag,
Steffen Sykora,
Jose M. Guevara,
Sven Hoffmann,
Christian Salazar,
Torben Hänke,
Rhea Kappenberger,
Sabine Wurmehl,
Bernd Büchner,
Christian Hess
Abstract:
The role of nematic fluctuations in the pairing mechanism of iron-based superconductors is frequently debated. Here we present a novel method to reveal such fluctuations by identifying energy and momentum of the corresponding nematic boson through the detection of a boson-assisted resonant amplification of Friedel oscillations. Using Fourier-transform scanning tunneling spectroscopy, we observe fo…
▽ More
The role of nematic fluctuations in the pairing mechanism of iron-based superconductors is frequently debated. Here we present a novel method to reveal such fluctuations by identifying energy and momentum of the corresponding nematic boson through the detection of a boson-assisted resonant amplification of Friedel oscillations. Using Fourier-transform scanning tunneling spectroscopy, we observe for the unconventional superconductor LiFeAs strong signatures of bosonic states at momentum $q\sim 0$ and energy $Ω\approx8$~meV. We show that these bosonic states survive in the normal conducting state, and, moreover, that they are in perfect agreement with well-known strong above-gap anomalies in the tunneling spectra. Attributing these small-$q$ boson modes to nematic fluctuations we provide the first spectroscopic approach to the nematic boson in an unconventional superconductor.
△ Less
Submitted 8 November, 2018;
originally announced November 2018.
-
Defect states in LiFeAs as seen by low temperature scanning tunneling microscopy and spectroscopy
Authors:
R. Schlegel,
P. K. Nag,
D. Baumann,
R. Beck,
S. Wurmehl,
B. Büchner,
C. Hess
Abstract:
We present a microscopic investigation of frequently observed impurity-induced states in stoichiometric LiFeAs using low temperature scanning tunneling microscopy and spectroscopy (STM/STS). Our data reveal seven distinct well defined defects which are discernible in topographic measurements. Depending on their local topographic symmetry, we are able to assign five defect types to specific lattice…
▽ More
We present a microscopic investigation of frequently observed impurity-induced states in stoichiometric LiFeAs using low temperature scanning tunneling microscopy and spectroscopy (STM/STS). Our data reveal seven distinct well defined defects which are discernible in topographic measurements. Depending on their local topographic symmetry, we are able to assign five defect types to specific lattice sites at the Li, Fe and As positions. The most prominent result is that two different defect types have a remarkably different impact on the superconducting state. A specific and quite abundant Fe-defect with $D_2$-symmetry generates significant impurity-induced additional states primarily at positive bias voltage with pronounced peaks in the on-site local density of states (LDOS) at about 4~mV and 12~mV. On the other hand, a $D_4$-symmetric As-defect causes a significantly enhanced LDOS at both positive and negative bias voltages. We expect that these findings provide fresh input for further experimental and theoretical studies on elucidating the nature of superconductivity in LiFeAs.
△ Less
Submitted 24 March, 2016;
originally announced March 2016.
-
Unusual temperature evolution of superconductivity in LiFeAs
Authors:
P. K. Nag,
R. Schlegel,
D. Baumann,
H. -J. Grafe,
R. Beck,
S. Wurmehl,
B. Büchner,
C. Hess
Abstract:
We have performed temperature dependent tunneling spectroscopy on an impurity-free surface area of a LiFeAs single crystal. Our data reveal a highly unusual temperature evolution of superconductivity: at $T_c^*=18$~K a partial superconducting gap opens, as is evidenced by subtle, yet clear features in the tunneling spectra, i.e. particle-hole symmetric coherence peaks, and a dip-hump structure whi…
▽ More
We have performed temperature dependent tunneling spectroscopy on an impurity-free surface area of a LiFeAs single crystal. Our data reveal a highly unusual temperature evolution of superconductivity: at $T_c^*=18$~K a partial superconducting gap opens, as is evidenced by subtle, yet clear features in the tunneling spectra, i.e. particle-hole symmetric coherence peaks, and a dip-hump structure which signals strong-coupling superconductivity. At $T_c=16$~K, these features substantiate dramatically and become characteristic of full superconductivity. Remarkably, this is accompanied by an almost jump-like increase of the gap energy at $T_c$ to about 87\% of its low-temperature gap value. The energy of the bosonic mode as measured by the distance between the coherence peak and the higher-energy dip remains practically constant in the whole temperature regime $T\leq T_c^*$. The comparison of these findings with established experimental and theoretical results lead us to suggest that the bosonic mode is not directly related to incommensurate spin fluctuations that have previously been observed in inelastic neutron scattering.
△ Less
Submitted 11 September, 2015;
originally announced September 2015.