-
Synthesizing Photorealistic Virtual Humans Through Cross-modal Disentanglement
Authors:
Siddarth Ravichandran,
Ondřej Texler,
Dimitar Dinev,
Hyun Jae Kang
Abstract:
Over the last few decades, many aspects of human life have been enhanced with virtual domains, from the advent of digital assistants such as Amazon's Alexa and Apple's Siri to the latest metaverse efforts of the rebranded Meta. These trends underscore the importance of generating photorealistic visual depictions of humans. This has led to the rapid growth of so-called deepfake and talking-head gen…
▽ More
Over the last few decades, many aspects of human life have been enhanced with virtual domains, from the advent of digital assistants such as Amazon's Alexa and Apple's Siri to the latest metaverse efforts of the rebranded Meta. These trends underscore the importance of generating photorealistic visual depictions of humans. This has led to the rapid growth of so-called deepfake and talking-head generation methods in recent years. Despite their impressive results and popularity, they usually lack certain qualitative aspects such as texture quality, lips synchronization, or resolution, and practical aspects such as the ability to run in real-time. To allow for virtual human avatars to be used in practical scenarios, we propose an end-to-end framework for synthesizing high-quality virtual human faces capable of speaking with accurate lip motion with a special emphasis on performance. We introduce a novel network utilizing visemes as an intermediate audio representation and a novel data augmentation strategy employing a hierarchical image synthesis approach that allows disentanglement of the different modalities used to control the global head motion. Our method runs in real-time, and is able to deliver superior results compared to the current state-of-the-art.
△ Less
Submitted 23 March, 2023; v1 submitted 2 September, 2022;
originally announced September 2022.
-
Statistical Guarantees for Fairness Aware Plug-In Algorithms
Authors:
Drona Khurana,
Srinivasan Ravichandran,
Sparsh Jain,
Narayanan Unny Edakunni
Abstract:
A plug-in algorithm to estimate Bayes Optimal Classifiers for fairness-aware binary classification has been proposed in (Menon & Williamson, 2018). However, the statistical efficacy of their approach has not been established. We prove that the plug-in algorithm is statistically consistent. We also derive finite sample guarantees associated with learning the Bayes Optimal Classifiers via the plug-i…
▽ More
A plug-in algorithm to estimate Bayes Optimal Classifiers for fairness-aware binary classification has been proposed in (Menon & Williamson, 2018). However, the statistical efficacy of their approach has not been established. We prove that the plug-in algorithm is statistically consistent. We also derive finite sample guarantees associated with learning the Bayes Optimal Classifiers via the plug-in algorithm. Finally, we propose a protocol that modifies the plug-in approach, so as to simultaneously guarantee fairness and differential privacy with respect to a binary feature deemed sensitive.
△ Less
Submitted 27 July, 2021;
originally announced July 2021.
-
Detection and Classification of mental illnesses on social media using RoBERTa
Authors:
Ankit Murarka,
Balaji Radhakrishnan,
Sushma Ravichandran
Abstract:
Given the current social distancing regulations across the world, social media has become the primary mode of communication for most people. This has resulted in the isolation of many people suffering from mental illnesses who are unable to receive assistance in person. They have increasingly turned to social media to express themselves and to look for guidance in dealing with their illnesses. Kee…
▽ More
Given the current social distancing regulations across the world, social media has become the primary mode of communication for most people. This has resulted in the isolation of many people suffering from mental illnesses who are unable to receive assistance in person. They have increasingly turned to social media to express themselves and to look for guidance in dealing with their illnesses. Kee** this in mind, we propose a solution to detect and classify mental illness posts on social media thereby enabling users to seek appropriate help. In this work, we detect and classify five prominent kinds of mental illnesses: depression, anxiety, bipolar disorder, ADHD and PTSD by analyzing unstructured user data on social media platforms. In addition, we are sharing a new high-quality dataset to drive research on this topic. We believe that our work is the first multi-class model that uses a Transformer-based architecture such as RoBERTa to analyze people's emotions and psychology. We also demonstrate how we stress-test our model using behavioral testing. With this research, we hope to be able to contribute to the public health system by automating some of the detection and classification process.
△ Less
Submitted 23 November, 2020;
originally announced November 2020.
-
FairXGBoost: Fairness-aware Classification in XGBoost
Authors:
Srinivasan Ravichandran,
Drona Khurana,
Bharath Venkatesh,
Narayanan Unny Edakunni
Abstract:
Highly regulated domains such as finance have long favoured the use of machine learning algorithms that are scalable, transparent, robust and yield better performance. One of the most prominent examples of such an algorithm is XGBoost. Meanwhile, there is also a growing interest in building fair and unbiased models in these regulated domains and numerous bias-mitigation algorithms have been propos…
▽ More
Highly regulated domains such as finance have long favoured the use of machine learning algorithms that are scalable, transparent, robust and yield better performance. One of the most prominent examples of such an algorithm is XGBoost. Meanwhile, there is also a growing interest in building fair and unbiased models in these regulated domains and numerous bias-mitigation algorithms have been proposed to this end. However, most of these bias-mitigation methods are restricted to specific model families such as logistic regression or support vector machine models, thus leaving modelers with a difficult decision of choosing between fairness from the bias-mitigation algorithms and scalability, transparency, performance from algorithms such as XGBoost. We aim to leverage the best of both worlds by proposing a fair variant of XGBoost that enjoys all the advantages of XGBoost, while also matching the levels of fairness from the state-of-the-art bias-mitigation algorithms. Furthermore, the proposed solution requires very little in terms of changes to the original XGBoost library, thus making it easy for adoption. We provide an empirical analysis of our proposed method on standard benchmark datasets used in the fairness community.
△ Less
Submitted 7 October, 2020; v1 submitted 3 September, 2020;
originally announced September 2020.
-
Drug Repurposing for Cancer: An NLP Approach to Identify Low-Cost Therapies
Authors:
Shivashankar Subramanian,
Ioana Baldini,
Sushma Ravichandran,
Dmitriy A. Katz-Rogozhnikov,
Karthikeyan Natesan Ramamurthy,
Prasanna Sattigeri,
Kush R. Varshney,
Annmarie Wang,
Pradeep Mangalath,
Laura B. Kleiman
Abstract:
More than 200 generic drugs approved by the U.S. Food and Drug Administration for non-cancer indications have shown promise for treating cancer. Due to their long history of safe patient use, low cost, and widespread availability, repurposing of generic drugs represents a major opportunity to rapidly improve outcomes for cancer patients and reduce healthcare costs worldwide. Evidence on the effica…
▽ More
More than 200 generic drugs approved by the U.S. Food and Drug Administration for non-cancer indications have shown promise for treating cancer. Due to their long history of safe patient use, low cost, and widespread availability, repurposing of generic drugs represents a major opportunity to rapidly improve outcomes for cancer patients and reduce healthcare costs worldwide. Evidence on the efficacy of non-cancer generic drugs being tested for cancer exists in scientific publications, but trying to manually identify and extract such evidence is intractable. In this paper, we introduce a system to automate this evidence extraction from PubMed abstracts. Our primary contribution is to define the natural language processing pipeline required to obtain such evidence, comprising the following modules: querying, filtering, cancer type entity extraction, therapeutic association classification, and study type classification. Using the subject matter expertise on our team, we create our own datasets for these specialized domain-specific tasks. We obtain promising performance in each of the modules by utilizing modern language modeling techniques and plan to treat them as baseline approaches for future improvement of individual components.
△ Less
Submitted 5 December, 2019; v1 submitted 18 November, 2019;
originally announced November 2019.
-
Bike Renting Data Analysis: The Case of Dublin City
Authors:
Thanh Thoa Pham Thi,
Joe Timoney,
Shyram Ravichandran,
Peter Mooney,
Adam Winstanley
Abstract:
Public bike renting is more and more popular in cities to incentivise a reduction in car journeys and to boost the use of green transportation alternatives. One of the challenges of this application is to effectively plan the resources usage. This paper presents some analysis of Dublin bike renting scheme based on statistics and data mining. It provides available bike patterns at the most interest…
▽ More
Public bike renting is more and more popular in cities to incentivise a reduction in car journeys and to boost the use of green transportation alternatives. One of the challenges of this application is to effectively plan the resources usage. This paper presents some analysis of Dublin bike renting scheme based on statistics and data mining. It provides available bike patterns at the most interesting bike stations, that is, the busiest and the quietest stations. Consistency checking with new data reinforces confidence in the patterns obtained. Identifying available bike patterns helps to better address user needs such as organising the rebalancing of the bike numbers between stations in advance of demand.
△ Less
Submitted 22 April, 2017;
originally announced April 2017.
-
Autogenic Training With Natural Language Processing Modules: A Recent Tool For Certain Neuro Cognitive Studies
Authors:
S. Ravichandran,
M. N. Karthik
Abstract:
Learning to respond to voice-text input involves the subject's ability in understanding the phonetic and text based contents and his/her ability to communicate based on his/her experience. The neuro-cognitive facility of the subject has to support two important domains in order to make the learning process complete. In many cases, though the understanding is complete, the response is partial. Th…
▽ More
Learning to respond to voice-text input involves the subject's ability in understanding the phonetic and text based contents and his/her ability to communicate based on his/her experience. The neuro-cognitive facility of the subject has to support two important domains in order to make the learning process complete. In many cases, though the understanding is complete, the response is partial. This is one valid reason why we need to support the information from the subject with scalable techniques such as Natural Language Processing (NLP) for abstraction of the contents from the output. This paper explores the feasibility of using NLP modules interlaced with Neural Networks to perform the required task in autogenic training related to medical applications.
△ Less
Submitted 2 July, 2004;
originally announced July 2004.