Search | arXiv e-print repository

Synthesizing Photorealistic Virtual Humans Through Cross-modal Disentanglement

Authors: Siddarth Ravichandran, Ondřej Texler, Dimitar Dinev, Hyun Jae Kang

Abstract: Over the last few decades, many aspects of human life have been enhanced with virtual domains, from the advent of digital assistants such as Amazon's Alexa and Apple's Siri to the latest metaverse efforts of the rebranded Meta. These trends underscore the importance of generating photorealistic visual depictions of humans. This has led to the rapid growth of so-called deepfake and talking-head gen… ▽ More Over the last few decades, many aspects of human life have been enhanced with virtual domains, from the advent of digital assistants such as Amazon's Alexa and Apple's Siri to the latest metaverse efforts of the rebranded Meta. These trends underscore the importance of generating photorealistic visual depictions of humans. This has led to the rapid growth of so-called deepfake and talking-head generation methods in recent years. Despite their impressive results and popularity, they usually lack certain qualitative aspects such as texture quality, lips synchronization, or resolution, and practical aspects such as the ability to run in real-time. To allow for virtual human avatars to be used in practical scenarios, we propose an end-to-end framework for synthesizing high-quality virtual human faces capable of speaking with accurate lip motion with a special emphasis on performance. We introduce a novel network utilizing visemes as an intermediate audio representation and a novel data augmentation strategy employing a hierarchical image synthesis approach that allows disentanglement of the different modalities used to control the global head motion. Our method runs in real-time, and is able to deliver superior results compared to the current state-of-the-art. △ Less

Submitted 23 March, 2023; v1 submitted 2 September, 2022; originally announced September 2022.

arXiv:2107.12783 [pdf, other]

Statistical Guarantees for Fairness Aware Plug-In Algorithms

Authors: Drona Khurana, Srinivasan Ravichandran, Sparsh Jain, Narayanan Unny Edakunni

Abstract: A plug-in algorithm to estimate Bayes Optimal Classifiers for fairness-aware binary classification has been proposed in (Menon & Williamson, 2018). However, the statistical efficacy of their approach has not been established. We prove that the plug-in algorithm is statistically consistent. We also derive finite sample guarantees associated with learning the Bayes Optimal Classifiers via the plug-i… ▽ More A plug-in algorithm to estimate Bayes Optimal Classifiers for fairness-aware binary classification has been proposed in (Menon & Williamson, 2018). However, the statistical efficacy of their approach has not been established. We prove that the plug-in algorithm is statistically consistent. We also derive finite sample guarantees associated with learning the Bayes Optimal Classifiers via the plug-in algorithm. Finally, we propose a protocol that modifies the plug-in approach, so as to simultaneously guarantee fairness and differential privacy with respect to a binary feature deemed sensitive. △ Less

Submitted 27 July, 2021; originally announced July 2021.

Comments: This paper was accepted at the workshop on Socially Responsible Machine Learning, ICML 2021

arXiv:2011.11226 [pdf, other]

Detection and Classification of mental illnesses on social media using RoBERTa

Authors: Ankit Murarka, Balaji Radhakrishnan, Sushma Ravichandran

Abstract: Given the current social distancing regulations across the world, social media has become the primary mode of communication for most people. This has resulted in the isolation of many people suffering from mental illnesses who are unable to receive assistance in person. They have increasingly turned to social media to express themselves and to look for guidance in dealing with their illnesses. Kee… ▽ More Given the current social distancing regulations across the world, social media has become the primary mode of communication for most people. This has resulted in the isolation of many people suffering from mental illnesses who are unable to receive assistance in person. They have increasingly turned to social media to express themselves and to look for guidance in dealing with their illnesses. Kee** this in mind, we propose a solution to detect and classify mental illness posts on social media thereby enabling users to seek appropriate help. In this work, we detect and classify five prominent kinds of mental illnesses: depression, anxiety, bipolar disorder, ADHD and PTSD by analyzing unstructured user data on social media platforms. In addition, we are sharing a new high-quality dataset to drive research on this topic. We believe that our work is the first multi-class model that uses a Transformer-based architecture such as RoBERTa to analyze people's emotions and psychology. We also demonstrate how we stress-test our model using behavioral testing. With this research, we hope to be able to contribute to the public health system by automating some of the detection and classification process. △ Less

Submitted 23 November, 2020; originally announced November 2020.

Comments: 8 pages, 1 figure, 6 tables

arXiv:2009.01442 [pdf, other]

FairXGBoost: Fairness-aware Classification in XGBoost

Authors: Srinivasan Ravichandran, Drona Khurana, Bharath Venkatesh, Narayanan Unny Edakunni

Abstract: Highly regulated domains such as finance have long favoured the use of machine learning algorithms that are scalable, transparent, robust and yield better performance. One of the most prominent examples of such an algorithm is XGBoost. Meanwhile, there is also a growing interest in building fair and unbiased models in these regulated domains and numerous bias-mitigation algorithms have been propos… ▽ More Highly regulated domains such as finance have long favoured the use of machine learning algorithms that are scalable, transparent, robust and yield better performance. One of the most prominent examples of such an algorithm is XGBoost. Meanwhile, there is also a growing interest in building fair and unbiased models in these regulated domains and numerous bias-mitigation algorithms have been proposed to this end. However, most of these bias-mitigation methods are restricted to specific model families such as logistic regression or support vector machine models, thus leaving modelers with a difficult decision of choosing between fairness from the bias-mitigation algorithms and scalability, transparency, performance from algorithms such as XGBoost. We aim to leverage the best of both worlds by proposing a fair variant of XGBoost that enjoys all the advantages of XGBoost, while also matching the levels of fairness from the state-of-the-art bias-mitigation algorithms. Furthermore, the proposed solution requires very little in terms of changes to the original XGBoost library, thus making it easy for adoption. We provide an empirical analysis of our proposed method on standard benchmark datasets used in the fairness community. △ Less

Submitted 7 October, 2020; v1 submitted 3 September, 2020; originally announced September 2020.

arXiv:1911.07819 [pdf, other]

Drug Repurposing for Cancer: An NLP Approach to Identify Low-Cost Therapies

Authors: Shivashankar Subramanian, Ioana Baldini, Sushma Ravichandran, Dmitriy A. Katz-Rogozhnikov, Karthikeyan Natesan Ramamurthy, Prasanna Sattigeri, Kush R. Varshney, Annmarie Wang, Pradeep Mangalath, Laura B. Kleiman

Abstract: More than 200 generic drugs approved by the U.S. Food and Drug Administration for non-cancer indications have shown promise for treating cancer. Due to their long history of safe patient use, low cost, and widespread availability, repurposing of generic drugs represents a major opportunity to rapidly improve outcomes for cancer patients and reduce healthcare costs worldwide. Evidence on the effica… ▽ More More than 200 generic drugs approved by the U.S. Food and Drug Administration for non-cancer indications have shown promise for treating cancer. Due to their long history of safe patient use, low cost, and widespread availability, repurposing of generic drugs represents a major opportunity to rapidly improve outcomes for cancer patients and reduce healthcare costs worldwide. Evidence on the efficacy of non-cancer generic drugs being tested for cancer exists in scientific publications, but trying to manually identify and extract such evidence is intractable. In this paper, we introduce a system to automate this evidence extraction from PubMed abstracts. Our primary contribution is to define the natural language processing pipeline required to obtain such evidence, comprising the following modules: querying, filtering, cancer type entity extraction, therapeutic association classification, and study type classification. Using the subject matter expertise on our team, we create our own datasets for these specialized domain-specific tasks. We obtain promising performance in each of the modules by utilizing modern language modeling techniques and plan to treat them as baseline approaches for future improvement of individual components. △ Less

Submitted 5 December, 2019; v1 submitted 18 November, 2019; originally announced November 2019.

arXiv:1704.06802 [pdf]

Bike Renting Data Analysis: The Case of Dublin City

Authors: Thanh Thoa Pham Thi, Joe Timoney, Shyram Ravichandran, Peter Mooney, Adam Winstanley

Abstract: Public bike renting is more and more popular in cities to incentivise a reduction in car journeys and to boost the use of green transportation alternatives. One of the challenges of this application is to effectively plan the resources usage. This paper presents some analysis of Dublin bike renting scheme based on statistics and data mining. It provides available bike patterns at the most interest… ▽ More Public bike renting is more and more popular in cities to incentivise a reduction in car journeys and to boost the use of green transportation alternatives. One of the challenges of this application is to effectively plan the resources usage. This paper presents some analysis of Dublin bike renting scheme based on statistics and data mining. It provides available bike patterns at the most interesting bike stations, that is, the busiest and the quietest stations. Consistency checking with new data reinforces confidence in the patterns obtained. Identifying available bike patterns helps to better address user needs such as organising the rebalancing of the bike numbers between stations in advance of demand. △ Less

Submitted 22 April, 2017; originally announced April 2017.

Comments: GISRUK 2017

arXiv:cs/0407008 [pdf]

Autogenic Training With Natural Language Processing Modules: A Recent Tool For Certain Neuro Cognitive Studies

Authors: S. Ravichandran, M. N. Karthik

Abstract: Learning to respond to voice-text input involves the subject's ability in understanding the phonetic and text based contents and his/her ability to communicate based on his/her experience. The neuro-cognitive facility of the subject has to support two important domains in order to make the learning process complete. In many cases, though the understanding is complete, the response is partial. Th… ▽ More Learning to respond to voice-text input involves the subject's ability in understanding the phonetic and text based contents and his/her ability to communicate based on his/her experience. The neuro-cognitive facility of the subject has to support two important domains in order to make the learning process complete. In many cases, though the understanding is complete, the response is partial. This is one valid reason why we need to support the information from the subject with scalable techniques such as Natural Language Processing (NLP) for abstraction of the contents from the output. This paper explores the feasibility of using NLP modules interlaced with Neural Networks to perform the required task in autogenic training related to medical applications. △ Less

Submitted 2 July, 2004; originally announced July 2004.

Comments: 2 Pages. Proceedings of 11th International Congress on Biological & Medical Engineering, Singapore (IEEE-EMBS & IFMBE endorsed)

ACM Class: I.2.1; I.2.6; I.2.7

Showing 1–7 of 7 results for author: Ravichandran, S