Skip to main content

Showing 1–12 of 12 results for author: Tanveer, M I

.
  1. arXiv:2207.12504  [pdf, other

    cs.CL

    Unsupervised Speaker Diarization that is Agnostic to Language, Overlap-Aware, and Tuning Free

    Authors: M. Iftekhar Tanveer, Diego Casabuena, Jussi Karlgren, Rosie Jones

    Abstract: Podcasts are conversational in nature and speaker changes are frequent -- requiring speaker diarization for content understanding. We propose an unsupervised technique for speaker diarization without relying on language-specific components. The algorithm is overlap-aware and does not require information about the number of speakers. Our approach shows 79% improvement on purity scores (34% on F-sco… ▽ More

    Submitted 25 July, 2022; originally announced July 2022.

    Comments: Published at Interspeech 2022

  2. arXiv:2103.14131  [pdf, other

    cs.LG

    Persistence Homology of TEDtalk: Do Sentence Embeddings Have a Topological Shape?

    Authors: Shouman Das, Syed A. Haque, Md. Iftekhar Tanveer

    Abstract: \emph{Topological data analysis} (TDA) has recently emerged as a new technique to extract meaningful discriminitve features from high dimensional data. In this paper, we investigate the possibility of applying TDA to improve the classification accuracy of public speaking rating. We calculated \emph{persistence image vectors} for the sentence embeddings of TEDtalk data and feed this vectors as addi… ▽ More

    Submitted 25 March, 2021; originally announced March 2021.

    Comments: 6 pages, 2 figures

  3. arXiv:2012.06157  [pdf, other

    cs.AI

    Fairness in Rating Prediction by Awareness of Verbal and Gesture Quality of Public Speeches

    Authors: Ankani Chattoraj, Rupam Acharyya, Shouman Das, Md. Iftekhar Tanveer, Ehsan Hoque

    Abstract: The role of verbal and non-verbal cues towards great public speaking has been a topic of exploration for many decades. We identify a commonality across present theories, the element of "variety or heterogeneity" in channels or modes of communication (e.g. resorting to stories, scientific facts, emotional connections, facial expressions etc.) which is essential for effectively communicating informa… ▽ More

    Submitted 15 November, 2021; v1 submitted 11 December, 2020; originally announced December 2020.

  4. arXiv:2003.00683  [pdf, other

    cs.AI

    Detection and Mitigation of Bias in Ted Talk Ratings

    Authors: Rupam Acharyya, Shouman Das, Ankani Chattoraj, Oishani Sengupta, Md Iftekar Tanveer

    Abstract: Unbiased data collection is essential to guaranteeing fairness in artificial intelligence models. Implicit bias, a form of behavioral conditioning that leads us to attribute predetermined characteristics to members of certain groups and informs the data collection process. This paper quantifies implicit bias in viewer ratings of TEDTalks, a diverse social platform assessing social and professional… ▽ More

    Submitted 2 March, 2020; originally announced March 2020.

  5. arXiv:2002.12721  [pdf, other

    stat.AP

    To be or not to be? A spatial predictive crime model for Rochester

    Authors: Ankani Chattoraj, Rupam Acharyya, Sabyasachi Shivkumar, Md Iftekar Tanveer, Mohammad Rafayet Ali

    Abstract: This project uses a spatial model (Geographically Weighted Regression) to relate various physical and social features to crime rates. Besides making interesting predictions from basic data statistics, the trained model can be used to predict on the test data. The high accuracy of this prediction on test data then allows us to make predictions of crime probabilities in different areas based on the… ▽ More

    Submitted 27 February, 2020; originally announced February 2020.

  6. arXiv:1911.11558  [pdf, other

    cs.LG cs.CL stat.ML

    FairyTED: A Fair Rating Predictor for TED Talk Data

    Authors: Rupam Acharyya, Shouman Das, Ankani Chattoraj, Md. Iftekhar Tanveer

    Abstract: With the recent trend of applying machine learning in every aspect of human life, it is important to incorporate fairness into the core of the predictive algorithms. We address the problem of predicting the quality of public speeches while being fair with respect to sensitive attributes of the speakers, e.g. gender and race. We use the TED talks as an input repository of public speeches because it… ▽ More

    Submitted 25 November, 2019; originally announced November 2019.

    Comments: 9 pages, 4 figures, 3 tables. Accepted as a conference paper to be presented at AAAI 2020

  7. arXiv:1906.03940  [pdf, other

    cs.MM cs.CL

    Predicting TED Talk Ratings from Language and Prosody

    Authors: Md Iftekhar Tanveer, Md Kamrul Hassan, Daniel Gildea, M. Ehsan Hoque

    Abstract: We use the largest open repository of public speaking---TED Talks---to predict the ratings of the online viewers. Our dataset contains over 2200 TED Talk transcripts (includes over 200 thousand sentences), audio features and the associated meta information including about 5.5 Million ratings from spontaneous visitors of the website. We propose three neural network architectures and compare with st… ▽ More

    Submitted 20 May, 2019; originally announced June 2019.

    Comments: arXiv admin note: substantial text overlap with arXiv:1905.08392

  8. arXiv:1905.08392  [pdf, other

    cs.LG cs.CL stat.ML

    A Causality-Guided Prediction of the TED Talk Ratings from the Speech-Transcripts using Neural Networks

    Authors: Md Iftekhar Tanveer, Md Kamrul Hasan, Daniel Gildea, M. Ehsan Hoque

    Abstract: Automated prediction of public speaking performance enables novel systems for tutoring public speaking skills. We use the largest open repository---TED Talks---to predict the ratings provided by the online viewers. The dataset contains over 2200 talk transcripts and the associated meta information including over 5.5 million ratings from spontaneous visitors to the website. We carefully removed the… ▽ More

    Submitted 20 May, 2019; originally announced May 2019.

  9. arXiv:1904.06618  [pdf, other

    cs.LG cs.CL stat.ML

    UR-FUNNY: A Multimodal Language Dataset for Understanding Humor

    Authors: Md Kamrul Hasan, Wasifur Rahman, Amir Zadeh, Jianyuan Zhong, Md Iftekhar Tanveer, Louis-Philippe Morency, Mohammed, Hoque

    Abstract: Humor is a unique and creative communicative behavior displayed during social interactions. It is produced in a multimodal manner, through the usage of words (text), gestures (vision) and prosodic cues (acoustic). Understanding humor from these three modalities falls within boundaries of multimodal language; a recent research trend in natural language processing that models natural language as it… ▽ More

    Submitted 13 April, 2019; originally announced April 2019.

    Journal ref: EMNLP-IJCNLP, 2019, 2046-2056

  10. arXiv:1707.04790  [pdf, other

    cs.HC

    Automatic Identification of Non-Meaningful Body-Movements and What It Reveals About Humans

    Authors: Md Iftekhar Tanveer, RuJie Zhao, Mohammed Hoque

    Abstract: We present a framework to identify whether a public speaker's body movements are meaningful or non-meaningful ("Mannerisms") in the context of their speeches. In a dataset of 84 public speaking videos from 28 individuals, we extract 314 unique body movement patterns (e.g. pacing, gesturing, shifting body weights, etc.). Online workers and the speakers themselves annotated the meaningfulness of the… ▽ More

    Submitted 15 July, 2017; originally announced July 2017.

  11. arXiv:1505.07310  [pdf, other

    cs.HC

    Use of Laplacian Projection Technique for Summarizing Likert Scale Annotations

    Authors: M. Iftekhar Tanveer

    Abstract: Summarizing Likert scale ratings from human annotators is an important step for collecting human judgments. In this project we study a novel, graph theoretic method for this purpose. We also analyze a few interesting properties for this approach using real annotation datasets.

    Submitted 26 May, 2015; originally announced May 2015.

  12. arXiv:1504.03425  [pdf, ps, other

    cs.HC cs.AI cs.CL

    Automated Analysis and Prediction of Job Interview Performance

    Authors: Iftekhar Naim, M. Iftekhar Tanveer, Daniel Gildea, Mohammed, Hoque

    Abstract: We present a computational framework for automatically quantifying verbal and nonverbal behaviors in the context of job interviews. The proposed framework is trained by analyzing the videos of 138 interview sessions with 69 internship-seeking undergraduates at the Massachusetts Institute of Technology (MIT). Our automated analysis includes facial expressions (e.g., smiles, head gestures, facial tr… ▽ More

    Submitted 14 April, 2015; originally announced April 2015.

    Comments: 14 pages, 8 figures, 6 tables