Skip to main content

Showing 1–11 of 11 results for author: Nagaraj, A

.
  1. arXiv:2406.04138  [pdf, other

    cs.CV cs.HC

    The 3D-PC: a benchmark for visual perspective taking in humans and machines

    Authors: Drew Linsley, Peisen Zhou, Alekh Karkada Ashok, Akash Nagaraj, Gaurav Gaonkar, Francis E Lewis, Zygmunt Pizlo, Thomas Serre

    Abstract: Visual perspective taking (VPT) is the ability to perceive and reason about the perspectives of others. It is an essential feature of human intelligence, which develops over the first decade of life and requires an ability to process the 3D structure of visual scenes. A growing number of reports have indicated that deep neural networks (DNNs) become capable of analyzing 3D scenes after training on… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

  2. arXiv:2306.11327  [pdf, other

    eess.AS cs.SD

    eCat: An End-to-End Model for Multi-Speaker TTS & Many-to-Many Fine-Grained Prosody Transfer

    Authors: Ammar Abbas, Sri Karlapati, Bastian Schnell, Penny Karanasou, Marcel Granero Moya, Amith Nagaraj, Ayman Boustati, Nicole Peinelt, Alexis Moinet, Thomas Drugman

    Abstract: We present eCat, a novel end-to-end multispeaker model capable of: a) generating long-context speech with expressive and contextually appropriate prosody, and b) performing fine-grained prosody transfer between any pair of seen speakers. eCat is trained using a two-stage training approach. In Stage I, the model learns speaker-independent word-level prosody representations in an end-to-end fashion… ▽ More

    Submitted 20 June, 2023; originally announced June 2023.

    Comments: Accepted to be published in the Proceedings of InterSpeech 2023

  3. arXiv:2305.16872  [pdf

    econ.GN cs.HC

    The Economics of Augmented and Virtual Reality

    Authors: Joshua Gans, Abhishek Nagaraj

    Abstract: This paper explores the economics of Augmented Reality (AR) and Virtual Reality (VR) technologies within decision-making contexts. Two metrics are proposed: Context Entropy, the informational complexity of an environment, and Context Immersivity, the value from full immersion. The analysis suggests that AR technologies assist in understanding complex contexts, while VR technologies provide access… ▽ More

    Submitted 26 May, 2023; originally announced May 2023.

    Comments: 13 pages, 1 table

  4. arXiv:2301.11722  [pdf, other

    cs.AI cs.HC

    Diffusion Models as Artists: Are we Closing the Gap between Humans and Machines?

    Authors: Victor Boutin, Thomas Fel, Lakshya Singhal, Rishav Mukherji, Akash Nagaraj, Julien Colin, Thomas Serre

    Abstract: An important milestone for AI is the development of algorithms that can produce drawings that are indistinguishable from those of humans. Here, we adapt the 'diversity vs. recognizability' scoring framework from Boutin et al, 2022 and find that one-shot diffusion models have indeed started to close the gap between humans and machines. However, using a finer-grained measure of the originality of in… ▽ More

    Submitted 31 May, 2023; v1 submitted 27 January, 2023; originally announced January 2023.

  5. Cross-domain Variational Capsules for Information Extraction

    Authors: Akash Nagaraj, Akhil K, Akshay Venkatesh, Srikanth HR

    Abstract: In this paper, we present a characteristic extraction algorithm and the Multi-domain Image Characteristics Dataset of characteristic-tagged images to simulate the way a human brain classifies cross-domain information and generates insight. The intent was to identify prominent characteristics in data and use this identification mechanism to auto-generate insight from data in other unseen domains. A… ▽ More

    Submitted 13 October, 2022; originally announced October 2022.

    Comments: This paper was originally written in 2020

    Journal ref: In Innovations in Computer Science and Engineering, pp. 63-72. Springer, Singapore, 2021

  6. arXiv:2210.09052  [pdf, other

    cs.CV cs.MM

    Digital Image Forensics using Deep Learning

    Authors: Akash Nagaraj, Mukund Sood, Vivek Kapoor, Yash Mathur, Bishesh Sinha

    Abstract: During the investigation of criminal activity when evidence is available, the issue at hand is determining the credibility of the video and ascertaining that the video is real. Today, one way to authenticate the footage is to identify the camera that was used to capture the image or video in question. While a very common way to do this is by using image meta-data, this data can easily be falsified… ▽ More

    Submitted 13 October, 2022; originally announced October 2022.

    Comments: This paper was written in 2018 as a part of our submission to the 2018 IEEE Signal Processing Cup: Forensic Camera Model Identification Challenge

  7. arXiv:2210.09004  [pdf, other

    cs.CY cs.AI cs.CL

    Real-Time Automated Answer Scoring

    Authors: Akash Nagaraj, Mukund Sood, Gowri Srinivasa

    Abstract: In recent years, the role of big data analytics has exponentially grown and is now slowly making its way into the education industry. Several attempts are being made in this sphere in order to improve the quality of education being provided to students and while many collaborations have been carried out before, automated scoring of answers has been explored to a rather limited extent. One of the b… ▽ More

    Submitted 13 October, 2022; originally announced October 2022.

    Comments: This paper was originally written in mid 2018

    Journal ref: In 2018 IEEE 18th International Conference on Advanced Learning Technologies (ICALT), pp.231-232. IEEE, 2018

  8. arXiv:2210.07465  [pdf, other

    cs.CR

    Learning Algorithms in Static Analysis of Web Applications

    Authors: Akash Nagaraj, Bishesh Sinha, Mukund Sood, Yash Mathur, Sanchika Gupta, Dinkar Sitaram

    Abstract: Web applications are distributed applications, they are programs that run on more than one computer and communicate through a network or server. This very distributed nature of web applications, combined with the scale and sheer complexity of modern software systems complicate manual security auditing, while also creating a huge attack surface of potential hackers. These factors are making automat… ▽ More

    Submitted 13 October, 2022; originally announced October 2022.

    Comments: This paper was originally written in 2019

  9. arXiv:2210.07400  [pdf, other

    cs.CV cs.AI cs.CY

    Real-time Action Recognition for Fine-Grained Actions and The Hand Wash Dataset

    Authors: Akash Nagaraj, Mukund Sood, Chetna Sureka, Gowri Srinivasa

    Abstract: In this paper we present a three-stream algorithm for real-time action recognition and a new dataset of handwash videos, with the intent of aligning action recognition with real-world constraints to yield effective conclusions. A three-stream fusion algorithm is proposed, which runs both accurately and efficiently, in real-time even on low-powered systems such as a Raspberry Pi. The cornerstone of… ▽ More

    Submitted 13 October, 2022; originally announced October 2022.

    Comments: This paper was originally written in early 2020

  10. arXiv:2210.07397  [pdf, other

    cs.RO cs.AI

    A Concise Introduction to Reinforcement Learning in Robotics

    Authors: Akash Nagaraj, Mukund Sood, Bhagya M Patil

    Abstract: One of the biggest hurdles robotics faces is the facet of sophisticated and hard-to-engineer behaviors. Reinforcement learning offers a set of tools, and a framework to address this problem. In parallel, the misgivings of robotics offer a solid testing ground and evaluation metric for advancements in reinforcement learning. The two disciplines go hand-in-hand, much like the fields of Mathematics a… ▽ More

    Submitted 13 October, 2022; originally announced October 2022.

    Comments: This paper was originally written in 2019

    Journal ref: Proceedings of International Conference on Recent Trends in Machine Learning, IoT, Smart Cities and Applications 2021

  11. arXiv:2204.05872  [pdf, other

    cs.CY

    Robust Quantification of Gender Disparity in Pre-Modern English Literature using Natural Language Processing

    Authors: Akarsh Nagaraj, Mayank Kejriwal

    Abstract: Research has continued to shed light on the extent and significance of gender disparity in social, cultural and economic spheres. More recently, computational tools from the Natural Language Processing (NLP) literature have been proposed for measuring such disparity using relatively extensive datasets and empirically rigorous methodologies. In this paper, we contribute to this line of research by… ▽ More

    Submitted 12 April, 2022; originally announced April 2022.