Skip to main content

Showing 1–11 of 11 results for author: Dwivedi, S K

Searching in archive cs. Search in all archives.
.
  1. arXiv:2404.16752  [pdf, other

    cs.CV

    TokenHMR: Advancing Human Mesh Recovery with a Tokenized Pose Representation

    Authors: Sai Kumar Dwivedi, Yu Sun, Priyanka Patel, Yao Feng, Michael J. Black

    Abstract: We address the problem of regressing 3D human pose and shape from a single image, with a focus on 3D accuracy. The current best methods leverage large datasets of 3D pseudo-ground-truth (p-GT) and 2D keypoints, leading to robust performance. With such methods, we observe a paradoxical decline in 3D pose accuracy with increasing 2D accuracy. This is caused by biases in the p-GT and the use of an ap… ▽ More

    Submitted 25 April, 2024; originally announced April 2024.

    Comments: CVPR 2024

  2. arXiv:2311.18836  [pdf, other

    cs.CV

    ChatPose: Chatting about 3D Human Pose

    Authors: Yao Feng, **g Lin, Sai Kumar Dwivedi, Yu Sun, Priyanka Patel, Michael J. Black

    Abstract: We introduce ChatPose, a framework employing Large Language Models (LLMs) to understand and reason about 3D human poses from images or textual descriptions. Our work is motivated by the human ability to intuitively understand postures from a single image or a brief description, a process that intertwines image interpretation, world knowledge, and an understanding of body language. Traditional huma… ▽ More

    Submitted 23 April, 2024; v1 submitted 30 November, 2023; originally announced November 2023.

    Comments: Home page: https://yfeng95.github.io/ChatPose/

  3. arXiv:2308.12965  [pdf, other

    cs.CV

    POCO: 3D Pose and Shape Estimation with Confidence

    Authors: Sai Kumar Dwivedi, Cordelia Schmid, Hongwei Yi, Michael J. Black, Dimitrios Tzionas

    Abstract: The regression of 3D Human Pose and Shape (HPS) from an image is becoming increasingly accurate. This makes the results useful for downstream tasks like human action recognition or 3D graphics. Yet, no regressor is perfect, and accuracy can be affected by ambiguous image evidence or by poses and appearance that are unseen during training. Most current HPS regressors, however, do not report the con… ▽ More

    Submitted 24 August, 2023; originally announced August 2023.

  4. arXiv:2303.03373  [pdf, other

    cs.CV

    Detecting Human-Object Contact in Images

    Authors: Yixin Chen, Sai Kumar Dwivedi, Michael J. Black, Dimitrios Tzionas

    Abstract: Humans constantly contact objects to move and perform tasks. Thus, detecting human-object contact is important for building human-centered artificial intelligence. However, there exists no robust method to detect contact between the body and the scene from an image, and there exists no dataset to learn such a detector. We fill this gap with HOT ("Human-Object conTact"), a new dataset of human-obje… ▽ More

    Submitted 4 April, 2023; v1 submitted 6 March, 2023; originally announced March 2023.

    Comments: Accepted at CVPR 2023

  5. arXiv:2110.03480  [pdf, other

    cs.CV

    Learning to Regress Bodies from Images using Differentiable Semantic Rendering

    Authors: Sai Kumar Dwivedi, Nikos Athanasiou, Muhammed Kocabas, Michael J. Black

    Abstract: Learning to regress 3D human body shape and pose (e.g.~SMPL parameters) from monocular images typically exploits losses on 2D keypoints, silhouettes, and/or part-segmentation when 3D training data is not available. Such losses, however, are limited because 2D keypoints do not supervise body shape and segmentations of people in clothing do not match projected minimally-clothed SMPL shapes. To explo… ▽ More

    Submitted 23 February, 2022; v1 submitted 7 October, 2021; originally announced October 2021.

    Comments: ICCV2021

  6. arXiv:1909.07945  [pdf, other

    cs.CV

    ProtoGAN: Towards Few Shot Learning for Action Recognition

    Authors: Sai Kumar Dwivedi, Vikram Gupta, Rahul Mitra, Shuaib Ahmed, Arjun Jain

    Abstract: Few-shot learning (FSL) for action recognition is a challenging task of recognizing novel action categories which are represented by few instances in the training data. In a more generalized FSL setting (G-FSL), both seen as well as novel action categories need to be recognized. Conventional classifiers suffer due to inadequate data in FSL setting and inherent bias towards seen action categories i… ▽ More

    Submitted 17 September, 2019; originally announced September 2019.

    Comments: 9 pages, 5 tables, 2 figures. To appear in the proceedings of ICCV Workshop 2019

  7. arXiv:1909.06672  [pdf, other

    cs.CV

    Progression Modelling for Online and Early Gesture Detection

    Authors: Vikram Gupta, Sai Kumar Dwivedi, Rishabh Dabral, Arjun Jain

    Abstract: Online and Early detection of gestures is crucial for building touchless gesture based interfaces. These interfaces should operate on a stream of video frames instead of the complete video and detect the presence of gestures at an earlier stage than post-completion for providing real time user experience. To achieve this, it is important to recognize the progression of the gesture across different… ▽ More

    Submitted 14 September, 2019; originally announced September 2019.

    Comments: 3DV 2019 Oral paper

  8. arXiv:1407.2423  [pdf

    cs.CR

    Desiging a logical security framework for e-commerce system based on soa

    Authors: Ashish Kr. Luhach, Sanjay K. Dwivedi, C. K. Jha

    Abstract: Rapid increases in information technology also changed the existing markets and transformed them into e- markets (e-commerce) from physical markets. Equally with the e-commerce evolution, enterprises have to recover a safer approach for implementing E-commerce and maintaining its logical security. SOA is one of the best techniques to fulfill these requirements. SOA holds the vantage of being easy… ▽ More

    Submitted 9 July, 2014; originally announced July 2014.

  9. arXiv:1407.2421  [pdf

    cs.CR cs.CY

    Designing and implementing the logical security framework for e-commerce based on service oriented architecture

    Authors: Ashish Kr. Luhach, Sanjay K Dwivedi, C K Jha

    Abstract: Rapid evolution of information technology has contributed to the evolution of more sophisticated E- commerce system with the better transaction time and protection. The currently used E-commerce models lack in quality properties such as logical security because of their poor designing and to face the highly equipped and trained intruders. This editorial proposed a security framework for small and… ▽ More

    Submitted 9 July, 2014; originally announced July 2014.

  10. arXiv:1406.4607  [pdf, ps, other

    physics.soc-ph cs.SI nlin.AO

    Uncovering Randomness and Success in Society

    Authors: Sarika Jalan, Camellia Sarkar, Anagha Madhusudanan, Sanjiv Kumar Dwivedi

    Abstract: An understanding of how individuals shape and impact the evolution of society is vastly limited due to the unavailability of large-scale reliable datasets that can simultaneously capture information regarding individual movements and social interactions. We believe that the popular Indian film industry, 'Bollywood', can provide a social network apt for such a study. Bollywood provides massive amou… ▽ More

    Submitted 8 July, 2014; v1 submitted 18 June, 2014; originally announced June 2014.

    Comments: 39 pages, 12 figures, 14 tables

    Journal ref: PloS one, 9(2), e88249 (2014)

  11. arXiv:1111.5293  [pdf

    cs.CL

    Rule based Part of speech Tagger for Homoeopathy Clinical realm

    Authors: Sanjay K. Dwivedi, Pramod P. Sukhadeve

    Abstract: A tagger is a mandatory segment of most text scrutiny systems, as it consigned a s yntax class (e.g., noun, verb, adjective, and adverb) to every word in a sentence. In this paper, we present a simple part of speech tagger for homoeopathy clinical language. This paper reports about the anticipated part of speech tagger for homoeopathy clinical language. It exploit standard pattern for evaluating s… ▽ More

    Submitted 13 November, 2011; originally announced November 2011.