Skip to main content

Showing 1–10 of 10 results for author: Debnath, B

Searching in archive cs. Search in all archives.
.
  1. arXiv:2404.12330  [pdf, other

    cs.CV cs.MM

    A Perspective on Deep Vision Performance with Standard Image and Video Codecs

    Authors: Christoph Reich, Oliver Hahn, Daniel Cremers, Stefan Roth, Biplob Debnath

    Abstract: Resource-constrained hardware, such as edge devices or cell phones, often rely on cloud servers to provide the required computational resources for inference in deep vision models. However, transferring image and video data from an edge or mobile device to a cloud server requires coding to deal with network constraints. The use of standardized codecs, such as JPEG or H.264, is prevalent and requir… ▽ More

    Submitted 18 April, 2024; originally announced April 2024.

    Comments: Accepted at CVPR 2024 Workshop on AI for Streaming (AIS)

  2. arXiv:2404.12309  [pdf, other

    cs.CV cs.IR cs.LG

    iRAG: An Incremental Retrieval Augmented Generation System for Videos

    Authors: Md Adnan Arefeen, Biplob Debnath, Md Yusuf Sarwar Uddin, Srimat Chakradhar

    Abstract: Retrieval augmented generation (RAG) systems combine the strengths of language generation and information retrieval to power many real-world applications like chatbots. Use of RAG for combined understanding of multimodal data such as text, images and videos is appealing but two critical limitations exist: one-time, upfront capture of all content in large multimodal data as text descriptions entail… ▽ More

    Submitted 18 April, 2024; originally announced April 2024.

  3. Differentiable JPEG: The Devil is in the Details

    Authors: Christoph Reich, Biplob Debnath, Deep Patel, Srimat Chakradhar

    Abstract: JPEG remains one of the most widespread lossy image coding methods. However, the non-differentiable nature of JPEG restricts the application in deep learning pipelines. Several differentiable approximations of JPEG have recently been proposed to address this issue. This paper conducts a comprehensive review of existing diff. JPEG approaches and identifies critical details that have been missed by… ▽ More

    Submitted 22 December, 2023; v1 submitted 13 September, 2023; originally announced September 2023.

    Comments: Accepted at WACV 2024. Project page: https://christophreich1996.github.io/differentiable_jpeg/ WACV paper: https://openaccess.thecvf.com/content/WACV2024/html/Reich_Differentiable_JPEG_The_Devil_Is_in_the_Details_WACV_2024_paper.html

  4. arXiv:2309.00841  [pdf, other

    cs.CL cs.AI cs.IR

    LeanContext: Cost-Efficient Domain-Specific Question Answering Using LLMs

    Authors: Md Adnan Arefeen, Biplob Debnath, Srimat Chakradhar

    Abstract: Question-answering (QA) is a significant application of Large Language Models (LLMs), sha** chatbot capabilities across healthcare, education, and customer service. However, widespread LLM integration presents a challenge for small businesses due to the high expenses of LLM API usage. Costs rise rapidly when domain-specific data (context) is used alongside queries for accurate domain-specific LL… ▽ More

    Submitted 2 September, 2023; originally announced September 2023.

    Comments: The paper is under review

  5. arXiv:2308.16215  [pdf, other

    eess.IV cs.CV cs.LG cs.MM

    Deep Video Codec Control for Vision Models

    Authors: Christoph Reich, Biplob Debnath, Deep Patel, Tim Prangemeier, Daniel Cremers, Srimat Chakradhar

    Abstract: Standardized lossy video coding is at the core of almost all real-world video processing pipelines. Rate control is used to enable standard codecs to adapt to different network bandwidth conditions or storage constraints. However, standard video codecs (e.g., H.264) and their rate control modules aim to minimize video distortion w.r.t. human quality assessment. We demonstrate empirically that stan… ▽ More

    Submitted 16 April, 2024; v1 submitted 30 August, 2023; originally announced August 2023.

    Comments: Accepted at CVPR 2024 Workshop on AI for Streaming (AIS)

  6. arXiv:2304.09617  [pdf, other

    cs.RO

    Towards Autonomous Selective Harvesting: A Review of Robot Perception, Robot Design, Motion Planning and Control

    Authors: Vishnu Rajendran S, Bappaditya Debnath, Bappaditya Debnath, Sariah Mghames, Willow Mandil, Soran Parsa, Simon Parsons, Amir Ghalamzan-E

    Abstract: This paper provides an overview of the current state-of-the-art in selective harvesting robots (SHRs) and their potential for addressing the challenges of global food production. SHRs have the potential to increase productivity, reduce labour costs, and minimise food waste by selectively harvesting only ripe fruits and vegetables. The paper discusses the main components of SHRs, including percepti… ▽ More

    Submitted 19 April, 2023; originally announced April 2023.

    Comments: Preprint: to be appeared in Journal of Field Robotics

  7. arXiv:2301.03947  [pdf, other

    cs.RO cs.AI cs.CV

    Autonomous Strawberry Picking Robotic System (Robofruit)

    Authors: Soran Parsa, Bappaditya Debnath, Muhammad Arshad Khan, Amir Ghalamzan E.

    Abstract: Challenges in strawberry picking made selective harvesting robotic technology demanding. However, selective harvesting of strawberries is complicated forming a few scientific research questions. Most available solutions only deal with a specific picking scenario, e.g., picking only a single variety of fruit in isolation. Nonetheless, most economically viable (e.g. high-yielding and/or disease-resi… ▽ More

    Submitted 10 January, 2023; originally announced January 2023.

    Comments: To appear in the Journal of Field Robotics (Accepted) Please watch the video at https://www.youtube.com/watch?v=v8gGAvsISXU

  8. arXiv:2208.09074  [pdf, other

    cs.RO

    dPMP-Deep Probabilistic Motion Planning: A use case in Strawberry Picking Robot

    Authors: Alessandra Tafuro, Bappaditya Debnath, Andrea M. Zanchettin, Amir Ghalamzan E

    Abstract: This paper presents a novel probabilistic approach to deep robot learning from demonstrations (LfD). Deep movement primitives (DMPs) are deterministic LfD model that maps visual information directly into a robot trajectory. This paper extends DMPs and presents a deep probabilistic model that maps the visual information into a distribution of effective robot trajectories. The architecture that lead… ▽ More

    Submitted 18 August, 2022; originally announced August 2022.

    Comments: To appear In IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) 2022

  9. F3S: Free Flow Fever Screening

    Authors: Kunal Rao, Giuseppe Coviello, Min Feng, Biplob Debnath, Wang-Pin Hsiung, Murugan Sankaradas, Yi Yang, Oliver Po, Utsav Drolia, Srimat Chakradhar

    Abstract: Identification of people with elevated body temperature can reduce or dramatically slow down the spread of infectious diseases like COVID-19. We present a novel fever-screening system, F3S, that uses edge machine learning techniques to accurately measure core body temperatures of multiple individuals in a free-flow setting. F3S performs real-time sensor fusion of visual camera with thermal camera… ▽ More

    Submitted 3 September, 2021; originally announced September 2021.

  10. arXiv:2009.14326  [pdf, other

    cs.CV cs.LG eess.IV

    Attention-Driven Body Pose Encoding for Human Activity Recognition

    Authors: B Debnath, M O'brien, S Kumar, A Behera

    Abstract: This article proposes a novel attention-based body pose encoding for human activity recognition that presents a enriched representation of body-pose that is learned. The enriched data complements the 3D body joint position data and improves model performance. In this paper, we propose a novel approach that learns enhanced feature representations from a given sequence of 3D body joints. To achieve… ▽ More

    Submitted 2 October, 2020; v1 submitted 29 September, 2020; originally announced September 2020.

    Comments: This paper has been accepted for publication at the IAPR IEEE/Computer Society International Conference on Pattern Recognition (ICPR), Milan, 2021

    Journal ref: IAPR IEEE/Computer Society International Conference on Pattern Recognition (ICPR), Milan, 2021