Skip to main content

Showing 1–18 of 18 results for author: Nguyen, R

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.16828  [pdf, other

    cs.IR cs.AI cs.CL

    Ragnarök: A Reusable RAG Framework and Baselines for TREC 2024 Retrieval-Augmented Generation Track

    Authors: Ronak Pradeep, Nandan Thakur, Sahel Sharifymoghaddam, Eric Zhang, Ryan Nguyen, Daniel Campos, Nick Craswell, Jimmy Lin

    Abstract: Did you try out the new Bing Search? Or maybe you fiddled around with Google AI~Overviews? These might sound familiar because the modern-day search stack has recently evolved to include retrieval-augmented generation (RAG) systems. They allow searching and incorporating real-time data into large language models (LLMs) to provide a well-informed, attributed, concise summary in contrast to the tradi… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

  2. arXiv:2403.16205  [pdf, other

    cs.CV

    Blur2Blur: Blur Conversion for Unsupervised Image Deblurring on Unknown Domains

    Authors: Bang-Dang Pham, Phong Tran, Anh Tran, Cuong Pham, Rang Nguyen, Minh Hoai

    Abstract: This paper presents an innovative framework designed to train an image deblurring algorithm tailored to a specific camera device. This algorithm works by transforming a blurry input image, which is challenging to deblur, into another blurry image that is more amenable to deblurring. The transformation process, from one blurry state to another, leverages unpaired data consisting of sharp and blurry… ▽ More

    Submitted 24 March, 2024; originally announced March 2024.

    Comments: Accepted to CVPR 2024

  3. MARRS: Multimodal Reference Resolution System

    Authors: Halim Cagri Ates, Shruti Bhargava, Site Li, Jiarui Lu, Siddhardha Maddula, Joel Ruben Antony Moniz, Anil Kumar Nalamalapu, Roman Hoang Nguyen, Melis Ozyildirim, Alkesh Patel, Dhivya Piraviperumal, Vincent Renkens, Ankit Samal, Thy Tran, Bo-Hsiang Tseng, Hong Yu, Yuan Zhang, Rong Zou

    Abstract: Successfully handling context is essential for any dialog understanding task. This context maybe be conversational (relying on previous user queries or system responses), visual (relying on what the user sees, for example, on their screen), or background (based on signals such as a ringing alarm or playing music). In this work, we present an overview of MARRS, or Multimodal Reference Resolution Sy… ▽ More

    Submitted 2 November, 2023; originally announced November 2023.

    Comments: Sixth Workshop on Computational Models of Reference, Anaphora and Coreference (CRAC 2023)

  4. arXiv:2310.00184  [pdf, other

    cs.RO

    NASU -- Novel Actuating Screw Unit: Origami-inspired Screw-based Propulsion on Mobile Ground Robots

    Authors: Calvin Joyce, Jason Lim, Roger Nguyen, Michael Owens, Sara Wickenhiser, Elizabeth Peiros, Florian Richter, Michael C. Yip

    Abstract: Screw-based locomotion is a robust method of locomotion across a wide range of media including water, sand, and gravel. A challenge with screws is their significant number of impactful design parameters that affect locomotion performance. One crucial parameter is the angle of attack (also called the lead angle), which has been shown to significantly impact the performance of screw propellers in te… ▽ More

    Submitted 13 May, 2024; v1 submitted 29 September, 2023; originally announced October 2023.

    Comments: 7 pages, 8 Figures, submitted to IROS 2024

  5. arXiv:2304.01686  [pdf, other

    cs.CV cs.AI

    HyperCUT: Video Sequence from a Single Blurry Image using Unsupervised Ordering

    Authors: Bang-Dang Pham, Phong Tran, Anh Tran, Cuong Pham, Rang Nguyen, Minh Hoai

    Abstract: We consider the challenging task of training models for image-to-video deblurring, which aims to recover a sequence of sharp images corresponding to a given blurry image input. A critical issue disturbing the training of an image-to-video model is the ambiguity of the frame ordering since both the forward and backward sequences are plausible solutions. This paper proposes an effective self-supervi… ▽ More

    Submitted 5 April, 2023; v1 submitted 4 April, 2023; originally announced April 2023.

    Comments: Accepted to CVPR 2023

  6. arXiv:2210.15897  [pdf, other

    eess.IV cs.CV cs.GR

    Single-Image HDR Reconstruction by Multi-Exposure Generation

    Authors: Phuoc-Hieu Le, Quynh Le, Rang Nguyen, Binh-Son Hua

    Abstract: High dynamic range (HDR) imaging is an indispensable technique in modern photography. Traditional methods focus on HDR reconstruction from multiple images, solving the core problems of image alignment, fusion, and tone map**, yet having a perfect solution due to ghosting and other visual artifacts in the reconstruction. Recent attempts at single-image HDR reconstruction show a promising alternat… ▽ More

    Submitted 28 October, 2022; originally announced October 2022.

    Comments: WACV 2023 paper. 8 pages of content, 2 pages of references, 8 pages of supplementary material

  7. arXiv:2210.00712  [pdf, other

    cs.CV

    PSENet: Progressive Self-Enhancement Network for Unsupervised Extreme-Light Image Enhancement

    Authors: Hue Nguyen, Diep Tran, Khoi Nguyen, Rang Nguyen

    Abstract: The extremes of lighting (e.g. too much or too little light) usually cause many troubles for machine and human vision. Many recent works have mainly focused on under-exposure cases where images are often captured in low-light conditions (e.g. nighttime) and achieved promising results for enhancing the quality of images. However, they are inferior to handling images under over-exposure. To mitigate… ▽ More

    Submitted 3 October, 2022; originally announced October 2022.

    Comments: Accepted to WACV 2023

  8. arXiv:2207.10785  [pdf, other

    cs.CV

    Inductive and Transductive Few-Shot Video Classification via Appearance and Temporal Alignments

    Authors: Khoi D. Nguyen, Quoc-Huy Tran, Khoi Nguyen, Binh-Son Hua, Rang Nguyen

    Abstract: We present a novel method for few-shot video classification, which performs appearance and temporal alignments. In particular, given a pair of query and support videos, we conduct appearance alignment via frame-level feature matching to achieve the appearance similarity score between the videos, while utilizing temporal order-preserving priors for obtaining the temporal similarity score between th… ▽ More

    Submitted 21 July, 2022; originally announced July 2022.

    Comments: Accepted to ECCV 2022

  9. arXiv:2206.07772  [pdf, other

    cs.AI

    Deep Learning and Handheld Augmented Reality Based System for Optimal Data Collection in Fault Diagnostics Domain

    Authors: Ryan Nguyen, Rahul Rai

    Abstract: Compared to current AI or robotic systems, humans navigate their environment with ease, making tasks such as data collection trivial. However, humans find it harder to model complex relationships hidden in the data. AI systems, especially deep learning (DL) algorithms, impressively capture those complex relationships. Symbiotically coupling humans and computational machines' strengths can simultan… ▽ More

    Submitted 15 June, 2022; originally announced June 2022.

  10. Physics-Infused Fuzzy Generative Adversarial Network for Robust Failure Prognosis

    Authors: Ryan Nguyen, Shubhendu Kumar Singh, Rahul Rai

    Abstract: Prognostics aid in the longevity of fielded systems or products. Quantifying the system's current health enable prognosis to enhance the operator's decision-making to preserve the system's health. Creating a prognosis for a system can be difficult due to (a) unknown physical relationships and/or (b) irregularities in data appearing well beyond the initiation of a problem. Traditionally, three diff… ▽ More

    Submitted 15 June, 2022; originally announced June 2022.

  11. arXiv:2206.04679  [pdf, other

    cs.LG cs.CV

    POODLE: Improving Few-shot Learning via Penalizing Out-of-Distribution Samples

    Authors: Duong H. Le, Khoi D. Nguyen, Khoi Nguyen, Quoc-Huy Tran, Rang Nguyen, Binh-Son Hua

    Abstract: In this work, we propose to use out-of-distribution samples, i.e., unlabeled samples coming from outside the target classes, to improve few-shot learning. Specifically, we exploit the easily available out-of-distribution samples to drive the classifier to avoid irrelevant features by maximizing the distance from prototypes to out-of-distribution samples while minimizing that of in-distribution sam… ▽ More

    Submitted 8 June, 2022; originally announced June 2022.

    Comments: Accepted at NeurIPS 2021 (First two authors contribute equally)

  12. arXiv:2202.06226  [pdf, other

    eess.SY cs.LG

    Feature Construction and Selection for PV Solar Power Modeling

    Authors: Yu Yang, Jia Mao, Richard Nguyen, Annas Tohmeh, Hen-Geul Yeh

    Abstract: Using solar power in the process industry can reduce greenhouse gas emissions and make the production process more sustainable. However, the intermittent nature of solar power renders its usage challenging. Building a model to predict photovoltaic (PV) power generation allows decision-makers to hedge energy shortages and further design proper operations. The solar power output is time-series data… ▽ More

    Submitted 13 February, 2022; originally announced February 2022.

    Comments: 6 pages, 8 figures

    Journal ref: Adconip 2022

  13. arXiv:2112.01398  [pdf, other

    cs.CV

    TISE: Bag of Metrics for Text-to-Image Synthesis Evaluation

    Authors: Tan M. Dinh, Rang Nguyen, Binh-Son Hua

    Abstract: In this paper, we conduct a study on the state-of-the-art methods for text-to-image synthesis and propose a framework to evaluate these methods. We consider syntheses where an image contains a single or multiple objects. Our study outlines several issues in the current evaluation pipeline: (i) for image quality assessment, a commonly used metric, e.g., Inception Score (IS), is often either miscali… ▽ More

    Submitted 19 July, 2022; v1 submitted 2 December, 2021; originally announced December 2021.

    Comments: Accepted to ECCV 2022; TISE toolbox is available at https://github.com/VinAIResearch/tise-toolbox

  14. arXiv:2112.00719  [pdf, other

    cs.CV

    HyperInverter: Improving StyleGAN Inversion via Hypernetwork

    Authors: Tan M. Dinh, Anh Tuan Tran, Rang Nguyen, Binh-Son Hua

    Abstract: Real-world image manipulation has achieved fantastic progress in recent years as a result of the exploration and utilization of GAN latent spaces. GAN inversion is the first step in this pipeline, which aims to map the real image to the latent code faithfully. Unfortunately, the majority of existing GAN inversion methods fail to meet at least one of the three requirements listed below: high recons… ▽ More

    Submitted 4 April, 2022; v1 submitted 1 December, 2021; originally announced December 2021.

    Comments: Accepted to CVPR 2022; Project page is located at https://di-mi-ta.github.io/HyperInverter/

  15. arXiv:2110.14588  [pdf, other

    cs.LG

    Fuzzy Generative Adversarial Networks

    Authors: Ryan Nguyen, Shubhendu Kumar Singh, Rahul Rai

    Abstract: Generative Adversarial Networks (GANs) are well-known tools for data generation and semi-supervised classification. GANs, with less labeled data, outperform Deep Neural Networks (DNNs) and Convolutional Neural Networks (CNNs) in classification across various tasks, this shows promise for develo** GANs capable of trespassing into the domain of semi-supervised regression. However, develo** GANs… ▽ More

    Submitted 27 October, 2021; originally announced October 2021.

  16. arXiv:2110.06416  [pdf, other

    cs.CV cs.LG

    MMIU: Dataset for Visual Intent Understanding in Multimodal Assistants

    Authors: Alkesh Patel, Joel Ruben Antony Moniz, Roman Nguyen, Nick Tzou, Hadas Kotek, Vincent Renkens

    Abstract: In multimodal assistant, where vision is also one of the input modalities, the identification of user intent becomes a challenging task as visual input can influence the outcome. Current digital assistants take spoken input and try to determine the user intent from conversational or device context. So, a dataset, which includes visual input (i.e. images or videos for the corresponding questions ta… ▽ More

    Submitted 30 October, 2021; v1 submitted 12 October, 2021; originally announced October 2021.

    Comments: Extended abstract accepted for WeCNLP 2021

  17. arXiv:2101.02637  [pdf, other

    cs.CV

    A Large-Scale, Time-Synchronized Visible and Thermal Face Dataset

    Authors: Domenick Poster, Matthew Thielke, Robert Nguyen, Srinivasan Rajaraman, Xing Di, Cedric Nimpa Fondje, Vishal M. Patel, Nathaniel J. Short, Benjamin S. Riggan, Nasser M. Nasrabadi, Shuowen Hu

    Abstract: Thermal face imagery, which captures the naturally emitted heat from the face, is limited in availability compared to face imagery in the visible spectrum. To help address this scarcity of thermal face imagery for research and algorithm development, we present the DEVCOM Army Research Laboratory Visible-Thermal Face Dataset (ARL-VTF). With over 500,000 images from 395 subjects, the ARL-VTF dataset… ▽ More

    Submitted 7 January, 2021; originally announced January 2021.

  18. arXiv:1809.05477  [pdf, other

    cs.RO

    Project AutoVision: Localization and 3D Scene Perception for an Autonomous Vehicle with a Multi-Camera System

    Authors: Lionel Heng, Benjamin Choi, Zhaopeng Cui, Marcel Geppert, Sixing Hu, Benson Kuan, Peidong Liu, Rang Nguyen, Ye Chuan Yeo, Andreas Geiger, Gim Hee Lee, Marc Pollefeys, Torsten Sattler

    Abstract: Project AutoVision aims to develop localization and 3D scene perception capabilities for a self-driving vehicle. Such capabilities will enable autonomous navigation in urban and rural environments, in day and night, and with cameras as the only exteroceptive sensors. The sensor suite employs many cameras for both 360-degree coverage and accurate multi-view stereo; the use of low-cost cameras keeps… ▽ More

    Submitted 4 March, 2019; v1 submitted 14 September, 2018; originally announced September 2018.

    Journal ref: 2019 IEEE International Conference on Robotics and Automation (ICRA)