Skip to main content

Showing 1–32 of 32 results for author: Chowdhury, P N

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.01810  [pdf, other

    cs.CV

    Freeview Sketching: View-Aware Fine-Grained Sketch-Based Image Retrieval

    Authors: Aneeshan Sain, Pinaki Nath Chowdhury, Subhadeep Koley, Ayan Kumar Bhunia, Yi-Zhe Song

    Abstract: In this paper, we delve into the intricate dynamics of Fine-Grained Sketch-Based Image Retrieval (FG-SBIR) by addressing a critical yet overlooked aspect -- the choice of viewpoint during sketch creation. Unlike photo systems that seamlessly handle diverse views through extensive datasets, sketch systems, with limited data collected from fixed perspectives, face challenges. Our pilot study, employ… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

    Comments: Accepted in European Conference on Computer Vision (ECCV) 2024

  2. arXiv:2405.18716  [pdf, other

    cs.CV

    SketchDeco: Decorating B&W Sketches with Colour

    Authors: Chaitat Utintu, Pinaki Nath Chowdhury, Aneeshan Sain, Subhadeep Koley, Ayan Kumar Bhunia, Yi-Zhe Song

    Abstract: This paper introduces a novel approach to sketch colourisation, inspired by the universal childhood activity of colouring and its professional applications in design and story-boarding. Striking a balance between precision and convenience, our method utilises region masks and colour palettes to allow intuitive user control, steering clear of the meticulousness of manual colour assignments or the l… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

  3. arXiv:2403.09480  [pdf, other

    cs.CV cs.AI

    What Sketch Explainability Really Means for Downstream Tasks

    Authors: Hmrishav Bandyopadhyay, Pinaki Nath Chowdhury, Ayan Kumar Bhunia, Aneeshan Sain, Tao Xiang, Yi-Zhe Song

    Abstract: In this paper, we explore the unique modality of sketch for explainability, emphasising the profound impact of human strokes compared to conventional pixel-oriented studies. Beyond explanations of network behavior, we discern the genuine implications of explainability across diverse downstream sketch-related tasks. We propose a lightweight and portable explainability solution -- a seamless plugin… ▽ More

    Submitted 14 March, 2024; originally announced March 2024.

    Comments: CVPR 2024

  4. arXiv:2403.09344  [pdf, other

    cs.CV cs.AI

    SketchINR: A First Look into Sketches as Implicit Neural Representations

    Authors: Hmrishav Bandyopadhyay, Ayan Kumar Bhunia, Pinaki Nath Chowdhury, Aneeshan Sain, Tao Xiang, Timothy Hospedales, Yi-Zhe Song

    Abstract: We propose SketchINR, to advance the representation of vector sketches with implicit neural models. A variable length vector sketch is compressed into a latent space of fixed dimension that implicitly encodes the underlying shape as a function of time and strokes. The learned function predicts the $xy$ point coordinates in a sketch at each time and stroke. Despite its simplicity, SketchINR outperf… ▽ More

    Submitted 14 March, 2024; originally announced March 2024.

    Comments: CVPR 2024

  5. arXiv:2403.07234  [pdf, other

    cs.CV

    It's All About Your Sketch: Democratising Sketch Control in Diffusion Models

    Authors: Subhadeep Koley, Ayan Kumar Bhunia, Deeptanshu Sekhri, Aneeshan Sain, Pinaki Nath Chowdhury, Tao Xiang, Yi-Zhe Song

    Abstract: This paper unravels the potential of sketches for diffusion models, addressing the deceptive promise of direct sketch control in generative AI. We importantly democratise the process, enabling amateur sketches to generate precise images, living up to the commitment of "what you sketch is what you get". A pilot study underscores the necessity, revealing that deformities in existing models stem from… ▽ More

    Submitted 20 March, 2024; v1 submitted 11 March, 2024; originally announced March 2024.

    Comments: Accepted in CVPR 2024. Project page available at https://subhadeepkoley.github.io/StableSketching

  6. arXiv:2403.07222  [pdf, other

    cs.CV

    You'll Never Walk Alone: A Sketch and Text Duet for Fine-Grained Image Retrieval

    Authors: Subhadeep Koley, Ayan Kumar Bhunia, Aneeshan Sain, Pinaki Nath Chowdhury, Tao Xiang, Yi-Zhe Song

    Abstract: Two primary input modalities prevail in image retrieval: sketch and text. While text is widely used for inter-category retrieval tasks, sketches have been established as the sole preferred modality for fine-grained image retrieval due to their ability to capture intricate visual details. In this paper, we question the reliance on sketches alone for fine-grained image retrieval by simultaneously ex… ▽ More

    Submitted 20 March, 2024; v1 submitted 11 March, 2024; originally announced March 2024.

    Comments: Accepted in CVPR 2024. Project page available at https://subhadeepkoley.github.io/Sketch2Word

  7. arXiv:2403.07214  [pdf, other

    cs.CV

    Text-to-Image Diffusion Models are Great Sketch-Photo Matchmakers

    Authors: Subhadeep Koley, Ayan Kumar Bhunia, Aneeshan Sain, Pinaki Nath Chowdhury, Tao Xiang, Yi-Zhe Song

    Abstract: This paper, for the first time, explores text-to-image diffusion models for Zero-Shot Sketch-based Image Retrieval (ZS-SBIR). We highlight a pivotal discovery: the capacity of text-to-image diffusion models to seamlessly bridge the gap between sketches and photos. This proficiency is underpinned by their robust cross-modal capabilities and shape bias, findings that are substantiated through our pi… ▽ More

    Submitted 20 March, 2024; v1 submitted 11 March, 2024; originally announced March 2024.

    Comments: Accepted in CVPR 2024. Project page available at https://subhadeepkoley.github.io/DiffusionZSSBIR

  8. arXiv:2403.07203  [pdf, other

    cs.CV

    How to Handle Sketch-Abstraction in Sketch-Based Image Retrieval?

    Authors: Subhadeep Koley, Ayan Kumar Bhunia, Aneeshan Sain, Pinaki Nath Chowdhury, Tao Xiang, Yi-Zhe Song

    Abstract: In this paper, we propose a novel abstraction-aware sketch-based image retrieval framework capable of handling sketch abstraction at varied levels. Prior works had mainly focused on tackling sub-factors such as drawing style and order, we instead attempt to model abstraction as a whole, and propose feature-level and retrieval granularity-level designs so that the system builds into its DNA the nec… ▽ More

    Submitted 20 March, 2024; v1 submitted 11 March, 2024; originally announced March 2024.

    Comments: Accepted in CVPR 2024. Project page available at https://subhadeepkoley.github.io/AbstractAway

  9. arXiv:2312.04364  [pdf, other

    cs.CV

    DemoCaricature: Democratising Caricature Generation with a Rough Sketch

    Authors: Dar-Yen Chen, Ayan Kumar Bhunia, Subhadeep Koley, Aneeshan Sain, Pinaki Nath Chowdhury, Yi-Zhe Song

    Abstract: In this paper, we democratise caricature generation, empowering individuals to effortlessly craft personalised caricatures with just a photo and a conceptual sketch. Our objective is to strike a delicate balance between abstraction and identity, while preserving the creativity and subjectivity inherent in a sketch. To achieve this, we present Explicit Rank-1 Model Editing alongside single-image pe… ▽ More

    Submitted 24 March, 2024; v1 submitted 7 December, 2023; originally announced December 2023.

  10. arXiv:2312.04043  [pdf, other

    cs.CV cs.AI

    Doodle Your 3D: From Abstract Freehand Sketches to Precise 3D Shapes

    Authors: Hmrishav Bandyopadhyay, Subhadeep Koley, Ayan Das, Ayan Kumar Bhunia, Aneeshan Sain, Pinaki Nath Chowdhury, Tao Xiang, Yi-Zhe Song

    Abstract: In this paper, we democratise 3D content creation, enabling precise generation of 3D shapes from abstract sketches while overcoming limitations tied to drawing skills. We introduce a novel part-level modelling and alignment framework that facilitates abstraction modelling and cross-modal correspondence. Leveraging the same part-level decoder, our approach seamlessly extends to sketch modelling by… ▽ More

    Submitted 7 June, 2024; v1 submitted 7 December, 2023; originally announced December 2023.

    Comments: CVPR 2024, Project Page: https://hmrishavbandy.github.io/doodle23d/

  11. arXiv:2306.10830  [pdf, other

    cs.CV

    3D VR Sketch Guided 3D Shape Prototy** and Exploration

    Authors: Ling Luo, Pinaki Nath Chowdhury, Tao Xiang, Yi-Zhe Song, Yulia Gryaditskaya

    Abstract: 3D shape modeling is labor-intensive, time-consuming, and requires years of expertise. To facilitate 3D shape modeling, we propose a 3D shape generation network that takes a 3D VR sketch as a condition. We assume that sketches are created by novices without art training and aim to reconstruct geometrically realistic 3D shapes of a given category. To handle potential sketch ambiguity, our method cr… ▽ More

    Submitted 10 January, 2024; v1 submitted 19 June, 2023; originally announced June 2023.

    Comments: Accepted by ICCV 2023

  12. arXiv:2303.15149  [pdf, other

    cs.CV

    What Can Human Sketches Do for Object Detection?

    Authors: Pinaki Nath Chowdhury, Ayan Kumar Bhunia, Aneeshan Sain, Subhadeep Koley, Tao Xiang, Yi-Zhe Song

    Abstract: Sketches are highly expressive, inherently capturing subjective and fine-grained visual cues. The exploration of such innate properties of human sketches has, however, been limited to that of image retrieval. In this paper, for the first time, we cultivate the expressiveness of sketches but for the fundamental vision task of object detection. The end result is a sketch-enabled object detection fra… ▽ More

    Submitted 28 October, 2023; v1 submitted 27 March, 2023; originally announced March 2023.

    Comments: Best Paper Finalist (Top 12 Best Papers). Presented in special single-track plenary sessions to all attendees in Computer Vision and Pattern Recognition (CVPR), 2023. Updated an error in Fig.3 (from Softmax to Cross Entropy). Thanks to the community for pointing it out

  13. arXiv:2303.13779  [pdf, other

    cs.CV

    Exploiting Unlabelled Photos for Stronger Fine-Grained SBIR

    Authors: Aneeshan Sain, Ayan Kumar Bhunia, Subhadeep Koley, Pinaki Nath Chowdhury, Soumitri Chattopadhyay, Tao Xiang, Yi-Zhe Song

    Abstract: This paper advances the fine-grained sketch-based image retrieval (FG-SBIR) literature by putting forward a strong baseline that overshoots prior state-of-the-arts by ~11%. This is not via complicated design though, but by addressing two critical issues facing the community (i) the gold standard triplet loss does not enforce holistic latent space geometry, and (ii) there are never enough sketches… ▽ More

    Submitted 23 March, 2023; originally announced March 2023.

    Comments: Accepted in CVPR 2023. Project page available at https://aneeshan95.github.io/Sketch_PVT/

  14. arXiv:2303.13440  [pdf, other

    cs.CV

    CLIP for All Things Zero-Shot Sketch-Based Image Retrieval, Fine-Grained or Not

    Authors: Aneeshan Sain, Ayan Kumar Bhunia, Pinaki Nath Chowdhury, Subhadeep Koley, Tao Xiang, Yi-Zhe Song

    Abstract: In this paper, we leverage CLIP for zero-shot sketch based image retrieval (ZS-SBIR). We are largely inspired by recent advances on foundation models and the unparalleled generalisation ability they seem to offer, but for the first time tailor it to benefit the sketch community. We put forward novel designs on how best to achieve this synergy, for both the category setting and the fine-grained set… ▽ More

    Submitted 27 March, 2023; v1 submitted 23 March, 2023; originally announced March 2023.

    Comments: Accepted in CVPR 2023. Project page available at https://aneeshan95.github.io/Sketch_LVM/

  15. arXiv:2303.11502  [pdf, other

    cs.CV

    Sketch2Saliency: Learning to Detect Salient Objects from Human Drawings

    Authors: Ayan Kumar Bhunia, Subhadeep Koley, Amandeep Kumar, Aneeshan Sain, Pinaki Nath Chowdhury, Tao Xiang, Yi-Zhe Song

    Abstract: Human sketch has already proved its worth in various visual understanding tasks (e.g., retrieval, segmentation, image-captioning, etc). In this paper, we reveal a new trait of sketches - that they are also salient. This is intuitive as sketching is a natural attentive process at its core. More specifically, we aim to study how sketches can be used as a weak label to detect salient objects present… ▽ More

    Submitted 30 March, 2023; v1 submitted 20 March, 2023; originally announced March 2023.

    Comments: CVPR 2023. Project page available at https://ayankumarbhunia.github.io/Sketch2Saliency/

  16. arXiv:2303.11162  [pdf, other

    cs.CV

    Picture that Sketch: Photorealistic Image Generation from Abstract Sketches

    Authors: Subhadeep Koley, Ayan Kumar Bhunia, Aneeshan Sain, Pinaki Nath Chowdhury, Tao Xiang, Yi-Zhe Song

    Abstract: Given an abstract, deformed, ordinary sketch from untrained amateurs like you and me, this paper turns it into a photorealistic image - just like those shown in Fig. 1(a), all non-cherry-picked. We differ significantly from prior art in that we do not dictate an edgemap-like sketch to start with, but aim to work with abstract free-hand human sketches. In doing so, we essentially democratise the sk… ▽ More

    Submitted 30 March, 2023; v1 submitted 20 March, 2023; originally announced March 2023.

    Comments: Accepted in CVPR 2023. Project page available at https://subhadeepkoley.github.io/PictureThatSketch

  17. arXiv:2207.01723  [pdf, other

    cs.CV

    Adaptive Fine-Grained Sketch-Based Image Retrieval

    Authors: Ayan Kumar Bhunia, Aneeshan Sain, Parth Shah, Animesh Gupta, Pinaki Nath Chowdhury, Tao Xiang, Yi-Zhe Song

    Abstract: The recent focus on Fine-Grained Sketch-Based Image Retrieval (FG-SBIR) has shifted towards generalising a model to new categories without any training data from them. In real-world applications, however, a trained FG-SBIR model is often applied to both new categories and different human sketchers, i.e., different drawing styles. Although this complicates the generalisation problem, fortunately, a… ▽ More

    Submitted 19 August, 2022; v1 submitted 4 July, 2022; originally announced July 2022.

    Comments: Accepted in ECCV 2022. Minor typos and Eq.4 corrected

  18. arXiv:2204.11964  [pdf, other

    cs.CV

    SceneTrilogy: On Human Scene-Sketch and its Complementarity with Photo and Text

    Authors: Pinaki Nath Chowdhury, Ayan Kumar Bhunia, Aneeshan Sain, Subhadeep Koley, Tao Xiang, Yi-Zhe Song

    Abstract: In this paper, we extend scene understanding to include that of human sketch. The result is a complete trilogy of scene representation from three diverse and complementary modalities -- sketch, photo, and text. Instead of learning a rigid three-way embedding and be done with it, we focus on learning a flexible joint embedding that fully supports the ``optionality" that this complementarity brings.… ▽ More

    Submitted 26 March, 2023; v1 submitted 25 April, 2022; originally announced April 2022.

    Comments: Accepted in Computer Vision and Pattern Recognition (CVPR), 2023

  19. arXiv:2203.14817  [pdf, other

    cs.CV

    Sketching without Worrying: Noise-Tolerant Sketch-Based Image Retrieval

    Authors: Ayan Kumar Bhunia, Subhadeep Koley, Abdullah Faiz Ur Rahman Khilji, Aneeshan Sain, Pinaki Nath Chowdhury, Tao Xiang, Yi-Zhe Song

    Abstract: Sketching enables many exciting applications, notably, image retrieval. The fear-to-sketch problem (i.e., "I can't sketch") has however proven to be fatal for its widespread adoption. This paper tackles this "fear" head on, and for the first time, proposes an auxiliary module for existing retrieval models that predominantly lets the users sketch without having to worry. We first conducted a pilot… ▽ More

    Submitted 28 March, 2022; originally announced March 2022.

    Comments: IEEE Conf. on Computer Vision and Pattern Recognition (CVPR), 2022 Code: https://github.com/AyanKumarBhunia/Stroke_Subset_Selector-for-FGSBIR

  20. arXiv:2203.14804  [pdf, other

    cs.CV

    Partially Does It: Towards Scene-Level FG-SBIR with Partial Input

    Authors: Pinaki Nath Chowdhury, Ayan Kumar Bhunia, Viswanatha Reddy Gajjala, Aneeshan Sain, Tao Xiang, Yi-Zhe Song

    Abstract: We scrutinise an important observation plaguing scene-level sketch research -- that a significant portion of scene sketches are "partial". A quick pilot study reveals: (i) a scene sketch does not necessarily contain all objects in the corresponding photo, due to the subjective holistic interpretation of scenes, (ii) there exists significant empty (white) regions as a result of object-level abstrac… ▽ More

    Submitted 28 March, 2022; originally announced March 2022.

    Comments: Accepted in CVPR 2022

  21. arXiv:2203.14691  [pdf, other

    cs.CV

    Sketch3T: Test-Time Training for Zero-Shot SBIR

    Authors: Aneeshan Sain, Ayan Kumar Bhunia, Vaishnav Potlapalli, Pinaki Nath Chowdhury, Tao Xiang, Yi-Zhe Song

    Abstract: Zero-shot sketch-based image retrieval typically asks for a trained model to be applied as is to unseen categories. In this paper, we question to argue that this setup by definition is not compatible with the inherent abstract and subjective nature of sketches, i.e., the model might transfer well to new categories, but will not understand sketches existing in different test-time distribution as a… ▽ More

    Submitted 28 March, 2022; originally announced March 2022.

    Comments: 10 pages, 5 figures. Accepted in CVPR 2022

  22. arXiv:2203.02113  [pdf, other

    cs.CV

    FS-COCO: Towards Understanding of Freehand Sketches of Common Objects in Context

    Authors: Pinaki Nath Chowdhury, Aneeshan Sain, Ayan Kumar Bhunia, Tao Xiang, Yulia Gryaditskaya, Yi-Zhe Song

    Abstract: We advance sketch research to scenes with the first dataset of freehand scene sketches, FS-COCO. With practical applications in mind, we collect sketches that convey scene content well but can be sketched within a few minutes by a person with any sketching skills. Our dataset comprises 10,000 freehand scene vector sketches with per point space-time information by 100 non-expert individuals, offeri… ▽ More

    Submitted 20 July, 2022; v1 submitted 3 March, 2022; originally announced March 2022.

    Comments: Accepted in ECCV 2022. Project Page: https://fscoco.github.io

  23. arXiv:2108.11636  [pdf, other

    cs.CV

    SketchLattice: Latticed Representation for Sketch Manipulation

    Authors: Yonggang Qi, Guoyao Su, Pinaki Nath Chowdhury, Mingkang Li, Yi-Zhe Song

    Abstract: The key challenge in designing a sketch representation lies with handling the abstract and iconic nature of sketches. Existing work predominantly utilizes either, (i) a pixelative format that treats sketches as natural images employing off-the-shelf CNN-based networks, or (ii) an elaborately designed vector format that leverages the structural information of drawing orders using sequential RNN-bas… ▽ More

    Submitted 26 August, 2021; originally announced August 2021.

    Comments: accepted to ICCV 2021

  24. arXiv:2107.12090  [pdf, other

    cs.CV

    Joint Visual Semantic Reasoning: Multi-Stage Decoder for Text Recognition

    Authors: Ayan Kumar Bhunia, Aneeshan Sain, Amandeep Kumar, Shuvozit Ghose, Pinaki Nath Chowdhury, Yi-Zhe Song

    Abstract: Although text recognition has significantly evolved over the years, state-of-the-art (SOTA) models still struggle in the wild scenarios due to complex backgrounds, varying fonts, uncontrolled illuminations, distortions and other artefacts. This is because such models solely depend on visual information for text recognition, thus lacking semantic reasoning capabilities. In this paper, we argue that… ▽ More

    Submitted 26 July, 2021; v1 submitted 26 July, 2021; originally announced July 2021.

    Comments: IEEE International Conference on Computer Vision (ICCV), 2021

  25. arXiv:2107.12087  [pdf, other

    cs.CV

    Text is Text, No Matter What: Unifying Text Recognition using Knowledge Distillation

    Authors: Ayan Kumar Bhunia, Aneeshan Sain, Pinaki Nath Chowdhury, Yi-Zhe Song

    Abstract: Text recognition remains a fundamental and extensively researched topic in computer vision, largely owing to its wide array of commercial applications. The challenging nature of the very problem however dictated a fragmentation of research efforts: Scene Text Recognition (STR) that deals with text in everyday scenes, and Handwriting Text Recognition (HTR) that tackles hand-written text. In this pa… ▽ More

    Submitted 27 July, 2021; v1 submitted 26 July, 2021; originally announced July 2021.

    Comments: IEEE International Conference on Computer Vision (ICCV), 2021

  26. arXiv:2107.12081  [pdf, other

    cs.CV

    Towards the Unseen: Iterative Text Recognition by Distilling from Errors

    Authors: Ayan Kumar Bhunia, Pinaki Nath Chowdhury, Aneeshan Sain, Yi-Zhe Song

    Abstract: Visual text recognition is undoubtedly one of the most extensively researched topics in computer vision. Great progress have been made to date, with the latest models starting to focus on the more practical "in-the-wild" setting. However, a salient problem still hinders practical deployment -- prior arts mostly struggle with recognising unseen (or rarely seen) character sequences. In this paper, w… ▽ More

    Submitted 26 July, 2021; originally announced July 2021.

    Comments: IEEE International Conference on Computer Vision (ICCV), 2021

  27. arXiv:2104.01876  [pdf, other

    cs.CV

    MetaHTR: Towards Writer-Adaptive Handwritten Text Recognition

    Authors: Ayan Kumar Bhunia, Shuvozit Ghose, Amandeep Kumar, Pinaki Nath Chowdhury, Aneeshan Sain, Yi-Zhe Song

    Abstract: Handwritten Text Recognition (HTR) remains a challenging problem to date, largely due to the varying writing styles that exist amongst us. Prior works however generally operate with the assumption that there is a limited number of styles, most of which have already been captured by existing datasets. In this paper, we take a completely different perspective -- we work on the assumption that there… ▽ More

    Submitted 5 April, 2021; originally announced April 2021.

    Comments: IEEE Conf. on Computer Vision and Pattern Recognition (CVPR), 2021

  28. arXiv:2103.13990  [pdf, other

    cs.CV

    More Photos are All You Need: Semi-Supervised Learning for Fine-Grained Sketch Based Image Retrieval

    Authors: Ayan Kumar Bhunia, Pinaki Nath Chowdhury, Aneeshan Sain, Yongxin Yang, Tao Xiang, Yi-Zhe Song

    Abstract: A fundamental challenge faced by existing Fine-Grained Sketch-Based Image Retrieval (FG-SBIR) models is the data scarcity -- model performances are largely bottlenecked by the lack of sketch-photo pairs. Whilst the number of photos can be easily scaled, each corresponding sketch still needs to be individually produced. In this paper, we aim to mitigate such an upper-bound on sketch data, and study… ▽ More

    Submitted 25 March, 2021; originally announced March 2021.

    Comments: IEEE Conf. on Computer Vision and Pattern Recognition (CVPR), 2021 Code : https://github.com/AyanKumarBhunia/semisupervised-FGSBIR

  29. arXiv:2103.13716  [pdf, other

    cs.CV

    Vectorization and Rasterization: Self-Supervised Learning for Sketch and Handwriting

    Authors: Ayan Kumar Bhunia, Pinaki Nath Chowdhury, Yongxin Yang, Timothy M. Hospedales, Tao Xiang, Yi-Zhe Song

    Abstract: Self-supervised learning has gained prominence due to its efficacy at learning powerful representations from unlabelled data that achieve excellent performance on many challenging downstream tasks. However supervision-free pre-text tasks are challenging to design and usually modality specific. Although there is a rich literature of self-supervised methods for either spatial (such as images) or tem… ▽ More

    Submitted 25 March, 2021; originally announced March 2021.

    Comments: IEEE Conf. on Computer Vision and Pattern Recognition (CVPR), 2021 Code : https://github.com/AyanKumarBhunia/Self-Supervised-Learning-for-Sketch

  30. arXiv:2007.07075  [pdf, other

    cs.CV

    UDBNET: Unsupervised Document Binarization Network via Adversarial Game

    Authors: Amandeep Kumar, Shuvozit Ghose, Pinaki Nath Chowdhury, Partha Pratim Roy, Umapada Pal

    Abstract: Degraded document image binarization is one of the most challenging tasks in the domain of document image analysis. In this paper, we present a novel approach towards document image binarization by introducing three-player min-max adversarial game. We train the network in an unsupervised setup by assuming that we do not have any paired-training data. In our approach, an Adversarial Texture Augment… ▽ More

    Submitted 27 October, 2020; v1 submitted 14 July, 2020; originally announced July 2020.

    Comments: Accepted in ICPR 2020

  31. arXiv:2004.08141  [pdf, other

    cs.CV

    Modeling Extent-of-Texture Information for Ground Terrain Recognition

    Authors: Shuvozit Ghose, Pinaki Nath Chowdhury, Partha Pratim Roy, Umapada Pal

    Abstract: Ground Terrain Recognition is a difficult task as the context information varies significantly over the regions of a ground terrain image. In this paper, we propose a novel approach towards ground-terrain recognition via modeling the Extent-of-Texture information to establish a balance between the order-less texture component and ordered-spatial information locally. At first, the proposed method u… ▽ More

    Submitted 27 October, 2020; v1 submitted 17 April, 2020; originally announced April 2020.

    Comments: Accepted in ICPR 2020

  32. arXiv:1907.00945  [pdf, ps, other

    cs.CV

    ICDAR2019 Robust Reading Challenge on Multi-lingual Scene Text Detection and Recognition -- RRC-MLT-2019

    Authors: Nibal Nayef, Yash Patel, Michal Busta, Pinaki Nath Chowdhury, Dimosthenis Karatzas, Wafa Khlif, Jiri Matas, Umapada Pal, Jean-Christophe Burie, Cheng-lin Liu, Jean-Marc Ogier

    Abstract: With the growing cosmopolitan culture of modern cities, the need of robust Multi-Lingual scene Text (MLT) detection and recognition systems has never been more immense. With the goal to systematically benchmark and push the state-of-the-art forward, the proposed competition builds on top of the RRC-MLT-2017 with an additional end-to-end task, an additional language in the real images dataset, a la… ▽ More

    Submitted 1 July, 2019; originally announced July 2019.

    Comments: ICDAR'19 camera-ready version. Competition available at https://rrc.cvc.uab.es/?ch=15. The first two authors contributed equally