Skip to main content

Showing 1–19 of 19 results for author: Kak, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2404.06470  [pdf, other

    cs.CV cs.IR cs.LG

    Learning State-Invariant Representations of Objects from Image Collections with State, Pose, and Viewpoint Changes

    Authors: Rohan Sarkar, Avinash Kak

    Abstract: We add one more invariance - state invariance - to the more commonly used other invariances for learning object representations for recognition and retrieval. By state invariance, we mean robust with respect to changes in the structural form of the object, such as when an umbrella is folded, or when an item of clothing is tossed on the floor. Since humans generally have no difficulty in recognizin… ▽ More

    Submitted 9 April, 2024; originally announced April 2024.

    Comments: This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

  2. arXiv:2403.00272  [pdf, other

    cs.CV cs.IR cs.LG

    Dual Pose-invariant Embeddings: Learning Category and Object-specific Discriminative Representations for Recognition and Retrieval

    Authors: Rohan Sarkar, Avinash Kak

    Abstract: In the context of pose-invariant object recognition and retrieval, we demonstrate that it is possible to achieve significant improvements in performance if both the category-based and the object-identity-based embeddings are learned simultaneously during training. In hindsight, that sounds intuitive because learning about the categories is more fundamental than learning about the individual object… ▽ More

    Submitted 29 February, 2024; originally announced March 2024.

    Comments: Accepted by IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR 2024)

  3. arXiv:2308.01262  [pdf, other

    cs.CV eess.IV

    Incorporating Season and Solar Specificity into Renderings made by a NeRF Architecture using Satellite Images

    Authors: Michael Gableman, Avinash Kak

    Abstract: As a result of Shadow NeRF and Sat-NeRF, it is possible to take the solar angle into account in a NeRF-based framework for rendering a scene from a novel viewpoint using satellite images for training. Our work extends those contributions and shows how one can make the renderings season-specific. Our main challenge was creating a Neural Radiance Field (NeRF) that could render seasonal features inde… ▽ More

    Submitted 15 December, 2023; v1 submitted 2 August, 2023; originally announced August 2023.

    Comments: 18 pages, 17 figures, 10 tables

    ACM Class: I.4.8; I.3.3

  4. arXiv:2305.14301  [pdf, other

    eess.IV cs.CV

    A Laplacian Pyramid Based Generative H&E Stain Augmentation Network

    Authors: Fangda Li, Zhiqiang Hu, Wen Chen, Avinash Kak

    Abstract: Hematoxylin and Eosin (H&E) staining is a widely used sample preparation procedure for enhancing the saturation of tissue sections and the contrast between nuclei and cytoplasm in histology images for medical diagnostics. However, various factors, such as the differences in the reagents used, result in high variability in the colors of the stains actually recorded. This variability poses a challen… ▽ More

    Submitted 14 July, 2023; v1 submitted 23 May, 2023; originally announced May 2023.

  5. arXiv:2304.12560  [pdf

    cs.NI

    HexRAN: A Programmable Approach to Open RAN Base Station System Design

    Authors: Ahan Kak, Van-Quan Pham, Huu-Trung Thieu, Nakjung Choi

    Abstract: In recent years, the radio access network (RAN) domain has witnessed a sea change with increasing levels of virtualization and softwarization driven by emerging paradigms such as the Open RAN (O-RAN) movement. However, the fundamental building block of the cellular network, i.e., the base station, remains unchanged and ill-equipped to handle this architectural evolution. In particular, with refere… ▽ More

    Submitted 3 July, 2024; v1 submitted 25 April, 2023; originally announced April 2023.

  6. arXiv:2303.06193  [pdf, other

    cs.CV

    Adaptive Supervised PatchNCE Loss for Learning H&E-to-IHC Stain Translation with Inconsistent Groundtruth Image Pairs

    Authors: Fangda Li, Zhiqiang Hu, Wen Chen, Avinash Kak

    Abstract: Immunohistochemical (IHC) staining highlights the molecular information critical to diagnostics in tissue samples. However, compared to H&E staining, IHC staining can be much more expensive in terms of both labor and the laboratory equipment required. This motivates recent research that demonstrates that the correlations between the morphological information present in the H&E-stained slides and t… ▽ More

    Submitted 10 March, 2023; originally announced March 2023.

  7. arXiv:2303.05639  [pdf, other

    cs.CV cs.LG

    Self-Supervised One-Shot Learning for Automatic Segmentation of StyleGAN Images

    Authors: Ankit Manerikar, Avinash C. Kak

    Abstract: We propose a framework for the automatic one-shot segmentation of synthetic images generated by a StyleGAN. Our framework is based on the observation that the multi-scale hidden features in the GAN generator hold useful semantic information that can be utilized for automatic on-the-fly segmentation of the generated images. Using these features, our framework learns to segment synthetic images usin… ▽ More

    Submitted 23 October, 2023; v1 submitted 9 March, 2023; originally announced March 2023.

  8. arXiv:2302.12301  [pdf, other

    cs.CV

    An Aligned Multi-Temporal Multi-Resolution Satellite Image Dataset for Change Detection Research

    Authors: Rahul Deshmukh, Constantine J. Roros, Amith Kashyap, Avinash C. Kak

    Abstract: This paper presents an aligned multi-temporal and multi-resolution satellite image dataset for research in change detection. We expect our dataset to be useful to researchers who want to fuse information from multiple satellites for detecting changes on the surface of the earth that may not be fully visible in any single satellite. The dataset we present was created by augmenting the SpaceNet-7 da… ▽ More

    Submitted 27 February, 2023; v1 submitted 23 February, 2023; originally announced February 2023.

    Comments: 8 pages, 4 figures, 3 tables, satellite image dataset

  9. arXiv:2201.00467  [pdf, other

    cs.CV cs.LG

    maskGRU: Tracking Small Objects in the Presence of Large Background Motions

    Authors: Constantine J. Roros, Avinash C. Kak

    Abstract: We propose a recurrent neural network-based spatio-temporal framework named maskGRU for the detection and tracking of small objects in videos. While there have been many developments in the area of object tracking in recent years, tracking a small moving object amid other moving objects and actors (such as a ball amid moving players in sports footage) continues to be a difficult task. Existing spa… ▽ More

    Submitted 2 January, 2022; originally announced January 2022.

    Comments: 12 pages, 3 figures

  10. arXiv:2112.05335  [pdf, other

    cs.CV cs.LG

    Uncertainty, Edge, and Reverse-Attention Guided Generative Adversarial Network for Automatic Building Detection in Remotely Sensed Images

    Authors: Somrita Chattopadhyay, Avinash C. Kak

    Abstract: Despite recent advances in deep-learning based semantic segmentation, automatic building detection from remotely sensed imagery is still a challenging problem owing to large variability in the appearance of buildings across the globe. The errors occur mostly around the boundaries of the building footprints, in shadow areas, and when detecting buildings whose exterior surfaces have reflectivity pro… ▽ More

    Submitted 9 December, 2021; originally announced December 2021.

    Comments: 23 pages

  11. arXiv:2102.10513  [pdf, other

    cs.SE cs.AI cs.DC eess.IV

    CheckSoft : A Scalable Event-Driven Software Architecture for Kee** Track of People and Things in People-Centric Spaces

    Authors: Rohan Sarkar, Avinash C. Kak

    Abstract: We present CheckSoft, a scalable event-driven software architecture for kee** track of people-object interactions in people-centric applications such as airport checkpoint security areas, automated retail stores, smart libraries, and so on. The architecture works off the video data generated in real time by a network of surveillance cameras. Although there are many different aspects to automatin… ▽ More

    Submitted 21 February, 2021; originally announced February 2021.

    Comments: 33 pages, 25 figures, 6 Tables

  12. arXiv:2010.01041  [pdf, other

    cs.CV

    Homography Estimation with Convolutional Neural Networks Under Conditions of Variance

    Authors: David Niblick, Avinash Kak

    Abstract: Planar homography estimation is foundational to many computer vision problems, such as Simultaneous Localization and Map** (SLAM) and Augmented Reality (AR). However, conditions of high variance confound even the state-of-the-art algorithms. In this report, we analyze the performance of two recently published methods using Convolutional Neural Networks (CNNs) that are meant to replace the more t… ▽ More

    Submitted 22 October, 2020; v1 submitted 2 October, 2020; originally announced October 2020.

    Comments: 9 pages, 16 figures, submitted to 2020 Computer Vision and Pattern Recognition Conference

  13. arXiv:2008.10271  [pdf, other

    cs.CV cs.DC cs.LG eess.IV

    Semantic Labeling of Large-Area Geographic Regions Using Multi-View and Multi-Date Satellite Images and Noisy OSM Training Labels

    Authors: Bharath Comandur, Avinash C. Kak

    Abstract: We present a novel multi-view training framework and CNN architecture for combining information from multiple overlap** satellite images and noisy training labels derived from OpenStreetMap (OSM) to semantically label buildings and roads across large geographic regions (100 km$^2$). Our approach to multi-view semantic segmentation yields a 4-7% improvement in the per-class IoU scores compared to… ▽ More

    Submitted 26 June, 2021; v1 submitted 24 August, 2020; originally announced August 2020.

    Comments: This work has been accepted by the IEEE for publication. Copyright may be transferred without notice, after which this version may no longer be accessible

  14. arXiv:1911.09800  [pdf, other

    cs.CV

    A Comparative Evaluation of SGM Variants (including a New Variant, tMGM) for Dense Stereo Matching

    Authors: Sonali Patil, Tanmay Prakash, Bharath Comandur, Avinash Kak

    Abstract: Our goal here is threefold: [1] To present a new dense-stereo matching algorithm, tMGM, that by combining the hierarchical logic of tSGM with the support structure of MGM achieves 6-8\% performance improvement over the baseline SGM (these performance numbers are posted under tMGM-16 in the Middlebury Benchmark V3 ); and [2] Through an exhaustive quantitative and qualitative comparative study, to c… ▽ More

    Submitted 21 November, 2019; originally announced November 2019.

  15. arXiv:1907.04404  [pdf, other

    cs.CV

    A New Stereo Benchmarking Dataset for Satellite Images

    Authors: Sonali Patil, Bharath Comandur, Tanmay Prakash, Avinash C. Kak

    Abstract: In order to facilitate further research in stereo reconstruction with multi-date satellite images, the goal of this paper is to provide a set of stereo-rectified images and the associated groundtruthed disparities for 10 AOIs (Area of Interest) drawn from two sources: 8 AOIs from IARPA's MVS Challenge dataset and 2 AOIs from the CORE3D-Public dataset. The disparities were groundtruthed by first co… ▽ More

    Submitted 9 July, 2019; originally announced July 2019.

  16. arXiv:1905.00934  [pdf, other

    eess.IV cs.CV

    A Splitting-Based Iterative Algorithm for GPU-Accelerated Statistical Dual-Energy X-Ray CT Reconstruction

    Authors: Fangda Li, Ankit Manerikar, Tanmay Prakash, Avinash Kak

    Abstract: When dealing with material classification in baggage at airports, Dual-Energy Computed Tomography (DECT) allows characterization of any given material with coefficients based on two attenuative effects: Compton scattering and photoelectric absorption. However, straightforward projection-domain decomposition methods for this characterization often yield poor reconstructions due to the high dynamic… ▽ More

    Submitted 2 May, 2019; originally announced May 2019.

  17. arXiv:1811.04772  [pdf, other

    cs.CV eess.IV

    Adaptive Target Recognition: A Case Study Involving Airport Baggage Screening

    Authors: Ankit Manerikar, Tanmay Prakash, Avinash C. Kak

    Abstract: This work addresses the question whether it is possible to design a computer-vision based automatic threat recognition (ATR) system so that it can adapt to changing specifications of a threat without having to create a new ATR each time. The changes in threat specifications, which may be warranted by intelligence reports and world events, are typically regarding the physical characteristics of wha… ▽ More

    Submitted 30 November, 2018; v1 submitted 12 November, 2018; originally announced November 2018.

  18. arXiv:1709.00488  [pdf, other

    cs.RO

    RMPD - A Recursive Mid-Point Displacement Algorithm for Path Planning

    Authors: Fangda Li, Ankit V. Manerikar, Avinash C. Kak

    Abstract: Motivated by what is required for real-time path planning, the paper starts out by presenting sRMPD, a new recursive "local" planner founded on the key notion that, unless made necessary by an obstacle, there must be no deviation from the shortest path between any two points, which would normally be a straight line path in the configuration space. Subsequently, we increase the power of sRMPD by us… ▽ More

    Submitted 25 February, 2018; v1 submitted 1 September, 2017; originally announced September 2017.

  19. arXiv:1304.1513  [pdf

    cs.AI

    Hierarchical Evidence Accumulation in the Pseiki System and Experiments in Model-Driven Mobile Robot Navigation

    Authors: A. C. Kak, K. M. Andress, C. Lopez-Abadia, M. S. Carroll, J. R. Lewis

    Abstract: In this paper, we will review the process of evidence accumulation in the PSEIKI system for expectation-driven interpretation of images of 3-D scenes. Expectations are presented to PSEIKI as a geometrical hierarchy of abstractions. PSEIKI's job is then to construct abstraction hierarchies in the perceived image taking cues from the abstraction hierarchies in the expectations. The Dempster-Shafe… ▽ More

    Submitted 27 March, 2013; originally announced April 2013.

    Comments: Appears in Proceedings of the Fifth Conference on Uncertainty in Artificial Intelligence (UAI1989)

    Report number: UAI-P-1989-PG-194-207