Skip to main content

Showing 1–28 of 28 results for author: Bala, K

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.12336  [pdf, other

    cs.CL cs.LG

    A Compass for Navigating the World of Sentence Embeddings for the Telecom Domain

    Authors: Sujoy Roychowdhury, Sumit Soman, H. G. Ranjani, Vansh Chhabra, Neeraj Gunda, Subhadip Bandyopadhyay, Sai Krishna Bala

    Abstract: A plethora of sentence embedding models makes it challenging to choose one, especially for domains such as telecom, rich with specialized vocabulary. We evaluate multiple embeddings obtained from publicly available models and their domain-adapted variants, on both point retrieval accuracies as well as their (95\%) confidence intervals. We establish a systematic method to obtain thresholds for simi… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

    Comments: 10 pages, 3 figures, 4 tables

    MSC Class: 68T50 ACM Class: I.2.7

  2. arXiv:2312.06960  [pdf, other

    cs.CV cs.LG

    Remote Sensing Vision-Language Foundation Models without Annotations via Ground Remote Alignment

    Authors: Utkarsh Mall, Cheng Perng Phoo, Meilin Kelsey Liu, Carl Vondrick, Bharath Hariharan, Kavita Bala

    Abstract: We introduce a method to train vision-language models for remote-sensing images without using any textual annotations. Our key insight is to use co-located internet imagery taken on the ground as an intermediary for connecting remote-sensing images and language. Specifically, we train an image encoder for remote sensing images to align with the image encoder of CLIP using a large amount of paired… ▽ More

    Submitted 11 December, 2023; originally announced December 2023.

  3. arXiv:2204.07030  [pdf, other

    cs.CV cs.LG

    Activation Regression for Continuous Domain Generalization with Applications to Crop Classification

    Authors: Samar Khanna, Bram Wallace, Kavita Bala, Bharath Hariharan

    Abstract: Geographic variance in satellite imagery impacts the ability of machine learning models to generalise to new regions. In this paper, we model geographic generalisation in medium resolution Landsat-8 satellite imagery as a continuous domain adaptation problem, demonstrating how models generalise better with appropriate domain knowledge. We develop a dataset spatially distributed across the entire c… ▽ More

    Submitted 14 April, 2022; originally announced April 2022.

  4. arXiv:2109.09923  [pdf, other

    cs.CV cs.RO

    AutoPhoto: Aesthetic Photo Capture using Reinforcement Learning

    Authors: Hadi AlZayer, Hubert Lin, Kavita Bala

    Abstract: The process of capturing a well-composed photo is difficult and it takes years of experience to master. We propose a novel pipeline for an autonomous agent to automatically capture an aesthetic photograph by navigating within a local region in a scene. Instead of classical optimization over heuristics such as the rule-of-thirds, we adopt a data-driven aesthetics estimator to assess photo quality.… ▽ More

    Submitted 20 September, 2021; originally announced September 2021.

    Comments: Accepted to IROS 2021

  5. arXiv:2108.10967  [pdf, other

    cs.CV

    Field-Guide-Inspired Zero-Shot Learning

    Authors: Utkarsh Mall, Bharath Hariharan, Kavita Bala

    Abstract: Modern recognition systems require large amounts of supervision to achieve accuracy. Adapting to new domains requires significant data from experts, which is onerous and can become too expensive. Zero-shot learning requires an annotated set of attributes for a novel category. Annotating the full set of attributes for a novel category proves to be a tedious and expensive task in deployment. This is… ▽ More

    Submitted 24 August, 2021; originally announced August 2021.

    Comments: Accepted to ICCV 2021

  6. arXiv:2104.00674  [pdf, other

    cs.CV cs.GR

    PhySG: Inverse Rendering with Spherical Gaussians for Physics-based Material Editing and Relighting

    Authors: Kai Zhang, Fujun Luan, Qianqian Wang, Kavita Bala, Noah Snavely

    Abstract: We present PhySG, an end-to-end inverse rendering pipeline that includes a fully differentiable renderer and can reconstruct geometry, materials, and illumination from scratch from a set of RGB input images. Our framework represents specular BRDFs and environmental illumination using mixtures of spherical Gaussians, and represents geometry as a signed distance function parameterized as a Multi-Lay… ▽ More

    Submitted 1 April, 2021; originally announced April 2021.

    Comments: Accepted to CVPR 2021; Project page: https://kai-46.github.io/PhySG-website/

  7. arXiv:2103.17070  [pdf, other

    cs.CV

    PiCIE: Unsupervised Semantic Segmentation using Invariance and Equivariance in Clustering

    Authors: Jang Hyun Cho, Utkarsh Mall, Kavita Bala, Bharath Hariharan

    Abstract: We present a new framework for semantic segmentation without annotations via clustering. Off-the-shelf clustering methods are limited to curated, single-label, and object-centric images yet real-world data are dominantly uncurated, multi-label, and scene-centric. We extend clustering from images to pixels and assign separate cluster membership to different instances within each image. However, sol… ▽ More

    Submitted 29 March, 2021; originally announced March 2021.

    Comments: CVPR 2021

  8. arXiv:2103.15208  [pdf, other

    cs.CV

    Unified Shape and SVBRDF Recovery using Differentiable Monte Carlo Rendering

    Authors: Fujun Luan, Shuang Zhao, Kavita Bala, Zhao Dong

    Abstract: Reconstructing the shape and appearance of real-world objects using measured 2D images has been a long-standing problem in computer vision. In this paper, we introduce a new analysis-by-synthesis technique capable of producing high-quality reconstructions through robust coarse-to-fine optimization and physics-based differentiable rendering. Unlike most previous methods that handle geometry and r… ▽ More

    Submitted 24 June, 2021; v1 submitted 28 March, 2021; originally announced March 2021.

  9. Materials In Paintings (MIP): An interdisciplinary dataset for perception, art history, and computer vision

    Authors: Mitchell J. P. van Zuijlen, Hubert Lin, Kavita Bala, Sylvia C. Pont, Maarten W. A. Wijntjes

    Abstract: A painter is free to modify how components of a natural scene are depicted, which can lead to a perceptually convincing image of the distal world. This signals a major difference between photos and paintings: paintings are explicitly created for human perception. Studying these painterly depictions could be beneficial to a multidisciplinary audience. In this paper, we capture and explore the paint… ▽ More

    Submitted 10 December, 2020; v1 submitted 5 December, 2020; originally announced December 2020.

  10. arXiv:2012.02897  [pdf, other

    cs.CV

    Discovering Underground Maps from Fashion

    Authors: Utkarsh Mall, Kavita Bala, Tamara Berg, Kristen Grauman

    Abstract: The fashion sense -- meaning the clothing styles people wear -- in a geographical region can reveal information about that region. For example, it can reflect the kind of activities people do there, or the type of crowds that frequently visit the region (e.g., tourist hot spot, student neighborhood, business center). We propose a method to automatically create underground neighborhood maps of citi… ▽ More

    Submitted 4 December, 2020; originally announced December 2020.

  11. arXiv:2011.14477  [pdf, other

    cs.CV

    What Can Style Transfer and Paintings Do For Model Robustness?

    Authors: Hubert Lin, Mitchell van Zuijlen, Sylvia C. Pont, Maarten W. A. Wijntjes, Kavita Bala

    Abstract: A common strategy for improving model robustness is through data augmentations. Data augmentations encourage models to learn desired invariances, such as invariance to horizontal flip** or small changes in color. Recent work has shown that arbitrary style transfer can be used as a form of data augmentation to encourage invariance to textures by creating painting-like images from photographs. How… ▽ More

    Submitted 27 May, 2021; v1 submitted 29 November, 2020; originally announced November 2020.

    Comments: CVPR 2021

  12. arXiv:2011.12276  [pdf, other

    cs.CV

    Insights From A Large-Scale Database of Material Depictions In Paintings

    Authors: Hubert Lin, Mitchell Van Zuijlen, Maarten W. A. Wijntjes, Sylvia C. Pont, Kavita Bala

    Abstract: Deep learning has paved the way for strong recognition systems which are often both trained on and applied to natural images. In this paper, we examine the give-and-take relationship between such visual recognition systems and the rich information available in the fine arts. First, we find that visual recognition systems designed for natural images can work surprisingly well on paintings. In parti… ▽ More

    Submitted 24 November, 2020; originally announced November 2020.

    Comments: International Workshop on Fine Art Pattern Extraction and Recognition, ICPR 2020

  13. arXiv:2003.03464  [pdf, other

    cs.RO

    DeepSemanticHPPC: Hypothesis-based Planning over Uncertain Semantic Point Clouds

    Authors: Yutao Han, Hubert Lin, Jacopo Banfi, Kavita Bala, Mark Campbell

    Abstract: Planning in unstructured environments is challenging -- it relies on sensing, perception, scene reconstruction, and reasoning about various uncertainties. We propose DeepSemanticHPPC, a novel uncertainty-aware hypothesis-based planner for unstructured environments. Our algorithmic pipeline consists of: a deep Bayesian neural network which segments surfaces with uncertainty estimates; a flexible po… ▽ More

    Submitted 6 March, 2020; originally announced March 2020.

    Comments: Accepted by the IEEE International Conference on Robotics and Automation (ICRA) 2020. Video Link: https://youtu.be/_SVEZx5vbiQ. The first three authors contributed equally to this work

  14. arXiv:2002.06626  [pdf, other

    cs.CV cs.LG eess.IV

    Block Annotation: Better Image Annotation for Semantic Segmentation with Sub-Image Decomposition

    Authors: Hubert Lin, Paul Upchurch, Kavita Bala

    Abstract: Image datasets with high-quality pixel-level annotations are valuable for semantic segmentation: labelling every pixel in an image ensures that rare classes and small objects are annotated. However, full-image annotations are expensive, with experts spending up to 90 minutes per image. We propose block sub-image annotation as a replacement for full-image annotation. Despite the attention cost of f… ▽ More

    Submitted 16 February, 2020; originally announced February 2020.

    Comments: ICCV 2019; http://www.cs.cornell.edu/~hubert/block_annotation/

  15. arXiv:1908.11412  [pdf, other

    cs.CV

    GeoStyle: Discovering Fashion Trends and Events

    Authors: Utkarsh Mall, Kevin Matzen, Bharath Hariharan, Noah Snavely, Kavita Bala

    Abstract: Understanding fashion styles and trends is of great potential interest to retailers and consumers alike. The photos people upload to social media are a historical and public data source of how people dress across the world and at different times. While we now have tools to automatically recognize the clothing and style attributes of what people are wearing in these photographs, we lack the ability… ▽ More

    Submitted 29 August, 2019; originally announced August 2019.

    Comments: Accepted in ICCV 2019

  16. Learning Material-Aware Local Descriptors for 3D Shapes

    Authors: Hubert Lin, Melinos Averkiou, Evangelos Kalogerakis, Balazs Kovacs, Siddhant Ranade, Vladimir G. Kim, Siddhartha Chaudhuri, Kavita Bala

    Abstract: Material understanding is critical for design, geometric modeling, and analysis of functional objects. We enable material-aware 3D shape analysis by employing a projective convolutional neural network architecture to learn material- aware descriptors from view-based representations of 3D points for point-wise material classification or material- aware retrieval. Unfortunately, only a small fractio… ▽ More

    Submitted 19 October, 2018; originally announced October 2018.

    Comments: 3DV 2018

  17. arXiv:1809.10820  [pdf, other

    cs.CV

    Inverse Transport Networks

    Authors: Chengqian Che, Fujun Luan, Shuang Zhao, Kavita Bala, Ioannis Gkioulekas

    Abstract: We introduce inverse transport networks as a learning architecture for inverse rendering problems where, given input image measurements, we seek to infer physical scene parameters such as shape, material, and illumination. During training, these networks are evaluated not only in terms of how close they can predict groundtruth parameters, but also in terms of whether the parameters they produce ca… ▽ More

    Submitted 27 September, 2018; originally announced September 2018.

  18. arXiv:1804.03189  [pdf, other

    cs.GR

    Deep Painterly Harmonization

    Authors: Fujun Luan, Sylvain Paris, Eli Shechtman, Kavita Bala

    Abstract: Copying an element from a photo and pasting it into a painting is a challenging task. Applying photo compositing techniques in this context yields subpar results that look like a collage --- and existing painterly stylization algorithms, which are global, perform poorly when applied locally. We address these issues with a dedicated algorithm that carefully determines the local statistics to be tra… ▽ More

    Submitted 26 June, 2018; v1 submitted 9 April, 2018; originally announced April 2018.

  19. arXiv:1706.01869  [pdf, other

    cs.CV

    StreetStyle: Exploring world-wide clothing styles from millions of photos

    Authors: Kevin Matzen, Kavita Bala, Noah Snavely

    Abstract: Each day billions of photographs are uploaded to photo-sharing services and social media platforms. These images are packed with information about how people live around the world. In this paper we exploit this rich trove of data to understand fashion and style trends worldwide. We present a framework for visual discovery at scale, analyzing clothing and fashion across millions of images of people… ▽ More

    Submitted 6 June, 2017; originally announced June 2017.

  20. arXiv:1705.01156  [pdf, other

    cs.CV cs.GR

    Shading Annotations in the Wild

    Authors: Balazs Kovacs, Sean Bell, Noah Snavely, Kavita Bala

    Abstract: Understanding shading effects in images is critical for a variety of vision and graphics problems, including intrinsic image decomposition, shadow removal, image relighting, and inverse rendering. As is the case with other vision tasks, machine learning is a promising approach to understanding shading - but there is little ground truth shading data available for real-world images. We introduce Sha… ▽ More

    Submitted 2 May, 2017; originally announced May 2017.

    Comments: CVPR 2017

  21. arXiv:1703.07511  [pdf, other

    cs.CV

    Deep Photo Style Transfer

    Authors: Fujun Luan, Sylvain Paris, Eli Shechtman, Kavita Bala

    Abstract: This paper introduces a deep-learning approach to photographic style transfer that handles a large variety of image content while faithfully transferring the reference style. Our approach builds upon the recent work on painterly transfer that separates style from the content of an image by considering different layers of a neural network. However, as is, this approach is not suitable for photoreal… ▽ More

    Submitted 10 April, 2017; v1 submitted 22 March, 2017; originally announced March 2017.

  22. arXiv:1611.05507  [pdf, other

    cs.CV

    Deep Feature Interpolation for Image Content Changes

    Authors: Paul Upchurch, Jacob Gardner, Geoff Pleiss, Robert Pless, Noah Snavely, Kavita Bala, Kilian Weinberger

    Abstract: We propose Deep Feature Interpolation (DFI), a new data-driven baseline for automatic high-resolution image transformation. As the name suggests, it relies only on simple linear interpolation of deep convolutional features from pre-trained convnets. We show that despite its simplicity, DFI can perform high-level semantic transformations like "make older/younger", "make bespectacled", "add smile",… ▽ More

    Submitted 19 June, 2017; v1 submitted 16 November, 2016; originally announced November 2016.

    Comments: First two authors contributed equally. Accepted by CVPR 2017. Code at https://github.com/paulu/deepfeatinterp

  23. arXiv:1603.02003  [pdf, other

    cs.CV

    From A to Z: Supervised Transfer of Style and Content Using Deep Neural Network Generators

    Authors: Paul Upchurch, Noah Snavely, Kavita Bala

    Abstract: We propose a new neural network architecture for solving single-image analogies - the generation of an entire set of stylistically similar images from just a single input image. Solving this problem requires separating image style from content. Our network is a modified variational autoencoder (VAE) that supports supervised training of single-image analogies and in-network evaluation of outputs wi… ▽ More

    Submitted 7 March, 2016; originally announced March 2016.

  24. arXiv:1512.04143  [pdf, other

    cs.CV

    Inside-Outside Net: Detecting Objects in Context with Skip Pooling and Recurrent Neural Networks

    Authors: Sean Bell, C. Lawrence Zitnick, Kavita Bala, Ross Girshick

    Abstract: It is well known that contextual and multi-scale representations are important for accurate visual recognition. In this paper we present the Inside-Outside Net (ION), an object detector that exploits information both inside and outside the region of interest. Contextual information outside the region of interest is integrated using spatial recurrent neural networks. Inside, we use skip pooling to… ▽ More

    Submitted 13 December, 2015; originally announced December 2015.

  25. arXiv:1511.06421  [pdf, other

    cs.LG cs.CV stat.ML

    Deep Manifold Traversal: Changing Labels with Convolutional Features

    Authors: Jacob R. Gardner, Paul Upchurch, Matt J. Kusner, Yixuan Li, Kilian Q. Weinberger, Kavita Bala, John E. Hopcroft

    Abstract: Many tasks in computer vision can be cast as a "label changing" problem, where the goal is to make a semantic change to the appearance of an image or some subject in an image in order to alter the class membership. Although successful task-specific methods have been developed for some label changing applications, to date no general purpose method exists. Motivated by this we propose deep manifold… ▽ More

    Submitted 17 March, 2016; v1 submitted 19 November, 2015; originally announced November 2015.

  26. arXiv:1509.07473  [pdf, other

    cs.CV

    Learning Visual Clothing Style with Heterogeneous Dyadic Co-occurrences

    Authors: Andreas Veit, Balazs Kovacs, Sean Bell, Julian McAuley, Kavita Bala, Serge Belongie

    Abstract: With the rapid proliferation of smart mobile devices, users now take millions of photos every day. These include large numbers of clothing and accessory images. We would like to answer questions like `What outfit goes well with this pair of shoes?' To answer these types of questions, one has to go beyond learning visual similarity and learn a visual notion of compatibility across categories. In th… ▽ More

    Submitted 24 September, 2015; originally announced September 2015.

    Comments: ICCV 2015

  27. arXiv:1412.0623  [pdf, other

    cs.CV

    Material Recognition in the Wild with the Materials in Context Database

    Authors: Sean Bell, Paul Upchurch, Noah Snavely, Kavita Bala

    Abstract: Recognizing materials in real-world images is a challenging task. Real-world materials have rich surface texture, geometry, lighting conditions, and clutter, which combine to make the problem particularly difficult. In this paper, we introduce a new, large-scale, open dataset of materials in the wild, the Materials in Context Database (MINC), and combine this dataset with deep learning to achieve… ▽ More

    Submitted 14 April, 2015; v1 submitted 1 December, 2014; originally announced December 2014.

    Comments: CVPR 2015. Sean Bell and Paul Upchurch contributed equally

  28. arXiv:1107.3671  [pdf

    cs.NI

    Impact of Mobility On QoS of Mobile WiMax Network With CBR Application

    Authors: Kranti Bala, Kiran Ahuja

    Abstract: The issue of mobility is important in wireless network because internet connectivity can only be effective if it's available during the movement of node. To enhance mobility, wireless access systems are designed such as IEEE 802.16e to operate on the move without any disruption of services. In this paper we are analyzing the impact of mobility on the QoS parameters (Throughput, Average Jitter and… ▽ More

    Submitted 19 July, 2011; originally announced July 2011.

    Comments: Total 7 Pages, 5 Figures and 1 table

    Journal ref: International Journal of Advancements in Technology , Vol. 2, No. 3, July 2011