Skip to main content

Showing 1–50 of 99 results for author: Brown, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2404.06672  [pdf, other

    cs.SE cs.CY

    Biomedical Open Source Software: Crucial Packages and Hidden Heroes

    Authors: Andrew Nesbitt, Boris Veytsman, Daniel Mietchen, Eva Maxfield Brown, James Howison, João Felipe Pimentel, Laurent Hèbert-Dufresne, Stephan Druskat

    Abstract: Despite the importance of scientific software for research, it is often not formally recognized and rewarded. This is especially true for foundation libraries, which are used by the software packages visible to the users, being ``hidden'' themselves. The funders and other organizations need to understand the complex network of computer programs that the modern research relies upon. In this work… ▽ More

    Submitted 9 April, 2024; originally announced April 2024.

  2. arXiv:2403.19831  [pdf, other

    cs.GT

    TASR: A Novel Trust-Aware Stackelberg Routing Algorithm to Mitigate Traffic Congestion

    Authors: Doris E. M. Brown, Venkata Sriram Siddhardh Nadendla, Sajal K. Das

    Abstract: Stackelberg routing platforms (SRP) reduce congestion in one-shot traffic networks by proposing optimal route recommendations to selfish travelers. Traditionally, Stackelberg routing is cast as a partial control problem where a fraction of traveler flow complies with route recommendations, while the remaining respond as selfish travelers. In this paper, a novel Stackelberg routing framework is for… ▽ More

    Submitted 28 March, 2024; originally announced March 2024.

  3. arXiv:2402.04446  [pdf, other

    eess.IV cs.CV cs.LG

    Pushing the limits of cell segmentation models for imaging mass cytometry

    Authors: Kimberley M. Bird, Xujiong Ye, Alan M. Race, James M. Brown

    Abstract: Imaging mass cytometry (IMC) is a relatively new technique for imaging biological tissue at subcellular resolution. In recent years, learning-based segmentation methods have enabled precise quantification of cell type and morphology, but typically rely on large datasets with fully annotated ground truth (GT) labels. This paper explores the effects of imperfect labels on learning-based segmentation… ▽ More

    Submitted 6 February, 2024; originally announced February 2024.

    Comments: International Symposium on Biomedical Imaging (ISBI) 2024 Submission

    ACM Class: I.2; I.4; I.4.6

  4. arXiv:2312.13274  [pdf, other

    cs.SE cs.CR cs.PL

    A Broad Comparative Evaluation of Software Debloating Tools

    Authors: Michael D. Brown, Adam Meily, Brian Fairservice, Akshay Sood, Jonathan Dorn, Eric Kilmer, Ronald Eytchison

    Abstract: Software debloating tools seek to improve program security and performance by removing unnecessary code, called bloat. While many techniques have been proposed, several barriers to their adoption have emerged. Namely, debloating tools are highly specialized, making it difficult for adopters to find the right type of tool for their needs. This is further hindered by a lack of established metrics an… ▽ More

    Submitted 12 June, 2024; v1 submitted 20 December, 2023; originally announced December 2023.

    Comments: 17 pages, 8 tables

  5. arXiv:2310.04550  [pdf, other

    cs.CV cs.CL cs.LG

    Module-wise Adaptive Distillation for Multimodality Foundation Models

    Authors: Chen Liang, Jiahui Yu, Ming-Hsuan Yang, Matthew Brown, Yin Cui, Tuo Zhao, Boqing Gong, Tianyi Zhou

    Abstract: Pre-trained multimodal foundation models have demonstrated remarkable generalizability but pose challenges for deployment due to their large sizes. One effective approach to reducing their sizes is layerwise distillation, wherein small student models are trained to match the hidden representations of large teacher models at each layer. Motivated by our observation that certain architecture compone… ▽ More

    Submitted 6 October, 2023; originally announced October 2023.

  6. arXiv:2309.10187  [pdf, other

    cs.HC

    Automated Interviewer or Augmented Survey? Collecting Social Data with Large Language Models

    Authors: Alejandro Cuevas Villalba, Eva M. Brown, Jennifer V. Scurrell, Jason Entenmann, Madeleine I. G. Daepp

    Abstract: Qualitative methods like interviews produce richer data in comparison with quantitative surveys, but are difficult to scale. Switching from web-based questionnaires to interactive chatbots offers a compromise, improving user engagement and response quality. Uptake remains limited, however, because of differences in users' expectations versus the capabilities of natural language processing methods.… ▽ More

    Submitted 10 October, 2023; v1 submitted 18 September, 2023; originally announced September 2023.

  7. arXiv:2309.04542  [pdf, other

    cs.CV

    Examining Autoexposure for Challenging Scenes

    Authors: SaiKiran Tedla, Beixuan Yang, Michael S. Brown

    Abstract: Autoexposure (AE) is a critical step applied by camera systems to ensure properly exposed images. While current AE algorithms are effective in well-lit environments with constant illumination, these algorithms still struggle in environments with bright light sources or scenes with abrupt changes in lighting. A significant hurdle in develo** new AE algorithms for challenging environments, especia… ▽ More

    Submitted 8 September, 2023; originally announced September 2023.

    Comments: ICCV 2023

  8. arXiv:2306.11920  [pdf, other

    cs.CV

    NILUT: Conditional Neural Implicit 3D Lookup Tables for Image Enhancement

    Authors: Marcos V. Conde, Javier Vazquez-Corral, Michael S. Brown, Radu Timofte

    Abstract: 3D lookup tables (3D LUTs) are a key component for image enhancement. Modern image signal processors (ISPs) have dedicated support for these as part of the camera rendering pipeline. Cameras typically provide multiple options for picture styles, where each style is usually obtained by applying a unique handcrafted 3D LUT. Current approaches for learning and applying 3D LUTs are notably fast, yet n… ▽ More

    Submitted 24 December, 2023; v1 submitted 20 June, 2023; originally announced June 2023.

    Comments: AAAI 2024 - The 38th Annual AAAI Conference on Artificial Intelligence

  9. arXiv:2305.14494  [pdf, other

    cs.SI

    Unsupervised Image Classification by Ideological Affiliation from User-Content Interaction Patterns

    Authors: Xinyi Liu, **ning Li, Dachun Sun, Ruijie Wang, Tarek Abdelzaher, Matt Brown, Anthony Barricelli, Matthias Kirchner, Arslan Basharat

    Abstract: The proliferation of political memes in modern information campaigns calls for efficient solutions for image classification by ideological affiliation. While significant advances have recently been made on text classification in modern natural language processing literature, understanding the political insinuation in imagery is less developed due to the hard nature of the problem. Unlike text, whe… ▽ More

    Submitted 23 May, 2023; originally announced May 2023.

    Comments: n Proc. PhoMemes (in conjunction with ICWSM), Limassol, Cyprus, June 2023

  10. arXiv:2305.03625  [pdf, ps, other

    cs.SD eess.AS physics.app-ph

    Physics-Based Acoustic Holograms

    Authors: Antonio Stanziola, Ben T. Cox, Bradley E. Treeby, Michael D. Brown

    Abstract: Advances in additive manufacturing have enabled the realisation of inexpensive, scalable, diffractive acoustic lenses that can be used to generate complex acoustic fields via phase and/or amplitude modulation. However, the design of these holograms relies on a thin-element approximation adapted from optics which can severely limit the fidelity of the realised acoustic field. Here, we introduce phy… ▽ More

    Submitted 5 May, 2023; originally announced May 2023.

  11. arXiv:2304.11743  [pdf, other

    cs.CV

    GamutMLP: A Lightweight MLP for Color Loss Recovery

    Authors: Hoang M. Le, Brian Price, Scott Cohen, Michael S. Brown

    Abstract: Cameras and image-editing software often process images in the wide-gamut ProPhoto color space, encompassing 90% of all visible colors. However, when images are encoded for sharing, this color-rich representation is transformed and clipped to fit within the small-gamut standard RGB (sRGB) color space, representing only 30% of visible colors. Recovering the lost color information is challenging due… ▽ More

    Submitted 23 April, 2023; originally announced April 2023.

  12. arXiv:2302.14177  [pdf, other

    cs.DL cs.SE

    Soft-Search: Two Datasets to Study the Identification and Production of Research Software

    Authors: Eva Maxfield Brown, Lindsey Schwartz, Richard Lewei Huang, Nicholas Weber

    Abstract: Software is an important tool for scholarly work, but software produced for research is in many cases not easily identifiable or discoverable. A potential first step in linking research and software is software identification. In this paper we present two datasets to study the identification and production of research software. The first dataset contains almost 1000 human labeled annotations of so… ▽ More

    Submitted 27 February, 2023; originally announced February 2023.

  13. arXiv:2212.00951  [pdf, other

    cs.AI

    SimpleMind adds thinking to deep neural networks

    Authors: Youngwon Choi, M. Wasil Wahi-Anwar, Matthew S. Brown

    Abstract: Deep neural networks (DNNs) detect patterns in data and have shown versatility and strong performance in many computer vision applications. However, DNNs alone are susceptible to obvious mistakes that violate simple, common sense concepts and are limited in their ability to use explicit knowledge to guide their search and decision making. While overall DNN performance metrics may be good, these ob… ▽ More

    Submitted 1 December, 2022; originally announced December 2022.

  14. arXiv:2211.08772  [pdf, other

    cs.CV

    MIMT: Multi-Illuminant Color Constancy via Multi-Task Local Surface and Light Color Learning

    Authors: Shuwei Li, Jikai Wang, Michael S. Brown, Robby T. Tan

    Abstract: The assumption of a uniform light color distribution is no longer applicable in scenes that have multiple light colors. Most color constancy methods are designed to deal with a single light color, and thus are erroneous when applied to multiple light colors. The spatial variability in multiple light colors causes the color constancy problem to be more challenging and requires the extraction of loc… ▽ More

    Submitted 22 August, 2023; v1 submitted 16 November, 2022; originally announced November 2022.

    Comments: 8 pages, 6 figures

  15. arXiv:2211.04656  [pdf, other

    cs.CV

    MEVID: Multi-view Extended Videos with Identities for Video Person Re-Identification

    Authors: Daniel Davila, Dawei Du, Bryon Lewis, Christopher Funk, Joseph Van Pelt, Roderick Collins, Kellie Corona, Matt Brown, Scott McCloskey, Anthony Hoogs, Brian Clipp

    Abstract: In this paper, we present the Multi-view Extended Videos with Identities (MEVID) dataset for large-scale, video person re-identification (ReID) in the wild. To our knowledge, MEVID represents the most-varied video person ReID dataset, spanning an extensive indoor and outdoor environment across nine unique dates in a 73-day window, various camera viewpoints, and entity clothing changes. Specificall… ▽ More

    Submitted 10 November, 2022; v1 submitted 8 November, 2022; originally announced November 2022.

    Comments: This paper was accepted to WACV 2023

  16. arXiv:2209.06375  [pdf, other

    cs.CV astro-ph.IM

    Self-Supervised Clustering on Image-Subtracted Data with Deep-Embedded Self-Organizing Map

    Authors: Y. -L. Mong, K. Ackley, T. L. Killestein, D. K. Galloway, M. Dyer, R. Cutter, M. J. I. Brown, J. Lyman, K. Ulaczyk, D. Steeghs, V. Dhillon, P. O'Brien, G. Ramsay, K. Noysena, R. Kotak, R. Breton, L. Nuttall, E. Palle, D. Pollacco, E. Thrane, S. Awiphan, U. Burhanudin, P. Chote, A. Chrimes, E. Daw , et al. (23 additional authors not shown)

    Abstract: Develo** an effective automatic classifier to separate genuine sources from artifacts is essential for transient follow-ups in wide-field optical surveys. The identification of transient detections from the subtraction artifacts after the image differencing process is a key step in such classifiers, known as real-bogus classification problem. We apply a self-supervised machine learning model, th… ▽ More

    Submitted 13 September, 2022; originally announced September 2022.

  17. arXiv:2206.04271  [pdf, other

    cs.CV

    DeepVerge: Classification of Roadside Verge Biodiversity and Conservation Potential

    Authors: Andrew Perrett, Charlie Barnes, Mark Schofield, Lan Qie, Petra Bosilj, James M. Brown

    Abstract: Open space grassland is being increasingly farmed or built upon, leading to a ram** up of conservation efforts targeting roadside verges. Approximately half of all UK grassland species can be found along the country's 500,000 km of roads, with some 91 species either threatened or near threatened. Careful management of these "wildlife corridors" is therefore essential to preventing species extinc… ▽ More

    Submitted 9 June, 2022; originally announced June 2022.

    ACM Class: I.4

  18. arXiv:2206.02715  [pdf, other

    cs.CV eess.IV

    Day-to-Night Image Synthesis for Training Nighttime Neural ISPs

    Authors: Abhijith Punnappurath, Abdullah Abuolaim, Abdelrahman Abdelhamed, Alex Levinshtein, Michael S. Brown

    Abstract: Many flagship smartphone cameras now use a dedicated neural image signal processor (ISP) to render noisy raw sensor images to the final processed output. Training nightmode ISP networks relies on large-scale datasets of image pairs with: (1) a noisy raw image captured with a short exposure and a high ISO gain; and (2) a ground truth low-noise raw image captured with a long exposure and low ISO tha… ▽ More

    Submitted 6 June, 2022; originally announced June 2022.

  19. arXiv:2206.01813  [pdf, other

    cs.CV eess.IV

    Learning sRGB-to-Raw-RGB De-rendering with Content-Aware Metadata

    Authors: Seonghyeon Nam, Abhijith Punnappurath, Marcus A. Brubaker, Michael S. Brown

    Abstract: Most camera images are rendered and saved in the standard RGB (sRGB) format by the camera's hardware. Due to the in-camera photo-finishing routines, nonlinear sRGB images are undesirable for computer vision tasks that assume a direct relationship between pixel values and scene radiance. For such applications, linear raw-RGB sensor images are preferred. Saving images in their raw-RGB format is stil… ▽ More

    Submitted 3 June, 2022; originally announced June 2022.

    Comments: CVPR 2022 (GitHub: https://github.com/SamsungLabs/content-aware-metadata)

  20. arXiv:2206.01103  [pdf, other

    eess.IV cs.CV

    Noise2NoiseFlow: Realistic Camera Noise Modeling without Clean Images

    Authors: Ali Maleky, Shayan Kousha, Michael S. Brown, Marcus A. Brubaker

    Abstract: Image noise modeling is a long-standing problem with many applications in computer vision. Early attempts that propose simple models, such as signal-independent additive white Gaussian noise or the heteroscedastic Gaussian noise model (a.k.a., camera noise level function) are not sufficient to learn the complex behavior of the camera sensor noise. Recently, more complex learning-based models have… ▽ More

    Submitted 2 June, 2022; originally announced June 2022.

    Comments: CVPR 2022

  21. arXiv:2206.00812  [pdf, other

    cs.CV eess.IV

    Modeling sRGB Camera Noise with Normalizing Flows

    Authors: Shayan Kousha, Ali Maleky, Michael S. Brown, Marcus A. Brubaker

    Abstract: Noise modeling and reduction are fundamental tasks in low-level computer vision. They are particularly important for smartphone cameras relying on small sensors that exhibit visually noticeable noise. There has recently been renewed interest in using data-driven approaches to improve camera noise models via neural networks. These data-driven approaches target noise present in the raw-sensor image… ▽ More

    Submitted 1 June, 2022; originally announced June 2022.

    Comments: CVPR 2022

  22. arXiv:2206.00614  [pdf, other

    cs.CV

    Dual-stream spatiotemporal networks with feature sharing for monitoring animals in the home cage

    Authors: Ezechukwu I. Nwokedi, Rasneer S. Bains, Luc Bidaut, Xujiong Ye, Sara Wells, James M. Brown

    Abstract: This paper presents a spatiotemporal deep learning approach for mouse behavioural classification in the home-cage. Using a series of dual-stream architectures with assorted modifications to increase performance, we introduce a novel feature sharing approach that jointly processes the streams at regular intervals throughout the network. To investigate the efficacy of this approach, models were eval… ▽ More

    Submitted 3 November, 2022; v1 submitted 1 June, 2022; originally announced June 2022.

  23. The Forgotten Margins of AI Ethics

    Authors: Abeba Birhane, Elayne Ruane, Thomas Laurent, Matthew S. Brown, Johnathan Flowers, Anthony Ventresque, Christopher L. Dancy

    Abstract: How has recent AI Ethics literature addressed topics such as fairness and justice in the context of continued social and structural power asymmetries? We trace both the historical roots and current landmark work that have been sha** the field and categorize these works under three broad umbrellas: (i) those grounded in Western canonical philosophy, (ii) mathematical and statistical methods, and… ▽ More

    Submitted 9 May, 2022; originally announced May 2022.

    Comments: To appear in the FAccT 2022 proceedings

  24. arXiv:2204.09110  [pdf

    cs.DL

    Councils in Action: Automating the Curation of Municipal Governance Data for Research

    Authors: Eva Maxfield Brown, Nicholas Weber

    Abstract: Large scale comparative research into municipal governance is often prohibitively difficult due to a lack of high-quality data. But, recent advances in speech-to-text algorithms and natural language processing has made it possible to more easily collect and analyze data about municipal governments. In this paper, we introduce an open-source platform, the Council Data Project (CDP), to curate novel… ▽ More

    Submitted 31 August, 2022; v1 submitted 19 April, 2022; originally announced April 2022.

    Comments: Keywords: public interest technology; municipal governance; data curation; computational data access; natural language processing To Be Published with 2022 ASIS&T Annual Meeting (https://www.asist.org/am22/)

  25. A Broad Comparative Evaluation of x86-64 Binary Rewriters

    Authors: Eric Schulte, Michael D. Brown, Vlad Folts

    Abstract: Binary rewriting is a rapidly-maturing technique for modifying software for instrumentation, customization, optimization, and hardening without access to source code. Unfortunately, the practical applications of binary rewriting tools are often unclear to users because their limitations are glossed over in the literature. This, among other challenges, has prohibited the widespread adoption of thes… ▽ More

    Submitted 7 September, 2022; v1 submitted 24 March, 2022; originally announced March 2022.

    Comments: 16 pages, 14 tables, 5 figures

    Journal ref: In Cyber Security Experimentation and Test Workshop (CSET 2022), August 8, 2022, Virtual, CA, USA. ACM, New York, NY, USA

  26. arXiv:2201.10366  [pdf, other

    cs.CV

    ADAPT: An Open-Source sUAS Payload for Real-Time Disaster Prediction and Response with AI

    Authors: Daniel Davila, Joseph VanPelt, Alexander Lynch, Adam Romlein, Peter Webley, Matthew S. Brown

    Abstract: Small unmanned aircraft systems (sUAS) are becoming prominent components of many humanitarian assistance and disaster response (HADR) operations. Pairing sUAS with onboard artificial intelligence (AI) substantially extends their utility in covering larger areas with fewer support personnel. A variety of missions, such as search and rescue, assessing structural damage, and monitoring forest fires,… ▽ More

    Submitted 25 January, 2022; originally announced January 2022.

    Comments: To be published in Workshop on Practical Deep Learning in the Wild at AAAI Conference on Artificial Intelligence 2022, 9 pages, 5 figures

  27. arXiv:2201.07711  [pdf, other

    cs.CR cs.HC cs.LG cs.OS

    Enhancing the Security & Privacy of Wearable Brain-Computer Interfaces

    Authors: Zahra Tarkhani, Lorena Qendro, Malachy O'Connor Brown, Oscar Hill, Cecilia Mascolo, Anil Madhavapeddy

    Abstract: Brain computing interfaces (BCI) are used in a plethora of safety/privacy-critical applications, ranging from healthcare to smart communication and control. Wearable BCI setups typically involve a head-mounted sensor connected to a mobile device, combined with ML-based data processing. Consequently, they are susceptible to a multiplicity of attacks across the hardware, software, and networking sta… ▽ More

    Submitted 19 January, 2022; originally announced January 2022.

  28. arXiv:2112.07074  [pdf, other

    cs.CV cs.LG

    Towards a Unified Foundation Model: Jointly Pre-Training Transformers on Unpaired Images and Text

    Authors: Qing Li, Boqing Gong, Yin Cui, Dan Kondratyuk, Xianzhi Du, Ming-Hsuan Yang, Matthew Brown

    Abstract: In this paper, we explore the possibility of building a unified foundation model that can be adapted to both vision-only and text-only tasks. Starting from BERT and ViT, we design a unified transformer consisting of modality-specific tokenizers, a shared transformer encoder, and task-specific output heads. To efficiently pre-train the proposed model jointly on unpaired images and text, we propose… ▽ More

    Submitted 13 December, 2021; originally announced December 2021.

    Comments: preliminary work

  29. arXiv:2112.04480  [pdf, other

    cs.CV cs.LG

    Exploring Temporal Granularity in Self-Supervised Video Representation Learning

    Authors: Rui Qian, Yeqing Li, Liangzhe Yuan, Boqing Gong, Ting Liu, Matthew Brown, Serge Belongie, Ming-Hsuan Yang, Hartwig Adam, Yin Cui

    Abstract: This work presents a self-supervised learning framework named TeG to explore Temporal Granularity in learning video representations. In TeG, we sample a long clip from a video and a short clip that lies inside the long clip. We then extract their dense temporal embeddings. The training objective consists of two parts: a fine-grained temporal learning objective to maximize the similarity between co… ▽ More

    Submitted 8 December, 2021; originally announced December 2021.

  30. arXiv:2112.01723  [pdf, other

    cs.CV eess.IV

    Adversarial Attacks against a Satellite-borne Multispectral Cloud Detector

    Authors: Andrew Du, Yee Wei Law, Michele Sasdelli, Bo Chen, Ken Clarke, Michael Brown, Tat-Jun Chin

    Abstract: Data collected by Earth-observing (EO) satellites are often afflicted by cloud cover. Detecting the presence of clouds -- which is increasingly done using deep learning -- is crucial preprocessing in EO applications. In fact, advanced EO satellites perform deep learning-based cloud detection on board the satellites and downlink only clear-sky data to save precious bandwidth. In this paper, we high… ▽ More

    Submitted 3 December, 2021; originally announced December 2021.

  31. arXiv:2111.07837  [pdf, other

    cs.CV

    Multi-View Motion Synthesis via Applying Rotated Dual-Pixel Blur Kernels

    Authors: Abdullah Abuolaim, Mahmoud Afifi, Michael S. Brown

    Abstract: Portrait mode is widely available on smartphone cameras to provide an enhanced photographic experience. One of the primary effects applied to images captured in portrait mode is a synthetic shallow depth of field (DoF). The synthetic DoF (or bokeh effect) selectively blurs regions in the image to emulate the effect of using a large lens with a wide aperture. In addition, many applications now inco… ▽ More

    Submitted 15 November, 2021; originally announced November 2021.

  32. arXiv:2109.13845  [pdf

    cs.CV

    Not Color Blind: AI Predicts Racial Identity from Black and White Retinal Vessel Segmentations

    Authors: Aaron S. Coyner, Praveer Singh, James M. Brown, Susan Ostmo, R. V. Paul Chan, Michael F. Chiang, Jayashree Kalpathy-Cramer, J. Peter Campbell

    Abstract: Background: Artificial intelligence (AI) may demonstrate racial bias when skin or choroidal pigmentation is present in medical images. Recent studies have shown that convolutional neural networks (CNNs) can predict race from images that were not previously thought to contain race-specific features. We evaluate whether grayscale retinal vessel maps (RVMs) of patients screened for retinopathy of pre… ▽ More

    Submitted 28 September, 2021; originally announced September 2021.

    Comments: 31 pages, 6 figures

  33. arXiv:2109.08750  [pdf, other

    cs.CV

    Auto White-Balance Correction for Mixed-Illuminant Scenes

    Authors: Mahmoud Afifi, Marcus A. Brubaker, Michael S. Brown

    Abstract: Auto white balance (AWB) is applied by camera hardware at capture time to remove the color cast caused by the scene illumination. The vast majority of white-balance algorithms assume a single light source illuminates the scene; however, real scenes often have mixed lighting conditions. This paper presents an effective AWB method to deal with such mixed-illuminant scenes. A unique departure from co… ▽ More

    Submitted 7 October, 2021; v1 submitted 17 September, 2021; originally announced September 2021.

    Journal ref: WACV 2021

  34. arXiv:2108.05251  [pdf, other

    cs.CV

    Improving Single-Image Defocus Deblurring: How Dual-Pixel Images Help Through Multi-Task Learning

    Authors: Abdullah Abuolaim, Mahmoud Afifi, Michael S. Brown

    Abstract: Many camera sensors use a dual-pixel (DP) design that operates as a rudimentary light field providing two sub-aperture views of a scene in a single capture. The DP sensor was developed to improve how cameras perform autofocus. Since the DP sensor's introduction, researchers have found additional uses for the DP data, such as depth estimation, reflection removal, and defocus deblurring. We are inte… ▽ More

    Submitted 9 February, 2022; v1 submitted 11 August, 2021; originally announced August 2021.

    Comments: Published in the Winter Conference on Applications of Computer Vision 2022 (WACV'22)

  35. arXiv:2108.01199  [pdf, other

    cs.CV

    Neural Image Representations for Multi-Image Fusion and Layer Separation

    Authors: Seonghyeon Nam, Marcus A. Brubaker, Michael S. Brown

    Abstract: We propose a framework for aligning and fusing multiple images into a single view using neural image representations (NIRs), also known as implicit or coordinate-based neural representations. Our framework targets burst images that exhibit camera ego motion and potential changes in the scene. We describe different strategies for alignment depending on the nature of the scene motion -- namely, pers… ▽ More

    Submitted 21 July, 2022; v1 submitted 2 August, 2021; originally announced August 2021.

    Comments: Project page: https://shnnam.github.io/research/nir

  36. arXiv:2107.04622  [pdf, other

    cs.CV

    Cumulative Assessment for Urban 3D Modeling

    Authors: Shea Hagstrom, Hee Won Pak, Stephanie Ku, Sean Wang, Gregory Hager, Myron Brown

    Abstract: Urban 3D modeling from satellite images requires accurate semantic segmentation to delineate urban features, multiple view stereo for 3D reconstruction of surface heights, and 3D model fitting to produce compact models with accurate surface slopes. In this work, we present a cumulative assessment metric that succinctly captures error contributions from each of these components. We demonstrate our… ▽ More

    Submitted 9 July, 2021; originally announced July 2021.

    Comments: Published in IEEE International Geoscience and Remote Sensing Symposium (IGARSS), 2021

  37. arXiv:2106.13920  [pdf, other

    cs.CV

    CAMS: Color-Aware Multi-Style Transfer

    Authors: Mahmoud Afifi, Abdullah Abuolaim, Mostafa Hussien, Marcus A. Brubaker, Michael S. Brown

    Abstract: Image style transfer aims to manipulate the appearance of a source image, or "content" image, to share similar texture and colors of a target "style" image. Ideally, the style transfer manipulation should also preserve the semantic content of the source image. A commonly used approach to assist in transferring styles is based on Gram matrix optimization. One problem of Gram matrix-based optimizati… ▽ More

    Submitted 4 September, 2021; v1 submitted 25 June, 2021; originally announced June 2021.

  38. arXiv:2106.00598  [pdf, other

    cs.CV

    Unsupervised detection of mouse behavioural anomalies using two-stream convolutional autoencoders

    Authors: Ezechukwu I Nwokedi, Rasneer S Bains, Luc Bidaut, Sara Wells, Xujiong Ye, James M Brown

    Abstract: This paper explores the application of unsupervised learning to detecting anomalies in mouse video data. The two models presented in this paper are a dual-stream, 3D convolutional autoencoder (with residual connections) and a dual-stream, 2D convolutional autoencoder. The publicly available dataset used here contains twelve videos of single home-caged mice alongside frame-level annotations. Under… ▽ More

    Submitted 28 May, 2021; originally announced June 2021.

  39. arXiv:2105.08229  [pdf, other

    cs.CV

    Single View Geocentric Pose in the Wild

    Authors: Gordon Christie, Kevin Foster, Shea Hagstrom, Gregory D. Hager, Myron Z. Brown

    Abstract: Current methods for Earth observation tasks such as semantic map**, map alignment, and change detection rely on near-nadir images; however, often the first available images in response to dynamic world events such as natural disasters are oblique. These tasks are much more difficult for oblique images due to observed object parallax. There has been recent success in learning to regress geocentri… ▽ More

    Submitted 17 May, 2021; originally announced May 2021.

    Comments: To be published in the proceedings of the CVPR 2021 EarthVision Workshop

  40. arXiv:2104.12727  [pdf, other

    cs.CV

    2.5D Visual Relationship Detection

    Authors: Yu-Chuan Su, Soravit Changpinyo, Xiangning Chen, Sathish Thoppay, Cho-Jui Hsieh, Lior Shapira, Radu Soricut, Hartwig Adam, Matthew Brown, Ming-Hsuan Yang, Boqing Gong

    Abstract: Visual 2.5D perception involves understanding the semantics and geometry of a scene through reasoning about object relationships with respect to the viewer in an environment. However, existing works in visual recognition primarily focus on the semantics. To bridge this gap, we study 2.5D visual relationship detection (2.5VRD), in which the goal is to jointly detect objects and predict their relati… ▽ More

    Submitted 26 April, 2021; originally announced April 2021.

  41. arXiv:2104.08418  [pdf, other

    cs.CV

    FiG-NeRF: Figure-Ground Neural Radiance Fields for 3D Object Category Modelling

    Authors: Christopher Xie, Keunhong Park, Ricardo Martin-Brualla, Matthew Brown

    Abstract: We investigate the use of Neural Radiance Fields (NeRF) to learn high quality 3D object category models from collections of input images. In contrast to previous work, we are able to do this whilst simultaneously separating foreground objects from their varying backgrounds. We achieve this via a 2-component NeRF model, FiG-NeRF, that prefers explanation of the scene as a geometrically constant bac… ▽ More

    Submitted 16 April, 2021; originally announced April 2021.

  42. arXiv:2103.11511  [pdf, other

    cs.CV cs.AI cs.LG

    MoViNets: Mobile Video Networks for Efficient Video Recognition

    Authors: Dan Kondratyuk, Liangzhe Yuan, Yandong Li, Li Zhang, Mingxing Tan, Matthew Brown, Boqing Gong

    Abstract: We present Mobile Video Networks (MoViNets), a family of computation and memory efficient video networks that can operate on streaming video for online inference. 3D convolutional neural networks (CNNs) are accurate at video recognition but require large computation and memory budgets and do not support online inference, making them difficult to work on mobile devices. We propose a three-step appr… ▽ More

    Submitted 18 April, 2021; v1 submitted 21 March, 2021; originally announced March 2021.

    Comments: Accepted to CVPR 2021

  43. arXiv:2102.10349  [pdf, other

    cs.CY cs.LG

    Everything is Relative: Understanding Fairness with Optimal Transport

    Authors: Kweku Kwegyir-Aggrey, Rebecca Santorella, Sarah M. Brown

    Abstract: To study discrimination in automated decision-making systems, scholars have proposed several definitions of fairness, each expressing a different fair ideal. These definitions require practitioners to make complex decisions regarding which notion to employ and are often difficult to use in practice since they make a binary judgement a system is fair or unfair instead of explaining the structure of… ▽ More

    Submitted 20 February, 2021; originally announced February 2021.

  44. arXiv:2102.09000  [pdf, other

    cs.CV eess.IV

    Mobile Computational Photography: A Tour

    Authors: Mauricio Delbracio, Damien Kelly, Michael S. Brown, Peyman Milanfar

    Abstract: The first mobile camera phone was sold only 20 years ago, when taking pictures with one's phone was an oddity, and sharing pictures online was unheard of. Today, the smartphone is more camera than phone. How did this happen? This transformation was enabled by advances in computational photography -the science and engineering of making great images from small form factor, mobile cameras. Modern alg… ▽ More

    Submitted 10 March, 2021; v1 submitted 17 February, 2021; originally announced February 2021.

  45. arXiv:2012.03255  [pdf, other

    eess.IV cs.CV

    Learning to Reduce Defocus Blur by Realistically Modeling Dual-Pixel Data

    Authors: Abdullah Abuolaim, Mauricio Delbracio, Damien Kelly, Michael S. Brown, Peyman Milanfar

    Abstract: Recent work has shown impressive results on data-driven defocus deblurring using the two-image views available on modern dual-pixel (DP) sensors. One significant challenge in this line of research is access to DP data. Despite many cameras having DP sensors, only a limited number provide access to the low-level DP sensor images. In addition, capturing training data for defocus deblurring involves… ▽ More

    Submitted 17 August, 2021; v1 submitted 6 December, 2020; originally announced December 2020.

  46. arXiv:2011.11731  [pdf, other

    cs.CV

    HistoGAN: Controlling Colors of GAN-Generated and Real Images via Color Histograms

    Authors: Mahmoud Afifi, Marcus A. Brubaker, Michael S. Brown

    Abstract: While generative adversarial networks (GANs) can successfully produce high-quality images, they can be challenging to control. Simplifying GAN-based image generation is critical for their adoption in graphic design and artistic work. This goal has led to significant interest in methods that can intuitively control the appearance of images generated by GANs. In this paper, we present HistoGAN, a co… ▽ More

    Submitted 26 March, 2021; v1 submitted 23 November, 2020; originally announced November 2020.

    Comments: CVPR 2021

  47. arXiv:2011.02580  [pdf, ps, other

    eess.IV cs.CV

    DeepReg: a deep learning toolkit for medical image registration

    Authors: Yunguan Fu, Nina Montaña Brown, Shaheer U. Saeed, Adrià Casamitjana, Zachary M. C. Baum, Rémi Delaunay, Qianye Yang, Alexander Grimwood, Zhe Min, Stefano B. Blumberg, Juan Eugenio Iglesias, Dean C. Barratt, Ester Bonmati, Daniel C. Alexander, Matthew J. Clarkson, Tom Vercauteren, Yipeng Hu

    Abstract: DeepReg (https://github.com/DeepRegNet/DeepReg) is a community-supported open-source toolkit for research and education in medical image registration using deep learning.

    Submitted 4 November, 2020; originally announced November 2020.

    Comments: Accepted in The Journal of Open Source Software (JOSS)

  48. arXiv:2009.12798  [pdf, other

    cs.CV eess.IV

    AIM 2020: Scene Relighting and Illumination Estimation Challenge

    Authors: Majed El Helou, Ruofan Zhou, Sabine Süsstrunk, Radu Timofte, Mahmoud Afifi, Michael S. Brown, Kele Xu, Hengxing Cai, Yuzhong Liu, Li-Wen Wang, Zhi-Song Liu, Chu-Tak Li, Sourya Dipta Das, Nisarg A. Shah, Akashdeep Jassal, Tongtong Zhao, Shanshan Zhao, Sabari Nathan, M. Parisa Beham, R. Suganya, Qing Wang, Zhongyun Hu, Xin Huang, Yaning Li, Maitreya Suin , et al. (12 additional authors not shown)

    Abstract: We review the AIM 2020 challenge on virtual image relighting and illumination estimation. This paper presents the novel VIDIT dataset used in the challenge and the different proposed solutions and final evaluation results over the 3 challenge tracks. The first track considered one-to-one relighting; the objective was to relight an input photo of a scene with a different color temperature and illum… ▽ More

    Submitted 27 September, 2020; originally announced September 2020.

    Comments: ECCVW 2020. Data and more information on https://github.com/majedelhelou/VIDIT

  49. arXiv:2009.12632  [pdf, other

    cs.CV

    Interactive White Balancing for Camera-Rendered Images

    Authors: Mahmoud Afifi, Michael S. Brown

    Abstract: White balance (WB) is one of the first photo-finishing steps used to render a captured image to its final output. WB is applied to remove the color cast caused by the scene's illumination. Interactive photo-editing software allows users to manually select different regions in a photo as examples of the illumination for WB correction (e.g., clicking on achromatic objects). Such interactive editing… ▽ More

    Submitted 26 September, 2020; originally announced September 2020.

    Comments: To appear in Color and Imaging Conference (CIC28), 2020

  50. arXiv:2009.01924  [pdf, other

    eess.IV cs.CV cs.LG cs.MS

    Introduction to Medical Image Registration with DeepReg, Between Old and New

    Authors: N. Montana Brown, Y. Fu, S. U. Saeed, A. Casamitjana, Z. M. C. Baum, R. Delaunay, Q. Yang, A. Grimwood, Z. Min, E. Bonmati, T. Vercauteren, M. J. Clarkson, Y. Hu

    Abstract: This document outlines a tutorial to get started with medical image registration using the open-source package DeepReg. The basic concepts of medical image registration are discussed, linking classical methods to newer methods using deep learning. Two iterative, classical algorithms using optimisation and one learning-based algorithm using deep learning are coded step-by-step using DeepReg utiliti… ▽ More

    Submitted 7 September, 2020; v1 submitted 29 August, 2020; originally announced September 2020.

    Comments: Submitted to MICCAI Educational Challenge 2020