Skip to main content

Showing 1–50 of 52 results for author: Howard, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.01613  [pdf, other

    cs.LG cs.AI stat.ML

    Self-adaptive weights based on balanced residual decay rate for physics-informed neural networks and deep operator networks

    Authors: Wenqian Chen, Amanda A. Howard, Panos Stinis

    Abstract: Physics-informed deep learning has emerged as a promising alternative for solving partial differential equations. However, for complex problems, training these networks can still be challenging, often resulting in unsatisfactory accuracy and efficiency. In this work, we demonstrate that the failure of plain physics-informed neural networks arises from the significant discrepancy in the convergence… ▽ More

    Submitted 27 June, 2024; originally announced July 2024.

    Comments: 13 figures, 4 tables

    Report number: PNNL-SA-199965

  2. arXiv:2406.19662  [pdf, other

    cs.LG physics.comp-ph

    Finite basis Kolmogorov-Arnold networks: domain decomposition for data-driven and physics-informed problems

    Authors: Amanda A. Howard, Bruno Jacob, Sarah H. Murphy, Alexander Heinlein, Panos Stinis

    Abstract: Kolmogorov-Arnold networks (KANs) have attracted attention recently as an alternative to multilayer perceptrons (MLPs) for scientific machine learning. However, KANs can be expensive to train, even for relatively small networks. Inspired by finite basis physics-informed neural networks (FBPINNs), in this work, we develop a domain decomposition method for KANs that allows for several small KANs to… ▽ More

    Submitted 28 June, 2024; originally announced June 2024.

  3. arXiv:2405.05171  [pdf, other

    cs.LG

    Custom Gradient Estimators are Straight-Through Estimators in Disguise

    Authors: Matt Schoenbauer, Daniele Moro, Lukasz Lew, Andrew Howard

    Abstract: Quantization-aware training comes with a fundamental challenge: the derivative of quantization functions such as rounding are zero almost everywhere and nonexistent elsewhere. Various differentiable approximations of quantization functions have been proposed to address this issue. In this paper, we prove that when the learning rate is sufficiently small, a large class of weight gradient estimators… ▽ More

    Submitted 22 May, 2024; v1 submitted 8 May, 2024; originally announced May 2024.

  4. arXiv:2404.10518  [pdf, other

    cs.CV

    MobileNetV4 -- Universal Models for the Mobile Ecosystem

    Authors: Danfeng Qin, Chas Leichner, Manolis Delakis, Marco Fornoni, Shixin Luo, Fan Yang, Weijun Wang, Colby Banbury, Chengxi Ye, Berkin Akin, Vaibhav Aggarwal, Tenghui Zhu, Daniele Moro, Andrew Howard

    Abstract: We present the latest generation of MobileNets, known as MobileNetV4 (MNv4), featuring universally efficient architecture designs for mobile devices. At its core, we introduce the Universal Inverted Bottleneck (UIB) search block, a unified and flexible structure that merges Inverted Bottleneck (IB), ConvNext, Feed Forward Network (FFN), and a novel Extra Depthwise (ExtraDW) variant. Alongside UIB,… ▽ More

    Submitted 16 April, 2024; originally announced April 2024.

  5. arXiv:2404.00103  [pdf, other

    cs.LG cs.CV

    PikeLPN: Mitigating Overlooked Inefficiencies of Low-Precision Neural Networks

    Authors: Marina Neseem, Conor McCullough, Randy Hsin, Chas Leichner, Shan Li, In Suk Chong, Andrew G. Howard, Lukasz Lew, Sherief Reda, Ville-Mikko Rautio, Daniele Moro

    Abstract: Low-precision quantization is recognized for its efficacy in neural network optimization. Our analysis reveals that non-quantized elementwise operations which are prevalent in layers such as parameterized activation functions, batch normalization, and quantization scaling dominate the inference cost of low-precision models. These non-quantized elementwise operations are commonly overlooked in SOTA… ▽ More

    Submitted 29 March, 2024; originally announced April 2024.

    Comments: Accepted in CVPR 2024. 10 Figures, 9 Tables

  6. Energy Flexibility Potential in the Brewery Sector: A Multi-agent Based Simulation of 239 Danish Breweries

    Authors: Daniel Anthony Howard, Zheng Grace Ma, Jacob Alstrup Engvang, Morten Hagenau, Kathrine Lau Jorgensen, Jonas Fausing Olesen, Bo Nørregaard Jørgensen

    Abstract: The beverage industry is a typical food processing industry, accounts for significant energy consumption, and has flexible demands. However, the deployment of energy flexibility in the beverage industry is complex and challenging. Furthermore, activation of energy flexibility from the whole brewery industry is necessary to ensure grid stability. Therefore, this paper assesses the energy flexibilit… ▽ More

    Submitted 26 January, 2024; originally announced January 2024.

  7. arXiv:2401.13693  [pdf, other

    cs.OH cs.AI cs.HC

    Challenge design roadmap

    Authors: Hugo Jair Escalante Balderas, Isabelle Guyon, Addison Howard, Walter Reade, Sebastien Treguer

    Abstract: Challenges can be seen as a type of game that motivates participants to solve serious tasks. As a result, competition organizers must develop effective game rules. However, these rules have multiple objectives beyond making the game enjoyable for participants. These objectives may include solving real-world problems, advancing scientific or technical areas, making scientific discoveries, and educa… ▽ More

    Submitted 15 January, 2024; originally announced January 2024.

    Journal ref: AI Competitions and Benchmarks: The Science Behind the Contests, In press

  8. arXiv:2401.07888  [pdf, other

    math.NA cs.LG

    Multifidelity domain decomposition-based physics-informed neural networks and operators for time-dependent problems

    Authors: Alexander Heinlein, Amanda A. Howard, Damien Beecroft, Panos Stinis

    Abstract: Multiscale problems are challenging for neural network-based discretizations of differential equations, such as physics-informed neural networks (PINNs). This can be (partly) attributed to the so-called spectral bias of neural networks. To improve the performance of PINNs for time-dependent problems, a combination of multifidelity stacking PINNs and domain decomposition-based finite basis PINNs is… ▽ More

    Submitted 6 June, 2024; v1 submitted 15 January, 2024; originally announced January 2024.

    MSC Class: 65M22; 65M55; 68T07

  9. arXiv:2401.04751  [pdf

    cs.LG cs.PF math.NA

    Identifying Best Practice Melting Patterns in Induction Furnaces: A Data-Driven Approach Using Time Series KMeans Clustering and Multi-Criteria Decision Making

    Authors: Daniel Anthony Howard, Bo Nørregaard Jørgensen, Zheng Ma

    Abstract: Improving energy efficiency in industrial production processes is crucial for competitiveness, and compliance with climate policies. This paper introduces a data-driven approach to identify optimal melting patterns in induction furnaces. Through time-series K-means clustering the melting patterns could be classified into distinct clusters based on temperature profiles. Using the elbow method, 12 c… ▽ More

    Submitted 9 January, 2024; originally announced January 2024.

    Journal ref: Energy Informatics. EI.A 2023. Lecture Notes in Computer Science, vol 14467

  10. arXiv:2311.06483  [pdf, other

    cs.LG math.NA

    Stacked networks improve physics-informed training: applications to neural networks and deep operator networks

    Authors: Amanda A Howard, Sarah H Murphy, Shady E Ahmed, Panos Stinis

    Abstract: Physics-informed neural networks and operator networks have shown promise for effectively solving equations modeling physical systems. However, these networks can be difficult or impossible to train accurately for some systems of equations. We present a novel multifidelity framework for stacking physics-informed neural networks and operator networks that facilitates training. We successively build… ▽ More

    Submitted 20 November, 2023; v1 submitted 11 November, 2023; originally announced November 2023.

  11. arXiv:2310.18612  [pdf, other

    cs.LG

    Efficient kernel surrogates for neural network-based regression

    Authors: Saad Qadeer, Andrew Engel, Amanda Howard, Adam Tsou, Max Vargas, Panos Stinis, Tony Chiang

    Abstract: Despite their immense promise in performing a variety of learning tasks, a theoretical understanding of the limitations of Deep Neural Networks (DNNs) has so far eluded practitioners. This is partly due to the inability to determine the closed forms of the learned functions, making it harder to study their generalization properties on unseen datasets. Recent work has shown that randomly initialize… ▽ More

    Submitted 24 January, 2024; v1 submitted 28 October, 2023; originally announced October 2023.

    Comments: 35 pages. software used to reach results available upon request, approved for release by Pacific Northwest National Laboratory

    Report number: PNNL-SA-191858 MSC Class: 68T07; 65M99

  12. arXiv:2306.17319  [pdf, other

    cs.CV

    ReMaX: Relaxing for Better Training on Efficient Panoptic Segmentation

    Authors: Shuyang Sun, Weijun Wang, Qihang Yu, Andrew Howard, Philip Torr, Liang-Chieh Chen

    Abstract: This paper presents a new mechanism to facilitate the training of mask transformers for efficient panoptic segmentation, democratizing its deployment. We observe that due to its high complexity, the training objective of panoptic segmentation will inevitably lead to much higher false positive penalization. Such unbalanced loss makes the training process of the end-to-end mask-transformer based arc… ▽ More

    Submitted 29 June, 2023; originally announced June 2023.

  13. arXiv:2305.14384  [pdf, other

    cs.LG cs.AI cs.CR cs.CV

    Adversarial Nibbler: A Data-Centric Challenge for Improving the Safety of Text-to-Image Models

    Authors: Alicia Parrish, Hannah Rose Kirk, Jessica Quaye, Charvi Rastogi, Max Bartolo, Oana Inel, Juan Ciro, Rafael Mosquera, Addison Howard, Will Cukierski, D. Sculley, Vijay Janapa Reddi, Lora Aroyo

    Abstract: The generative AI revolution in recent years has been spurred by an expansion in compute power and data quantity, which together enable extensive pre-training of powerful text-to-image (T2I) models. With their greater capabilities to generate realistic and creative content, these T2I models like DALL-E, MidJourney, Imagen or Stable Diffusion are reaching ever wider audiences. Any unsafe behaviors… ▽ More

    Submitted 22 May, 2023; originally announced May 2023.

    MSC Class: 14J68 (Primary)

  14. A multifidelity approach to continual learning for physical systems

    Authors: Amanda Howard, Yucheng Fu, Panos Stinis

    Abstract: We introduce a novel continual learning method based on multifidelity deep neural networks. This method learns the correlation between the output of previously trained models and the desired output of the model on the current training dataset, limiting catastrophic forgetting. On its own the multifidelity continual learning method shows robust results that limit forgetting across several datasets.… ▽ More

    Submitted 9 February, 2024; v1 submitted 7 April, 2023; originally announced April 2023.

  15. arXiv:2302.08202  [pdf

    eess.AS cs.SD

    DeepSpace: Dynamic Spatial and Source Cue Based Source Separation for Dialog Enhancement

    Authors: Aaron Master, Lie Lu, Jonas Samuelsson, Heidi-Maria Lehtonen, Scott Norcross, Nathan Swedlow, Audrey Howard

    Abstract: Dialog Enhancement (DE) is a feature which allows a user to increase the level of dialog in TV or movie content relative to non-dialog sounds. When only the original mix is available, DE is "unguided," and requires source separation. In this paper, we describe the DeepSpace system, which performs source separation using both dynamic spatial cues and source cues to support unguided DE. Its technolo… ▽ More

    Submitted 22 February, 2023; v1 submitted 16 February, 2023; originally announced February 2023.

    Comments: 5 pages, 4 figures. To be published in ICASSP 2023

  16. arXiv:2301.11402  [pdf, other

    physics.comp-ph cs.LG physics.ao-ph physics.geo-ph

    A Hybrid Deep Neural Operator/Finite Element Method for Ice-Sheet Modeling

    Authors: QiZhi He, Mauro Perego, Amanda A. Howard, George Em Karniadakis, Panos Stinis

    Abstract: One of the most challenging and consequential problems in climate modeling is to provide probabilistic projections of sea level rise. A large part of the uncertainty of sea level projections is due to uncertainty in ice sheet dynamics. At the moment, accurate quantification of the uncertainty is hindered by the cost of ice sheet computational models. In this work, we develop a hybrid approach to a… ▽ More

    Submitted 26 January, 2023; originally announced January 2023.

  17. arXiv:2207.10225  [pdf, other

    cs.CV cs.LG

    On Label Granularity and Object Localization

    Authors: Elijah Cole, Kimberly Wilber, Grant Van Horn, Xuan Yang, Marco Fornoni, Pietro Perona, Serge Belongie, Andrew Howard, Oisin Mac Aodha

    Abstract: Weakly supervised object localization (WSOL) aims to learn representations that encode object location using only image-level category labels. However, many objects can be labeled at different levels of granularity. Is it an animal, a bird, or a great horned owl? Which image-level labels should we use? In this paper we study the role of label granularity in WSOL. To facilitate this investigation w… ▽ More

    Submitted 20 July, 2022; originally announced July 2022.

    Comments: ECCV 2022

  18. Multifidelity Deep Operator Networks For Data-Driven and Physics-Informed Problems

    Authors: Amanda A. Howard, Mauro Perego, George E. Karniadakis, Panos Stinis

    Abstract: Operator learning for complex nonlinear systems is increasingly common in modeling multi-physics and multi-scale systems. However, training such high-dimensional operators requires a large amount of expensive, high-fidelity data, either from experiments or simulations. In this work, we present a composite Deep Operator Network (DeepONet) for learning using two datasets with different levels of fid… ▽ More

    Submitted 21 November, 2023; v1 submitted 19 April, 2022; originally announced April 2022.

  19. arXiv:2202.04137  [pdf, other

    cs.LG cond-mat.mtrl-sci

    Machine Learning in Heterogeneous Porous Materials

    Authors: Marta D'Elia, Hang Deng, Cedric Fraces, Krishna Garikipati, Lori Graham-Brady, Amanda Howard, George Karniadakis, Vahid Keshavarzzadeh, Robert M. Kirby, Nathan Kutz, Chunhui Li, Xing Liu, Hannah Lu, Pania Newell, Daniel O'Malley, Masa Prodanovic, Gowri Srinivasan, Alexandre Tartakovsky, Daniel M. Tartakovsky, Hamdi Tchelepi, Bozo Vazic, Hari Viswanathan, Hongkyu Yoon, Piotr Zarzycki

    Abstract: The "Workshop on Machine learning in heterogeneous porous materials" brought together international scientific communities of applied mathematics, porous media, and material sciences with experts in the areas of heterogeneous materials, machine learning (ML) and applied mathematics to identify how ML can advance materials research. Within the scope of ML and materials research, the goal of the wor… ▽ More

    Submitted 4 February, 2022; originally announced February 2022.

    Comments: The workshop link is: https://amerimech.mech.utah.edu

  20. arXiv:2112.11623  [pdf, other

    cs.CV cs.AI

    MOSAIC: Mobile Segmentation via decoding Aggregated Information and encoded Context

    Authors: Weijun Wang, Andrew Howard

    Abstract: We present a next-generation neural network architecture, MOSAIC, for efficient and accurate semantic image segmentation on mobile devices. MOSAIC is designed using commonly supported neural operations by diverse mobile hardware platforms for flexible deployment across various mobile platforms. With a simple asymmetric encoder-decoder structure which consists of an efficient multi-scale context en… ▽ More

    Submitted 21 December, 2021; originally announced December 2021.

  21. arXiv:2111.06839  [pdf, other

    cs.CV

    The self-supervised spectral-spatial attention-based transformer network for automated, accurate prediction of crop nitrogen status from UAV imagery

    Authors: Xin Zhang, Liangxiu Han, Tam Sobeih, Lewis Lappin, Mark Lee, Andew Howard, Aron Kisdi

    Abstract: Nitrogen (N) fertilizer is routinely applied by farmers to increase crop yields. At present, farmers often over-apply N fertilizer in some locations or at certain times because they do not have high-resolution crop N status data. N-use efficiency can be low, with the remaining N lost to the environment, resulting in higher production costs and environmental pollution. Accurate and timely estimatio… ▽ More

    Submitted 15 February, 2022; v1 submitted 12 November, 2021; originally announced November 2021.

  22. Mitigating Racial Biases in Toxic Language Detection with an Equity-Based Ensemble Framework

    Authors: Matan Halevy, Camille Harris, Amy Bruckman, Diyi Yang, Ayanna Howard

    Abstract: Recent research has demonstrated how racial biases against users who write African American English exists in popular toxic language datasets. While previous work has focused on a single fairness criteria, we propose to use additional descriptive fairness metrics to better understand the source of these biases. We demonstrate that different benchmark classifiers, as well as two in-process bias-rem… ▽ More

    Submitted 27 September, 2021; originally announced September 2021.

    Comments: Accepted to ACM EAAMO '21: https://eaamo.org/accepted/ Code available: https://github.com/matanhalevy/DebiasingHateDetectionAAE

  23. arXiv:2106.10258  [pdf, other

    cs.CV cs.AI cs.LG

    Bridging the Gap Between Object Detection and User Intent via Query-Modulation

    Authors: Marco Fornoni, Chaochao Yan, Liangchen Luo, Kimberly Wilber, Alex Stark, Yin Cui, Boqing Gong, Andrew Howard

    Abstract: When interacting with objects through cameras, or pictures, users often have a specific intent. For example, they may want to perform a visual search. With most object detection models relying on image pixels as their sole input, undesired results are not uncommon. Most typically: lack of a high-confidence detection on the object of interest, or detection with a wrong class label. The issue is esp… ▽ More

    Submitted 3 August, 2022; v1 submitted 18 June, 2021; originally announced June 2021.

  24. arXiv:2106.09188  [pdf, other

    physics.chem-ph cs.LG

    Physics-informed CoKriging model of a redox flow battery

    Authors: Amanda A. Howard, Alexandre M. Tartakovsky

    Abstract: Redox flow batteries (RFBs) offer the capability to store large amounts of energy cheaply and efficiently, however, there is a need for fast and accurate models of the charge-discharge curve of a RFB to potentially improve the battery capacity and performance. We develop a multifidelity model for predicting the charge-discharge curve of a RFB. In the multifidelity model, we use the Physics-informe… ▽ More

    Submitted 16 June, 2021; originally announced June 2021.

  25. arXiv:2105.03014  [pdf, other

    cs.CV

    BasisNet: Two-stage Model Synthesis for Efficient Inference

    Authors: Mingda Zhang, Chun-Te Chu, Andrey Zhmoginov, Andrew Howard, Brendan Jou, Yukun Zhu, Li Zhang, Rebecca Hwa, Adriana Kovashka

    Abstract: In this work, we present BasisNet which combines recent advancements in efficient neural network architectures, conditional computation, and early termination in a simple new form. Our approach incorporates a lightweight model to preview the input and generate input-dependent combination coefficients, which later controls the synthesis of a more accurate specialist model to make final prediction.… ▽ More

    Submitted 6 May, 2021; originally announced May 2021.

    Comments: To appear, 4th Workshop on Efficient Deep Learning for Computer Vision (ECV2021), CVPR2021 Workshop

  26. arXiv:2101.01260  [pdf, other

    cs.CV

    SpotPatch: Parameter-Efficient Transfer Learning for Mobile Object Detection

    Authors: Keren Ye, Adriana Kovashka, Mark Sandler, Menglong Zhu, Andrew Howard, Marco Fornoni

    Abstract: Deep learning based object detectors are commonly deployed on mobile devices to solve a variety of tasks. For maximum accuracy, each detector is usually trained to solve one single specific task, and comes with a completely independent set of parameters. While this guarantees high performance, it is also highly inefficient, as each model has to be separately downloaded and stored. In this paper we… ▽ More

    Submitted 4 January, 2021; originally announced January 2021.

    Comments: Accepted by the ACCV2020 (Oral)

  27. arXiv:2012.05578  [pdf, other

    cs.LG cs.CV

    Large-Scale Generative Data-Free Distillation

    Authors: Liangchen Luo, Mark Sandler, Zi Lin, Andrey Zhmoginov, Andrew Howard

    Abstract: Knowledge distillation is one of the most popular and effective techniques for knowledge transfer, model compression and semi-supervised learning. Most existing distillation approaches require the access to original or augmented training samples. But this can be problematic in practice due to privacy, proprietary and availability concerns. Recent work has put forward some methods to tackle this pr… ▽ More

    Submitted 10 December, 2020; originally announced December 2020.

  28. arXiv:2010.04904  [pdf, other

    cs.CV cs.AI cs.LG

    Multi-path Neural Networks for On-device Multi-domain Visual Classification

    Authors: Qifei Wang, Junjie Ke, Joshua Greaves, Grace Chu, Gabriel Bender, Luciano Sbaiz, Alec Go, Andrew Howard, Feng Yang, Ming-Hsuan Yang, Jeff Gilbert, Peyman Milanfar

    Abstract: Learning multiple domains/tasks with a single model is important for improving data efficiency and lowering inference cost for numerous vision tasks, especially on resource-constrained mobile devices. However, hand-crafting a multi-domain/task model can be both tedious and challenging. This paper proposes a novel approach to automatically learn a multi-path network for multi-domain visual classifi… ▽ More

    Submitted 8 January, 2021; v1 submitted 10 October, 2020; originally announced October 2020.

    Comments: WACV 2021

  29. arXiv:2009.01658  [pdf, other

    physics.comp-ph cs.LG physics.flu-dyn

    Learning Unknown Physics of non-Newtonian Fluids

    Authors: Brandon Reyes, Amanda A. Howard, Paris Perdikaris, Alexandre M. Tartakovsky

    Abstract: We extend the physics-informed neural network (PINN) method to learn viscosity models of two non-Newtonian systems (polymer melts and suspensions of particles) using only velocity measurements. The PINN-inferred viscosity models agree with the empirical models for shear rates with large absolute values but deviate for shear rates near zero where the analytical models have an unphysical singularity… ▽ More

    Submitted 26 August, 2020; originally announced September 2020.

    Journal ref: Phys. Rev. Fluids 6, 073301 (2021)

  30. arXiv:2008.08178  [pdf, other

    cs.CV

    Discovering Multi-Hardware Mobile Models via Architecture Search

    Authors: Grace Chu, Okan Arikan, Gabriel Bender, Weijun Wang, Achille Brighton, Pieter-Jan Kindermans, Hanxiao Liu, Berkin Akin, Suyog Gupta, Andrew Howard

    Abstract: Hardware-aware neural architecture designs have been predominantly focusing on optimizing model performance on single hardware and model development complexity, where another important factor, model deployment complexity, has been largely ignored. In this paper, we argue that, for applications that may be deployed on multiple hardware, having different single-hardware models across the deployed ha… ▽ More

    Submitted 23 April, 2021; v1 submitted 18 August, 2020; originally announced August 2020.

    Comments: CVPR Workshop 2021

  31. arXiv:2008.05994  [pdf

    physics.comp-ph cs.LG

    A community-powered search of machine learning strategy space to find NMR property prediction models

    Authors: Lars A. Bratholm, Will Gerrard, Brandon Anderson, Shaojie Bai, Sunghwan Choi, Lam Dang, Pavel Hanchar, Addison Howard, Guillaume Huard, Sanghoon Kim, Zico Kolter, Risi Kondor, Mordechai Kornbluth, Youhan Lee, Youngsoo Lee, Jonathan P. Mailoa, Thanh Tu Nguyen, Milos Popovic, Goran Rakocevic, Walter Reade, Wonho Song, Luka Stojanovic, Erik H. Thiede, Nebojsa Tijanic, Andres Torrubia , et al. (4 additional authors not shown)

    Abstract: The rise of machine learning (ML) has created an explosion in the potential strategies for using data to make scientific predictions. For physical scientists wishing to apply ML strategies to a particular domain, it can be difficult to assess in advance what strategy to adopt within a vast space of possibilities. Here we outline the results of an online community-powered effort to swarm search the… ▽ More

    Submitted 13 August, 2020; originally announced August 2020.

  32. arXiv:2007.08476  [pdf

    cs.HC

    Accessible Computer Science for K-12 Students with Hearing Impairments

    Authors: Meenakshi Das, Daniela Marghitu, Fatemeh Jamshidi, Mahender Mandala, Ayanna Howard

    Abstract: An inclusive science, technology, engineering and mathematics (STEM) workforce is needed to maintain America's leadership in the scientific enterprise. Increasing the participation of underrepresented groups in STEM, including persons with disabilities, requires national attention to fully engage the nation's citizens in transforming its STEM enterprise. To address this need, a number of initiativ… ▽ More

    Submitted 16 July, 2020; originally announced July 2020.

    Comments: 12 Pages, 5 figures, 22nd International Conference on Human Computer Interaction

  33. arXiv:2006.05729  [pdf, other

    cs.AI cs.GT cs.RO

    A Bayesian Framework for Nash Equilibrium Inference in Human-Robot Parallel Play

    Authors: Shray Bansal, ** Xu, Ayanna Howard, Charles Isbell

    Abstract: We consider shared workspace scenarios with humans and robots acting to achieve independent goals, termed as parallel play. We model these as general-sum games and construct a framework that utilizes the Nash equilibrium solution concept to consider the interactive effect of both agents while planning. We find multiple Pareto-optimal equilibria in these tasks. We hypothesize that people act by cho… ▽ More

    Submitted 10 June, 2020; originally announced June 2020.

    Comments: Accepted at Robotics: Science and Systems (RSS) 2020

  34. arXiv:1909.03205  [pdf, other

    cs.CV

    Non-discriminative data or weak model? On the relative importance of data and model resolution

    Authors: Mark Sandler, Jonathan Baccash, Andrey Zhmoginov, Andrew Howard

    Abstract: We explore the question of how the resolution of the input image ("input resolution") affects the performance of a neural network when compared to the resolution of the hidden layers ("internal resolution"). Adjusting these characteristics is frequently used as a hyperparameter providing a trade-off between model performance and accuracy. An intuitive interpretation is that the reduced information… ▽ More

    Submitted 17 October, 2019; v1 submitted 7 September, 2019; originally announced September 2019.

    Comments: ICCV 2019 Workshop on Real-World Recognition from Low-Quality Images and Videos

  35. arXiv:1906.05721  [pdf, other

    cs.CV eess.IV

    Visual Wake Words Dataset

    Authors: Aakanksha Chowdhery, Pete Warden, Jonathon Shlens, Andrew Howard, Rocky Rhodes

    Abstract: The emergence of Internet of Things (IoT) applications requires intelligence on the edge. Microcontrollers provide a low-cost compute platform to deploy intelligent IoT applications using machine learning at scale, but have extremely limited on-chip memory and compute capability. To deploy computer vision on such devices, we need tiny vision models that fit within a few hundred kilobytes of memory… ▽ More

    Submitted 12 June, 2019; originally announced June 2019.

    Comments: 10 pages, 4 figures

    ACM Class: I.2.10; B.7.1; I.5.2

  36. arXiv:1906.01737  [pdf, other

    cs.CV

    Geo-Aware Networks for Fine-Grained Recognition

    Authors: Grace Chu, Brian Potetz, Weijun Wang, Andrew Howard, Yang Song, Fernando Brucher, Thomas Leung, Hartwig Adam

    Abstract: Fine-grained recognition distinguishes among categories with subtle visual differences. In order to differentiate between these challenging visual categories, it is helpful to leverage additional information. Geolocation is a rich source of additional information that can be used to improve fine-grained classification accuracy, but has been understudied. Our contributions to this field are twofold… ▽ More

    Submitted 4 September, 2019; v1 submitted 4 June, 2019; originally announced June 2019.

    Comments: ICCVW 2019

  37. arXiv:1905.02244  [pdf, other

    cs.CV

    Searching for MobileNetV3

    Authors: Andrew Howard, Mark Sandler, Grace Chu, Liang-Chieh Chen, Bo Chen, Mingxing Tan, Weijun Wang, Yukun Zhu, Ruoming Pang, Vijay Vasudevan, Quoc V. Le, Hartwig Adam

    Abstract: We present the next generation of MobileNets based on a combination of complementary search techniques as well as a novel architecture design. MobileNetV3 is tuned to mobile phone CPUs through a combination of hardware-aware network architecture search (NAS) complemented by the NetAdapt algorithm and then subsequently improved through novel architecture advances. This paper starts the exploration… ▽ More

    Submitted 20 November, 2019; v1 submitted 6 May, 2019; originally announced May 2019.

    Comments: ICCV 2019

  38. arXiv:1904.07714  [pdf, other

    cs.CV cs.AI cs.PF

    Low-Power Computer Vision: Status, Challenges, Opportunities

    Authors: Sergei Alyamkin, Matthew Ardi, Alexander C. Berg, Achille Brighton, Bo Chen, Yiran Chen, Hsin-Pai Cheng, Zichen Fan, Chen Feng, Bo Fu, Kent Gauen, Abhinav Goel, Alexander Goncharenko, Xuyang Guo, Soonhoi Ha, Andrew Howard, Xiao Hu, Yuanjun Huang, Donghyun Kang, Jaeyoun Kim, Jong Gook Ko, Alexander Kondratyev, Junhyeok Lee, Seungjae Lee, Suwoong Lee , et al. (19 additional authors not shown)

    Abstract: Computer vision has achieved impressive progress in recent years. Meanwhile, mobile phones have become the primary computing platforms for millions of people. In addition to mobile phones, many autonomous systems rely on visual data for making decisions and some of these systems have limited energy (such as unmanned aerial vehicles also called drones and mobile robots). These systems rely on batte… ▽ More

    Submitted 15 April, 2019; originally announced April 2019.

    Comments: Preprint, Accepted by IEEE Journal on Emerging and Selected Topics in Circuits and Systems. arXiv admin note: substantial text overlap with arXiv:1810.01732

  39. arXiv:1901.00885  [pdf

    cs.RO cs.HC

    An Interactive Robotic Framework to Facilitate Sensory Experiences for Children with ASD

    Authors: Hifza Javed, Rachael Burns, Myounghoon Jeon, Ayanna M. Howard, Chung Hyuk Park

    Abstract: The diagnosis of Autism Spectrum Disorder (ASD) in children is commonly accompanied by a diagnosis of sensory processing disorders as well. Abnormalities are usually reported in multiple sensory processing domains, showing a higher prevalence of unusual responses, particularly to tactile, auditory and visual stimuli. This paper discusses a novel robot-based framework designed to target sensory dif… ▽ More

    Submitted 3 January, 2019; originally announced January 2019.

    Comments: 18 pages, 12 figures

  40. arXiv:1810.10703  [pdf, ps, other

    cs.LG cs.CV stat.ML

    K for the Price of 1: Parameter-efficient Multi-task and Transfer Learning

    Authors: Pramod Kaushik Mudrakarta, Mark Sandler, Andrey Zhmoginov, Andrew Howard

    Abstract: We introduce a novel method that enables parameter-efficient transfer and multi-task learning with deep neural networks. The basic approach is to learn a model patch - a small set of parameters - that will specialize to each task, instead of fine-tuning the last layer or the entire network. For instance, we show that learning a set of scales and biases is sufficient to convert a pretrained network… ▽ More

    Submitted 23 February, 2019; v1 submitted 24 October, 2018; originally announced October 2018.

    Comments: published at ICLR 2019

  41. arXiv:1810.01732  [pdf

    cs.CV

    2018 Low-Power Image Recognition Challenge

    Authors: Sergei Alyamkin, Matthew Ardi, Achille Brighton, Alexander C. Berg, Yiran Chen, Hsin-Pai Cheng, Bo Chen, Zichen Fan, Chen Feng, Bo Fu, Kent Gauen, Jongkook Go, Alexander Goncharenko, Xuyang Guo, Hong Hanh Nguyen, Andrew Howard, Yuanjun Huang, Donghyun Kang, Jaeyoun Kim, Alexander Kondratyev, Seungjae Lee, Suwoong Lee, Junhyeok Lee, Zhiyu Liang, Xin Liu , et al. (16 additional authors not shown)

    Abstract: The Low-Power Image Recognition Challenge (LPIRC, https://rebootingcomputing.ieee.org/lpirc) is an annual competition started in 2015. The competition identifies the best technologies that can classify and detect objects in images efficiently (short execution time and low energy consumption) and accurately (high precision). Over the four years, the winners' scores have improved more than 24 times.… ▽ More

    Submitted 3 October, 2018; originally announced October 2018.

    Comments: 13 pages, workshop in 2018 CVPR, competition, low-power, image recognition

  42. arXiv:1807.11626  [pdf, other

    cs.CV cs.LG

    MnasNet: Platform-Aware Neural Architecture Search for Mobile

    Authors: Mingxing Tan, Bo Chen, Ruoming Pang, Vijay Vasudevan, Mark Sandler, Andrew Howard, Quoc V. Le

    Abstract: Designing convolutional neural networks (CNN) for mobile devices is challenging because mobile models need to be small and fast, yet still accurate. Although significant efforts have been dedicated to design and improve mobile CNNs on all dimensions, it is very difficult to manually balance these trade-offs when there are so many architectural possibilities to consider. In this paper, we propose a… ▽ More

    Submitted 28 May, 2019; v1 submitted 30 July, 2018; originally announced July 2018.

    Comments: Published in CVPR 2019

    Journal ref: CVPR 2019

  43. arXiv:1807.00948  [pdf, other

    cs.RO cs.HC

    Does Removing Stereotype Priming Remove Bias? A Pilot Human-Robot Interaction Study

    Authors: Tobi Ogunyale, De'Aira Bryant, Ayanna Howard

    Abstract: Robots capable of participating in complex social interactions have shown great potential in a variety of applications. As these robots grow more popular, it is essential to continuously evaluate the dynamics of the human-robot relationship. One factor shown to have potential impacts on this critical relationship is the human projection of stereotypes onto social robots, a practice that is implici… ▽ More

    Submitted 2 July, 2018; originally announced July 2018.

    Comments: 5 pages, 9 figures, 1 table, to be presented at the 5th Workshop on Fairness, Accountability, and Transparency in Machine Learning (FAT/ML 2018), Stockholm, Sweden, July 15, 2018

  44. arXiv:1806.06193  [pdf, other

    cs.CV cs.LG

    Large Scale Fine-Grained Categorization and Domain-Specific Transfer Learning

    Authors: Yin Cui, Yang Song, Chen Sun, Andrew Howard, Serge Belongie

    Abstract: Transferring the knowledge learned from large scale datasets (e.g., ImageNet) via fine-tuning offers an effective solution for domain-specific fine-grained visual categorization (FGVC) tasks (e.g., recognizing bird species or car make and model). In such scenarios, data annotation often calls for specialized domain knowledge and thus is difficult to scale. In this work, we first tackle a problem i… ▽ More

    Submitted 16 June, 2018; originally announced June 2018.

    Comments: CVPR 2018

  45. arXiv:1804.03230  [pdf, other

    cs.CV

    NetAdapt: Platform-Aware Neural Network Adaptation for Mobile Applications

    Authors: Tien-Ju Yang, Andrew Howard, Bo Chen, Xiao Zhang, Alec Go, Mark Sandler, Vivienne Sze, Hartwig Adam

    Abstract: This work proposes an algorithm, called NetAdapt, that automatically adapts a pre-trained deep neural network to a mobile platform given a resource budget. While many existing algorithms simplify networks based on the number of MACs or weights, optimizing those indirect metrics may not necessarily reduce the direct metrics, such as latency and energy consumption. To solve this problem, NetAdapt in… ▽ More

    Submitted 28 September, 2018; v1 submitted 9 April, 2018; originally announced April 2018.

    Comments: Accepted by ECCV 2018

  46. arXiv:1801.04381  [pdf, other

    cs.CV

    MobileNetV2: Inverted Residuals and Linear Bottlenecks

    Authors: Mark Sandler, Andrew Howard, Menglong Zhu, Andrey Zhmoginov, Liang-Chieh Chen

    Abstract: In this paper we describe a new mobile architecture, MobileNetV2, that improves the state of the art performance of mobile models on multiple tasks and benchmarks as well as across a spectrum of different model sizes. We also describe efficient ways of applying these mobile models to object detection in a novel framework we call SSDLite. Additionally, we demonstrate how to build mobile semantic se… ▽ More

    Submitted 21 March, 2019; v1 submitted 12 January, 2018; originally announced January 2018.

    Journal ref: The IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2018, pp. 4510-4520

  47. arXiv:1712.05877  [pdf, ps, other

    cs.LG stat.ML

    Quantization and Training of Neural Networks for Efficient Integer-Arithmetic-Only Inference

    Authors: Benoit Jacob, Skirmantas Kligys, Bo Chen, Menglong Zhu, Matthew Tang, Andrew Howard, Hartwig Adam, Dmitry Kalenichenko

    Abstract: The rising popularity of intelligent mobile devices and the daunting computational cost of deep learning-based models call for efficient and accurate on-device inference schemes. We propose a quantization scheme that allows inference to be carried out using integer-only arithmetic, which can be implemented more efficiently than floating point inference on commonly available integer-only hardware.… ▽ More

    Submitted 15 December, 2017; originally announced December 2017.

    Comments: 14 pages, 12 figures

  48. arXiv:1704.04861  [pdf, other

    cs.CV

    MobileNets: Efficient Convolutional Neural Networks for Mobile Vision Applications

    Authors: Andrew G. Howard, Menglong Zhu, Bo Chen, Dmitry Kalenichenko, Weijun Wang, Tobias Weyand, Marco Andreetto, Hartwig Adam

    Abstract: We present a class of efficient models called MobileNets for mobile and embedded vision applications. MobileNets are based on a streamlined architecture that uses depth-wise separable convolutions to build light weight deep neural networks. We introduce two simple global hyper-parameters that efficiently trade off between latency and accuracy. These hyper-parameters allow the model builder to choo… ▽ More

    Submitted 16 April, 2017; originally announced April 2017.

  49. arXiv:1511.06789  [pdf, other

    cs.CV

    The Unreasonable Effectiveness of Noisy Data for Fine-Grained Recognition

    Authors: Jonathan Krause, Benjamin Sapp, Andrew Howard, Howard Zhou, Alexander Toshev, Tom Duerig, James Philbin, Li Fei-Fei

    Abstract: Current approaches for fine-grained recognition do the following: First, recruit experts to annotate a dataset of images, optionally also collecting more structured data in the form of part annotations and bounding boxes. Second, train a model utilizing this data. Toward the goal of solving fine-grained recognition, we introduce an alternative approach, leveraging free, noisy data from the web and… ▽ More

    Submitted 18 October, 2016; v1 submitted 20 November, 2015; originally announced November 2015.

    Comments: ECCV 2016, data is released

  50. arXiv:1312.5402  [pdf

    cs.CV

    Some Improvements on Deep Convolutional Neural Network Based Image Classification

    Authors: Andrew G. Howard

    Abstract: We investigate multiple techniques to improve upon the current state of the art deep convolutional neural network based image classification pipeline. The techiques include adding more image transformations to training data, adding more transformations to generate additional predictions at test time and using complementary models applied to higher resolution images. This paper summarizes our entry… ▽ More

    Submitted 18 December, 2013; originally announced December 2013.