Search | arXiv e-print repository

EPTQ: Enhanced Post-Training Quantization via Label-Free Hessian

Authors: Ofir Gordon, Hai Victor Habi, Arnon Netzer

Abstract: Quantization of deep neural networks (DNN) has become a key element in the efforts of embedding such networks on end-user devices. However, current quantization methods usually suffer from costly accuracy degradation. In this paper, we propose a new method for Enhanced Post Training Quantization named EPTQ. The method is based on knowledge distillation with an adaptive weighting of layers. In addi… ▽ More Quantization of deep neural networks (DNN) has become a key element in the efforts of embedding such networks on end-user devices. However, current quantization methods usually suffer from costly accuracy degradation. In this paper, we propose a new method for Enhanced Post Training Quantization named EPTQ. The method is based on knowledge distillation with an adaptive weighting of layers. In addition, we introduce a new label-free technique for approximating the Hessian trace of the task loss, named Label-Free Hessian. This technique removes the requirement of a labeled dataset for computing the Hessian. The adaptive knowledge distillation uses the Label-Free Hessian technique to give greater attention to the sensitive parts of the model while performing the optimization. Empirically, by employing EPTQ we achieve state-of-the-art results on a wide variety of models, tasks, and datasets, including ImageNet classification, COCO object detection, and Pascal-VOC for semantic segmentation. We demonstrate the performance and compatibility of EPTQ on an extended set of architectures, including CNNs, Transformers, hybrid, and MLP-only models. △ Less

Submitted 20 September, 2023; originally announced September 2023.

arXiv:2306.12760 [pdf, other]

Blended-NeRF: Zero-Shot Object Generation and Blending in Existing Neural Radiance Fields

Authors: Ori Gordon, Omri Avrahami, Dani Lischinski

Abstract: Editing a local region or a specific object in a 3D scene represented by a NeRF or consistently blending a new realistic object into the scene is challenging, mainly due to the implicit nature of the scene representation. We present Blended-NeRF, a robust and flexible framework for editing a specific region of interest in an existing NeRF scene, based on text prompts, along with a 3D ROI box. Our… ▽ More Editing a local region or a specific object in a 3D scene represented by a NeRF or consistently blending a new realistic object into the scene is challenging, mainly due to the implicit nature of the scene representation. We present Blended-NeRF, a robust and flexible framework for editing a specific region of interest in an existing NeRF scene, based on text prompts, along with a 3D ROI box. Our method leverages a pretrained language-image model to steer the synthesis towards a user-provided text prompt, along with a 3D MLP model initialized on an existing NeRF scene to generate the object and blend it into a specified region in the original scene. We allow local editing by localizing a 3D ROI box in the input scene, and blend the content synthesized inside the ROI with the existing scene using a novel volumetric blending technique. To obtain natural looking and view-consistent results, we leverage existing and new geometric priors and 3D augmentations for improving the visual fidelity of the final result. We test our framework both qualitatively and quantitatively on a variety of real 3D scenes and text prompts, demonstrating realistic multi-view consistent results with much flexibility and diversity compared to the baselines. Finally, we show the applicability of our framework for several 3D editing applications, including adding new objects to a scene, removing/replacing/altering existing objects, and texture conversion. △ Less

Submitted 7 September, 2023; v1 submitted 22 June, 2023; originally announced June 2023.

Comments: 16 pages, 14 figures. Project page: https://www.vision.huji.ac.il/blended-nerf/

arXiv:2105.10993 [pdf, other]

Cooperative Multi-Agent Path Finding: Beyond Path Planning and Collision Avoidance

Authors: Nir Greshler, Ofir Gordon, Oren Salzman, Nahum Shimkin

Abstract: We introduce the Cooperative Multi-Agent Path Finding (Co-MAPF) problem, an extension to the classical MAPF problem, where cooperative behavior is incorporated. In this setting, a group of autonomous agents operate in a shared environment and have to complete cooperative tasks while avoiding collisions with the other agents in the group. This extension naturally models many real-world applications… ▽ More We introduce the Cooperative Multi-Agent Path Finding (Co-MAPF) problem, an extension to the classical MAPF problem, where cooperative behavior is incorporated. In this setting, a group of autonomous agents operate in a shared environment and have to complete cooperative tasks while avoiding collisions with the other agents in the group. This extension naturally models many real-world applications, where groups of agents are required to collaborate in order to complete a given task. To this end, we formalize the Co-MAPF problem and introduce Cooperative Conflict-Based Search (Co-CBS), a CBS-based algorithm for solving the problem optimally for a wide set of Co-MAPF problems. Co-CBS uses a cooperation-planning module integrated into CBS such that cooperation planning is decoupled from path planning. Finally, we present empirical results on several MAPF benchmarks demonstrating our algorithm's properties. △ Less

Submitted 23 May, 2021; originally announced May 2021.

Comments: 9 pages, 5 figures

arXiv:2104.08759 [pdf, other]

Revisiting the Complexity Analysis of Conflict-Based Search: New Computational Techniques and Improved Bounds

Authors: Ofir Gordon, Yuval Filmus, Oren Salzman

Abstract: The problem of Multi-Agent Path Finding (MAPF) calls for finding a set of conflict-free paths for a fleet of agents operating in a given environment. Arguably, the state-of-the-art approach to computing optimal solutions is Conflict-Based Search (CBS). In this work we revisit the complexity analysis of CBS to provide tighter bounds on the algorithm's run-time in the worst-case. Our analysis paves… ▽ More The problem of Multi-Agent Path Finding (MAPF) calls for finding a set of conflict-free paths for a fleet of agents operating in a given environment. Arguably, the state-of-the-art approach to computing optimal solutions is Conflict-Based Search (CBS). In this work we revisit the complexity analysis of CBS to provide tighter bounds on the algorithm's run-time in the worst-case. Our analysis paves the way to better pinpoint the parameters that govern (in the worst case) the algorithm's computational complexity. Our analysis is based on two complementary approaches: In the first approach we bound the run-time using the size of a Multi-valued Decision Diagram (MDD) -- a layered graph which compactly contains all possible single-agent paths between two given vertices for a specific path length. In the second approach we express the running time by a novel recurrence relation which bounds the algorithm's complexity. We use generating functions-based analysis in order to tightly bound the recurrence. Using these technique we provide several new upper-bounds on CBS's complexity. The results allow us to improve the existing bound on the running time of CBS for many cases. For example, on a set of common benchmarks we improve the upper-bound by a factor of at least $2^{10^{7}}$. △ Less

Submitted 18 April, 2021; originally announced April 2021.

arXiv:2008.12371 [pdf, ps, other]

Improving the Segmentation of Scanning Probe Microscope Images using Convolutional Neural Networks

Authors: Steff Farley, Jo E. A. Hodgkinson, Oliver M. Gordon, Joanna Turner, Andrea Soltoggio, Philip J. Moriarty, Eugenie Hunsicker

Abstract: A wide range of techniques can be considered for segmentation of images of nanostructured surfaces. Manually segmenting these images is time-consuming and results in a user-dependent segmentation bias, while there is currently no consensus on the best automated segmentation methods for particular techniques, image classes, and samples. Any image segmentation approach must minimise the noise in the… ▽ More A wide range of techniques can be considered for segmentation of images of nanostructured surfaces. Manually segmenting these images is time-consuming and results in a user-dependent segmentation bias, while there is currently no consensus on the best automated segmentation methods for particular techniques, image classes, and samples. Any image segmentation approach must minimise the noise in the images to ensure accurate and meaningful statistical analysis can be carried out. Here we develop protocols for the segmentation of images of 2D assemblies of gold nanoparticles formed on silicon surfaces via deposition from an organic solvent. The evaporation of the solvent drives far-from-equilibrium self-organisation of the particles, producing a wide variety of nano- and micro-structured patterns. We show that a segmentation strategy using the U-Net convolutional neural network outperforms traditional automated approaches and has particular potential in the processing of images of nanostructured systems. △ Less

Submitted 27 August, 2020; originally announced August 2020.

Comments: 21 pages, 10 figures

arXiv:1907.13401 [pdf, ps, other]

doi 10.1088/2632-2153/ab42ec

Embedding Human Heuristics in Machine-Learning-Enabled Probe Microscopy

Authors: O. Gordon, F. Junqueira, P. Moriarty

Abstract: Scanning probe microscopists generally do not rely on complete images to assess the quality of data acquired during a scan. Instead, assessments of the state of the tip apex, which not only determines the resolution in any scanning probe technique but can also generate a wide array of frustrating artefacts, are carried out in real time on the basis of a few lines of an image (and, typically, their… ▽ More Scanning probe microscopists generally do not rely on complete images to assess the quality of data acquired during a scan. Instead, assessments of the state of the tip apex, which not only determines the resolution in any scanning probe technique but can also generate a wide array of frustrating artefacts, are carried out in real time on the basis of a few lines of an image (and, typically, their associated line profiles.) The very small number of machine learning approaches to probe microscopy published to date, however, involve classifications based on full images. Given that data acquisition is the most time-consuming task during routine tip conditioning, automated methods are thus currently extremely slow in comparison to the tried-and-trusted strategies and heuristics used routinely by probe microscopists. Here, we explore various strategies by which different STM image classes (arising from changes in the tip state) can be correctly identified from partial scans. By employing a secondary temporal network and a rolling window of a small group of individual scanlines, we find that tip assessment is possible with a small fraction of a complete image. We achieve this with little-to-no performance penalty -- or, indeed, markedly improved performance in some cases -- and introduce a protocol to detect the state of the tip apex in real time. △ Less

Submitted 31 July, 2019; originally announced July 2019.

arXiv:1904.11752 [pdf, other]

doi 10.17639/nott.344

Rushing or Dragging? An Analysis of the "Universality" of Correlated Fluctuations in Hi-Hat Timing and Dynamics

Authors: Oliver Gordon, Dominic Coy, Jack Matthews, Easel Kandola-McNicholas, Owain Llewellyn, Adeel Bokhari, Philip Moriarty

Abstract: A previous analysis of fluctuations in a virtuoso (Jeff Porcaro) drum performance [Räsänen et al., PLoS ONE 10(6): e0127902 (2015)] demonstrated that the rhythmic signal comprised both long range correlations and short range anti-correlations, with a characteristic timescale distinguishing the two regimes. We have extended Räsänen et al.'s approach to a much larger number of drum samples (N=132, p… ▽ More A previous analysis of fluctuations in a virtuoso (Jeff Porcaro) drum performance [Räsänen et al., PLoS ONE 10(6): e0127902 (2015)] demonstrated that the rhythmic signal comprised both long range correlations and short range anti-correlations, with a characteristic timescale distinguishing the two regimes. We have extended Räsänen et al.'s approach to a much larger number of drum samples (N=132, provided by a total of 58 participants) and to a different performance (viz., Rush's Tom Sawyer). A key focus of our study was to test whether the fluctuation dynamics discovered by Räsänen et al. are "universal" in the following sense: is the crossover from short-range to long-range correlated fluctuations a general phenomenon or is it restricted to particular drum patterns and/or specific drummers? We find no compelling evidence to suggest that the short-range to long-range correlation crossover that is characteristic of Porcaro's performance is a common feature of temporal fluctuations in drum patterns. Moreover, level of experience and/or playing technique surprisingly do not play a role in influencing a short-range to long-range correlation cross-over. Our study also highlights that a great deal of caution needs to be taken when using the detrended fluctuation analysis technique, particularly with regard to anti-correlated signals. △ Less

Submitted 26 April, 2019; originally announced April 2019.

arXiv:1903.09101 [pdf, ps, other]

doi 10.1063/1.5099590

Scanning Probe State Recognition With Multi-Class Neural Network Ensembles

Authors: O. Gordon, P. D'Hondt, L. Knijff, S. Freeney, F. Junqueira, P. Moriarty, I. Swart

Abstract: One of the largest obstacles facing scanning probe microscopy is the constant need to correct flaws in the scanning probe in situ. This is currently a manual, time-consuming process that would benefit greatly from automation. Here we introduce a convolutional neural network protocol that enables automated recognition of a variety of desirable and undesirable scanning probe tip states on both metal… ▽ More One of the largest obstacles facing scanning probe microscopy is the constant need to correct flaws in the scanning probe in situ. This is currently a manual, time-consuming process that would benefit greatly from automation. Here we introduce a convolutional neural network protocol that enables automated recognition of a variety of desirable and undesirable scanning probe tip states on both metal and non-metal surfaces. By combining the best performing models into majority voting ensembles, we find that the desirable states of H:Si(100) can be distinguished with a mean precision of 0.89 and an average receiver-operator-characteristic curve area of 0.95. More generally, high and low-quality tips can be distinguished with a mean precision of 0.96 and near perfect area-under-curve of 0.98. With trivial modifications, we also successfully automatically identify undesirable, non-surface-specific states on surfaces of Au(111) and Cu(111). In these cases we find mean precisions of 0.95 and 0.75 and area-under-curves of 0.98 and 0.94, respectively. △ Less

Submitted 21 March, 2019; originally announced March 2019.

Comments: Initial submission to APL

Showing 1–8 of 8 results for author: Gordon, O