Search | arXiv e-print repository

Reverse time-to-death as time-scale in time-to-event analysis for studies of advanced illness and palliative care

Authors: Yin Bun Cheung, Xiangmei Ma, Isha Chaudhry, Nan Liu, Qingyuan Zhuang, Grace Meijuan Yang, Chetna Malhotra, Eric Andrew Finkelstein

Abstract: Background: Incidence of adverse outcome events rises as patients with advanced illness approach end-of-life. Exposures that tend to occur near end-of-life, e.g., use of wheelchair, oxygen therapy and palliative care, may therefore be found associated with the incidence of the adverse outcomes. We propose a strategy for time-to-event analysis to mitigate the time-varying confounding. Methods: We p… ▽ More Background: Incidence of adverse outcome events rises as patients with advanced illness approach end-of-life. Exposures that tend to occur near end-of-life, e.g., use of wheelchair, oxygen therapy and palliative care, may therefore be found associated with the incidence of the adverse outcomes. We propose a strategy for time-to-event analysis to mitigate the time-varying confounding. Methods: We propose a concept of reverse time-to-death (rTTD) and its use for the time-scale in time-to-event analysis. We used data on community-based palliative care uptake (exposure) and emergency department visits (outcome) among patients with advanced cancer in Singapore to illustrate. We compare the results against that of the common practice of using time-on-study (TOS) as time-scale. Results: Graphical analysis demonstrated that cancer patients receiving palliative care had higher rate of emergency department visits than non-recipients mainly because they were closer to end-of-life, and that rTTD analysis made comparison between patients at the same time-to-death. Analysis of emergency department visits in relation to palliative care using TOS time-scale showed significant increase in hazard ratio estimate when observed time-varying covariates were omitted from statistical adjustment (change-in-estimate=0.38; 95% CI 0.15 to 0.60). There was no such change in otherwise the same analysis using rTTD (change-in-estimate=0.04; 95% CI -0.02 to 0.11), demonstrating the ability of rTTD time-scale to mitigate confounding that intensifies in relation to time-to-death. Conclusion: Use of rTTD as time-scale in time-to-event analysis provides a simple and robust approach to control time-varying confounding in studies of advanced illness, even if the confounders are unmeasured. △ Less

Submitted 2 July, 2024; originally announced July 2024.

Comments: 22 pages (including 2 tables and 2 figures)

arXiv:2310.09388 [pdf, other]

CORN: Co-Trained Full- And No-Reference Speech Quality Assessment

Authors: Pranay Manocha, Donald Williamson, Adam Finkelstein

Abstract: Perceptual evaluation constitutes a crucial aspect of various audio-processing tasks. Full reference (FR) or similarity-based metrics rely on high-quality reference recordings, to which lower-quality or corrupted versions of the recording may be compared for evaluation. In contrast, no-reference (NR) metrics evaluate a recording without relying on a reference. Both the FR and NR approaches exhibit… ▽ More Perceptual evaluation constitutes a crucial aspect of various audio-processing tasks. Full reference (FR) or similarity-based metrics rely on high-quality reference recordings, to which lower-quality or corrupted versions of the recording may be compared for evaluation. In contrast, no-reference (NR) metrics evaluate a recording without relying on a reference. Both the FR and NR approaches exhibit advantages and drawbacks relative to each other. In this paper, we present a novel framework called CORN that amalgamates these dual approaches, concurrently training both FR and NR models together. After training, the models can be applied independently. We evaluate CORN by predicting several common objective metrics and across two different architectures. The NR model trained using CORN has access to a reference recording during training, and thus, as one would expect, it consistently outperforms baseline NR models trained independently. Perhaps even more remarkable is that the CORN FR model also outperforms its baseline counterpart, even though it relies on the same training data and the same model architecture. Thus, a single training regime produces two independently useful models, each outperforming independently trained models △ Less

Submitted 8 January, 2024; v1 submitted 13 October, 2023; originally announced October 2023.

arXiv:2212.02634 [pdf, other]

doi 10.1007/978-3-031-25082-8\_8

QFT: Post-training quantization via fast joint finetuning of all degrees of freedom

Authors: Alex Finkelstein, Ella Fuchs, Idan Tal, Mark Grobman, Niv Vosco, Eldad Meller

Abstract: The post-training quantization (PTQ) challenge of bringing quantized neural net accuracy close to original has drawn much attention driven by industry demand. Many of the methods emphasize optimization of a specific degree-of-freedom (DoF), such as quantization step size, preconditioning factors, bias fixing, often chained to others in multi-step solutions. Here we rethink quantized network parame… ▽ More The post-training quantization (PTQ) challenge of bringing quantized neural net accuracy close to original has drawn much attention driven by industry demand. Many of the methods emphasize optimization of a specific degree-of-freedom (DoF), such as quantization step size, preconditioning factors, bias fixing, often chained to others in multi-step solutions. Here we rethink quantized network parameterization in HW-aware fashion, towards a unified analysis of all quantization DoF, permitting for the first time their joint end-to-end finetuning. Our single-step simple and extendable method, dubbed quantization-aware finetuning (QFT), achieves 4-bit weight quantization results on-par with SoTA within PTQ constraints of speed and resource. △ Less

Submitted 5 December, 2022; originally announced December 2022.

Comments: Presented at CADL2022 workshop at ECCV2022

Journal ref: Computer Vision - {ECCV} 2022 Workshops - Tel Aviv, Israel, October 23-27, 2022, Proceedings, Part {VII}

arXiv:2206.13411 [pdf, other]

Audio Similarity is Unreliable as a Proxy for Audio Quality

Authors: Pranay Manocha, Zeyu **, Adam Finkelstein

Abstract: Many audio processing tasks require perceptual assessment. However, the time and expense of obtaining ``gold standard'' human judgments limit the availability of such data. Most applications incorporate full reference or other similarity-based metrics (e.g. PESQ) that depend on a clean reference. Researchers have relied on such metrics to evaluate and compare various proposed methods, often conclu… ▽ More Many audio processing tasks require perceptual assessment. However, the time and expense of obtaining ``gold standard'' human judgments limit the availability of such data. Most applications incorporate full reference or other similarity-based metrics (e.g. PESQ) that depend on a clean reference. Researchers have relied on such metrics to evaluate and compare various proposed methods, often concluding that small, measured differences imply one is more effective than another. This paper demonstrates several practical scenarios where similarity metrics fail to agree with human perception, because they: (1) vary with clean references; (2) rely on attributes that humans factor out when considering quality, and (3) are sensitive to imperceptible signal level differences. In those scenarios, we show that no-reference metrics do not suffer from such shortcomings and correlate better with human perception. We conclude therefore that similarity serves as an unreliable proxy for audio quality. △ Less

Submitted 27 June, 2022; originally announced June 2022.

Comments: To Appear, Interspeech 2022

arXiv:2102.05109 [pdf, other]

CDPAM: Contrastive learning for perceptual audio similarity

Authors: Pranay Manocha, Zeyu **, Richard Zhang, Adam Finkelstein

Abstract: Many speech processing methods based on deep learning require an automatic and differentiable audio metric for the loss function. The DPAM approach of Manocha et al. learns a full-reference metric trained directly on human judgments, and thus correlates well with human perception. However, it requires a large number of human annotations and does not generalize well outside the range of perturbatio… ▽ More Many speech processing methods based on deep learning require an automatic and differentiable audio metric for the loss function. The DPAM approach of Manocha et al. learns a full-reference metric trained directly on human judgments, and thus correlates well with human perception. However, it requires a large number of human annotations and does not generalize well outside the range of perturbations on which it was trained. This paper introduces CDPAM, a metric that builds on and advances DPAM. The primary improvement is to combine contrastive learning and multi-dimensional representations to build robust models from limited data. In addition, we collect human judgments on triplet comparisons to improve generalization to a broader range of audio perturbations. CDPAM correlates well with human responses across nine varied datasets. We also show that adding this metric to existing speech synthesis and enhancement methods yields significant improvement, as measured by objective and subjective tests. △ Less

Submitted 9 February, 2021; originally announced February 2021.

Comments: Dataset, code and sound examples can be found at https://github.com/pranaymanocha/PerceptualAudio/tree/master/cdpam

arXiv:2102.04533 [pdf, other]

Learning from Shader Program Traces

Authors: Yuting Yang, Connelly Barnes, Adam Finkelstein

Abstract: Deep learning for image processing typically treats input imagery as pixels in some color space. This paper proposes instead to learn from program traces of procedural fragment shaders -- programs that generate images. At each pixel, we collect the intermediate values computed at program execution, and these data form the input to the learned model. We investigate this learning task for a variety… ▽ More Deep learning for image processing typically treats input imagery as pixels in some color space. This paper proposes instead to learn from program traces of procedural fragment shaders -- programs that generate images. At each pixel, we collect the intermediate values computed at program execution, and these data form the input to the learned model. We investigate this learning task for a variety of applications: our model can learn to predict a low-noise output image from shader programs that exhibit sampling noise; this model can also learn from a simplified shader program that approximates the reference solution with less computation, as well as learn the output of postprocessing filters like defocus blur and edge-aware sharpening. Finally we show that the idea of learning from program traces can even be applied to non-imagery simulations of flocks of boids. Our experiments on a variety of shaders show quantitatively and qualitatively that models learned from program traces outperform baseline models learned from RGB color augmented with hand-picked shader-specific features like normals, depth, and diffuse and specular color. △ Less

Submitted 24 April, 2022; v1 submitted 8 February, 2021; originally announced February 2021.

arXiv:2008.13682 [pdf]

Some peculiarities of water freezing at small sub-zero temperatures

Authors: Alexei V. Finkelstein

Abstract: I consider the kinetics of water freezing and show that, at small sub-zero temperatures, (i) the time of ice nucleation within the bulk water environment is enormous and therefore cannot take place either in lakes of in living cells; (ii) that the ice nucleation needs some ice-binding surfaces to occur, but (iii) even this kind of ice nucleation can take place, as a rule, only at the temperatures… ▽ More I consider the kinetics of water freezing and show that, at small sub-zero temperatures, (i) the time of ice nucleation within the bulk water environment is enormous and therefore cannot take place either in lakes of in living cells; (ii) that the ice nucleation needs some ice-binding surfaces to occur, but (iii) even this kind of ice nucleation can take place, as a rule, only at the temperatures that are a few degrees below 0oC. Further, I discuss factors that can drastically reduce the ice nucleation time at nearly-zero temperatures both in open reservoirs, where water contacts with air, and in cells, where there is no such contact. △ Less

Submitted 31 August, 2020; originally announced August 2020.

Comments: 15 pages, 5 figures, 2 tables

arXiv:2008.00141 [pdf, other]

Actor-Action Video Classification CSC 249/449 Spring 2020 Challenge Report

Authors: **g Shi, Zhiheng Li, Haitian Zheng, Yihang Xu, Tianyou Xiao, Weitao Tan, Xiaoning Guo, Sizhe Li, Bin Yang, Zhexin Xu, Ruitao Lin, Zhongkai Shangguan, Yue Zhao, **gwen Wang, Rohan Sharma, Surya Iyer, A**kya Deshmukh, Raunak Mahalik, Srishti Singh, Jayant G Rohra, Yipeng Zhang, Tongyu Yang, Xuan Wen, Ethan Fahnestock, Bryce Ikeda , et al. (8 additional authors not shown)

Abstract: This technical report summarizes submissions and compiles from Actor-Action video classification challenge held as a final project in CSC 249/449 Machine Vision course (Spring 2020) at University of Rochester This technical report summarizes submissions and compiles from Actor-Action video classification challenge held as a final project in CSC 249/449 Machine Vision course (Spring 2020) at University of Rochester △ Less

Submitted 18 August, 2020; v1 submitted 31 July, 2020; originally announced August 2020.

arXiv:2006.05694 [pdf, other]

HiFi-GAN: High-Fidelity Denoising and Dereverberation Based on Speech Deep Features in Adversarial Networks

Authors: Jiaqi Su, Zeyu **, Adam Finkelstein

Abstract: Real-world audio recordings are often degraded by factors such as noise, reverberation, and equalization distortion. This paper introduces HiFi-GAN, a deep learning method to transform recorded speech to sound as though it had been recorded in a studio. We use an end-to-end feed-forward WaveNet architecture, trained with multi-scale adversarial discriminators in both the time domain and the time-f… ▽ More Real-world audio recordings are often degraded by factors such as noise, reverberation, and equalization distortion. This paper introduces HiFi-GAN, a deep learning method to transform recorded speech to sound as though it had been recorded in a studio. We use an end-to-end feed-forward WaveNet architecture, trained with multi-scale adversarial discriminators in both the time domain and the time-frequency domain. It relies on the deep feature matching losses of the discriminators to improve the perceptual quality of enhanced speech. The proposed model generalizes well to new speakers, new speech content, and new environments. It significantly outperforms state-of-the-art baseline methods in both objective and subjective experiments. △ Less

Submitted 21 September, 2020; v1 submitted 10 June, 2020; originally announced June 2020.

Comments: Accepted by INTERSPEECH 2020

arXiv:2005.11414 [pdf]

Data as Infrastructure for Smart Cities: Linking Data Platforms to Business Strategies

Authors: Larissa Romualdo-Suzuki, Anthony Finkelstein

Abstract: The systems that operate the infrastructure of cities have evolved in a fragmented fashion across several generations of technology, causing city utilities and services to operate sub-optimally and limiting the creation of new value-added services and restrict opportunities for cost-saving. The integration of cross-domain city data offers a new wave of opportunities to mitigate some of these impac… ▽ More The systems that operate the infrastructure of cities have evolved in a fragmented fashion across several generations of technology, causing city utilities and services to operate sub-optimally and limiting the creation of new value-added services and restrict opportunities for cost-saving. The integration of cross-domain city data offers a new wave of opportunities to mitigate some of these impacts and enables city systems to draw effectively on interoperable data that will be used to deliver smarter cities. Despite the considerable potential of city data, current smart cities initiatives have mainly addressed the problem of data management from a technology perspective, and have disregarded stakeholders and data needs. As a consequence, such initiatives are susceptible to failure from inadequate stakeholder input, requirements neglecting, and information fragmentation and overload. They are also likely to be limited in terms of both scalability and future proofing against technological, commercial and legislative change. This paper proposes a systematic business-modeldriven framework to guide the design of large and highly interconnected data infrastructures which are provided and supported by multiple stakeholders. The framework is used to model, elicit and reason about the requirements of the service, technology, organization, value, and governance aspects of smart cities. The requirements serve as an input to a closed-loop supply chain model, which is designed and managed to explicitly consider the activities and processes that enables the stakeholders of smart cities to efficiently leverage their collective knowledge. We demonstrate how our approach can be used to design data infrastructures by examining a series of exemplary scenarios and by demonstrating how our approach handles the holistic design of a data infrastructure and informs the decision making process. △ Less

Submitted 22 May, 2020; originally announced May 2020.

arXiv:2001.04460 [pdf, other]

A Differentiable Perceptual Audio Metric Learned from Just Noticeable Differences

Authors: Pranay Manocha, Adam Finkelstein, Richard Zhang, Nicholas J. Bryan, Gautham J. Mysore, Zeyu **

Abstract: Many audio processing tasks require perceptual assessment. The ``gold standard`` of obtaining human judgments is time-consuming, expensive, and cannot be used as an optimization criterion. On the other hand, automated metrics are efficient to compute but often correlate poorly with human judgment, particularly for audio differences at the threshold of human detection. In this work, we construct a… ▽ More Many audio processing tasks require perceptual assessment. The ``gold standard`` of obtaining human judgments is time-consuming, expensive, and cannot be used as an optimization criterion. On the other hand, automated metrics are efficient to compute but often correlate poorly with human judgment, particularly for audio differences at the threshold of human detection. In this work, we construct a metric by fitting a deep neural network to a new large dataset of crowdsourced human judgments. Subjects are prompted to answer a straightforward, objective question: are two recordings identical or not? These pairs are algorithmically generated under a variety of perturbations, including noise, reverb, and compression artifacts; the perturbation space is probed with the goal of efficiently identifying the just-noticeable difference (JND) level of the subject. We show that the resulting learned metric is well-calibrated with human judgments, outperforming baseline methods. Since it is a deep network, the metric is differentiable, making it suitable as a loss function for other tasks. Thus, simply replacing an existing loss (e.g., deep feature loss) with our metric yields significant improvement in a denoising network, as measured by subjective pairwise comparison. △ Less

Submitted 18 May, 2020; v1 submitted 13 January, 2020; originally announced January 2020.

Comments: Dataset, code and sound examples can be found at https://pixl.cs.princeton.edu/pubs/Manocha_2020_ADP/

arXiv:1906.03193 [pdf, other]

Fighting Quantization Bias With Bias

Authors: Alexander Finkelstein, Uri Almog, Mark Grobman

Abstract: Low-precision representation of deep neural networks (DNNs) is critical for efficient deployment of deep learning application on embedded platforms, however, converting the network to low precision degrades its performance. Crucially, networks that are designed for embedded applications usually suffer from increased degradation since they have less redundancy. This is most evident for the ubiquito… ▽ More Low-precision representation of deep neural networks (DNNs) is critical for efficient deployment of deep learning application on embedded platforms, however, converting the network to low precision degrades its performance. Crucially, networks that are designed for embedded applications usually suffer from increased degradation since they have less redundancy. This is most evident for the ubiquitous MobileNet architecture which requires a costly quantization-aware training cycle to achieve acceptable performance when quantized to 8-bits. In this paper, we trace the source of the degradation in MobileNets to a shift in the mean activation value. This shift is caused by an inherent bias in the quantization process which builds up across layers, shifting all network statistics away from the learned distribution. We show that this phenomenon happens in other architectures as well. We propose a simple remedy - compensating for the quantization induced shift by adding a constant to the additive bias term of each channel. We develop two simple methods for estimating the correction constants - one using iterative evaluation of the quantized network and one where the constants are set using a short training phase. Both methods are fast and require only a small amount of unlabeled data, making them appealing for rapid deployment of neural networks. Using the above methods we are able to match the performance of training-based quantization of MobileNets at a fraction of the cost. △ Less

Submitted 7 June, 2019; originally announced June 2019.

Comments: Accepted to ECV workshop at CVPR2019

arXiv:1906.01524 [pdf, other]

Text-based Editing of Talking-head Video

Authors: Ohad Fried, Ayush Tewari, Michael Zollhöfer, Adam Finkelstein, Eli Shechtman, Dan B Goldman, Kyle Genova, Zeyu **, Christian Theobalt, Maneesh Agrawala

Abstract: Editing talking-head video to change the speech content or to remove filler words is challenging. We propose a novel method to edit talking-head video based on its transcript to produce a realistic output video in which the dialogue of the speaker has been modified, while maintaining a seamless audio-visual flow (i.e. no jump cuts). Our method automatically annotates an input talking-head video wi… ▽ More Editing talking-head video to change the speech content or to remove filler words is challenging. We propose a novel method to edit talking-head video based on its transcript to produce a realistic output video in which the dialogue of the speaker has been modified, while maintaining a seamless audio-visual flow (i.e. no jump cuts). Our method automatically annotates an input talking-head video with phonemes, visemes, 3D face pose and geometry, reflectance, expression and scene illumination per frame. To edit a video, the user has to only edit the transcript, and an optimization strategy then chooses segments of the input corpus as base material. The annotated parameters corresponding to the selected segments are seamlessly stitched together and used to produce an intermediate video representation in which the lower half of the face is rendered with a parametric face model. Finally, a recurrent video generation network transforms this representation to a photorealistic video that matches the edited transcript. We demonstrate a large variety of edits, such as the addition, removal, and alteration of words, as well as convincing language translation and full sentence synthesis. △ Less

Submitted 4 June, 2019; originally announced June 2019.

Comments: A version with higher resolution images can be downloaded from the authors' website

arXiv:1902.01917 [pdf, other]

Same, Same But Different - Recovering Neural Network Quantization Error Through Weight Factorization

Authors: Eldad Meller, Alexander Finkelstein, Uri Almog, Mark Grobman

Abstract: Quantization of neural networks has become common practice, driven by the need for efficient implementations of deep neural networks on embedded devices. In this paper, we exploit an oft-overlooked degree of freedom in most networks - for a given layer, individual output channels can be scaled by any factor provided that the corresponding weights of the next layer are inversely scaled. Therefore,… ▽ More Quantization of neural networks has become common practice, driven by the need for efficient implementations of deep neural networks on embedded devices. In this paper, we exploit an oft-overlooked degree of freedom in most networks - for a given layer, individual output channels can be scaled by any factor provided that the corresponding weights of the next layer are inversely scaled. Therefore, a given network has many factorizations which change the weights of the network without changing its function. We present a conceptually simple and easy to implement method that uses this property and show that proper factorizations significantly decrease the degradation caused by quantization. We show improvement on a wide variety of networks and achieve state-of-the-art degradation results for MobileNets. While our focus is on quantization, this type of factorization is applicable to other domains such as network-pruning, neural nets regularization and network interpretability. △ Less

Submitted 5 February, 2019; originally announced February 2019.

arXiv:1810.08012 [pdf]

Sound Absorption in Replicated Aluminum Foam

Authors: Arcady Finkelstein, Eugene Furman, Dmitry Husnullin, Borodianskiy Konstantin

Abstract: Sound absorption is an important technological task in machine-building and civil engineering. Porous materials are traditionally used for these purposes, as they are neither ignitable nor hygroscopic and thus suitable for noise oppression, first of all in means of transportation. Absorption of acoustic oscillation energy in porous metals occurs mainly due to viscous friction. A theoretical descri… ▽ More Sound absorption is an important technological task in machine-building and civil engineering. Porous materials are traditionally used for these purposes, as they are neither ignitable nor hygroscopic and thus suitable for noise oppression, first of all in means of transportation. Absorption of acoustic oscillation energy in porous metals occurs mainly due to viscous friction. A theoretical description of the process of energy viscous dissipation in a porous media on basis of Rayleigh classical model is given in paper [1], whereas the modern level of theory is set forth in Johnson-Champoux-Allard model [2]. Attempts of utilizing aluminum foam as the cheapest porous metal for sound absorption are related to forming of the open porous structure by rolling [3] or by heat treatment [4]. However, the sound absorption ratio of metal foam presented in these papers does not rise over 80%, whereas it reaches 99.9% in a wide frequency range when we take conventional sound-absorption materials (i.e. glass-wool). The problem of foamed metal consists of considerable reflection of acoustic waves from the surface. △ Less

Submitted 18 October, 2018; originally announced October 2018.

arXiv:1805.09544 [pdf, other]

Testing the influence of acceleration on time dilation using a rotating MÖssbauer absorber

Authors: Y. Friedman, J. M. Steiner, S. Livshitz, E. Perez, I. Nowik, I. Felner, H. -C. Wille, G. Wortmann, O. Efrati, A. Finkelstein, S. Petitgirard, A. I. Chumakov, D. Bessas

Abstract: The aim of the experiment series was to test the influence of acceleration on time dilation by measuring the relative spectral shift between the resonance spectra of a rotating Mossbauer absorber with acceleration anti-parallel and parallel to the direction of the incident beam. Based on the experiences and know-how acquired in our previous experiments, We collected data for rotation frequencies u… ▽ More The aim of the experiment series was to test the influence of acceleration on time dilation by measuring the relative spectral shift between the resonance spectra of a rotating Mossbauer absorber with acceleration anti-parallel and parallel to the direction of the incident beam. Based on the experiences and know-how acquired in our previous experiments, We collected data for rotation frequencies up to 510Hz in both directions of rotation and also used different slits. For each run with high rotation, we observed a stable statistically significant relative shift between the spectra of the two states with opposite acceleration. This indicates the influence of acceleration on time dilation. However, we found that this shift also depends on the choice of the slit, and on the direction of rotation. These new unexpected findings, resulting from the loss of symmetry in obtaining the resonant lines in the two states, could overshadow the relative shift due to acceleration. This loss of the symmetry is caused by the deflection of the radiative decay due to the Nuclear Lighthouse effect from the rotating Mossbauer absorber. We also found that it is impossible to keep the alignment (between the optical and the dynamical rotor systems) with accuracy needed for such experiment, for long runs, which resulted in the reduction of the accuracy of the observed relative shift. These prevent us to claim with certainty the influence of acceleration on time dilation using the currently available technology. An improved KB optics with focal spot of less than 1 micron to avoid the use of a slit and a more rigid mounting of the rotor system, are necessary for the success of such experiment. Hopefully, these findings together with the indispensable plan for a conclusive experiment presented in the paper, will prove useful to future experimentalists wishing to pursue such an experiment. △ Less

Submitted 24 May, 2018; originally announced May 2018.

arXiv:1710.10687 [pdf, other]

High-Precision Localization Using Ground Texture

Authors: Linguang Zhang, Adam Finkelstein, Szymon Rusinkiewicz

Abstract: Location-aware applications play an increasingly critical role in everyday life. However, satellite-based localization (e.g., GPS) has limited accuracy and can be unusable in dense urban areas and indoors. We introduce an image-based global localization system that is accurate to a few millimeters and performs reliable localization both indoors and outside. The key idea is to capture and index dis… ▽ More Location-aware applications play an increasingly critical role in everyday life. However, satellite-based localization (e.g., GPS) has limited accuracy and can be unusable in dense urban areas and indoors. We introduce an image-based global localization system that is accurate to a few millimeters and performs reliable localization both indoors and outside. The key idea is to capture and index distinctive local keypoints in ground textures. This is based on the observation that ground textures including wood, carpet, tile, concrete, and asphalt may look random and homogeneous, but all contain cracks, scratches, or unique arrangements of fibers. These imperfections are persistent, and can serve as local features. Our system incorporates a downward-facing camera to capture the fine texture of the ground, together with an image processing pipeline that locates the captured texture patch in a compact database constructed offline. We demonstrate the capability of our system to robustly, accurately, and quickly locate test images on various types of outdoor and indoor ground surfaces. △ Less

Submitted 26 June, 2019; v1 submitted 29 October, 2017; originally announced October 2017.

arXiv:1701.00220 [pdf]

Classification of Smartphone Users Using Internet Traffic

Authors: Andrey Finkelstein, Ron Biton, Rami Puzis, Asaf Shabtai

Abstract: Today, smartphone devices are owned by a large portion of the population and have become a very popular platform for accessing the Internet. Smartphones provide the user with immediate access to information and services. However, they can easily expose the user to many privacy risks. Applications that are installed on the device and entities with access to the device's Internet traffic can reveal… ▽ More Today, smartphone devices are owned by a large portion of the population and have become a very popular platform for accessing the Internet. Smartphones provide the user with immediate access to information and services. However, they can easily expose the user to many privacy risks. Applications that are installed on the device and entities with access to the device's Internet traffic can reveal private information about the smartphone user and steal sensitive content stored on the device or transmitted by the device over the Internet. In this paper, we present a method to reveal various demographics and technical computer skills of smartphone users by their Internet traffic records, using machine learning classification models. We implement and evaluate the method on real life data of smartphone users and show that smartphone users can be classified by their gender, smoking habits, software programming experience, and other characteristics. △ Less

Submitted 1 January, 2017; originally announced January 2017.

arXiv:1504.06755 [pdf, other]

TurkerGaze: Crowdsourcing Saliency with Webcam based Eye Tracking

Authors: **mei Xu, Krista A Ehinger, Yinda Zhang, Adam Finkelstein, Sanjeev R. Kulkarni, Jianxiong Xiao

Abstract: Traditional eye tracking requires specialized hardware, which means collecting gaze data from many observers is expensive, tedious and slow. Therefore, existing saliency prediction datasets are order-of-magnitudes smaller than typical datasets for other vision recognition tasks. The small size of these datasets limits the potential for training data intensive algorithms, and causes overfitting in… ▽ More Traditional eye tracking requires specialized hardware, which means collecting gaze data from many observers is expensive, tedious and slow. Therefore, existing saliency prediction datasets are order-of-magnitudes smaller than typical datasets for other vision recognition tasks. The small size of these datasets limits the potential for training data intensive algorithms, and causes overfitting in benchmark evaluation. To address this deficiency, this paper introduces a webcam-based gaze tracking system that supports large-scale, crowdsourced eye tracking deployed on Amazon Mechanical Turk (AMTurk). By a combination of careful algorithm and gaming protocol design, our system obtains eye tracking data for saliency prediction comparable to data gathered in a traditional lab setting, with relatively lower cost and less effort on the part of the researchers. Using this tool, we build a saliency dataset for a large number of natural images. We will open-source our tool and provide a web server where researchers can upload their images to get eye tracking results from AMTurk. △ Less

Submitted 20 May, 2015; v1 submitted 25 April, 2015; originally announced April 2015.

Comments: 9 pages, 14 figures

arXiv:1405.1621 [pdf]

Characteristic time of crossing a long free energy barrier

Authors: Alexei V. Finkelstein

Abstract: This short paper presents a simple approximate analytical estimate of the characteristic time of crossing a high, long and arbitrary bumpy free energy barrier in a course of chemical, biochemical or physical reaction. This short paper presents a simple approximate analytical estimate of the characteristic time of crossing a high, long and arbitrary bumpy free energy barrier in a course of chemical, biochemical or physical reaction. △ Less

Submitted 7 May, 2014; originally announced May 2014.

arXiv:1301.1925 [pdf, other]

doi 10.1103/PhysRevA.87.043636

Effective theory for the propagation of a wave-packet in a disordered and nonlinear medium

Authors: G. Schwiete, A. M. Finkelstein

Abstract: The propagation of a wave-packet in a nonlinear disordered medium exhibits interesting dynamics. Here, we present an analysis based on the nonlinear Schrödinger equation (Gross-Pitaevskii equation). This problem is directly connected to experiments on expanding Bose gases and to studies of transverse localization in nonlinear optical media. In a nonlinear medium the energy of the wave-packet is st… ▽ More The propagation of a wave-packet in a nonlinear disordered medium exhibits interesting dynamics. Here, we present an analysis based on the nonlinear Schrödinger equation (Gross-Pitaevskii equation). This problem is directly connected to experiments on expanding Bose gases and to studies of transverse localization in nonlinear optical media. In a nonlinear medium the energy of the wave-packet is stored both in the kinetic and potential parts, and details of its propagation are to a large extent determined by the transfer from one form of energy to the other. A theory describing the evolution of the wave-packet has been formulated in [G. Schwiete and A. Finkelstein, Phys. Rev. Lett. 104, 103904 (2010)] in terms of a nonlinear kinetic equation. In this paper, we present details of the derivation of the kinetic equation and of its analysis. As an important new ingredient we study interparticle-collisions induced by the nonlinearity and derive the corresponding collision integral. We restrict ourselves to the weakly nonlinear limit, for which disorder scattering is the dominant scattering mechanism. We find that in the special case of a white noise impurity potential the mean squared radius in a two-dimensional system scales linearly with t. This result has previously been obtained in the collisionless limit, but it also holds in the presence of collisions. Finally, we mention different mechanisms through which the nonlinearity may influence localization of the expanding wave-packet. △ Less

Submitted 9 January, 2013; originally announced January 2013.

Comments: 21 pages, 10 figures

Journal ref: Phys. Rev. A 87, 043636 (2013)

arXiv:1012.5506 [pdf, other]

Ontology-based Queries over Cancer Data

Authors: Alejandra Gonzalez-Beltran, Ben Tagger, Anthony Finkelstein

Abstract: The ever-increasing amount of data in biomedical research, and in cancer research in particular, needs to be managed to support efficient data access, exchange and integration. Existing software infrastructures, such caGrid, support access to distributed information annotated with a domain ontology. However, caGrid's current querying functionality depends on the structure of individual data resour… ▽ More The ever-increasing amount of data in biomedical research, and in cancer research in particular, needs to be managed to support efficient data access, exchange and integration. Existing software infrastructures, such caGrid, support access to distributed information annotated with a domain ontology. However, caGrid's current querying functionality depends on the structure of individual data resources without exploiting the semantic annotations. In this paper, we present the design and development of an ontology-based querying functionality that consists of: the generation of OWL2 ontologies from the underlying data resources metadata and a query rewriting and translation process based on reasoning, which converts a query at the domain ontology level into queries at the software infrastructure level. We present a detailed analysis of our approach as well as an extensive performance evaluation. While the implementation and evaluation was performed for the caGrid infrastructure, the approach could be applicable to other model and metadata-driven environments for data sharing. △ Less

Submitted 26 December, 2010; originally announced December 2010.

Comments: in Adrian Paschke, Albert Burger, Andrea Splendiani, M. Scott Marshall, Paolo Romano: Proceedings of the 3rd International Workshop on Semantic Web Applications and Tools for the Life Sciences, Berlin,Germany, December 8-10, 2010

Report number: SWAT4LS 2010 ACM Class: J.3

arXiv:1010.0726 [pdf, other]

An effective theory of pulse propagation in a nonlinear and disordered medium in two dimensions

Authors: G. Schwiete, A. M. Finkelstein

Abstract: We develop an effective theory of pulse propagation in a nonlinear {\it and} disordered medium. The theory is formulated in terms of a nonlinear diffusion equation. Despite its apparent simplicity this equation describes novel phenomena which we refer to as "locked explosion" and "diffusive" collapse. The equation can be applied to such distinct physical systems as laser beams propagating in disor… ▽ More We develop an effective theory of pulse propagation in a nonlinear {\it and} disordered medium. The theory is formulated in terms of a nonlinear diffusion equation. Despite its apparent simplicity this equation describes novel phenomena which we refer to as "locked explosion" and "diffusive" collapse. The equation can be applied to such distinct physical systems as laser beams propagating in disordered photonic crystals or Bose-Einstein condensates expanding in a disordered environment. △ Less

Submitted 4 October, 2010; originally announced October 2010.

Comments: 15 pages, 11 figures. Prepared for "Perspectives of Mesoscopic Physics - Dedicated to Prof Yoseph Imry's 70th Birthday". This article is based on arXiv:0905.4722 [PRL 104, 103904 (2010)] and contains additional information and illustrations

arXiv:1001.4968

A Non-Coprehensive Survey Of Integration Methods In Discrete Geometry

Authors: Amir Finkelstein

Abstract: The paper suggests a short survey of integration algorithms which evolved since 1982. These theorems and algorithms form discrete versions of the calculus theorems. The paper suggests a short survey of integration algorithms which evolved since 1982. These theorems and algorithms form discrete versions of the calculus theorems. △ Less

Submitted 27 April, 2014; v1 submitted 27 January, 2010; originally announced January 2010.

Comments: This paper has been withdrawn by the author. This paper is a draft to the paper "Applying Semi-discrete Operators to Calculus", arXiv:1012.5751

arXiv:0912.4891

A Dual Approach To The Advanced Calculus Via Lebesgue's Integral

Authors: Amir Finkelstein

Abstract: The paper suggests a slightly more rigorous justification to Wang et al.'s work from 2007, and introduces the Slanted Line Integral. The paper suggests a slightly more rigorous justification to Wang et al.'s work from 2007, and introduces the Slanted Line Integral. △ Less

Submitted 27 April, 2014; v1 submitted 24 December, 2009; originally announced December 2009.

Comments: This paper has been withdrawn by the author. This paper is a draft to the paper "Applying Semi-discrete Operators to Calculus", arXiv:1012.5751

arXiv:physics/0612139 [pdf]

Energy Barrier for an Ion Crossing an Intra-Membrane Channel

Authors: Alexei V. Finkelstein, Dmitry N. Ivankov, Alexander M. Dykhne

Abstract: We present a simple approximate analytical estimate for self-energy of a charge in the middle of cylindrical channel of a high permittivity epsilon_1 in a media of a low permittivity epsilon_2 (for the cases of infinitely long and comparatively short channels) and show that this estimate is in a good quantitative agreement with exact solution of Poisson equation. Further, using these estimates,… ▽ More We present a simple approximate analytical estimate for self-energy of a charge in the middle of cylindrical channel of a high permittivity epsilon_1 in a media of a low permittivity epsilon_2 (for the cases of infinitely long and comparatively short channels) and show that this estimate is in a good quantitative agreement with exact solution of Poisson equation. Further, using these estimates, we explain the observed a lower conductivity, caused by an increased the self-free-energy for ions, whose diameter is by ~1 angstrom less than that of the channel (as compared to ions, whose diameter is equal to that of the channel). △ Less

Submitted 14 December, 2006; originally announced December 2006.

Comments: 6 pages, 2 figures, 1 table, 6 references

Showing 1–26 of 26 results for author: Finkelstein, A