Search | arXiv e-print repository

Spectral CT Two-step and One-step Material Decomposition using Diffusion Posterior Sampling

Authors: Corentin Vazia, Alexandre Bousse, Jacques Froment, Béatrice Vedel, Franck Vermet, Zhihan Wang, Thore Dassow, Jean-Pierre Tasu, Dimitris Visvikis

Abstract: This paper proposes a novel approach to spectral computed tomography (CT) material decomposition that uses the recent advances in generative diffusion models (DMs) for inverse problems. Spectral CT and more particularly photon-counting CT (PCCT) can perform transmission measurements at different energy levels which can be used for material decomposition. It is an ill-posed inverse problem and ther… ▽ More This paper proposes a novel approach to spectral computed tomography (CT) material decomposition that uses the recent advances in generative diffusion models (DMs) for inverse problems. Spectral CT and more particularly photon-counting CT (PCCT) can perform transmission measurements at different energy levels which can be used for material decomposition. It is an ill-posed inverse problem and therefore requires regularization. DMs are a class of generative model that can be used to solve inverse problems via diffusion posterior sampling (DPS). In this paper we adapt DPS for material decomposition in a PCCT setting. We propose two approaches, namely Two-step Diffusion Posterior Sampling (TDPS) and One-step Diffusion Posterior Sampling (ODPS). Early results from an experiment with simulated low-dose PCCT suggest that DPSs have the potential to outperform state-of-the-art model-based iterative reconstruction (MBIR). Moreover, our results indicate that TDPS produces material images with better peak signal-to-noise ratio (PSNR) than images produced with ODPS with similar structural similarity (SSIM). △ Less

Submitted 23 June, 2024; v1 submitted 15 March, 2024; originally announced March 2024.

Comments: 5 pages, 3 figures, submitted to EUSIPCO 2024

arXiv:2403.06308 [pdf, other]

Diffusion Posterior Sampling for Synergistic Reconstruction in Spectral Computed Tomography

Authors: Corentin Vazia, Alexandre Bousse, Béatrice Vedel, Franck Vermet, Zhihan Wang, Thore Dassow, Jean-Pierre Tasu, Dimitris Visvikis, Jacques Froment

Abstract: Using recent advances in generative artificial intelligence (AI) brought by diffusion models, this paper introduces a new synergistic method for spectral computed tomography (CT) reconstruction. Diffusion models define a neural network to approximate the gradient of the log-density of the training data, which is then used to generate new images similar to the training ones. Following the inverse p… ▽ More Using recent advances in generative artificial intelligence (AI) brought by diffusion models, this paper introduces a new synergistic method for spectral computed tomography (CT) reconstruction. Diffusion models define a neural network to approximate the gradient of the log-density of the training data, which is then used to generate new images similar to the training ones. Following the inverse problem paradigm, we propose to adapt this generative process to synergistically reconstruct multiple images at different energy bins from multiple measurements. The experiments suggest that using multiple energy bins simultaneously improves the reconstruction by inverse diffusion and outperforms state-of-the-art synergistic reconstruction techniques. △ Less

Submitted 15 March, 2024; v1 submitted 10 March, 2024; originally announced March 2024.

Comments: 5 pages, 2 figures, IEEE ISBI 2024

arXiv:2312.14698 [pdf, other]

Time-changed normalizing flows for accurate SDE modeling

Authors: Naoufal El Bekri, Lucas Drumetz, Franck Vermet

Abstract: The generative paradigm has become increasingly important in machine learning and deep learning models. Among popular generative models are normalizing flows, which enable exact likelihood estimation by transforming a base distribution through diffeomorphic transformations. Extending the normalizing flow framework to handle time-indexed flows gave dynamic normalizing flows, a powerful tool to mode… ▽ More The generative paradigm has become increasingly important in machine learning and deep learning models. Among popular generative models are normalizing flows, which enable exact likelihood estimation by transforming a base distribution through diffeomorphic transformations. Extending the normalizing flow framework to handle time-indexed flows gave dynamic normalizing flows, a powerful tool to model time series, stochastic processes, and neural stochastic differential equations (SDEs). In this work, we propose a novel variant of dynamic normalizing flows, a Time Changed Normalizing Flow (TCNF), based on time deformation of a Brownian motion which constitutes a versatile and extensive family of Gaussian processes. This approach enables us to effectively model some SDEs, that cannot be modeled otherwise, including standard ones such as the well-known Ornstein-Uhlenbeck process, and generalizes prior methodologies, leading to improved results and better inference and prediction capability. △ Less

Submitted 15 January, 2024; v1 submitted 22 December, 2023; originally announced December 2023.

arXiv:2311.11900 [pdf, other]

Measuring and Mitigating Biases in Motor Insurance Pricing

Authors: Mulah Moriah, Franck Vermet, Arthur Charpentier

Abstract: The non-life insurance sector operates within a highly competitive and tightly regulated framework, confronting a pivotal juncture in the formulation of pricing strategies. Insurers are compelled to harness a range of statistical methodologies and available data to construct optimal pricing structures that align with the overarching corporate strategy while accommodating the dynamics of market com… ▽ More The non-life insurance sector operates within a highly competitive and tightly regulated framework, confronting a pivotal juncture in the formulation of pricing strategies. Insurers are compelled to harness a range of statistical methodologies and available data to construct optimal pricing structures that align with the overarching corporate strategy while accommodating the dynamics of market competition. Given the fundamental societal role played by insurance, premium rates are subject to rigorous scrutiny by regulatory authorities. These rates must conform to principles of transparency, explainability, and ethical considerations. Consequently, the act of pricing transcends mere statistical calculations and carries the weight of strategic and societal factors. These multifaceted concerns may drive insurers to establish equitable premiums, taking into account various variables. For instance, regulations mandate the provision of equitable premiums, considering factors such as policyholder gender or mutualist group dynamics in accordance with respective corporate strategies. Age-based premium fairness is also mandated. In certain insurance domains, variables such as the presence of serious illnesses or disabilities are emerging as new dimensions for evaluating fairness. Regardless of the motivating factor prompting an insurer to adopt fairer pricing strategies for a specific variable, the insurer must possess the capability to define, measure, and ultimately mitigate any ethical biases inherent in its pricing practices while upholding standards of consistency and performance. This study seeks to provide a comprehensive set of tools for these endeavors and assess their effectiveness through practical application in the context of automobile insurance. △ Less

Submitted 20 June, 2024; v1 submitted 20 November, 2023; originally announced November 2023.

arXiv:2311.00666 [pdf, other]

doi 10.1109/TRPMS.2023.3330045

Uconnect: Synergistic Spectral CT Reconstruction with U-Nets Connecting the Energy bins

Authors: Zhihan Wang, Alexandre Bousse, Franck Vermet, Jacques Froment, Béatrice Vedel, Alessandro Perelli, Jean-Pierre Tasu, Dimitris Visvikis

Abstract: Spectral computed tomography (CT) offers the possibility to reconstruct attenuation images at different energy levels, which can be then used for material decomposition. However, traditional methods reconstruct each energy bin individually and are vulnerable to noise. In this paper, we propose a novel synergistic method for spectral CT reconstruction, namely Uconnect. It utilizes trained convoluti… ▽ More Spectral computed tomography (CT) offers the possibility to reconstruct attenuation images at different energy levels, which can be then used for material decomposition. However, traditional methods reconstruct each energy bin individually and are vulnerable to noise. In this paper, we propose a novel synergistic method for spectral CT reconstruction, namely Uconnect. It utilizes trained convolutional neural networks (CNNs) to connect the energy bins to a latent image so that the full binned data is used synergistically. We experiment on two types of low-dose data: simulated and real patient data. Qualitative and quantitative analysis show that our proposed Uconnect outperforms state-of-art model-based iterative reconstruction (MBIR) techniques as well as CNN-based denoising. △ Less

Submitted 22 November, 2023; v1 submitted 1 November, 2023; originally announced November 2023.

arXiv:2210.08842 [pdf, other]

Geometry-preserving Lie Group Integrators For Differential Equations On The Manifold Of Symmetric Positive Definite Matrices

Authors: Lucas Drumetz, Alexandre Reiffers-Masson, Naoufal El Bekri, Franck Vermet

Abstract: In many applications, one encounters signals that lie on manifolds rather than a Euclidean space. In particular, covariance matrices are examples of ubiquitous mathematical objects that have a non Euclidean structure. The application of Euclidean methods to integrate differential equations lying on such objects does not respect the geometry of the manifold, which can cause many numerical issues. I… ▽ More In many applications, one encounters signals that lie on manifolds rather than a Euclidean space. In particular, covariance matrices are examples of ubiquitous mathematical objects that have a non Euclidean structure. The application of Euclidean methods to integrate differential equations lying on such objects does not respect the geometry of the manifold, which can cause many numerical issues. In this paper, we propose to use Lie group methods to define geometry-preserving numerical integration schemes on the manifold of symmetric positive definite matrices. These can be applied to a number of differential equations on covariance matrices of practical interest. We show that they are more stable and robust than other classical or naive integration schemes on an example. △ Less

Submitted 15 May, 2023; v1 submitted 17 October, 2022; originally announced October 2022.

arXiv:2209.15398 [pdf, other]

doi 10.1007/978-3-031-15565-9_1

Evaluation of importance estimators in deep learning classifiers for Computed Tomography

Authors: Lennart Brocki, Wistan Marchadour, Jonas Maison, Bogdan Badic, Panagiotis Papadimitroulas, Mathieu Hatt, Franck Vermet, Neo Christopher Chung

Abstract: Deep learning has shown superb performance in detecting objects and classifying images, ensuring a great promise for analyzing medical imaging. Translating the success of deep learning to medical imaging, in which doctors need to understand the underlying process, requires the capability to interpret and explain the prediction of neural networks. Interpretability of deep neural networks often reli… ▽ More Deep learning has shown superb performance in detecting objects and classifying images, ensuring a great promise for analyzing medical imaging. Translating the success of deep learning to medical imaging, in which doctors need to understand the underlying process, requires the capability to interpret and explain the prediction of neural networks. Interpretability of deep neural networks often relies on estimating the importance of input features (e.g., pixels) with respect to the outcome (e.g., class probability). However, a number of importance estimators (also known as saliency maps) have been developed and it is unclear which ones are more relevant for medical imaging applications. In the present work, we investigated the performance of several importance estimators in explaining the classification of computed tomography (CT) images by a convolutional deep network, using three distinct evaluation metrics. First, the model-centric fidelity measures a decrease in the model accuracy when certain inputs are perturbed. Second, concordance between importance scores and the expert-defined segmentation masks is measured on a pixel level by a receiver operating characteristic (ROC) curves. Third, we measure a region-wise overlap between a XRAI-based map and the segmentation mask by Dice Similarity Coefficients (DSC). Overall, two versions of SmoothGrad topped the fidelity and ROC rankings, whereas both Integrated Gradients and SmoothGrad excelled in DSC evaluation. Interestingly, there was a critical discrepancy between model-centric (fidelity) and human-centric (ROC and DSC) evaluation. Expert expectation and intuition embedded in segmentation maps does not necessarily align with how the model arrived at its prediction. Understanding this difference in interpretability would help harnessing the power of deep learning in medicine. △ Less

Submitted 30 September, 2022; originally announced September 2022.

Comments: 4th International Workshop on EXplainable and TRAnsparent AI and Multi-Agent Systems (EXTRAAMAS 2022) - International Conference on Autonomous Agents and Multi-Agent Systems (AAMAS)

Journal ref: 2022 EXTRAAMAS 2022, Lecture Notes in Computer Science (LNAI, volume 13283)

arXiv:2209.00562 [pdf, other]

Model Transparency and Interpretability : Survey and Application to the Insurance Industry

Authors: Dimitri Delcaillau, Antoine Ly, Alize Papp, Franck Vermet

Abstract: The use of models, even if efficient, must be accompanied by an understanding at all levels of the process that transforms data (upstream and downstream). Thus, needs increase to define the relationships between individual data and the choice that an algorithm could make based on its analysis (e.g. the recommendation of one product or one promotional offer, or an insurance rate representative of t… ▽ More The use of models, even if efficient, must be accompanied by an understanding at all levels of the process that transforms data (upstream and downstream). Thus, needs increase to define the relationships between individual data and the choice that an algorithm could make based on its analysis (e.g. the recommendation of one product or one promotional offer, or an insurance rate representative of the risk). Model users must ensure that models do not discriminate and that it is also possible to explain their results. This paper introduces the importance of model interpretation and tackles the notion of model transparency. Within an insurance context, it specifically illustrates how some tools can be used to enforce the control of actuarial models that can nowadays leverage on machine learning. On a simple example of loss frequency estimation in car insurance, we show the interest of some interpretability methods to adapt explanation to the target audience. △ Less

Submitted 1 September, 2022; originally announced September 2022.

Comments: Accepted to European Actuarial Journal

arXiv:2107.02764 [pdf, other]

Collaborative Insurance Sustainability and Network Structure

Authors: Arthur Charpentier, Lariosse Kouakou, Matthias Löwe, Philipp Ratz, Franck Vermet

Abstract: The peer-to-peer (P2P) economy has been growing with the advent of the Internet, with well known brands such as Uber or Airbnb being examples thereof. In the insurance sector the approach is still in its infancy, but some companies have started to explore P2P-based collaborative insurance products (eg. Lemonade in the U.S. or Inspeer in France). The actuarial literature only recently started to co… ▽ More The peer-to-peer (P2P) economy has been growing with the advent of the Internet, with well known brands such as Uber or Airbnb being examples thereof. In the insurance sector the approach is still in its infancy, but some companies have started to explore P2P-based collaborative insurance products (eg. Lemonade in the U.S. or Inspeer in France). The actuarial literature only recently started to consider those risk sharing mechanisms, as in Denuit and Robert (2021) or Feng et al. (2021). In this paper, describe and analyse such a P2P product, with some reciprocal risk sharing contracts. Here, we consider the case where policyholders still have an insurance contract, but the first self-insurance layer, below the deductible, can be shared with friends. We study the impact of the shape of the network (through the distribution of degrees) on the risk reduction. We consider also some optimal setting of the reciprocal commitments, and discuss the introduction of contracts with friends of friends to mitigate some possible drawbacks of having people without enough connections to exchange risks. △ Less

Submitted 12 September, 2022; v1 submitted 5 July, 2021; originally announced July 2021.

arXiv:2009.14702 [pdf, ps, other]

doi 10.1007/s10955-021-02727-z

Some Remarks on Replicated Simulated Annealing

Authors: Vincent Gripon, Matthias Löwe, Franck Vermet

Abstract: Recently authors have introduced the idea of training discrete weights neural networks using a mix between classical simulated annealing and a replica ansatz known from the statistical physics literature. Among other points, they claim their method is able to find robust configurations. In this paper, we analyze this so-called "replicated simulated annealing" algorithm. In particular, we explicit… ▽ More Recently authors have introduced the idea of training discrete weights neural networks using a mix between classical simulated annealing and a replica ansatz known from the statistical physics literature. Among other points, they claim their method is able to find robust configurations. In this paper, we analyze this so-called "replicated simulated annealing" algorithm. In particular, we explicit criteria to guarantee its convergence, and study when it successfully samples from configurations. We also perform experiments using synthetic and real data bases. △ Less

Submitted 2 December, 2020; v1 submitted 30 September, 2020; originally announced September 2020.

arXiv:2007.12919 [pdf, other]

Interpretabilité des modèles : état des lieux des méthodes et application à l'assurance

Authors: Dimitri Delcaillau, Antoine Ly, Franck Vermet, Alizé Papp

Abstract: Since May 2018, the General Data Protection Regulation (GDPR) has introduced new obligations to industries. By setting a legal framework, it notably imposes strong transparency on the use of personal data. Thus, people must be informed of the use of their data and must consent the usage of it. Data is the raw material of many models which today make it possible to increase the quality and performa… ▽ More Since May 2018, the General Data Protection Regulation (GDPR) has introduced new obligations to industries. By setting a legal framework, it notably imposes strong transparency on the use of personal data. Thus, people must be informed of the use of their data and must consent the usage of it. Data is the raw material of many models which today make it possible to increase the quality and performance of digital services. Transparency on the use of data also requires a good understanding of its use through different models. The use of models, even if efficient, must be accompanied by an understanding at all levels of the process that transform data (upstream and downstream of a model), thus making it possible to define the relationships between the individual's data and the choice that an algorithm could make based on the analysis of the latter. (For example, the recommendation of one product or one promotional offer or an insurance rate representative of the risk.) Models users must ensure that models do not discriminate against and that it is also possible to explain its result. The widening of the panel of predictive algorithms - made possible by the evolution of computing capacities -- leads scientists to be vigilant about the use of models and to consider new tools to better understand the decisions deduced from them . Recently, the community has been particularly active on model transparency with a marked intensification of publications over the past three years. The increasingly frequent use of more complex algorithms (\textit{deep learning}, Xgboost, etc.) presenting attractive performances is undoubtedly one of the causes of this interest. This article thus presents an inventory of methods of interpreting models and their uses in an insurance context. △ Less

Submitted 25 July, 2020; originally announced July 2020.

Comments: 25 pages without appendix, submitted to BFA, French preprint before English paper

arXiv:2006.05095 [pdf, other]

Towards an Intrinsic Definition of Robustness for a Classifier

Authors: Théo Giraudon, Vincent Gripon, Matthias Löwe, Franck Vermet

Abstract: The robustness of classifiers has become a question of paramount importance in the past few years. Indeed, it has been shown that state-of-the-art deep learning architectures can easily be fooled with imperceptible changes to their inputs. Therefore, finding good measures of robustness of a trained classifier is a key issue in the field. In this paper, we point out that averaging the radius of rob… ▽ More The robustness of classifiers has become a question of paramount importance in the past few years. Indeed, it has been shown that state-of-the-art deep learning architectures can easily be fooled with imperceptible changes to their inputs. Therefore, finding good measures of robustness of a trained classifier is a key issue in the field. In this paper, we point out that averaging the radius of robustness of samples in a validation set is a statistically weak measure. We propose instead to weight the importance of samples depending on their difficulty. We motivate the proposed score by a theoretical case study using logistic regression, where we show that the proposed score is independent of the choice of the samples it is evaluated upon. We also empirically demonstrate the ability of the proposed score to measure robustness of classifiers with little dependence on the choice of samples in more complex settings, including deep convolutional neural networks and real datasets. △ Less

Submitted 11 June, 2020; v1 submitted 9 June, 2020; originally announced June 2020.

Comments: 13 pages

arXiv:1904.11890 [pdf, ps, other]

doi 10.1016/j.physa.2020.124735

Multi-group Binary Choice with Social Interaction and a Random Communication Structure -- a Random Graph Approach

Authors: Matthias Löwe, Kristina Schubert, Franck Vermet

Abstract: We construct and analyze a random graph model for discrete choice with social interaction and several groups of equal size. We concentrate on the case of two groups of equal sizes and we allow the interaction strength within a group to differ from the interaction strength between the two groups. Given that the resulting graph is sufficiently dense we show that, with probability one, the average de… ▽ More We construct and analyze a random graph model for discrete choice with social interaction and several groups of equal size. We concentrate on the case of two groups of equal sizes and we allow the interaction strength within a group to differ from the interaction strength between the two groups. Given that the resulting graph is sufficiently dense we show that, with probability one, the average decision in each of the two groups is the same as in the fully connected model. In particular, we show that there is a phase transition: If the interaction among a group and between the groups is strong enough the average decision per group will either be positive or negative and the decision of the two groups will be correlated. We also compute the free energy per particle in our model. △ Less

Submitted 17 March, 2020; v1 submitted 26 April, 2019; originally announced April 2019.

arXiv:1710.08637 [pdf, ps, other]

Improving Accuracy of Nonparametric Transfer Learning via Vector Segmentation

Authors: Vincent Gripon, Ghouthi B. Hacene, Matthias Löwe, Franck Vermet

Abstract: Transfer learning using deep neural networks as feature extractors has become increasingly popular over the past few years. It allows to obtain state-of-the-art accuracy on datasets too small to train a deep neural network on its own, and it provides cutting edge descriptors that, combined with nonparametric learning methods, allow rapid and flexible deployment of performing solutions in computati… ▽ More Transfer learning using deep neural networks as feature extractors has become increasingly popular over the past few years. It allows to obtain state-of-the-art accuracy on datasets too small to train a deep neural network on its own, and it provides cutting edge descriptors that, combined with nonparametric learning methods, allow rapid and flexible deployment of performing solutions in computationally restricted settings. In this paper, we are interested in showing that the features extracted using deep neural networks have specific properties which can be used to improve accuracy of downstream nonparametric learning methods. Namely, we demonstrate that for some distributions where information is embedded in a few coordinates, segmenting feature vectors can lead to better accuracy. We show how this model can be applied to real datasets by performing experiments using three mainstream deep neural network feature extractors and four databases, in vision and audio. △ Less

Submitted 24 October, 2017; originally announced October 2017.

arXiv:1702.01929 [pdf, ps, other]

doi 10.1007/s10955-017-1806-y

On a model of associative memory with huge storage capacity

Authors: Mete Demircigil, Judith Heusel, Matthias Löwe, Sven Upgang, Franck Vermet

Abstract: In [7] Krotov and Hopfield suggest a generalized version of the well-known Hopfield model of associative memory. In their version they consider a polynomial interaction function and claim that this increases the storage capacity of the model. We prove this claim and take the "limit" as the degree of the polynomial becomes infinite, i.e. an exponential interaction function. With this interaction we… ▽ More In [7] Krotov and Hopfield suggest a generalized version of the well-known Hopfield model of associative memory. In their version they consider a polynomial interaction function and claim that this increases the storage capacity of the model. We prove this claim and take the "limit" as the degree of the polynomial becomes infinite, i.e. an exponential interaction function. With this interaction we prove that model has an exponential storage capacity in the number of neurons, yet the basins of attraction are almost as large as in the standard Hopfield model. △ Less

Submitted 30 June, 2017; v1 submitted 7 February, 2017; originally announced February 2017.

Comments: 13 pages

MSC Class: 82C32; 60K35; Secondary: 68T05; 92B20

Journal ref: J. Stat. Phys. 168 (2), 288-299 (2017)

arXiv:1611.05898 [pdf, other]

Associative Memories to Accelerate Approximate Nearest Neighbor Search

Authors: Vincent Gripon, Matthias Löwe, Franck Vermet

Abstract: Nearest neighbor search is a very active field in machine learning for it appears in many application cases, including classification and object retrieval. In its canonical version, the complexity of the search is linear with both the dimension and the cardinal of the collection of vectors the search is performed in. Recently many works have focused on reducing the dimension of vectors using quant… ▽ More Nearest neighbor search is a very active field in machine learning for it appears in many application cases, including classification and object retrieval. In its canonical version, the complexity of the search is linear with both the dimension and the cardinal of the collection of vectors the search is performed in. Recently many works have focused on reducing the dimension of vectors using quantization techniques or hashing, while providing an approximate result. In this paper we focus instead on tackling the cardinal of the collection of vectors. Namely, we introduce a technique that partitions the collection of vectors and stores each part in its own associative memory. When a query vector is given to the system, associative memories are polled to identify which one contain the closest match. Then an exhaustive search is conducted only on the part of vectors stored in the selected associative memory. We study the effectiveness of the system when messages to store are generated from i.i.d. uniform $\pm$1 random variables or 0-1 sparse i.i.d. random variables. We also conduct experiment on both synthetic data and real data and show it is possible to achieve interesting trade-offs between complexity and accuracy. △ Less

Submitted 5 July, 2017; v1 submitted 10 November, 2016; originally announced November 2016.

Comments: 21 pages, 12 figures

MSC Class: 82C32; 60K35 (Primary); 68T05; 92B20 (Secondary)

arXiv:1512.08892 [pdf, other]

doi 10.1007/s10955-016-1530-z

A Comparative Study of Sparse Associative Memories

Authors: Vincent Gripon, Judith Heusel, Matthias Löwe, Franck Vermet

Abstract: We study various models of associative memories with sparse information, i.e. a pattern to be stored is a random string of $0$s and $1$s with about $\log N$ $1$s, only. We compare different synaptic weights, architectures and retrieval mechanisms to shed light on the influence of the various parameters on the storage capacity. We study various models of associative memories with sparse information, i.e. a pattern to be stored is a random string of $0$s and $1$s with about $\log N$ $1$s, only. We compare different synaptic weights, architectures and retrieval mechanisms to shed light on the influence of the various parameters on the storage capacity. △ Less

Submitted 24 June, 2016; v1 submitted 30 December, 2015; originally announced December 2015.

Comments: 28 pages, 2 figures

MSC Class: 82C32; 60K35 (Primary); 68T05; 92B20 (Secondary)

arXiv:1411.1224 [pdf, ps, other]

On the capacity of a new model of associative memory based on neural cliques

Authors: Judith Heusel, Matthias Löwe, Franck Vermet

Abstract: Based on recent work by Gripon and Berrou, we introduce a new model of an associative memory. We show that this model has an efficiency bounded away from 0 and is therefore significantly more effective than the well known Hopfield model. We prove that the synchronous and asynchronous retrieval dynamics converge and give upper and lower bounds on the memory capacity of the model. Based on recent work by Gripon and Berrou, we introduce a new model of an associative memory. We show that this model has an efficiency bounded away from 0 and is therefore significantly more effective than the well known Hopfield model. We prove that the synchronous and asynchronous retrieval dynamics converge and give upper and lower bounds on the memory capacity of the model. △ Less

Submitted 5 November, 2014; originally announced November 2014.

Comments: 11 pages

MSC Class: Primary: 82C32; 60K35; Secondary: 68T05; 92B20

arXiv:1408.0294 [pdf, ps, other]

Large deviation upper bounds for sums of positively associated indicators

Authors: Matthias Löwe, Franck Vermet

Abstract: We give exponential upper bounds for $P(S \le k)$, in particular $P(S=0)$, where $S$ is a sum of indicator random variables that are positively associated. These bounds allow, in particular, a comparison with the independent case. We give examples in which we compare with a famous exponential inequality for sums of correlated indicators, the Janson inequality. Here our bound sometimes proves to be… ▽ More We give exponential upper bounds for $P(S \le k)$, in particular $P(S=0)$, where $S$ is a sum of indicator random variables that are positively associated. These bounds allow, in particular, a comparison with the independent case. We give examples in which we compare with a famous exponential inequality for sums of correlated indicators, the Janson inequality. Here our bound sometimes proves to be superior to Janson's bound. △ Less

Submitted 19 December, 2014; v1 submitted 1 August, 2014; originally announced August 2014.

Comments: 15 pages

MSC Class: Primary: 60F10; Secondary: 60C05

arXiv:1303.4542 [pdf, ps, other]

doi 10.3150/14-BEJ630

Capacity of an associative memory model on random graph architectures

Authors: Matthias Löwe, Franck Vermet

Abstract: We analyze the storage capacity of the Hopfield models on classes of random graphs. While such a setup has been analyzed for the case that the underlying random graph model is an Erdös-Renyi graph, other architectures, including those investigated in the recent neuroscience literature, have not been studied yet. We develop a notion of storage capacity that highlights the influence of the graph top… ▽ More We analyze the storage capacity of the Hopfield models on classes of random graphs. While such a setup has been analyzed for the case that the underlying random graph model is an Erdös-Renyi graph, other architectures, including those investigated in the recent neuroscience literature, have not been studied yet. We develop a notion of storage capacity that highlights the influence of the graph topology and give results on the storage capacity for not too irregular random graph models. The class of models investigated includes the popular power law graphs for some parameter values. △ Less

Submitted 28 July, 2015; v1 submitted 19 March, 2013; originally announced March 2013.

Comments: Published at http://dx.doi.org/10.3150/14-BEJ630 in the Bernoulli (http://isi.cbs.nl/bernoulli/) by the International Statistical Institute/Bernoulli Society (http://isi.cbs.nl/BS/bshome.htm)

Report number: IMS-BEJ-BEJ630

Journal ref: Bernoulli 2015, Vol. 21, No. 3, 1884-1910

arXiv:1206.4162 [pdf, ps, other]

Mixing times for the Swap** Algorithm on the Blume-Emery-Griffiths Model

Authors: M. Ebbers, H. Knöpfel, M. Löwe, F. Vermet

Abstract: We analyze the so called Swap** Algorithm, a parallel version of the well-known Metropolis-Hastings algorithm, on the mean-field version of the Blume-Emery-Griffiths model in statistical mechanics. This model has two parameters and depending on their choice, the model exhibits either a first, or a second order phase transition. In agreement with a conjecture by Bhatnagar and Randall we find that… ▽ More We analyze the so called Swap** Algorithm, a parallel version of the well-known Metropolis-Hastings algorithm, on the mean-field version of the Blume-Emery-Griffiths model in statistical mechanics. This model has two parameters and depending on their choice, the model exhibits either a first, or a second order phase transition. In agreement with a conjecture by Bhatnagar and Randall we find that the Swap** Algorithm mixes rapidly in presence of a second order phase transition, while becoming slow when the phase transition is first order. △ Less

Submitted 19 June, 2012; originally announced June 2012.

Comments: 35 pages, to be published in Random Structures and Algorithms

MSC Class: 60J10 (Primary) 60K35 (Secondary)

Showing 1–21 of 21 results for author: Vermet, F