-
Do highly over-parameterized neural networks generalize since bad solutions are rare?
Authors:
Julius Martinetz,
Thomas Martinetz
Abstract:
We study over-parameterized classifiers where Empirical Risk Minimization (ERM) for learning leads to zero training error. In these over-parameterized settings there are many global minima with zero training error, some of which generalize better than others. We show that under certain conditions the fraction of "bad" global minima with a true error larger than ε decays to zero exponentially fast…
▽ More
We study over-parameterized classifiers where Empirical Risk Minimization (ERM) for learning leads to zero training error. In these over-parameterized settings there are many global minima with zero training error, some of which generalize better than others. We show that under certain conditions the fraction of "bad" global minima with a true error larger than ε decays to zero exponentially fast with the number of training data n. The bound depends on the distribution of the true error over the set of classifier functions used for the given classification problem, and does not necessarily depend on the size or complexity (e.g. the number of parameters) of the classifier function set. This insight may provide a novel perspective on the unexpectedly good generalization even of highly over-parameterized neural networks. We substantiate our theoretical findings through experiments on synthetic data and a subset of MNIST. Additionally, we assess our hypothesis using VGG19 and ResNet18 on a subset of Caltech101.
△ Less
Submitted 3 December, 2023; v1 submitted 7 November, 2022;
originally announced November 2022.
-
Large Neural Networks Learning from Scratch with Very Few Data and without Explicit Regularization
Authors:
Christoph Linse,
Thomas Martinetz
Abstract:
Recent findings have shown that highly over-parameterized Neural Networks generalize without pretraining or explicit regularization. It is achieved with zero training error, i.e., complete over-fitting by memorizing the training data. This is surprising, since it is completely against traditional machine learning wisdom. In our empirical study we fortify these findings in the domain of fine-graine…
▽ More
Recent findings have shown that highly over-parameterized Neural Networks generalize without pretraining or explicit regularization. It is achieved with zero training error, i.e., complete over-fitting by memorizing the training data. This is surprising, since it is completely against traditional machine learning wisdom. In our empirical study we fortify these findings in the domain of fine-grained image classification. We show that very large Convolutional Neural Networks with millions of weights do learn with only a handful of training samples and without image augmentation, explicit regularization or pretraining. We train the architectures ResNet018, ResNet101 and VGG19 on subsets of the difficult benchmark datasets Caltech101, CUB_200_2011, FGVCAircraft, Flowers102 and StanfordCars with 100 classes and more, perform a comprehensive comparative study and draw implications for the practical application of CNNs. Finally, we show that a randomly initialized VGG19 with 140 million weights learns to distinguish airplanes and motorbikes with up to 95% accuracy using only 20 training samples per class.
△ Less
Submitted 21 October, 2022; v1 submitted 18 May, 2022;
originally announced May 2022.
-
Explainable COVID-19 Detection Using Chest CT Scans and Deep Learning
Authors:
Hammam Alshazly,
Christoph Linse,
Erhardt Barth,
Thomas Martinetz
Abstract:
This paper explores how well deep learning models trained on chest CT images can diagnose COVID-19 infected people in a fast and automated process. To this end, we adopt advanced deep network architectures and propose a transfer learning strategy using custom-sized input tailored for each deep architecture to achieve the best performance. We conduct extensive sets of experiments on two CT image da…
▽ More
This paper explores how well deep learning models trained on chest CT images can diagnose COVID-19 infected people in a fast and automated process. To this end, we adopt advanced deep network architectures and propose a transfer learning strategy using custom-sized input tailored for each deep architecture to achieve the best performance. We conduct extensive sets of experiments on two CT image datasets, namely the SARS-CoV-2 CT-scan and the COVID19-CT. The obtained results show superior performances for our models compared with previous studies, where our best models achieve average accuracy, precision, sensitivity, specificity and F1 score of 99.4%, 99.6%, 99.8%, 99.6% and 99.4% on the SARS-CoV-2 dataset; and 92.9%, 91.3%, 93.7%, 92.2% and 92.5% on the COVID19-CT dataset, respectively. Furthermore, we apply two visualization techniques to provide visual explanations for the models' predictions. The visualizations show well-separated clusters for CT images of COVID-19 from other lung diseases, and accurate localizations of the COVID-19 associated regions.
△ Less
Submitted 9 November, 2020;
originally announced November 2020.
-
Feature Products Yield Efficient Networks
Authors:
Philipp Grüning,
Thomas Martinetz,
Erhardt Barth
Abstract:
We introduce Feature-Product networks (FP-nets) as a novel deep-network architecture based on a new building block inspired by principles of biological vision. For each input feature map, a so-called FP-block learns two different filters, the outputs of which are then multiplied. Such FP-blocks are inspired by models of end-stopped neurons, which are common in cortical areas V1 and especially in V…
▽ More
We introduce Feature-Product networks (FP-nets) as a novel deep-network architecture based on a new building block inspired by principles of biological vision. For each input feature map, a so-called FP-block learns two different filters, the outputs of which are then multiplied. Such FP-blocks are inspired by models of end-stopped neurons, which are common in cortical areas V1 and especially in V2. Convolutional neural networks can be transformed into parameter-efficient FP-nets by substituting conventional blocks of regular convolutions with FP-blocks. In this way, we create several novel FP-nets based on state-of-the-art networks and evaluate them on the Cifar-10 and ImageNet challenges. We show that the use of FP-blocks reduces the number of parameters significantly without decreasing generalization capability. Since so far heuristics and search algorithms have been used to find more efficient networks, it seems remarkable that we can obtain even more efficient networks based on a novel bio-inspired design principle.
△ Less
Submitted 18 August, 2020;
originally announced August 2020.
-
Solving Raven's Progressive Matrices with Multi-Layer Relation Networks
Authors:
Marius Jahrens,
Thomas Martinetz
Abstract:
Raven's Progressive Matrices are a benchmark originally designed to test the cognitive abilities of humans. It has recently been adapted to test relational reasoning in machine learning systems. For this purpose the so-called Procedurally Generated Matrices dataset was set up, which is so far one of the most difficult relational reasoning benchmarks. Here we show that deep neural networks are capa…
▽ More
Raven's Progressive Matrices are a benchmark originally designed to test the cognitive abilities of humans. It has recently been adapted to test relational reasoning in machine learning systems. For this purpose the so-called Procedurally Generated Matrices dataset was set up, which is so far one of the most difficult relational reasoning benchmarks. Here we show that deep neural networks are capable of solving this benchmark, reaching an accuracy of 98.0 percent over the previous state-of-the-art of 62.6 percent by combining Wild Relation Networks with Multi-Layer Relation Networks and introducing Magnitude Encoding, an encoding scheme designed for late fusion architectures.
△ Less
Submitted 25 March, 2020;
originally announced March 2020.
-
Multi-layer Relation Networks
Authors:
Marius Jahrens,
Thomas Martinetz
Abstract:
Relational Networks (RN) as introduced by Santoro et al. (2017) have demonstrated strong relational reasoning capabilities with a rather shallow architecture. Its single-layer design, however, only considers pairs of information objects, making it unsuitable for problems requiring reasoning across a higher number of facts. To overcome this limitation, we propose a multi-layer relation network arch…
▽ More
Relational Networks (RN) as introduced by Santoro et al. (2017) have demonstrated strong relational reasoning capabilities with a rather shallow architecture. Its single-layer design, however, only considers pairs of information objects, making it unsuitable for problems requiring reasoning across a higher number of facts. To overcome this limitation, we propose a multi-layer relation network architecture which enables successive refinements of relational information through multiple layers. We show that the increased depth allows for more complex relational reasoning by applying it to the bAbI 20 QA dataset, solving all 20 tasks with joint training and surpassing the state-of-the-art results.
△ Less
Submitted 5 November, 2018;
originally announced November 2018.
-
Adaptive Hierarchical Sensing for the Efficient Sampling of Sparse and Compressible Signals
Authors:
Henry Schütze,
Erhardt Barth,
Thomas Martinetz
Abstract:
We present the novel adaptive hierarchical sensing algorithm K-AHS, which samples sparse or compressible signals with a measurement complexity equal to that of Compressed Sensing (CS). In contrast to CS, K-AHS is adaptive as sensing vectors are selected while sampling, depending on previous measurements. Prior to sampling, the user chooses a transform domain in which the signal of interest is spar…
▽ More
We present the novel adaptive hierarchical sensing algorithm K-AHS, which samples sparse or compressible signals with a measurement complexity equal to that of Compressed Sensing (CS). In contrast to CS, K-AHS is adaptive as sensing vectors are selected while sampling, depending on previous measurements. Prior to sampling, the user chooses a transform domain in which the signal of interest is sparse. The corresponding transform determines the collection of sensing vectors. K-AHS gradually refines initial coarse measurements to significant signal coefficients in the sparse transform domain based on a sensing tree which provides a natural hierarchy of sensing vectors. K-AHS directly provides significant signal coefficients in the sparse transform domain and does not require a reconstruction stage based on inverse optimization. Therefore, the K-AHS sensing vectors must not satisfy any incoherence or restricted isometry property. A mathematical analysis proves the sampling complexity of K-AHS as well as a general and sufficient condition for sampling the optimal k-term approximation, which is applied to particular signal models. The analytical findings are supported by simulations with synthetic signals and real world images. On standard benchmark images, K-AHS achieves lower reconstruction errors than CS.
△ Less
Submitted 14 July, 2018;
originally announced July 2018.
-
Deep Convolutional Neural Networks as Generic Feature Extractors
Authors:
Lars Hertel,
Erhardt Barth,
Thomas Käster,
Thomas Martinetz
Abstract:
Recognizing objects in natural images is an intricate problem involving multiple conflicting objectives. Deep convolutional neural networks, trained on large datasets, achieve convincing results and are currently the state-of-the-art approach for this task. However, the long time needed to train such deep networks is a major drawback. We tackled this problem by reusing a previously trained network…
▽ More
Recognizing objects in natural images is an intricate problem involving multiple conflicting objectives. Deep convolutional neural networks, trained on large datasets, achieve convincing results and are currently the state-of-the-art approach for this task. However, the long time needed to train such deep networks is a major drawback. We tackled this problem by reusing a previously trained network. For this purpose, we first trained a deep convolutional network on the ILSVRC2012 dataset. We then maintained the learned convolution kernels and only retrained the classification part on different datasets. Using this approach, we achieved an accuracy of 67.68 % on CIFAR-100, compared to the previous state-of-the-art result of 65.43 %. Furthermore, our findings indicate that convolutional networks are able to learn generic feature extractors that can be used for different tasks.
△ Less
Submitted 6 October, 2017;
originally announced October 2017.
-
Recursive Autoconvolution for Unsupervised Learning of Convolutional Neural Networks
Authors:
Boris Knyazev,
Erhardt Barth,
Thomas Martinetz
Abstract:
In visual recognition tasks, such as image classification, unsupervised learning exploits cheap unlabeled data and can help to solve these tasks more efficiently. We show that the recursive autoconvolution operator, adopted from physics, boosts existing unsupervised methods by learning more discriminative filters. We take well established convolutional neural networks and train their filters layer…
▽ More
In visual recognition tasks, such as image classification, unsupervised learning exploits cheap unlabeled data and can help to solve these tasks more efficiently. We show that the recursive autoconvolution operator, adopted from physics, boosts existing unsupervised methods by learning more discriminative filters. We take well established convolutional neural networks and train their filters layer-wise. In addition, based on previous works we design a network which extracts more than 600k features per sample, but with the total number of trainable parameters greatly reduced by introducing shared filters in higher layers. We evaluate our networks on the MNIST, CIFAR-10, CIFAR-100 and STL-10 image classification benchmarks and report several state of the art results among other unsupervised methods.
△ Less
Submitted 26 March, 2017; v1 submitted 2 June, 2016;
originally announced June 2016.
-
Committees of deep feedforward networks trained with few data
Authors:
Bogdan Miclut,
Thomas Kaester,
Thomas Martinetz,
Erhardt Barth
Abstract:
Deep convolutional neural networks are known to give good results on image classification tasks. In this paper we present a method to improve the classification result by combining multiple such networks in a committee. We adopt the STL-10 dataset which has very few training examples and show that our method can achieve results that are better than the state of the art. The networks are trained la…
▽ More
Deep convolutional neural networks are known to give good results on image classification tasks. In this paper we present a method to improve the classification result by combining multiple such networks in a committee. We adopt the STL-10 dataset which has very few training examples and show that our method can achieve results that are better than the state of the art. The networks are trained layer-wise and no backpropagation is used. We also explore the effects of dataset augmentation by mirroring, rotation, and scaling.
△ Less
Submitted 23 June, 2014;
originally announced June 2014.
-
The phase response of the cortical slow oscillation
Authors:
Arne Weigenand,
Thomas Martinetz,
Jens Christian Claussen
Abstract:
Cortical slow oscillations occur in the mammalian brain during deep sleep and have been shown to contribute to memory consolidation, an effect that can be enhanced by electrical stimulation. As the precise underlying working mechanisms are not known it is desired to develop and analyze computational models of slow oscillations and to study the response to electrical stimuli. In this paper we emplo…
▽ More
Cortical slow oscillations occur in the mammalian brain during deep sleep and have been shown to contribute to memory consolidation, an effect that can be enhanced by electrical stimulation. As the precise underlying working mechanisms are not known it is desired to develop and analyze computational models of slow oscillations and to study the response to electrical stimuli. In this paper we employ the conductance based model of Compte et al. [J Neurophysiol 89, 2707] to study the effect of electrical stimulation. The population response to electrical stimulation depends on the timing of the stimulus with respect to the state of the slow oscillation. First, we reproduce the experimental results of electrical stimulation in ferret brain slices by Shu et al. [Nature 423, 288] from the conductance based model. We then numerically obtain the phase response curve for the conductance based network model to quantify the network's response to weak stimuli. Our results agree with experiments in vivo and in vitro that show that sensitivity to stimulation is weaker in the up than in the down state. However, we also find that within the up state stimulation leads to a shortening of the up state, or phase advance, whereas during the up-down transition a prolongation of up states is possible, resulting in a phase delay. Finally, we compute the phase response curve for the simple mean-field model by Ngo et al. [Europhys Lett 89, 68002] and find that the qualitative shape of the PRC is preserved, despite its different mechanism for the generation of slow oscillations.
△ Less
Submitted 30 December, 2013;
originally announced December 2013.
-
Predicting economic growth with classical physics and human biology
Authors:
Hans G. Danielmeyer,
Thomas Martinetz
Abstract:
We collect and analyze the data for working time, life expectancy, and the pair output and infrastructure of industrializing nations. During S-functional recovery from disaster the pair's time shifts yield 25 years for the infrastructure's physical lifetime. At G7 level the per capita outputs converge and the time shifts identify a heritable quantity with a reaction time of 62 years. It seems to c…
▽ More
We collect and analyze the data for working time, life expectancy, and the pair output and infrastructure of industrializing nations. During S-functional recovery from disaster the pair's time shifts yield 25 years for the infrastructure's physical lifetime. At G7 level the per capita outputs converge and the time shifts identify a heritable quantity with a reaction time of 62 years. It seems to control demand and the spare time required for enjoying G7 affluence. The sum of spare and working time is fixed by the universal flow of time. This yields analytic solutions for equilibrium, recovery, and long-term evolution for all six variables with biologically stabilized parameters.
△ Less
Submitted 6 December, 2012;
originally announced December 2012.
-
The physics of business cycles and inflation
Authors:
Hans G. Danielmeyer,
Thomas Martinetz
Abstract:
We analyse four consecutive cycles observed in the USA for employment and inflation. They are driven by three oil price shocks and an intended interest rate shock. Non-linear coupling between the rate equations for consumer products as prey and consumers as predators provides the required instability, but its natural dam** is too high for spontaneous cycles. Extending the Lotka-Volterra equation…
▽ More
We analyse four consecutive cycles observed in the USA for employment and inflation. They are driven by three oil price shocks and an intended interest rate shock. Non-linear coupling between the rate equations for consumer products as prey and consumers as predators provides the required instability, but its natural dam** is too high for spontaneous cycles. Extending the Lotka-Volterra equations with a small term for collective anticipation yields a second analytic solution without dam**. It predicts the base period, phase shifts, and the sensitivity to shocks for all six cyclic variables correctly.
△ Less
Submitted 6 December, 2012;
originally announced December 2012.
-
A physical theory of economic growth
Authors:
Hans G. Danielmeyer,
Thomas Martinetz
Abstract:
Economic growth is unpredictable unless demand is quantified. We solve this problem by introducing the demand for unpaid spare time and a user quantity named human capacity. It organizes and amplifies spare time required for enjoying affluence like physical capital, the technical infrastructure for production, organizes and amplifies working time for supply. The sum of annual spare and working tim…
▽ More
Economic growth is unpredictable unless demand is quantified. We solve this problem by introducing the demand for unpaid spare time and a user quantity named human capacity. It organizes and amplifies spare time required for enjoying affluence like physical capital, the technical infrastructure for production, organizes and amplifies working time for supply. The sum of annual spare and working time is fixed by the universal flow of time. This yields the first macroeconomic equilibrium condition. Both storable quantities form stabilizing feedback loops. They are driven with the general and technical knowledge embodied with parts of the supply by education and construction. Linear amplification yields S-functions as only analytic solutions. Destructible physical capital controls medium-term recoveries from disaster. Indestructible human capacity controls the collective long-term industrial evolution. It is immune even to world wars and runs from 1800 to date parallel to the unisex life expectancy in the pioneering nations. This is the first quantitative information on long-term demand. The theory is self-consistent. It reproduces all peaceful data from 1800 to date without adjustable parameter. It has full forecasting power since the decisive parameters are constants of the human species. They predict an asymptotic maximum for the economic level per capita. Long-term economic growth appears as a part of natural science.
△ Less
Submitted 12 June, 2012;
originally announced June 2012.
-
On the boundedness of an iteration involving points on the hypersphere
Authors:
Thomas Binder,
Thomas Martinetz
Abstract:
For a finite set of points $X$ on the unit hypersphere in $\mathbb{R}^d$ we consider the iteration $u_{i+1}=u_i+χ_i$, where $χ_i$ is the point of $X$ farthest from $u_i$. Restricting to the case where the origin is contained in the convex hull of $X$ we study the maximal length of $u_i$. We give sharp upper bounds for the length of $u_i$ independently of $X$. Precisely, this upper bound is infinit…
▽ More
For a finite set of points $X$ on the unit hypersphere in $\mathbb{R}^d$ we consider the iteration $u_{i+1}=u_i+χ_i$, where $χ_i$ is the point of $X$ farthest from $u_i$. Restricting to the case where the origin is contained in the convex hull of $X$ we study the maximal length of $u_i$. We give sharp upper bounds for the length of $u_i$ independently of $X$. Precisely, this upper bound is infinity for $d\ge 3$ and $\sqrt2$ for $d=2$.
△ Less
Submitted 5 October, 2010; v1 submitted 11 January, 2010;
originally announced January 2010.
-
Dynamic Fitness Landscapes in Molecular Evolution
Authors:
Claus O. Wilke,
Christopher Ronnewinkel,
Thomas Martinetz
Abstract:
We study self-replicating molecules under externally varying conditions. Changing conditions such as temperature variations and/or alterations in the environment's resource composition lead to both non-constant replication and decay rates of the molecules. In general, therefore, molecular evolution takes place in a dynamic rather than a static fitness landscape. We incorporate dynamic replicatio…
▽ More
We study self-replicating molecules under externally varying conditions. Changing conditions such as temperature variations and/or alterations in the environment's resource composition lead to both non-constant replication and decay rates of the molecules. In general, therefore, molecular evolution takes place in a dynamic rather than a static fitness landscape. We incorporate dynamic replication and decay rates into the standard quasispecies theory of molecular evolution, and show that for periodic time-dependencies, a system of evolving molecules enters a limit cycle for $t\to\infty$. For fast periodic changes, we show that molecules adapt to the time-averaged fitness landscape, whereas for slow changes they track the variations in the landscape arbitrarily closely. We derive a general approximation method that allows us to calculate the attractor of time-periodic landscapes, and demonstrate using several examples that the results of the approximation and the limiting cases of very slow and very fast changes are in perfect agreement. We also discuss landscapes with arbitrary time dependencies, and show that very fast changes again lead to a system that adapts to the time-averaged landscape. Finally, we analyze the dynamics of a finite population of molecules in a dynamic landscape, and discuss its relation to the infinite population limit.
△ Less
Submitted 12 May, 2000; v1 submitted 4 December, 1999;
originally announced December 1999.
-
Genetic Algorithms in Time-Dependent Environments
Authors:
Christopher Ronnewinkel,
Claus O. Wilke,
Thomas Martinetz
Abstract:
The influence of time-dependent fitnesses on the infinite population dynamics of simple genetic algorithms (without crossover) is analyzed. Based on general arguments, a schematic phase diagram is constructed that allows one to characterize the asymptotic states in dependence on the mutation rate and the time scale of changes. Furthermore, the notion of regular changes is raised for which the po…
▽ More
The influence of time-dependent fitnesses on the infinite population dynamics of simple genetic algorithms (without crossover) is analyzed. Based on general arguments, a schematic phase diagram is constructed that allows one to characterize the asymptotic states in dependence on the mutation rate and the time scale of changes. Furthermore, the notion of regular changes is raised for which the population can be shown to converge towards a generalized quasispecies. Based on this, error thresholds and an optimal mutation rate are approximately calculated for a generational genetic algorithm with a moving needle-in-the-haystack landscape. The so found phase diagram is fully consistent with our general considerations.
△ Less
Submitted 4 November, 1999;
originally announced November 1999.
-
Molecular Evolution in Time Dependent Environments
Authors:
Claus O. Wilke,
Christopher Ronnewinkel,
Thomas Martinetz
Abstract:
The quasispecies theory is studied for dynamic replication landscapes. A meaningful asymptotic quasispecies is defined for periodic time dependencies. The quasispecies' composition is constantly changing over the oscillation period. The error threshold moves towards the position of the time averaged landscape for high oscillation frequencies and follows the landscape closely for low oscillation…
▽ More
The quasispecies theory is studied for dynamic replication landscapes. A meaningful asymptotic quasispecies is defined for periodic time dependencies. The quasispecies' composition is constantly changing over the oscillation period. The error threshold moves towards the position of the time averaged landscape for high oscillation frequencies and follows the landscape closely for low oscillation frequencies.
△ Less
Submitted 17 November, 1999; v1 submitted 14 April, 1999;
originally announced April 1999.
-
Adaptive walks on time-dependent fitness landscapes
Authors:
Claus O. Wilke,
Thomas Martinetz
Abstract:
The idea of adaptive walks on fitness landscapes as a means of studying evolutionary processes on large time scales is extended to fitness landscapes that are slowly changing over time. The influence of ruggedness and of the amount of static fitness contributions are investigated for model landscapes derived from Kauffman's $NK$ landscapes. Depending on the amount of static fitness contributions…
▽ More
The idea of adaptive walks on fitness landscapes as a means of studying evolutionary processes on large time scales is extended to fitness landscapes that are slowly changing over time. The influence of ruggedness and of the amount of static fitness contributions are investigated for model landscapes derived from Kauffman's $NK$ landscapes. Depending on the amount of static fitness contributions in the landscape, the evolutionary dynamics can be divided into a percolating and a non-percolating phase. In the percolating phase, the walker performs a random walk over the regions of the landscape with high fitness.
△ Less
Submitted 16 March, 1999;
originally announced March 1999.
-
Lifetimes of agents under external stress
Authors:
Claus O. Wilke,
Thomas Martinetz
Abstract:
An exact formula for the distribution of lifetimes in coherent-noise models and related models is derived. For certain stress distributions, this formula can be analytically evaluated and yields simple closed expressions. For those types of stress for which a closed expression is not available, a numerical evaluation can be done in a straightforward way. All results obtained are in perfect agree…
▽ More
An exact formula for the distribution of lifetimes in coherent-noise models and related models is derived. For certain stress distributions, this formula can be analytically evaluated and yields simple closed expressions. For those types of stress for which a closed expression is not available, a numerical evaluation can be done in a straightforward way. All results obtained are in perfect agreement with numerical experiments. The implications for the coherent-noise models' application to macroevolution are discussed.
△ Less
Submitted 9 December, 1998;
originally announced December 1998.
-
How fast do structures emerge in hypercycle-systems?
Authors:
S. Altmeyer,
C. Wilke,
T. Martinetz
Abstract:
A general framework for the simulation of reaction-diffusion systems with probabilistic cellular automata is presented. The basic reaction probabilities of the chemical model translate directly into the transition rules of the automaton, thus allowing a clear comparison between simulation results and analytic calculations. This framework is then applied to simulations of hypercycle-systems in up…
▽ More
A general framework for the simulation of reaction-diffusion systems with probabilistic cellular automata is presented. The basic reaction probabilities of the chemical model translate directly into the transition rules of the automaton, thus allowing a clear comparison between simulation results and analytic calculations. This framework is then applied to simulations of hypercycle-systems in up to three dimensions. Furthermore, a new measurement quantity is introduced and applied to the hypercycle-systems in two and three dimensions. It can be shown that this quantity can be interpreted as a measure for the macroscopic order of the hypercycle systems.
△ Less
Submitted 5 June, 1998;
originally announced June 1998.
-
Hierarchical noise in large systems of independent agents
Authors:
Claus Wilke,
Thomas Martinetz
Abstract:
A generalization of the coherent-noise models [M. E. J. Newman and K. Sneppen, Phys. Rev. E{\bf54}, 6226 (1996)] is presented where the agents in the model are subjected to a multitude of stresses, generated in a hierarchy of different contexts. The hierarchy is realized as a Cayley-tree. Two different ways of stress propagation in the tree are considered. In both cases, coherence arises in larg…
▽ More
A generalization of the coherent-noise models [M. E. J. Newman and K. Sneppen, Phys. Rev. E{\bf54}, 6226 (1996)] is presented where the agents in the model are subjected to a multitude of stresses, generated in a hierarchy of different contexts. The hierarchy is realized as a Cayley-tree. Two different ways of stress propagation in the tree are considered. In both cases, coherence arises in large subsystems of the tree. Clear similarities between the behavior of the tree model and of the coherent-noise model can be observed. For one of the two methods of stress propagation, the behavior of the tree model can be approximated very well by an ensemble of coherent-noise models, where the sizes $k$ of the systems in the ensemble scale as $k^{-2}$. The results are found to be independent of the tree's structure for a large class of reasonable choices. Additionally, it is found that power-law distributed lifetimes of agents arise even under the complete absence of correlations between the stresses the agents feel.
△ Less
Submitted 6 October, 1998; v1 submitted 22 May, 1998;
originally announced May 1998.
-
Large-scale evolution and extinction in a hierarchically structured environment
Authors:
C. Wilke,
S. Altmeyer,
T. Martinetz
Abstract:
A class of models for large-scale evolution and mass extinctions is presented. These models incorporate environmental changes on all scales, from influences on a single species to global effects. This is a step towards a unified picture of mass extinctions, which enables one to study coevolutionary effects and external abiotic influences with the same means. The generic features of such models a…
▽ More
A class of models for large-scale evolution and mass extinctions is presented. These models incorporate environmental changes on all scales, from influences on a single species to global effects. This is a step towards a unified picture of mass extinctions, which enables one to study coevolutionary effects and external abiotic influences with the same means. The generic features of such models are studied in a simple version, in which all environmental changes are generated at random and without feedback from other parts of the system.
△ Less
Submitted 10 March, 1998;
originally announced March 1998.
-
Aftershocks in Coherent-Noise Models
Authors:
C. Wilke,
S. Altmeyer,
T. Martinetz
Abstract:
The decay pattern of aftershocks in the so-called 'coherent-noise' models [M. E. J. Newman and K. Sneppen, Phys. Rev. E54, 6226 (1996)] is studied in detail. Analytical and numerical results show that the probability to find a large event at time $t$ after an initial major event decreases as $t^{-τ}$ for small $t$, with the exponent $τ$ ranging from 0 to values well above 1. This is in contrast…
▽ More
The decay pattern of aftershocks in the so-called 'coherent-noise' models [M. E. J. Newman and K. Sneppen, Phys. Rev. E54, 6226 (1996)] is studied in detail. Analytical and numerical results show that the probability to find a large event at time $t$ after an initial major event decreases as $t^{-τ}$ for small $t$, with the exponent $τ$ ranging from 0 to values well above 1. This is in contrast to Sneppen und Newman, who stated that the exponent is about 1, independent of the microscopic details of the simulation. Numerical simulations of an extended model [C. Wilke, T. Martinetz, Phys. Rev. E56, 7128 (1997)] show that the power-law is only a generic feature of the original dynamics and does not necessarily appear in a more general context. Finally, the implications of the results to the modeling of earthquakes are discussed.
△ Less
Submitted 12 March, 1998; v1 submitted 20 October, 1997;
originally announced October 1997.
-
A Simple Model of Evolution with Variable System Size
Authors:
C. Wilke,
T. Martinetz
Abstract:
A simple model of biological extinction with variable system size is presented that exhibits a power-law distribution of extinction event sizes. The model is a generalization of a model recently introduced by Newman (Proc. R. Soc. Lond. B265, 1605 (1996). Both analytical and numerical analysis show that the exponent of the power-law distribution depends only marginally on the growth rate $g$ at…
▽ More
A simple model of biological extinction with variable system size is presented that exhibits a power-law distribution of extinction event sizes. The model is a generalization of a model recently introduced by Newman (Proc. R. Soc. Lond. B265, 1605 (1996). Both analytical and numerical analysis show that the exponent of the power-law distribution depends only marginally on the growth rate $g$ at which new species enter the system and is equal to the one of the original model in the limit $g\to\infty$. A critical growth rate $g_c$ can be found below which the system dies out. Under these model assumptions stable ecosystems can only exist if the regrowth of species is sufficiently fast.
△ Less
Submitted 20 October, 1997; v1 submitted 7 May, 1997;
originally announced May 1997.