-
Pyramidal Clustering Algorithms in ISO-3D Project
Authors:
Oldemar Rodriguez,
Edwin Diday
Abstract:
Pyramidal clustering method generalizes hierarchies by allowing non-disjoint classes at a given level instead of a partition. Moreover, the clusters of the pyramid are intervals of a total order on the set being clustered. [Diday 1984], [Bertrand, Diday 1990] and [Mfoumoune 1998] proposed algorithms to build a pyramid starting with an arbitrary order of the individual. In this paper we present two…
▽ More
Pyramidal clustering method generalizes hierarchies by allowing non-disjoint classes at a given level instead of a partition. Moreover, the clusters of the pyramid are intervals of a total order on the set being clustered. [Diday 1984], [Bertrand, Diday 1990] and [Mfoumoune 1998] proposed algorithms to build a pyramid starting with an arbitrary order of the individual. In this paper we present two new algorithms name {\tt CAPS} and {\tt CAPSO}. {\tt CAPSO} builds a pyramid starting with an order given on the set of the individuals (or symbolic objects) while {\tt CAPS} finds this order. These two algorithms allows moreover to cluster more complex data than the tabular model allows to process, by considering variation on the values taken by the variables, in this way, our method produces a symbolic pyramid. Each cluster thus formed is defined not only by the set of its elements (i.e. its extent) but also by a symbolic object, which describes its properties (i.e. its intent). These two algorithms were implemented in C++ and Java to the ISO-3D project.
△ Less
Submitted 10 January, 2024;
originally announced January 2024.
-
Multidimensional Scaling for Interval Data: INTERSCAL
Authors:
Susanne Winsberg,
Oldemar Rodriguez,
Edwin Diday
Abstract:
Standard multidimensional scaling takes as input a dissimilarity matrix of general term $δ_{ij}$ which is a numerical value. In this paper we input $δ_{ij}=[\underline{δ_{ij}},\overline{δ_{ij}}]$ where $\underline{δ_{ij}}$ and $\overline{δ_{ij}}$ are the lower bound and the upper bound of the ``dissimilarity'' between the stimulus/object $S_i$ and the stimulus/object $S_j$ respectively. As output…
▽ More
Standard multidimensional scaling takes as input a dissimilarity matrix of general term $δ_{ij}$ which is a numerical value. In this paper we input $δ_{ij}=[\underline{δ_{ij}},\overline{δ_{ij}}]$ where $\underline{δ_{ij}}$ and $\overline{δ_{ij}}$ are the lower bound and the upper bound of the ``dissimilarity'' between the stimulus/object $S_i$ and the stimulus/object $S_j$ respectively. As output instead of representing each stimulus/object on a factorial plane by a point, as in other multidimensional scaling methods, in the proposed method each stimulus/object is visualized by a rectangle, in order to represent dissimilarity variation. We generalize the classical scaling method looking for a method that produces results similar to those obtained by Tops Principal Components Analysis. Two examples are presented to illustrate the effectiveness of the proposed method.
△ Less
Submitted 10 January, 2024;
originally announced January 2024.
-
Stock data analysis using sympbolic data analysis
Authors:
Philippe Caillou,
Edwin Diday
Abstract:
In this paper we present a model of the stock exchange domain using symbolic dataanalysis and we use the SODAS software to analyze this domain. After a short presentationof the software, we present the analysis in three steps: choice of the symbolic objects, theirdefinition and their analysis with SODAS. We give details for each of these steps and thereimportance is underlined. Two examples of res…
▽ More
In this paper we present a model of the stock exchange domain using symbolic dataanalysis and we use the SODAS software to analyze this domain. After a short presentationof the software, we present the analysis in three steps: choice of the symbolic objects, theirdefinition and their analysis with SODAS. We give details for each of these steps and thereimportance is underlined. Two examples of results are described to show the analysis interestand pertinence. The conclusion describes perspectives after the improvement of SODAS forits application in the stock exchange domain.
△ Less
Submitted 23 February, 2023;
originally announced February 2023.