Search | arXiv e-print repository

Totally smooth renormings

Authors: Eve Oja, Tauri Viil, Dirk Werner

Abstract: We study the problem of totally smooth renormings of Banach spaces and provide such renormings for spaces which are weakly compactly generated. We also consider renormings for $(a,B,c)$-ideals. We study the problem of totally smooth renormings of Banach spaces and provide such renormings for spaces which are weakly compactly generated. We also consider renormings for $(a,B,c)$-ideals. △ Less

Submitted 19 July, 2018; originally announced July 2018.

MSC Class: 46B03; 46B04; 46B20

arXiv:1612.07312 [pdf, ps, other]

The Bartle-Dunford-Schwartz and the Dinculeanu-Singer theorems revisited

Authors: Fernando Muñoz, Eve Oja, Cándido Piñeiro

Abstract: Let $X$ and $Y$ be Banach spaces and let $Ω$ be a compact Hausdorff space. Denote by $\mathcal{C}_{p}(Ω,X)$ the space of $p$-continous $X$-valued functions, $1\leq p\leq \infty$. For operators $S\in\mathcal{L}(\mathcal{C}(Ω),\mathcal{L}(X,Y))$ and $U\in\mathcal{L}(\mathcal{C}_{p}(Ω,X),Y)$, we establish integral representation theorems with respect to a vector measure… ▽ More Let $X$ and $Y$ be Banach spaces and let $Ω$ be a compact Hausdorff space. Denote by $\mathcal{C}_{p}(Ω,X)$ the space of $p$-continous $X$-valued functions, $1\leq p\leq \infty$. For operators $S\in\mathcal{L}(\mathcal{C}(Ω),\mathcal{L}(X,Y))$ and $U\in\mathcal{L}(\mathcal{C}_{p}(Ω,X),Y)$, we establish integral representation theorems with respect to a vector measure $m:Σ\rightarrow \mathcal{L}(X,Y^{**})$, where $Σ$ denotes the $σ$-algebra of Borel subsets of $Ω$. The first theorem extends the classical Bartle-Dunford-Schwartz representation theorem. It is used to prove the second theorem, which extends the classical Dinculeanu-Singer representation theorem, also providing to it an alternative simpler proof. For the latter (and the main) result, we build the needed integration theory, relying on a new concept of the $q$-semivariation, $1\leq q\leq \infty$, of a vector measure $m:Σ\rightarrow \mathcal{L}(X,Y^{**})$. △ Less

Submitted 21 December, 2016; originally announced December 2016.

MSC Class: 47A67 (Primary); 28B05; 46B25; 46B28; 46G10; 47B38 (Secondary)

arXiv:1606.07202 [pdf, other]

Operators on the Banach space of $p$-continuous vector-valued functions

Authors: Fernando Muñoz, Eve Oja, Cándido Piñeiro

Abstract: Let $X$, $Y$, and $Z$ be Banach spaces, and let $α$ be a tensor norm. Let a bounded linear operator $S\in\mathcal{L}(Z,\mathcal{L}(X,Y))$ be given. We obtain (necessary and/or sufficient) conditions for the existence of an operator $U\in\mathcal{L}(Z\hat{\otimes}_αX,Y)$ such that $(Sz)x = U(z\otimes x)$, for all $z\in Z$ and $x\in X$, i.e., $S= U^{#}$, the associated operator to $U$. Let $Ω$ be a… ▽ More Let $X$, $Y$, and $Z$ be Banach spaces, and let $α$ be a tensor norm. Let a bounded linear operator $S\in\mathcal{L}(Z,\mathcal{L}(X,Y))$ be given. We obtain (necessary and/or sufficient) conditions for the existence of an operator $U\in\mathcal{L}(Z\hat{\otimes}_αX,Y)$ such that $(Sz)x = U(z\otimes x)$, for all $z\in Z$ and $x\in X$, i.e., $S= U^{#}$, the associated operator to $U$. Let $Ω$ be a compact Hausdorff space and denote by $\mathcal{C}(Ω)$ the space of continuous functions from $Ω$ into $\mathbb{K}$. We apply these results to $S\in\mathcal{L}(\mathcal{C}(Ω),\mathcal{L}(X, Y))$ for characterizing the existence of an operator $U\in\mathcal{L}(\mathcal{C}_{p}(Ω,X),Y)$ such that $U^{#}=S$, where $\mathcal{C}_{p}(Ω,X)$ is the space of $p$-continuous $X$-valued functions, $1\leq p \leq \infty$. △ Less

Submitted 23 June, 2016; originally announced June 2016.

arXiv:1505.05821 [pdf, other]

Optimizing the Information Retrieval Trade-off in Data Visualization Using $α$-Divergence

Authors: Ehsan Amid, Onur Dikmen, Erkki Oja

Abstract: Data visualization is one of the major applications of nonlinear dimensionality reduction. From the information retrieval perspective, the quality of a visualization can be evaluated by considering the extent that the neighborhood relation of each data point is maintained while the number of unrelated points that are retrieved is minimized. This property can be quantified as a trade-off between th… ▽ More Data visualization is one of the major applications of nonlinear dimensionality reduction. From the information retrieval perspective, the quality of a visualization can be evaluated by considering the extent that the neighborhood relation of each data point is maintained while the number of unrelated points that are retrieved is minimized. This property can be quantified as a trade-off between the mean precision and mean recall of the visualization. While there have been some approaches to formulate the visualization objective directly as a weighted sum of the precision and recall, there is no systematic way to determine the optimal trade-off between these two nor a clear interpretation of the optimal value. In this paper, we investigate the properties of $α$-divergence for information visualization, focusing our attention on a particular range of $α$ values. We show that the minimization of the new cost function corresponds to maximizing a geometric mean between precision and recall, parameterized by $α$. Contrary to some earlier methods, no hand-tuning is needed, but we can rigorously estimate the optimal value of $α$ for a given input data. For this, we provide a statistical framework using a novel distribution called Exponential Divergence with Augmentation (EDA). By the extensive set of experiments, we show that the optimal value of $α$, obtained by EDA corresponds to the optimal trade-off between the precision and recall for a given data distribution. △ Less

Submitted 29 March, 2016; v1 submitted 21 May, 2015; originally announced May 2015.

arXiv:1410.5670 [pdf, ps, other]

Weaker relatives of the bounded approximation property for a Banach operator ideal

Authors: Silvia Lassalle, Eve Oja, Pablo Turco

Abstract: Fixed a Banach operator ideal $\mathcal A$, we introduce and investigate two new approximation properties, which are strictly weaker than the bounded approximation property (BAP) for $\mathcal A$ of Lima, Lima and Oja (2010). We call them the weak BAP for $\mathcal A$ and the local BAP for $\mathcal A$, showing that the latter is in turn strictly weaker than the former. Under this framework, we ad… ▽ More Fixed a Banach operator ideal $\mathcal A$, we introduce and investigate two new approximation properties, which are strictly weaker than the bounded approximation property (BAP) for $\mathcal A$ of Lima, Lima and Oja (2010). We call them the weak BAP for $\mathcal A$ and the local BAP for $\mathcal A$, showing that the latter is in turn strictly weaker than the former. Under this framework, we address the question of approximation properties passing from dual spaces to underlying spaces. We relate the weak and local BAPs for $\mathcal A$ with approximation properties given by tensor norms and show that the Saphar BAP of order $p$ is the weak BAP for the ideal of absolutely $p^*$-summing operators, $1\leq p\leq\infty$, $1/p + 1/{p^*}=1$. △ Less

Submitted 7 August, 2015; v1 submitted 21 October, 2014; originally announced October 2014.

Comments: 22 Pages

MSC Class: 46B28; 46B20; 47L05; 47L20

arXiv:1409.6476 [pdf, ps, other]

On $(p,r)$-null sequences and their relatives

Authors: Kati Ain, Eve Oja

Abstract: Let $1\leq p < \infty$ and $1\leq r \leq p^\ast$, where $p^\ast$ is the conjugate index of $p$. We prove an omnibus theorem, which provides numerous equivalences for a sequence $(x_n)$ in a Banach space $X$ to be a $(p,r)$-null sequence. One of them is that $(x_n)$ is $(p,r)$-null if and only if $(x_n)$ is null and relatively $(p,r)$-compact. This equivalence is known in the "limit" case when… ▽ More Let $1\leq p < \infty$ and $1\leq r \leq p^\ast$, where $p^\ast$ is the conjugate index of $p$. We prove an omnibus theorem, which provides numerous equivalences for a sequence $(x_n)$ in a Banach space $X$ to be a $(p,r)$-null sequence. One of them is that $(x_n)$ is $(p,r)$-null if and only if $(x_n)$ is null and relatively $(p,r)$-compact. This equivalence is known in the "limit" case when $r=p^\ast$, the case of the $p$-null sequence and $p$-compactness. Our approach is more direct and easier than those applied for the proof of the latter result. We apply it also to characterize the unconditional and weak versions of $(p,r)$-null sequences. △ Less

Submitted 23 September, 2014; originally announced September 2014.

arXiv:1406.1385 [pdf, ps, other]

Learning the Information Divergence

Authors: Onur Dikmen, Zhirong Yang, Erkki Oja

Abstract: Information divergence that measures the difference between two nonnegative matrices or tensors has found its use in a variety of machine learning problems. Examples are Nonnegative Matrix/Tensor Factorization, Stochastic Neighbor Embedding, topic models, and Bayesian network optimization. The success of such a learning task depends heavily on a suitable divergence. A large variety of divergences… ▽ More Information divergence that measures the difference between two nonnegative matrices or tensors has found its use in a variety of machine learning problems. Examples are Nonnegative Matrix/Tensor Factorization, Stochastic Neighbor Embedding, topic models, and Bayesian network optimization. The success of such a learning task depends heavily on a suitable divergence. A large variety of divergences have been suggested and analyzed, but very few results are available for an objective choice of the optimal divergence for a given task. Here we present a framework that facilitates automatic selection of the best divergence among a given family, based on standard maximum likelihood estimation. We first propose an approximated Tweedie distribution for the beta-divergence family. Selecting the best beta then becomes a machine learning problem solved by maximum likelihood. Next, we reformulate alpha-divergence in terms of beta-divergence, which enables automatic selection of alpha by maximum likelihood with reuse of the learning principle for beta-divergence. Furthermore, we show the connections between gamma and beta-divergences as well as Rényi and alpha-divergences, such that our automatic selection framework is extended to non-separable divergences. Experiments on both synthetic and real-world data demonstrate that our method can quite accurately select information divergence across different learning problems and various divergence families. △ Less

Submitted 5 June, 2014; originally announced June 2014.

Comments: 12 pages, 7 figures

arXiv:1310.6232 [pdf, ps, other]

Principle of local reflexivity respecting subspaces

Authors: Eve Oja

Abstract: We obtain a strengthening of the principle of local reflexivity in a general form. The added strength makes local reflexivity operators respect given subspaces. Applications are given to bounded approximation properties of pairs, consisting of a Banach space and its subspace. We obtain a strengthening of the principle of local reflexivity in a general form. The added strength makes local reflexivity operators respect given subspaces. Applications are given to bounded approximation properties of pairs, consisting of a Banach space and its subspace. △ Less

Submitted 23 October, 2013; originally announced October 2013.

arXiv:1206.4676 [pdf]

Clustering by Low-Rank Doubly Stochastic Matrix Decomposition

Authors: Zhirong Yang, Erkki Oja

Abstract: Clustering analysis by nonnegative low-rank approximations has achieved remarkable progress in the past decade. However, most approximation approaches in this direction are still restricted to matrix factorization. We propose a new low-rank learning method to improve the clustering performance, which is beyond matrix factorization. The approximation is based on a two-step bipartite random walk thr… ▽ More Clustering analysis by nonnegative low-rank approximations has achieved remarkable progress in the past decade. However, most approximation approaches in this direction are still restricted to matrix factorization. We propose a new low-rank learning method to improve the clustering performance, which is beyond matrix factorization. The approximation is based on a two-step bipartite random walk through virtual cluster nodes, where the approximation is formed by only cluster assigning probabilities. Minimizing the approximation error measured by Kullback-Leibler divergence is equivalent to maximizing the likelihood of a discriminative model, which endows our method with a solid probabilistic interpretation. The optimization is implemented by a relaxed Majorization-Minimization algorithm that is advantageous in finding good local minima. Furthermore, we point out that the regularized algorithm with Dirichlet prior only serves as initialization. Experimental results show that the new method has strong performance in clustering purity for various datasets, especially for large-scale manifold data. △ Less

Submitted 18 June, 2012; originally announced June 2012.

Comments: ICML2012

Showing 1–9 of 9 results for author: Oja, E