Search | arXiv e-print repository

MOD-CL: Multi-label Object Detection with Constrained Loss

Authors: Sota Moriyama, Koji Watanabe, Katsumi Inoue, Akihiro Takemura

Abstract: We introduce MOD-CL, a multi-label object detection framework that utilizes constrained loss in the training process to produce outputs that better satisfy the given requirements. In this paper, we use $\mathrm{MOD_{YOLO}}$, a multi-label object detection model built upon the state-of-the-art object detection model YOLOv8, which has been published in recent years. In Task 1, we introduce the Corre… ▽ More We introduce MOD-CL, a multi-label object detection framework that utilizes constrained loss in the training process to produce outputs that better satisfy the given requirements. In this paper, we use $\mathrm{MOD_{YOLO}}$, a multi-label object detection model built upon the state-of-the-art object detection model YOLOv8, which has been published in recent years. In Task 1, we introduce the Corrector Model and Blender Model, two new models that follow after the object detection process, aiming to generate a more constrained output. For Task 2, constrained losses have been incorporated into the $\mathrm{MOD_{YOLO}}$ architecture using Product T-Norm. The results show that these implementations are instrumental to improving the scores for both Task 1 and Task 2. △ Less

Submitted 31 January, 2024; originally announced March 2024.

arXiv:2306.06821 [pdf, ps, other]

Towards end-to-end ASP computation

Authors: Taisuke Sato, Akihiro Takemura, Katsumi Inoue

Abstract: We propose an end-to-end approach for answer set programming (ASP) and linear algebraically compute stable models satisfying given constraints. The idea is to implement Lin-Zhao's theorem \cite{Lin04} together with constraints directly in vector spaces as numerical minimization of a cost function constructed from a matricized normal logic program, loop formulas in Lin-Zhao's theorem and constraint… ▽ More We propose an end-to-end approach for answer set programming (ASP) and linear algebraically compute stable models satisfying given constraints. The idea is to implement Lin-Zhao's theorem \cite{Lin04} together with constraints directly in vector spaces as numerical minimization of a cost function constructed from a matricized normal logic program, loop formulas in Lin-Zhao's theorem and constraints, thereby no use of symbolic ASP or SAT solvers involved in our approach. We also propose precomputation that shrinks the program size and heuristics for loop formulas to reduce computational difficulty. We empirically test our approach with programming examples including the 3-coloring and Hamiltonian cycle problems. As our approach is purely numerical and only contains vector/matrix operations, acceleration by parallel technologies such as many-cores and GPUs is expected. △ Less

Submitted 13 June, 2023; v1 submitted 11 June, 2023; originally announced June 2023.

Comments: 29 pages, 9 figures

ACM Class: I.2.4

arXiv:2109.08290 [pdf, ps, other]

doi 10.4204/EPTCS.345.26

Generating Explainable Rule Sets from Tree-Ensemble Learning Methods by Answer Set Programming

Authors: Akihiro Takemura, Katsumi Inoue

Abstract: We propose a method for generating explainable rule sets from tree-ensemble learners using Answer Set Programming (ASP). To this end, we adopt a decompositional approach where the split structures of the base decision trees are exploited in the construction of rules, which in turn are assessed using pattern mining methods encoded in ASP to extract interesting rules. We show how user-defined constr… ▽ More We propose a method for generating explainable rule sets from tree-ensemble learners using Answer Set Programming (ASP). To this end, we adopt a decompositional approach where the split structures of the base decision trees are exploited in the construction of rules, which in turn are assessed using pattern mining methods encoded in ASP to extract interesting rules. We show how user-defined constraints and preferences can be represented declaratively in ASP to allow for transparent and flexible rule set generation, and how rules can be used as explanations to help the user better understand the models. Experimental evaluation with real-world datasets and popular tree-ensemble algorithms demonstrates that our approach is applicable to a wide range of classification tasks. △ Less

Submitted 16 September, 2021; originally announced September 2021.

Comments: In Proceedings ICLP 2021, arXiv:2109.07914

Journal ref: EPTCS 345, 2021, pp. 127-140

arXiv:1806.07626 [pdf, other]

doi 10.1007/s13160-019-00394-y

Game-theoretic derivation of upper hedging prices of multivariate contingent claims and submodularity

Authors: Takeru Matsuda, Akimichi Takemura

Abstract: We investigate upper and lower hedging prices of multivariate contingent claims from the viewpoint of game-theoretic probability and submodularity. By considering a game between "Market" and "Investor" in discrete time, the pricing problem is reduced to a backward induction of an optimization over simplexes. For European options with payoff functions satisfying a combinatorial property called subm… ▽ More We investigate upper and lower hedging prices of multivariate contingent claims from the viewpoint of game-theoretic probability and submodularity. By considering a game between "Market" and "Investor" in discrete time, the pricing problem is reduced to a backward induction of an optimization over simplexes. For European options with payoff functions satisfying a combinatorial property called submodularity or supermodularity, this optimization is solved in closed form by using the Lovász extension and the upper and lower hedging prices can be calculated efficiently. This class includes the options on the maximum or the minimum of several assets. We also study the asymptotic behavior as the number of game rounds goes to infinity. The upper and lower hedging prices of European options converge to the solutions of the Black-Scholes-Barenblatt equations. For European options with submodular or supermodular payoff functions, the Black-Scholes-Barenblatt equation is reduced to the linear Black-Scholes equation and it is solved in closed form. Numerical results show the validity of the theoretical results. △ Less

Submitted 20 June, 2018; originally announced June 2018.

Journal ref: Japan Journal of Industrial and Applied Mathematics, 37, 213--248, 2020

arXiv:1507.07056 [pdf, other]

doi 10.1109/TWC.2016.2555796

Exact ZF Analysis and Computer-Algebra-Aided Evaluation in Rank-1 LoS Rician Fading

Authors: Constantin Siriteanu, Akimichi Takemura, Christoph Koutschan, Satoshi Kuriki, Donald St. P. Richards, Hyundong Shin

Abstract: We study zero-forcing detection (ZF) for multiple-input/multiple-output (MIMO) spatial multiplexing under transmit-correlated Rician fading for an N_R X N_T channel matrix with rank-1 line-of-sight (LoS) component. By using matrix transformations and multivariate statistics, our exact analysis yields the signal-to-noise ratio moment generating function (m.g.f.) as an infinite series of gamma distr… ▽ More We study zero-forcing detection (ZF) for multiple-input/multiple-output (MIMO) spatial multiplexing under transmit-correlated Rician fading for an N_R X N_T channel matrix with rank-1 line-of-sight (LoS) component. By using matrix transformations and multivariate statistics, our exact analysis yields the signal-to-noise ratio moment generating function (m.g.f.) as an infinite series of gamma distribution m.g.f.'s and analogous series for ZF performance measures, e.g., outage probability and ergodic capacity. However, their numerical convergence is inherently problematic with increasing Rician K-factor, N_R , and N_T. We circumvent this limitation as follows. First, we derive differential equations satisfied by the performance measures with a novel automated approach employing a computer-algebra tool which implements Groebner basis computation and creative telesco**. These differential equations are then solved with the holonomic gradient method (HGM) from initial conditions computed with the infinite series. We demonstrate that HGM yields more reliable performance evaluation than by infinite series alone and more expeditious than by simulation, for realistic values of K , and even for N_R and N_T relevant to large MIMO systems. We envision extending the proposed approaches for exact analysis and reliable evaluation to more general Rician fading and other transceiver methods. △ Less

Submitted 19 May, 2016; v1 submitted 24 July, 2015; originally announced July 2015.

Comments: Accepted for publication by the IEEE Transactions on Wireless Communications, on April 7th, 2016; this is the final revision before publication

arXiv:1403.3788 [pdf, other]

doi 10.1109/TWC.2014.2385075

MIMO Zero-Forcing Performance Evaluation Using the Holonomic Gradient Method

Authors: Constantin Siriteanu, Akimichi Takemura, Satoshi Kuriki, Hyundong Shin, Christoph Koutschan

Abstract: For multiple-input multiple-output (MIMO) spatial-multiplexing transmission, zero-forcing detection (ZF) is appealing because of its low complexity. Our recent MIMO ZF performance analysis for Rician--Rayleigh fading, which is relevant in heterogeneous networks, has yielded for the ZF outage probability and ergodic capacity infinite-series expressions. Because they arose from expanding the conflue… ▽ More For multiple-input multiple-output (MIMO) spatial-multiplexing transmission, zero-forcing detection (ZF) is appealing because of its low complexity. Our recent MIMO ZF performance analysis for Rician--Rayleigh fading, which is relevant in heterogeneous networks, has yielded for the ZF outage probability and ergodic capacity infinite-series expressions. Because they arose from expanding the confluent hypergeometric function $ {_1\! F_1} (\cdot, \cdot, σ) $ around 0, they do not converge numerically at realistically-high Rician $ K $-factor values. Therefore, herein, we seek to take advantage of the fact that $ {_1\! F_1} (\cdot, \cdot, σ) $ satisfies a differential equation, i.e., it is a \textit{holonomic} function. Holonomic functions can be computed by the \textit{holonomic gradient method} (HGM), i.e., by numerically solving the satisfied differential equation. Thus, we first reveal that the moment generating function (m.g.f.) and probability density function (p.d.f.) of the ZF signal-to-noise ratio (SNR) are holonomic. Then, from the differential equation for $ {_1\! F_1} (\cdot, \cdot, σ) $, we deduce those satisfied by the SNR m.g.f. and p.d.f., and demonstrate that the HGM helps compute the p.d.f. accurately at practically-relevant values of $ K $. Finally, numerical integration of the SNR p.d.f. produced by HGM yields accurate ZF outage probability and ergodic capacity results. △ Less

Submitted 15 April, 2015; v1 submitted 15 March, 2014; originally announced March 2014.

Comments: This manuscript was accepted in December 2014

Journal ref: IEEE Transactions on Wireless Communications, vol. 14, no. 4, April 2015, pp. 2322-2335

arXiv:1401.0430 [pdf, other]

Schur Complement Based Analysis of MIMO Zero-Forcing for Rician Fading

Authors: Constantin Siriteanu, Akimichi Takemura, Satoshi Kuriki, Donald St. P. Richards, Hyundong Shin

Abstract: For multiple-input/multiple-output (MIMO) spatial multiplexing with zero-forcing detection (ZF), signal-to-noise ratio (SNR) analysis for Rician fading involves the cumbersome noncentral-Wishart distribution (NCWD) of the transmit sample-correlation (Gramian) matrix. An \textsl{approximation} with a \textsl{virtual} CWD previously yielded for the ZF SNR an approximate (virtual) Gamma distribution.… ▽ More For multiple-input/multiple-output (MIMO) spatial multiplexing with zero-forcing detection (ZF), signal-to-noise ratio (SNR) analysis for Rician fading involves the cumbersome noncentral-Wishart distribution (NCWD) of the transmit sample-correlation (Gramian) matrix. An \textsl{approximation} with a \textsl{virtual} CWD previously yielded for the ZF SNR an approximate (virtual) Gamma distribution. However, analytical conditions qualifying the accuracy of the SNR-distribution approximation were unknown. Therefore, we have been attempting to exactly characterize ZF SNR for Rician fading. Our previous attempts succeeded only for the sole Rician-fading stream under Rician--Rayleigh fading, by writing it as scalar Schur complement (SC) in the Gramian. Herein, we pursue a more general, matrix-SC-based analysis to characterize SNRs when several streams may undergo Rician fading. On one hand, for full-Rician fading, the SC distribution is found to be exactly a CWD if and only if a channel-mean--correlation \textsl{condition} holds. Interestingly, this CWD then coincides with the \textsl{virtual} CWD ensuing from the \textsl{approximation}. Thus, under the \textsl{condition}, the actual and virtual SNR-distributions coincide. On the other hand, for Rician--Rayleigh fading, the matrix-SC distribution is characterized in terms of determinant of matrix with elementary-function entries, which also yields a new characterization of the ZF SNR. Average error probability results validate our analysis vs.~simulation. △ Less

Submitted 26 September, 2014; v1 submitted 2 January, 2014; originally announced January 2014.

Comments: 32 pages, 4 figures, 1 table

arXiv:1307.2958 [pdf, other]

Exact MIMO Zero-Forcing Detection Analysis for Transmit-Correlated Rician Fading

Authors: Constantin Siriteanu, Steven Blostein, Akimichi Takemura, Hyundong Shin, Shahram Yousefi, Satoshi Kuriki

Abstract: We analyze the performance of multiple input/multiple output (MIMO) communications systems employing spatial multiplexing and zero-forcing detection (ZF). The distribution of the ZF signal-to-noise ratio (SNR) is characterized when either the intended stream or interfering streams experience Rician fading, and when the fading may be correlated on the transmit side. Previously, exact ZF analysis ba… ▽ More We analyze the performance of multiple input/multiple output (MIMO) communications systems employing spatial multiplexing and zero-forcing detection (ZF). The distribution of the ZF signal-to-noise ratio (SNR) is characterized when either the intended stream or interfering streams experience Rician fading, and when the fading may be correlated on the transmit side. Previously, exact ZF analysis based on a well-known SNR expression has been hindered by the noncentrality of the Wishart distribution involved. In addition, approximation with a central-Wishart distribution has not proved consistently accurate. In contrast, the following exact ZF study proceeds from a lesser-known SNR expression that separates the intended and interfering channel-gain vectors. By first conditioning on, and then averaging over the interference, the ZF SNR distribution for Rician-Rayleigh fading is shown to be an infinite linear combination of gamma distributions. On the other hand, for Rayleigh-Rician fading, the ZF SNR is shown to be gamma-distributed. Based on the SNR distribution, we derive new series expressions for the ZF average error probability, outage probability, and ergodic capacity. Numerical results confirm the accuracy of our new expressions, and reveal effects of interference and channel statistics on performance. △ Less

Submitted 2 January, 2014; v1 submitted 10 July, 2013; originally announced July 2013.

Comments: 14 pages, two-colum, 1 table, 10 figures

Report number: METR 2013-07

arXiv:1005.5273 [pdf, ps, other]

Holonomic Gradient Descent and its Application to Fisher-Bingham Integral

Authors: Tomonari Sei, Nobuki Takayama, Akimichi Takemura, Hiromasa Nakayama, Kenta Nishiyama, Masayuki Noro, Katsuyoshi Ohara

Abstract: We give a new algorithm to find local maximum and minimum of a holonomic function and apply it for the Fisher-Bingham integral on the sphere $S^n$, which is used in the directional statistics. The method utilizes the theory and algorithms of holonomic systems. We give a new algorithm to find local maximum and minimum of a holonomic function and apply it for the Fisher-Bingham integral on the sphere $S^n$, which is used in the directional statistics. The method utilizes the theory and algorithms of holonomic systems. △ Less

Submitted 6 September, 2010; v1 submitted 28 May, 2010; originally announced May 2010.

Comments: 23 pages, 1 figure

Journal ref: Advances in Applied Mathematics 47 (2011) 639-658

arXiv:cs/0607055 [pdf, ps, other]

Boundary cliques, clique trees and perfect sequences of maximal cliques of a chordal graph

Authors: Hisayuki Hara, Akimichi Takemura

Abstract: We characterize clique trees of a chordal graph in their relation to simplicial vertices and perfect sequences of maximal cliques. We investigate boundary cliques defined by Shibata and clarify their relation to endpoints of clique trees. Next we define a symmetric binary relation between the set of clique trees and the set of perfect sequences of maximal cliques. We describe the relation as a b… ▽ More We characterize clique trees of a chordal graph in their relation to simplicial vertices and perfect sequences of maximal cliques. We investigate boundary cliques defined by Shibata and clarify their relation to endpoints of clique trees. Next we define a symmetric binary relation between the set of clique trees and the set of perfect sequences of maximal cliques. We describe the relation as a bipartite graph and prove that the bipartite graph is always connected. Lastly we consider to characterize chordal graphs from the aspect of non-uniqueness of clique trees. △ Less

Submitted 11 July, 2006; originally announced July 2006.

ACM Class: G.2.2

arXiv:cs/0506007 [pdf, ps, other]

Defensive forecasting for linear protocols

Authors: Vladimir Vovk, Ilia Nouretdinov, Akimichi Takemura, Glenn Shafer

Abstract: We consider a general class of forecasting protocols, called "linear protocols", and discuss several important special cases, including multi-class forecasting. Forecasting is formalized as a game between three players: Reality, whose role is to generate observations; Forecaster, whose goal is to predict the observations; and Skeptic, who tries to make money on any lack of agreement between Fore… ▽ More We consider a general class of forecasting protocols, called "linear protocols", and discuss several important special cases, including multi-class forecasting. Forecasting is formalized as a game between three players: Reality, whose role is to generate observations; Forecaster, whose goal is to predict the observations; and Skeptic, who tries to make money on any lack of agreement between Forecaster's predictions and the actual observations. Our main mathematical result is that for any continuous strategy for Skeptic in a linear protocol there exists a strategy for Forecaster that does not allow Skeptic's capital to grow. This result is a meta-theorem that allows one to transform any continuous law of probability in a linear protocol into a forecasting strategy whose predictions are guaranteed to satisfy this law. We apply this meta-theorem to a weak law of large numbers in Hilbert spaces to obtain a version of the K29 prediction algorithm for linear protocols and show that this version also satisfies the attractive properties of proper calibration and resolution under a suitable choice of its kernel parameter, with no assumptions about the way the data is generated. △ Less

Submitted 24 September, 2005; v1 submitted 2 June, 2005; originally announced June 2005.

Comments: 16 pages

ACM Class: I.2.6; I.5.1

arXiv:cs/0505083 [pdf, ps, other]

Defensive forecasting

Authors: Vladimir Vovk, Akimichi Takemura, Glenn Shafer

Abstract: We consider how to make probability forecasts of binary labels. Our main mathematical result is that for any continuous gambling strategy used for detecting disagreement between the forecasts and the actual labels, there exists a forecasting strategy whose forecasts are ideal as far as this gambling strategy is concerned. A forecasting strategy obtained in this way from a gambling strategy demon… ▽ More We consider how to make probability forecasts of binary labels. Our main mathematical result is that for any continuous gambling strategy used for detecting disagreement between the forecasts and the actual labels, there exists a forecasting strategy whose forecasts are ideal as far as this gambling strategy is concerned. A forecasting strategy obtained in this way from a gambling strategy demonstrating a strong law of large numbers is simplified and studied empirically. △ Less

Submitted 30 May, 2005; originally announced May 2005.

Comments: 15 pages, 2 figures, to appear in the AIStats'2005 electronic proceedings

ACM Class: I.2.6; I.5.1

Journal ref: Proceedings of the Tenth International Workshop on Artificial Intelligence and Statistics, 2005, pages 365--372.

Showing 1–12 of 12 results for author: Takemura, A