Search | arXiv e-print repository

Constructing Multiclass Classifiers using Binary Classifiers Under Log-Loss

Authors: Assaf Ben-Yishai, Or Ordentlich

Abstract: The construction of multiclass classifiers from binary elements is studied in this paper, and performance is quantified by the regret, defined with respect to the Bayes optimal log-loss. We discuss two known methods. The first is one vs. all (OVA), for which we prove that the multiclass regret is upper bounded by the sum of binary regrets of the constituent classifiers. The second is hierarchical… ▽ More The construction of multiclass classifiers from binary elements is studied in this paper, and performance is quantified by the regret, defined with respect to the Bayes optimal log-loss. We discuss two known methods. The first is one vs. all (OVA), for which we prove that the multiclass regret is upper bounded by the sum of binary regrets of the constituent classifiers. The second is hierarchical classification, based on a binary tree. For this method we prove that the multiclass regret is exactly a weighted sum of constituent binary regrets where the weighing is determined by the tree structure. We also introduce a leverage-hierarchical classification method, which potentially yields smaller log-loss and regret. The advantages of these classification methods are demonstrated by simulation on both synthetic and real-life datasets. △ Less

Submitted 12 August, 2021; v1 submitted 16 February, 2021; originally announced February 2021.

Comments: A shorter version of this contribution was presented in ISIT 2021

arXiv:2008.01686 [pdf, other]

Simple Modulo can Significantly Outperform Deep Learning-based Deepcode

Authors: Assaf Ben-Yishai, Ofer Shayevitz

Abstract: Deepcode (H.Kim et al.2018) is a recently suggested Deep Learning-based scheme for communication over the AWGN channel with noisy feedback, claimed to be superior to all previous schemes in the literature. Deepcode's use of nonlinear coding (via Deep Learning) has been inspired by known shortcomings (Y.-H. Kim et al 2007) of linear feedback schemes. In 2014, we presented a nonlinear feedback codin… ▽ More Deepcode (H.Kim et al.2018) is a recently suggested Deep Learning-based scheme for communication over the AWGN channel with noisy feedback, claimed to be superior to all previous schemes in the literature. Deepcode's use of nonlinear coding (via Deep Learning) has been inspired by known shortcomings (Y.-H. Kim et al 2007) of linear feedback schemes. In 2014, we presented a nonlinear feedback coding scheme based on a combination of the classical SK scheme and modulo-arithmetic, using a small number of elementary operations without any type of neural network. This Modulo-SK scheme has been omitted from the performance comparisons made in the Deepcode paper, due to its use of common randomness (dither), and in a later version since it was incorrectly interpreted as a variable-length coding scheme. However, the dither in Modulo-SK was used only for the standard purpose of tractable performance analysis, and is not required in practice. In this short note, we show that a fully-deterministic Modulo-SK (without dithering) can outperform Deepcode. For example, to attain an error probability of 10^(-4) at rate 1/3 Modulo-SK requires 3dB less feedback SNR than Deepcode. To attain an error probability of 10^(-6) with noiseless feedback, Deepcode requires 150 rounds of communication, whereas Modulo-SK requires only 15 rounds, even if the feedback is noisy (with 27dB SNR). We further address the numerical stability issues of the original SK scheme reported in the Deepcode paper, and explain how they can be avoided. We augment this report with an online-available, fully-functional Matlab simulation for both the classical and Modulo-SK schemes. Finally, note that Modulo-SK is by no means claimed to be the best possible solution; in particular, using deep learning in conjunction with modulo-arithmetic might lead to better designs, and remains a fascinating direction for future research. △ Less

Submitted 5 August, 2020; v1 submitted 4 August, 2020; originally announced August 2020.

Comments: Technical Report, 4 Pages, 3 Figures

arXiv:2001.00499 [pdf, other]

doi 10.3390/e23010017

On the Interactive Capacity of Finite-State Protocols

Authors: Assaf Ben-Yishai, Young-Han Kim, Rotem Oshman, Ofer Shayevitz

Abstract: The interactive capacity of a noisy channel is the highest possible rate at which arbitrary interactive protocols can be simulated reliably over the channel. Determining the interactive capacity is notoriously difficult, and the best known lower bounds are far below the associated Shannon capacity, which serves as a trivial (and also generally the best known) upper bound. This paper considers the… ▽ More The interactive capacity of a noisy channel is the highest possible rate at which arbitrary interactive protocols can be simulated reliably over the channel. Determining the interactive capacity is notoriously difficult, and the best known lower bounds are far below the associated Shannon capacity, which serves as a trivial (and also generally the best known) upper bound. This paper considers the more restricted setup of simulating finite-state protocols. It is shown that all two-state protocols, as well as rich families of arbitrary finite-state protocols, can be simulated at the Shannon capacity, establishing the interactive capacity for those families of protocols. △ Less

Submitted 31 December, 2019; originally announced January 2020.

Comments: arXiv admin note: text overlap with arXiv:1908.07367

arXiv:1908.07367 [pdf, ps, other]

A Lower Bound on the Essential Interactive Capacity of Binary Memoryless Symmetric Channels

Authors: Assaf Ben-Yishai, Young-Han Kim, Or Ordentlich, Ofer Shayevitz

Abstract: The essential interactive capacity of a discrete memoryless channel is defined in this paper as the maximal rate at which the transcript of any interactive protocol can be reliably simulated over the channel, using a deterministic coding scheme. In contrast to other interactive capacity definitions in the literature, this definition makes no assumptions on the order of speakers (which can be adapt… ▽ More The essential interactive capacity of a discrete memoryless channel is defined in this paper as the maximal rate at which the transcript of any interactive protocol can be reliably simulated over the channel, using a deterministic coding scheme. In contrast to other interactive capacity definitions in the literature, this definition makes no assumptions on the order of speakers (which can be adaptive) and does not allow any use of private / public randomness; hence, the essential interactive capacity is a function of the channel model only. It is shown that the essential interactive capacity of any binary memoryless symmetric (BMS) channel is at least $0.0302$ its Shannon capacity. To that end, we present a simple coding scheme, based on extended-Hamming codes combined with error detection, that achieves the lower bound in the special case of the binary symmetric channel (BSC). We then adapt the scheme to the entire family of BMS channels, and show that it achieves the same lower bound using extremes of the Bhattacharyya parameter. △ Less

Submitted 12 August, 2021; v1 submitted 20 August, 2019; originally announced August 2019.

arXiv:1801.01022 [pdf, other]

Shannon Capacity is Achievable for Binary Interactive First-Order Markovian Protocols

Authors: Assaf Ben-Yishai, Ofer Shayevitz, Young-Han Kim

Abstract: We address the problem of simulating an arbitrary binary interactive first-order Markovian protocol over a pair of binary symmetric channels with crossover probability $\varepsilon$. We are interested in the achievable rates of reliable simulation, i.e., in characterizing the smallest possible blowup in communications such that a vanishing error probability (in the protocol length) can be attained… ▽ More We address the problem of simulating an arbitrary binary interactive first-order Markovian protocol over a pair of binary symmetric channels with crossover probability $\varepsilon$. We are interested in the achievable rates of reliable simulation, i.e., in characterizing the smallest possible blowup in communications such that a vanishing error probability (in the protocol length) can be attained. Whereas for general interactive protocols the output of each party may depend on all previous outputs of its counterpart, in a (first-order) Markovian protocol this dependence is limited to the last observed output only. Previous works in the field discuss broader families of protocols but assess the achievable rates only at the limit where $\varepsilon\to0$. In this paper, we prove that the one-way Shannon capacity, $1-h(\varepsilon)$, can be achieved for any binary first-order Markovian protocol. This surprising result, is to the best of our knowledge, the first example in which non-trivial interactive protocol can be simulated in the Shannon capacity. We give two capacity achieving coding schemes, which both divide the protocol into independent blocks, and implement vertical block coding. The first scheme is based on a random separation into blocks with variable lengths. The second scheme is based on a deterministic separation into blocks, and efficiently predicting their last transmission. The prediction can be regarded as a binary pointer jum** game, for which we show that the final step can be calculated with $O(\log m)$ bits, where $m$ is the number of rounds in the game. We conclude the work by discussing possible extensions of the result to higher order models. △ Less

Submitted 6 August, 2018; v1 submitted 30 December, 2017; originally announced January 2018.

Comments: arXiv admin note: text overlap with arXiv:1709.09123

arXiv:1709.09123 [pdf, ps, other]

Interactive Coding for Markovian Protocols

Authors: Assaf Ben-Yishai, Ofer Shayevitz, Young-Han Kim

Abstract: We address the problem of simulating an arbitrary Markovian interactive protocol over binary symmetric channels with crossover probability $\varepsilon$. We are interested in the achievable rates of reliable simulation, i.e., in characterizing the smallest possible blowup in communications such that a vanishing error probability (in the protocol length) can be attained. Whereas for general interac… ▽ More We address the problem of simulating an arbitrary Markovian interactive protocol over binary symmetric channels with crossover probability $\varepsilon$. We are interested in the achievable rates of reliable simulation, i.e., in characterizing the smallest possible blowup in communications such that a vanishing error probability (in the protocol length) can be attained. Whereas for general interactive protocols the output of each party may depend on all previous outputs of its counterpart, in a (first order) Markovian protocol this dependence is limited to the last observed output only. In the special case where there is no dependence on previous outputs (no interaction), the maximal achievable rate is given by the (one-way) Shannon capacity $1-h(\varepsilon)$. For Markovian protocols, we first show that a rate of $\frac{2}{3}(1-h(\varepsilon))$ can be trivially achieved. We then describe a more involved coding scheme and provide a closed-form lower bound for its rate at any noise level $\varepsilon$. Specifically, we show that this scheme outperforms the trivial one for any $\varepsilon<0.044$, and achieves a rate higher than $\frac{1-h(\varepsilon)}{1+h(\varepsilon)+h\left(<\varepsilon(2-\varepsilon)>\right)}=1-Θ(h(\varepsilon))$ as $\varepsilon\to 0$, which is order-wise the best possible. This should be juxtaposed with a result of Kol and Raz that shows the capacity for interactive protocols with alternating rounds is lower bounded by $1-O(\sqrt{h(\varepsilon)})$. △ Less

Submitted 30 October, 2017; v1 submitted 26 September, 2017; originally announced September 2017.

Comments: Allerton 2017

arXiv:1509.03085 [pdf, ps, other]

Interactive Schemes for the AWGN Channel with Noisy Feedback

Authors: Assaf Ben-Yishai, Ofer Shayevitz

Abstract: We study the problem of communication over an additive white Gaussian noise (AWGN) channel with an AWGN feedback channel. When the feedback channel is noiseless, the classic Schalkwijk-Kailath (S-K) scheme is known to achieve capacity in a simple sequential fashion, while attaining reliability superior to non-feedback schemes. In this work, we show how simplicity and reliability can be attained ev… ▽ More We study the problem of communication over an additive white Gaussian noise (AWGN) channel with an AWGN feedback channel. When the feedback channel is noiseless, the classic Schalkwijk-Kailath (S-K) scheme is known to achieve capacity in a simple sequential fashion, while attaining reliability superior to non-feedback schemes. In this work, we show how simplicity and reliability can be attained even when the feedback is noisy, provided that the feedback channel is sufficiently better than the feedforward channel. Specifically, we introduce a low-complexity low-delay interactive scheme that operates close to capacity for a fixed bit error probability (e.g. $10^{-6}$). We then build on this scheme to provide two asymptotic constructions, one based on high dimensional lattices, and the other based on concatenated coding, that admit an error exponent significantly exceeding the best possible non-feedback exponent. Our approach is based on the interpretation of feedback transmission as a side-information problem, and employs an interactive modulo-lattice solution. △ Less

Submitted 3 January, 2017; v1 submitted 10 September, 2015; originally announced September 2015.

Comments: Accepted for publication in the IEEE Transactions on Information Theory

arXiv:1503.05297 [pdf, ps, other]

The AWGN BC with MAC Feedback: A Reduction to Noiseless Feedback via Interaction

Authors: Assaf Ben-Yishai, Ofer Shayevitz

Abstract: We consider the problem of communication over a two-user Additive White Gaussian Noise Broadcast Channel (AWGN-BC) with an AWGN Multiple Access (MAC) active feedback. We describe a constructive reduction from this setup to the well-studied setup of linear-feedback coding over the AWGN-BC with noiseless feedback (and different parameters). This reduction facilitates the design of linear-feedback co… ▽ More We consider the problem of communication over a two-user Additive White Gaussian Noise Broadcast Channel (AWGN-BC) with an AWGN Multiple Access (MAC) active feedback. We describe a constructive reduction from this setup to the well-studied setup of linear-feedback coding over the AWGN-BC with noiseless feedback (and different parameters). This reduction facilitates the design of linear-feedback coding schemes in the (passive) noiseless feedback regime, which can then be easily and constructively transformed into coding schemes in the MAC feedback regime that attain the exact same rates. Our construction introduces an element of interaction into the coding protocol, and is based on modulo-lattice operations. As an example, we apply our method to the Ozarow-Leung scheme, and demonstrate how MAC feedback can be used to increase the capacity region of the AWGN-BC. △ Less

Submitted 20 May, 2015; v1 submitted 18 March, 2015; originally announced March 2015.

Comments: Invited talk ITW 2015, Jerusalem

arXiv:1501.06671 [pdf, ps, other]

The Gaussian Channel with Noisy Feedback: Improving Reliability via Interaction

Authors: Assaf Ben-Yishai, Ofer Shayevitz

Abstract: Consider a pair of terminals connected by two independent (feedforward and feedback) Additive White Gaussian Noise (AWGN) channels, and limited by individual power constraints. The first terminal would like to reliably send information to the second terminal at a given rate. While the reliability in the cases of no feedback and of noiseless feedback is well studied, not much is known about the cas… ▽ More Consider a pair of terminals connected by two independent (feedforward and feedback) Additive White Gaussian Noise (AWGN) channels, and limited by individual power constraints. The first terminal would like to reliably send information to the second terminal at a given rate. While the reliability in the cases of no feedback and of noiseless feedback is well studied, not much is known about the case of noisy feedback. In this work, we present an interactive scheme that significantly improves the reliability relative to the no-feedback setting, whenever the feedback Signal to Noise Ratio (SNR) is sufficiently larger than the feedforward SNR. The scheme combines Schalkwijk-Kailath (S-K) coding and modulo--lattice analog transmission. △ Less

Submitted 15 April, 2015; v1 submitted 27 January, 2015; originally announced January 2015.

Comments: Accepted for ISIT 2015

arXiv:1407.8022 [pdf, ps, other]

The Gaussian Channel with Noisy Feedback: Near-Capacity Performance via Simple Interaction

Authors: Assaf Ben-Yishai, Ofer Shayevitz

Abstract: Consider a pair of terminals connected by two independent additive white Gaussian noise channels, and limited by individual power constraints. The first terminal would like to reliably send information to the second terminal, within a given error probability. We construct an explicit interactive scheme consisting of only (non-linear) scalar operations, by endowing the Schalkwijk-Kailath noiseless… ▽ More Consider a pair of terminals connected by two independent additive white Gaussian noise channels, and limited by individual power constraints. The first terminal would like to reliably send information to the second terminal, within a given error probability. We construct an explicit interactive scheme consisting of only (non-linear) scalar operations, by endowing the Schalkwijk-Kailath noiseless feedback scheme with modulo arithmetic. Our scheme achieves a communication rate close to the Shannon limit, in a small number of rounds. For example, for an error probability of $10^{-6}$, if the Signal to Noise Ratio ($\mathrm{SNR}$) of the feedback channel exceeds the $\mathrm{SNR}$ of the forward channel by $20\mathrm{dB}$, our scheme operates $0.8\mathrm{dB}$ from the Shannon limit with only $19$ rounds of interaction. In comparison, attaining the same performance using state of the art Forward Error Correction (FEC) codes requires two orders of magnitude increase in delay and complexity. On the other extreme, a minimal delay uncoded system with the same error probability is bounded away by $9\mathrm{dB}$ from the Shannon limit. △ Less

Submitted 15 December, 2014; v1 submitted 30 July, 2014; originally announced July 2014.

Comments: Allerton Conference on Communication, Control, and Computing, October 2014

Showing 1–10 of 10 results for author: Ben-Yishai, A