Skip to main content

Showing 1–50 of 105 results for author: Weissman, T

.
  1. arXiv:2406.16797  [pdf, other

    cs.CL cs.AI

    Lottery Ticket Adaptation: Mitigating Destructive Interference in LLMs

    Authors: Ashwinee Panda, Berivan Isik, Xiangyu Qi, Sanmi Koyejo, Tsachy Weissman, Prateek Mittal

    Abstract: Existing methods for adapting large language models (LLMs) to new tasks are not suited to multi-task adaptation because they modify all the model weights -- causing destructive interference between tasks. The resulting effects, such as catastrophic forgetting of earlier tasks, make it challenging to obtain good performance on multiple tasks at the same time. To mitigate this, we propose Lottery Ti… ▽ More

    Submitted 25 June, 2024; v1 submitted 24 June, 2024; originally announced June 2024.

  2. arXiv:2401.11088  [pdf, other

    quant-ph

    Lossy Compression for Schrödinger-style Quantum Simulations

    Authors: Noah Huffman, Dmitri Pavlichin, Tsachy Weissman

    Abstract: Simulating quantum circuits on classical hardware is a powerful and necessary tool for develo** and testing quantum algorithms and hardware as well as evaluating claims of quantum supremacy in the Noisy Intermediate-Scale Quantum (NISQ) regime. Schrödinger-style simulations are limited by the exponential growth of the number of state amplitudes which need to be stored. In this work, we apply sca… ▽ More

    Submitted 1 March, 2024; v1 submitted 19 January, 2024; originally announced January 2024.

  3. arXiv:2306.12625  [pdf, other

    cs.LG cs.DC stat.ML

    Adaptive Compression in Federated Learning via Side Information

    Authors: Berivan Isik, Francesco Pase, Deniz Gunduz, Sanmi Koyejo, Tsachy Weissman, Michele Zorzi

    Abstract: The high communication cost of sending model updates from the clients to the server is a significant bottleneck for scalable federated learning (FL). Among existing approaches, state-of-the-art bitrate-accuracy tradeoffs have been achieved using stochastic compression methods -- in which the client $n$ sends a sample from a client-only probability distribution $q_{φ^{(n)}}$, and the server estimat… ▽ More

    Submitted 21 April, 2024; v1 submitted 21 June, 2023; originally announced June 2023.

    Comments: Published at the International Conference on Artificial Intelligence and Statistics (AISTATS), 2024

  4. arXiv:2306.04924  [pdf, other

    cs.LG cs.CR cs.DC cs.IT stat.ML

    Exact Optimality of Communication-Privacy-Utility Tradeoffs in Distributed Mean Estimation

    Authors: Berivan Isik, Wei-Ning Chen, Ayfer Ozgur, Tsachy Weissman, Albert No

    Abstract: We study the mean estimation problem under communication and local differential privacy constraints. While previous work has proposed \emph{order}-optimal algorithms for the same problem (i.e., asymptotically optimal as we spend more bits), \emph{exact} optimality (in the non-asymptotic setting) still has not been achieved. In this work, we take a step towards characterizing the \emph{exact}-optim… ▽ More

    Submitted 28 October, 2023; v1 submitted 8 June, 2023; originally announced June 2023.

    Comments: Published at the Conference on Neural Information Processing Systems (NeurIPS), 2023

  5. arXiv:2305.01857  [pdf, other

    cs.IT

    Toward Textual Transform Coding

    Authors: Tsachy Weissman

    Abstract: Inspired by recent work on compression with and for young humans, the success of transform-based approaches to information processing, and the rise of powerful language-based AI, we propose \emph{textual transform coding}. It shares some of its key properties with traditional transform-based coding underlying much of our current multimedia compression technologies. It can form the basis for compre… ▽ More

    Submitted 2 May, 2023; originally announced May 2023.

  6. arXiv:2212.10674  [pdf, other

    eess.IV

    PIM: Video Coding using Perceptual Importance Maps

    Authors: Evgenya Pergament, Pulkit Tandon, Oren Rippel, Lubomir Bourdev, Alexander G. Anderson, Bruno Olshausen, Tsachy Weissman, Sachin Katti, Kedar Tatwawadi

    Abstract: Human perception is at the core of lossy video compression, with numerous approaches developed for perceptual quality assessment and improvement over the past two decades. In the determination of perceptual quality, different spatio-temporal regions of the video differ in their relative importance to the human viewer. However, since it is challenging to infer or even collect such fine-grained info… ▽ More

    Submitted 9 April, 2023; v1 submitted 20 December, 2022; originally announced December 2022.

  7. arXiv:2211.06358  [pdf, other

    cs.GT cs.LG

    Leveraging the Hints: Adaptive Bidding in Repeated First-Price Auctions

    Authors: Wei Zhang, Yanjun Han, Zhengyuan Zhou, Aaron Flores, Tsachy Weissman

    Abstract: With the advent and increasing consolidation of e-commerce, digital advertising has very recently replaced traditional advertising as the main marketing force in the economy. In the past four years, a particularly important development in the digital advertising industry is the shift from second-price auctions to first-price auctions for online display ads. This shift immediately motivated the int… ▽ More

    Submitted 5 November, 2022; originally announced November 2022.

    Comments: 28 pages

  8. arXiv:2210.07437  [pdf, other

    cs.IT

    Upper bounds on the Rate of Uniformly-Random Codes for the Deletion Channel

    Authors: Berivan Isik, Francisco Pernice, Tsachy Weissman

    Abstract: We consider the maximum coding rate achievable by uniformly-random codes for the deletion channel. We prove an upper bound that's within 0.1 of the best known lower bounds for all values of the deletion probability $d,$ and much closer for small and large $d.$ We give simulation results which suggest that our upper bound is within 0.05 of the exact value for all $d$, and within $0.01$ for… ▽ More

    Submitted 13 October, 2022; originally announced October 2022.

  9. arXiv:2209.15328  [pdf, other

    cs.LG stat.AP stat.ML

    Sparse Random Networks for Communication-Efficient Federated Learning

    Authors: Berivan Isik, Francesco Pase, Deniz Gunduz, Tsachy Weissman, Michele Zorzi

    Abstract: One main challenge in federated learning is the large communication cost of exchanging weight updates from clients to the server at each round. While prior work has made great progress in compressing the weight updates through gradient compression methods, we propose a radically different approach that does not update the weights at all. Instead, our method freezes the weights at their initial \em… ▽ More

    Submitted 8 February, 2023; v1 submitted 30 September, 2022; originally announced September 2022.

    Comments: Published at the International Conference on Learning Representations (ICLR) 2023

  10. arXiv:2205.03969  [pdf, other

    eess.IV

    An Interactive Annotation Tool for Perceptual Video Compression

    Authors: Evgenya Pergament, Pulkit Tandon, Kedar Tatwawadi, Oren Rippel, Lubomir Bourdev, Bruno Olshausen, Tsachy Weissman, Sachin Katti, Alexander G. Anderson

    Abstract: Human perception is at the core of lossy video compression and yet, it is challenging to collect data that is sufficiently dense to drive compression. In perceptual quality assessment, human feedback is typically collected as a single scalar quality score indicating preference of one distorted video over another. In reality, some videos may be better in some parts but not in others. We propose an… ▽ More

    Submitted 8 May, 2022; originally announced May 2022.

  11. arXiv:2202.02892  [pdf, other

    cs.IT cs.LG eess.SP

    Lossy Compression of Noisy Data for Private and Data-Efficient Learning

    Authors: Berivan Isik, Tsachy Weissman

    Abstract: Storage-efficient privacy-preserving learning is crucial due to increasing amounts of sensitive user data required for modern learning tasks. We propose a framework for reducing the storage cost of user data while at the same time providing privacy guarantees, without essential loss in the utility of the data for learning. Our method comprises noise injection followed by lossy compression. We show… ▽ More

    Submitted 22 March, 2023; v1 submitted 6 February, 2022; originally announced February 2022.

    Comments: Published at the IEEE Journal on Selected Areas in Information Theory (JSAIT). Preliminary version was presented at the IEEE International Symposium on Information Theory (ISIT), 2022, with a slightly different title, "Learning under Storage and Privacy Constraints."

  12. arXiv:2106.14014  [pdf, other

    eess.IV cs.MM

    Txt2Vid: Ultra-Low Bitrate Compression of Talking-Head Videos via Text

    Authors: Pulkit Tandon, Shubham Chandak, Pat Pataranutaporn, Yimeng Liu, Anesu M. Mapuranga, Pattie Maes, Tsachy Weissman, Misha Sra

    Abstract: Video represents the majority of internet traffic today, driving a continual race between the generation of higher quality content, transmission of larger file sizes, and the development of network infrastructure. In addition, the recent COVID-19 pandemic fueled a surge in the use of video conferencing tools. Since videos take up considerable bandwidth (~100 Kbps to a few Mbps), improved video com… ▽ More

    Submitted 2 April, 2022; v1 submitted 26 June, 2021; originally announced June 2021.

    Comments: 11 pages, 8 figures, 2 table. Addition of statistical analysis of results. Reorganization and rewriting of text to make it clearer

  13. arXiv:2102.08329  [pdf, other

    cs.LG cs.IT eess.SP stat.ML

    An Information-Theoretic Justification for Model Pruning

    Authors: Berivan Isik, Tsachy Weissman, Albert No

    Abstract: We study the neural network (NN) compression problem, viewing the tension between the compression ratio and NN performance through the lens of rate-distortion theory. We choose a distortion metric that reflects the effect of NN compression on the model output and derive the tradeoff between rate (compression) and distortion. In addition to characterizing theoretical limits of NN compression, this… ▽ More

    Submitted 9 February, 2022; v1 submitted 16 February, 2021; originally announced February 2021.

    Comments: Published in the International Conference on Artificial Intelligence and Statistics (AISTATS) 2022. Previous titles: 1) Rate-Distortion Theoretic Model Compression: Successive Refinement for Pruning, 2) Successive pruning for model compression via rate distortion theory

  14. arXiv:2102.07725  [pdf, other

    cs.LG

    Neural Network Compression for Noisy Storage Devices

    Authors: Berivan Isik, Kristy Choi, Xin Zheng, Tsachy Weissman, Stefano Ermon, H. -S. Philip Wong, Armin Alaghi

    Abstract: Compression and efficient storage of neural network (NN) parameters is critical for applications that run on resource-constrained devices. Despite the significant progress in NN model compression, there has been considerably less investigation in the actual \textit{physical} storage of NN parameters. Conventionally, model compression and physical storage are decoupled, as digital storage media wit… ▽ More

    Submitted 13 March, 2023; v1 submitted 15 February, 2021; originally announced February 2021.

    Comments: Published at the ACM Transactions on Embedded Computing Systems (TECS), 2023

  15. arXiv:2011.03800  [pdf, other

    eess.IV

    Reducing latency and bandwidth for video streaming using keypoint extraction and digital puppetry

    Authors: Roshan Prabhakar, Shubham Chandak, Carina Chiu, Renee Liang, Huong Nguyen, Kedar Tatwawadi, Tsachy Weissman

    Abstract: COVID-19 has made video communication one of the most important modes of information exchange. While extensive research has been conducted on the optimization of the video streaming pipeline, in particular the development of novel video codecs, further improvement in the video quality and latency is required, especially under poor network conditions. This paper proposes an alternative to the conve… ▽ More

    Submitted 8 January, 2021; v1 submitted 7 November, 2020; originally announced November 2020.

    Comments: 10 pages, 5 figures, 1-page summary to be published at DCC 2021. Revision: added references

  16. arXiv:2007.04568  [pdf, ps, other

    cs.LG cs.GT cs.IT stat.ML

    Learning to Bid Optimally and Efficiently in Adversarial First-price Auctions

    Authors: Yanjun Han, Zhengyuan Zhou, Aaron Flores, Erik Ordentlich, Tsachy Weissman

    Abstract: First-price auctions have very recently swept the online advertising industry, replacing second-price auctions as the predominant auction mechanism on many platforms. This shift has brought forth important challenges for a bidder: how should one bid in a first-price auction, where unlike in second-price auctions, it is no longer optimal to bid one's private value truthfully and hard to know the ot… ▽ More

    Submitted 9 July, 2020; originally announced July 2020.

  17. arXiv:2003.09795  [pdf, other

    cs.LG cs.GT cs.IT stat.ME stat.ML

    Optimal No-regret Learning in Repeated First-price Auctions

    Authors: Yanjun Han, Zhengyuan Zhou, Tsachy Weissman

    Abstract: We study online learning in repeated first-price auctions where a bidder, only observing the winning bid at the end of each auction, learns to adaptively bid in order to maximize her cumulative payoff. To achieve this goal, the bidder faces censored feedback: if she wins the bid, then she is not able to observe the highest bid of the other bidders, which we assume is \textit{iid} drawn from an unk… ▽ More

    Submitted 4 March, 2024; v1 submitted 21 March, 2020; originally announced March 2020.

    Comments: To appear in Operations Research

  18. arXiv:1911.00208  [pdf, other

    eess.SP cs.LG

    LFZip: Lossy compression of multivariate floating-point time series data via improved prediction

    Authors: Shubham Chandak, Kedar Tatwawadi, Chengtao Wen, Lingyun Wang, Juan Aparicio, Tsachy Weissman

    Abstract: Time series data compression is emerging as an important problem with the growth in IoT devices and sensors. Due to the presence of noise in these datasets, lossy compression can often provide significant compression gains without impacting the performance of downstream applications. In this work, we propose an error-bounded lossy compressor, LFZip, for multivariate floating-point time series data… ▽ More

    Submitted 13 January, 2020; v1 submitted 1 November, 2019; originally announced November 2019.

  19. arXiv:1907.01582  [pdf, other

    cond-mat.stat-mech cs.IT physics.bio-ph

    Minimum Power to Maintain a Nonequilibrium Distribution of a Markov Chain

    Authors: Dmitri S. Pavlichin, Yihui Quek, Tsachy Weissman

    Abstract: Biological systems use energy to maintain non-equilibrium distributions for long times, e.g. of chemical concentrations or protein conformations. What are the fundamental limits of the power used to "hold" a stochastic system in a desired distribution over states? We study the setting of an uncontrolled Markov chain $Q$ altered into a controlled chain $P$ having a desired stationary distribution.… ▽ More

    Submitted 2 July, 2019; originally announced July 2019.

    Comments: 9 pages, 5 figures

  20. arXiv:1904.03271  [pdf, ps, other

    cs.IT cs.DC math.CO

    Optimal Communication Rates and Combinatorial Properties for Common Randomness Generation

    Authors: Yanjun Han, Kedar Tatwawadi, Gowtham R. Kurri, Zhengqing Zhou, Vinod M. Prabhakaran, Tsachy Weissman

    Abstract: We study common randomness generation problems where $n$ players aim to generate same sequences of random coin flips where some subsets of the players share an independent common coin which can be tossed multiple times, and there is a publicly seen blackboard through which the players communicate with each other. We provide a tight representation of the optimal communication rates via linear progr… ▽ More

    Submitted 6 October, 2021; v1 submitted 5 April, 2019; originally announced April 2019.

    Comments: 17 pages, 10 figures

  21. arXiv:1811.07557  [pdf, other

    cs.LG cs.IT stat.ML

    Neural Joint Source-Channel Coding

    Authors: Kristy Choi, Kedar Tatwawadi, Aditya Grover, Tsachy Weissman, Stefano Ermon

    Abstract: For reliable transmission across a noisy communication channel, classical results from information theory show that it is asymptotically optimal to separate out the source and channel coding processes. However, this decomposition can fall short in the finite bit-length regime, as it requires non-trivial tuning of hand-crafted codes and assumes infinite computational power for decoding. In this wor… ▽ More

    Submitted 14 May, 2019; v1 submitted 19 November, 2018; originally announced November 2018.

  22. arXiv:1810.11137  [pdf, other

    eess.IV cs.CV cs.IT cs.MM

    Towards improved lossy image compression: Human image reconstruction with public-domain images

    Authors: Ashutosh Bhown, Soham Mukherjee, Sean Yang, Shubham Chandak, Irena Fischer-Hwang, Kedar Tatwawadi, Judith Fan, Tsachy Weissman

    Abstract: Lossy image compression has been studied extensively in the context of typical loss functions such as RMSE, MS-SSIM, etc. However, compression at low bitrates generally produces unsatisfying results. Furthermore, the availability of massive public image datasets appears to have hardly been exploited in image compression. Here, we present a paradigm for eliciting human image reconstruction in order… ▽ More

    Submitted 24 June, 2019; v1 submitted 25 October, 2018; originally announced October 2018.

  23. arXiv:1809.06522  [pdf, other

    cs.IT math.ST

    Concentration Inequalities for the Empirical Distribution

    Authors: Jay Mardia, Jiantao Jiao, Ervin Tánczos, Robert D. Nowak, Tsachy Weissman

    Abstract: We study concentration inequalities for the Kullback--Leibler (KL) divergence between the empirical distribution and the true distribution. Applying a recursion technique, we improve over the method of types bound uniformly in all regimes of sample size $n$ and alphabet size $k$, and the improvement becomes more significant when $k$ is large. We discuss the applications of our results in obtaining… ▽ More

    Submitted 18 October, 2019; v1 submitted 18 September, 2018; originally announced September 2018.

    Comments: Accepted for publication in Information and Inference

  24. arXiv:1805.01355  [pdf, ps, other

    cs.IT eess.SP

    Minimax redundancy for Markov chains with large state space

    Authors: Kedar Shriram Tatwawadi, Jiantao Jiao, Tsachy Weissman

    Abstract: For any Markov source, there exist universal codes whose normalized codelength approaches the Shannon limit asymptotically as the number of samples goes to infinity. This paper investigates how fast the gap between the normalized codelength of the "best" universal compressor and the Shannon limit (i.e. the compression redundancy) vanishes non-asymptotically in terms of the alphabet size and mixing… ▽ More

    Submitted 5 May, 2018; v1 submitted 1 May, 2018; originally announced May 2018.

    Comments: 22 pages, 1 figure

  25. arXiv:1802.08417  [pdf, ps, other

    cs.DC cs.IT stat.ME

    Geometric Lower Bounds for Distributed Parameter Estimation under Communication Constraints

    Authors: Yanjun Han, Ayfer Özgür, Tsachy Weissman

    Abstract: We consider parameter estimation in distributed networks, where each sensor in the network observes an independent sample from an underlying distribution and has $k$ bits to communicate its sample to a centralized processor which computes an estimate of a desired parameter. We develop lower bounds for the minimax risk of estimating the underlying parameter for a large class of losses and distribut… ▽ More

    Submitted 22 July, 2021; v1 submitted 23 February, 2018; originally announced February 2018.

    Comments: This version (v4) added a new corollary on logistic regression, as well as more discussions on sparse Gaussian mean estimation, compared to v3

    Journal ref: published in COLT 2018

  26. arXiv:1802.08405  [pdf, ps, other

    stat.ME cs.IT cs.LG

    Local moment matching: A unified methodology for symmetric functional estimation and distribution estimation under Wasserstein distance

    Authors: Yanjun Han, Jiantao Jiao, Tsachy Weissman

    Abstract: We present \emph{Local Moment Matching (LMM)}, a unified methodology for symmetric functional estimation and distribution estimation under Wasserstein distance. We construct an efficiently computable estimator that achieves the minimax rates in estimating the distribution up to permutation, and show that the plug-in approach of our unlabeled distribution estimator is "universal" in estimating symm… ▽ More

    Submitted 26 June, 2018; v1 submitted 23 February, 2018; originally announced February 2018.

  27. arXiv:1802.07889  [pdf, ps, other

    cs.LG math.ST stat.ML

    Entropy Rate Estimation for Markov Chains with Large State Space

    Authors: Yanjun Han, Jiantao Jiao, Chuan-Zheng Lee, Tsachy Weissman, Yihong Wu, Tiancheng Yu

    Abstract: Estimating the entropy based on data is one of the prototypical problems in distribution property testing and estimation. For estimating the Shannon entropy of a distribution on $S$ elements with independent samples, [Paninski2004] showed that the sample complexity is sublinear in $S$, and [Valiant--Valiant2011] showed that consistent estimation of Shannon entropy is possible if and only if the sa… ▽ More

    Submitted 24 September, 2018; v1 submitted 21 February, 2018; originally announced February 2018.

    Comments: Published as a conference paper on NIPS 2018

  28. arXiv:1712.07177  [pdf, other

    cs.LG stat.ML

    Approximate Profile Maximum Likelihood

    Authors: Dmitri S. Pavlichin, Jiantao Jiao, Tsachy Weissman

    Abstract: We propose an efficient algorithm for approximate computation of the profile maximum likelihood (PML), a variant of maximum likelihood maximizing the probability of observing a sufficient statistic rather than the empirical sample. The PML has appealing theoretical properties, but is difficult to compute exactly. Inspired by observations gleaned from exactly solvable cases, we look for an approxim… ▽ More

    Submitted 19 December, 2017; originally announced December 2017.

  29. arXiv:1711.02141  [pdf, ps, other

    math.ST cs.IT stat.ME

    Optimal rates of entropy estimation over Lipschitz balls

    Authors: Yanjun Han, Jiantao Jiao, Tsachy Weissman, Yihong Wu

    Abstract: We consider the problem of minimax estimation of the entropy of a density over Lipschitz balls. Drop** the usual assumption that the density is bounded away from zero, we obtain the minimax rates $(n\ln n)^{-s/(s+d)} + n^{-1/2}$ for $0<s\leq 2$ for densities supported on $[0,1]^d$, where $s$ is the smoothness parameter and $n$ is the number of independent samples. We generalize the results to de… ▽ More

    Submitted 10 November, 2019; v1 submitted 6 November, 2017; originally announced November 2017.

  30. arXiv:1709.00134  [pdf, ps, other

    cs.IT

    Universality of Logarithmic Loss in Lossy Compression

    Authors: Albert No, Tsachy Weissman

    Abstract: We establish two strong senses of universality of logarithmic loss as a distortion criterion in lossy compression: For any fixed length lossy compression problem under an arbitrary distortion criterion, we show that there is an equivalent lossy compression problem under logarithmic loss. In the successive refinement problem, if the first decoder operates under logarithmic loss, we show that any di… ▽ More

    Submitted 31 August, 2017; originally announced September 2017.

  31. arXiv:1708.09041  [pdf, ps, other

    math.PR

    Generalizations of Maximal Inequalities to Arbitrary Selection Rules

    Authors: Jiantao Jiao, Yanjun Han, Tsachy Weissman

    Abstract: We present a generalization of the maximal inequalities that upper bound the expectation of the maximum of $n$ jointly distributed random variables. We control the expectation of a randomly selected random variable from $n$ jointly distributed random variables, and present bounds that are at least as tight as the classical maximal inequalities, and much tighter when the distribution of selection i… ▽ More

    Submitted 29 August, 2017; originally announced August 2017.

  32. arXiv:1707.01203  [pdf, ps, other

    cs.IT stat.ML

    Estimating the Fundamental Limits is Easier than Achieving the Fundamental Limits

    Authors: Jiantao Jiao, Yanjun Han, Irena Fischer-Hwang, Tsachy Weissman

    Abstract: We show through case studies that it is easier to estimate the fundamental limits of data processing than to construct explicit algorithms to achieve those limits. Focusing on binary classification, data compression, and prediction under logarithmic loss, we show that in the finite space setting, when it is possible to construct an estimator of the limits with vanishing error with $n$ samples, it… ▽ More

    Submitted 1 October, 2017; v1 submitted 4 July, 2017; originally announced July 2017.

  33. Minimax Estimation of the $L_1$ Distance

    Authors: Jiantao Jiao, Yanjun Han, Tsachy Weissman

    Abstract: We consider the problem of estimating the $L_1$ distance between two discrete probability measures $P$ and $Q$ from empirical data in a nonasymptotic and large alphabet setting. When $Q$ is known and one obtains $n$ samples from $P$, we show that for every $Q$, the minimax rate-optimal estimator with $n$ samples achieves performance comparable to that of the maximum likelihood estimator (MLE) with… ▽ More

    Submitted 23 June, 2018; v1 submitted 2 May, 2017; originally announced May 2017.

    Comments: to appear on IEEE Transactions on Information Theory

  34. arXiv:1704.05199  [pdf, ps, other

    cs.IT

    Mutual Information, Relative Entropy and Estimation Error in Semi-martingale Channels

    Authors: Jiantao Jiao, Kartik Venkat, Tsachy Weissman

    Abstract: Fundamental relations between information and estimation have been established in the literature for the continuous-time Gaussian and Poisson channels, in a long line of work starting from the classical representation theorems by Duncan and Kabanov respectively. In this work, we demonstrate that such relations hold for a much larger family of continuous-time channels. We introduce the family of se… ▽ More

    Submitted 18 April, 2017; originally announced April 2017.

  35. arXiv:1612.05845  [pdf, ps, other

    cs.IT

    Dependence Measures Bounding the Exploration Bias for General Measurements

    Authors: Jiantao Jiao, Yanjun Han, Tsachy Weissman

    Abstract: We propose a framework to analyze and quantify the bias in adaptive data analysis. It generalizes that proposed by Russo and Zou'15, applying to measurements whose moment generating function exists, measurements with a finite $p$-norm, and measurements in general Orlicz spaces. We introduce a new class of dependence measures which retain key properties of mutual information while more effectively… ▽ More

    Submitted 17 July, 2017; v1 submitted 17 December, 2016; originally announced December 2016.

  36. arXiv:1611.01186  [pdf, other

    cs.NE cs.LG stat.ML

    Demystifying ResNet

    Authors: Sihan Li, Jiantao Jiao, Yanjun Han, Tsachy Weissman

    Abstract: The Residual Network (ResNet), proposed in He et al. (2015), utilized shortcut connections to significantly reduce the difficulty of training, which resulted in great performance boosts in terms of both training and generalization error. It was empirically observed in He et al. (2015) that stacking more layers of residual blocks with shortcut 2 results in smaller training error, while it is not… ▽ More

    Submitted 20 May, 2017; v1 submitted 3 November, 2016; originally announced November 2016.

  37. arXiv:1611.00270  [pdf, ps, other

    cs.IT

    When is Noisy State Information at the Encoder as Useless as No Information or as Good as Noise-Free State?

    Authors: Rui Xu, Jun Chen, Tsachy Weissman, Jian-Kang Zhang

    Abstract: For any binary-input channel with perfect state information at the decoder, if the mutual information between the noisy state observation at the encoder and the true channel state is below a positive threshold determined solely by the state distribution, then the capacity is the same as that with no encoder side information. A complementary phenomenon is revealed for the generalized probing capaci… ▽ More

    Submitted 1 November, 2016; originally announced November 2016.

    Comments: This paper was presented in part at the 2016 IEEE International Symposium on Information Theory. 16 pages, 8 figures

  38. Minimax Rate-Optimal Estimation of Divergences between Discrete Distributions

    Authors: Yanjun Han, Jiantao Jiao, Tsachy Weissman

    Abstract: We study the minimax estimation of $α$-divergences between discrete distributions for integer $α\ge 1$, which include the Kullback--Leibler divergence and the $χ^2$-divergences as special examples. Drop** the usual theoretical tricks to acquire independence, we construct the first minimax rate-optimal estimator which does not require any Poissonization, sample splitting, or explicit construction… ▽ More

    Submitted 3 March, 2021; v1 submitted 30 May, 2016; originally announced May 2016.

    Comments: This (v5) is a significantly revised version of (v2), and fixed some typos in (v4)

    Journal ref: Published in IEEE Journal on Selected Areas in Information Theory, vol. 1, no. 3, pp. 814-823, Nov. 2020

  39. arXiv:1511.04836  [pdf, ps, other

    q-bio.GN cs.IT

    DUDE-Seq: Fast, Flexible, and Robust Denoising for Targeted Amplicon Sequencing

    Authors: Byunghan Lee, Taesup Moon, Sungroh Yoon, Tsachy Weissman

    Abstract: We consider the correction of errors from nucleotide sequences produced by next-generation targeted amplicon sequencing. The next-generation sequencing (NGS) platforms can provide a great deal of sequencing data thanks to their high throughput, but the associated error rates often tend to be high. Denoising in high-throughput sequencing has thus become a crucial process for boosting the reliabilit… ▽ More

    Submitted 4 July, 2017; v1 submitted 16 November, 2015; originally announced November 2015.

  40. arXiv:1506.03407  [pdf, ps, other

    cs.IT

    Strong Successive Refinability and Rate-Distortion-Complexity Tradeoff

    Authors: Albert No, Amir Ingber, Tsachy Weissman

    Abstract: We investigate the second order asymptotics (source dispersion) of the successive refinement problem. Similarly to the classical definition of a successively refinable source, we say that a source is strongly successively refinable if successive refinement coding can achieve the second order optimum rate (including the dispersion terms) at both decoders. We establish a sufficient condition for str… ▽ More

    Submitted 15 March, 2016; v1 submitted 10 June, 2015; originally announced June 2015.

  41. Does Dirichlet Prior Smoothing Solve the Shannon Entropy Estimation Problem?

    Authors: Yanjun Han, Jiantao Jiao, Tsachy Weissman

    Abstract: The Dirichlet prior is widely used in estimating discrete distributions and functionals of discrete distributions. In terms of Shannon entropy estimation, one approach is to plug-in the Dirichlet prior smoothed distribution into the entropy functional, while the other one is to calculate the Bayes estimator for entropy under the Dirichlet prior for squared error, which is the conditional expectati… ▽ More

    Submitted 18 September, 2017; v1 submitted 1 February, 2015; originally announced February 2015.

    Comments: 27 pages, 1 figure, published on IEEE Transactions on Information Theory, merged with https://arxiv.longhoe.net/abs/1406.6959

    Journal ref: IEEE Transactions on Information Theory, vol. 63, no. 10, pp. 6774-6798, Oct. 2017

  42. arXiv:1502.00326  [pdf, other

    cs.IT

    Adaptive Estimation of Shannon Entropy

    Authors: Yanjun Han, Jiantao Jiao, Tsachy Weissman

    Abstract: We consider estimating the Shannon entropy of a discrete distribution $P$ from $n$ i.i.d. samples. Recently, Jiao, Venkat, Han, and Weissman, and Wu and Yang constructed approximation theoretic estimators that achieve the minimax $L_2$ rates in estimating entropy. Their estimators are consistent given $n \gg \frac{S}{\ln S}$ samples, where $S$ is the alphabet size, and it is the best possible samp… ▽ More

    Submitted 1 January, 2019; v1 submitted 1 February, 2015; originally announced February 2015.

  43. arXiv:1411.1467  [pdf, ps, other

    cs.IT

    Minimax Estimation of Discrete Distributions under $\ell_1$ Loss

    Authors: Yanjun Han, Jiantao Jiao, Tsachy Weissman

    Abstract: We analyze the problem of discrete distribution estimation under $\ell_1$ loss. We provide non-asymptotic upper and lower bounds on the maximum risk of the empirical distribution (the maximum likelihood estimator), and the minimax risk in regimes where the alphabet size $S$ may grow with the number of observations $n$. We show that among distributions with bounded entropy $H$, the asymptotic maxim… ▽ More

    Submitted 28 December, 2015; v1 submitted 5 November, 2014; originally announced November 2014.

    Journal ref: IEEE Transactions on Information Theory, Vol. 61, No. 11, pp 6343-6354, Nov. 2015

  44. arXiv:1409.7458  [pdf, ps, other

    stat.ME cs.DS cs.IT stat.ML

    Beyond Maximum Likelihood: from Theory to Practice

    Authors: Jiantao Jiao, Kartik Venkat, Yanjun Han, Tsachy Weissman

    Abstract: Maximum likelihood is the most widely used statistical estimation technique. Recent work by the authors introduced a general methodology for the construction of estimators for functionals in parametric models, and demonstrated improvements - both in theory and in practice - over the maximum likelihood estimator (MLE), particularly in high dimensional scenarios involving parameter dimension compara… ▽ More

    Submitted 25 September, 2014; originally announced September 2014.

  45. Maximum Likelihood Estimation of Functionals of Discrete Distributions

    Authors: Jiantao Jiao, Kartik Venkat, Yanjun Han, Tsachy Weissman

    Abstract: We consider the problem of estimating functionals of discrete distributions, and focus on tight nonasymptotic analysis of the worst case squared error risk of widely used estimators. We apply concentration inequalities to analyze the random fluctuation of these estimators around their expectations, and the theory of approximation using positive linear operators to analyze the deviation of their ex… ▽ More

    Submitted 9 August, 2017; v1 submitted 26 June, 2014; originally announced June 2014.

    Comments: 27 pages, 1 figure, published in IEEE Transactions on Information Theory

  46. arXiv:1406.6956  [pdf, other

    cs.IT math.ST

    Minimax Estimation of Functionals of Discrete Distributions

    Authors: Jiantao Jiao, Kartik Venkat, Yanjun Han, Tsachy Weissman

    Abstract: We propose a general methodology for the construction and analysis of minimax estimators for a wide class of functionals of finite dimensional parameters, and elaborate on the case of discrete distributions, where the alphabet size $S$ is unknown and may be comparable with the number of observations $n$. We treat the respective regions where the functional is "nonsmooth" and "smooth" separately. I… ▽ More

    Submitted 10 March, 2015; v1 submitted 26 June, 2014; originally announced June 2014.

    Comments: To appear in IEEE Transactions on Information Theory

  47. arXiv:1406.6730  [pdf, other

    cs.IT

    Rateless Lossy Compression via the Extremes

    Authors: Albert No, Tsachy Weissman

    Abstract: We begin by presenting a simple lossy compressor operating at near-zero rate: The encoder merely describes the indices of the few maximal source components, while the decoder's reconstruction is a natural estimate of the source components based on this information. This scheme turns out to be near-optimal for the memoryless Gaussian source in the sense of achieving the zero-rate slope of its disto… ▽ More

    Submitted 8 March, 2016; v1 submitted 25 June, 2014; originally announced June 2014.

  48. Distortion-Rate Function of Sub-Nyquist Sampled Gaussian Sources

    Authors: Alon Kipnis, Andrea J. Goldsmith, Yonina C. Eldar, Tsachy Weissman

    Abstract: The amount of information lost in sub-Nyquist sampling of a continuous-time Gaussian stationary process is quantified. We consider a combined source coding and sub-Nyquist reconstruction problem in which the input to the encoder is a noisy sub-Nyquist sampled version of the analog source. We first derive an expression for the mean squared error in the reconstruction of the process from a noisy and… ▽ More

    Submitted 6 November, 2015; v1 submitted 21 May, 2014; originally announced May 2014.

    Comments: Accepted for publication at the IEEE transactions on information theory

    Journal ref: Information Theory, IEEE Transactions on , vol.62, no.1, pp.401-429, Jan. 2016

  49. arXiv:1404.6812  [pdf, other

    cs.IT math.PR

    Relations between Information and Estimation in Discrete-Time Lévy Channels

    Authors: Jiantao Jiao, Kartik Venkat, Tsachy Weissman

    Abstract: Fundamental relations between information and estimation have been established in the literature for the discrete-time Gaussian and Poisson channels. In this work, we demonstrate that such relations hold for a much larger class of observation models. We introduce the natural family of discrete-time Lévy channels where the distribution of the output conditioned on the input is infinitely divisible.… ▽ More

    Submitted 1 February, 2017; v1 submitted 27 April, 2014; originally announced April 2014.

  50. Information Measures: the Curious Case of the Binary Alphabet

    Authors: Jiantao Jiao, Thomas Courtade, Albert No, Kartik Venkat, Tsachy Weissman

    Abstract: Four problems related to information divergence measures defined on finite alphabets are considered. In three of the cases we consider, we illustrate a contrast which arises between the binary-alphabet and larger-alphabet settings. This is surprising in some instances, since characterizations for the larger-alphabet settings do not generalize their binary-alphabet counterparts. Specifically, we sh… ▽ More

    Submitted 28 November, 2014; v1 submitted 27 April, 2014; originally announced April 2014.

    Comments: to appear in IEEE Transactions on Information Theory