Skip to main content

Showing 1–40 of 40 results for author: Ryabko, B

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.14570  [pdf, ps, other

    cs.IT

    A general method for the development of constrained codes

    Authors: Boris Ryabko

    Abstract: Nowadays there are several classes of constrained codes intended for different applications. The following two large classes can be distinguished. The first class contains codes with local constraints; for example, the source data must be encoded by binary sequences containing no sub-words 00 and 111. The second class contains codes with global constraints; for example, the code-words must be bina… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

    MSC Class: 94A29

  2. arXiv:2404.02708  [pdf, ps, other

    cs.IT

    Building test batteries based on analysing random number generator tests within the framework of algorithmic information theory

    Authors: Boris Ryabko

    Abstract: The problem of testing random number generators is considered and it is shown that an approach based on algorithmic information theory allows us to compare the power of different tests in some cases where the available methods of mathematical statistics do not distinguish between the tests. In particular, it is shown that tests based on data compression methods using dictionaries should be include… ▽ More

    Submitted 3 April, 2024; originally announced April 2024.

    MSC Class: 03D32 68Q30 62A01 62G10

  3. arXiv:2307.09735  [pdf, ps, other

    cs.CR cs.IT

    Reduction of the secret key length in the perfect cipher by data compression and randomisation

    Authors: Boris Ryabko

    Abstract: Perfect ciphers have been a very attractive cryptographic tool ever since C. Shannon described them. Note that, by definition, if a perfect cipher is used, no one can get any information about the encrypted message without knowing the secret key. We consider the problem of reducing the key length of perfect ciphers, because in many applications the length of the secret key is a crucial parameter.… ▽ More

    Submitted 18 July, 2023; originally announced July 2023.

  4. arXiv:2303.15258  [pdf, ps, other

    cs.CR cs.IT

    Unconditionally secure ciphers with a short key for a source with unknown statistics

    Authors: Boris Ryabko

    Abstract: We consider the problem of constructing an unconditionally secure cipher with a short key for the case where the probability distribution of encrypted messages is unknown. Note that unconditional security means that an adversary with no computational constraints can obtain only a negligible amount of information ("leakage") about an encrypted message (without knowing the key). Here we consider the… ▽ More

    Submitted 27 March, 2023; originally announced March 2023.

  5. arXiv:2205.03890  [pdf, ps, other

    cs.CR

    Entropically secure cipher for messages generated by Markov chains with unknown statistics

    Authors: Boris Ryabko

    Abstract: In 2002, Russell and Wang proposed a definition of entropically security that was developed within the framework of secret key cryptography. An entropically-secure system is unconditionally secure, that is, unbreakable, regardless of the enemy's computing power. In 2004, Dodis and Smith developed the results of Russell and Wang and, in particular, stated that the concept of an entropy-protected sy… ▽ More

    Submitted 8 May, 2022; originally announced May 2022.

  6. arXiv:2112.10173  [pdf, ps, other

    cs.CR

    Using data compression and randomization to build an unconditionally secure short key cipher

    Authors: Boris Ryabko

    Abstract: We consider the problem of constructing an unconditionally secure cipher for the case when the key length is less than the length of the encrypted message. (Unconditional security means that a computationally unbounded adversary cannot obtain information about the encrypted message without the key.) In this article, we propose data compression and randomization techniques combined with entropicall… ▽ More

    Submitted 19 December, 2021; originally announced December 2021.

    MSC Class: 94A60 Cryptography

  7. arXiv:2107.14577  [pdf, ps, other

    cs.IT cs.DS

    Fast direct access to variable length codes

    Authors: Boris Ryabko

    Abstract: We consider the issue of direct access to any letter of a sequence encoded with a variable length code and stored in the computer's memory, which is a special case of the random access problem to compressed memory. The characteristics according to which methods are evaluated are the access time to one letter and the memory used. The proposed methods, with various trade-offs between the characteris… ▽ More

    Submitted 30 July, 2021; originally announced July 2021.

    ACM Class: E.2; E.4

  8. arXiv:2105.06638  [pdf, ps, other

    math.ST cs.CR cs.IT

    Calibrating random number generator tests

    Authors: Boris Ryabko

    Abstract: Currently, statistical tests for random number generators (RNGs) are widely used in practice, and some of them are even included in information security standards. But despite the popularity of RNGs, consistent tests are known only for stationary ergodic deviations of randomness (a test is consistent if it detects any deviations from a given class when the sample size goes to $ \infty $). However,… ▽ More

    Submitted 14 May, 2021; originally announced May 2021.

    MSC Class: 68Q30; 62A01; 65C10 ACM Class: G.3

  9. arXiv:2008.08955  [pdf, ps, other

    cs.IT

    Linear hash-functions and their applications to error detection and correction

    Authors: Boris Ryabko

    Abstract: We describe and explore so-called linear hash functions and show how they can be used to build error detection and correction codes. The method can be applied for different types of errors (for example, burst errors). When the method is applied to a model where number of distorted letters is limited, the obtained estimate of its performance is slightly better than the known Varshamov-Gilbert bound… ▽ More

    Submitted 20 August, 2020; originally announced August 2020.

    MSC Class: 94B60 ACM Class: E.4

  10. arXiv:2002.07271  [pdf, other

    cs.IT cs.AR

    Information Theory as a Means of Determining the Main Factors Affecting the Processors Architecture

    Authors: Anton Rakitskiy, Boris Ryabko

    Abstract: In this article we are investigating the computers development process in the past decades in order to identify the factors that influence it the most. We describe such factors and use them to predict the direction of further development. To solve these problems, we use the concept of the Computer Capacity, which allows us to estimate the performance of computers theoretically, relying only on the… ▽ More

    Submitted 17 February, 2020; originally announced February 2020.

    Comments: 7 pages, 1 figure

  11. arXiv:2001.11838  [pdf, ps, other

    math.ST cs.CR

    The time-adaptive statistical testing for random number generators

    Authors: Boris Ryabko

    Abstract: The problem of constructing effective statistical tests for random number generators (RNG) is considered. Currently, there are hundreds of RNG statistical tests that are often combined into so-called batteries, each containing from a dozen to more than one hundred tests. When a battery test is used, it is applied to a sequence generated by the RNG, and the calculation time is determined by the l… ▽ More

    Submitted 7 February, 2020; v1 submitted 30 January, 2020; originally announced January 2020.

    Comments: arXiv admin note: text overlap with arXiv:1912.06542

  12. arXiv:1912.06542  [pdf, other

    cs.IT cs.CR math.ST

    On asymptotically optimal tests for random number generators

    Authors: Boris Ryabko

    Abstract: The problem of constructing effective statistical tests for random number generators (RNG) is considered. Currently, statistical tests for RNGs are a mandatory part of cryptographic information protection systems, but their effectiveness is mainly estimated based on experiments with various RNGs. We find an asymptotic estimate for the p-value of an optimal test in the case where the alternative… ▽ More

    Submitted 13 December, 2019; originally announced December 2019.

    MSC Class: 62G10

  13. arXiv:1904.03825  [pdf, ps, other

    cs.IT stat.AP

    Application of data compression techniques to time series forecasting

    Authors: K. S. Chirikhin, B. Ya. Ryabko

    Abstract: In this study we show that standard well-known file compression programs (zlib, bzip2, etc.) are able to forecast real-world time series data well. The strength of our approach is its ability to use a set of data compression algorithms and "automatically" choose the best one of them during the process of forecasting. Besides, modern data-compressors are able to find many kinds of latent regulariti… ▽ More

    Submitted 7 April, 2019; originally announced April 2019.

    Comments: Accepted. International Symposium on Forecasting, 2019

  14. arXiv:1809.02978  [pdf, other

    cs.IT

    Time-universal data compression and prediction

    Authors: Boris Ryabko

    Abstract: Suppose there is a large file which should be transmitted (or stored) and there are several (say, m) admissible data-compressors. It seems natural to try all the compressors and then choose the best, i.e. the one that gives the shortest compressed file. Then transfer (or store) the index number of the best compressor (it requires log m bits) and the compressed file.The only problem is the time, wh… ▽ More

    Submitted 9 September, 2018; originally announced September 2018.

  15. arXiv:1705.07730  [pdf, other

    cs.PF

    Application of the Computer Capacity to the Analysis of Processors Evolution

    Authors: Boris Ryabko, Anton Rakitskiy

    Abstract: The notion of computer capacity was proposed in 2012, and this quantity has been estimated for computers of different kinds. In this paper we show that, when designing new processors, the manufacturers change the parameters that affect the computer capacity. This allows us to predict the values of parameters of future processors. As the main example we use Intel processors, due to the accessibil… ▽ More

    Submitted 14 May, 2017; originally announced May 2017.

    ACM Class: B.0

  16. arXiv:1701.04028  [pdf, other

    cs.IT math.ST

    Using data-compressors for statistical analysis of problems on homogeneity testing and classification

    Authors: Boris Ryabko, Andrey Guskov, Irina Selivanova

    Abstract: Nowadays data compressors are applied to many problems of text analysis, but many such applications are developed outside of the framework of mathematical statistics. In this paper we overcome this obstacle and show how several methods of classical mathematical statistics can be developed based on applications of the data compressors.

    Submitted 15 January, 2017; originally announced January 2017.

  17. arXiv:1605.00214  [pdf, ps, other

    cs.CR cs.IT

    Information-Theoretical Analysis of Two Shannon's Ciphers

    Authors: Boris Ryabko

    Abstract: We describe generalized running key ciphers and apply them for analysis of two Shannon's methods. In particular, we suggest some estimation of the cipher equivocation and the probability of correct deciphering without key.

    Submitted 1 May, 2016; originally announced May 2016.

  18. arXiv:1512.06961  [pdf, ps, other

    cs.IT cs.CR math.PR

    Two-faced processes and random number generators

    Authors: Boris Ryabko

    Abstract: We describe random processes (with binary alphabet) whose entropy is less than 1 (per letter), but they mimic true random process, i.e., by definition, generated sequence can be interpreted as the result of the flips of a fair coin with sides that are labeled 0 and 1. It gives a possibility to construct Random Number Generators which possess theoretical guarantees. This, in turn, is important for… ▽ More

    Submitted 22 December, 2015; originally announced December 2015.

  19. arXiv:1509.07776  [pdf, ps, other

    cs.IT math.ST

    Predicting the outcomes of every process for which an asymptotically accurate stationary predictor exists is impossible

    Authors: Daniil Ryabko, Boris Ryabko

    Abstract: The problem of prediction consists in forecasting the conditional distribution of the next outcome given the past. Assume that the source generating the data is such that there is a stationary ergodic predictor whose error converges to zero (in a certain sense). The question is whether there is a universal predictor for all such sources, that is, a predictor whose error goes to zero if any of the… ▽ More

    Submitted 25 September, 2015; originally announced September 2015.

    Comments: appears in the proceedings of ISIT 2015, pp. 1204-1206, Hong Kong

  20. arXiv:1310.3482  [pdf, ps, other

    cs.IT cs.NI

    Using Information Theory to Study the Efficiency and Capacity of Caching in the Computer Networks

    Authors: Boris Ryabko

    Abstract: Nowadays computer networks use different kind of memory whose speeds and capacities vary widely. There exist methods of a so-called caching which are intended to use the different kinds of memory in such a way that the frequently used data are stored in the faster memory, wheres the infrequent ones are stored in the slower memory. We address the problems of estimating the caching efficiency and it… ▽ More

    Submitted 13 October, 2013; originally announced October 2013.

    MSC Class: 68M10 Network design and communication

  21. arXiv:1303.2219  [pdf, ps, other

    cs.CR cs.IT

    The Vernam cipher is robust to small deviations from randomness

    Authors: Boris Ryabko

    Abstract: The Vernam cipher (or one-time pad) has played an important rule in cryptography because it is a perfect secrecy system. For example, if an English text (presented in binary system) $X_1 X_2 ... $ is enciphered according to the formula $Z_i = (X_i + Y_i) \mod 2 $, where $Y_1 Y_2 ...$ is a key sequence generated by the Bernoulli source with equal probabilities of 0 and 1, anyone who knows… ▽ More

    Submitted 9 March, 2013; originally announced March 2013.

    MSC Class: 94-XX Information and communication; circuits

  22. arXiv:1104.2239  [pdf, other

    cs.IT physics.data-an

    Experimental Investigation of Forecasting Methods Based on Universal Measures

    Authors: Boris Ryabko, Pavel Pristavka

    Abstract: We describe and experimentally investigate a method to construct forecasting algorithms for stationary and ergodic processes based on universal measures (or so-called universal data compressors). Using some geophysical and economical time series as examples, we show that the precision of thus obtained predictions is higher than that of known methods.

    Submitted 12 April, 2011; originally announced April 2011.

    Comments: submitted

  23. arXiv:1012.3059  [pdf, ps, other

    cs.IT math.ST

    Confidence Sets in Time-Series Filtering

    Authors: Boris Ryabko, Daniil Ryabko

    Abstract: The problem of filtering of finite-alphabet stationary ergodic time series is considered. A method for constructing a confidence set for the (unknown) signal is proposed, such that the resulting set has the following properties: First, it includes the unknown signal with probability $γ$, where $γ$ is a parameter supplied to the filter. Second, the size of the confidence sets grows exponentially wi… ▽ More

    Submitted 9 July, 2012; v1 submitted 14 December, 2010; originally announced December 2010.

    Comments: some of the results were reported at ISIT2011, St. Petersburg, Russia, pp. 2436-2438

  24. arXiv:1003.3619  [pdf, ps, other

    cs.IT cs.CC

    Using Information Theory to Study the Efficiency and Capacity of Computers and Similar Devices

    Authors: Boris Ryabko

    Abstract: We address the problems of estimating the computer efficiency and the computer capacity. We define the computer efficiency and capacity and suggest a method for their estimation, based on the analysis of processor instructions and kinds of accessible memory. It is shown how the suggested method can be applied to estimate the computer capacity. In particular, this consideration gives a new… ▽ More

    Submitted 18 March, 2010; originally announced March 2010.

    MSC Class: 94A24; 68M01 ACM Class: B.1.2

  25. arXiv:0912.4649  [pdf, other

    cs.IT cs.AI nlin.AO

    The use of ideas of Information Theory for studying "language" and intelligence in ants

    Authors: Boris Ryabko, Zhanna Reznikova

    Abstract: In this review we integrate results of long term experimental study on ant "language" and intelligence which were fully based on fundamental ideas of Information Theory, such as the Shannon entropy, the Kolmogorov complexity, and the Shannon's equation connecting the length of a message ($l$) and its frequency $(p)$, i.e. $l = - \log p$ for rational communication systems. This approach, new for… ▽ More

    Submitted 23 December, 2009; originally announced December 2009.

    Journal ref: Entropy 2009, 11(4), 836-853

  26. arXiv:0901.4023  [pdf, ps, other

    cs.CC cs.CR

    Using Kolmogorov Complexity for Understanding Some Limitations on Steganography

    Authors: Boris Ryabko, Daniil Ryabko

    Abstract: Recently perfectly secure steganographic systems have been described for a wide class of sources of covertexts. The speed of transmission of secret information for these stegosystems is proportional to the length of the covertext. In this work we show that there are sources of covertexts for which such stegosystems do not exist. The key observation is that if the set of possible covertexts has a… ▽ More

    Submitted 26 January, 2009; originally announced January 2009.

  27. arXiv:0809.4743  [pdf, ps, other

    cs.IT cs.DS

    The Imaginary Sliding Window As a New Data Structure for Adaptive Algorithms

    Authors: Boris Ryabko

    Abstract: The scheme of the sliding window is known in Information Theory, Computer Science, the problem of predicting and in stastistics. Let a source with unknown statistics generate some word $... x_{-1}x_{0}x_{1}x_{2}...$ in some alphabet $A$. For every moment $t, t=... $ $-1, 0, 1, ...$, one stores the word ("window") $ x_{t-w} x_{t-w+1}... x_{t-1}$ where $w$,$w \geq 1$, is called "window length". In… ▽ More

    Submitted 27 September, 2008; originally announced September 2008.

    Comments: Published in: Problems of information transmission,1996,v.32,#2

  28. Constructing Perfect Steganographic Systems

    Authors: Boris Ryabko, Daniil Ryabko

    Abstract: We propose steganographic systems for the case when covertexts (containers) are generated by a finite-memory source with possibly unknown statistics. The probability distributions of covertexts with and without hidden information are the same; this means that the proposed stegosystems are perfectly secure, i.e. an observer cannot determine whether hidden information is being transmitted. The speed… ▽ More

    Submitted 11 July, 2011; v1 submitted 9 September, 2008; originally announced September 2008.

    Journal ref: Information and Computation, 2011, Vol. 209, No. 9, pp. 1223-1230

  29. arXiv:0809.1226  [pdf, ps, other

    cs.IT cs.AI math.ST

    Applications of Universal Source Coding to Statistical Analysis of Time Series

    Authors: Boris Ryabko

    Abstract: We show how universal codes can be used for solving some of the most important statistical problems for time series. By definition, a universal code (or a universal lossless data compressor) can compress any sequence generated by a stationary and ergodic source asymptotically to the Shannon entropy, which, in turn, is the best achievable ratio for lossless data compressors. We consider finite-… ▽ More

    Submitted 7 September, 2008; originally announced September 2008.

    Comments: accepted for publication

  30. arXiv:0804.0510  [pdf, ps, other

    cs.IT math.ST

    Nonparametric Statistical Inference for Ergodic Processes

    Authors: Daniil Ryabko, Boris Ryabko

    Abstract: In this work a method for statistical analysis of time series is proposed, which is used to obtain solutions to some classical problems of mathematical statistics under the only assumption that the process generating the data is stationary ergodic. Namely, three problems are considered: goodness-of-fit (or identity) testing, process classification, and the change point problem. For each of the pro… ▽ More

    Submitted 3 April, 2012; v1 submitted 3 April, 2008; originally announced April 2008.

    Comments: Conference version in: D. Ryabko, B. Ryabko, On hypotheses testing for ergodic processes, in Proceedgings of Information Theory Workshop, 2008, Porto, Portugal, pp. 281-283

    Journal ref: IEEE Transactions on Information Theory 56, 3 (2010) 1430-1435

  31. arXiv:0708.2893  [pdf

    cs.IT

    Fast Recursive Coding Based on Grou** of Symbols

    Authors: Nikolay Ponomarenko, Vladimir Lukin, Karen Egiazarian, Jaakko Astola, Boris Y Ryabko

    Abstract: A novel fast recursive coding technique is proposed. It operates with only integer values not longer 8 bits and is multiplication free. Recursion the algorithm is based on indirectly provides rather effective coding of symbols for very large alphabets. The code length for the proposed technique can be up to 20-30% less than for arithmetic coding and, in the worst case it is only by 1-3% larger.

    Submitted 21 August, 2007; originally announced August 2007.

    Comments: 3 pages, submitted to IEEE Transactions on Information Theory

    ACM Class: E.4

  32. arXiv:cs/0701036  [pdf, ps, other

    cs.IT

    Compression-based methods for nonparametric density estimation, on-line prediction, regression and classification for time series

    Authors: Boris Ryabko

    Abstract: We address the problem of nonparametric estimation of characteristics for stationary and ergodic time series. We consider finite-alphabet time series and real-valued ones and the following four problems: i) estimation of the (limiting) probability (or estimation of the density for real-valued time series), ii) on-line prediction, iii) regression and iv) classification (or so-called problems with… ▽ More

    Submitted 1 November, 2007; v1 submitted 7 January, 2007; originally announced January 2007.

  33. arXiv:cs/0606085  [pdf, ps, other

    cs.CR

    Provably Secure Universal Steganographic Systems

    Authors: Boris Ryabko, Daniil Ryabko

    Abstract: We propose a simple universal (that is, distribution--free) steganographic system in which covertexts with and without hidden texts are statistically indistinguishable. The stegosystem can be applied to any source generating i.i.d. covertexts with unknown distribution, and the hidden text is transmitted exactly, with zero probability of error. Moreover, the proposed steganographic system has two… ▽ More

    Submitted 20 June, 2006; originally announced June 2006.

    Comments: 11 pages

    Report number: Cryptology ePrint Archive, Report 2006/063

  34. arXiv:cs/0602084  [pdf, ps, other

    cs.IT

    Universal Codes as a Basis for Time Series Testing

    Authors: Boris Ryabko, Jaakko Astola

    Abstract: We suggest a new approach to hypothesis testing for ergodic and stationary processes. In contrast to standard methods, the suggested approach gives a possibility to make tests, based on any lossless data compression method even if the distribution law of the codeword lengths is not known. We apply this approach to the following four problems: goodness-of-fit testing (or identity testing), testin… ▽ More

    Submitted 25 February, 2006; originally announced February 2006.

    Comments: accepted for "Statistical Methodology" (Elsevier)

  35. arXiv:cs/0601069  [pdf, ps, other

    cs.CC cs.DM

    Fast Enumeration of Combinatorial Objects

    Authors: Boris Ryabko

    Abstract: The problem of ranking can be described as follows. We have a set of combinatorial objects $S$, such as, say, the k-subsets of n things, and we can imagine that they have been arranged in some list, say lexicographically, and we want to have a fast method for obtaining the rank of a given object in the list. This problem is widely known in Combinatorial Analysis, Computer Science and Information… ▽ More

    Submitted 15 January, 2006; originally announced January 2006.

    Journal ref: published in Discrete Math.and Applications, v.10, n2, 1998

  36. arXiv:cs/0506094  [pdf, ps, other

    cs.IT

    Universal Codes as a Basis for Nonparametric Testing of Serial Independence for Time Series

    Authors: Boris Ryabko, Jaakko Astola

    Abstract: We consider a stationary and ergodic source $p$ generated symbols $x_1 ... x_t$ from some finite set $A$ and a null hypothesis $H_0$ that $p$ is Markovian source with memory (or connectivity) not larger than $m, (m >= 0).$ The alternative hypothesis $H_1$ is that the sequence is generated by a stationary and ergodic source, which differs from the source under $H_0$. In particular, if $m= 0$ we h… ▽ More

    Submitted 26 June, 2005; originally announced June 2005.

    Comments: accepted for ISIT'05

  37. arXiv:cs/0505079  [pdf, ps, other

    cs.CC

    Application of Kolmogorov complexity and universal codes to identity testing and nonparametric testing of serial independence for time series

    Authors: Boris Ryabko, Jaakko Astola, Alex Gammerman

    Abstract: We show that Kolmogorov complexity and such its estimators as universal codes (or data compression methods) can be applied for hypotheses testing in a framework of classical mathematical statistics. The methods for identity testing and nonparametric testing of serial independence for time series are suggested.

    Submitted 29 May, 2005; originally announced May 2005.

    Comments: submitted

  38. arXiv:cs/0504079  [pdf, ps, other

    cs.IT

    Prediction of Large Alphabet Processes and Its Application to Adaptive Source Coding

    Authors: Boris Ryabko, Jaakko Astola

    Abstract: The problem of predicting a sequence $x_1,x_2,...$ generated by a discrete source with unknown statistics is considered. Each letter $x_{t+1}$ is predicted using information on the word $x_1x_2... x_t$ only. In fact, this problem is a classical problem which has received much attention. Its history can be traced back to Laplace. We address the problem where each $x_i$ belongs to some large (or e… ▽ More

    Submitted 21 April, 2005; v1 submitted 17 April, 2005; originally announced April 2005.

    Comments: submitted

  39. arXiv:cs/0504006  [pdf, ps, other

    cs.IT

    Using Information Theory Approach to Randomness Testing

    Authors: B. Ya. Ryabko, V. A. Monarev

    Abstract: We address the problem of detecting deviations of binary sequence from randomness,which is very important for random number (RNG) and pseudorandom number generators (PRNG). Namely, we consider a null hypothesis $H_0$ that a given bit sequence is generated by Bernoulli source with equal probabilities of 0 and 1 and the alternative hypothesis $H_1$ that the sequence is generated by a stationary an… ▽ More

    Submitted 3 April, 2005; originally announced April 2005.

    Comments: Journal of Statistical Planning and Inference,2005, (accepted)

  40. arXiv:cs/0504005  [pdf, ps, other

    cs.IT

    Fast Codes for Large Alphabets

    Authors: Boris Ryabko, Jaakko Astola, Karen Egiazarian

    Abstract: We address the problem of constructing a fast lossless code in the case when the source alphabet is large. The main idea of the new scheme may be described as follows. We group letters with small probabilities in subsets (acting as super letters) and use time consuming coding for these subsets only, whereas letters in the subsets have the same code length and therefore can be coded fast. The des… ▽ More

    Submitted 2 April, 2005; originally announced April 2005.

    Comments: published