Skip to main content

Showing 1–45 of 45 results for author: Duda, J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.05097  [pdf, other

    cs.LG stat.ML

    Biology-inspired joint distribution neurons based on Hierarchical Correlation Reconstruction allowing for multidirectional neural networks

    Authors: Jarek Duda

    Abstract: Biological neural networks seem qualitatively superior (e.g. in learning, flexibility, robustness) from current artificial like Multi-Layer Perceptron (MLP) or Kolmogorov-Arnold Network (KAN). Simultaneously, in contrast to them: have fundamentally multidirectional signal propagation~\cite{axon}, also of probability distributions e.g. for uncertainty estimation, and are believed not being able to… ▽ More

    Submitted 1 July, 2024; v1 submitted 8 May, 2024; originally announced May 2024.

    Comments: 7 pages, 6 figures

  2. arXiv:2402.04916  [pdf, other

    cs.CC

    Simple inexpensive vertex and edge invariants distinguishing dataset strongly regular graphs

    Authors: Jarek Duda

    Abstract: While standard Weisfeiler-Leman vertex labels are not able to distinguish even vertices of regular graphs, there is proposed and tested family of inexpensive polynomial time vertex and edge invariants, distinguishing much more difficult SRGs (strongly regular graphs), also often their vertices. Among 43717 SRGs from dataset by Edward Spence, proposed vertex invariants alone were able to distinguis… ▽ More

    Submitted 7 February, 2024; originally announced February 2024.

    Comments: 6 pages, 4 figures

  3. arXiv:2311.13431  [pdf, other

    stat.ML cs.IT cs.LG

    Extracting individual variable information for their decoupling, direct mutual information and multi-feature Granger causality

    Authors: Jarek Duda

    Abstract: Working with multiple variables they usually contain difficult to control complex dependencies. This article proposes extraction of their individual information, e.g. $\overline{X|Y}$ as random variable containing information from $X$, but with removed information about $Y$, by using $(x,y) \leftrightarrow (\bar{x}=\textrm{CDF}_{X|Y=y}(x),y)$ reversible normalization. One application can be decoup… ▽ More

    Submitted 22 November, 2023; originally announced November 2023.

    Comments: 3 pages, 1 figure

  4. arXiv:2305.09478  [pdf, other

    eess.SP cs.LG q-bio.NC

    Time delay multi-feature correlation analysis to extract subtle dependencies from EEG signals

    Authors: Jarek Duda

    Abstract: Electroencephalography (EEG) signals are resultants of extremely complex brain activity. Some details of this hidden dynamics might be accessible through e.g. joint distributions $ρ_{Δt}$ of signals of pairs of electrodes shifted by various time delays (lag $Δt$). A standard approach is monitoring a single evaluation of such joint distributions, like Pearson correlation (or mutual information), wh… ▽ More

    Submitted 29 May, 2023; v1 submitted 24 April, 2023; originally announced May 2023.

    Comments: 7 pages, 7 figures

  5. arXiv:2304.03069  [pdf, other

    stat.ME cs.LG econ.EM stat.ML

    Adaptive Student's t-distribution with method of moments moving estimator for nonstationary time series

    Authors: Jarek Duda

    Abstract: The real life time series are usually nonstationary, bringing a difficult question of model adaptation. Classical approaches like ARMA-ARCH assume arbitrary type of dependence. To avoid such bias, we will focus on recently proposed agnostic philosophy of moving estimator: in time $t$ finding parameters optimizing e.g. $F_t=\sum_{τ<t} (1-η)^{t-τ} \ln(ρ_θ(x_τ))$ moving log-likelihood, evolving in ti… ▽ More

    Submitted 12 April, 2023; v1 submitted 6 April, 2023; originally announced April 2023.

    Comments: 5 pages, 6 figures

  6. arXiv:2209.10043  [pdf, other

    cs.LG cs.AI eess.IV q-bio.QM

    SynthA1c: Towards Clinically Interpretable Patient Representations for Diabetes Risk Stratification

    Authors: Michael S. Yao, Allison Chae, Matthew T. MacLean, Anurag Verma, Jeffrey Duda, James Gee, Drew A. Torigian, Daniel Rader, Charles Kahn, Walter R. Witschey, Hersh Sagreiya

    Abstract: Early diagnosis of Type 2 Diabetes Mellitus (T2DM) is crucial to enable timely therapeutic interventions and lifestyle modifications. As the time available for clinical office visits shortens and medical imaging data become more widely available, patient image data could be used to opportunistically identify patients for additional T2DM diagnostic workup by physicians. We investigated whether imag… ▽ More

    Submitted 27 July, 2023; v1 submitted 20 September, 2022; originally announced September 2022.

    Comments: 12 pages. Accepted to PRIME MICCAI 2023

  7. arXiv:2209.06211  [pdf, other

    q-bio.QM cs.LG

    Predicting probability distributions for cancer therapy drug selection optimization

    Authors: Jarek Duda

    Abstract: Large variability between cell lines brings a difficult optimization problem of drug selection for cancer therapy. Standard approaches use prediction of value for this purpose, corresponding e.g. to expected value of their distribution. This article shows superiority of working on, predicting the entire probability distributions - proposing basic tools for this purpose. We are mostly interested in… ▽ More

    Submitted 13 September, 2022; originally announced September 2022.

    Comments: 4 pages, 4 figures

  8. Compression Optimality of Asymmetric Numeral Systems

    Authors: Josef Pieprzyk, Jarek Duda, Marcin Pawlowski, Seyit Camtepe, Arash Mahboubi, Pawel Morawiecki

    Abstract: Compression also known as entropy coding has a rich and long history. However, a recent explosion of multimedia Internet applications (such as teleconferencing and video streaming for instance) renews an interest in fast compression that also squeezes out as much redundancy as possible. In 2009 Jarek Duda invented his asymmetric numeral system (ANS). Apart from a beautiful mathematical structure,… ▽ More

    Submitted 6 September, 2022; originally announced September 2022.

  9. arXiv:2207.11174  [pdf, other

    q-bio.BM cs.LG q-bio.QM

    Low cost prediction of probability distributions of molecular properties for early virtual screening

    Authors: Jarek Duda, Sabina Podlewska

    Abstract: While there is a general focus on predictions of values, mathematically more appropriate is prediction of probability distributions: with additional possibilities like prediction of uncertainty, higher moments and quantiles. For the purpose of the computer-aided drug design field, this article applies Hierarchical Correlation Reconstruction approach, previously applied in the analysis of demograph… ▽ More

    Submitted 21 July, 2022; originally announced July 2022.

    Comments: 5 pages, 6 figures

  10. arXiv:2206.06194  [pdf, other

    cs.LG astro-ph.GA astro-ph.HE

    Predicting conditional probability distributions of redshifts of Active Galactic Nuclei using Hierarchical Correlation Reconstruction

    Authors: Jarek Duda

    Abstract: While there is a general focus on prediction of values, real data often only allows to predict conditional probability distributions, with capabilities bounded by conditional entropy $H(Y|X)$. If additionally estimating uncertainty, we can treat a predicted value as the center of Gaussian of Laplace distribution - idealization which can be far from complex conditional distributions of real data. T… ▽ More

    Submitted 13 June, 2022; originally announced June 2022.

    Comments: 5 pages, 6 figures

  11. arXiv:2204.08242  [pdf, other

    cs.LG

    Fast optimization of common basis for matrix set through Common Singular Value Decomposition

    Authors: Jarek Duda

    Abstract: SVD (singular value decomposition) is one of the basic tools of machine learning, allowing to optimize basis for a given matrix. However, sometimes we have a set of matrices $\{A_k\}_k$ instead, and would like to optimize a single common basis for them: find orthogonal matrices $U$, $V$, such that $\{U^T A_k V\}$ set of matrices is somehow simpler. For example DCT-II is orthonormal basis of functi… ▽ More

    Submitted 18 April, 2022; originally announced April 2022.

    Comments: 4 pages, 3 figures

  12. arXiv:2201.05028  [pdf, other

    cs.IT q-bio.GN

    Context binning, model clustering and adaptivity for data compression of genetic data

    Authors: Jarek Duda

    Abstract: Rapid growth of genetic databases means huge savings from improvements in their data compression, what requires better inexpensive statistical models. This article proposes automatized optimizations e.g. of Markov-like models, especially context binning and model clustering. While it is popular to just remove low bits of the context, proposed context binning automatically optimizes such reduction… ▽ More

    Submitted 3 May, 2022; v1 submitted 13 January, 2022; originally announced January 2022.

    Comments: 7 pages, 7 figures

  13. arXiv:2106.06438  [pdf, other

    cs.IT

    Encoding of probability distributions for Asymmetric Numeral Systems

    Authors: Jarek Duda

    Abstract: Many data compressors regularly encode probability distributions for entropy coding - requiring minimal description length type of optimizations. Canonical prefix/Huffman coding usually just writes lengths of bit sequences, this way approximating probabilities with powers-of-2. Operating on more accurate probabilities usually allows for better compression ratios, and is possible e.g. using arithme… ▽ More

    Submitted 4 July, 2022; v1 submitted 11 June, 2021; originally announced June 2021.

    Comments: 7 pages, 6 figures

  14. arXiv:2007.12055  [pdf, other

    cs.IT eess.IV

    Improving distribution and flexible quantization for DCT coefficients

    Authors: Jarek Duda

    Abstract: While it is a common knowledge that AC coefficients of Fourier-related transforms, like DCT-II of JPEG image compression, are from Laplace distribution, there was tested more general EPD (exponential power distribution) $ρ\sim \exp(-(|x-μ|/σ)^κ)$ family, leading to maximum likelihood estimated (MLE) $κ\approx 0.5$ instead of Laplace distribution $κ=1$ - such replacement gives $\approx 0.1$ bits/va… ▽ More

    Submitted 22 February, 2021; v1 submitted 23 July, 2020; originally announced July 2020.

    Comments: 11 pages, 13 figures

  15. arXiv:2004.03391  [pdf, other

    eess.IV cs.LG cs.MM stat.ML

    Exploiting context dependence for image compression with upsampling

    Authors: Jarek Duda

    Abstract: Image compression with upsampling encodes information to succeedingly increase image resolution, for example by encoding differences in FUIF and JPEG XL. It is useful for progressive decoding, also often can improve compression ratio - both for lossless compression and e.g. DC coefficients of lossy. However, the currently used solutions rather do not exploit context dependence for encoding of such… ▽ More

    Submitted 13 July, 2020; v1 submitted 6 April, 2020; originally announced April 2020.

    Comments: 6 pages, 4 figures

  16. arXiv:1912.13300  [pdf, other

    cs.IT cond-mat.stat-mech

    Nearly accurate solutions for Ising-like models using Maximal Entropy Random Walk

    Authors: Jarek Duda

    Abstract: While one-dimensional Markov processes are well understood, going to higher dimensions there are only a few analytically solved Ising-like models, in practice requiring to use relatively costly, uncontrollable and inaccurate Monte-Carlo methods. There is discussed analytical approach for e.g. $width\times \infty$ approximation of lattice, also exploiting Hammersley-Clifford theorem to generate ran… ▽ More

    Submitted 19 April, 2021; v1 submitted 31 December, 2019; originally announced December 2019.

    Comments: 7 pages, 4 figures

  17. arXiv:1911.02361  [pdf, other

    q-fin.TR cs.LG q-fin.ST stat.ML

    Modelling bid-ask spread conditional distributions using hierarchical correlation reconstruction

    Authors: Jarosław Duda, Robert Syrek, Henryk Gurgul

    Abstract: While we would like to predict exact values, available incomplete information is rarely sufficient - usually allowing only to predict conditional probability distributions. This article discusses hierarchical correlation reconstruction (HCR) methodology for such prediction on example of usually unavailable bid-ask spreads, predicted from more accessible data like closing price, volume, high/low pr… ▽ More

    Submitted 4 November, 2019; originally announced November 2019.

    Comments: 10 pages, 7 figures

  18. arXiv:1907.07063  [pdf, other

    cs.LG stat.ML

    SGD momentum optimizer with step estimation by online parabola model

    Authors: Jarek Duda

    Abstract: In stochastic gradient descent, especially for neural network training, there are currently dominating first order methods: not modeling local distance to minimum. This information required for optimal step size is provided by second order methods, however, they have many difficulties, starting with full Hessian having square of dimension number of coefficients. This article proposes a minimal s… ▽ More

    Submitted 9 December, 2019; v1 submitted 16 July, 2019; originally announced July 2019.

    Comments: 7 pages, 2 figures

  19. arXiv:1906.03238  [pdf, other

    eess.IV cs.MM

    Parametric context adaptive Laplace distribution for multimedia compression

    Authors: Jarek Duda

    Abstract: Data compression often subtracts prediction and encodes the difference (residue) e.g. assuming Laplace distribution, for example for images, videos, audio, or numerical data. Its performance is strongly dependent on the proper choice of width (scale parameter) of this parametric distribution, can be improved if optimizing it based on local situation like context. For example in popular LOCO-I \cit… ▽ More

    Submitted 14 October, 2019; v1 submitted 28 May, 2019; originally announced June 2019.

    Comments: 8 pages, 4 figures

  20. arXiv:1903.12286  [pdf, other

    stat.ML cs.LG

    Toroidal AutoEncoder

    Authors: Maciej Mikulski, Jaroslaw Duda

    Abstract: Enforcing distributions of latent variables in neural networks is an active subject. It is vital in all kinds of generative models, where we want to be able to interpolate between points in the latent space, or sample from it. Modern generative AutoEncoders (AE) like WAE, SWAE, CWAE add a regularizer to the standard (deterministic) AE, which allows to enforce Gaussian distribution in the latent sp… ▽ More

    Submitted 28 March, 2019; originally announced March 2019.

    Comments: 5 pages, 5 figures

  21. arXiv:1901.11457  [pdf, other

    cs.LG stat.ML

    Improving SGD convergence by online linear regression of gradients in multiple statistically relevant directions

    Authors: Jarek Duda

    Abstract: Deep neural networks are usually trained with stochastic gradient descent (SGD), which minimizes objective function using very rough approximations of gradient, only averaging to the real gradient. Standard approaches like momentum or ADAM only consider a single direction, and do not try to model distance from extremum - neglecting valuable information from calculated sequence of gradients, often… ▽ More

    Submitted 13 March, 2023; v1 submitted 31 January, 2019; originally announced January 2019.

    Comments: 14 pages, 7 figures

  22. arXiv:1812.08040  [pdf, other

    cs.LG stat.ML

    Credibility evaluation of income data with hierarchical correlation reconstruction

    Authors: Jarek Duda, Adam Szulc

    Abstract: In situations like tax declarations or analyzes of household budgets we would like to automatically evaluate credibility of exogenous variable (declared income) based on some available (endogenous) variables - we want to build a model and train it on provided data sample to predict (conditional) probability distribution of exogenous variable based on values of endogenous variables. Using Polish ho… ▽ More

    Submitted 21 April, 2019; v1 submitted 19 December, 2018; originally announced December 2018.

    Comments: 7 pages, 6 figures

  23. arXiv:1811.04751  [pdf, other

    cs.LG stat.ML

    Gaussian AutoEncoder

    Authors: Jarek Duda

    Abstract: Generative AutoEncoders require a chosen probability distribution in latent space, usually multivariate Gaussian. The original Variational AutoEncoder (VAE) uses randomness in encoder - causing problematic distortion, and overlaps in latent space for distinct inputs. It turned out unnecessary: we can instead use deterministic encoder with additional regularizer to ensure that sample distribution i… ▽ More

    Submitted 14 January, 2019; v1 submitted 12 November, 2018; originally announced November 2018.

    Comments: 6 pages, 2 figures

  24. arXiv:1807.04119  [pdf, other

    cs.LG stat.ML

    Exploiting statistical dependencies of time series with hierarchical correlation reconstruction

    Authors: Jarek Duda

    Abstract: While we are usually focused on forecasting future values of time series, it is often valuable to additionally predict their entire probability distributions, e.g. to evaluate risk, Monte Carlo simulations. On example of time series of $\approx$ 30000 Dow Jones Industrial Averages, there will be presented application of hierarchical correlation reconstruction for this purpose: MSE estimating polyn… ▽ More

    Submitted 23 January, 2019; v1 submitted 11 July, 2018; originally announced July 2018.

    Comments: 10 pages, 13 figures

  25. arXiv:1804.06218  [pdf, other

    cs.LG stat.ML

    Hierarchical correlation reconstruction with missing data, for example for biology-inspired neuron

    Authors: Jarek Duda

    Abstract: Machine learning often needs to model density from a multidimensional data sample, including correlations between coordinates. Additionally, we often have missing data case: that data points can miss values for some of coordinates. This article adapts rapid parametric density estimation approach for this purpose: modelling density as a linear combination of orthonormal functions, for which $L^2$ o… ▽ More

    Submitted 27 May, 2018; v1 submitted 17 April, 2018; originally announced April 2018.

    Comments: 7 pages, 5 figures

  26. arXiv:1801.01058  [pdf, other

    cs.LG

    Polynomial-based rotation invariant features

    Authors: Jarek Duda

    Abstract: One of basic difficulties of machine learning is handling unknown rotations of objects, for example in image recognition. A related problem is evaluation of similarity of shapes, for example of two chemical molecules, for which direct approach requires costly pairwise rotation alignment and comparison. Rotation invariants are useful tools for such purposes, allowing to extract features describing… ▽ More

    Submitted 3 January, 2018; originally announced January 2018.

    Comments: 6 pages, 3 figures

  27. arXiv:1705.05285  [pdf, other

    math.OC cs.IT

    Improving Pyramid Vector Quantizer with power projection

    Authors: Jarek Duda

    Abstract: Pyramid Vector Quantizer (PVQ) is a promising technique especially for multimedia data compression, already used in Opus audio codec and considered for AV1 video codec. It quantizes vectors from Euclidean unit sphere by first projecting them to $L^1$ norm unit sphere, then quantizing and encoding there. This paper shows that the used standard radial projection is suboptimal and proposes to tune it… ▽ More

    Submitted 3 May, 2017; originally announced May 2017.

    Comments: 3 pages, 4 figures

  28. arXiv:1703.04456  [pdf, other

    cs.CC

    P?=NP as minimization of degree 4 polynomial, integration or Grassmann number problem, and new graph isomorphism problem approaches

    Authors: Jarek Duda

    Abstract: While the P vs NP problem is mainly approached form the point of view of discrete mathematics, this paper proposes reformulations into the field of abstract algebra, geometry, fourier analysis and of continuous global optimization - which advanced tools might bring new perspectives and approaches for this question. The first one is equivalence of satisfaction of 3-SAT problem with the question of… ▽ More

    Submitted 24 October, 2022; v1 submitted 13 March, 2017; originally announced March 2017.

    Comments: 20 pages, 10 figures

  29. arXiv:1702.02144  [pdf, other

    cs.LG

    Rapid parametric density estimation

    Authors: Jarek Duda

    Abstract: Parametric density estimation, for example as Gaussian distribution, is the base of the field of statistics. Machine learning requires inexpensive estimation of much more complex densities, and the basic approach is relatively costly maximum likelihood estimation (MLE). There will be discussed inexpensive density estimation, for example literally fitting a polynomial (or Fourier series) to the sam… ▽ More

    Submitted 20 February, 2017; v1 submitted 7 February, 2017; originally announced February 2017.

    Comments: 8 pages, 4 figures

  30. Lightweight compression with encryption based on Asymmetric Numeral Systems

    Authors: Jarek Duda, Marcin Niemiec

    Abstract: Data compression combined with effective encryption is a common requirement of data storage and transmission. Low cost of these operations is often a high priority in order to increase transmission speed and reduce power usage. This requirement is crucial for battery-powered devices with limited resources, such as autonomous remote sensors or implants. Well-known and popular encryption techniques… ▽ More

    Submitted 14 December, 2016; originally announced December 2016.

    Comments: 10 pages, 6 figures

    Journal ref: AMCS, vol. 33 (2023)

  31. arXiv:1610.06023  [pdf, other

    cs.DS

    Practical estimation of rotation distance and induced partial order for binary trees

    Authors: Jarek Duda

    Abstract: Tree rotations (left and right) are basic local deformations allowing to transform between two unlabeled binary trees of the same size. Hence, there is a natural problem of practically finding such transformation path with low number of rotations, the optimal minimal number is called the rotation distance. Such distance could be used for instance to quantify similarity between two trees for variou… ▽ More

    Submitted 19 October, 2016; originally announced October 2016.

    Comments: 5 pages, 8 figures

  32. arXiv:1608.04271  [pdf, other

    cs.IT

    Nonuniform probability modulation for reducing energy consumption of remote sensors

    Authors: Jarek Duda

    Abstract: One of the main goals of 5G wireless telecommunication technology is improving energy efficiency, especially of remote sensors which should be able for example to transmit on average 1bit/s for 10 years from a single AAA battery. There will be discussed using modulation with nonuniform probability distribution of symbols for improving energy efficiency of transmission at cost of reduced throughput… ▽ More

    Submitted 15 August, 2016; originally announced August 2016.

    Comments: 7 pages, 4 figures

  33. arXiv:1602.05889  [pdf, other

    cs.DS cs.IT

    Distortion-Resistant Hashing for rapid search of similar DNA subsequence

    Authors: Jarek Duda

    Abstract: One of the basic tasks in bioinformatics is localizing a short subsequence $S$, read while sequencing, in a long reference sequence $R$, like the human geneome. A natural rapid approach would be finding a hash value for $S$ and compare it with a prepared database of hash values for each of length $|S|$ subsequences of $R$. The problem with such approach is that it would only spot a perfect match,… ▽ More

    Submitted 18 February, 2016; originally announced February 2016.

    Comments: 5 pages, 4 figures

  34. arXiv:1601.02420  [pdf, other

    cs.IT

    Fundamental Bounds and Approaches to Sequence Reconstruction from Nanopore Sequencers

    Authors: Jarek Duda, Wojciech Szpankowski, Ananth Grama

    Abstract: Nanopore sequencers are emerging as promising new platforms for high-throughput sequencing. As with other technologies, sequencer errors pose a major challenge for their effective use. In this paper, we present a novel information theoretic analysis of the impact of insertion-deletion (indel) errors in nanopore sequencers. In particular, we consider the following problems: (i) for given indel erro… ▽ More

    Submitted 11 January, 2016; originally announced January 2016.

    Comments: 12 pages, 5 figures

  35. arXiv:1511.00856  [pdf, other

    cs.IT

    Designing dedicated data compression for physics experiments within FPGA already used for data acquisition

    Authors: Jarek Duda, Grzegorz Korcyl

    Abstract: Physics experiments produce enormous amount of raw data, counted in petabytes per day. Hence, there is large effort to reduce this amount, mainly by using some filters. The situation can be improved by additionally applying some data compression techniques: removing redundancy and optimally encoding the actual information. Preferably, both filtering and data compression should fit in FPGA already… ▽ More

    Submitted 3 November, 2015; originally announced November 2015.

    Comments: 7 pages, 5 figures

  36. arXiv:1509.09211  [pdf, other

    cs.CE

    Normalized rotation shape descriptors and lossy compression of molecular shape

    Authors: Jarek Duda

    Abstract: There is a common need to search of molecular databases for compounds resembling some shape, what suggests having similar biological activity while searching for new drugs. The large size of the databases requires fast methods for such initial screening, for example based on feature vectors constructed to fulfill the requirement that similar molecules should correspond to close vectors. Ultrafast… ▽ More

    Submitted 30 September, 2015; originally announced September 2015.

    Comments: 10 pages, 10 figures

  37. arXiv:1505.07056  [pdf, other

    cs.IT

    Joint error correction enhancement of the fountain codes concept

    Authors: Jarek Duda

    Abstract: Fountain codes like LT or Raptor codes, also known as rateless erasure codes, allow to encode a message as some number of packets, such that any large enough subset of these packets is sufficient to fully reconstruct the message. It requires undamaged packets, while the packets which were not lost are usually damaged in real scenarios. Hence, an additional error correction layer is often required:… ▽ More

    Submitted 18 August, 2015; v1 submitted 20 May, 2015; originally announced May 2015.

    Comments: 14 pages, 9 figures

  38. arXiv:1311.2540  [pdf, other

    cs.IT

    Asymmetric numeral systems: entropy coding combining speed of Huffman coding with compression rate of arithmetic coding

    Authors: Jarek Duda

    Abstract: The modern data compression is mainly based on two approaches to entropy coding: Huffman (HC) and arithmetic/range coding (AC). The former is much faster, but approximates probabilities with powers of 2, usually leading to relatively low compression rates. The latter uses nearly exact probabilities - easily approaching theoretical compression rate limit (Shannon entropy), but at cost of much large… ▽ More

    Submitted 6 January, 2014; v1 submitted 11 November, 2013; originally announced November 2013.

    Comments: 24 pages, 12 figures

  39. arXiv:1211.1572  [pdf, other

    cs.IT cs.CR cs.MM

    Embedding grayscale halftone pictures in QR Codes using Correction Trees

    Authors: Jarek Duda

    Abstract: Barcodes like QR Codes have made that encoded messages have entered our everyday life, what suggests to attach them a second layer of information: directly available to human receiver for informational or marketing purposes. We will discuss a general problem of using codes with chosen statistical constrains, for example reproducing given grayscale picture using halftone technique. If both sender a… ▽ More

    Submitted 2 December, 2012; v1 submitted 7 November, 2012; originally announced November 2012.

    Comments: 16 pages, 6 figures

  40. arXiv:1206.4555  [pdf, other

    cs.IT cs.DB cs.DS math.CO

    Optimal compression of hash-origin prefix trees

    Authors: Jarek Duda

    Abstract: There is a common problem of operating on hash values of elements of some database. In this paper there will be analyzed informational content of such general task and how to practically approach such found lower boundaries. Minimal prefix tree which distinguish elements turns out to require asymptotically only about 2.77544 bits per element, while standard approaches use a few times more. While b… ▽ More

    Submitted 8 July, 2012; v1 submitted 20 June, 2012; originally announced June 2012.

    Comments: 13 pages, 3 figures, 1 table

  41. arXiv:1204.5317  [pdf, other

    cs.IT

    Correction Trees as an Alternative to Turbo Codes and Low Density Parity Check Codes

    Authors: Jarosław Duda, Paweł Korus

    Abstract: The rapidly improving performance of modern hardware renders convolutional codes obsolete, and allows for the practical implementation of more sophisticated correction codes such as low density parity check (LDPC) and turbo codes (TC). Both are decoded by iterative algorithms, which require a disproportional computational effort for low channel noise. They are also unable to correct higher noise l… ▽ More

    Submitted 24 May, 2012; v1 submitted 24 April, 2012; originally announced April 2012.

    Comments: 14 pages, 7 figures, submitted to IEEE Transactions on Information Theory

  42. arXiv:0902.0271  [pdf, other

    cs.IT cs.CR math.GM

    Asymmetric numeral systems

    Authors: Jarek Duda

    Abstract: In this paper will be presented new approach to entropy coding: family of generalizations of standard numeral systems which are optimal for encoding sequence of equiprobable symbols, into asymmetric numeral systems - optimal for freely chosen probability distributions of symbols. It has some similarities to Range Coding but instead of encoding symbol in choosing a range, we spread these ranges u… ▽ More

    Submitted 21 May, 2009; v1 submitted 2 February, 2009; originally announced February 2009.

    Comments: 47 pages, 6 figures

  43. arXiv:0804.3615  [pdf, ps, other

    cs.CC cs.DS

    Combinatorial invariants for graph isomorphism problem

    Authors: Jarek Duda

    Abstract: Presented approach in polynomial time calculates large number of invariants for each vertex, which won't change with graph isomorphism and should fully determine the graph. For example numbers of closed paths of length k for given starting vertex, what can be though as the diagonal terms of k-th power of the adjacency matrix. For k=2 we would get degree of verities invariant, higher describes lo… ▽ More

    Submitted 19 May, 2008; v1 submitted 22 April, 2008; originally announced April 2008.

  44. arXiv:0712.1309  [pdf, other

    math.DS cs.DM

    Complex base numeral systems

    Authors: Jarek Duda

    Abstract: In this paper will be introduced large, probably complete family of complex base systems, which are 'proper' - for each point of the space there is a representation which is unique for all but some zero measure set. The condition defining this family is the periodicity - we get periodic covering of the plane by fractals in hexagonal-type structure, what can be used for example in image compressi… ▽ More

    Submitted 24 February, 2008; v1 submitted 10 December, 2007; originally announced December 2007.

    Comments: 19 pages, 7 figures

  45. arXiv:0710.3861  [pdf, other

    cs.IT

    Optimal encoding on discrete lattice with translational invariant constrains using statistical algorithms

    Authors: Jarek Duda

    Abstract: In this paper will be presented methodology of encoding information in valuations of discrete lattice with some translational invariant constrains in asymptotically optimal way. The method is based on finding statistical description of such valuations and changing it into statistical algorithm, which allows to construct deterministically valuation with given statistics. Optimal statistics allow… ▽ More

    Submitted 2 November, 2008; v1 submitted 20 October, 2007; originally announced October 2007.

    Comments: 39 pages, 8 figures Submitted to IEEE Information Theory