-
Path continuity for multi-wheeled AGVs
Authors:
Mirko Kokot,
Damjan Miklić,
Tamara Petrović
Abstract:
Notwithstanding the growing presence of AGVs in the industry, there is a lack of research about multi-wheeled AGVs which offer higher maneuverability and space efficiency. In this paper, we present generalized path continuity conditions as a continuation of previous research done for vehicles with more constrained kinematic capabilities. We propose a novel approach for analytically defining variou…
▽ More
Notwithstanding the growing presence of AGVs in the industry, there is a lack of research about multi-wheeled AGVs which offer higher maneuverability and space efficiency. In this paper, we present generalized path continuity conditions as a continuation of previous research done for vehicles with more constrained kinematic capabilities. We propose a novel approach for analytically defining various kinematic modes (motion modes), that AGVs with multiple steer and drive wheels can utilize. This approach enables deriving vehicle kinematic equations based on the vehicle configuration and its constraints, path shape, and corresponding motion mode. Finally, we derive general continuity conditions for paths that multi-wheeled AGVs can follow, and show through examples how they can be utilized in layout design methods.
△ Less
Submitted 2 March, 2021;
originally announced March 2021.
-
Even better correction of genome sequencing data
Authors:
Maciej Dlugosz,
Sebastian Deorowicz,
Marek Kokot
Abstract:
We introduce an improved version of RECKONER, an error corrector for Illumina whole genome sequencing data. By modifying its workflow we reduce the computation time even 10 times. We also propose a new method of determination of $k$-mer length, the key parameter of $k$-spectrum-based family of correctors. The correction algorithms are examined on huge data sets, i.e., human and maize genomes for b…
▽ More
We introduce an improved version of RECKONER, an error corrector for Illumina whole genome sequencing data. By modifying its workflow we reduce the computation time even 10 times. We also propose a new method of determination of $k$-mer length, the key parameter of $k$-spectrum-based family of correctors. The correction algorithms are examined on huge data sets, i.e., human and maize genomes for both Illumina HiSeq and MiSeq instruments.
△ Less
Submitted 2 March, 2017;
originally announced March 2017.
-
Even faster sorting of (not only) integers
Authors:
Marek Kokot,
Sebastian Deorowicz,
Maciej Dlugosz
Abstract:
In this paper we introduce RADULS2, the fastest parallel sorter based on radix algorithm. It is optimized to process huge amounts of data making use of modern multicore CPUs. The main novelties include: extremely optimized algorithm for handling tiny arrays (up to about a hundred of records) that could appear even billions times as subproblems to handle and improved processing of larger subarrays…
▽ More
In this paper we introduce RADULS2, the fastest parallel sorter based on radix algorithm. It is optimized to process huge amounts of data making use of modern multicore CPUs. The main novelties include: extremely optimized algorithm for handling tiny arrays (up to about a hundred of records) that could appear even billions times as subproblems to handle and improved processing of larger subarrays with better use of non-temporal memory stores.
△ Less
Submitted 2 March, 2017;
originally announced March 2017.
-
KMC 3: counting and manipulating k-mer statistics
Authors:
Marek Kokot,
Maciej Długosz,
Sebastian Deorowicz
Abstract:
Summary: Counting all k-mers in a given dataset is a standard procedure in many bioinformatics applications. We introduce KMC3, a significant improvement of the former KMC2 algorithm together with KMC tools for manipulating k-mer databases. Usefulness of the tools is shown on a few real problems. Availability: Program is freely available at http://sun.aei.polsl.pl/REFRESH/kmc. Contact: sebastian.d…
▽ More
Summary: Counting all k-mers in a given dataset is a standard procedure in many bioinformatics applications. We introduce KMC3, a significant improvement of the former KMC2 algorithm together with KMC tools for manipulating k-mer databases. Usefulness of the tools is shown on a few real problems. Availability: Program is freely available at http://sun.aei.polsl.pl/REFRESH/kmc. Contact: [email protected]
△ Less
Submitted 27 January, 2017;
originally announced January 2017.
-
Sorting Data on Ultra-Large Scale with RADULS. New Incarnation of Radix Sort
Authors:
Marek Kokot,
Sebastian Deorowicz,
Agnieszka Debudaj-Grabysz
Abstract:
The paper introduces RADULS, a new parallel sorter based on radix sort algorithm, intended to organize ultra-large data sets efficiently. For example 4G 16-byte records can be sorted with 16 threads in less than 15 seconds on Intel Xeon-based workstation. The implementation of RADULS is not only highly optimized to gain such an excellent performance, but also parallelized in a cache friendly manne…
▽ More
The paper introduces RADULS, a new parallel sorter based on radix sort algorithm, intended to organize ultra-large data sets efficiently. For example 4G 16-byte records can be sorted with 16 threads in less than 15 seconds on Intel Xeon-based workstation. The implementation of RADULS is not only highly optimized to gain such an excellent performance, but also parallelized in a cache friendly manner to make the most of modern multicore architectures. Besides, our parallel scheduler launches a few different procedures at runtime, according to the current parameters of the execution, for proper workload management. All experiments show RADULS to be superior to competing algorithms.
△ Less
Submitted 8 December, 2016;
originally announced December 2016.
-
KMC 2: Fast and resource-frugal $k$-mer counting
Authors:
Sebastian Deorowicz,
Marek Kokot,
Szymon Grabowski,
Agnieszka Debudaj-Grabysz
Abstract:
Motivation: Building the histogram of occurrences of every $k$-symbol long substring of nucleotide data is a standard step in many bioinformatics applications, known under the name of $k$-mer counting. Its applications include develo** de Bruijn graph genome assemblers, fast multiple sequence alignment and repeat detection. The tremendous amounts of NGS data require fast algorithms for $k$-mer c…
▽ More
Motivation: Building the histogram of occurrences of every $k$-symbol long substring of nucleotide data is a standard step in many bioinformatics applications, known under the name of $k$-mer counting. Its applications include develo** de Bruijn graph genome assemblers, fast multiple sequence alignment and repeat detection. The tremendous amounts of NGS data require fast algorithms for $k$-mer counting, preferably using moderate amounts of memory.
Results: We present a novel method for $k$-mer counting, on large datasets at least twice faster than the strongest competitors (Jellyfish~2, KMC~1), using about 12\,GB (or less) of RAM memory. Our disk-based method bears some resemblance to MSPKmerCounter, yet replacing the original minimizers with signatures (a carefully selected subset of all minimizers) and using $(k, x)$-mers allows to significantly reduce the I/O, and a highly parallel overall architecture allows to achieve unprecedented processing speeds. For example, KMC~2 allows to count the 28-mers of a human reads collection with 44-fold coverage (106\,GB of compressed size) in about 20 minutes, on a 6-core Intel i7 PC with an SSD.
Availability: KMC~2 is freely available at http://sun.aei.polsl.pl/kmc.
Contact: [email protected]
△ Less
Submitted 6 July, 2014;
originally announced July 2014.