Search | arXiv e-print repository

Pron vs Prompt: Can Large Language Models already Challenge a World-Class Fiction Author at Creative Text Writing?

Authors: Guillermo Marco, Julio Gonzalo, Ramón del Castillo, María Teresa Mateo Girona

Abstract: It has become routine to report research results where Large Language Models (LLMs) outperform average humans in a wide range of language-related tasks, and creative text writing is no exception. It seems natural, then, to raise the bid: Are LLMs ready to compete in creative writing skills with a top (rather than average) novelist? To provide an initial answer for this question, we have carried ou… ▽ More It has become routine to report research results where Large Language Models (LLMs) outperform average humans in a wide range of language-related tasks, and creative text writing is no exception. It seems natural, then, to raise the bid: Are LLMs ready to compete in creative writing skills with a top (rather than average) novelist? To provide an initial answer for this question, we have carried out a contest between Patricio Pron (an awarded novelist, considered one of the best of his generation) and GPT-4 (one of the top performing LLMs), in the spirit of AI-human duels such as DeepBlue vs Kasparov and AlphaGo vs Lee Sidol. We asked Pron and GPT-4 to provide thirty titles each, and then to write short stories for both their titles and their opponent's. Then, we prepared an evaluation rubric inspired by Boden's definition of creativity, and we collected 5,400 manual assessments provided by literature critics and scholars. The results of our experimentation indicate that LLMs are still far from challenging a top human creative writer, and that reaching such level of autonomous creative writing skills probably cannot be reached simply with larger language models. △ Less

Submitted 1 July, 2024; originally announced July 2024.

Comments: 9 pages 6 figures

arXiv:2307.01756 [pdf, other]

Identifying Professional Photographers Through Image Quality and Aesthetics in Flickr

Authors: Sofia Strukova, Rubén Gaspar Marco, José A. Ruipérez-Valiente, Félix Gómez Mármol

Abstract: In our generation, there is an undoubted rise in the use of social media and specifically photo and video sharing platforms. These sites have proved their ability to yield rich data sets through the users' interaction which can be used to perform a data-driven evaluation of capabilities. Nevertheless, this study reveals the lack of suitable data sets in photo and video sharing platforms and evalua… ▽ More In our generation, there is an undoubted rise in the use of social media and specifically photo and video sharing platforms. These sites have proved their ability to yield rich data sets through the users' interaction which can be used to perform a data-driven evaluation of capabilities. Nevertheless, this study reveals the lack of suitable data sets in photo and video sharing platforms and evaluation processes across them. In this way, our first contribution is the creation of one of the largest labelled data sets in Flickr with the multimodal data which has been open sourced as part of this contribution. Predicated on these data, we explored machine learning models and concluded that it is feasible to properly predict whether a user is a professional photographer or not based on self-reported occupation labels and several feature representations out of the user, photo and crowdsourced sets. We also examined the relationship between the aesthetics and technical quality of a picture and the social activity of that picture. Finally, we depicted which characteristics differentiate professional photographers from non-professionals. As far as we know, the results presented in this work represent an important novelty for the users' expertise identification which researchers from various domains can use for different applications. △ Less

Submitted 4 July, 2023; originally announced July 2023.

arXiv:2209.14140 [pdf, ps, other]

Time and Energy Efficient Contention Resolution in Asynchronous Shared Channels

Authors: Gianluca De Marco, Dariusz R. Kowalski, Grzegorz Stachowiak

Abstract: A number of stations, independently activated over time, is able to communicate by transmitting and listening to a shared channel in discrete time slots, and a message is successfully delivered to all stations if and only if its source station is the only transmitter at a time. Despite a vast amount of work in the last decades, many fundamental questions remain open in the realistic situation wher… ▽ More A number of stations, independently activated over time, is able to communicate by transmitting and listening to a shared channel in discrete time slots, and a message is successfully delivered to all stations if and only if its source station is the only transmitter at a time. Despite a vast amount of work in the last decades, many fundamental questions remain open in the realistic situation where stations do not start synchronously but are awaken in arbitrary times. In this work we present a broad picture of results for the fundamental problem of Contention resolution, in which each of the contending stations needs to broadcast successfully its message. We show that adaptive algorithms or algorithms with the knowledge of the contention size $k$ achieve a linear $O(k)$ message latency even if the channel feedback is restricted to simple acknowledgements in case of successful transmissions and in the absence of synchronization. This asymptotically optimal performance cannot be extended to other settings: we prove that there is no non-adaptive algorithm without the knowledge of contention size $k$ admitting latency $o(k\log k/(\log\log k)^2)$. This means, in particular, that coding (even random) with acknowledgements is not very efficient on a shared channel without synchronization or an estimate of the contention size. We also present a non-adaptive algorithm with no knowledge of contention size that almost matches the lower bound on latency. Finally, despite the absence of a collision detection mechanism, we show that our algorithms are also efficient in terms of energy, understood as the total number of transmissions performed by the stations during the execution. △ Less

Submitted 28 September, 2022; originally announced September 2022.

arXiv:2209.13423 [pdf, ps, other]

Deterministic non-adaptive contention resolution on a shared channel

Authors: Gianluca De Marco, Dariusz R. Kowalski, Grzegorz Stachowiak

Abstract: In a multiple access channel, autonomous stations are able to transmit and listen to a shared device. A fundamental problem, called \textit{contention resolution}, is to allow any station to successfully deliver its message by resolving the conflicts that arise when several stations transmit simultaneously. Despite a long history on such a problem, most of the results deal with the static setting… ▽ More In a multiple access channel, autonomous stations are able to transmit and listen to a shared device. A fundamental problem, called \textit{contention resolution}, is to allow any station to successfully deliver its message by resolving the conflicts that arise when several stations transmit simultaneously. Despite a long history on such a problem, most of the results deal with the static setting when all stations start simultaneously, while many fundamental questions remain open in the realistic scenario when stations can join the channel at arbitrary times. In this paper, we explore the impact that three major channel features (asynchrony among stations, knowledge of the number of contenders and possibility of switching off stations after a successful transmission) can have on the time complexity of non-adaptive deterministic algorithms. We establish upper and lower bounds allowing to understand which parameters permit time-efficient contention resolution and which do not. △ Less

Submitted 3 October, 2022; v1 submitted 27 September, 2022; originally announced September 2022.

arXiv:1608.04174 [pdf, ps, other]

doi 10.3233/FI-2017-1537

Naming a Channel with Beeps

Authors: Bogdan S. Chlebus, Gianluca De Marco, Muhammed Talo

Abstract: We consider a communication channel in which the only possible communication mode is transmitting beeps, which reach all the nodes instantaneously. Nodes are anonymous, in that they do not have any individual identifiers. The algorithmic goal is to randomly assign names to the nodes in such a manner that the names make a contiguous segment of positive integers starting from $1$. We give a Las Vega… ▽ More We consider a communication channel in which the only possible communication mode is transmitting beeps, which reach all the nodes instantaneously. Nodes are anonymous, in that they do not have any individual identifiers. The algorithmic goal is to randomly assign names to the nodes in such a manner that the names make a contiguous segment of positive integers starting from $1$. We give a Las Vegas naming algorithm for the case when the number of nodes $n$ is known, and a Monte Carlo algorithm for the case when the number of nodes $n$ is not known. The algorithms are provably optimal with respect to the expected time $O(n\log n)$, the number of used random bits $O(n\log n)$, and the probability of error. △ Less

Submitted 28 December, 2018; v1 submitted 14 August, 2016; originally announced August 2016.

Journal ref: Fundamenta Informatica, 153 (3) : 199 - 219, 2017

arXiv:1507.02272 [pdf, ps, other]

Anonymous Processors with Synchronous Shared Memory

Authors: Bogdan S. Chlebus, Gianluca De Marco, Muhammed Talo

Abstract: We consider synchronous distributed systems in which anonymous processors communicate by shared read-write variables. The goal is to have all the processors assign unique names to themselves. We consider the instances of this problem determined by whether the number $n$ is known or not, and whether concurrently attempting to write distinct values into the same memory cell is allowed or not, and wh… ▽ More We consider synchronous distributed systems in which anonymous processors communicate by shared read-write variables. The goal is to have all the processors assign unique names to themselves. We consider the instances of this problem determined by whether the number $n$ is known or not, and whether concurrently attempting to write distinct values into the same memory cell is allowed or not, and whether the number of shared variables is a constant independent of $n$ or it is unbounded. For known $n$, we give Las Vegas algorithms that operate in the optimum expected time, as determined by the amount of available shared memory, and use the optimum $O(n\log n)$ expected number of random bits. For unknown $n$, we give Monte Carlo algorithms that produce correct output upon termination with probabilities that are $1-n^{-Ω(1)}$, which is best possible when terminating almost surely and using $O(n\log n)$ random bits. △ Less

Submitted 31 August, 2016; v1 submitted 8 July, 2015; originally announced July 2015.

arXiv:1504.02549 [pdf, other]

A dynamical systems approach to the discrimination of the modes of operation of cryptographic systems

Authors: Jeaneth Machicao, Jan M. Baetens, Anderson G. Marco, Bernard De Baets, Odemir M. Bruno

Abstract: Evidence of signatures associated with cryptographic modes of operation is established. Motivated by some analogies between cryptographic and dynamical systems, in particular with chaos theory, we propose an algorithm based on Lyapunov exponents of discrete dynamical systems to estimate the divergence among ciphertexts as the encryption algorithm is applied iteratively. The results allow to distin… ▽ More Evidence of signatures associated with cryptographic modes of operation is established. Motivated by some analogies between cryptographic and dynamical systems, in particular with chaos theory, we propose an algorithm based on Lyapunov exponents of discrete dynamical systems to estimate the divergence among ciphertexts as the encryption algorithm is applied iteratively. The results allow to distinguish among six modes of operation, namely ECB, CBC, OFB, CFB, CTR and PCBC using DES, IDEA, TEA and XTEA block ciphers of 64 bits, as well as AES, RC6, Twofish, Seed, Serpent and Camellia block ciphers of 128 bits. Furthermore, the proposed methodology enables a classification of modes of operation of cryptographic systems according to their strength. △ Less

Submitted 10 April, 2015; originally announced April 2015.

Comments: 14 pages, 10 figures

arXiv:1502.02427 [pdf, ps, other]

A Distributed Message-Optimal Assignment on Rings

Authors: Gianluca De Marco, Mauro Leoncini, Manuela Montangero

Abstract: Consider a set of items and a set of $m$ colors, where each item is associated to one color. Consider also $n$ computational agents connected by a ring. Each agent holds a subset of the items and items of the same color can be held by different agents. We analyze the problem of distributively assigning colors to agents in such a way that (a) each color is assigned to one agent only and (b) the num… ▽ More Consider a set of items and a set of $m$ colors, where each item is associated to one color. Consider also $n$ computational agents connected by a ring. Each agent holds a subset of the items and items of the same color can be held by different agents. We analyze the problem of distributively assigning colors to agents in such a way that (a) each color is assigned to one agent only and (b) the number of different colors assigned to each agent is minimum. Since any color assignment requires the items be distributed according to it (e.g. all items of the same color are to be held by only one agent), we define the cost of a color assignment as the amount of items that need to be moved, given an initial allocation. We first show that any distributed algorithm for this problem requires a message complexity of $Ω(n\cdot m)$ and then we exhibit an optimal message complexity algorithm for synchronous rings that in polynomial time determines a color assignment with cost at most three times the optimal. We also discuss solutions for the asynchronous setting. Finally, we show how to get a better cost solution at the expenses of either the message or the time complexity. △ Less

Submitted 9 September, 2015; v1 submitted 9 February, 2015; originally announced February 2015.

arXiv:1411.4498 [pdf, ps, other]

doi 10.1016/j.tcs.2015.11.046

Scalable Wake-up of Multi-Channel Single-Hop Radio Networks

Authors: Bogdan S. Chlebus, Gianluca De Marco, Dariusz R. Kowalski

Abstract: We consider single-hop radio networks with multiple channels as a model of wireless networks. There are $n$ stations connected to $b$ radio channels that do not provide collision detection. A station uses all the channels concurrently and independently. Some $k$ stations may become active spontaneously at arbitrary times. The goal is to wake up the network, which occurs when all the stations hear… ▽ More We consider single-hop radio networks with multiple channels as a model of wireless networks. There are $n$ stations connected to $b$ radio channels that do not provide collision detection. A station uses all the channels concurrently and independently. Some $k$ stations may become active spontaneously at arbitrary times. The goal is to wake up the network, which occurs when all the stations hear a successful transmission on some channel. Duration of a waking-up execution is measured starting from the first spontaneous activation. We present a deterministic algorithm for the general problem that wakes up the network in $O(k\log^{1/b} k\log n)$ time, where $k$ is unknown. We give a deterministic scalable algorithm for the special case when $b>d \log \log n$, for some constant $d>1$, which wakes up the network in $O(\frac{k}{b}\log n\log(b\log n))$ time, with $k$ unknown. This algorithm misses time optimality by at most a factor of $O(\log n(\log b +\log\log n))$, because any deterministic algorithm requires $Ω(\frac{k}{b}\log \frac{n}{k})$ time. We give a randomized algorithm that wakes up the network within $O(k^{1/b}\ln \frac{1}ε)$ rounds with a probability that is at least $1-ε$, for any $0<ε<1$, where $k$ is known. We also consider a model of jamming, in which each channel in any round may be jammed to prevent a successful transmission, which happens with some known parameter probability $p$, independently across all channels and rounds. For this model, we give two deterministic algorithms for unknown~$k$: one wakes up the network in time $O(\log^{-1}(\frac{1}{p})\, k\log n\log^{1/b} k)$, and the other in time $O(\log^{-1}(\frac{1}{p}) \, \frac{k}{b} \log n\log(b\log n))$ but assuming the inequality $b>\log(128b\log n)$, both with a probability that is at least $1-1/\mbox{poly}(n)$. △ Less

Submitted 26 December, 2018; v1 submitted 17 November, 2014; originally announced November 2014.

Journal ref: Theoretical Computer Science, Volume 615, 15 February 2016, Pages 23--44

arXiv:1201.3114 [pdf, other]

doi 10.1142/S0129183110015166

Fast, parallel and secure cryptography algorithm using Lorenz's attractor

Authors: Anderson Gonçalves Marco, Alexandre Souto Martinez, Odemir Martinez Bruno

Abstract: A novel cryptography method based on the Lorenz's attractor chaotic system is presented. The proposed algorithm is secure and fast, making it practical for general use. We introduce the chaotic operation mode, which provides an interaction among the password, message and a chaotic system. It ensures that the algorithm yields a secure codification, even if the nature of the chaotic system is known.… ▽ More A novel cryptography method based on the Lorenz's attractor chaotic system is presented. The proposed algorithm is secure and fast, making it practical for general use. We introduce the chaotic operation mode, which provides an interaction among the password, message and a chaotic system. It ensures that the algorithm yields a secure codification, even if the nature of the chaotic system is known. The algorithm has been implemented in two versions: one sequential and slow and the other, parallel and fast. Our algorithm assures the integrity of the ciphertext (we know if it has been altered, which is not assured by traditional algorithms) and consequently its authenticity. Numerical experiments are presented, discussed and show the behavior of the method in terms of security and performance. The fast version of the algorithm has a performance comparable to AES, a popular cryptography program used commercially nowadays, but it is more secure, which makes it immediately suitable for general purpose cryptography applications. An internet page has been set up, which enables the readers to test the algorithm and also to try to break into the cipher in. △ Less

Submitted 15 January, 2012; originally announced January 2012.

Journal ref: International Journal of Modern Physics C, Volume: 21, Issue: 3(2010) pp. 365-382

arXiv:1112.6326 [pdf, other]

Chaotic Encryption Method Based on Life-Like Cellular Automata

Authors: Marina Jeaneth Machicao, Anderson G. Marco, Odemir M. Bruno

Abstract: We propose a chaotic encryption method based on Cellular Automata(CA), specifically on the family called the "Life-Like" type. Thus, the encryption process lying on the pseudo-random numbers generated (PRNG) by each CA's evolution, which transforms the password as the initial conditions to encrypt messages. Moreover, is explored the dynamical behavior of CA to reach a "good" quality as PRNG based… ▽ More We propose a chaotic encryption method based on Cellular Automata(CA), specifically on the family called the "Life-Like" type. Thus, the encryption process lying on the pseudo-random numbers generated (PRNG) by each CA's evolution, which transforms the password as the initial conditions to encrypt messages. Moreover, is explored the dynamical behavior of CA to reach a "good" quality as PRNG based on measures to quantify "how chaotic a dynamical system is", through the combination of the entropy, Lyapunov exponent, and Hamming distance. Finally, we present the detailed security analysis based on experimental tests: DIEHARD and ENT suites, as well as Fouriers Power Spectrum, used as a security criteria. △ Less

Submitted 29 December, 2011; originally announced December 2011.

arXiv:1105.1622 [pdf, ps, other]

Computing Majority with Triple Queries

Authors: Gianluca De Marco, Evangelos Kranakis, Gabor Wiener

Abstract: Consider a bin containing $n$ balls colored with two colors. In a $k$-query, $k$ balls are selected by a questioner and the oracle's reply is related (depending on the computation model being considered) to the distribution of colors of the balls in this $k$-tuple; however, the oracle never reveals the colors of the individual balls. Following a number of queries the questioner is said to determin… ▽ More Consider a bin containing $n$ balls colored with two colors. In a $k$-query, $k$ balls are selected by a questioner and the oracle's reply is related (depending on the computation model being considered) to the distribution of colors of the balls in this $k$-tuple; however, the oracle never reveals the colors of the individual balls. Following a number of queries the questioner is said to determine the majority color if it can output a ball of the majority color if it exists, and can prove that there is no majority if it does not exist. We investigate two computation models (depending on the type of replies being allowed). We give algorithms to compute the minimum number of 3-queries which are needed so that the questioner can determine the majority color and provide tight and almost tight upper and lower bounds on the number of queries needed in each case. △ Less

Submitted 9 May, 2011; originally announced May 2011.

Comments: 22 pages, 1 figure, conference version to appear in proceedings of the 17th Annual International Computing and Combinatorics Conference (COCOON 2011)

Showing 1–12 of 12 results for author: Marco, G