Search | arXiv e-print repository

The Impact of LoRA on the Emergence of Clusters in Transformers

Authors: Hugo Koubbi, Matthieu Boussard, Louis Hernandez

Abstract: In this paper, we employ the mathematical framework on Transformers developed by \citet{sander2022sinkformers,geshkovski2023emergence,geshkovski2023mathematical} to explore how variations in attention parameters and initial token values impact the structural dynamics of token clusters. Our analysis demonstrates that while the clusters within a modified attention matrix dynamics can exhibit signifi… ▽ More In this paper, we employ the mathematical framework on Transformers developed by \citet{sander2022sinkformers,geshkovski2023emergence,geshkovski2023mathematical} to explore how variations in attention parameters and initial token values impact the structural dynamics of token clusters. Our analysis demonstrates that while the clusters within a modified attention matrix dynamics can exhibit significant divergence from the original over extended periods, they maintain close similarities over shorter intervals, depending on the parameter differences. This work contributes to the fine-tuning field through practical applications to the LoRA algorithm \cite{hu2021lora,peft}, enhancing our understanding of the behavior of LoRA-enhanced Transformer models. △ Less

Submitted 23 February, 2024; originally announced February 2024.

arXiv:2402.01385 [pdf]

Del Visual al Auditivo: Sonorización de Escenas Guiada por Imagen

Authors: María Sánchez, Laura Fernández, Julián Arias, Mateo Cámara, Giulia Comini, Adam Gabrys, José Luis Blanco, Juan Ignacio Godino, Luis Alfonso Hernández

Abstract: Recent advances in image, video, text and audio generative techniques, and their use by the general public, are leading to new forms of content generation. Usually, each modality was approached separately, which poses limitations. The automatic sound recording of visual sequences is one of the greatest challenges for the automatic generation of multimodal content. We present a processing flow that… ▽ More Recent advances in image, video, text and audio generative techniques, and their use by the general public, are leading to new forms of content generation. Usually, each modality was approached separately, which poses limitations. The automatic sound recording of visual sequences is one of the greatest challenges for the automatic generation of multimodal content. We present a processing flow that, starting from images extracted from videos, is able to sound them. We work with pre-trained models that employ complex encoders, contrastive learning, and multiple modalities, allowing complex representations of the sequences for their sonorization. The proposed scheme proposes different possibilities for audio map** and text guidance. We evaluated the scheme on a dataset of frames extracted from a commercial video game and sounds extracted from the Freesound platform. Subjective tests have evidenced that the proposed scheme is able to generate and assign audios automatically and conveniently to images. Moreover, it adapts well to user preferences, and the proposed objective metrics show a high correlation with the subjective ratings. △ Less

Submitted 2 February, 2024; originally announced February 2024.

Comments: 10 pages, in Spanish, Tecniacústica

arXiv:2302.00737 [pdf, other]

A Universal Technique for Machine-Certified Proofs of Linearizable Algorithms

Authors: Prasad Jayanti, Siddhartha Jayanti, Ugur Y. Yavuz, Lizzie Hernandez

Abstract: Linearizability has been the long standing gold standard for consistency in concurrent data structures. However, proofs of linearizability can be long and intricate, hard to produce, and extremely time consuming even to verify. In this work, we address this issue by introducing simple $universal$, $sound$, and $complete$ proof methods for producing machine-verifiable proofs of linearizability and… ▽ More Linearizability has been the long standing gold standard for consistency in concurrent data structures. However, proofs of linearizability can be long and intricate, hard to produce, and extremely time consuming even to verify. In this work, we address this issue by introducing simple $universal$, $sound$, and $complete$ proof methods for producing machine-verifiable proofs of linearizability and its close cousin, strong linearizability. Universality means that our method works for any object type; soundness means that an algorithm can be proved correct by our method only if it is linearizable (resp. strong linearizable); and completeness means that any linearizable (resp. strong linearizable) implementation can be proved so using our method. We demonstrate the simplicity and power of our method by producing proofs of linearizability for the Herlihy-Wing queue and Jayanti's single-scanner snapshot, as well as a proof of strong linearizability of the Jayanti-Tarjan union-find object. All three of these proofs are machine-verified by TLAPS (the Temporal Logic of Actions Proof System). △ Less

Submitted 13 February, 2023; v1 submitted 1 February, 2023; originally announced February 2023.

Comments: 31 pages

arXiv:2208.14393 [pdf, other]

Who talks about what? Comparing the information treatment in traditional media with online discussions

Authors: Hendrik Schawe, Mariano Gastón Beiró, J. Ignacio Alvarez-Hamelin, Dimitris Kotzinos, Laura Hernández

Abstract: We study the dynamics of interactions between a traditional medium, the New York Times journal, and its followers in Twitter, using a massive dataset. It consists of the metadata of the articles published by the journal during the first year of the COVID-19 pandemic, and the posts published in Twitter by a large set of followers of the @nytimes account along with those published by a set of follow… ▽ More We study the dynamics of interactions between a traditional medium, the New York Times journal, and its followers in Twitter, using a massive dataset. It consists of the metadata of the articles published by the journal during the first year of the COVID-19 pandemic, and the posts published in Twitter by a large set of followers of the @nytimes account along with those published by a set of followers of several other media of different kind. The dynamics of discussions held in Twitter by exclusive followers of a medium show a strong dependence on the medium they follow: the followers of @FoxNews show the highest similarity to each other and a strong differentiation of interests with the general group. Our results also reveal the difference in the attention payed to U.S. presidential elections by the journal and by its followers, and show that the topic related to the ``Black Lives Matter'' movement started in Twitter, and was addressed later by the journal. △ Less

Submitted 30 August, 2022; originally announced August 2022.

Comments: 14 pages, 7 figures

arXiv:2202.05199 [pdf, other]

doi 10.1109/TBME.2021.3130548

A Human-Centered Machine-Learning Approach for Muscle-Tendon Junction Tracking in Ultrasound Images

Authors: Christoph Leitner, Robert Jarolim, Bernhard Englmair, Annika Kruse, Karen Andrea Lara Hernandez, Andreas Konrad, Eric Su, Jörg Schröttner, Luke A. Kelly, Glen A. Lichtwark, Markus Tilp, Christian Baumgartner

Abstract: Biomechanical and clinical gait research observes muscles and tendons in limbs to study their functions and behaviour. Therefore, movements of distinct anatomical landmarks, such as muscle-tendon junctions, are frequently measured. We propose a reliable and time efficient machine-learning approach to track these junctions in ultrasound videos and support clinical biomechanists in gait analysis. In… ▽ More Biomechanical and clinical gait research observes muscles and tendons in limbs to study their functions and behaviour. Therefore, movements of distinct anatomical landmarks, such as muscle-tendon junctions, are frequently measured. We propose a reliable and time efficient machine-learning approach to track these junctions in ultrasound videos and support clinical biomechanists in gait analysis. In order to facilitate this process, a method based on deep-learning was introduced. We gathered an extensive dataset, covering 3 functional movements, 2 muscles, collected on 123 healthy and 38 impaired subjects with 3 different ultrasound systems, and providing a total of 66864 annotated ultrasound images in our network training. Furthermore, we used data collected across independent laboratories and curated by researchers with varying levels of experience. For the evaluation of our method a diverse test-set was selected that is independently verified by four specialists. We show that our model achieves similar performance scores to the four human specialists in identifying the muscle-tendon junction position. Our method provides time-efficient tracking of muscle-tendon junctions, with prediction times of up to 0.078 seconds per frame (approx. 100 times faster than manual labeling). All our codes, trained models and test-set were made publicly available and our model is provided as a free-to-use online service on https://deepmtj.org/. △ Less

Submitted 10 February, 2022; originally announced February 2022.

Comments: in IEEE Transactions on Biomedical Engineering

ACM Class: I.2.1

arXiv:2107.12565 [pdf]

A Biomedically oriented automatically annotated Twitter COVID-19 Dataset

Authors: Luis Alberto Robles Hernandez, Tiffany J. Callahan, Juan M. Banda

Abstract: The use of social media data, like Twitter, for biomedical research has been gradually increasing over the years. With the COVID-19 pandemic, researchers have turned to more nontraditional sources of clinical data to characterize the disease in near real-time, study the societal implications of interventions, as well as the sequelae that recovered COVID-19 cases present (Long-COVID). However, manu… ▽ More The use of social media data, like Twitter, for biomedical research has been gradually increasing over the years. With the COVID-19 pandemic, researchers have turned to more nontraditional sources of clinical data to characterize the disease in near real-time, study the societal implications of interventions, as well as the sequelae that recovered COVID-19 cases present (Long-COVID). However, manually curated social media datasets are difficult to come by due to the expensive costs of manual annotation and the efforts needed to identify the correct texts. When datasets are available, they are usually very small and their annotations do not generalize well over time or to larger sets of documents. As part of the 2021 Biomedical Linked Annotation Hackathon, we release our dataset of over 120 million automatically annotated tweets for biomedical research purposes. Incorporating best practices, we identify tweets with potentially high clinical relevance. We evaluated our work by comparing several SpaCy-based annotation frameworks against a manually annotated gold-standard dataset. Selecting the best method to use for automatic annotation, we then annotated 120 million tweets and released them publicly for future downstream usage within the biomedical domain. △ Less

Submitted 26 July, 2021; originally announced July 2021.

Comments: 8 Pages, 3 tables

arXiv:2011.09538 [pdf, other]

Evolution of the political opinion landscape during electoral periods

Authors: Tomás Mussi Reyero, Mariano G. Beiró, J. Ignacio Alvarez-Hamelin, Laura Hernández, Dimitris Kotzinos

Abstract: We present a study of the evolution of the political landscape during the 2015 and 2019 presidential elections in Argentina, based on the data obtained from the micro-blogging platform Twitter. We build a semantic network based on the hashtags used by all the users following at least one of the main candidates. With this network we can detect the topics that are discussed in the society. At a diff… ▽ More We present a study of the evolution of the political landscape during the 2015 and 2019 presidential elections in Argentina, based on the data obtained from the micro-blogging platform Twitter. We build a semantic network based on the hashtags used by all the users following at least one of the main candidates. With this network we can detect the topics that are discussed in the society. At a difference with most studies of opinion on social media, we do not choose the topics a priori, they naturally emerge from the community structure of the semantic network instead. We assign to each user a dynamical topic vector which measures the evolution of her/his opinion in this space and allows us to monitor the similarities and differences among groups of supporters of different candidates. Our results show that the method is able to detect the dynamics of formation of opinion on different topics and, in particular, it can capture the resha** of the political opinion landscape which has led to the inversion of result between the two rounds of the 2015 election. △ Less

Submitted 18 November, 2020; originally announced November 2020.

arXiv:1708.00352 [pdf]

Implementing an Edge-Fog-Cloud architecture for stream data management

Authors: Lilian Hernandez, Hung Cao, Monica Wachowicz

Abstract: The Internet of Moving Things (IoMT) requires support for a data life cycle process ranging from sorting, cleaning and monitoring data streams to more complex tasks such as querying, aggregation, and analytics. Current solutions for stream data management in IoMT have been focused on partial aspects of a data life cycle process, with special emphasis on sensor networks. This paper aims to address… ▽ More The Internet of Moving Things (IoMT) requires support for a data life cycle process ranging from sorting, cleaning and monitoring data streams to more complex tasks such as querying, aggregation, and analytics. Current solutions for stream data management in IoMT have been focused on partial aspects of a data life cycle process, with special emphasis on sensor networks. This paper aims to address this problem by develo** streaming data life cycle process that incorporates an edge/fog/cloud architecture that is needed for handling heterogeneous, streaming and geographically-dispersed IoMT devices. We propose a 3-tier architecture to support an instant intra-layer communication that establishes a stream data flow in real-time to respond to immediate data life cycle tasks in the system. Communication and process are thus the defining factors in the design of our stream data management solution for IoMT. We describe and evaluate our prototype implementation using real-time transit data feeds. Preliminary results are showing the advantages of running data life cycle tasks for reducing the volume of data streams that are redundant and should not be transported to the cloud. △ Less

Submitted 27 September, 2017; v1 submitted 1 August, 2017; originally announced August 2017.

Comments: stream data life cycle, edge computing, cloud computing, fog computing, Internet of Moving Things, will be published in OpenFog Congress 2017

arXiv:1706.06535 [pdf]

doi 10.1145/3132211.3132452

Combining edge and cloud computing for mobility analytics

Authors: Ikechukwu Maduako, Hung Cao, Lilian Hernandez, Monica Wachowicz

Abstract: Mobility analytics using data generated from the Internet of Mobile Things (IoMT) is facing many challenges which range from the ingestion of data streams coming from a vast number of fog nodes and IoMT devices to avoiding overflowing the cloud with useless massive data streams that can trigger bottlenecks [1]. Managing data flow is becoming an important part of the IoMT because it will dictate in… ▽ More Mobility analytics using data generated from the Internet of Mobile Things (IoMT) is facing many challenges which range from the ingestion of data streams coming from a vast number of fog nodes and IoMT devices to avoiding overflowing the cloud with useless massive data streams that can trigger bottlenecks [1]. Managing data flow is becoming an important part of the IoMT because it will dictate in which platform analytical tasks should run in the future. Data flows are usually a sequence of out-of-order tuples with a high data input rate, and mobility analytics requires a real-time flow of data in both directions, from the edge to the cloud, and vice-versa. Before pulling the data streams to the cloud, edge data stream processing is needed for detecting missing, broken, and duplicated tuples in addition to recognize tuples whose arrival time is out of order. Analytical tasks such as data filtering, data cleaning and low-level data contextualization can be executed at the edge of a network. In contrast, more complex analytical tasks such as graph processing can be deployed in the cloud, and the results of ad-hoc queries and streaming graph analytics can be pushed to the edge as needed by a user application. Graphs are efficient representations used in mobility analytics because they unify knowledge about connectivity, proximity and interaction among moving things. This poster describes the preliminary results from our experimental prototype developed for supporting transit systems, in which edge and cloud computing are combined to process transit data streams forwarded from fog nodes into a cloud. The motivation of this research is to understand how to perform meaningfulness mobility analytics on transit feeds by combining cloud and fog computing architectures in order to improve fleet management, mass transit and remote asset monitoring △ Less

Submitted 20 June, 2017; originally announced June 2017.

Comments: Edge Computing, Cloud Computing, Mobility Analytics, Internet of Mobile Things, Edge Fog Fabric

arXiv:1608.07192 [pdf]

Design of two combined health recommender systems for tailoring messages in a smoking cessation app

Authors: Santiago Hors-Fraile, Francisco J Núñez Benjumea, Laura Carrasco Hernández, Francisco Ortega Ruiz, Luis Fernandez-Luque

Abstract: In this article, we describe the design of two recommender systems (RS) designed to support the smoking cessation process through a mobile application. We plan to use a hybrid RS (content-based, utility-based, and demographic filtering) to tailor health recommendation messages, and a content-based RS to schedule a timely delivery of the message. We also define metrics that we will use to assess th… ▽ More In this article, we describe the design of two recommender systems (RS) designed to support the smoking cessation process through a mobile application. We plan to use a hybrid RS (content-based, utility-based, and demographic filtering) to tailor health recommendation messages, and a content-based RS to schedule a timely delivery of the message. We also define metrics that we will use to assess their performance, hel** people quit smoking when we run the pilot. △ Less

Submitted 19 December, 2019; v1 submitted 25 August, 2016; originally announced August 2016.

Comments: Please, cite as: Hors-Fraile, S., Núñez Benjumea, F.J., Carrasco Hernández, L., Ruiz, F.O., Fernandez-Luque, L. (2016) Design of two combined health recommender systems for tailoring messages in a smoking cessation app. International Workshop on Engendering Health with RecSys co-located with ACM RecSys 2016. Boston, MA, USA

arXiv:1405.7811 [pdf, ps, other]

doi 10.1103/PhysRevE.91.032808

Entropic determination of the phase transition in a coevolving opinion-formation model

Authors: Enrique Burgos, Laura Hernandez, Horacio Ceva, Roberto P. J. Perazzo

Abstract: We study an opinion formation model by the means of a co-evolving complex network where the vertices represent the individuals, characterised by their evolving opinions, and the edges represent the interactions among them. The network adapts to the spreading of opinions in two ways: not only connected agents interact and eventually change their thinking but an agent may also rewire one of its link… ▽ More We study an opinion formation model by the means of a co-evolving complex network where the vertices represent the individuals, characterised by their evolving opinions, and the edges represent the interactions among them. The network adapts to the spreading of opinions in two ways: not only connected agents interact and eventually change their thinking but an agent may also rewire one of its links to a neighborhood holding the same opinion as his. The dynamics depends on an external parameter Φ, which controls the plasticity of the network. We show how the information entropy associated to the distribution of group sizes, allows to locate the phase transition between full consensus and a society where different opinions coexist. We also determine the minimum size of the most informative sampling. At the transition the distribution of the sizes of groups holding the same opinion is scale free. △ Less

Submitted 30 May, 2014; originally announced May 2014.

Comments: 7 pages, 6 figures

MSC Class: 46N55

arXiv:1302.1546 [pdf]

Inference with Idempotent Valuations

Authors: Luis D. Hernandez, Serafin Moral

Abstract: Valuation based systems verifying an idempotent property are studied. A partial order is defined between the valuations giving them a lattice structure. Then, two different strategies are introduced to represent valuations: as infimum of the most informative valuations or as supremum of the least informative ones. It is studied how to carry out computations with both representations in an effic… ▽ More Valuation based systems verifying an idempotent property are studied. A partial order is defined between the valuations giving them a lattice structure. Then, two different strategies are introduced to represent valuations: as infimum of the most informative valuations or as supremum of the least informative ones. It is studied how to carry out computations with both representations in an efficient way. The particular cases of finite sets and convex polytopes are considered. △ Less

Submitted 6 February, 2013; originally announced February 2013.

Comments: Appears in Proceedings of the Thirteenth Conference on Uncertainty in Artificial Intelligence (UAI1997)

Report number: UAI-P-1997-PG-229-237

arXiv:1110.1591 [pdf, ps, other]

doi 10.1103/PhysRevE.84.067101

Co-evolutionnary network approach to cultural dynamics controlled by intolerance

Authors: Carlos Gracia-Lázaro, Fernando Quijandría, Laura Hernández, Luis Mario Floría, Yamir Moreno

Abstract: Starting from Axelrod's model of cultural dissemination, we introduce a rewiring probability, enabling agents to cut the links with their unfriendly neighbors if their cultural similarity is below a tolerance parameter. For low values of tolerance, rewiring promotes the convergence to a frozen monocultural state. However, intermediate tolerance values prevent rewiring once the network is fragmente… ▽ More Starting from Axelrod's model of cultural dissemination, we introduce a rewiring probability, enabling agents to cut the links with their unfriendly neighbors if their cultural similarity is below a tolerance parameter. For low values of tolerance, rewiring promotes the convergence to a frozen monocultural state. However, intermediate tolerance values prevent rewiring once the network is fragmented, resulting in a multicultural society even for values of initial cultural diversity in which the original Axelrod model reaches globalization. △ Less

Submitted 7 October, 2011; originally announced October 2011.

arXiv:cs/0312034 [pdf, ps, other]

Sharing secret color images using cellular automata with memory

Authors: Gonzalo Alvarez, Luis Hernandez, Angel Martin

Abstract: A {k,n}-threshold scheme based on two-dimensional memory cellular automata is proposed to share images in a secret way. This method allows to encode an image into n shared images so that only qualified subsets of k or more shares can recover the secret image, but any k-1 or fewer of them gain no information about the original image. The main characteristics of this new scheme are: each shared im… ▽ More A {k,n}-threshold scheme based on two-dimensional memory cellular automata is proposed to share images in a secret way. This method allows to encode an image into n shared images so that only qualified subsets of k or more shares can recover the secret image, but any k-1 or fewer of them gain no information about the original image. The main characteristics of this new scheme are: each shared image has the same size that the original one, and the recovered image is exactly the same than the secret image; i.e., there is no loss of resolution. △ Less

Submitted 17 December, 2003; originally announced December 2003.

Comments: 17 pages, 6 figures, LaTeX format

ACM Class: I.4.9

Showing 1–14 of 14 results for author: Hernández, L