Skip to main content

Showing 1–31 of 31 results for author: Vega, R

Searching in archive cs. Search in all archives.
.
  1. arXiv:2309.11408  [pdf, other

    cs.RO eess.SY

    Indirect Swarm Control: Characterization and Analysis of Emergent Swarm Behaviors

    Authors: Ricardo Vega, Connor Mattson, Daniel S. Brown, Cameron Nowzari

    Abstract: Emergence and emergent behaviors are often defined as cases where changes in local interactions between agents at a lower level effectively changes what occurs in the higher level of the system (i.e., the whole swarm) and its properties. However, the manner in which these collective emergent behaviors self-organize is less understood. The focus of this paper is in presenting a new framework for ch… ▽ More

    Submitted 28 March, 2024; v1 submitted 20 September, 2023; originally announced September 2023.

    Comments: 8 pages, 13 figures, submitted to IROS 2024 conference

  2. arXiv:2306.17744  [pdf, other

    cs.RO eess.SY

    Zespol: A Lightweight Environment for Training Swarming Agents

    Authors: Shay Snyder, Kevin Zhu, Ricardo Vega, Cameron Nowzari, Maryam Parsa

    Abstract: Agent-based modeling (ABM) and simulation have emerged as important tools for studying emergent behaviors, especially in the context of swarming algorithms for robotic systems. Despite significant research in this area, there is a lack of standardized simulation environments, which hinders the development and deployment of real-world robotic swarms. To address this issue, we present Zespol, a modu… ▽ More

    Submitted 30 June, 2023; originally announced June 2023.

    Comments: 5 pages, 4 figures, 1 table

  3. Cross-domain Sentiment Classification in Spanish

    Authors: Lautaro Estienne, Matias Vera, Leonardo Rey Vega

    Abstract: Sentiment Classification is a fundamental task in the field of Natural Language Processing, and has very important academic and commercial applications. It aims to automatically predict the degree of sentiment present in a text that contains opinions and subjectivity at some level, like product and movie reviews, or tweets. This can be really difficult to accomplish, in part, because different dom… ▽ More

    Submitted 15 March, 2023; originally announced March 2023.

  4. arXiv:2302.04829  [pdf, other

    cs.LG cs.AI

    Modeling and Forecasting COVID-19 Cases using Latent Subpopulations

    Authors: Roberto Vega, Zehra Shah, Pouria Ramazi, Russell Greiner

    Abstract: Classical epidemiological models assume homogeneous populations. There have been important extensions to model heterogeneous populations, when the identity of the sub-populations is known, such as age group or geographical location. Here, we propose two new methods to model the number of people infected with COVID-19 over time, each as a linear combination of latent sub-populations -- i.e., when w… ▽ More

    Submitted 9 February, 2023; originally announced February 2023.

    Comments: 14 pages, 8 figures, submitted to Frontiers in Big Data

  5. arXiv:2301.09018  [pdf, ps, other

    cs.RO eess.SY

    Simulate Less, Expect More: Bringing Robot Swarms to Life via Low-Fidelity Simulations

    Authors: Ricardo Vega, Kevin Zhu, Sean Luke, Maryam Parsa, Cameron Nowzari

    Abstract: This paper proposes a novel methodology for addressing the simulation-reality gap for multi-robot swarm systems. Rather than immediately try to shrink or `bridge the gap' anytime a real-world experiment failed that worked in simulation, we characterize conditions under which this is actually necessary. When these conditions are not satisfied, we show how very simple simulators can still be used to… ▽ More

    Submitted 21 January, 2023; originally announced January 2023.

    Comments: 9 pages, 9 figures

  6. arXiv:2209.07148  [pdf, ps, other

    cs.LG cs.AI cs.IT

    Semi-supervised Batch Learning From Logged Data

    Authors: Gholamali Aminian, Armin Behnamnia, Roberto Vega, Laura Toni, Chengchun Shi, Hamid R. Rabiee, Omar Rivasplata, Miguel R. D. Rodrigues

    Abstract: Off-policy learning methods are intended to learn a policy from logged data, which includes context, action, and feedback (cost or reward) for each sample point. In this work, we build on the counterfactual risk minimization framework, which also assumes access to propensity scores. We propose learning methods for problems where feedback is missing for some samples, so there are samples with feedb… ▽ More

    Submitted 18 February, 2024; v1 submitted 15 September, 2022; originally announced September 2022.

    Comments: 46 pages,

  7. arXiv:2203.16463  [pdf

    cs.LG cs.AI cs.CR

    Perfectly Accurate Membership Inference by a Dishonest Central Server in Federated Learning

    Authors: Georg Pichler, Marco Romanelli, Leonardo Rey Vega, Pablo Piantanida

    Abstract: Federated Learning is expected to provide strong privacy guarantees, as only gradients or model parameters but no plain text training data is ever exchanged either between the clients or between the clients and the central server. In this paper, we challenge this claim by introducing a simple but still very effective membership inference attack algorithm, which relies only on a single training ste… ▽ More

    Submitted 9 November, 2023; v1 submitted 30 March, 2022; originally announced March 2022.

    Comments: accepted for publication in IEEE Transactions on Dependable and Secure Computing

  8. arXiv:2201.05282  [pdf, other

    cs.LG

    Domain-shift adaptation via linear transformations

    Authors: Roberto Vega, Russell Greiner

    Abstract: A predictor, $f_A : X \to Y$, learned with data from a source domain (A) might not be accurate on a target domain (B) when their distributions are different. Domain adaptation aims to reduce the negative effects of this distribution mismatch. Here, we analyze the case where $P_A(Y\ |\ X) \neq P_B(Y\ |\ X)$, $P_A(X) \neq P_B(X)$ but $P_A(Y) = P_B(Y)$; where there are affine transformations of $X$ t… ▽ More

    Submitted 13 January, 2022; originally announced January 2022.

  9. arXiv:2112.05547  [pdf, other

    cs.LG stat.ML

    PACMAN: PAC-style bounds accounting for the Mismatch between Accuracy and Negative log-loss

    Authors: Matias Vera, Leonardo Rey Vega, Pablo Piantanida

    Abstract: The ultimate performance of machine learning algorithms for classification tasks is usually measured in terms of the empirical error probability (or accuracy) based on a testing dataset. Whereas, these algorithms are optimized through the minimization of a typically different--more convenient--loss function based on a training set. For classification tasks, this loss function is often the negative… ▽ More

    Submitted 10 December, 2021; originally announced December 2021.

    Comments: Submitted to be considered for publication in Information and Inference: a Journal of the IMA

  10. arXiv:2106.01590  [pdf, other

    cs.LG stat.AP

    SIMLR: Machine Learning inside the SIR model for COVID-19 Forecasting

    Authors: Roberto Vega, Leonardo Flores, Russell Greiner

    Abstract: Accurate forecasts of the number of newly infected people during an epidemic are critical for making effective timely decisions. This paper addresses this challenge using the SIMLR model, which incorporates machine learning (ML) into the epidemiological SIR model. For each region, SIMLR tracks the changes in the policies implemented at the government level, which it uses to estimate the time-varyi… ▽ More

    Submitted 3 June, 2021; originally announced June 2021.

  11. arXiv:2102.06164  [pdf, other

    cs.CV cs.LG

    Sample Efficient Learning of Image-Based Diagnostic Classifiers Using Probabilistic Labels

    Authors: Roberto Vega, Pouneh Gorji, Zichen Zhang, Xuebin Qin, Abhilash Rakkunedeth Hareendranathan, Jeevesh Kapur, Jacob L. Jaremko, Russell Greiner

    Abstract: Deep learning approaches often require huge datasets to achieve good generalization. This complicates its use in tasks like image-based medical diagnosis, where the small training datasets are usually insufficient to learn appropriate data representations. For such sensitive tasks it is also important to provide the confidence in the predictions. Here, we propose a way to learn and use probabilist… ▽ More

    Submitted 11 February, 2021; originally announced February 2021.

    Comments: To appear in the Proceedings of the 24 th International Conference on Artificial Intelligence and Statistics (AISTATS) 2021, San Diego,California, USA. PMLR: Volume 130

  12. arXiv:2010.11642  [pdf, other

    stat.ML cs.LG

    The Role of Mutual Information in Variational Classifiers

    Authors: Matias Vera, Leonardo Rey Vega, Pablo Piantanida

    Abstract: Overfitting data is a well-known phenomenon related with the generation of a model that mimics too closely (or exactly) a particular instance of data, and may therefore fail to predict future observations reliably. In practice, this behaviour is controlled by various--sometimes heuristics--regularization techniques, which are motivated by develo** upper bounds to the generalization error. In thi… ▽ More

    Submitted 13 April, 2023; v1 submitted 22 October, 2020; originally announced October 2020.

    Comments: Accepted for publication to Machine Learning Springer

  13. arXiv:2004.13475  [pdf, other

    cs.DC

    Efficient GPU Thread Map** on Embedded 2D Fractals

    Authors: Cristóbal A. Navarro, Felipe A. Quezada, Nancy Hitschfeld, Raimundo Vega, Benjamin Bustos

    Abstract: This work proposes a new approach for map** GPU threads onto a family of discrete embedded 2D fractals. A block-space map $λ: \mathbb{Z}_{\mathbb{E}}^{2} \mapsto \mathbb{Z}_{\mathbb{F}}^{2}$ is proposed, from Euclidean parallel space $\mathbb{E}$ to embedded fractal space $\mathbb{F}$, that maps in $\mathcal{O}(\log_2 \log_2(n))$ time and uses no more than $\mathcal{O}(n^\mathbb{H})$ threads wit… ▽ More

    Submitted 25 April, 2020; originally announced April 2020.

    Comments: 20 Pages. arXiv admin note: text overlap with arXiv:1706.04552

    ACM Class: C.1.4; G.2.0

  14. arXiv:2001.05585  [pdf, ps, other

    cs.DC

    GPU Tensor Cores for fast Arithmetic Reductions

    Authors: Cristóbal A. Navarro, Roberto Carrasco, Ricardo J. Barrientos, Javier A. Riquelme, Raimundo Vega

    Abstract: This work proposes a GPU tensor core approach that encodes the arithmetic reduction of $n$ numbers as a set of chained $m \times m$ matrix multiply accumulate (MMA) operations executed in parallel by GPU tensor cores. The asymptotic running time of the proposed chained tensor core approach is $T(n)=5 log_{m^2}{n}$ and its speedup is $S=\dfrac{4}{5} log_{2}{m^2}$ over the classic $O(n \log n)$ para… ▽ More

    Submitted 15 January, 2020; originally announced January 2020.

    Comments: 14 pages, 11 figures

  15. arXiv:1912.01772  [pdf, ps, other

    cs.CL

    A Resource for Computational Experiments on Mapudungun

    Authors: Mingjun Duan, Carlos Fasola, Sai Krishna Rallabandi, Rodolfo M. Vega, Antonios Anastasopoulos, Lori Levin, Alan W Black

    Abstract: We present a resource for computational experiments on Mapudungun, a polysynthetic indigenous language spoken in Chile with upwards of 200 thousand speakers. We provide 142 hours of culturally significant conversations in the domain of medical treatment. The conversations are fully transcribed and translated into Spanish. The transcriptions also include annotations for code-switching and non-stand… ▽ More

    Submitted 4 April, 2020; v1 submitted 3 December, 2019; originally announced December 2019.

    Comments: accepted at LREC 2020

  16. Extreme coverage in 5G Narrowband IoT: a LUT-based strategy to optimize shared channels

    Authors: Emmanuel Luján, Juan A. Zuloaga Mellino, Alejandro D. Otero, Leonardo Rey Vega, Cecilia G. Galarza, Esteban E. Mocskos

    Abstract: One of the main challenges in IoT is providing communication support to an increasing number of connected devices. In recent years, narrowband radio technology has emerged to address this situation: Narrowband Internet of Things (NB-IoT), which is now part of 5G. Supporting massive connectivity becomes particularly demanding in extreme coverage scenarios such as underground or deep inside building… ▽ More

    Submitted 24 December, 2019; v1 submitted 7 August, 2019; originally announced August 2019.

    Comments: Paper accepted at IEEE IoT Journal

  17. arXiv:1905.11972  [pdf, other

    stat.ML cs.IT cs.LG

    Understanding the Behaviour of the Empirical Cross-Entropy Beyond the Training Distribution

    Authors: Matias Vera, Pablo Piantanida, Leonardo Rey Vega

    Abstract: Machine learning theory has mostly focused on generalization to samples from the same distribution as the training data. Whereas a better understanding of generalization beyond the training distribution where the observed distribution changes is also fundamentally important to achieve a more powerful form of generalization. In this paper, we attempt to study through the lens of information measure… ▽ More

    Submitted 28 May, 2019; originally announced May 2019.

    Comments: 18 pages, 6 Figures

  18. Analyzing GPU Tensor Core Potential for Fast Reductions

    Authors: Roberto Carrasco, Raimundo Vega, Cristóbal A. Navarro

    Abstract: The Nvidia GPU architecture has introduced new computing elements such as the \textit{tensor cores}, which are special processing units dedicated to perform fast matrix-multiply-accumulate (MMA) operations and accelerate \textit{Deep Learning} applications. In this work we present the idea of using tensor cores for a different purpose such as the parallel arithmetic reduction problem, and propose… ▽ More

    Submitted 8 March, 2019; originally announced March 2019.

    Comments: This paper was presented in the SCCC 2018 Conference, November 5

    Journal ref: 37th Internatioinal Conference of the Chilean Computer Science Society, SCCC 2018, November 5-9, Santiago, Chile, 2018

  19. arXiv:1802.05355  [pdf, other

    stat.ML cs.LG

    The Role of Information Complexity and Randomization in Representation Learning

    Authors: Matías Vera, Pablo Piantanida, Leonardo Rey Vega

    Abstract: A grand challenge in representation learning is to learn the different explanatory factors of variation behind the high dimen- sional data. Encoder models are often determined to optimize performance on training data when the real objective is to generalize well to unseen data. Although there is enough numerical evidence suggesting that noise injection (during training) at the representation level… ▽ More

    Submitted 14 February, 2018; originally announced February 2018.

    Comments: 35 pages, 3 figures. Submitted for publication

  20. Compression-Based Regularization with an Application to Multi-Task Learning

    Authors: Matías Vera, Leonardo Rey Vega, Pablo Piantanida

    Abstract: This paper investigates, from information theoretic grounds, a learning problem based on the principle that any regularity in a given dataset can be exploited to extract compact features from data, i.e., using fewer bits than needed to fully describe the data itself, in order to build meaningful representations of a relevant content (multiple labels). We begin by introducing the noisy lossy source… ▽ More

    Submitted 19 November, 2017; originally announced November 2017.

    Comments: 13 pages, 7 figures. Submitted for publication

  21. arXiv:1706.04552  [pdf, ps, other

    cs.DC

    Block-space GPU Map** for Embedded Sierpiński Gasket Fractals

    Authors: Cristóbal A. Navarro, Benjamín Bustos, Raimundo Vega, Nancy Hitschfeld

    Abstract: This work studies the problem of GPU thread map** for a Sierpiński gasket fractal embedded in a discrete Euclidean space of $n \times n$. A block-space map $λ: \mathbb{Z}_{\mathbb{E}}^{2} \mapsto \mathbb{Z}_{\mathbb{F}}^{2}$ is proposed, from Euclidean parallel space $\mathbb{E}$ to embedded fractal space $\mathbb{F}$, that maps in $\mathcal{O}(\log_2 \log_2(n))$ time and uses no more than… ▽ More

    Submitted 14 June, 2017; originally announced June 2017.

    Comments: 7 pages, 8 Figures

  22. Collaborative Information Bottleneck

    Authors: Matías Vera, Leonardo Rey Vega, Pablo Piantanida

    Abstract: This paper investigates a multi-terminal source coding problem under a logarithmic loss fidelity which does not necessarily lead to an additive distortion measure. The problem is motivated by an extension of the Information Bottleneck method to a multi-source scenario where several encoders have to build cooperatively rate-limited descriptions of their sources in order to maximize information with… ▽ More

    Submitted 24 November, 2021; v1 submitted 5 April, 2016; originally announced April 2016.

    Comments: Submitted to IEEE Transactions on Information Theory (revised, 29, 7 figures)

    Journal ref: IEEE Transactions on Information Theory ( Volume: 65, Issue: 2, Feb. 2019)

  23. arXiv:1510.01363  [pdf, other

    cs.IT

    Cooperative spectrum sensing schemes with partial statistics knowledge

    Authors: Juan Augusto Maya, Leonardo Rey Vega, Cecilia G. Galarza

    Abstract: In this letter, we analyze the problem of detecting spectrum holes in cognitive radio systems. We consider that a group of unlicensed users can sense the radio signal energy, perform some simple processing and transmit the result to a central entity, where the decision about the presence or not of licensed users is made. We show that the proposed cooperative schemes present good performances even… ▽ More

    Submitted 2 November, 2015; v1 submitted 5 October, 2015; originally announced October 2015.

    Comments: This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

  24. arXiv:1509.04119  [pdf, other

    cs.IT

    Exploiting Spatial Correlation in Energy Constrained Distributed Detection

    Authors: Juan Augusto Maya, Cecilia G. Galarza, Leonardo Rey Vega

    Abstract: We consider the detection of a correlated random process immersed in noise in a wireless sensor network. Each node has an individual energy constraint and the communication with the processing central units are affected by the path loss propagation effect. Guided by energy efficiency concerns, we consider the partition of the whole network into clusters, each one with a coordination node or \emph{… ▽ More

    Submitted 14 September, 2015; originally announced September 2015.

    Comments: This paper was submitted to IEEE Transactions on Signal Processing. Ancillary files are available for this paper to show some symbolic expressions using the MATLAB script "SymExprForPfaPm.m"

  25. arXiv:1502.01359  [pdf, other

    cs.IT

    The Three-Terminal Interactive Lossy Source Coding Problem

    Authors: Leonardo Rey Vega, Pablo Piantanida, Alfred Hero III

    Abstract: The three-node multiterminal lossy source coding problem is investigated. We derive an inner bound to the general rate-distortion region of this problem which is a natural extension of the seminal work by Kaspi'85 on the interactive two-terminal source coding problem. It is shown that this (rather involved) inner bound contains several rate-distortion regions of some relevant source coding setting… ▽ More

    Submitted 18 January, 2016; v1 submitted 4 February, 2015; originally announced February 2015.

    Comments: New version with changes suggested by reviewers.Revised and resubmitted to IEEE Transactions on Information Theory. 92 pages, 11 figures, 1 table

  26. Computer-assisted polyp matching between optical colonoscopy and CT colonography: a phantom study

    Authors: Holger R. Roth, Thomas E. Hampshire, Emma Helbren, Mingxing Hu, Roser Vega, Steve Halligan, David J. Hawkes

    Abstract: Potentially precancerous polyps detected with CT colonography (CTC) need to be removed subsequently, using an optical colonoscope (OC). Due to large colonic deformations induced by the colonoscope, even very experienced colonoscopists find it difficult to pinpoint the exact location of the colonoscope tip in relation to polyps reported on CTC. This can cause unduly prolonged OC examinations that a… ▽ More

    Submitted 15 January, 2015; originally announced January 2015.

    Comments: This paper was presented at the SPIE Medical Imaging 2014 conference

    Journal ref: Proc. SPIE 9036, Medical Imaging 2014: Image-Guided Procedures, Robotic Interventions, and Modeling, 903609 (March 12, 2014)

  27. arXiv:1410.3929  [pdf, other

    stat.OT cs.IT stat.AP

    Distributed Detection of a Random Process over a Multiple Access Channel under Energy and Bandwidth Constraints

    Authors: Juan Augusto Maya, Leonardo Rey Vega, Cecilia G. Galarza

    Abstract: We analyze a binary hypothesis testing problem built on a wireless sensor network (WSN) for detecting a stationary random process distributed both in space and time with circularly-symmetric complex Gaussian distribution under the Neyman-Pearson framework. Using an analog scheme, the sensors transmit different linear combinations of their measurements through a multiple access channel (MAC) to rea… ▽ More

    Submitted 15 October, 2014; originally announced October 2014.

    Comments: This paper was submitted to IEEE Transactions on Signal Processing

  28. On Fundamental Trade-offs of Device-to-Device Communications in Large Wireless Networks

    Authors: Andrés Altieri, Pablo Piantanida, Leonardo Rey Vega, Cecilia G. Galarza

    Abstract: This paper studies the gains, in terms of served requests, attainable through out-of-band device-to-device (D2D) video exchanges in large cellular networks. A stochastic framework, in which users are clustered to exchange videos, is introduced, considering several aspects of this problem: the video-caching policy, user matching for exchanges, aspects regarding scheduling and transmissions. A famil… ▽ More

    Submitted 4 May, 2015; v1 submitted 9 May, 2014; originally announced May 2014.

    Comments: 33 pages, 9 figures. Updated version, to appear in IEEE Transactions on Wireless Communications

  29. arXiv:1403.7317  [pdf, other

    cs.IT

    On the Outage Probability of the Full-Duplex Interference-Limited Relay Channel

    Authors: Andres Altieri, Leonardo Rey Vega, Pablo Piantanida, Cecilia G. Galarza

    Abstract: In this paper, we study the performance, in terms of the asymptotic error probability, of a user which communicates with a destination with the aid of a full-duplex in-band relay. We consider that the network is interference-limited, and interfering users are distributed as a Poisson point process. In this case, the asymptotic error probability is upper bounded by the outage probability (OP). We i… ▽ More

    Submitted 15 May, 2014; v1 submitted 28 March, 2014; originally announced March 2014.

    Comments: 30 pages, 4 figures. Final version. To appear in IEEE JSAC Special Issue on Full-duplex Wireless Communications and Networks, 2014

  30. Analysis of a Cooperative Strategy for a Large Decentralized Wireless Network

    Authors: Andrés Altieri, Leonardo Rey Vega, Pablo Piantanida, Cecilia Galarza

    Abstract: This paper investigates the benefits of cooperation and proposes a relay activation strategy for a large wireless network with multiple transmitters. In this framework, some nodes cooperate with a nearby node that acts as a relay, using the decode-and-forward protocol, and others use direct transmission. The network is modeled as an independently marked Poisson point process and the source nodes m… ▽ More

    Submitted 12 June, 2013; v1 submitted 15 March, 2012; originally announced March 2012.

    Comments: Updated version. To appear in IEEE Transactions on Networking

  31. arXiv:1103.2172  [pdf, other

    cs.IT

    Cooperative Strategies for Interference-Limited Wireless Networks

    Authors: Andres Altieri, Leonardo Rey Vega, Cecilia G. Galarza, Pablo Piantanida

    Abstract: Consider the communication of a single-user aided by a nearby relay involved in a large wireless network where the nodes form an homogeneous Poisson point process. Since this network is interference-limited the asymptotic error probability is bounded from above by the outage probability experienced by the user. We investigate the outage behavior for the well-known cooperative schemes, namely, deco… ▽ More

    Submitted 10 March, 2011; originally announced March 2011.

    Comments: 5 pages and 4 figures. Submitted to ISIT 2011