Skip to main content

Showing 1–50 of 60 results for author: Almeida, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.13099  [pdf, other

    cs.CV cs.LG

    Sampling 3D Gaussian Scenes in Seconds with Latent Diffusion Models

    Authors: Paul Henderson, Melonie de Almeida, Daniela Ivanova, Titas Anciukevičius

    Abstract: We present a latent diffusion model over 3D scenes, that can be trained using only 2D image data. To achieve this, we first design an autoencoder that maps multi-view images to 3D Gaussian splats, and simultaneously builds a compressed latent representation of these splats. Then, we train a multi-view diffusion model over the latent space to learn an efficient generative model. This pipeline does… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

  2. Momentary Stressor Logging and Reflective Visualizations: Implications for Stress Management with Wearables

    Authors: Sameer Neupane, Mithun Saha, Nasir Ali, Timothy Hnat, Shahin Alan Samiei, Anandatirtha Nandugudi, David M. Almeida, Santosh Kumar

    Abstract: Commercial wearables from Fitbit, Garmin, and Whoop have recently introduced real-time notifications based on detecting changes in physiological responses indicating potential stress. In this paper, we investigate how these new capabilities can be leveraged to improve stress management. We developed a smartwatch app, a smartphone app, and a cloud service, and conducted a 100-day field study with 1… ▽ More

    Submitted 29 January, 2024; originally announced January 2024.

    Comments: In CHI '24 Proceedings of the CHI Conference on Human Factors in Computing Systems Honolulu, HI, USA

  3. A Generalized Multiscale Bundle-Based Hyperspectral Sparse Unmixing Algorithm

    Authors: Luciano Carvalho Ayres, Ricardo Augusto Borsoi, José Carlos Moreira Bermudez, Sérgio José Melo de Almeida

    Abstract: In hyperspectral sparse unmixing, a successful approach employs spectral bundles to address the variability of the endmembers in the spatial domain. However, the regularization penalties usually employed aggregate substantial computational complexity, and the solutions are very noise-sensitive. We generalize a multiscale spatial regularization approach to solve the unmixing problem by incorporatin… ▽ More

    Submitted 23 January, 2024; originally announced January 2024.

  4. arXiv:2312.13784  [pdf, other

    cs.NE cs.SI

    Benchmarking Evolutionary Community Detection Algorithms in Dynamic Networks

    Authors: Giordano Paoletti, Luca Gioacchini, Marco Mellia, Luca Vassio, Jussara M. Almeida

    Abstract: In dynamic complex networks, entities interact and form network communities that evolve over time. Among the many static Community Detection (CD) solutions, the modularity-based Louvain, or Greedy Modularity Algorithm (GMA), is widely employed in real-world applications due to its intuitiveness and scalability. Nevertheless, addressing CD in dynamic graphs remains an open problem, since the evolut… ▽ More

    Submitted 11 January, 2024; v1 submitted 21 December, 2023; originally announced December 2023.

    Comments: Accepted at the 4th Workshop on Graphs and more Complex structures for Learning and Reasoning (GCLR) at AAAI 2024

    Journal ref: 4th Workshop on Graphs and more Complex structures for Learning and Reasoning (GCLR) at AAAI 2024

  5. arXiv:2309.08647  [pdf, other

    cs.CL cs.AI

    Intent Detection at Scale: Tuning a Generic Model using Relevant Intents

    Authors: Nichal Narotamo, David Aparicio, Tiago Mesquita, Mariana Almeida

    Abstract: Accurately predicting the intent of customer support requests is vital for efficient support systems, enabling agents to quickly understand messages and prioritize responses accordingly. While different approaches exist for intent detection, maintaining separate client-specific or industry-specific models can be costly and impractical as the client base expands. This work proposes a system to sc… ▽ More

    Submitted 15 September, 2023; originally announced September 2023.

    Comments: 6 pages, 6 tables, 2 figures, ICMLA 2023

  6. arXiv:2308.14782  [pdf, other

    cs.CY

    Hel** Fact-Checkers Identify Fake News Stories Shared through Images on WhatsApp

    Authors: Julio C. S. Reis, Philipe Melo, Fabiano Belém, Fabricio Murai, Jussara M. Almeida, Fabricio Benevenuto

    Abstract: WhatsApp has introduced a novel avenue for smartphone users to engage with and disseminate news stories. The convenience of forming interest-based groups and seamlessly sharing content has rendered WhatsApp susceptible to the exploitation of misinformation campaigns. While the process of fact-checking remains a potent tool in identifying fabricated news, its efficacy falters in the face of the unp… ▽ More

    Submitted 28 August, 2023; originally announced August 2023.

    Comments: This is a preprint version of an accepted manuscript on the Brazilian Symposium on Multimedia and the Web (WebMedia). Please, consider to cite it instead of this one

  7. An explainable model to support the decision about the therapy protocol for AML

    Authors: Jade M. Almeida, Giovanna A. Castro, João A. Machado-Neto, Tiago A. Almeida

    Abstract: Acute Myeloid Leukemia (AML) is one of the most aggressive types of hematological neoplasm. To support the specialists' decision about the appropriate therapy, patients with AML receive a prognostic of outcomes according to their cytogenetic and molecular characteristics, often divided into three risk categories: favorable, intermediate, and adverse. However, the current risk classification has kn… ▽ More

    Submitted 15 July, 2023; v1 submitted 5 July, 2023; originally announced July 2023.

    Comments: Preprint of the paper accepted to be published in the Proc. of the 12th Brazilian Conference on Intelligent Systems (BRACIS'2023)

  8. arXiv:2306.15740  [pdf, other

    cs.NI cs.CR cs.DC

    Impact of User Privacy and Mobility on Edge Offloading

    Authors: João Paulo Esper, Nadjib Achir, Kleber Vieira Cardoso, Jussara M. Almeida

    Abstract: Offloading high-demanding applications to the edge provides better quality of experience (QoE) for users with limited hardware devices. However, to maintain a competitive QoE, infrastructure, and service providers must adapt to users' different mobility patterns, which can be challenging, especially for location-based services (LBS). Another issue that needs to be tackled is the increasing demand… ▽ More

    Submitted 27 June, 2023; originally announced June 2023.

    Comments: 2023 Annual IEEE International Symposium on Personal, Indoor, and Mobile Radio Communications (IEEE PIMRC 2023)

  9. arXiv:2305.17321  [pdf, other

    cs.NI cs.IT eess.SP

    Optimal Resource Allocation with Delay Guarantees for Network Slicing in Disaggregated RAN

    Authors: Flávio G. C. Rocha, Gabriel M. F. de Almeida, Kleber V. Cardoso, Cristiano B. Both, José F. de Rezende

    Abstract: In this article, we propose a novel formulation for the resource allocation problem of a sliced and disaggregated Radio Access Network (RAN) and its transport network. Our proposal assures an end-to-end delay bound for the Ultra-Reliable and Low-Latency Communication (URLLC) use case while jointly considering the number of admitted users, the transmission rate allocation per slice, the functional… ▽ More

    Submitted 5 June, 2023; v1 submitted 26 May, 2023; originally announced May 2023.

    Comments: 21 pages, 10 figures. For the associated GitHub repository, see https://github.com/LABORA-INF-UFG/paper-FGKCJ-2023

  10. arXiv:2301.02760  [pdf, other

    cs.NI

    RIC-O: Efficient placement of a disaggregated and distributed RAN Intelligent Controller with dynamic clustering of radio nodes

    Authors: Gabriel M. Almeida, Gustavo Z. Bruno, Alexandre Huff, Matti Hiltunen, Elias P. Duarte Jr., Cristiano B. Both, Kleber V. Cardoso

    Abstract: The Radio Access Network (RAN) is the segment of cellular networks that provides wireless connectivity to end-users. O-RAN Alliance has been transforming the RAN industry by proposing open RAN specifications and the programmable Non-Real-Time and Near-Real-Time RAN Intelligent Controllers (Non-RT RIC and Near-RT RIC). Both RICs provide platforms for running applications called rApps and xApps, res… ▽ More

    Submitted 6 January, 2023; originally announced January 2023.

    Comments: 30 pages, 10 figures

  11. NAWQ-SR: A Hybrid-Precision NPU Engine for Efficient On-Device Super-Resolution

    Authors: Stylianos I. Venieris, Mario Almeida, Royson Lee, Nicholas D. Lane

    Abstract: In recent years, image and video delivery systems have begun integrating deep learning super-resolution (SR) approaches, leveraging their unprecedented visual enhancement capabilities while reducing reliance on networking conditions. Nevertheless, deploying these solutions on mobile devices still remains an active challenge as SR models are excessively demanding with respect to workload and memory… ▽ More

    Submitted 14 March, 2023; v1 submitted 15 December, 2022; originally announced December 2022.

    Comments: Accepted for publication at the IEEE Transactions on Mobile Computing (TMC), 2023

  12. arXiv:2211.11928  [pdf, ps, other

    cs.DC

    A case study of proactive auto-scaling for an ecommerce workload

    Authors: Marcella Medeiros Siqueira Coutinho de Almeida, Thiago Emmanuel Pereira, Fabio Morais

    Abstract: Preliminary data obtained from a partnership between the Federal University of Campina Grande and an ecommerce company indicates that some applications have issues when dealing with variable demand. This happens because a delay in scaling resources leads to performance degradation and, in literature, is a matter usually treated by improving the auto-scaling. To better understand the current state-… ▽ More

    Submitted 21 November, 2022; originally announced November 2022.

  13. arXiv:2211.10322  [pdf, other

    cs.LG

    Understanding the double descent curve in Machine Learning

    Authors: Luis Sa-Couto, Jose Miguel Ramos, Miguel Almeida, Andreas Wichert

    Abstract: The theory of bias-variance used to serve as a guide for model selection when applying Machine Learning algorithms. However, modern practice has shown success with over-parameterized models that were expected to overfit but did not. This led to the proposal of the double descent curve of performance by Belkin et al. Although it seems to describe a real, representative phenomenon, the field is lack… ▽ More

    Submitted 18 November, 2022; originally announced November 2022.

  14. arXiv:2207.03522  [pdf, other

    cs.LG cs.NE cs.SI physics.soc-ph stat.ML

    TF-GNN: Graph Neural Networks in TensorFlow

    Authors: Oleksandr Ferludin, Arno Eigenwillig, Martin Blais, Dustin Zelle, Jan Pfeifer, Alvaro Sanchez-Gonzalez, Wai Lok Sibon Li, Sami Abu-El-Haija, Peter Battaglia, Neslihan Bulut, Jonathan Halcrow, Filipe Miguel Gonçalves de Almeida, Pedro Gonnet, Liangze Jiang, Parth Kothari, Silvio Lattanzi, André Linhares, Brandon Mayer, Vahab Mirrokni, John Palowitch, Mihir Paradkar, Jennifer She, Anton Tsitsulin, Kevin Villela, Lisa Wang , et al. (2 additional authors not shown)

    Abstract: TensorFlow-GNN (TF-GNN) is a scalable library for Graph Neural Networks in TensorFlow. It is designed from the bottom up to support the kinds of rich heterogeneous graph data that occurs in today's information ecosystems. In addition to enabling machine learning researchers and advanced developers, TF-GNN offers low-code solutions to empower the broader developer community in graph learning. Many… ▽ More

    Submitted 23 July, 2023; v1 submitted 7 July, 2022; originally announced July 2022.

  15. arXiv:2205.10293  [pdf, other

    cs.LG cs.SI

    DELATOR: Money Laundering Detection via Multi-Task Learning on Large Transaction Graphs

    Authors: Henrique S. Assumpção, Fabrício Souza, Leandro Lacerda Campos, Vinícius T. de Castro Pires, Paulo M. Laurentys de Almeida, Fabricio Murai

    Abstract: Money laundering has become one of the most relevant criminal activities in modern societies, as it causes massive financial losses for governments, banks and other institutions. Detecting such activities is among the top priorities when it comes to financial analysis, but current approaches are often costly and labor intensive partly due to the sheer amount of data to be analyzed. Hence, there is… ▽ More

    Submitted 24 October, 2022; v1 submitted 20 May, 2022; originally announced May 2022.

    Comments: Accepted for publication in the 2022 IEEE International Conference on Big Data (IEEE BigData) as a short paper

  16. arXiv:2111.06161  [pdf, other

    cs.NI cs.LG cs.SI

    Understanding mobility in networks: A node embedding approach

    Authors: Matheus F. C. Barros, Carlos H. G. Ferreira, Bruno Pereira dos Santos, Lourenço A. P. Júnior, Marco Mellia, Jussara M. Almeida

    Abstract: Motivated by the growing number of mobile devices capable of connecting and exchanging messages, we propose a methodology aiming to model and analyze node mobility in networks. We note that many existing solutions in the literature rely on topological measurements calculated directly on the graph of node contacts, aiming to capture the notion of the node's importance in terms of connectivity and m… ▽ More

    Submitted 11 November, 2021; originally announced November 2021.

  17. arXiv:2109.13963  [pdf, other

    cs.LG cs.PF

    Smart at what cost? Characterising Mobile Deep Neural Networks in the wild

    Authors: Mario Almeida, Stefanos Laskaridis, Abhinav Mehrotra, Lukasz Dudziak, Ilias Leontiadis, Nicholas D. Lane

    Abstract: With smartphones' omnipresence in people's pockets, Machine Learning (ML) on mobile is gaining traction as devices become more powerful. With applications ranging from visual filters to voice assistants, intelligence on mobile comes in many forms and facets. However, Deep Neural Network (DNN) inference remains a compute intensive workload, with devices struggling to support intelligence at the cos… ▽ More

    Submitted 28 September, 2021; originally announced September 2021.

    Comments: Accepted at the ACM Internet Measurement Conference (IMC), 2021

  18. arXiv:2109.10462  [pdf, other

    cs.SI cs.AI cs.CY cs.LG stat.CO

    A Hierarchical Network-Oriented Analysis of User Participation in Misinformation Spread on WhatsApp

    Authors: Gabriel Peres Nobre, Carlos H. G. Ferreira, Jussara M. Almeida

    Abstract: WhatsApp emerged as a major communication platform in many countries in the recent years. Despite offering only one-to-one and small group conversations, WhatsApp has been shown to enable the formation of a rich underlying network, crossing the boundaries of existing groups, and with structural properties that favor information dissemination at large. Indeed, WhatsApp has reportedly been used as a… ▽ More

    Submitted 21 September, 2021; originally announced September 2021.

    Comments: Paper Accepted in Information Processing & Management, Elsevier

  19. On the Dynamics of Political Discussions on Instagram: A Network Perspective

    Authors: Carlos H. G. Ferreira, Fabricio Murai, Ana P. C. Silva, Jussara M. Almeida, Martino Trevisan, Luca Vassio, Marco Mellia, Idilio Drago

    Abstract: Instagram has been increasingly used as a source of information especially among the youth. As a result, political figures now leverage the platform to spread opinions and political agenda. We here analyze online discussions on Instagram, notably in political topics, from a network perspective. Specifically, we investigate the emergence of communities of co-commenters, that is, groups of users who… ▽ More

    Submitted 13 September, 2022; v1 submitted 19 September, 2021; originally announced September 2021.

    Journal ref: Online Social Networks and Media, Volume 25, 2021, ISSN 2468-6964

  20. arXiv:2108.12214  [pdf, other

    cs.DC cs.PF

    Machine Learning for Performance Prediction of Spark Cloud Applications

    Authors: Alexandre Maros, Fabricio Murai, Ana Paula Couto da Silva, Jussara M. Almeida, Marco Lattuada, Eugenio Gianniti, Marjan Hosseini, Danilo Ardagna

    Abstract: Big data applications and analytics are employed in many sectors for a variety of goals: improving customers satisfaction, predicting market behavior or improving processes in public health. These applications consist of complex software stacks that are often run on cloud systems. Predicting execution times is important for estimating the cost of cloud services and for effectively managing the und… ▽ More

    Submitted 27 August, 2021; originally announced August 2021.

    Comments: Published in 2019 IEEE 12th International Conference on Cloud Computing (CLOUD)

    ACM Class: B.8.2; I.2

  21. arXiv:2107.04702  [pdf

    cs.NE

    Um Metodo para Busca Automatica de Redes Neurais Artificiais

    Authors: Anderson P. da Silva, Teresa B. Ludermir, Leandro M. Almeida

    Abstract: This paper describes a method that automatically searches Artificial Neural Networks using Cellular Genetic Algorithms. The main difference of this method for a common genetic algorithm is the use of a cellular automaton capable of providing the location for individuals, reducing the possibility of local minima in search space. This method employs an evolutionary search for simultaneous choices of… ▽ More

    Submitted 9 July, 2021; originally announced July 2021.

    Comments: 13 pages, in Portuguese, 4 figures, 2 tables

  22. arXiv:2106.04805  [pdf, other

    stat.ML cs.LG cs.SI math.PR

    Streaming Belief Propagation for Community Detection

    Authors: Yuchen Wu, MohammadHossein Bateni, Andre Linhares, Filipe Miguel Goncalves de Almeida, Andrea Montanari, Ashkan Norouzi-Fard, Jakab Tardos

    Abstract: The community detection problem requires to cluster the nodes of a network into a small number of well-connected "communities". There has been substantial recent progress in characterizing the fundamental statistical limits of community detection under simple stochastic block models. However, in real-world applications, the network structure is typically dynamic, with nodes that join over time. In… ▽ More

    Submitted 10 June, 2021; v1 submitted 9 June, 2021; originally announced June 2021.

    Comments: 36 pages, 13 figures

  23. Multi-task fully convolutional network for tree species map** in dense forests using small training hyperspectral data

    Authors: Laura Elena Cué La Rosa, Camile Sothe, Raul Queiroz Feitosa, Cláudia Maria de Almeida, Marcos Benedito Schimalski, Dario Augusto Borges Oliveira

    Abstract: This work proposes a multi-task fully convolutional architecture for tree species map** in dense forests from sparse and scarce polygon-level annotations using hyperspectral UAV-borne data. Our model implements a partial loss function that enables dense tree semantic labeling outcomes from non-dense training samples, and a distance regression complementary task that enforces tree crown boundary… ▽ More

    Submitted 6 September, 2021; v1 submitted 1 June, 2021; originally announced June 2021.

    Comments: Full version of preprint accepted at ISPRS Journal of Photogrammetry and Remote Sensing

  24. arXiv:2104.09949  [pdf, other

    cs.DC cs.CV cs.LG

    DynO: Dynamic Onloading of Deep Neural Networks from Cloud to Device

    Authors: Mario Almeida, Stefanos Laskaridis, Stylianos I. Venieris, Ilias Leontiadis, Nicholas D. Lane

    Abstract: Recently, there has been an explosive growth of mobile and embedded applications using convolutional neural networks(CNNs). To alleviate their excessive computational demands, developers have traditionally resorted to cloud offloading, inducing high infrastructure costs and a strong dependence on networking conditions. On the other end, the emergence of powerful SoCs is gradually enabling on-devic… ▽ More

    Submitted 11 January, 2022; v1 submitted 20 April, 2021; originally announced April 2021.

    Comments: Accepted for publication at the ACM Transactions on Embedded Computing Systems (TECS) in the special issue on Accelerating AI on the Edge

  25. arXiv:2103.00535  [pdf, other

    cs.SI

    A multi-objective time series analysis of community mobility reduction comparing first and second COVID-19 waves

    Authors: Gabriela Cavalcante da Silva, Fernanda Monteiro de Almeida, Sabrina Oliveira, Leonardo C. T. Bezerra, Elizabeth F. Wanner, Ricardo H. C. Takahashi

    Abstract: With the logistic challenges faced by most countries for the production, distribution, and application of vaccines for the novel coronavirus disease~(COVID-19), social distancing~(SD) remains the most tangible approach to mitigate the spread of the virus. To assist SD monitoring, several tech companies have made publicly available anonymized mobility data. In this work, we conduct a multi-objectiv… ▽ More

    Submitted 28 February, 2021; originally announced March 2021.

  26. arXiv:2102.13451  [pdf, other

    cs.LG cs.DC

    FjORD: Fair and Accurate Federated Learning under heterogeneous targets with Ordered Dropout

    Authors: Samuel Horvath, Stefanos Laskaridis, Mario Almeida, Ilias Leontiadis, Stylianos I. Venieris, Nicholas D. Lane

    Abstract: Federated Learning (FL) has been gaining significant traction across different ML tasks, ranging from vision to keyboard predictions. In large-scale deployments, client heterogeneity is a fact and constitutes a primary problem for fairness, training performance and accuracy. Although significant efforts have been made into tackling statistical data heterogeneity, the diversity in the processing ca… ▽ More

    Submitted 11 January, 2022; v1 submitted 26 February, 2021; originally announced February 2021.

    Comments: Accepted at the 35th Conference on Neural Information Processing Systems (NeurIPS), 2021

  27. arXiv:2102.13192  [pdf, other

    cs.NI

    PlaceRAN: Optimal Placement of Virtualized Network Functions in the Next-generation Radio Access Networks

    Authors: Fernando Zanferrari Morais, Gabriel Matheus de Almeida, Leizer Pinto, Kleber Vieira Cardoso, Luis M. Contreras, Rodrigo da Rosa Righi, Cristiano Bonato Both

    Abstract: The fifth-generation mobile evolution enables several transformations on Next Generation Radio Access Networks (NG-RAN). The RAN protocol stack is splitting into eight possible disaggregated options combined into three network units, i.e., Central, Distributed, and Radio. Besides that, further advances allow the RAN software to be virtualized on top of general-purpose vendor-neutral hardware, deal… ▽ More

    Submitted 28 March, 2021; v1 submitted 25 February, 2021; originally announced February 2021.

  28. arXiv:2102.00461  [pdf, other

    cs.CL stat.AP stat.ML

    Multilingual Email Zoning

    Authors: Bruno Jardim, Ricardo Rei, Mariana S. C. Almeida

    Abstract: The segmentation of emails into functional zones (also dubbed email zoning) is a relevant preprocessing step for most NLP tasks that deal with emails. However, despite the multilingual character of emails and their applications, previous literature regarding email zoning corpora and systems was developed essentially for English. In this paper, we analyse the existing email zoning corpora and pro… ▽ More

    Submitted 13 February, 2021; v1 submitted 31 January, 2021; originally announced February 2021.

    Comments: Accepted at EACL 2021 SRW (https://sites.google.com/view/eaclsrw2021/home); 6 pages with 2 Figures and 8 Tables, plus references; Cleverly Multilingual Zoning Corpus available at https://github.com/cleverly-ai/multilingual-email-zoning

  29. arXiv:2011.09012  [pdf, other

    cs.PL

    RustViz: Interactively Visualizing Ownership and Borrowing

    Authors: Gongming, Luo, Vishnu Reddy, Marcelo Almeida, Yingying Zhu, Ke Du, Cyrus Omar

    Abstract: Rust is a systems programming language that guarantees memory safety without the need for a garbage collector by statically tracking ownership and borrowing events. The associated rules are subtle and unique among industry programming languages, which can make learning Rust more challenging. Motivated by the challenges that Rust learners face, we are develo** RustViz, a tool that allows teachers… ▽ More

    Submitted 17 November, 2020; originally announced November 2020.

    Comments: 9 pages, 3 figures. Presented at HATRA 2020 (Human Aspects of Types and Reasoning Assistants)

  30. arXiv:2010.06992  [pdf, other

    cs.LG cs.AI cs.SI stat.ML

    InstantEmbedding: Efficient Local Node Representations

    Authors: Ştefan Postăvaru, Anton Tsitsulin, Filipe Miguel Gonçalves de Almeida, Yingtao Tian, Silvio Lattanzi, Bryan Perozzi

    Abstract: In this paper, we introduce InstantEmbedding, an efficient method for generating single-node representations using local PageRank computations. We theoretically prove that our approach produces globally consistent representations in sublinear time. We demonstrate this empirically by conducting extensive experiments on real-world datasets with over a billion edges. Our experiments confirm that Inst… ▽ More

    Submitted 14 October, 2020; originally announced October 2020.

    Comments: 23 pages, 9 figures

  31. arXiv:2009.11751  [pdf, ps, other

    cs.CR cs.LG stat.ML

    BreachRadar: Automatic Detection of Points-of-Compromise

    Authors: Miguel Araujo, Miguel Almeida, Jaime Ferreira, Luis Silva, Pedro Bizarro

    Abstract: Bank transaction fraud results in over $13B annual losses for banks, merchants, and card holders worldwide. Much of this fraud starts with a Point-of-Compromise (a data breach or a skimming operation) where credit and debit card digital information is stolen, resold, and later used to perform fraud. We introduce this problem and present an automatic Points-of-Compromise (POC) detection procedure.… ▽ More

    Submitted 24 September, 2020; originally announced September 2020.

    Comments: 9 pages, 10 figures, published in SIAM's 2017 International Conference on Data Mining (SDM17)

  32. arXiv:2008.06402  [pdf, other

    cs.LG cs.CV cs.DC stat.ML

    SPINN: Synergistic Progressive Inference of Neural Networks over Device and Cloud

    Authors: Stefanos Laskaridis, Stylianos I. Venieris, Mario Almeida, Ilias Leontiadis, Nicholas D. Lane

    Abstract: Despite the soaring use of convolutional neural networks (CNNs) in mobile applications, uniformly sustaining high-performance inference on mobile has been elusive due to the excessive computational demands of modern CNNs and the increasing diversity of deployed devices. A popular alternative comprises offloading CNN processing to powerful cloud-based servers. Nevertheless, by relying on the cloud… ▽ More

    Submitted 24 August, 2020; v1 submitted 14 August, 2020; originally announced August 2020.

    Comments: Accepted at the 26th Annual International Conference on Mobile Computing and Networking (MobiCom), 2020

  33. arXiv:2005.02443  [pdf, other

    cs.CY cs.SI

    A Dataset of Fact-Checked Images Shared on WhatsApp During the Brazilian and Indian Elections

    Authors: Julio C. S. Reis, Philipe de Freitas Melo, Kiran Garimella, Jussara M. Almeida, Dean Eckles, Fabrício Benevenuto

    Abstract: Recently, messaging applications, such as WhatsApp, have been reportedly abused by misinformation campaigns, especially in Brazil and India. A notable form of abuse in WhatsApp relies on several manipulated images and memes containing all kinds of fake stories. In this work, we performed an extensive data collection from a large set of WhatsApp publicly accessible groups and fact-checking agency w… ▽ More

    Submitted 5 May, 2020; originally announced May 2020.

    Comments: 7 pages. This is a preprint version of an accepted paper on ICWSM'20. Please, consider to cite the conference version instead of this one

  34. arXiv:2002.09963  [pdf, other

    cs.LG stat.ML

    Mitigating Class Boundary Label Uncertainty to Reduce Both Model Bias and Variance

    Authors: Matthew Almeida, Wei Ding, Scott Crouter, ** Chen

    Abstract: The study of model bias and variance with respect to decision boundaries is critically important in supervised classification. There is generally a tradeoff between the two, as fine-tuning of the decision boundary of a classification model to accommodate more boundary training samples (i.e., higher model complexity) may improve training accuracy (i.e., lower bias) but hurt generalization against u… ▽ More

    Submitted 23 February, 2020; originally announced February 2020.

  35. arXiv:2002.05988  [pdf, other

    cs.LG cs.CR stat.ML

    Interleaved Sequence RNNs for Fraud Detection

    Authors: Bernardo Branco, Pedro Abreu, Ana Sofia Gomes, Mariana S. C. Almeida, João Tiago Ascensão, Pedro Bizarro

    Abstract: Payment card fraud causes multibillion dollar losses for banks and merchants worldwide, often fueling complex criminal activities. To address this, many real-time fraud detection systems use tree-based models, demanding complex feature engineering systems to efficiently enrich transactions with historical data while complying with millisecond-level latencies. In this work, we do not require thos… ▽ More

    Submitted 17 June, 2020; v1 submitted 14 February, 2020; originally announced February 2020.

    Comments: 9 pages, 4 figures, to appear in SIGKDD'20 Industry Track

  36. Super-resolution of multispectral satellite images using convolutional neural networks

    Authors: M. U. Müller, N. Ekhtiari, R. M. Almeida, C. Rieke

    Abstract: Super-resolution aims at increasing image resolution by algorithmic means and has progressed over the recent years due to advances in the fields of computer vision and deep learning. Convolutional Neural Networks based on a variety of architectures have been applied to the problem, e.g. autoencoders and residual networks. While most research focuses on the processing of photographs consisting only… ▽ More

    Submitted 8 April, 2020; v1 submitted 3 February, 2020; originally announced February 2020.

    Comments: To be published in the ISPRS Annals of the Photogrammetry, Remote Sensing and Spatial Information Sciences: https://www.isprs.org/publications/annals.aspx, proceedings of the XXIV ISPRS Congress, 14-20 June 2020, Nice, France

    MSC Class: 68-06 ACM Class: I.4.3

    Journal ref: ISPRS Ann. Photogramm. Remote Sens. Spatial Inf. Sci., V-1-2020, 33-40

  37. arXiv:1908.09015  [pdf, other

    cs.DC cs.CR

    Towards Secure and Decentralized Sharing of IoT Data

    Authors: Hien Thi Thu Truong, Miguel Almeida, Ghassan Karame, Claudio Soriente

    Abstract: The Internet of Things (IoT) bears unprecedented security and scalability challenges due to the magnitude of data produced and exchanged by IoT devices and platforms. Some of those challenges are currently being addressed by coupling IoT applications with blockchains. However, current blockchain-backed IoT systems simply use the blockchain to store access control policies, thereby underutilizing t… ▽ More

    Submitted 23 August, 2019; originally announced August 2019.

  38. arXiv:1906.10513  [pdf, other

    cs.RO

    The Role of Compute in Autonomous Aerial Vehicles

    Authors: Behzad Boroujerdian, Hasan Genc, Srivatsan Krishnan, Bardienus Pieter Duisterhof, Brian Plancher, Kayvan Mansoorshahi, Marcelino Almeida, Wenzhi Cui, Aleksandra Faust, Vijay Janapa Reddi

    Abstract: Autonomous-mobile cyber-physical machines are part of our future. Specifically, unmanned-aerial-vehicles have seen a resurgence in activity with use-cases such as package delivery. These systems face many challenges such as their low-endurance caused by limited onboard-energy, hence, improving the mission-time and energy are of importance. Such improvements traditionally are delivered through bett… ▽ More

    Submitted 23 June, 2019; originally announced June 2019.

    Comments: arXiv admin note: substantial text overlap with arXiv:1905.06388

  39. arXiv:1906.06240  [pdf, other

    cs.DC cs.PF

    Diffusing Your Mobile Apps: Extending In-Network Function Virtualization to Mobile Function Offloading

    Authors: Mario Almeida, Liang Wang, Jeremy Blackburn, Konstantina Papagiannaki, Jon Crowcroft

    Abstract: Motivated by the huge disparity between the limited battery capacity of user devices and the ever-growing energy demands of modern mobile apps, we propose INFv. It is the first offloading system able to cache, migrate and dynamically execute on demand functionality from mobile devices in ISP networks. It aims to bridge this gap by extending the promising NFV paradigm to mobile applications in orde… ▽ More

    Submitted 14 June, 2019; originally announced June 2019.

  40. arXiv:1905.07346  [pdf, other

    cs.LG cs.PF stat.ML

    EmBench: Quantifying Performance Variations of Deep Neural Networks across Modern Commodity Devices

    Authors: Mario Almeida, Stefanos Laskaridis, Ilias Leontiadis, Stylianos I. Venieris, Nicholas D. Lane

    Abstract: In recent years, advances in deep learning have resulted in unprecedented leaps in diverse tasks spanning from speech and object recognition to context awareness and health monitoring. As a result, an increasing number of AI-enabled applications are being developed targeting ubiquitous and mobile devices. While deep neural networks (DNNs) are getting bigger and more complex, they also impose a hea… ▽ More

    Submitted 17 May, 2019; originally announced May 2019.

    Comments: Accepted at MobiSys 2019: 3rd International Workshop on Embedded and Mobile Deep Learning (EMDL), 2019

  41. Towards Understanding Political Interactions on Instagram

    Authors: Martino Trevisan, Luca Vassio, Idilio Drago, Marco Mellia, Fabricio Murai, Flavio Figueiredo, Ana Paula Couto da Silva, Jussara M. Almeida

    Abstract: Online Social Networks (OSNs) allow personalities and companies to communicate directly with the public, bypassing filters of traditional medias. As people rely on OSNs to stay up-to-date, the political debate has moved online too. We witness the sudden explosion of harsh political debates and the dissemination of rumours in OSNs. Identifying such behaviour requires a deep understanding on how peo… ▽ More

    Submitted 4 May, 2021; v1 submitted 26 April, 2019; originally announced April 2019.

    Comments: 5 pages, 8 figures, Proceedings of the 30th ACM Conference on Hypertext and Social Media, https://dl.acm.org/doi/10.1145/3342220.3343657

    Journal ref: HT19: Proceedings of the 30th ACM Conference on Hypertext and Social Media. September 2019. Pages 247-251. Association for Computing Machinery

  42. Whole slide image registration for the study of tumor heterogeneity

    Authors: Leslie Solorzano, Gabriela M. Almeida, Bárbara Mesquita, Diana Martins, Carla Oliveira, Carolina Wählby

    Abstract: Consecutive thin sections of tissue samples make it possible to study local variation in e.g. protein expression and tumor heterogeneity by staining for a new protein in each section. In order to compare and correlate patterns of different proteins, the images have to be registered with high accuracy. The problem we want to solve is registration of gigapixel whole slide images (WSI). This presents… ▽ More

    Submitted 24 January, 2019; originally announced January 2019.

    Comments: MICCAI2018 - Computational Pathology and Ophthalmic Medical Image Analysis - COMPAY

    Journal ref: vol 11039, 2018, p95-102

  43. Analyzing Ideological Communities in Congressional Voting Networks

    Authors: Carlos H. G. Ferreira, Breno de Souza Matos, Jusssara M. Almeida

    Abstract: We here study the behavior of political party members aiming at identifying how ideological communities are created and evolve over time in diverse (fragmented and non-fragmented) party systems. Using public voting data of both Brazil and the US, we propose a methodology to identify and characterize ideological communities, their member polarization, and how such communities evolve over time, cove… ▽ More

    Submitted 29 October, 2018; originally announced October 2018.

  44. arXiv:1803.03448  [pdf, other

    cs.CR

    A Family of Droids -- Android Malware Detection via Behavioral Modeling: Static vs Dynamic Analysis

    Authors: Lucky Onwuzurike, Mario Almeida, Enrico Mariconti, Jeremy Blackburn, Gianluca Stringhini, Emiliano De Cristofaro

    Abstract: Following the increasing popularity of mobile ecosystems, cybercriminals have increasingly targeted them, designing and distributing malicious apps that steal information or cause harm to the device's owner. Aiming to counter them, detection techniques based on either static or dynamic analysis that model Android malware, have been proposed. While the pros and cons of these analysis techniques are… ▽ More

    Submitted 13 July, 2018; v1 submitted 9 March, 2018; originally announced March 2018.

    Comments: A preliminary version of this paper appears in the Proceedings of 16th Annual Conference on Privacy, Security and Trust (PST 2018). This is the full version

  45. arXiv:1703.06288  [pdf, other

    cs.SI

    Gender Matters! Analyzing Global Cultural Gender Preferences for Venues Using Social Sensing

    Authors: Willi Mueller, Thiago H Silva, Jussara M Almeida, Antonio A F Loureiro

    Abstract: Gender differences is a phenomenon around the world actively researched by social scientists. Traditionally, the data used to support such studies is manually obtained, often through surveys with volunteers. However, due to their inherent high costs because of manual steps, such traditional methods do not quickly scale to large-size studies. We here investigate a particular aspect of gender differ… ▽ More

    Submitted 18 March, 2017; originally announced March 2017.

  46. Reducing Nondeterministic Tree Automata by Adding Transitions

    Authors: Ricardo Manuel de Oliveira Almeida

    Abstract: We introduce saturation of nondeterministic tree automata, a technique that consists of adding new transitions to an automaton while preserving its language. We implemented our algorithm on minotaut - a module of the tree automata library libvata that reduces the size of automata by merging states and removing superfluous transitions - and we show how saturation can make subsequent merge and trans… ▽ More

    Submitted 15 December, 2016; originally announced December 2016.

    Comments: In Proceedings MEMICS 2016, arXiv:1612.04037

    Journal ref: EPTCS 233, 2016, pp. 33-51

  47. arXiv:1604.07890  [pdf, other

    cs.SI

    Understanding Video-Ad Consumption on YouTube: A Measurement Study on User Behavior, Popularity, and Content Properties

    Authors: Mariana Arantes, Flavio Figueiredo, Jussara M. Almeida

    Abstract: Faced with the challenge of attracting user attention and revenue, social media websites have turned to video advertisements (video-ads). While in traditional media the video-ad market is mostly based on an interaction between content providers and marketers, the use of video-ads in social media has enabled a more complex interaction, that also includes content creator and viewer preferences. To b… ▽ More

    Submitted 26 April, 2016; originally announced April 2016.

    Comments: To Appear at WebSci 16

  48. arXiv:1604.01303  [pdf, other

    cs.NI cs.PF

    C3PO: Computation Congestion Control (PrOactive) - an algorithm for dynamic diffusion of ephemeral in-network services

    Authors: Liang Wang, Mario Almeida, Jeremy Blackburn, Jon Crowcroft

    Abstract: There is an obvious trend that more and more data and computation are migrating into networks nowadays. Combining mature virtualization technologies with service-centric net- working, we are entering into an era where countless services reside in an ISP network to provide low-latency access. Such services are often computation intensive and are dynamically created and destroyed on demands everywhe… ▽ More

    Submitted 6 April, 2016; v1 submitted 5 April, 2016; originally announced April 2016.

  49. arXiv:1408.7094  [pdf, other

    cs.SI physics.soc-ph

    Improving the Effectiveness of Content Popularity Prediction Methods using Time Series Trends

    Authors: Flavio Figueiredo, Marcos André Gonçalves, Jussara M. Almeida

    Abstract: We here present a simple and effective model to predict the popularity of web content. Our solution, which is the winner of two of the three tasks of the ECML/PKDD 2014 Predictive Analytics Challenge, aims at predicting user engagement metrics, such as number of visits and social network engagement, that a web page will achieve 48 hours after its upload, using only information available in the fir… ▽ More

    Submitted 29 August, 2014; originally announced August 2014.

    Comments: Presented on the ECML/PKDD Discovery Challenge on Predictive Analytics. Winner of two out pf three tasks of the Predictive Analytics Discovery Challenge

    ACM Class: H.3.5

  50. arXiv:1405.1459  [pdf, other

    cs.SI physics.soc-ph

    Revisit Behavior in Social Media: The Phoenix-R Model and Discoveries

    Authors: Flavio Figueiredo, Jussara M. Almeida, Yasuko Matsubara, Bruno Ribeiro, Christos Faloutsos

    Abstract: How many listens will an artist receive on a online radio? How about plays on a YouTube video? How many of these visits are new or returning users? Modeling and mining popularity dynamics of social activity has important implications for researchers, content creators and providers. We here investigate the effect of revisits (successive visits from a single user) on content popularity. Using four d… ▽ More

    Submitted 22 June, 2014; v1 submitted 6 May, 2014; originally announced May 2014.

    Comments: To appear on European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases 2014