Skip to main content

Showing 1–38 of 38 results for author: Passos, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2402.07310  [pdf, other

    cs.CV cs.LG

    BioNeRF: Biologically Plausible Neural Radiance Fields for View Synthesis

    Authors: Leandro A. Passos, Douglas Rodrigues, Danilo Jodas, Kelton A. P. Costa, Ahsan Adeel, João Paulo Papa

    Abstract: This paper presents BioNeRF, a biologically plausible architecture that models scenes in a 3D representation and synthesizes new views through radiance fields. Since NeRF relies on the network weights to store the scene's 3-dimensional representation, BioNeRF implements a cognitive-inspired mechanism that fuses inputs from multiple sources into a memory-like structure, improving the storing capaci… ▽ More

    Submitted 25 March, 2024; v1 submitted 11 February, 2024; originally announced February 2024.

  2. arXiv:2311.16711  [pdf, other

    cs.CV cs.AI cs.HC cs.LG

    LEDITS++: Limitless Image Editing using Text-to-Image Models

    Authors: Manuel Brack, Felix Friedrich, Katharina Kornmeier, Linoy Tsaban, Patrick Schramowski, Kristian Kersting, Apolinário Passos

    Abstract: Text-to-image diffusion models have recently received increasing interest for their astonishing ability to produce high-fidelity images from solely text inputs. Subsequent research efforts aim to exploit and apply their capabilities to real image editing. However, existing image-to-image methods are often inefficient, imprecise, and of limited versatility. They either require time-consuming finetu… ▽ More

    Submitted 25 June, 2024; v1 submitted 28 November, 2023; originally announced November 2023.

    Comments: Proceedings of the 2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) The project page is available at https://leditsplusplus-project.static.hf.space

  3. arXiv:2311.05556  [pdf, other

    cs.CV cs.LG

    LCM-LoRA: A Universal Stable-Diffusion Acceleration Module

    Authors: Simian Luo, Yiqin Tan, Suraj Patil, Daniel Gu, Patrick von Platen, Apolinário Passos, Longbo Huang, Jian Li, Hang Zhao

    Abstract: Latent Consistency Models (LCMs) have achieved impressive performance in accelerating text-to-image generative tasks, producing high-quality images with minimal inference steps. LCMs are distilled from pre-trained latent diffusion models (LDMs), requiring only ~32 A100 GPU training hours. This report further extends LCMs' potential in two aspects: First, by applying LoRA distillation to Stable-Dif… ▽ More

    Submitted 9 November, 2023; originally announced November 2023.

    Comments: Technical Report

  4. Feature Selection and Hyperparameter Fine-tuning in Artificial Neural Networks for Wood Quality Classification

    Authors: Mateus Roder, Leandro Aparecido Passos, João Paulo Papa, André Luis Debiaso Rossi

    Abstract: Quality classification of wood boards is an essential task in the sawmill industry, which is still usually performed by human operators in small to median companies in develo** countries. Machine learning algorithms have been successfully employed to investigate the problem, offering a more affordable alternative compared to other solutions. However, such approaches usually present some drawback… ▽ More

    Submitted 20 October, 2023; originally announced October 2023.

  5. arXiv:2307.00522  [pdf, other

    cs.CV cs.AI cs.LG

    LEDITS: Real Image Editing with DDPM Inversion and Semantic Guidance

    Authors: Linoy Tsaban, Apolinário Passos

    Abstract: Recent large-scale text-guided diffusion models provide powerful image-generation capabilities. Currently, a significant effort is given to enable the modification of these images using text only as means to offer intuitive and versatile editing. However, editing proves to be difficult for these generative models due to the inherent nature of editing techniques, which involves preserving certain c… ▽ More

    Submitted 2 July, 2023; originally announced July 2023.

    Comments: 8 pages, 5 figures, 1 table. This report builds up on the works introduced in - arXiv:2304.06140, arXiv:2301.12247

  6. arXiv:2305.10403  [pdf, other

    cs.CL cs.AI

    PaLM 2 Technical Report

    Authors: Rohan Anil, Andrew M. Dai, Orhan Firat, Melvin Johnson, Dmitry Lepikhin, Alexandre Passos, Siamak Shakeri, Emanuel Taropa, Paige Bailey, Zhifeng Chen, Eric Chu, Jonathan H. Clark, Laurent El Shafey, Yan** Huang, Kathy Meier-Hellstern, Gaurav Mishra, Erica Moreira, Mark Omernick, Kevin Robinson, Sebastian Ruder, Yi Tay, Kefan Xiao, Yuanzhong Xu, Yu**g Zhang, Gustavo Hernandez Abrego , et al. (103 additional authors not shown)

    Abstract: We introduce PaLM 2, a new state-of-the-art language model that has better multilingual and reasoning capabilities and is more compute-efficient than its predecessor PaLM. PaLM 2 is a Transformer-based model trained using a mixture of objectives. Through extensive evaluations on English and multilingual language, and reasoning tasks, we demonstrate that PaLM 2 has significantly improved quality on… ▽ More

    Submitted 13 September, 2023; v1 submitted 17 May, 2023; originally announced May 2023.

  7. arXiv:2303.08774  [pdf, other

    cs.CL cs.AI

    GPT-4 Technical Report

    Authors: OpenAI, Josh Achiam, Steven Adler, Sandhini Agarwal, Lama Ahmad, Ilge Akkaya, Florencia Leoni Aleman, Diogo Almeida, Janko Altenschmidt, Sam Altman, Shyamal Anadkat, Red Avila, Igor Babuschkin, Suchir Balaji, Valerie Balcom, Paul Baltescu, Haiming Bao, Mohammad Bavarian, Jeff Belgum, Irwan Bello, Jake Berdine, Gabriel Bernadett-Shapiro, Christopher Berner, Lenny Bogdonoff, Oleg Boiko , et al. (256 additional authors not shown)

    Abstract: We report the development of GPT-4, a large-scale, multimodal model which can accept image and text inputs and produce text outputs. While less capable than humans in many real-world scenarios, GPT-4 exhibits human-level performance on various professional and academic benchmarks, including passing a simulated bar exam with a score around the top 10% of test takers. GPT-4 is a Transformer-based mo… ▽ More

    Submitted 4 March, 2024; v1 submitted 15 March, 2023; originally announced March 2023.

    Comments: 100 pages; updated authors list; fixed author names and added citation

  8. Enhancing Hyper-To-Real Space Projections Through Euclidean Norm Meta-Heuristic Optimization

    Authors: Luiz C. F. Ribeiro, Mateus Roder, Gustavo H. de Rosa, Leandro A. Passos, João P. Papa

    Abstract: The continuous computational power growth in the last decades has made solving several optimization problems significant to humankind a tractable task; however, tackling some of them remains a challenge due to the overwhelming amount of candidate solutions to be evaluated, even by using sophisticated algorithms. In such a context, a set of nature-inspired stochastic methods, called meta-heuristic… ▽ More

    Submitted 31 January, 2023; originally announced January 2023.

  9. DDIPNet and DDIPNet+: Discriminant Deep Image Prior Networks for Remote Sensing Image Classification

    Authors: Daniel F. S. Santos, Rafael G. Pires, Leandro A. Passos, João P. Papa

    Abstract: Research on remote sensing image classification significantly impacts essential human routine tasks such as urban planning and agriculture. Nowadays, the rapid advance in technology and the availability of many high-quality remote sensing images create a demand for reliable automation methods. The current paper proposes two novel deep learning-based architectures for image classification purposes,… ▽ More

    Submitted 20 December, 2022; originally announced December 2022.

    Comments: Published in: 2021 IEEE International Geoscience and Remote Sensing Symposium IGARSS

  10. FEMa-FS: Finite Element Machines for Feature Selection

    Authors: Lucas Biaggi, João P. Papa, Kelton A. P Costa, Danillo R. Pereira, Leandro A. Passos

    Abstract: Identifying anomalies has become one of the primary strategies towards security and protection procedures in computer networks. In this context, machine learning-based methods emerge as an elegant solution to identify such scenarios and learn irrelevant information so that a reduction in the identification time and possible gain in accuracy can be obtained. This paper proposes a novel feature sele… ▽ More

    Submitted 5 December, 2022; originally announced December 2022.

  11. From Actions to Events: A Transfer Learning Approach Using Improved Deep Belief Networks

    Authors: Mateus Roder, Jurandy Almeida, Gustavo H. de Rosa, Leandro A. Passos, André L. D. Rossi, João P. Papa

    Abstract: In the last decade, exponential data growth supplied machine learning-based algorithms' capacity and enabled their usage in daily-life activities. Additionally, such an improvement is partially explained due to the advent of deep learning techniques, i.e., stacks of simple architectures that end up in more complex models. Although both factors produce outstanding results, they also pose drawbacks… ▽ More

    Submitted 30 November, 2022; originally announced November 2022.

  12. arXiv:2210.15458  [pdf, other

    cs.CL cs.LG stat.ML

    Arithmetic Sampling: Parallel Diverse Decoding for Large Language Models

    Authors: Luke Vilnis, Yury Zemlyanskiy, Patrick Murray, Alexandre Passos, Sumit Sanghai

    Abstract: Decoding methods for large language models often trade-off between diversity of outputs and parallelism of computation. Methods such as beam search and Gumbel top-k sampling can guarantee a different output for each element of the beam, but are not easy to parallelize. Alternatively, methods such as temperature sampling and its modifications (top-k sampling, nucleus sampling, typical decoding, and… ▽ More

    Submitted 1 June, 2023; v1 submitted 18 October, 2022; originally announced October 2022.

    Comments: 17 pages, to appear at ICML 2023

  13. ComplexWoundDB: A Database for Automatic Complex Wound Tissue Categorization

    Authors: Talita A. Pereira, Regina C. Popim, Leandro A. Passos, Danillo R. Pereira, Clayton R. Pereira, João P. Papa

    Abstract: Complex wounds usually face partial or total loss of skin thickness, healing by secondary intention. They can be acute or chronic, figuring infections, ischemia and tissue necrosis, and association with systemic diseases. Research institutes around the globe report countless cases, ending up in a severe public health problem, for they involve human resources (e.g., physicians and health care profe… ▽ More

    Submitted 26 September, 2022; originally announced September 2022.

  14. PL-kNN: A Parameterless Nearest Neighbors Classifier

    Authors: Danilo Samuel Jodas, Leandro Aparecido Passos, Ahsan Adeel, João Paulo Papa

    Abstract: Demands for minimum parameter setup in machine learning models are desirable to avoid time-consuming optimization processes. The $k$-Nearest Neighbors is one of the most effective and straightforward models employed in numerous problems. Despite its well-known performance, it requires the value of $k$ for specific data distribution, thus demanding expensive computational efforts. This paper propos… ▽ More

    Submitted 30 September, 2022; v1 submitted 26 September, 2022; originally announced September 2022.

  15. arXiv:2209.03275  [pdf, other

    cs.SD cs.LG cs.MM eess.AS

    Multimodal Speech Enhancement Using Burst Propagation

    Authors: Mohsin Raza, Leandro A. Passos, Ahmed Khubaib, Ahsan Adeel

    Abstract: This paper proposes the MBURST, a novel multimodal solution for audio-visual speech enhancements that consider the most recent neurological discoveries regarding pyramidal cells of the prefrontal cortex and other brain regions. The so-called burst propagation implements several criteria to address the credit assignment problem in a more biologically plausible manner: steering the sign and magnitud… ▽ More

    Submitted 5 February, 2024; v1 submitted 7 September, 2022; originally announced September 2022.

  16. arXiv:2206.02671  [pdf, ps, other

    cs.SD cs.CV eess.AS

    Canonical Cortical Graph Neural Networks and its Application for Speech Enhancement in Audio-Visual Hearing Aids

    Authors: Leandro A. Passos, João Paulo Papa, Amir Hussain, Ahsan Adeel

    Abstract: Despite the recent success of machine learning algorithms, most models face drawbacks when considering more complex tasks requiring interaction between different sources, such as multimodal input data and logical time sequences. On the other hand, the biological brain is highly sharpened in this sense, empowered to automatically manage and integrate such streams of information. In this context, th… ▽ More

    Submitted 31 January, 2023; v1 submitted 6 June, 2022; originally announced June 2022.

  17. A Novel Approach for Optimum-Path Forest Classification Using Fuzzy Logic

    Authors: Renato W. R. de Souza, João V. C. de Oliveira, Leandro A. Passos, Wei** Ding, João P. Papa, Victor Hugo C. de Albuquerque

    Abstract: In the past decades, fuzzy logic has played an essential role in many research areas. Alongside, graph-based pattern recognition has shown to be of great importance due to its flexibility in partitioning the feature space using the background from graph theory. Some years ago, a new framework for both supervised, semi-supervised, and unsupervised learning named Optimum-Path Forest (OPF) was propos… ▽ More

    Submitted 13 April, 2022; originally announced April 2022.

    Journal ref: IEEE Transactions on Fuzzy Systems 28.12 (2019): 3076-3086

  18. arXiv:2203.17189  [pdf, other

    cs.LG cs.CL

    Scaling Up Models and Data with $\texttt{t5x}$ and $\texttt{seqio}$

    Authors: Adam Roberts, Hyung Won Chung, Anselm Levskaya, Gaurav Mishra, James Bradbury, Daniel Andor, Sharan Narang, Brian Lester, Colin Gaffney, Afroz Mohiuddin, Curtis Hawthorne, Aitor Lewkowycz, Alex Salcianu, Marc van Zee, Jacob Austin, Sebastian Goodman, Livio Baldini Soares, Haitang Hu, Sasha Tsvyashchenko, Aakanksha Chowdhery, Jasmijn Bastings, Jannis Bulian, Xavier Garcia, Jianmo Ni, Andrew Chen , et al. (18 additional authors not shown)

    Abstract: Recent neural network-based language models have benefited greatly from scaling up the size of training datasets and the number of parameters in the models themselves. Scaling can be complicated due to various factors including the need to distribute computation on supercomputer clusters (e.g., TPUs), prevent bottlenecks when infeeding data, and ensure reproducible results. In this work, we presen… ▽ More

    Submitted 31 March, 2022; originally announced March 2022.

  19. arXiv:2203.13856  [pdf, other

    eess.IV cs.CV cs.LG

    Robust deep learning for eye fundus images: Bridging real and synthetic data for enhancing generalization

    Authors: Guilherme C. Oliveira, Gustavo H. Rosa, Daniel C. G. Pedronette, João P. Papa, Himeesh Kumar, Leandro A. Passos, Dinesh Kumar

    Abstract: Deep learning applications for assessing medical images are limited because the datasets are often small and imbalanced. The use of synthetic data has been proposed in the literature, but neither a robust comparison of the different methods nor generalizability has been reported. Our approach integrates a retinal image quality assessment model and StyleGAN2 architecture to enhance Age-related Macu… ▽ More

    Submitted 3 April, 2024; v1 submitted 25 March, 2022; originally announced March 2022.

    Comments: Accepted by the Biomedical Signal Processing and Control

    Journal ref: Biomedical Signal Processing and Control, 94 (2024), 106263

  20. arXiv:2203.02740  [pdf, ps, other

    cs.LG cs.CV

    MaxDropoutV2: An Improved Method to Drop out Neurons in Convolutional Neural Networks

    Authors: Claudio Filipi Goncalves do Santos, Mateus Roder, Leandro A. Passos, João P. Papa

    Abstract: In the last decade, exponential data growth supplied the machine learning-based algorithms' capacity and enabled their usage in daily life activities. Additionally, such an improvement is partially explained due to the advent of deep learning techniques, i.e., stacks of simple architectures that end up in more complex models. Although both factors produce outstanding results, they also pose drawba… ▽ More

    Submitted 5 March, 2022; originally announced March 2022.

  21. Handling Imbalanced Datasets Through Optimum-Path Forest

    Authors: Leandro Aparecido Passos, Danilo S. Jodas, Luiz C. F. Ribeiro, Marco Akio, Andre Nunes de Souza, João Paulo Papa

    Abstract: In the last decade, machine learning-based approaches became capable of performing a wide range of complex tasks sometimes better than humans, demanding a fraction of the time. Such an advance is partially due to the exponential growth in the amount of data available, which makes it possible to extract trustworthy real-world information from them. However, such data is generally imbalanced since s… ▽ More

    Submitted 17 February, 2022; originally announced February 2022.

  22. A Review of Deep Learning-based Approaches for Deepfake Content Detection

    Authors: Leandro A. Passos, Danilo Jodas, Kelton A. P. da Costa, Luis A. Souza Júnior, Douglas Rodrigues, Javier Del Ser, David Camacho, João Paulo Papa

    Abstract: Recent advancements in deep learning generative models have raised concerns as they can create highly convincing counterfeit images and videos. This poses a threat to people's integrity and can lead to social instability. To address this issue, there is a pressing need to develop new computational models that can efficiently detect forged content and alert users to potential image and video manipu… ▽ More

    Submitted 15 February, 2024; v1 submitted 12 February, 2022; originally announced February 2022.

  23. Multimodal Audio-Visual Information Fusion using Canonical-Correlated Graph Neural Network for Energy-Efficient Speech Enhancement

    Authors: Leandro Aparecido Passos, João Paulo Papa, Javier Del Ser, Amir Hussain, Ahsan Adeel

    Abstract: This paper proposes a novel multimodal self-supervised architecture for energy-efficient audio-visual (AV) speech enhancement that integrates Graph Neural Networks with canonical correlation analysis (CCA-GNN). The proposed approach lays its foundations on a state-of-the-art CCA-GNN that learns representative embeddings by maximizing the correlation between pairs of augmented views of the same inp… ▽ More

    Submitted 16 September, 2022; v1 submitted 9 February, 2022; originally announced February 2022.

  24. arXiv:2201.03323  [pdf, other

    cs.CV cs.LG

    Gait Recognition Based on Deep Learning: A Survey

    Authors: Claudio Filipi Gonçalves dos Santos, Diego de Souza Oliveira, Leandro A. Passos, Rafael Gonçalves Pires, Daniel Felipe Silva Santos, Lucas Pascotti Valem, Thierry P. Moreira, Marcos Cleison S. Santana, Mateus Roder, João Paulo Papa, Danilo Colombo

    Abstract: In general, biometry-based control systems may not rely on individual expected behavior or cooperation to operate appropriately. Instead, such systems should be aware of malicious procedures for unauthorized access attempts. Some works available in the literature suggest addressing the problem through gait recognition approaches. Such methods aim at identifying human beings through intrinsic perce… ▽ More

    Submitted 10 January, 2022; originally announced January 2022.

  25. arXiv:2112.08634  [pdf, other

    cs.CL

    FRUIT: Faithfully Reflecting Updated Information in Text

    Authors: Robert L. Logan IV, Alexandre Passos, Sameer Singh, Ming-Wei Chang

    Abstract: Textual knowledge bases such as Wikipedia require considerable effort to keep up to date and consistent. While automated writing assistants could potentially ease this burden, the problem of suggesting edits grounded in external knowledge has been under-explored. In this paper, we introduce the novel generation task of *faithfully reflecting updated information in text* (FRUIT) where the goal is t… ▽ More

    Submitted 13 July, 2022; v1 submitted 16 December, 2021; originally announced December 2021.

    Comments: v2.0, NAACL 2022

  26. A Layer-Wise Information Reinforcement Approach to Improve Learning in Deep Belief Networks

    Authors: Mateus Roder, Leandro A. Passos, Luiz Carlos Felix Ribeiro, Clayton Pereira, João Paulo Papa

    Abstract: With the advent of deep learning, the number of works proposing new methods or improving existent ones has grown exponentially in the last years. In this scenario, "very deep" models were emerging, once they were expected to extract more intrinsic and abstract features while supporting a better performance. However, such models suffer from the gradient vanishing problem, i.e., backpropagation valu… ▽ More

    Submitted 17 January, 2021; originally announced January 2021.

  27. Intestinal Parasites Classification Using Deep Belief Networks

    Authors: Mateus Roder, Leandro A. Passos, Luiz Carlos Felix Ribeiro, Barbara Caroline Benato, Alexandre Xavier Falcão, João Paulo Papa

    Abstract: Currently, approximately $4$ billion people are infected by intestinal parasites worldwide. Diseases caused by such infections constitute a public health problem in most tropical countries, leading to physical and mental disorders, and even death to children and immunodeficient individuals. Although subjected to high error rates, human visual inspection is still in charge of the vast majority of c… ▽ More

    Submitted 17 January, 2021; originally announced January 2021.

  28. A Metaheuristic-Driven Approach to Fine-Tune Deep Boltzmann Machines

    Authors: Leandro Aparecido Passos, João Paulo Papa

    Abstract: Deep learning techniques, such as Deep Boltzmann Machines (DBMs), have received considerable attention over the past years due to the outstanding results concerning a variable range of domains. One of the main shortcomings of these techniques involves the choice of their hyperparameters, since they have a significant impact on the final results. This work addresses the issue of fine-tuning hyperpa… ▽ More

    Submitted 14 January, 2021; originally announced January 2021.

    Comments: 30 pages, 7 figures

    Journal ref: Applied Soft Computing 97 (2020): 105717

  29. $\text{O}^2$PF: Oversampling via Optimum-Path Forest for Breast Cancer Detection

    Authors: Leandro Aparecido Passos, Danilo Samuel Jodas, Luiz C. F. Ribeiro, Thierry Pinheiro, João P. Papa

    Abstract: Breast cancer is among the most deadly diseases, distressing mostly women worldwide. Although traditional methods for detection have presented themselves as valid for the task, they still commonly present low accuracies and demand considerable time and effort from professionals. Therefore, a computer-aided diagnosis (CAD) system capable of providing early detection becomes hugely desirable. In the… ▽ More

    Submitted 14 January, 2021; originally announced January 2021.

    Comments: 6 pages, 3 figures. 2020 IEEE 33rd International Symposium on Computer-Based Medical Systems (CBMS)

  30. arXiv:1912.09910  [pdf, other

    cs.IR

    Report on the First HIPstIR Workshop on the Future of Information Retrieval

    Authors: Laura Dietz, Bhaskar Mitra, Jeremy Pickens, Hana Anber, Sandeep Avula, Asia Biega, Adrian Boteanu, Shubham Chatterjee, Jeff Dalton, Shiri Dori-Hacohen, John Foley, Henry Feild, Ben Gamari, Rosie Jones, Pallika Kanani, Sumanta Kashyapi, Widad Machmouchi, Matthew Mitsui, Steve Nole, Alexandre Tachard Passos, Jordan Ramsdell, Adam Roegiest, David Smith, Alessandro Sordoni

    Abstract: The vision of HIPstIR is that early stage information retrieval (IR) researchers get together to develop a future for non-mainstream ideas and research agendas in IR. The first iteration of this vision materialized in the form of a three day workshop in Portsmouth, New Hampshire attended by 24 researchers across academia and industry. Attendees pre-submitted one or more topics that they want to pi… ▽ More

    Submitted 20 December, 2019; originally announced December 2019.

  31. arXiv:1907.05550  [pdf, other

    cs.LG

    Faster Neural Network Training with Data Echoing

    Authors: Dami Choi, Alexandre Passos, Christopher J. Shallue, George E. Dahl

    Abstract: In the twilight of Moore's law, GPUs and other specialized hardware accelerators have dramatically sped up neural network training. However, earlier stages of the training pipeline, such as disk I/O and data preprocessing, do not run on accelerators. As accelerators continue to improve, these earlier stages will increasingly become the bottleneck. In this paper, we introduce "data echoing," which… ▽ More

    Submitted 7 May, 2020; v1 submitted 11 July, 2019; originally announced July 2019.

  32. arXiv:1903.01855  [pdf, other

    cs.PL cs.LG

    TensorFlow Eager: A Multi-Stage, Python-Embedded DSL for Machine Learning

    Authors: Akshay Agrawal, Akshay Naresh Modi, Alexandre Passos, Allen Lavoie, Ashish Agarwal, Asim Shankar, Igor Ganichev, Josh Levenberg, Mingsheng Hong, Rajat Monga, Shanqing Cai

    Abstract: TensorFlow Eager is a multi-stage, Python-embedded domain-specific language for hardware-accelerated machine learning, suitable for both interactive research and production. TensorFlow, which TensorFlow Eager extends, requires users to represent computations as dataflow graphs; this permits compiler optimizations and simplifies deployment but hinders rapid prototy** and run-time dynamism. Tensor… ▽ More

    Submitted 26 February, 2019; originally announced March 2019.

    Journal ref: Proc. of the 2nd SysML Conference, 2019

  33. arXiv:1804.03235  [pdf, other

    cs.LG cs.AI stat.ML

    Large scale distributed neural network training through online distillation

    Authors: Rohan Anil, Gabriel Pereyra, Alexandre Passos, Robert Ormandi, George E. Dahl, Geoffrey E. Hinton

    Abstract: Techniques such as ensembling and distillation promise model quality improvements when paired with almost any base model. However, due to increased test-time cost (for ensembles) and increased complexity of the training pipeline (for distillation), these techniques are challenging to use in industrial settings. In this paper we explore a variant of distillation which is relatively straightforward… ▽ More

    Submitted 20 August, 2020; v1 submitted 9 April, 2018; originally announced April 2018.

    Comments: Clarify that implementations should use available parallelism in pseudo-code

  34. arXiv:1504.06654  [pdf, other

    cs.CL stat.ML

    Efficient Non-parametric Estimation of Multiple Embeddings per Word in Vector Space

    Authors: Arvind Neelakantan, Jeevan Shankar, Alexandre Passos, Andrew McCallum

    Abstract: There is rising interest in vector-space word embeddings and their use in NLP, especially given recent methods for their fast estimation at very large scale. Nearly all this work, however, assumes a single vector per word type ignoring polysemy and thus jeopardizing their usefulness for downstream tasks. We present an extension to the Skip-gram model that efficiently learns multiple embeddings per… ▽ More

    Submitted 24 April, 2015; originally announced April 2015.

    Comments: In Conference on Empirical Methods in Natural Language Processing, 2014

  35. arXiv:1404.5367  [pdf, other

    cs.CL

    Lexicon Infused Phrase Embeddings for Named Entity Resolution

    Authors: Alexandre Passos, Vineet Kumar, Andrew McCallum

    Abstract: Most state-of-the-art approaches for named-entity recognition (NER) use semi supervised information in the form of word clusters and lexicons. Recently neural network-based language models have been explored, as they as a byproduct generate highly informative vector representations for words, known as word embeddings. In this paper we present two contributions: a new form of learning word embeddin… ▽ More

    Submitted 21 April, 2014; originally announced April 2014.

    Comments: Accepted in CoNLL 2014

  36. arXiv:1403.1349  [pdf, ps, other

    cs.CL cs.DL cs.IR

    Learning Soft Linear Constraints with Application to Citation Field Extraction

    Authors: Sam Anzaroot, Alexandre Passos, David Belanger, Andrew McCallum

    Abstract: Accurately segmenting a citation string into fields for authors, titles, etc. is a challenging task because the output typically obeys various global constraints. Previous work has shown that modeling soft constraints, where the model is encouraged, but not require to obey the constraints, can substantially improve segmentation performance. On the other hand, for imposing hard constraints, dual de… ▽ More

    Submitted 17 October, 2014; v1 submitted 6 March, 2014; originally announced March 2014.

    Comments: appears in Proc. the 52nd Annual Meeting of the Association for Computational Linguistics (ACL2014)

  37. arXiv:1206.6486  [pdf

    cs.LG stat.ML

    Flexible Modeling of Latent Task Structures in Multitask Learning

    Authors: Alexandre Passos, Piyush Rai, Jacques Wainer, Hal Daume III

    Abstract: Multitask learning algorithms are typically designed assuming some fixed, a priori known latent structure shared by all the tasks. However, it is usually unclear what type of latent task structure is the most appropriate for a given multitask learning problem. Ideally, the "right" latent task structure should be learned in a data-driven manner. We present a flexible, nonparametric Bayesian model t… ▽ More

    Submitted 27 June, 2012; originally announced June 2012.

    Comments: Appears in Proceedings of the 29th International Conference on Machine Learning (ICML 2012)

  38. arXiv:1201.0490  [pdf, ps, other

    cs.LG cs.MS

    Scikit-learn: Machine Learning in Python

    Authors: Fabian Pedregosa, Gaël Varoquaux, Alexandre Gramfort, Vincent Michel, Bertrand Thirion, Olivier Grisel, Mathieu Blondel, Andreas Müller, Joel Nothman, Gilles Louppe, Peter Prettenhofer, Ron Weiss, Vincent Dubourg, Jake Vanderplas, Alexandre Passos, David Cournapeau, Matthieu Brucher, Matthieu Perrot, Édouard Duchesnay

    Abstract: Scikit-learn is a Python module integrating a wide range of state-of-the-art machine learning algorithms for medium-scale supervised and unsupervised problems. This package focuses on bringing machine learning to non-specialists using a general-purpose high-level language. Emphasis is put on ease of use, performance, documentation, and API consistency. It has minimal dependencies and is distribute… ▽ More

    Submitted 5 June, 2018; v1 submitted 2 January, 2012; originally announced January 2012.

    Comments: Update authors list and URLs

    Journal ref: Journal of Machine Learning Research (2011)