Skip to main content

Showing 1–50 of 52 results for author: Barbu, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.14481  [pdf, other

    cs.LG cs.AI cs.NE q-bio.NC

    Revealing Vision-Language Integration in the Brain with Multimodal Networks

    Authors: Vighnesh Subramaniam, Colin Conwell, Christopher Wang, Gabriel Kreiman, Boris Katz, Ignacio Cases, Andrei Barbu

    Abstract: We use (multi)modal deep neural networks (DNNs) to probe for sites of multimodal integration in the human brain by predicting stereoencephalography (SEEG) recordings taken while human subjects watched movies. We operationalize sites of multimodal integration as regions where a multimodal vision-language model predicts recordings better than unimodal language, unimodal vision, or linearly-integrate… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

    Comments: ICML 2024; 23 pages, 11 figures

  2. arXiv:2406.03044  [pdf, other

    cs.LG q-bio.NC

    Population Transformer: Learning Population-level Representations of Intracranial Activity

    Authors: Geeling Chau, Christopher Wang, Sabera Talukder, Vighnesh Subramaniam, Saraswati Soedarmadji, Yisong Yue, Boris Katz, Andrei Barbu

    Abstract: We present a self-supervised framework that learns population-level codes for intracranial neural recordings at scale, unlocking the benefits of representation learning for a key neuroscience recording modality. The Population Transformer (PopT) lowers the amount of data required for decoding experiments, while increasing accuracy, even on never-before-seen subjects and tasks. We address two key c… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

    Comments: 17 pages, 10 figures, submitted to NeurIPS 2024

  3. arXiv:2405.09805  [pdf, other

    cs.CL cs.CR

    SecureLLM: Using Compositionality to Build Provably Secure Language Models for Private, Sensitive, and Secret Data

    Authors: Abdulrahman Alabdulkareem, Christian M Arnold, Yerim Lee, Pieter M Feenstra, Boris Katz, Andrei Barbu

    Abstract: Traditional security mechanisms isolate resources from users who should not access them. We reflect the compositional nature of such security mechanisms back into the structure of LLMs to build a provably secure LLM; that we term SecureLLM. Other approaches to LLM safety attempt to protect against bad actors or bad outcomes, but can only do so to an extent making them inappropriate for sensitive d… ▽ More

    Submitted 13 June, 2024; v1 submitted 16 May, 2024; originally announced May 2024.

  4. arXiv:2402.15587  [pdf, ps, other

    cs.CV stat.ML

    A Study of Shape Modeling Against Noise

    Authors: Cheng Long, Adrian Barbu

    Abstract: Shape modeling is a challenging task with many potential applications in computer vision and medical imaging. There are many shape modeling methods in the literature, each with its advantages and applications. However, many shape modeling methods have difficulties handling shapes that have missing pieces or outliers. In this regard, this paper introduces shape denoising, a fundamental problem in s… ▽ More

    Submitted 23 February, 2024; originally announced February 2024.

    Comments: 4 pages, 2 figures, International Conference on Image Processing (ICIP)

  5. arXiv:2305.01726  [pdf, ps, other

    stat.ML cs.LG stat.CO stat.ME

    Slow Kill for Big Data Learning

    Authors: Yiyuan She, Jianhui Shen, Adrian Barbu

    Abstract: Big-data applications often involve a vast number of observations and features, creating new challenges for variable selection and parameter estimation. This paper presents a novel technique called ``slow kill,'' which utilizes nonconvex constrained optimization, adaptive $\ell_2$-shrinkage, and increasing learning rates. The fact that the problem size can decrease during the slow kill iterations… ▽ More

    Submitted 2 May, 2023; originally announced May 2023.

  6. arXiv:2304.02972  [pdf, other

    cs.LG stat.ML

    Training a Two Layer ReLU Network Analytically

    Authors: Adrian Barbu

    Abstract: Neural networks are usually trained with different variants of gradient descent based optimization algorithms such as stochastic gradient descent or the Adam optimizer. Recent theoretical work states that the critical points (where the gradient of the loss is zero) of two-layer ReLU networks with the square loss are not all local minima. However, in this work we will explore an algorithm for train… ▽ More

    Submitted 6 April, 2023; originally announced April 2023.

    Comments: 17 pages, 11 figures

  7. Feature Selection with Annealing for Forecasting Financial Time Series

    Authors: Hakan Pabuccu, Adrian Barbu

    Abstract: Stock market and cryptocurrency forecasting is very important to investors as they aspire to achieve even the slightest improvement to their buy or hold strategies so that they may increase profitability. However, obtaining accurate and reliable predictions is challenging, noting that accuracy does not equate to reliability, especially when financial time-series forecasting is applied owing to its… ▽ More

    Submitted 23 February, 2024; v1 submitted 3 March, 2023; originally announced March 2023.

    Comments: 37 pages, 1 figures and 12 tables

    MSC Class: 68T07

  8. arXiv:2302.14599  [pdf, other

    stat.ML cs.LG

    Scalable Clustering: Large Scale Unsupervised Learning of Gaussian Mixture Models with Outliers

    Authors: Yijia Zhou, Kyle A. Gallivan, Adrian Barbu

    Abstract: Clustering is a widely used technique with a long and rich history in a variety of areas. However, most existing algorithms do not scale well to large datasets, or are missing theoretical guarantees of convergence. This paper introduces a provably robust clustering algorithm based on loss minimization that performs well on Gaussian mixture models with outliers. It provides theoretical guarantees t… ▽ More

    Submitted 28 February, 2023; originally announced February 2023.

  9. arXiv:2302.14367  [pdf, other

    cs.LG eess.SP q-bio.NC

    BrainBERT: Self-supervised representation learning for intracranial recordings

    Authors: Christopher Wang, Vighnesh Subramaniam, Adam Uri Yaari, Gabriel Kreiman, Boris Katz, Ignacio Cases, Andrei Barbu

    Abstract: We create a reusable Transformer, BrainBERT, for intracranial recordings bringing modern representation learning approaches to neuroscience. Much like in NLP and speech recognition, this Transformer enables classifying complex concepts, i.e., decoding neural data, with higher accuracy and with much less data by being pretrained in an unsupervised manner on a large corpus of unannotated neural reco… ▽ More

    Submitted 28 February, 2023; originally announced February 2023.

    Comments: 9 pages, 6 figures, ICLR 2023

  10. arXiv:2211.13087  [pdf, other

    cs.CV cs.AI

    Human or Machine? Turing Tests for Vision and Language

    Authors: Mengmi Zhang, Giorgia Dellaferrera, Ankur Sikarwar, Marcelo Armendariz, Noga Mudrik, Prachi Agrawal, Spandan Madan, Andrei Barbu, Haochen Yang, Tanishq Kumar, Meghna Sadwani, Stella Dellaferrera, Michele Pizzochero, Hanspeter Pfister, Gabriel Kreiman

    Abstract: As AI algorithms increasingly participate in daily activities that used to be the sole province of humans, we are inevitably called upon to consider how much machines are really like us. To address this question, we turn to the Turing test and systematically benchmark current AIs in their abilities to imitate humans. We establish a methodology to evaluate humans versus machines in Turing-like test… ▽ More

    Submitted 23 November, 2022; originally announced November 2022.

    Comments: 134 pages

  11. arXiv:2210.02585  [pdf, other

    cs.LG cs.AI

    Query The Agent: Improving sample efficiency through epistemic uncertainty estimation

    Authors: Julian Alverio, Boris Katz, Andrei Barbu

    Abstract: Curricula for goal-conditioned reinforcement learning agents typically rely on poor estimates of the agent's epistemic uncertainty or fail to consider the agents' epistemic uncertainty altogether, resulting in poor sample efficiency. We propose a novel algorithm, Query The Agent (QTA), which significantly improves sample efficiency by estimating the agent's epistemic uncertainty throughout the sta… ▽ More

    Submitted 5 October, 2022; originally announced October 2022.

    Comments: Submitted to ICLR 2023

  12. arXiv:2207.07033  [pdf, other

    cs.AI cs.CY

    Develo** a Series of AI Challenges for the United States Department of the Air Force

    Authors: Vijay Gadepally, Gregory Angelides, Andrei Barbu, Andrew Bowne, Laura J. Brattain, Tamara Broderick, Armando Cabrera, Glenn Carl, Ronisha Carter, Miriam Cha, Emilie Cowen, Jesse Cummings, Bill Freeman, James Glass, Sam Goldberg, Mark Hamilton, Thomas Heldt, Kuan Wei Huang, Phillip Isola, Boris Katz, Jamie Koerner, Yen-Chen Lin, David Mayo, Kyle McAlpin, Taylor Perron , et al. (17 additional authors not shown)

    Abstract: Through a series of federal initiatives and orders, the U.S. Government has been making a concerted effort to ensure American leadership in AI. These broad strategy documents have influenced organizations such as the United States Department of the Air Force (DAF). The DAF-MIT AI Accelerator is an initiative between the DAF and MIT to bridge the gap between AI researchers and DAF mission requireme… ▽ More

    Submitted 14 July, 2022; originally announced July 2022.

  13. arXiv:2112.07173  [pdf, other

    cs.CV cs.AI cs.NE q-bio.NC

    On the use of Cortical Magnification and Saccades as Biological Proxies for Data Augmentation

    Authors: Binxu Wang, David Mayo, Arturo Deza, Andrei Barbu, Colin Conwell

    Abstract: Self-supervised learning is a powerful way to learn useful representations from natural data. It has also been suggested as one possible means of building visual representation in humans, but the specific objective and algorithm are unknown. Currently, most self-supervised methods encourage the system to learn an invariant representation of different transformations of the same image in contrast t… ▽ More

    Submitted 14 December, 2021; originally announced December 2021.

    Comments: 14 pages, 6 figures, 2 tables. Published in NeurIPS 2021 Workshop, Shared Visual Representations in Human & Machine Intelligence (SVRHM). For code, see https://github.com/Animadversio/Foveated_Saccade_SimCLR

    ACM Class: I.4.10; I.5.1; I.2.6; I.2.10

  14. arXiv:2110.10298  [pdf, other

    cs.RO

    Incorporating Rich Social Interactions Into MDPs

    Authors: Ravi Tejwani, Yen-Ling Kuo, Tianmin Shu, Bennett Stankovits, Dan Gutfreund, Joshua B. Tenenbaum, Boris Katz, Andrei Barbu

    Abstract: Much of what we do as humans is engage socially with other agents, a skill that robots must also eventually possess. We demonstrate that a rich theory of social interactions originating from microsociology and economics can be formalized by extending a nested MDP where agents reason about arbitrary functions of each other's hidden rewards. This extended Social MDP allows us to encode the five basi… ▽ More

    Submitted 7 February, 2022; v1 submitted 19 October, 2021; originally announced October 2021.

    Comments: Accepted to the 39th International Conference on Robotics and Automation (ICRA 2022)

  15. arXiv:2110.09741  [pdf, other

    cs.RO cs.AI cs.CL cs.LG

    Trajectory Prediction with Linguistic Representations

    Authors: Yen-Ling Kuo, Xin Huang, Andrei Barbu, Stephen G. McGill, Boris Katz, John J. Leonard, Guy Rosman

    Abstract: Language allows humans to build mental models that interpret what is happening around them resulting in more accurate long-term predictions. We present a novel trajectory prediction model that uses linguistic intermediate representations to forecast trajectories, and is trained using trajectory samples with partially-annotated captions. The model learns the meaning of each of the words without dir… ▽ More

    Submitted 9 March, 2022; v1 submitted 19 October, 2021; originally announced October 2021.

    Comments: Accepted in ICRA 2022

  16. arXiv:2110.07575  [pdf, other

    cs.CL cs.CV cs.MM eess.AS

    Spoken ObjectNet: A Bias-Controlled Spoken Caption Dataset

    Authors: Ian Palmer, Andrew Rouditchenko, Andrei Barbu, Boris Katz, James Glass

    Abstract: Visually-grounded spoken language datasets can enable models to learn cross-modal correspondences with very weak supervision. However, modern audio-visual datasets contain biases that undermine the real-world performance of models trained on that data. We introduce Spoken ObjectNet, which is designed to remove some of these biases and provide a way to better evaluate how effectively models will pe… ▽ More

    Submitted 14 October, 2021; originally announced October 2021.

    Comments: Presented at Interspeech 2021. This version contains additional experiments on the Spoken ObjectNet test set

  17. arXiv:2104.02883  [pdf, other

    stat.ML cs.LG

    Online Feature Screening for Data Streams with Concept Drift

    Authors: Mingyuan Wang, Adrian Barbu

    Abstract: Screening feature selection methods are often used as a preprocessing step for reducing the number of variables before training step. Traditional screening methods only focus on dealing with complete high dimensional datasets. Modern datasets not only have higher dimension and larger sample size, but also have properties such as streaming input, sparsity and concept drift. Therefore a considerable… ▽ More

    Submitted 6 April, 2021; originally announced April 2021.

    Comments: 8 figures, 30 pages

  18. The Compact Support Neural Network

    Authors: Adrian Barbu, Hongyu Mou

    Abstract: Neural networks are popular and useful in many fields, but they have the problem of giving high confidence responses for examples that are away from the training data. This makes the neural networks very confident in their prediction while making gross mistakes, thus limiting their reliability for safety-critical applications such as autonomous driving, space exploration, etc. This paper introduce… ▽ More

    Submitted 7 December, 2021; v1 submitted 1 April, 2021; originally announced April 2021.

    Comments: 13 pages, 6 figures

    Journal ref: Sensors 2021, 21(24), 8494

  19. arXiv:2103.01933  [pdf, other

    cs.AI cs.CV cs.LG stat.ML

    PHASE: PHysically-grounded Abstract Social Events for Machine Social Perception

    Authors: Aviv Netanyahu, Tianmin Shu, Boris Katz, Andrei Barbu, Joshua B. Tenenbaum

    Abstract: The ability to perceive and reason about social interactions in the context of physical environments is core to human social intelligence and human-machine cooperation. However, no prior dataset or benchmark has systematically evaluated physically grounded perception of complex social interactions that go beyond short actions, such as high-fiving, or simple group activities, such as gathering. In… ▽ More

    Submitted 19 March, 2021; v1 submitted 2 March, 2021; originally announced March 2021.

    Comments: The first two authors contributed equally; AAAI 2021; 13 pages, 7 figures; Project page: https://www.tshu.io/PHASE

  20. arXiv:2008.03277  [pdf, other

    cs.CL

    Learning a natural-language to LTL executable semantic parser for grounded robotics

    Authors: Christopher Wang, Candace Ross, Yen-Ling Kuo, Boris Katz, Andrei Barbu

    Abstract: Children acquire their native language with apparent ease by observing how language is used in context and attempting to use it themselves. They do so without laborious annotations, negative examples, or even direct corrections. We take a step toward robots that can do the same by training a grounded semantic parser, which discovers latent linguistic representations that can be used for the execut… ▽ More

    Submitted 16 March, 2021; v1 submitted 7 August, 2020; originally announced August 2020.

    Comments: 10 pages, 2 figures, Accepted in Conference on Robot Learning (CoRL) 2020

    ACM Class: I.2.7

  21. arXiv:2008.02742  [pdf, other

    cs.CL cs.AI cs.RO

    Compositional Networks Enable Systematic Generalization for Grounded Language Understanding

    Authors: Yen-Ling Kuo, Boris Katz, Andrei Barbu

    Abstract: Humans are remarkably flexible when understanding new sentences that include combinations of concepts they have never encountered before. Recent work has shown that while deep networks can mimic some human language abilities when presented with novel sentences, systematic variation uncovers the limitations in the language-understanding abilities of networks. We demonstrate that these limitations c… ▽ More

    Submitted 19 October, 2021; v1 submitted 6 August, 2020; originally announced August 2020.

    Comments: Accepted in Findings of EMNLP 2021

  22. arXiv:2006.01110  [pdf, other

    cs.RO cs.CL

    Encoding formulas as deep networks: Reinforcement learning for zero-shot execution of LTL formulas

    Authors: Yen-Ling Kuo, Boris Katz, Andrei Barbu

    Abstract: We demonstrate a reinforcement learning agent which uses a compositional recurrent neural network that takes as input an LTL formula and determines satisfying actions. The input LTL formulas have never been seen before, yet the network performs zero-shot generalization to satisfy them. This is a novel form of multi-task learning for RL agents where agents learn from one diverse set of tasks and ge… ▽ More

    Submitted 6 August, 2020; v1 submitted 1 June, 2020; originally announced June 2020.

    Comments: Accepted in IROS 2020

  23. arXiv:2002.08911  [pdf, other

    cs.CL cs.AI

    Measuring Social Biases in Grounded Vision and Language Embeddings

    Authors: Candace Ross, Boris Katz, Andrei Barbu

    Abstract: We generalize the notion of social biases from language embeddings to grounded vision and language embeddings. Biases are present in grounded embeddings, and indeed seem to be equally or more significant than for ungrounded embeddings. This is despite the fact that vision and language can suffer from different biases, which one might hope could attenuate the biases in both. Multiple ways exist to… ▽ More

    Submitted 21 August, 2023; v1 submitted 20 February, 2020; originally announced February 2020.

    Comments: Camera-ready from NAACL 2021. Previous arXiv version was from before conference and was not the most recent version

  24. arXiv:2002.05201  [pdf, other

    cs.RO cs.CL

    Deep compositional robotic planners that follow natural language commands

    Authors: Yen-Ling Kuo, Boris Katz, Andrei Barbu

    Abstract: We demonstrate how a sampling-based robotic planner can be augmented to learn to understand a sequence of natural language commands in a continuous configuration space to move and manipulate objects. Our approach combines a deep network structured according to the parse of a complex command that includes objects, verbs, spatial relations, and attributes, with a sampling-based planner, RRT. A recur… ▽ More

    Submitted 19 February, 2020; v1 submitted 12 February, 2020; originally announced February 2020.

    Comments: Accepted in ICRA 2020

  25. arXiv:2002.04322  [pdf, other

    cs.LG stat.ML

    A study of local optima for learning feature interactions using neural networks

    Authors: Yangzi Guo, Adrian Barbu

    Abstract: In many fields such as bioinformatics, high energy physics, power distribution, etc., it is desirable to learn non-linear models where a small number of variables are selected and the interaction between them is explicitly modeled to predict the response. In principle, neural networks (NNs) could accomplish this task since they can model non-linear feature interactions very well. However, NNs requ… ▽ More

    Submitted 11 February, 2020; originally announced February 2020.

  26. arXiv:2002.04319  [pdf, other

    cs.LG stat.ML

    Neural Rule Ensembles: Encoding Sparse Feature Interactions into Neural Networks

    Authors: Gitesh Dawer, Yangzi Guo, Sida Liu, Adrian Barbu

    Abstract: Artificial Neural Networks form the basis of very powerful learning methods. It has been observed that a naive application of fully connected neural networks to data with many irrelevant variables often leads to overfitting. In an attempt to circumvent this issue, a prior knowledge pertaining to what features are relevant and their possible feature interactions can be encoded into these networks.… ▽ More

    Submitted 11 February, 2020; originally announced February 2020.

  27. arXiv:2002.04301  [pdf, other

    cs.LG cs.NE stat.ML

    Network Pruning via Annealing and Direct Sparsity Control

    Authors: Yangzi Guo, Yiyuan She, Adrian Barbu

    Abstract: Artificial neural networks (ANNs) especially deep convolutional networks are very popular these days and have been proved to successfully offer quite reliable solutions to many vision problems. However, the use of deep neural networks is widely impeded by their intensive computational and memory cost. In this paper, we propose a novel efficient network pruning method that is suitable for both non-… ▽ More

    Submitted 26 July, 2020; v1 submitted 11 February, 2020; originally announced February 2020.

  28. arXiv:1909.12465  [pdf, other

    cs.AI

    Playing Atari Ball Games with Hierarchical Reinforcement Learning

    Authors: Hua Huang, Adrian Barbu

    Abstract: Human beings are particularly good at reasoning and inference from just a few examples. When facing new tasks, humans will leverage knowledge and skills learned before, and quickly integrate them with the new task. In addition to learning by experimentation, human also learn socio-culturally through instructions and learning by example. In this way humans can learn much faster compared with most c… ▽ More

    Submitted 26 September, 2019; originally announced September 2019.

  29. arXiv:1906.03728  [pdf, other

    cs.LG stat.ML

    The Generalization-Stability Tradeoff In Neural Network Pruning

    Authors: Brian R. Bartoldson, Ari S. Morcos, Adrian Barbu, Gordon Erlebacher

    Abstract: Pruning neural network parameters is often viewed as a means to compress models, but pruning has also been motivated by the desire to prevent overfitting. This motivation is particularly relevant given the perhaps surprising observation that a wide variety of pruning approaches increase test accuracy despite sometimes massive reductions in parameter counts. To better understand this phenomenon, we… ▽ More

    Submitted 22 October, 2020; v1 submitted 9 June, 2019; originally announced June 2019.

    Comments: NeurIPS 2020 conference paper

  30. arXiv:1811.06966  [pdf, other

    cs.RO

    Temporal Grounding Graphs for Language Understanding with Accrued Visual-Linguistic Context

    Authors: Rohan Paul, Andrei Barbu, Sue Felshin, Boris Katz, Nicholas Roy

    Abstract: A robot's ability to understand or ground natural language instructions is fundamentally tied to its knowledge about the surrounding world. We present an approach to grounding natural language utterances in the context of factual information gathered through natural-language interactions and past visual observations. A probabilistic model estimates, from a natural language utterance, the objects,r… ▽ More

    Submitted 16 November, 2018; originally announced November 2018.

    Comments: Published in ICJAI 2017

  31. arXiv:1810.00804  [pdf, other

    cs.RO

    Deep sequential models for sampling-based planning

    Authors: Yen-Ling Kuo, Andrei Barbu, Boris Katz

    Abstract: We demonstrate how a sequence model and a sampling-based planner can influence each other to produce efficient plans and how such a model can automatically learn to take advantage of observations of the environment. Sampling-based planners such as RRT generally know nothing of their environments even if they have traversed similar spaces many times. A sequence model, such as an HMM or LSTM, guides… ▽ More

    Submitted 1 October, 2018; originally announced October 2018.

    Comments: Published in IROS 2018

  32. Are screening methods useful in feature selection? An empirical study

    Authors: Mingyuan Wang, Adrian Barbu

    Abstract: Filter or screening methods are often used as a preprocessing step for reducing the number of variables used by a learning algorithm in obtaining a classification or regression model. While there are many such filter methods, there is a need for an objective evaluation of these methods. Such an evaluation is needed to compare them with each other and also to answer whether they are at all useful,… ▽ More

    Submitted 8 July, 2019; v1 submitted 14 September, 2018; originally announced September 2018.

    Comments: 29 pages, 4 figures, 21 tables

    Journal ref: PLoS One, 09/11/2019

  33. arXiv:1805.01930  [pdf, other

    stat.ML cs.LG

    Enhancing the Regularization Effect of Weight Pruning in Artificial Neural Networks

    Authors: Brian Bartoldson, Adrian Barbu, Gordon Erlebacher

    Abstract: Artificial neural networks (ANNs) may not be worth their computational/memory costs when used in mobile phones or embedded devices. Parameter-pruning algorithms combat these costs, with some algorithms capable of removing over 90% of an ANN's weights without harming the ANN's performance. Removing weights from an ANN is a form of regularization, but existing pruning algorithms do not significantly… ▽ More

    Submitted 4 May, 2018; originally announced May 2018.

  34. arXiv:1804.02744  [pdf, other

    stat.ML cs.LG

    Unsupervised Learning of GMM with a Uniform Background Component

    Authors: Sida Liu, Adrian Barbu

    Abstract: Gaussian Mixture Models are one of the most studied and mature models in unsupervised learning. However, outliers are often present in the data and could influence the cluster estimation. In this paper, we study a new model that assumes that data comes from a mixture of a number of Gaussians as well as a uniform ``background'' component assumed to contain outliers and other non-interesting observa… ▽ More

    Submitted 20 March, 2020; v1 submitted 8 April, 2018; originally announced April 2018.

    Comments: 36 pages, 16 figures and 4 tables

  35. arXiv:1803.11521  [pdf, other

    stat.ML cs.LG

    A Novel Framework for Online Supervised Learning with Feature Selection

    Authors: Lizhe Sun, Mingyuan Wang, Siquan Zhu, Adrian Barbu

    Abstract: Current online learning methods suffer issues such as lower convergence rates and limited capability to select important features compared to their offline counterparts. In this paper, a novel framework for online learning based on running averages is proposed. Many popular offline regularized methods such as Lasso, Elastic Net, Minimax Concave Penalty (MCP), and Feature Selection with Annealing (… ▽ More

    Submitted 19 May, 2024; v1 submitted 30 March, 2018; originally announced March 2018.

    Comments: This version has been accepted by Journal of Nonparametric Statistics

  36. arXiv:1802.03882  [pdf, other

    stat.ML cs.LG

    Random Hinge Forest for Differentiable Learning

    Authors: Nathan Lay, Adam P. Harrison, Sharon Schreiber, Gitesh Dawer, Adrian Barbu

    Abstract: We propose random hinge forests, a simple, efficient, and novel variant of decision forests. Importantly, random hinge forests can be readily incorporated as a general component within arbitrary computation graphs that are optimized end-to-end with stochastic gradient descent or variants thereof. We derive random hinge forest and ferns, focusing on their sparse and efficient nature, their min-max… ▽ More

    Submitted 1 March, 2018; v1 submitted 11 February, 2018; originally announced February 2018.

  37. arXiv:1709.05545  [pdf, other

    stat.ML cs.LG

    Generating Compact Tree Ensembles via Annealing

    Authors: Gitesh Dawer, Yangzi Guo, Adrian Barbu

    Abstract: Tree ensembles are flexible predictive models that can capture relevant variables and to some extent their interactions in a compact and interpretable manner. Most algorithms for obtaining tree ensembles are based on versions of boosting or Random Forest. Previous work showed that boosting algorithms exhibit a cyclic behavior of selecting the same tree again and again due to the way the loss is op… ▽ More

    Submitted 19 February, 2020; v1 submitted 16 September, 2017; originally announced September 2017.

    Comments: Comparison with Random Forest included in the results section

  38. Parameterized Principal Component Analysis

    Authors: Ajay Gupta, Adrian Barbu

    Abstract: When modeling multivariate data, one might have an extra parameter of contextual information that could be used to treat some observations as more similar to others. For example, images of faces can vary by age, and one would expect the face of a 40 year old to be more similar to the face of a 30 year old than to a baby face. We introduce a novel manifold approximation method, parameterized princi… ▽ More

    Submitted 2 May, 2017; v1 submitted 16 August, 2016; originally announced August 2016.

    Comments: 36 pages, 15 figures

    Journal ref: Pattern Recognition 78, No. 6, 215-227, 2018

  39. arXiv:1605.04481  [pdf, ps, other

    cs.CL

    Anchoring and Agreement in Syntactic Annotations

    Authors: Yevgeni Berzak, Yan Huang, Andrei Barbu, Anna Korhonen, Boris Katz

    Abstract: We present a study on two key characteristics of human syntactic annotations: anchoring and agreement. Anchoring is a well known cognitive bias in human decision making, where judgments are drawn towards pre-existing values. We study the influence of anchoring on a standard approach to creation of syntactic resources where syntactic annotations are obtained via human editing of tagger and parser o… ▽ More

    Submitted 21 September, 2016; v1 submitted 14 May, 2016; originally announced May 2016.

    Comments: EMNLP 2016

  40. arXiv:1603.08079  [pdf, other

    cs.CV cs.AI cs.CL

    Do You See What I Mean? Visual Resolution of Linguistic Ambiguities

    Authors: Yevgeni Berzak, Andrei Barbu, Daniel Harari, Boris Katz, Shimon Ullman

    Abstract: Understanding language goes hand in hand with the ability to integrate complex contextual information obtained via perception. In this work, we present a novel task for grounded language understanding: disambiguating a sentence given a visual scene which depicts one of the possible interpretations of that sentence. To this end, we introduce a new multimodal corpus containing ambiguous sentences, r… ▽ More

    Submitted 26 March, 2016; originally announced March 2016.

    Comments: EMNLP 2015

    Journal ref: Conference on Empirical Methods in Natural Language Processing (EMNLP), 2015, pages 1477--1487

  41. RENOIR - A Dataset for Real Low-Light Image Noise Reduction

    Authors: Josue Anaya, Adrian Barbu

    Abstract: Image denoising algorithms are evaluated using images corrupted by artificial noise, which may lead to incorrect conclusions about their performances on real noise. In this paper we introduce a dataset of color images corrupted by natural noise due to low-light conditions, together with spatially and intensity-aligned low noise images of the same scenes. We also introduce a method for estimating t… ▽ More

    Submitted 8 May, 2017; v1 submitted 29 September, 2014; originally announced September 2014.

    Comments: 27 pages, 11 figures

    Journal ref: Journal of Visual Communication and Image Representation 51, No. 2, 144-154, 2018

  42. arXiv:1408.6418  [pdf

    cs.CV cs.CL cs.IR

    Video In Sentences Out

    Authors: Andrei Barbu, Alexander Bridge, Zachary Burchill, Dan Coroian, Sven Dickinson, Sanja Fidler, Aaron Michaux, Sam Mussman, Siddharth Narayanaswamy, Dhaval Salvi, Lara Schmidt, Jiangnan Shangguan, Jeffrey Mark Siskind, Jarrell Waggoner, Song Wang, **lian Wei, Yifan Yin, Zhiqi Zhang

    Abstract: We present a system that produces sentential descriptions of video: who did what to whom, and where and how they did it. Action class is rendered as a verb, participant objects as noun phrases, properties of those objects as adjectival modifiers in those noun phrases, spatial relations between those participants as prepositional phrases, and characteristics of the event as prepositional-phrase adj… ▽ More

    Submitted 9 August, 2014; originally announced August 2014.

    Comments: Appears in Proceedings of the Twenty-Eighth Conference on Uncertainty in Artificial Intelligence (UAI2012)

    Report number: UAI-P-2012-PG-102-112

  43. arXiv:1404.3596  [pdf, other

    cs.CV

    Face Detection with a 3D Model

    Authors: Adrian Barbu, Nathan Lay, Gary Gramajo

    Abstract: This paper presents a part-based face detection approach where the spatial relationship between the face parts is represented by a hidden 3D model with six parameters. The computational complexity of the search in the six dimensional pose space is addressed by proposing meaningful 3D pose candidates by image-based regression from detected face keypoint locations. The 3D pose candidates are evaluat… ▽ More

    Submitted 3 November, 2015; v1 submitted 14 April, 2014; originally announced April 2014.

    Comments: 14 pages, 11 figures

    Journal ref: Academic Press Library in Signal Processing Volume 6: Image and Video Processing and Analysis and Computer Vision, pp 237-259, 2018. Editors: R. Chellappa and S. Theodoridis

  44. arXiv:1310.2880  [pdf, other

    stat.ML cs.CV cs.LG math.ST

    Feature Selection with Annealing for Computer Vision and Big Data Learning

    Authors: Adrian Barbu, Yiyuan She, Liang**g Ding, Gary Gramajo

    Abstract: Many computer vision and medical imaging problems are faced with learning from large-scale datasets, with millions of observations and features. In this paper we propose a novel efficient learning scheme that tightens a sparsity constraint by gradually removing variables based on a criterion and a schedule. The attractive fact that the problem size keeps drop** throughout the iterations makes it… ▽ More

    Submitted 17 March, 2016; v1 submitted 10 October, 2013; originally announced October 2013.

    Comments: 18 pages, 9 figures

    Journal ref: IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 39, no 2, pp 272 - 286, 2017

  45. arXiv:1309.5174  [pdf, other

    cs.CV cs.CL cs.IR

    Saying What You're Looking For: Linguistics Meets Video Search

    Authors: Andrei Barbu, N. Siddharth, Jeffrey Mark Siskind

    Abstract: We present an approach to searching large video corpora for video clips which depict a natural-language query in the form of a sentence. This approach uses compositional semantics to encode subtle meaning that is lost in other systems, such as the difference between two sentences which have identical words but entirely different meaning: "The person rode the horse} vs. \emph{The horse rode the per… ▽ More

    Submitted 20 September, 2013; originally announced September 2013.

    Comments: 13 pages, 8 figures

  46. arXiv:1308.4189  [pdf, other

    cs.CV cs.AI cs.CL

    Seeing What You're Told: Sentence-Guided Activity Recognition In Video

    Authors: N. Siddharth, Andrei Barbu, Jeffrey Mark Siskind

    Abstract: We present a system that demonstrates how the compositional structure of events, in concert with the compositional structure of language, can interplay with the underlying focusing mechanisms in video action recognition, thereby providing a medium, not only for top-down and bottom-up integration, but also for multi-modal integration between vision and language. We show how the roles played by part… ▽ More

    Submitted 28 May, 2014; v1 submitted 19 August, 2013; originally announced August 2013.

    Comments: To appear in CVPR 2014

  47. arXiv:1204.3616  [pdf, other

    cs.CV cs.AI

    Large-Scale Automatic Labeling of Video Events with Verbs Based on Event-Participant Interaction

    Authors: Andrei Barbu, Alexander Bridge, Dan Coroian, Sven Dickinson, Sam Mussman, Siddharth Narayanaswamy, Dhaval Salvi, Lara Schmidt, Jiangnan Shangguan, Jeffrey Mark Siskind, Jarrell Waggoner, Song Wang, **lian Wei, Yifan Yin, Zhiqi Zhang

    Abstract: We present an approach to labeling short video clips with English verbs as event descriptions. A key distinguishing aspect of this work is that it labels videos with verbs that describe the spatiotemporal interaction between event participants, humans and objects interacting with each other, abstracting away all object-class information and fine-grained image characteristics, and relying solely on… ▽ More

    Submitted 16 April, 2012; originally announced April 2012.

  48. arXiv:1204.2801  [pdf, other

    cs.CV cs.AI cs.RO

    Seeing Unseeability to See the Unseeable

    Authors: Siddharth Narayanaswamy, Andrei Barbu, Jeffrey Mark Siskind

    Abstract: We present a framework that allows an observer to determine occluded portions of a structure by finding the maximum-likelihood estimate of those occluded portions consistent with visible image evidence and a consistency model. Doing this requires determining which portions of the structure are occluded in the first place. Since each process relies on the other, we determine a solution to both prob… ▽ More

    Submitted 12 April, 2012; originally announced April 2012.

    Journal ref: Advances in Cognitive Systems, Vol. 2, pp. 77-94, 2012

  49. arXiv:1204.2742  [pdf, other

    cs.CV cs.AI

    Video In Sentences Out

    Authors: Andrei Barbu, Alexander Bridge, Zachary Burchill, Dan Coroian, Sven Dickinson, Sanja Fidler, Aaron Michaux, Sam Mussman, Siddharth Narayanaswamy, Dhaval Salvi, Lara Schmidt, Jiangnan Shangguan, Jeffrey Mark Siskind, Jarrell Waggoner, Song Wang, **lian Wei, Yifan Yin, Zhiqi Zhang

    Abstract: We present a system that produces sentential descriptions of video: who did what to whom, and where and how they did it. Action class is rendered as a verb, participant objects as noun phrases, properties of those objects as adjectival modifiers in those noun phrases,spatial relations between those participants as prepositional phrases, and characteristics of the event as prepositional-phrase adju… ▽ More

    Submitted 12 April, 2012; originally announced April 2012.

  50. arXiv:1204.2741  [pdf, other

    cs.CV cs.AI

    Simultaneous Object Detection, Tracking, and Event Recognition

    Authors: Andrei Barbu, Aaron Michaux, Siddharth Narayanaswamy, Jeffrey Mark Siskind

    Abstract: The common internal structure and algorithmic organization of object detection, detection-based tracking, and event recognition facilitates a general approach to integrating these three components. This supports multidirectional information flow between these components allowing object detection to influence tracking and event recognition and event recognition to influence tracking and object dete… ▽ More

    Submitted 12 April, 2012; originally announced April 2012.

    Journal ref: Advances in Cognitive Systems, Vol. 2, pp. 203-220, 2012