Skip to main content

Showing 1–14 of 14 results for author: Sledge, I J

.
  1. arXiv:2401.11313  [pdf, other

    cs.CV cs.LG eess.IV

    Weakly-Supervised Semantic Segmentation of Circular-Scan, Synthetic-Aperture-Sonar Imagery

    Authors: Isaac J. Sledge, Dominic M. Byrne, Jonathan L. King, Steven H. Ostertag, Denton L. Woods, James L. Prater, Jermaine L. Kennedy, Timothy M. Marston, Jose C. Principe

    Abstract: We propose a weakly-supervised framework for the semantic segmentation of circular-scan synthetic-aperture-sonar (CSAS) imagery. The first part of our framework is trained in a supervised manner, on image-level labels, to uncover a set of semi-sparse, spatially-discriminative regions in each image. The classification uncertainty of each region is then evaluated. Those areas with the lowest uncerta… ▽ More

    Submitted 20 January, 2024; originally announced January 2024.

    Comments: Submitted to the IEEE Journal of Oceanic Engineering

  2. arXiv:2212.11083  [pdf, other

    cs.LG cs.AI cs.IT

    Adapting the Exploration Rate for Value-of-Information-Based Reinforcement Learning

    Authors: Isaac J. Sledge, Jose C. Principe

    Abstract: In this paper, we consider the problem of adjusting the exploration rate when using value-of-information-based exploration. We do this by converting the value-of-information optimization into a problem of finding equilibria of a flow for a changing exploration rate. We then develop an efficient path-following scheme for converging to these equilibria and hence uncovering optimal action-selection p… ▽ More

    Submitted 30 December, 2022; v1 submitted 20 December, 2022; originally announced December 2022.

    Comments: Submitted to the IEEE Transactions on Information Theory

  3. arXiv:2109.11737  [pdf, other

    cs.IT cs.LG

    Estimating Rényi's $α$-Cross-Entropies in a Matrix-Based Way

    Authors: Isaac J. Sledge, Jose C. Principe

    Abstract: Conventional information-theoretic quantities assume access to probability distributions. Estimating such distributions is not trivial. Here, we consider function-based formulations of cross entropy that sidesteps this a priori estimation requirement. We propose three measures of Rényi's $α$-cross-entropies in the setting of reproducing-kernel Hilbert spaces. Each measure has its appeals. We prove… ▽ More

    Submitted 24 September, 2021; originally announced September 2021.

    Comments: Submitted to the IEEE Transactions on Information Theory

  4. arXiv:2107.10504  [pdf, other

    cs.CV cs.LG

    External-Memory Networks for Low-Shot Learning of Targets in Forward-Looking-Sonar Imagery

    Authors: Isaac J. Sledge, Christopher D. Toole, Joseph A. Maestri, Jose C. Principe

    Abstract: We propose a memory-based framework for real-time, data-efficient target analysis in forward-looking-sonar (FLS) imagery. Our framework relies on first removing non-discriminative details from the imagery using a small-scale DenseNet-inspired network. Doing so simplifies ensuing analyses and permits generalizing from few labeled examples. We then cascade the filtered imagery into a novel NeuralRAM… ▽ More

    Submitted 22 July, 2021; originally announced July 2021.

    Comments: Submitted to IEEE Journal of Oceanic Engineering

  5. An Information-Theoretic Approach for Automatically Determining the Number of States when Aggregating Markov Chains

    Authors: Isaac J. Sledge, Jose C. Principe

    Abstract: A fundamental problem when aggregating Markov chains is the specification of the number of state groups. Too few state groups may fail to sufficiently capture the pertinent dynamics of the original, high-order Markov chain. Too many state groups may lead to a non-parsimonious, reduced-order Markov chain whose complexity rivals that of the original. In this paper, we show that an augmented value-of… ▽ More

    Submitted 5 July, 2021; originally announced July 2021.

    Comments: Submitted to IEEE ICASSP. arXiv admin note: substantial text overlap with arXiv:1903.09266

  6. arXiv:2102.12017  [pdf, other

    cs.LG cs.AI cs.RO

    Annotating Motion Primitives for Simplifying Action Search in Reinforcement Learning

    Authors: Isaac J. Sledge, Darshan W. Bryner, Jose C. Principe

    Abstract: Reinforcement learning in large-scale environments is challenging due to the many possible actions that can be taken in specific situations. We have previously developed a means of constraining, and hence speeding up, the search process through the use of motion primitives; motion primitives are sequences of pre-specified actions taken across a state series. As a byproduct of this work, we have fo… ▽ More

    Submitted 26 November, 2021; v1 submitted 23 February, 2021; originally announced February 2021.

    Comments: IEEE Transactions on Emerging Topics in Computational Intelligence

  7. Faster Convergence in Deep-Predictive-Coding Networks to Learn Deeper Representations

    Authors: Isaac J. Sledge, Jose C. Principe

    Abstract: Deep-predictive-coding networks (DPCNs) are hierarchical, generative models. They rely on feed-forward and feed-back connections to modulate latent feature representations of stimuli in a dynamic and context-sensitive manner. A crucial element of DPCNs is a forward-backward inference procedure to uncover sparse, invariant features. However, this inference is a major computational bottleneck. It se… ▽ More

    Submitted 23 September, 2021; v1 submitted 17 January, 2021; originally announced January 2021.

    Comments: Submitted to the IEEE Transactions on Neural Networks and Learning Systems

  8. Target Detection and Segmentation in Circular-Scan Synthetic-Aperture-Sonar Images using Semi-Supervised Convolutional Encoder-Decoders

    Authors: Isaac J. Sledge, Matthew S. Emigh, Jonathan L. King, Denton L. Woods, J. Tory Cobb, Jose C. Principe

    Abstract: We propose a framework for saliency-based, multi-target detection and segmentation of circular-scan, synthetic-aperture-sonar (CSAS) imagery. Our framework relies on a multi-branch, convolutional encoder-decoder network (MB-CEDN). The encoder portion of the MB-CEDN extracts visual contrast features from CSAS images. These features are fed into dual decoders that perform pixel-level segmentation to… ▽ More

    Submitted 17 February, 2022; v1 submitted 10 January, 2021; originally announced January 2021.

    Comments: Submitted to IEEE Journal of Oceanic Engineering

  9. Reduction of Markov Chains using a Value-of-Information-Based Approach

    Authors: Isaac J. Sledge, Jose C. Principe

    Abstract: In this paper, we propose an approach to obtain reduced-order models of Markov chains. Our approach is composed of two information-theoretic processes. The first is a means of comparing pairs of stationary chains on different state spaces, which is done via the negative Kullback-Leibler divergence defined on a model joint space. Model reduction is achieved by solving a value-of-information criteri… ▽ More

    Submitted 21 March, 2019; originally announced March 2019.

    Comments: Submitted to Entropy

  10. arXiv:1901.07484  [pdf, other

    cs.LG stat.ML

    An Exact Reformulation of Feature-Vector-based Radial-Basis-Function Networks for Graph-based Observations

    Authors: Isaac J. Sledge, Jose C. Principe

    Abstract: Radial-basis-function networks are traditionally defined for sets of vector-based observations. In this short paper, we reformulate such networks so that they can be applied to adjacency-matrix representations of weighted, directed graphs that represent the relationships between object pairs. We re-state the sum-of-squares objective function so that it is purely dependent on entries from the adjac… ▽ More

    Submitted 1 August, 2019; v1 submitted 22 January, 2019; originally announced January 2019.

    Comments: Submitted to the IEEE Transactions on Neural Networks and Learning Systems

  11. Guided Policy Exploration for Markov Decision Processes using an Uncertainty-Based Value-of-Information Criterion

    Authors: Isaac J. Sledge, Matthew S. Emigh, Jose C. Principe

    Abstract: Reinforcement learning in environments with many action-state pairs is challenging. At issue is the number of episodes needed to thoroughly search the policy space. Most conventional heuristics address this search problem in a stochastic manner. This can leave large portions of the policy space unvisited during the early training stages. In this paper, we propose an uncertainty-based, information-… ▽ More

    Submitted 5 February, 2018; originally announced February 2018.

    Comments: IEEE Transactions on Neural Networks and Learning Systems

  12. arXiv:1710.10381  [pdf, other

    cs.AI cs.LG stat.ML

    Partitioning Relational Matrices of Similarities or Dissimilarities using the Value of Information

    Authors: Isaac J. Sledge, Jose C. Principe

    Abstract: In this paper, we provide an approach to clustering relational matrices whose entries correspond to either similarities or dissimilarities between objects. Our approach is based on the value of information, a parameterized, information-theoretic criterion that measures the change in costs associated with changes in information. Optimizing the value of information yields a deterministic annealing s… ▽ More

    Submitted 27 October, 2017; originally announced October 2017.

    Comments: Submitted to the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)

  13. arXiv:1710.02869  [pdf, other

    cs.AI cs.LG stat.ML

    An Analysis of the Value of Information when Exploring Stochastic, Discrete Multi-Armed Bandits

    Authors: Isaac J. Sledge, Jose C. Principe

    Abstract: In this paper, we propose an information-theoretic exploration strategy for stochastic, discrete multi-armed bandits that achieves optimal regret. Our strategy is based on the value of information criterion. This criterion measures the trade-off between policy information and obtainable rewards. High amounts of policy information are associated with exploration-dominant searches of the space and y… ▽ More

    Submitted 3 March, 2018; v1 submitted 8 October, 2017; originally announced October 2017.

    Comments: Entropy

  14. arXiv:1702.08628  [pdf, other

    cs.LG cs.AI cs.IT

    Analysis of Agent Expertise in Ms. Pac-Man using Value-of-Information-based Policies

    Authors: Isaac J. Sledge, Jose C. Principe

    Abstract: Conventional reinforcement learning methods for Markov decision processes rely on weakly-guided, stochastic searches to drive the learning process. It can therefore be difficult to predict what agent behaviors might emerge. In this paper, we consider an information-theoretic cost function for performing constrained stochastic searches that promote the formation of risk-averse to risk-favoring beha… ▽ More

    Submitted 4 November, 2017; v1 submitted 27 February, 2017; originally announced February 2017.

    Comments: IEEE Transactions on Computational Intelligence and Artificial Intelligence in Games