Skip to main content

Showing 1–11 of 11 results for author: Giguère, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2404.10883  [pdf, other

    cs.AI cs.LG stat.ME

    Automated Discovery of Functional Actual Causes in Complex Environments

    Authors: Caleb Chuck, Sankaran Vaidyanathan, Stephen Giguere, Amy Zhang, David Jensen, Scott Niekum

    Abstract: Reinforcement learning (RL) algorithms often struggle to learn policies that generalize to novel situations due to issues such as causal confusion, overfitting to irrelevant factors, and failure to isolate control of state factors. These issues stem from a common source: a failure to accurately identify and exploit state-specific causal relationships in the environment. While some prior works in R… ▽ More

    Submitted 16 April, 2024; originally announced April 2024.

  2. arXiv:2111.03936  [pdf, other

    cs.LG

    SOPE: Spectrum of Off-Policy Estimators

    Authors: Christina J. Yuan, Yash Chandak, Stephen Giguere, Philip S. Thomas, Scott Niekum

    Abstract: Many sequential decision making problems are high-stakes and require off-policy evaluation (OPE) of a new policy using historical data collected using some other policy. One of the most common OPE techniques that provides unbiased estimates is trajectory based importance sampling (IS). However, due to the high variance of trajectory IS estimates, importance sampling methods based on state-action v… ▽ More

    Submitted 2 December, 2021; v1 submitted 6 November, 2021; originally announced November 2021.

    Comments: Accepted at Thirty-fifth Conference on Neural Information Processing Systems (NeurIPS 2021)

  3. arXiv:2108.05875  [pdf, other

    cs.RO cs.AI cs.CV cs.LG eess.SY

    Distributional Depth-Based Estimation of Object Articulation Models

    Authors: A**kya Jain, Stephen Giguere, Rudolf Lioutikov, Scott Niekum

    Abstract: We propose a method that efficiently learns distributions over articulation model parameters directly from depth images without the need to know articulation model categories a priori. By contrast, existing methods that learn articulation models from raw observations typically only predict point estimates of the model parameters, which are insufficient to guarantee the safe manipulation of articul… ▽ More

    Submitted 25 October, 2021; v1 submitted 12 August, 2021; originally announced August 2021.

    Comments: In the proceedings of the 5th Annual Conference on Robot Learning (CoRL), 2021. Project webpage: https://pearl-utexas.github.io/DUST-net/ . 18 pages, 10 figures, 4 tables

  4. arXiv:2012.09951  [pdf, other

    cs.LG cs.HC

    Fairkit, Fairkit, on the Wall, Who's the Fairest of Them All? Supporting Data Scientists in Training Fair Models

    Authors: Brittany Johnson, Jesse Bartola, Rico Angell, Katherine Keith, Sam Witty, Stephen J. Giguere, Yuriy Brun

    Abstract: Modern software relies heavily on data and machine learning, and affects decisions that shape our world. Unfortunately, recent studies have shown that because of biases in data, software systems frequently inject bias into their decisions, from producing better closed caption transcriptions of men's voices than of women's voices to overcharging people of color for financial loans. To address bias… ▽ More

    Submitted 17 December, 2020; originally announced December 2020.

  5. arXiv:1905.11577  [pdf, other

    cs.LG q-bio.BM stat.ML

    Towards Interpretable Sparse Graph Representation Learning with Laplacian Pooling

    Authors: Emmanuel Noutahi, Dominique Beaini, Julien Horwood, Sébastien Giguère, Prudencio Tossou

    Abstract: Recent work in graph neural networks (GNNs) has led to improvements in molecular activity and property prediction tasks. Unfortunately, GNNs often fail to capture the relative importance of interactions between molecular substructures, in part due to the absence of efficient intermediate pooling steps. To address these issues, we propose LaPool (Laplacian Pooling), a novel, data-driven, and interp… ▽ More

    Submitted 2 April, 2020; v1 submitted 27 May, 2019; originally announced May 2019.

    Comments: 11 pages, with Appendices

  6. arXiv:1703.02992  [pdf, other

    cs.LG

    A Manifold Approach to Learning Mutually Orthogonal Subspaces

    Authors: Stephen Giguere, Francisco Garcia, Sridhar Mahadevan

    Abstract: Although many machine learning algorithms involve learning subspaces with particular characteristics, optimizing a parameter matrix that is constrained to represent a subspace can be challenging. One solution is to use Riemannian optimization methods that enforce such constraints implicitly, leveraging the fact that the feasible parameter values form a manifold. While Riemannian methods exist for… ▽ More

    Submitted 8 March, 2017; originally announced March 2017.

    Comments: 9 pages, 3 Figures

    ACM Class: G.1.6; I.2.6

  7. arXiv:1505.06249  [pdf, other

    q-bio.GN cs.LG stat.ML

    Greedy Biomarker Discovery in the Genome with Applications to Antimicrobial Resistance

    Authors: Alexandre Drouin, Sébastien Giguère, Maxime Déraspe, François Laviolette, Mario Marchand, Jacques Corbeil

    Abstract: The Set Covering Machine (SCM) is a greedy learning algorithm that produces sparse classifiers. We extend the SCM for datasets that contain a huge number of features. The whole genetic material of living organisms is an example of such a case, where the number of feature exceeds 10^7. Three human pathogens were used to evaluate the performance of the SCM at predicting antimicrobial resistance. Our… ▽ More

    Submitted 22 May, 2015; originally announced May 2015.

    Comments: Peer-reviewed and accepted for an oral presentation in the Greed is Great workshop at the International Conference on Machine Learning, Lille, France, 2015

  8. arXiv:1412.1463  [pdf, ps, other

    cs.LG cs.CE

    On the String Kernel Pre-Image Problem with Applications in Drug Discovery

    Authors: Sébastien Giguère, Amélie Rolland, François Laviolette, Mario Marchand

    Abstract: The pre-image problem has to be solved during inference by most structured output predictors. For string kernels, this problem corresponds to finding the string associated to a given input. An algorithm capable of solving or finding good approximations to this problem would have many applications in computational biology and other fields. This work uses a recent result on combinatorial optimizatio… ▽ More

    Submitted 3 December, 2014; v1 submitted 3 December, 2014; originally announced December 2014.

    Comments: Peer-reviewed and accepted for presentation at Machine Learning in Computational Biology 2014, Montréal, Québec, Canada

    ACM Class: I.2.6; K.3.2

  9. arXiv:1412.1074  [pdf, other

    q-bio.GN cs.CE cs.LG stat.ML

    Learning interpretable models of phenotypes from whole genome sequences with the Set Covering Machine

    Authors: Alexandre Drouin, Sébastien Giguère, Vladana Sagatovich, Maxime Déraspe, François Laviolette, Mario Marchand, Jacques Corbeil

    Abstract: The increased affordability of whole genome sequencing has motivated its use for phenotypic studies. We address the problem of learning interpretable models for discrete phenotypes from whole genomes. We propose a general approach that relies on the Set Covering Machine and a k-mer representation of the genomes. We show results for the problem of predicting the resistance of Pseudomonas Aeruginosa… ▽ More

    Submitted 2 December, 2014; originally announced December 2014.

    Comments: Presented at Machine Learning in Computational Biology 2014, Montréal, Québec, Canada

  10. arXiv:1405.6757  [pdf, other

    cs.LG

    Proximal Reinforcement Learning: A New Theory of Sequential Decision Making in Primal-Dual Spaces

    Authors: Sridhar Mahadevan, Bo Liu, Philip Thomas, Will Dabney, Steve Giguere, Nicholas Jacek, Ian Gemp, Ji Liu

    Abstract: In this paper, we set forth a new vision of reinforcement learning developed by us over the past few years, one that yields mathematically rigorous solutions to longstanding important questions that have remained unresolved: (i) how to design reliable, convergent, and robust reinforcement learning algorithms (ii) how to guarantee that reinforcement learning satisfies pre-specified "safety" guarant… ▽ More

    Submitted 26 May, 2014; originally announced May 2014.

    Comments: 121 pages

  11. arXiv:1207.7253  [pdf, other

    q-bio.QM cs.LG q-bio.BM stat.ML

    Learning a peptide-protein binding affinity predictor with kernel ridge regression

    Authors: Sébastien Giguère, Mario Marchand, François Laviolette, Alexandre Drouin, Jacques Corbeil

    Abstract: We propose a specialized string kernel for small bio-molecules, peptides and pseudo-sequences of binding interfaces. The kernel incorporates physico-chemical properties of amino acids and elegantly generalize eight kernels, such as the Oligo, the Weighted Degree, the Blended Spectrum, and the Radial Basis Function. We provide a low complexity dynamic programming algorithm for the exact computation… ▽ More

    Submitted 31 July, 2012; originally announced July 2012.

    Comments: 22 pages, 4 figures, 5 tables

    MSC Class: 92B05 ACM Class: I.2.6; J.3; G.3; G.4; I.5.2

    Journal ref: BMC Bioinformatics 2013, 14:82