Skip to main content

Showing 1–43 of 43 results for author: Jaakkola, T S

.
  1. arXiv:2401.04082  [pdf, other

    q-bio.QM cs.LG stat.ML

    Improved motif-scaffolding with SE(3) flow matching

    Authors: Jason Yim, Andrew Campbell, Emile Mathieu, Andrew Y. K. Foong, Michael Gastegger, José Jiménez-Luna, Sarah Lewis, Victor Garcia Satorras, Bastiaan S. Veeling, Frank Noé, Regina Barzilay, Tommi S. Jaakkola

    Abstract: Protein design often begins with knowledge of a desired function from a motif which motif-scaffolding aims to construct a functional protein around. Recently, generative models have achieved breakthrough success in designing scaffolds for a diverse range of motifs. However, the generated scaffolds tend to lack structural diversity, which can hinder success in wet-lab validation. In this work, we e… ▽ More

    Submitted 8 January, 2024; originally announced January 2024.

    Comments: Preprint. Code: https://github.com/ microsoft/frame-flow

  2. arXiv:2306.10193  [pdf, other

    cs.CL cs.LG

    Conformal Language Modeling

    Authors: Victor Quach, Adam Fisch, Tal Schuster, Adam Yala, Jae Ho Sohn, Tommi S. Jaakkola, Regina Barzilay

    Abstract: We propose a novel approach to conformal prediction for generative language models (LMs). Standard conformal prediction produces prediction sets -- in place of single predictions -- that have rigorous, statistical performance guarantees. LM responses are typically sampled from the model's predicted distribution over the large, combinatorial output space of natural language. Translating this proces… ▽ More

    Submitted 1 June, 2024; v1 submitted 16 June, 2023; originally announced June 2023.

    Comments: ICLR 2024

  3. arXiv:2304.03889  [pdf, other

    q-bio.BM cs.LG

    DiffDock-PP: Rigid Protein-Protein Docking with Diffusion Models

    Authors: Mohamed Amine Ketata, Cedrik Laue, Ruslan Mammadov, Hannes Stärk, Menghua Wu, Gabriele Corso, Céline Marquet, Regina Barzilay, Tommi S. Jaakkola

    Abstract: Understanding how proteins structurally interact is crucial to modern biology, with applications in drug discovery and protein design. Recent machine learning methods have formulated protein-small molecule docking as a generative problem with significant performance boosts over both traditional and deep learning baselines. In this work, we propose a similar approach for rigid protein-protein docki… ▽ More

    Submitted 7 April, 2023; originally announced April 2023.

    Comments: ICLR Machine Learning for Drug Discovery (MLDD) Workshop 2023

  4. arXiv:2304.00047  [pdf, other

    cs.LG cs.CR cs.IT

    PEOPL: Characterizing Privately Encoded Open Datasets with Public Labels

    Authors: Homa Esfahanizadeh, Adam Yala, Rafael G. L. D'Oliveira, Andrea J. D. Jaba, Victor Quach, Ken R. Duffy, Tommi S. Jaakkola, Vinod Vaikuntanathan, Manya Ghobadi, Regina Barzilay, Muriel Médard

    Abstract: Allowing organizations to share their data for training of machine learning (ML) models without unintended information leakage is an open problem in practice. A promising technique for this still-open problem is to train models on the encoded data. Our approach, called Privately Encoded Open Datasets with Public Labels (PEOPL), uses a certain class of randomly constructed transforms to encode sens… ▽ More

    Submitted 31 March, 2023; originally announced April 2023.

    Comments: Submitted to IEEE Transactions on Information Forensics and Security

  5. arXiv:2301.02197  [pdf, other

    cond-mat.dis-nn physics.comp-ph

    Virtual Node Graph Neural Network for Full Phonon Prediction

    Authors: Ryotaro Okabe, Abhijatmedhi Chotrattanapituk, Artittaya Boonkird, Nina Andrejevic, Xiang Fu, Tommi S. Jaakkola, Qichen Song, Thanh Nguyen, Nathan Drucker, Sai Mu, Bolin Liao, Yongqiang Cheng, Mingda Li

    Abstract: The structure-property relationship plays a central role in materials science. Understanding the structure-property relationship in solid-state materials is crucial for structure design with optimized properties. The past few years witnessed remarkable progress in correlating structures with properties in crystalline materials, such as machine learning methods and particularly graph neural network… ▽ More

    Submitted 5 January, 2023; originally announced January 2023.

    Comments: 40 pages total, 4 main figures + 25 supplementary figures

  6. arXiv:2203.08908  [pdf, other

    cs.LG

    Adversarial Support Alignment

    Authors: Shangyuan Tong, Timur Garipov, Yang Zhang, Shiyu Chang, Tommi S. Jaakkola

    Abstract: We study the problem of aligning the supports of distributions. Compared to the existing work on distribution alignment, support alignment does not require the densities to be matched. We propose symmetric support difference as a divergence measure to quantify the mismatch between supports. We show that select discriminators (e.g. discriminator trained for Jensen-Shannon divergence) are able to ma… ▽ More

    Submitted 16 March, 2022; originally announced March 2022.

    Comments: Accepted to ICLR 2022

  7. arXiv:2201.12406  [pdf, other

    cs.LG cs.CR cs.CV

    Syfer: Neural Obfuscation for Private Data Release

    Authors: Adam Yala, Victor Quach, Homa Esfahanizadeh, Rafael G. L. D'Oliveira, Ken R. Duffy, Muriel Médard, Tommi S. Jaakkola, Regina Barzilay

    Abstract: Balancing privacy and predictive utility remains a central challenge for machine learning in healthcare. In this paper, we develop Syfer, a neural obfuscation method to protect against re-identification attacks. Syfer composes trained layers with random neural networks to encode the original data (e.g. X-rays) while maintaining the ability to predict diagnoses from the encoded data. The randomness… ▽ More

    Submitted 28 January, 2022; originally announced January 2022.

  8. arXiv:2110.13880  [pdf, other

    cs.LG cs.AI cs.CL

    Understanding Interlocking Dynamics of Cooperative Rationalization

    Authors: Mo Yu, Yang Zhang, Shiyu Chang, Tommi S. Jaakkola

    Abstract: Selective rationalization explains the prediction of complex neural networks by finding a small subset of the input that is sufficient to predict the neural model output. The selection mechanism is commonly integrated into the model itself by specifying a two-component cascaded system consisting of a rationale generator, which makes a binary selection of the input features (which is the rationale)… ▽ More

    Submitted 26 October, 2021; originally announced October 2021.

    Comments: Accepted at NeurIPS 2021

  9. arXiv:2106.07802  [pdf, other

    physics.chem-ph cs.LG

    GeoMol: Torsional Geometric Generation of Molecular 3D Conformer Ensembles

    Authors: Octavian-Eugen Ganea, Lagnajit Pattanaik, Connor W. Coley, Regina Barzilay, Klavs F. Jensen, William H. Green, Tommi S. Jaakkola

    Abstract: Prediction of a molecule's 3D conformer ensemble from the molecular graph holds a key role in areas of cheminformatics and drug discovery. Existing generative models have several drawbacks including lack of modeling important molecular geometry elements (e.g. torsion angles), separate optimization stages prone to error accumulation, and the need for structure fine-tuning based on approximate class… ▽ More

    Submitted 8 June, 2021; originally announced June 2021.

  10. arXiv:2106.02484  [pdf, other

    cs.CR cs.AI

    NeuraCrypt: Hiding Private Health Data via Random Neural Networks for Public Training

    Authors: Adam Yala, Homa Esfahanizadeh, Rafael G. L. D' Oliveira, Ken R. Duffy, Manya Ghobadi, Tommi S. Jaakkola, Vinod Vaikuntanathan, Regina Barzilay, Muriel Medard

    Abstract: Balancing the needs of data privacy and predictive utility is a central challenge for machine learning in healthcare. In particular, privacy concerns have led to a dearth of public datasets, complicated the construction of multi-hospital cohorts and limited the utilization of external machine learning resources. To remedy this, new methods are required to enable data owners, such as hospitals, to… ▽ More

    Submitted 4 June, 2021; originally announced June 2021.

  11. arXiv:2012.10713  [pdf, other

    cs.LG cs.AI stat.ML

    Fundamental Limits and Tradeoffs in Invariant Representation Learning

    Authors: Han Zhao, Chen Dan, Bryon Aragam, Tommi S. Jaakkola, Geoffrey J. Gordon, Pradeep Ravikumar

    Abstract: A wide range of machine learning applications such as privacy-preserving learning, algorithmic fairness, and domain adaptation/generalization among others, involve learning invariant representations of the data that aim to achieve two competing goals: (a) maximize information or accuracy with respect to a target response, and (b) maximize invariance or independence with respect to a set of protect… ▽ More

    Submitted 23 November, 2022; v1 submitted 19 December, 2020; originally announced December 2020.

    Comments: JMLR camera-ready version

  12. arXiv:2003.09772  [pdf, other

    cs.LG cs.AI cs.CL stat.ML

    Invariant Rationalization

    Authors: Shiyu Chang, Yang Zhang, Mo Yu, Tommi S. Jaakkola

    Abstract: Selective rationalization improves neural network interpretability by identifying a small subset of input features -- the rationale -- that best explains or supports the prediction. A typical rationalization criterion, i.e. maximum mutual information (MMI), finds the rationale that maximizes the prediction performance based only on the rationale. However, MMI can be problematic because it picks up… ▽ More

    Submitted 21 March, 2020; originally announced March 2020.

    Comments: 10 pages

  13. arXiv:1911.02536  [pdf, other

    cs.LG stat.ML

    Unsupervised Hierarchy Matching with Optimal Transport over Hyperbolic Spaces

    Authors: David Alvarez-Melis, Youssef Mroueh, Tommi S. Jaakkola

    Abstract: This paper focuses on the problem of unsupervised alignment of hierarchical data such as ontologies or lexical databases. This is a problem that appears across areas, from natural language processing to bioinformatics, and is typically solved by appeal to outside knowledge bases and label-textual similarity. In contrast, we approach the problem from a purely geometric perspective: given only a vec… ▽ More

    Submitted 7 May, 2020; v1 submitted 6 November, 2019; originally announced November 2019.

    Comments: AISTATS 2020

  14. arXiv:1910.13294  [pdf, other

    cs.CL cs.LG

    Rethinking Cooperative Rationalization: Introspective Extraction and Complement Control

    Authors: Mo Yu, Shiyu Chang, Yang Zhang, Tommi S. Jaakkola

    Abstract: Selective rationalization has become a common mechanism to ensure that predictive models reveal how they use any available features. The selection may be soft or hard, and identifies a subset of input features relevant for prediction. The setup can be viewed as a co-operate game between the selector (aka rationale generator) and the predictor making use of only the selected features. The co-operat… ▽ More

    Submitted 15 December, 2019; v1 submitted 29 October, 2019; originally announced October 2019.

    Comments: Accepted by EMNLP 2019

  15. arXiv:1910.12853  [pdf, other

    cs.LG cs.CL stat.ML

    A Game Theoretic Approach to Class-wise Selective Rationalization

    Authors: Shiyu Chang, Yang Zhang, Mo Yu, Tommi S. Jaakkola

    Abstract: Selection of input features such as relevant pieces of text has become a common technique of highlighting how complex neural predictors operate. The selection can be optimized post-hoc for trained models or incorporated directly into the method itself (self-explaining). However, an overall selection does not properly capture the multi-faceted nature of useful rationales such as pros and cons for d… ▽ More

    Submitted 28 October, 2019; originally announced October 2019.

    Comments: Accepted by Neural Information Processing Systems (NeurIPS 2019), Vancouver, Canada

  16. arXiv:1910.09688  [pdf, other

    cs.LG stat.ML

    Learning to Make Generalizable and Diverse Predictions for Retrosynthesis

    Authors: Benson Chen, Tianxiao Shen, Tommi S. Jaakkola, Regina Barzilay

    Abstract: We propose a new model for making generalizable and diverse retrosynthetic reaction predictions. Given a target compound, the task is to predict the likely chemical reactants to produce the target. This generative task can be framed as a sequence-to-sequence problem by using the SMILES representations of the molecules. Building on top of the popular Transformer architecture, we propose two novel p… ▽ More

    Submitted 21 October, 2019; originally announced October 2019.

  17. arXiv:1909.13488  [pdf, other

    cs.LG stat.ML

    Oblique Decision Trees from Derivatives of ReLU Networks

    Authors: Guang-He Lee, Tommi S. Jaakkola

    Abstract: We show how neural models can be used to realize piece-wise constant functions such as decision trees. The proposed architecture, which we call locally constant networks, builds on ReLU networks that are piece-wise linear and hence their associated gradients with respect to the inputs are locally constant. We formally establish the equivalence between the classes of locally constant networks and d… ▽ More

    Submitted 3 May, 2020; v1 submitted 30 September, 2019; originally announced September 2019.

    Comments: Published in International Conference on Learning Representations (ICLR), 2020. Code available: https://github.com/guanghelee/iclr20-lcn

  18. arXiv:1907.03207  [pdf, other

    cs.LG stat.ML

    Towards Robust, Locally Linear Deep Networks

    Authors: Guang-He Lee, David Alvarez-Melis, Tommi S. Jaakkola

    Abstract: Deep networks realize complex map**s that are often understood by their locally linear behavior at or around points of interest. For example, we use the derivative of the map** with respect to its inputs for sensitivity analysis, or to explain (obtain coordinate relevance for) a prediction. One key challenge is that such derivatives are themselves inherently unstable. In this paper, we propose… ▽ More

    Submitted 6 July, 2019; originally announced July 2019.

    Comments: Published in International Conference on Learning Representations (ICLR), 2019

  19. arXiv:1906.04948  [pdf, other

    cs.LG stat.ML

    Tight Certificates of Adversarial Robustness for Randomly Smoothed Classifiers

    Authors: Guang-He Lee, Yang Yuan, Shiyu Chang, Tommi S. Jaakkola

    Abstract: Strong theoretical guarantees of robustness can be given for ensembles of classifiers generated by input randomization. Specifically, an $\ell_2$ bounded adversary cannot alter the ensemble prediction generated by an additive isotropic Gaussian noise, where the radius for the adversary depends on both the variance of the distribution as well as the ensemble margin at the point of interest. We buil… ▽ More

    Submitted 26 February, 2020; v1 submitted 12 June, 2019; originally announced June 2019.

    Comments: Published in Advances in Neural Information Processing Systems (NeurIPS), 2019

  20. arXiv:1902.09737  [pdf, other

    cs.LG stat.ML

    Functional Transparency for Structured Data: a Game-Theoretic Approach

    Authors: Guang-He Lee, Wengong **, David Alvarez-Melis, Tommi S. Jaakkola

    Abstract: We provide a new approach to training neural models to exhibit transparency in a well-defined, functional manner. Our approach naturally operates over structured data and tailors the predictor, functionally, towards a chosen family of (local) witnesses. The estimation problem is setup as a co-operative game between an unrestricted predictor such as a neural network, and a set of witnesses chosen f… ▽ More

    Submitted 26 February, 2019; originally announced February 2019.

  21. arXiv:1902.02037  [pdf, other

    stat.ML cs.AI cs.CV cs.LG

    Bidirectional Inference Networks: A Class of Deep Bayesian Networks for Health Profiling

    Authors: Hao Wang, Chengzhi Mao, Hao He, Mingmin Zhao, Tommi S. Jaakkola, Dina Katabi

    Abstract: We consider the problem of inferring the values of an arbitrary set of variables (e.g., risk of diseases) given other observed variables (e.g., symptoms and diagnosed diseases) and high-dimensional signals (e.g., MRI images or EEG). This is a common problem in healthcare since variables of interest often differ for different patients. Existing methods including Bayesian networks and structured pre… ▽ More

    Submitted 6 February, 2019; originally announced February 2019.

    Comments: Appeared at AAAI 2019

  22. arXiv:1809.00013  [pdf, other

    cs.CL

    Gromov-Wasserstein Alignment of Word Embedding Spaces

    Authors: David Alvarez-Melis, Tommi S. Jaakkola

    Abstract: Cross-lingual or cross-domain correspondences play key roles in tasks ranging from machine translation to transfer learning. Recently, purely unsupervised methods operating on monolingual embeddings have become effective alignment tools. Current state-of-the-art methods, however, involve multiple steps, including heuristic post-hoc refinement strategies. In this paper, we cast the correspondence p… ▽ More

    Submitted 31 August, 2018; originally announced September 2018.

    Comments: EMNLP 2018

  23. arXiv:1807.00130  [pdf, other

    cs.LG stat.ML

    Game-Theoretic Interpretability for Temporal Modeling

    Authors: Guang-He Lee, David Alvarez-Melis, Tommi S. Jaakkola

    Abstract: Interpretability has arisen as a key desideratum of machine learning models alongside performance. Approaches so far have been primarily concerned with fixed dimensional inputs emphasizing feature relevance or selection. In contrast, we focus on temporal modeling and the problem of tailoring the predictor, functionally, towards an interpretable family. To this end, we propose a co-operative game b… ▽ More

    Submitted 30 June, 2018; originally announced July 2018.

  24. arXiv:1806.09277  [pdf, other

    stat.ML cs.LG

    Towards Optimal Transport with Global Invariances

    Authors: David Alvarez-Melis, Stefanie Jegelka, Tommi S. Jaakkola

    Abstract: Many problems in machine learning involve calculating correspondences between sets of objects, such as point clouds or images. Discrete optimal transport provides a natural and successful approach to such tasks whenever the two sets of objects can be represented in the same space, or at least distances between them can be directly evaluated. Unfortunately neither requirement is likely to hold when… ▽ More

    Submitted 26 February, 2019; v1 submitted 24 June, 2018; originally announced June 2018.

    Comments: AISTATS 2019

  25. arXiv:1806.08049  [pdf, other

    cs.LG stat.ML

    On the Robustness of Interpretability Methods

    Authors: David Alvarez-Melis, Tommi S. Jaakkola

    Abstract: We argue that robustness of explanations---i.e., that similar inputs should give rise to similar explanations---is a key desideratum for interpretability. We introduce metrics to quantify robustness and demonstrate that current methods do not perform well according to these metrics. Finally, we propose ways that robustness can be enforced on existing interpretability approaches.

    Submitted 20 June, 2018; originally announced June 2018.

    Comments: presented at 2018 ICML Workshop on Human Interpretability in Machine Learning (WHI 2018), Stockholm, Sweden

  26. arXiv:1806.07538  [pdf, other

    cs.LG stat.ML

    Towards Robust Interpretability with Self-Explaining Neural Networks

    Authors: David Alvarez-Melis, Tommi S. Jaakkola

    Abstract: Most recent work on interpretability of complex machine learning models has focused on estimating $\textit{a posteriori}$ explanations for previously trained models around specific predictions. $\textit{Self-explaining}$ models where interpretability plays a key role already during learning have received much less attention. We propose three desiderata for explanations in general -- explicitness,… ▽ More

    Submitted 3 December, 2018; v1 submitted 19 June, 2018; originally announced June 2018.

    Comments: NeurIPS 2018

  27. arXiv:1712.06199  [pdf, other

    stat.ML cs.LG

    Structured Optimal Transport

    Authors: David Alvarez-Melis, Tommi S. Jaakkola, Stefanie Jegelka

    Abstract: Optimal Transport has recently gained interest in machine learning for applications ranging from domain adaptation, sentence similarities to deep learning. Yet, its ability to capture frequently occurring structure beyond the "ground metric" is limited. In this work, we develop a nonlinear generalization of (discrete) optimal transport that is able to reflect much additional structure. We demonstr… ▽ More

    Submitted 17 December, 2017; originally announced December 2017.

  28. arXiv:1707.01943  [pdf, other

    cs.LG

    A causal framework for explaining the predictions of black-box sequence-to-sequence models

    Authors: David Alvarez-Melis, Tommi S. Jaakkola

    Abstract: We interpret the predictions of any black-box structured input-structured output model around a specific input-output pair. Our method returns an "explanation" consisting of groups of input-output tokens that are causally related. These dependencies are inferred by querying the black-box model with perturbed inputs, generating a graph over tokens from the responses, and solving a partitioning prob… ▽ More

    Submitted 14 November, 2017; v1 submitted 6 July, 2017; originally announced July 2017.

    Comments: 12 Pages, EMNLP 2017

  29. arXiv:1511.00573  [pdf, other

    stat.ML cs.AI cs.SI

    From random walks to distances on unweighted graphs

    Authors: Tatsunori B. Hashimoto, Yi Sun, Tommi S. Jaakkola

    Abstract: Large unweighted directed graphs are commonly used to capture relations between entities. A fundamental problem in the analysis of such networks is to properly define the similarity or dissimilarity between any two vertices. Despite the significance of this problem, statistical characterization of the proposed metrics has been limited. We introduce and develop a class of techniques for analyzing r… ▽ More

    Submitted 2 November, 2015; originally announced November 2015.

    Comments: To appear in NIPS 2015

  30. arXiv:1509.05808  [pdf, other

    cs.CL cs.LG stat.ML

    Word, graph and manifold embedding from Markov processes

    Authors: Tatsunori B. Hashimoto, David Alvarez-Melis, Tommi S. Jaakkola

    Abstract: Continuous vector representations of words and objects appear to carry surprisingly rich semantic content. In this paper, we advance both the conceptual and theoretical understanding of word embeddings in three ways. First, we ground embeddings in semantic spaces studied in cognitive-psychometric literature and introduce new evaluation tasks. Second, in contrast to prior work, we take metric recov… ▽ More

    Submitted 18 September, 2015; originally announced September 2015.

  31. arXiv:1411.5720  [pdf, other

    stat.ML cs.SI math.ST stat.ME

    Metric recovery from directed unweighted graphs

    Authors: Tatsunori B. Hashimoto, Yi Sun, Tommi S. Jaakkola

    Abstract: We analyze directed, unweighted graphs obtained from $x_i\in \mathbb{R}^d$ by connecting vertex $i$ to $j$ iff $|x_i - x_j| < ε(x_i)$. Examples of such graphs include $k$-nearest neighbor graphs, where $ε(x_i)$ varies from point to point, and, arguably, many real world graphs such as co-purchasing graphs. We ask whether we can recover the underlying Euclidean metric $ε(x_i)$ and the associated den… ▽ More

    Submitted 20 November, 2014; originally announced November 2014.

    Comments: Poster at NIPS workshop on networks. Submitted to AISTATS 2015

  32. arXiv:1309.6838  [pdf

    cs.LG stat.ML

    Inverse Covariance Estimation for High-Dimensional Data in Linear Time and Space: Spectral Methods for Riccati and Sparse Models

    Authors: Jean Honorio, Tommi S. Jaakkola

    Abstract: We propose maximum likelihood estimation for learning Gaussian graphical models with a Gaussian (ell_2^2) prior on the parameters. This is in contrast to the commonly used Laplace (ell_1) prior for encouraging sparseness. We show that our optimization problem leads to a Riccati matrix equation, which has a closed form solution. We propose an efficient algorithm that performs a singular value decom… ▽ More

    Submitted 26 September, 2013; originally announced September 2013.

    Comments: Appears in Proceedings of the Twenty-Ninth Conference on Uncertainty in Artificial Intelligence (UAI2013)

    Journal ref: Uncertainty in Artificial Intelligence (UAI), 2013

  33. arXiv:1302.3586  [pdf

    cs.AI

    Computing Upper and Lower Bounds on Likelihoods in Intractable Networks

    Authors: Tommi S. Jaakkola, Michael I. Jordan

    Abstract: We present deterministic techniques for computing upper and lower bounds on marginal probabilities in sigmoid and noisy-OR networks. These techniques become useful when the size of the network (or clique size) precludes exact computations. We illustrate the tightness of the bounds by numerical experiments.

    Submitted 13 February, 2013; originally announced February 2013.

    Comments: Appears in Proceedings of the Twelfth Conference on Uncertainty in Artificial Intelligence (UAI1996)

    Report number: UAI-P-1996-PG-340-348

  34. arXiv:1301.3875  [pdf

    cs.LG cs.AI stat.ML

    Tractable Bayesian Learning of Tree Belief Networks

    Authors: Marina Meila, Tommi S. Jaakkola

    Abstract: In this paper we present decomposable priors, a family of priors over structure and parameters of tree belief nets for which Bayesian learning with complete observations is tractable, in the sense that the posterior is also decomposable and can be completely determined analytically in polynomial time. This follows from two main results: First, we show that factored distributions over spanning tre… ▽ More

    Submitted 16 January, 2013; originally announced January 2013.

    Comments: Appears in Proceedings of the Sixteenth Conference on Uncertainty in Artificial Intelligence (UAI2000)

    Report number: UAI-P-2000-PG-380-388

  35. arXiv:1301.3865  [pdf

    cs.LG stat.ML

    Feature Selection and Dualities in Maximum Entropy Discrimination

    Authors: Tony S. Jebara, Tommi S. Jaakkola

    Abstract: Incorporating feature selection into a classification or regression method often carries a number of advantages. In this paper we formalize feature selection specifically from a discriminative perspective of improving classification/regression accuracy. The feature selection method is developed as an extension to the recently proposed maximum entropy discrimination (MED) framework. We describe MED… ▽ More

    Submitted 16 January, 2013; originally announced January 2013.

    Comments: Appears in Proceedings of the Sixteenth Conference on Uncertainty in Artificial Intelligence (UAI2000)

    Report number: UAI-P-2000-PG-291-300

  36. arXiv:1301.0610  [pdf

    cs.LG stat.ML

    A New Class of Upper Bounds on the Log Partition Function

    Authors: Martin Wainwright, Tommi S. Jaakkola, Alan Willsky

    Abstract: Bounds on the log partition function are important in a variety of contexts, including approximate inference, model fitting, decision theory, and large deviations analysis. We introduce a new class of upper bounds on the log partition function, based on convex combinations of distributions in the exponential domain, that is applicable to an arbitrary undirected graphical model. In the special cas… ▽ More

    Submitted 12 December, 2012; originally announced January 2013.

    Comments: Appears in Proceedings of the Eighteenth Conference on Uncertainty in Artificial Intelligence (UAI2002)

    Report number: UAI-P-2002-PG-536-543

  37. arXiv:1301.0602  [pdf

    cs.LG stat.ML

    Unsupervised Active Learning in Large Domains

    Authors: Harald Steck, Tommi S. Jaakkola

    Abstract: Active learning is a powerful approach to analyzing data effectively. We show that the feasibility of active learning depends crucially on the choice of measure with respect to which the query is being optimized. The standard information gain, for example, does not permit an accurate evaluation with a small committee, a representative subset of the model space. We propose a surrogate measure requ… ▽ More

    Submitted 12 December, 2012; originally announced January 2013.

    Comments: Appears in Proceedings of the Eighteenth Conference on Uncertainty in Artificial Intelligence (UAI2002)

    Report number: UAI-P-2002-PG-469-476

  38. arXiv:1301.0562  [pdf

    cs.LG stat.ML

    Continuation Methods for Mixing Heterogenous Sources

    Authors: Adrian Corduneanu, Tommi S. Jaakkola

    Abstract: A number of modern learning tasks involve estimation from heterogeneous information sources. This includes classification with labeled and unlabeled data as well as other problems with analogous structure such as competitive (game theoretic) problems. The associated estimation problems can be typically reduced to solving a set of fixed point equations (consistency conditions). We introduce a gener… ▽ More

    Submitted 12 December, 2012; originally announced January 2013.

    Comments: Appears in Proceedings of the Eighteenth Conference on Uncertainty in Artificial Intelligence (UAI2002)

    Report number: UAI-P-2002-PG-111-118

  39. arXiv:1212.2466  [pdf

    cs.LG stat.ML

    On Information Regularization

    Authors: Adrian Corduneanu, Tommi S. Jaakkola

    Abstract: We formulate a principle for classification with the knowledge of the marginal distribution over the data points (unlabeled data). The principle is cast in terms of Tikhonov style regularization where the regularization penalty articulates the way in which the marginal density should constrain otherwise unrestricted conditional distributions. Specifically, the regularization penalty penalizes any… ▽ More

    Submitted 19 October, 2012; originally announced December 2012.

    Comments: Appears in Proceedings of the Nineteenth Conference on Uncertainty in Artificial Intelligence (UAI2003)

    Report number: UAI-P-2003-PG-151-158

  40. arXiv:1206.5243  [pdf

    cs.LG stat.ML

    Convergent Propagation Algorithms via Oriented Trees

    Authors: Amir Globerson, Tommi S. Jaakkola

    Abstract: Inference problems in graphical models are often approximated by casting them as constrained optimization problems. Message passing algorithms, such as belief propagation, have previously been suggested as methods for solving these optimization problems. However, there are few convergence guarantees for such algorithms, and the algorithms are therefore not guaranteed to solve the corresponding opt… ▽ More

    Submitted 20 June, 2012; originally announced June 2012.

    Comments: Appears in Proceedings of the Twenty-Third Conference on Uncertainty in Artificial Intelligence (UAI2007)

    Report number: UAI-P-2007-PG-133-140

  41. arXiv:1206.3288  [pdf

    cs.DS cs.AI cs.CE

    Tightening LP Relaxations for MAP using Message Passing

    Authors: David Sontag, Talya Meltzer, Amir Globerson, Tommi S. Jaakkola, Yair Weiss

    Abstract: Linear Programming (LP) relaxations have become powerful tools for finding the most probable (MAP) configuration in graphical models. These relaxations can be solved efficiently using message-passing algorithms such as belief propagation and, when the relaxation is tight, provably find the MAP configuration. The standard LP relaxation is not tight enough in many real-world problems, however, and t… ▽ More

    Submitted 13 June, 2012; originally announced June 2012.

    Comments: Appears in Proceedings of the Twenty-Fourth Conference on Uncertainty in Artificial Intelligence (UAI2008)

    Report number: UAI-P-2008-PG-503-510

  42. Variational Probabilistic Inference and the QMR-DT Network

    Authors: T. S. Jaakkola, M. I. Jordan

    Abstract: We describe a variational approximation method for efficient inference in large-scale probabilistic models. Variational methods are deterministic procedures that provide approximations to marginal and conditional probabilities of interest. They provide alternatives to approximate inference methods based on stochastic sampling or search. We describe a variational approach to the p… ▽ More

    Submitted 26 May, 2011; originally announced May 2011.

    Journal ref: Journal Of Artificial Intelligence Research, Volume 10, pages 291-322, 1999

  43. arXiv:cs/0508070  [pdf, ps, other

    cs.IT cs.AI

    MAP estimation via agreement on (hyper)trees: Message-passing and linear programming

    Authors: Martin J. Wainwright, Tommi S. Jaakkola, Alan S. Willsky

    Abstract: We develop and analyze methods for computing provably optimal {\em maximum a posteriori} (MAP) configurations for a subclass of Markov random fields defined on graphs with cycles. By decomposing the original distribution into a convex combination of tree-structured distributions, we obtain an upper bound on the optimal value of the original problem (i.e., the log probability of the MAP assignmen… ▽ More

    Submitted 15 August, 2005; originally announced August 2005.

    Comments: Presented in part at the Allerton Conference on Communication, Computing and Control in October 2002. Full journal version appear in the IEEE Transactions on Information Theory, November 2005