Skip to main content

Showing 1–22 of 22 results for author: Bica, I

.
  1. arXiv:2403.05530  [pdf, other

    cs.CL cs.AI

    Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context

    Authors: Gemini Team, Petko Georgiev, Ving Ian Lei, Ryan Burnell, Libin Bai, Anmol Gulati, Garrett Tanzer, Damien Vincent, Zhufeng Pan, Shibo Wang, Soroosh Mariooryad, Yifan Ding, Xinyang Geng, Fred Alcober, Roy Frostig, Mark Omernick, Lexi Walker, Cosmin Paduraru, Christina Sorokin, Andrea Tacchetti, Colin Gaffney, Samira Daruki, Olcan Sercinoglu, Zach Gleicher, Juliette Love , et al. (1092 additional authors not shown)

    Abstract: In this report, we introduce the Gemini 1.5 family of models, representing the next generation of highly compute-efficient multimodal models capable of recalling and reasoning over fine-grained information from millions of tokens of context, including multiple long documents and hours of video and audio. The family includes two new models: (1) an updated Gemini 1.5 Pro, which exceeds the February… ▽ More

    Submitted 14 June, 2024; v1 submitted 8 March, 2024; originally announced March 2024.

  2. arXiv:2401.09865  [pdf, other

    cs.CV cs.AI cs.LG

    Improving fine-grained understanding in image-text pre-training

    Authors: Ioana Bica, Anastasija Ilić, Matthias Bauer, Goker Erdogan, Matko Bošnjak, Christos Kaplanis, Alexey A. Gritsenko, Matthias Minderer, Charles Blundell, Razvan Pascanu, Jovana Mitrović

    Abstract: We introduce SPARse Fine-grained Contrastive Alignment (SPARC), a simple method for pretraining more fine-grained multimodal representations from image-text pairs. Given that multiple image patches often correspond to single words, we propose to learn a grou** of image patches for every token in the caption. To achieve this, we use a sparse similarity metric between image patches and language to… ▽ More

    Submitted 18 January, 2024; originally announced January 2024.

    Comments: 26 pages

  3. arXiv:2311.01489  [pdf, other

    stat.ML cs.LG

    Invariant Causal Imitation Learning for Generalizable Policies

    Authors: Ioana Bica, Daniel Jarrett, Mihaela van der Schaar

    Abstract: Consider learning an imitation policy on the basis of demonstrated behavior from multiple environments, with an eye towards deployment in an unseen environment. Since the observable features from each setting may be different, directly learning individual policies as map**s from features to actions is prone to spurious correlations -- and may not generalize well. However, the expert's policy is… ▽ More

    Submitted 2 November, 2023; originally announced November 2023.

    Journal ref: In Proc. 35th International Conference on Neural Information Processing Systems (NeurIPS 2021)

  4. arXiv:2311.01388  [pdf, other

    stat.ML cs.LG

    Time-series Generation by Contrastive Imitation

    Authors: Daniel Jarrett, Ioana Bica, Mihaela van der Schaar

    Abstract: Consider learning a generative model for time-series data. The sequential setting poses a unique challenge: Not only should the generator capture the conditional dynamics of (stepwise) transitions, but its open-loop rollouts should also preserve the joint distribution of (multi-step) trajectories. On one hand, autoregressive models trained by MLE allow learning and computing explicit transition di… ▽ More

    Submitted 2 November, 2023; originally announced November 2023.

    Journal ref: In Proc. 35th International Conference on Neural Information Processing Systems (NeurIPS 2021)

  5. arXiv:2310.18688  [pdf, other

    cs.LG

    Clairvoyance: A Pipeline Toolkit for Medical Time Series

    Authors: Daniel Jarrett, **sung Yoon, Ioana Bica, Zhaozhi Qian, Ari Ercole, Mihaela van der Schaar

    Abstract: Time-series learning is the bread and butter of data-driven *clinical decision support*, and the recent explosion in ML research has demonstrated great potential in various healthcare settings. At the same time, medical time-series problems in the wild are challenging due to their highly *composite* nature: They entail design choices and interactions among components that preprocess data, impute m… ▽ More

    Submitted 28 October, 2023; originally announced October 2023.

    Journal ref: In Proc. 9th International Conference on Learning Representations (ICLR 2021)

  6. arXiv:2302.10258  [pdf, other

    cs.LG cs.AI stat.ME

    Neural Algorithmic Reasoning with Causal Regularisation

    Authors: Beatrice Bevilacqua, Kyriacos Nikiforou, Borja Ibarz, Ioana Bica, Michela Paganini, Charles Blundell, Jovana Mitrovic, Petar Veličković

    Abstract: Recent work on neural algorithmic reasoning has investigated the reasoning capabilities of neural networks, effectively demonstrating they can learn to execute classical algorithms on unseen data coming from the train distribution. However, the performance of existing neural reasoners significantly degrades on out-of-distribution (OOD) test data, where inputs have larger sizes. In this work, we ma… ▽ More

    Submitted 3 July, 2023; v1 submitted 20 February, 2023; originally announced February 2023.

    Comments: ICML 2023, Camera Ready; 17 pages, 7 figures

  7. arXiv:2210.13043  [pdf, other

    cs.LG cs.AI

    Data-IQ: Characterizing subgroups with heterogeneous outcomes in tabular data

    Authors: Nabeel Seedat, Jonathan Crabbé, Ioana Bica, Mihaela van der Schaar

    Abstract: High model performance, on average, can hide that models may systematically underperform on subgroups of the data. We consider the tabular setting, which surfaces the unique issue of outcome heterogeneity - this is prevalent in areas such as healthcare, where patients with similar features can have different outcomes, thus making reliable predictions challenging. To tackle this, we propose Data-IQ… ▽ More

    Submitted 24 October, 2022; originally announced October 2022.

    Comments: Presented at NeurIPS 2022

  8. arXiv:2210.06183  [pdf, other

    cs.LG stat.ME

    Transfer Learning on Heterogeneous Feature Spaces for Treatment Effects Estimation

    Authors: Ioana Bica, Mihaela van der Schaar

    Abstract: Consider the problem of improving the estimation of conditional average treatment effects (CATE) for a target domain of interest by leveraging related information from a source domain with a different feature space. This heterogeneous transfer learning problem for CATE estimation is ubiquitous in areas such as healthcare where we may wish to evaluate the effectiveness of a treatment for a new pati… ▽ More

    Submitted 8 October, 2022; originally announced October 2022.

  9. arXiv:2208.01373  [pdf, other

    cs.LG cs.AI stat.ML

    DAPDAG: Domain Adaptation via Perturbed DAG Reconstruction

    Authors: Yanke Li, Hatt Tobias, Ioana Bica, Mihaela van der Schaar

    Abstract: Leveraging labelled data from multiple domains to enable prediction in another domain without labels is a significant, yet challenging problem. To address this problem, we introduce the framework DAPDAG (\textbf{D}omain \textbf{A}daptation via \textbf{P}erturbed \textbf{DAG} Reconstruction) and propose to learn an auto-encoder that undertakes inference on population statistics given features and r… ▽ More

    Submitted 2 August, 2022; originally announced August 2022.

  10. arXiv:2206.08363  [pdf, other

    cs.LG cs.AI stat.ME

    Benchmarking Heterogeneous Treatment Effect Models through the Lens of Interpretability

    Authors: Jonathan Crabbé, Alicia Curth, Ioana Bica, Mihaela van der Schaar

    Abstract: Estimating personalized effects of treatments is a complex, yet pervasive problem. To tackle it, recent developments in the machine learning (ML) literature on heterogeneous treatment effect estimation gave rise to many sophisticated, but opaque, tools: due to their flexibility, modularity and ability to learn constrained representations, neural networks in particular have become central to this l… ▽ More

    Submitted 16 June, 2022; originally announced June 2022.

  11. arXiv:2201.05119  [pdf, other

    cs.CV cs.LG stat.ML

    Pushing the limits of self-supervised ResNets: Can we outperform supervised learning without labels on ImageNet?

    Authors: Nenad Tomasev, Ioana Bica, Brian McWilliams, Lars Buesing, Razvan Pascanu, Charles Blundell, Jovana Mitrovic

    Abstract: Despite recent progress made by self-supervised methods in representation learning with residual networks, they still underperform supervised learning on the ImageNet classification benchmark, limiting their applicability in performance-critical settings. Building on prior theoretical insights from ReLIC [Mitrovic et al., 2021], we include additional inductive biases into self-supervised learning.… ▽ More

    Submitted 3 November, 2022; v1 submitted 13 January, 2022; originally announced January 2022.

  12. arXiv:2112.03811  [pdf, other

    cs.LG

    Disentangled Counterfactual Recurrent Networks for Treatment Effect Inference over Time

    Authors: Jeroen Berrevoets, Alicia Curth, Ioana Bica, Eoin McKinney, Mihaela van der Schaar

    Abstract: Choosing the best treatment-plan for each individual patient requires accurate forecasts of their outcome trajectories as a function of the treatment, over time. While large observational data sets constitute rich sources of information to learn from, they also contain biases as treatments are rarely assigned randomly in practice. To provide accurate and unbiased forecasts, we introduce the Disent… ▽ More

    Submitted 7 December, 2021; originally announced December 2021.

  13. arXiv:2106.04240  [pdf, other

    cs.LG

    The Medkit-Learn(ing) Environment: Medical Decision Modelling through Simulation

    Authors: Alex J. Chan, Ioana Bica, Alihan Huyuk, Daniel Jarrett, Mihaela van der Schaar

    Abstract: Understanding decision-making in clinical environments is of paramount importance if we are to bring the strengths of machine learning to ultimately improve patient outcomes. Several factors including the availability of public data, the intrinsically offline nature of the problem, and the complexity of human decision making, has meant that the mainstream development of algorithms is often geared… ▽ More

    Submitted 14 March, 2022; v1 submitted 8 June, 2021; originally announced June 2021.

  14. arXiv:2102.11500  [pdf, other

    cs.LG cs.AI

    Model-Attentive Ensemble Learning for Sequence Modeling

    Authors: Victor D. Bourgin, Ioana Bica, Mihaela van der Schaar

    Abstract: Medical time-series datasets have unique characteristics that make prediction tasks challenging. Most notably, patient trajectories often contain longitudinal variations in their input-output relationships, generally referred to as temporal conditional shift. Designing sequence models capable of adapting to such time-varying distributions remains a prevailing problem. To address this we present Mo… ▽ More

    Submitted 23 February, 2021; originally announced February 2021.

  15. arXiv:2102.06271  [pdf, other

    cs.LG

    Selecting Treatment Effects Models for Domain Adaptation Using Causal Knowledge

    Authors: Trent Kyono, Ioana Bica, Zhaozhi Qian, Mihaela van der Schaar

    Abstract: Selecting causal inference models for estimating individualized treatment effects (ITE) from observational data presents a unique challenge since the counterfactual outcomes are never observed. The problem is challenged further in the unsupervised domain adaptation (UDA) setting where we only have access to labeled samples in the source domain, but desire selecting a model that achieves good perfo… ▽ More

    Submitted 11 February, 2021; originally announced February 2021.

  16. arXiv:2101.11769  [pdf, other

    stat.ML cs.LG

    Learning Matching Representations for Individualized Organ Transplantation Allocation

    Authors: Can Xu, Ahmed M. Alaa, Ioana Bica, Brent D. Ershoff, Maxime Cannesson, Mihaela van der Schaar

    Abstract: Organ transplantation is often the last resort for treating end-stage illness, but the probability of a successful transplantation depends greatly on compatibility between donors and recipients. Current medical practice relies on coarse rules for donor-recipient matching, but is short of domain knowledge regarding the complex factors underlying organ compatibility. In this paper, we formulate the… ▽ More

    Submitted 1 February, 2021; v1 submitted 27 January, 2021; originally announced January 2021.

    Comments: Accepted to AISTATS 2021

  17. arXiv:2007.13531  [pdf, other

    cs.LG cs.AI stat.ML

    Learning "What-if" Explanations for Sequential Decision-Making

    Authors: Ioana Bica, Daniel Jarrett, Alihan Hüyük, Mihaela van der Schaar

    Abstract: Building interpretable parameterizations of real-world decision-making on the basis of demonstrated behavior -- i.e. trajectories of observations and actions made by an expert maximizing some unknown reward function -- is essential for introspecting and auditing policies in different institutions. In this paper, we propose learning explanations of expert decisions by modeling their reward function… ▽ More

    Submitted 30 March, 2021; v1 submitted 2 July, 2020; originally announced July 2020.

    Comments: In Proc. 9th International Conference on Learning Representations (ICLR 2021)

  18. arXiv:2007.09317  [pdf, other

    stat.ME

    Robust Optimal Designs when Missing Data Happen at Random

    Authors: Rui Hu, Ion Bica, Zhichun Zhai

    Abstract: In this article, we investigate the robust optimal design problem for the prediction of response when the fitted regression models are only approximately specified, and observations might be missing completely at random. The intuitive idea is as follows: We assume that data are missing at random, and the complete case analysis is applied. To account for the occurrence of missing data, the design c… ▽ More

    Submitted 17 October, 2022; v1 submitted 17 July, 2020; originally announced July 2020.

    Comments: 28 pages. Submitted

  19. arXiv:2006.14154  [pdf, other

    stat.ML cs.LG

    Strictly Batch Imitation Learning by Energy-based Distribution Matching

    Authors: Daniel Jarrett, Ioana Bica, Mihaela van der Schaar

    Abstract: Consider learning a policy purely on the basis of demonstrated behavior -- that is, with no access to reinforcement signals, no knowledge of transition dynamics, and no further interaction with the environment. This *strictly batch imitation learning* problem arises wherever live experimentation is costly, such as in healthcare. One solution is simply to retrofit existing algorithms for apprentice… ▽ More

    Submitted 14 January, 2021; v1 submitted 24 June, 2020; originally announced June 2020.

    Comments: In Proc. 34th International Conference on Neural Information Processing Systems (NeurIPS 2020)

  20. arXiv:2002.12326  [pdf, other

    cs.LG stat.ML

    Estimating the Effects of Continuous-valued Interventions using Generative Adversarial Networks

    Authors: Ioana Bica, James Jordon, Mihaela van der Schaar

    Abstract: While much attention has been given to the problem of estimating the effect of discrete interventions from observational data, relatively little work has been done in the setting of continuous-valued interventions, such as treatments associated with a dosage parameter. In this paper, we tackle this problem by building on a modification of the generative adversarial networks (GANs) framework. Our m… ▽ More

    Submitted 22 November, 2020; v1 submitted 27 February, 2020; originally announced February 2020.

    Journal ref: Advances in Neural Information Processing Systems (2020)

  21. arXiv:2002.04083  [pdf, other

    cs.LG stat.ML

    Estimating Counterfactual Treatment Outcomes over Time Through Adversarially Balanced Representations

    Authors: Ioana Bica, Ahmed M. Alaa, James Jordon, Mihaela van der Schaar

    Abstract: Identifying when to give treatments to patients and how to select among multiple treatments over time are important medical problems with a few existing solutions. In this paper, we introduce the Counterfactual Recurrent Network (CRN), a novel sequence-to-sequence model that leverages the increasingly available patient observational data to estimate treatment effects over time and answer such medi… ▽ More

    Submitted 10 February, 2020; originally announced February 2020.

    Journal ref: In Proc. 8th International Conference on Learning Representations (ICLR 2020)

  22. arXiv:1902.00450  [pdf, other

    cs.LG stat.AP stat.ML

    Time Series Deconfounder: Estimating Treatment Effects over Time in the Presence of Hidden Confounders

    Authors: Ioana Bica, Ahmed M. Alaa, Mihaela van der Schaar

    Abstract: The estimation of treatment effects is a pervasive problem in medicine. Existing methods for estimating treatment effects from longitudinal observational data assume that there are no hidden confounders, an assumption that is not testable in practice and, if it does not hold, leads to biased estimates. In this paper, we develop the Time Series Deconfounder, a method that leverages the assignment o… ▽ More

    Submitted 18 September, 2020; v1 submitted 1 February, 2019; originally announced February 2019.

    Journal ref: In Proc. 37th International Conference on Machine Learning (ICML 2020)