Skip to main content

Showing 1–6 of 6 results for author: Matoba, K

.
  1. arXiv:2311.16079  [pdf, other

    cs.CL cs.AI cs.LG

    MEDITRON-70B: Scaling Medical Pretraining for Large Language Models

    Authors: Zeming Chen, Alejandro Hernández Cano, Angelika Romanou, Antoine Bonnet, Kyle Matoba, Francesco Salvi, Matteo Pagliardini, Simin Fan, Andreas Köpf, Amirkeivan Mohtashami, Alexandre Sallinen, Alireza Sakhaeirad, Vinitra Swamy, Igor Krawczuk, Deniz Bayazit, Axel Marmet, Syrielle Montariol, Mary-Anne Hartley, Martin Jaggi, Antoine Bosselut

    Abstract: Large language models (LLMs) can potentially democratize access to medical knowledge. While many efforts have been made to harness and improve LLMs' medical knowledge and reasoning capacities, the resulting models are either closed-source (e.g., PaLM, GPT-4) or limited in scale (<= 13B parameters), which restricts their abilities. In this work, we improve access to large-scale medical LLMs by rele… ▽ More

    Submitted 27 November, 2023; originally announced November 2023.

  2. arXiv:2210.11269  [pdf, other

    cs.LG physics.ao-ph physics.flu-dyn

    Inference from Real-World Sparse Measurements

    Authors: Arnaud Pannatier, Kyle Matoba, François Fleuret

    Abstract: Real-world problems often involve complex and unstructured sets of measurements, which occur when sensors are sparsely placed in either space or time. Being able to model this irregular spatiotemporal data and extract meaningful forecasts is crucial. Deep learning architectures capable of processing sets of measurements with positions varying from set to set, and extracting readouts anywhere are m… ▽ More

    Submitted 15 April, 2024; v1 submitted 20 October, 2022; originally announced October 2022.

    Comments: 27 pages, 12 figures, Published at TMLR https://openreview.net/forum?id=y9IDfODRns

  3. arXiv:2206.07144  [pdf, other

    cs.LG

    Efficiently Training Low-Curvature Neural Networks

    Authors: Suraj Srinivas, Kyle Matoba, Himabindu Lakkaraju, Francois Fleuret

    Abstract: The highly non-linear nature of deep neural networks causes them to be susceptible to adversarial examples and have unstable gradients which hinders interpretability. However, existing methods to solve these issues, such as adversarial training, are expensive and often sacrifice predictive accuracy. In this work, we consider curvature, which is a mathematical quantity which encodes the degree of… ▽ More

    Submitted 10 January, 2023; v1 submitted 14 June, 2022; originally announced June 2022.

    Comments: NeurIPS 2022

  4. arXiv:2203.01016  [pdf, other

    cs.LG

    The Theoretical Expressiveness of Maxpooling

    Authors: Kyle Matoba, Nikolaos Dimitriadis, François Fleuret

    Abstract: Over the decade since deep neural networks became state of the art image classifiers there has been a tendency towards less use of max pooling: the function that takes the largest of nearby pixels in an image. Since max pooling featured prominently in earlier generations of image classifiers, we wish to understand this trend, and whether it is justified. We develop a theoretical framework analyzin… ▽ More

    Submitted 2 March, 2022; originally announced March 2022.

    Comments: 31 pages, 6 figures

  5. arXiv:2101.12509  [pdf, ps, other

    cs.LG cs.AI

    Challenges for Using Impact Regularizers to Avoid Negative Side Effects

    Authors: David Lindner, Kyle Matoba, Alexander Meulemans

    Abstract: Designing reward functions for reinforcement learning is difficult: besides specifying which behavior is rewarded for a task, the reward also has to discourage undesired outcomes. Misspecified reward functions can lead to unintended negative side effects, and overall unsafe behavior. To overcome this problem, recent work proposed to augment the specified reward function with an impact regularizer… ▽ More

    Submitted 23 February, 2021; v1 submitted 29 January, 2021; originally announced January 2021.

    Comments: Presented at the SafeAI workshop at AAAI 2021

  6. arXiv:1109.3873  [pdf, other

    math.NA

    A Computable Figure of Merit for Quasi-Monte Carlo Point Sets

    Authors: Makoto Matsumoto, Mutsuo Saito, Kyle Matoba

    Abstract: Let $\mathcal{P} \subset [0,1)^S$ be a finite point set of cardinality $N$ in an $S$-dimensional cube, and let $f:[0,1)^S \to \mathbb{R}$ be an integrable function. A QMC integration of $f$ by $\mathcal{P}$ is the average of values of $f$ at each point in $\mathcal{P}$, which approximates the integration of $f$ over the cube. Assume that $\mathcal{P}$ is constructed from an $\mathbb{F}2$-vector sp… ▽ More

    Submitted 20 February, 2012; v1 submitted 18 September, 2011; originally announced September 2011.