Skip to main content

Showing 1–10 of 10 results for author: Bingham, G

Searching in archive cs. Search in all archives.
.
  1. arXiv:2403.05530  [pdf, other

    cs.CL cs.AI

    Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context

    Authors: Gemini Team, Petko Georgiev, Ving Ian Lei, Ryan Burnell, Libin Bai, Anmol Gulati, Garrett Tanzer, Damien Vincent, Zhufeng Pan, Shibo Wang, Soroosh Mariooryad, Yifan Ding, Xinyang Geng, Fred Alcober, Roy Frostig, Mark Omernick, Lexi Walker, Cosmin Paduraru, Christina Sorokin, Andrea Tacchetti, Colin Gaffney, Samira Daruki, Olcan Sercinoglu, Zach Gleicher, Juliette Love , et al. (1092 additional authors not shown)

    Abstract: In this report, we introduce the Gemini 1.5 family of models, representing the next generation of highly compute-efficient multimodal models capable of recalling and reasoning over fine-grained information from millions of tokens of context, including multiple long documents and hours of video and audio. The family includes two new models: (1) an updated Gemini 1.5 Pro, which exceeds the February… ▽ More

    Submitted 14 June, 2024; v1 submitted 8 March, 2024; originally announced March 2024.

  2. arXiv:2401.15814  [pdf, other

    cs.LG

    OntoMedRec: Logically-Pretrained Model-Agnostic Ontology Encoders for Medication Recommendation

    Authors: Weicong Tan, Weiqing Wang, Xin Zhou, Wray Buntine, Gordon Bingham, Hongzhi Yin

    Abstract: Most existing medication recommendation models learn representations for medical concepts based on electronic health records (EHRs) and make recommendations with learnt representations. However, most medications appear in the dataset for limited times, resulting in insufficient learning of their representations. Medical ontologies are the hierarchical classification systems for medical terms where… ▽ More

    Submitted 14 February, 2024; v1 submitted 28 January, 2024; originally announced January 2024.

  3. arXiv:2304.03374  [pdf, other

    cs.LG

    Optimizing Neural Networks through Activation Function Discovery and Automatic Weight Initialization

    Authors: Garrett Bingham

    Abstract: Automated machine learning (AutoML) methods improve upon existing models by optimizing various aspects of their design. While present methods focus on hyperparameters and neural network topologies, other aspects of neural network design can be optimized as well. To further the state of the art in AutoML, this dissertation introduces techniques for discovering more powerful activation functions and… ▽ More

    Submitted 6 April, 2023; originally announced April 2023.

    Comments: PhD Dissertation

  4. arXiv:2301.05785  [pdf, other

    cs.LG cs.NE

    Efficient Activation Function Optimization through Surrogate Modeling

    Authors: Garrett Bingham, Risto Miikkulainen

    Abstract: Carefully designed activation functions can improve the performance of neural networks in many machine learning tasks. However, it is difficult for humans to construct optimal activation functions, and current activation function search algorithms are prohibitively expensive. This paper aims to improve the state of the art through three steps: First, the benchmark datasets Act-Bench-CNN, Act-Bench… ▽ More

    Submitted 8 November, 2023; v1 submitted 13 January, 2023; originally announced January 2023.

    Comments: NeurIPS 2023. 28 pages, 16 figures, 6 tables

  5. arXiv:2109.08958  [pdf, other

    cs.LG

    AutoInit: Analytic Signal-Preserving Weight Initialization for Neural Networks

    Authors: Garrett Bingham, Risto Miikkulainen

    Abstract: Neural networks require careful weight initialization to prevent signals from exploding or vanishing. Existing initialization schemes solve this problem in specific cases by assuming that the network has a certain activation function or topology. It is difficult to derive such weight initialization strategies, and modern architectures therefore often use these same initialization schemes even thou… ▽ More

    Submitted 29 November, 2022; v1 submitted 18 September, 2021; originally announced September 2021.

    Comments: To appear in AAAI 2023. 19 pages, 10 figures, 3 tables

  6. arXiv:2006.03179  [pdf, other

    cs.LG cs.CV cs.NE stat.ML

    Discovering Parametric Activation Functions

    Authors: Garrett Bingham, Risto Miikkulainen

    Abstract: Recent studies have shown that the choice of activation function can significantly affect the performance of deep learning networks. However, the benefits of novel activation functions have been inconsistent and task dependent, and therefore the rectified linear unit (ReLU) is still the most commonly used. This paper proposes a technique for customizing activation functions automatically, resultin… ▽ More

    Submitted 21 January, 2022; v1 submitted 4 June, 2020; originally announced June 2020.

    Comments: Published in Neural Networks. 34 pages, 10 figures, 11 tables

    Journal ref: Neural Networks, Volume 148, 2022, Pages 48-65, ISSN 0893-6080

  7. arXiv:2002.07224  [pdf, other

    cs.LG cs.CV cs.NE stat.ML

    Evolutionary Optimization of Deep Learning Activation Functions

    Authors: Garrett Bingham, William Macke, Risto Miikkulainen

    Abstract: The choice of activation function can have a large effect on the performance of a neural network. While there have been some attempts to hand-engineer novel activation functions, the Rectified Linear Unit (ReLU) remains the most commonly-used in practice. This paper shows that evolutionary algorithms can discover novel activation functions that outperform ReLU. A tree-based search space of candida… ▽ More

    Submitted 11 April, 2020; v1 submitted 17 February, 2020; originally announced February 2020.

    Comments: 8 pages; 9 figures/tables; GECCO 2020

  8. arXiv:1906.03492  [pdf, other

    cs.CL

    Improving Low-Resource Cross-lingual Document Retrieval by Reranking with Deep Bilingual Representations

    Authors: Rui Zhang, Caitlin Westerfield, Sungrok Shim, Garrett Bingham, Alexander Fabbri, Neha Verma, William Hu, Dragomir Radev

    Abstract: In this paper, we propose to boost low-resource cross-lingual document retrieval performance with deep bilingual query-document representations. We match queries and documents in both source and target languages with four components, each of which is implemented as a term interaction-based deep neural network with cross-lingual word embeddings as input. By including query likelihood scores as extr… ▽ More

    Submitted 8 June, 2019; originally announced June 2019.

    Comments: ACL 2019, short paper

  9. arXiv:1811.06446  [pdf, other

    cs.CV

    Preliminary Studies on a Large Face Database

    Authors: Benjamin Yip, Garrett Bingham, Katherine Kempfert, Jonathan Fabish, Troy Kling, Cuixian Chen, Yishi Wang

    Abstract: We perform preliminary studies on a large longitudinal face database MORPH-II, which is a benchmark dataset in the field of computer vision and pattern recognition. First, we summarize the inconsistencies in the dataset and introduce the steps and strategy taken for cleaning. The potential implications of these inconsistencies on prior research are introduced. Next, we propose a new automatic subs… ▽ More

    Submitted 15 November, 2018; originally announced November 2018.

    Comments: It has been accepted in the 5th National Symposium for NSF REU Research in Data Science, Systems, and Security. G. Bingham and K. Kempfert contributed equally

  10. arXiv:1711.00575  [pdf, other

    cs.CV

    Random Subspace Two-dimensional LDA for Face Recognition

    Authors: Garrett Bingham

    Abstract: In this paper, a novel technique named random subspace two-dimensional LDA (RS-2DLDA) is developed for face recognition. This approach offers a number of improvements over the random subspace two-dimensional PCA (RS2DPCA) framework introduced by Nguyen et al. [5]. Firstly, the eigenvectors from 2DLDA have more discriminative power than those from 2DPCA, resulting in higher accuracy for the RS-2DLD… ▽ More

    Submitted 1 November, 2017; originally announced November 2017.