Skip to main content

Showing 1–4 of 4 results for author: Gulluk, H I

Searching in archive cs. Search in all archives.
.
  1. arXiv:2303.03382  [pdf, other

    cs.LG stat.ML

    Globally Optimal Training of Neural Networks with Threshold Activation Functions

    Authors: Tolga Ergen, Halil Ibrahim Gulluk, Jonathan Lacotte, Mert Pilanci

    Abstract: Threshold activation functions are highly preferable in neural networks due to their efficiency in hardware implementations. Moreover, their mode of operation is more interpretable and resembles that of biological neurons. However, traditional gradient based algorithms such as Gradient Descent cannot be used to train the parameters of neural networks with threshold activations since the activation… ▽ More

    Submitted 6 March, 2023; originally announced March 2023.

    Comments: Accepted to ICLR 2023

  2. arXiv:2302.06232  [pdf, other

    cs.LG stat.ML

    Understanding Multimodal Contrastive Learning and Incorporating Unpaired Data

    Authors: Ryumei Nakada, Halil Ibrahim Gulluk, Zhun Deng, Wenlong Ji, James Zou, Linjun Zhang

    Abstract: Language-supervised vision models have recently attracted great attention in computer vision. A common approach to build such models is to use contrastive learning on paired data across the two modalities, as exemplified by Contrastive Language-Image Pre-Training (CLIP). In this paper, under linear representation settings, (i) we initiate the investigation of a general class of nonlinear loss func… ▽ More

    Submitted 14 March, 2023; v1 submitted 13 February, 2023; originally announced February 2023.

    Comments: 42 pages, 3 figures, accepted by AISTATS 2023; a link to GitHub repository added, style corrected, acknowledgements section added

  3. arXiv:2201.06142  [pdf, other

    cs.LG stat.ML

    Towards Sample-efficient Overparameterized Meta-learning

    Authors: Yue Sun, Adhyyan Narang, Halil Ibrahim Gulluk, Samet Oymak, Maryam Fazel

    Abstract: An overarching goal in machine learning is to build a generalizable model with few samples. To this end, overparameterization has been the subject of immense interest to explain the generalization ability of deep nets even when the size of the dataset is smaller than that of the model. While the prior literature focuses on the classical supervised setting, this paper aims to demystify overparamete… ▽ More

    Submitted 16 January, 2022; originally announced January 2022.

    Journal ref: Advances in Neural Information Processing Systems, 34 (2021)

  4. arXiv:2102.07206  [pdf, other

    cs.LG stat.ML

    Sample Efficient Subspace-based Representations for Nonlinear Meta-Learning

    Authors: Halil Ibrahim Gulluk, Yue Sun, Samet Oymak, Maryam Fazel

    Abstract: Constructing good representations is critical for learning complex tasks in a sample efficient manner. In the context of meta-learning, representations can be constructed from common patterns of previously seen tasks so that a future task can be learned quickly. While recent works show the benefit of subspace-based representations, such results are limited to linear-regression tasks. This work exp… ▽ More

    Submitted 26 February, 2021; v1 submitted 14 February, 2021; originally announced February 2021.

    Comments: To appear in ICASSP 21'