Skip to main content

Showing 1–5 of 5 results for author: Zi, W

Searching in archive cs. Search in all archives.
.
  1. arXiv:2404.06059  [pdf, other

    quant-ph cs.AI

    Efficient Quantum Circuits for Machine Learning Activation Functions including Constant T-depth ReLU

    Authors: Wei Zi, Siyi Wang, Hyunji Kim, Xiaoming Sun, Anupam Chattopadhyay, Patrick Rebentrost

    Abstract: In recent years, Quantum Machine Learning (QML) has increasingly captured the interest of researchers. Among the components in this domain, activation functions hold a fundamental and indispensable role. Our research focuses on the development of activation functions quantum circuits for integration into fault-tolerant quantum computing architectures, with an emphasis on minimizing $T$-depth. Spec… ▽ More

    Submitted 9 April, 2024; originally announced April 2024.

    Comments: 13 pages

  2. arXiv:2211.05187  [pdf, other

    cs.CV

    Training a Vision Transformer from scratch in less than 24 hours with 1 GPU

    Authors: Saghar Irandoust, Thibaut Durand, Yunduz Rakhmangulova, Wenjie Zi, Hossein Hajimirsadeghi

    Abstract: Transformers have become central to recent advances in computer vision. However, training a vision Transformer (ViT) model from scratch can be resource intensive and time consuming. In this paper, we aim to explore approaches to reduce the training costs of ViT models. We introduce some algorithmic improvements to enable training a ViT model from scratch with limited hardware (1 GPU) and time (24… ▽ More

    Submitted 9 November, 2022; originally announced November 2022.

    Comments: 7 pages, 2 figures, 1 table, published in "Has it Trained Yet? Workshop at the Conference on Neural Information Processing Systems (NeurIPS 2022)"

    ACM Class: I.2.10

  3. arXiv:2106.04559  [pdf, other

    cs.CL

    Turing: an Accurate and Interpretable Multi-Hypothesis Cross-Domain Natural Language Database Interface

    Authors: Peng Xu, Wenjie Zi, Hamidreza Shahidi, Ákos Kádár, Keyi Tang, Wei Yang, Jawad Ateeq, Harsh Barot, Meidan Alon, Yanshuai Cao

    Abstract: A natural language database interface (NLDB) can democratize data-driven insights for non-technical users. However, existing Text-to-SQL semantic parsers cannot achieve high enough accuracy in the cross-database setting to allow good usability in practice. This work presents Turing, a NLDB system toward bridging this gap. The cross-domain semantic parser of Turing with our novel value prediction m… ▽ More

    Submitted 8 June, 2021; originally announced June 2021.

    Comments: ACL 2021 demonstration track

  4. arXiv:2012.15355  [pdf, other

    cs.CL cs.LG

    Optimizing Deeper Transformers on Small Datasets

    Authors: Peng Xu, Dhruv Kumar, Wei Yang, Wenjie Zi, Keyi Tang, Chenyang Huang, Jackie Chi Kit Cheung, Simon J. D. Prince, Yanshuai Cao

    Abstract: It is a common belief that training deep transformers from scratch requires large datasets. Consequently, for small datasets, people usually use shallow and simple additional layers on top of pre-trained models during fine-tuning. This work shows that this does not always need to be the case: with proper initialization and optimization, the benefits of very deep transformers can carry over to chal… ▽ More

    Submitted 31 May, 2021; v1 submitted 30 December, 2020; originally announced December 2020.

    Comments: Accepted at ACL 2021 main conference

  5. arXiv:1907.05083  [pdf, ps, other

    cs.DS cs.GT

    Cake Cutting on Graphs: A Discrete and Bounded Proportional Protocol

    Authors: Xiaohui Bei, Xiaoming Sun, Hao Wu, Jialin Zhang, Zhijie Zhang, Wei Zi

    Abstract: The classical cake cutting problem studies how to find fair allocations of a heterogeneous and divisible resource among multiple agents. Two of the most commonly studied fairness concepts in cake cutting are proportionality and envy-freeness. It is well known that a proportional allocation among $n$ agents can be found efficiently via simple protocols [16]. For envy-freeness, in a recent breakthro… ▽ More

    Submitted 11 July, 2019; originally announced July 2019.