Skip to main content

Showing 1–3 of 3 results for author: Kastoryano, M J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.04332  [pdf, other

    cs.CV cs.LG

    Coarse-To-Fine Tensor Trains for Compact Visual Representations

    Authors: Sebastian Loeschcke, Dan Wang, Christian Leth-Espensen, Serge Belongie, Michael J. Kastoryano, Sagie Benaim

    Abstract: The ability to learn compact, high-quality, and easy-to-optimize representations for visual data is paramount to many applications such as novel view synthesis and 3D reconstruction. Recent work has shown substantial success in using tensor networks to design such compact and high-quality representations. However, the ability to optimize tensor-based representations, and in particular, the highly… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

    Comments: Project webpage: https://sebulo.github.io/PuTT_website/

  2. arXiv:2405.20322  [pdf, other

    quant-ph cs.DS

    Quantum generalizations of Glauber and Metropolis dynamics

    Authors: András Gilyén, Chi-Fang Chen, Joao F. Doriguello, Michael J. Kastoryano

    Abstract: Classical Markov Chain Monte Carlo methods have been essential for simulating statistical physical systems and have proven well applicable to other systems with complex degrees of freedom. Motivated by the statistical physics origins, Chen, Kastoryano, and Gilyén [CKG23] proposed a continuous-time quantum thermodynamic analog to Glauber dynamic that is (i) exactly detailed balanced, (ii) efficient… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

  3. arXiv:2405.16528  [pdf, other

    cs.LG cs.CL

    LoQT: Low Rank Adapters for Quantized Training

    Authors: Sebastian Loeschcke, Mads Toftrup, Michael J. Kastoryano, Serge Belongie, Vésteinn Snæbjarnarson

    Abstract: Training of large neural networks requires significant computational resources. Despite advances using low-rank adapters and quantization, pretraining of models such as LLMs on consumer hardware has not been possible without model sharding, offloading during training, or per-layer gradient updates. To address these limitations, we propose LoQT, a method for efficiently training quantized models. L… ▽ More

    Submitted 26 May, 2024; originally announced May 2024.