Skip to main content

Showing 1–8 of 8 results for author: Alon, Y

Searching in archive cs. Search in all archives.
.
  1. arXiv:2312.14125  [pdf, other

    cs.CV cs.AI

    VideoPoet: A Large Language Model for Zero-Shot Video Generation

    Authors: Dan Kondratyuk, Lijun Yu, Xiuye Gu, José Lezama, Jonathan Huang, Grant Schindler, Rachel Hornung, Vighnesh Birodkar, Jimmy Yan, Ming-Chang Chiu, Krishna Somandepalli, Hassan Akbari, Yair Alon, Yong Cheng, Josh Dillon, Agrim Gupta, Meera Hahn, Anja Hauth, David Hendon, Alonso Martinez, David Minnen, Mikhail Sirotenko, Kihyuk Sohn, Xuan Yang, Hartwig Adam , et al. (6 additional authors not shown)

    Abstract: We present VideoPoet, a language model capable of synthesizing high-quality video, with matching audio, from a large variety of conditioning signals. VideoPoet employs a decoder-only transformer architecture that processes multimodal inputs -- including images, videos, text, and audio. The training protocol follows that of Large Language Models (LLMs), consisting of two stages: pretraining and tas… ▽ More

    Submitted 4 June, 2024; v1 submitted 21 December, 2023; originally announced December 2023.

    Comments: To appear at ICML 2024; Project page: http://sites.research.google/videopoet/

  2. arXiv:2311.00056  [pdf, other

    cs.CV cs.AI cs.LG

    Diversity and Diffusion: Observations on Synthetic Image Distributions with Stable Diffusion

    Authors: David Marwood, Shumeet Baluja, Yair Alon

    Abstract: Recent progress in text-to-image (TTI) systems, such as StableDiffusion, Imagen, and DALL-E 2, have made it possible to create realistic images with simple text prompts. It is tempting to use these systems to eliminate the manual task of obtaining natural images for training a new machine learning classifier. However, in all of the experiments performed to date, classifiers trained solely with syn… ▽ More

    Submitted 31 October, 2023; originally announced November 2023.

  3. arXiv:2207.14648  [pdf, other

    cs.SE cs.AI cs.LG

    Using Graph Neural Networks for Program Termination

    Authors: Yoav Alon, Cristina David

    Abstract: Termination analyses investigate the termination behavior of programs, intending to detect nontermination, which is known to cause a variety of program bugs (e.g. hanging programs, denial-of-service vulnerabilities). Beyond formal approaches, various attempts have been made to estimate the termination behavior of programs using neural networks. However, the majority of these approaches continue to… ▽ More

    Submitted 28 July, 2022; originally announced July 2022.

    Comments: ESEC/FSE 2022 - The 30th ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering

  4. arXiv:2201.03669  [pdf, other

    eess.IV cs.CV q-bio.QM

    Neuroplastic graph attention networks for nuclei segmentation in histopathology images

    Authors: Yoav Alon, Huiyu Zhou

    Abstract: Modern histopathological image analysis relies on the segmentation of cell structures to derive quantitative metrics required in biomedical research and clinical diagnostics. State-of-the-art deep learning approaches predominantly apply convolutional layers in segmentation and are typically highly customized for a specific experimental configuration; often unable to generalize to unknown data. As… ▽ More

    Submitted 10 January, 2022; originally announced January 2022.

  5. arXiv:2103.13997  [pdf

    cs.CL cs.LG

    Real-time low-resource phoneme recognition on edge devices

    Authors: Yonatan Alon

    Abstract: While speech recognition has seen a surge in interest and research over the last decade, most machine learning models for speech recognition either require large training datasets or lots of storage and memory. Combined with the prominence of English as the number one language in which audio data is available, this means most other languages currently lack good speech recognition models. The met… ▽ More

    Submitted 25 March, 2021; originally announced March 2021.

    Comments: The model and code described in this paper are publicly available at https://github.com/yonatankarimish/YonaVox

    ACM Class: I.2.7

  6. arXiv:2101.11600  [pdf, other

    eess.IV cs.CV q-bio.QM

    Synthetic Generation of Three-Dimensional Cancer Cell Models from Histopathological Images

    Authors: Yoav Alon, Xiang Yu, Huiyu Zhou

    Abstract: Synthetic generation of three-dimensional cell models from histopathological images aims to enhance understanding of cell mutation, and progression of cancer, necessary for clinical assessment and optimal treatment. Classical reconstruction algorithms based on image registration of consecutive slides of stained tissues are prone to errors and often not suitable for the training of three-dimensiona… ▽ More

    Submitted 8 February, 2021; v1 submitted 26 January, 2021; originally announced January 2021.

    Comments: For submission in MICCAI2021

  7. arXiv:2012.15472  [pdf, other

    cs.RO cs.AI cs.LG

    Multi-Agent Reinforcement Learning for Unmanned Aerial Vehicle Coordination by Multi-Critic Policy Gradient Optimization

    Authors: Yoav Alon, Huiyu Zhou

    Abstract: Recent technological progress in the development of Unmanned Aerial Vehicles (UAVs) together with decreasing acquisition costs make the application of drone fleets attractive for a wide variety of tasks. In agriculture, disaster management, search and rescue operations, commercial and military applications, the advantage of applying a fleet of drones originates from their ability to cooperate auto… ▽ More

    Submitted 31 December, 2020; originally announced December 2020.

    Comments: This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

  8. arXiv:2012.01988  [pdf, other

    cs.CV

    Wisdom of Committees: An Overlooked Approach To Faster and More Accurate Models

    Authors: Xiaofang Wang, Dan Kondratyuk, Eric Christiansen, Kris M. Kitani, Yair Alon, Elad Eban

    Abstract: Committee-based models (ensembles or cascades) construct models by combining existing pre-trained ones. While ensembles and cascades are well-known techniques that were proposed before deep learning, they are not considered a core building block of deep model architectures and are rarely compared to in recent literature on develo** efficient models. In this work, we go back to basics and conduct… ▽ More

    Submitted 17 February, 2022; v1 submitted 3 December, 2020; originally announced December 2020.

    Comments: ICLR 2022