Search | arXiv e-print repository

Diffusion Models for Video Prediction and Infilling

Authors: Tobias Höppe, Arash Mehrjou, Stefan Bauer, Didrik Nielsen, Andrea Dittadi

Abstract: Predicting and anticipating future outcomes or reasoning about missing information in a sequence are critical skills for agents to be able to make intelligent decisions. This requires strong, temporally coherent generative capabilities. Diffusion models have shown remarkable success in several generative tasks, but have not been extensively explored in the video domain. We present Random-Mask Vide… ▽ More Predicting and anticipating future outcomes or reasoning about missing information in a sequence are critical skills for agents to be able to make intelligent decisions. This requires strong, temporally coherent generative capabilities. Diffusion models have shown remarkable success in several generative tasks, but have not been extensively explored in the video domain. We present Random-Mask Video Diffusion (RaMViD), which extends image diffusion models to videos using 3D convolutions, and introduces a new conditioning technique during training. By varying the mask we condition on, the model is able to perform video prediction, infilling, and upsampling. Due to our simple conditioning scheme, we can utilize the same architecture as used for unconditional training, which allows us to train the model in a conditional and unconditional fashion at the same time. We evaluate RaMViD on two benchmark datasets for video prediction, on which we achieve state-of-the-art results, and one for video generation. High-resolution videos are provided at https://sites.google.com/view/video-diffusion-prediction. △ Less

Submitted 14 November, 2022; v1 submitted 15 June, 2022; originally announced June 2022.

Comments: Published in TMLR (11/2022)

arXiv:2112.01864 [pdf, other]

doi 10.1029/2022SW003196

Over 20-year global magnetohydrodynamic simulation of Earth's magnetosphere

Authors: Ilja Honkonen, Max van de Kamp, Theresa Hoppe, Kirsti Kauristie

Abstract: We present our approach to modeling over 20 years of the solar wind-magnetosphere-ionosphere system using version 5 of the Grand Unified Magnetosphere-Ionosphere Coupling Simulation (GUMICS-5). As input we use 16-s resolution magnetic field and 1-min plasma measurements by the Advanced Composition Explorer (ACE) satellite from 1998 to 2020. The modeled interval is divided into 28 h simulations, wh… ▽ More We present our approach to modeling over 20 years of the solar wind-magnetosphere-ionosphere system using version 5 of the Grand Unified Magnetosphere-Ionosphere Coupling Simulation (GUMICS-5). As input we use 16-s resolution magnetic field and 1-min plasma measurements by the Advanced Composition Explorer (ACE) satellite from 1998 to 2020. The modeled interval is divided into 28 h simulations, which include 4 h overlap. We use a maximum magnetospheric resolution of 0.5 Earth radii (Re) up to about 15 Re from Earth and decreasing resolution further away. In the ionosphere we use a maximum resolution of approximately 100 km poleward of +-58 degrees magnetic latitude and decreasing resolution towards the equator. With respect to the previous version GUMICS-4, we have parallelized the magnetosphere of GUMICS-5 using the Message Passing Interface and have made several improvements which have e.g. decreased its numerical diffusion. We compare the simulation results to several empirical models and geomagnetic indices derived from ground magnetic field measurements. GUMICS-5 reproduces observed solar cycle trends in magnetopause stand-off distance and magnetospheric lobe field strength but consistency in plasma sheet pressure and ionospheric cross-polar cap potential is lower. Comparisons with geomagnetic indices show better results for Kp index than for AE index. The simulation results are available at https://doi.org/10.23729/ca1da110-2d4e-45c4-8876-57210fbb0b0d, consisting of full ionospheric files and size-optimized magnetospheric files. The data used for Figures is available at https://doi.org/10.5281/zenodo.6641258. Our extensive results can serve e.g. as a foundation for a combined physics-based and black-box approach to real-time prediction of near-Earth space, or as input to other physics-based models of the inner magnetosphere, upper and middle atmosphere, etc. △ Less

Submitted 14 June, 2022; v1 submitted 3 December, 2021; originally announced December 2021.

Journal ref: Space Weather, 20, e2022SW003196

arXiv:2101.00027 [pdf, other]

The Pile: An 800GB Dataset of Diverse Text for Language Modeling

Authors: Leo Gao, Stella Biderman, Sid Black, Laurence Golding, Travis Hoppe, Charles Foster, Jason Phang, Horace He, Anish Thite, Noa Nabeshima, Shawn Presser, Connor Leahy

Abstract: Recent work has demonstrated that increased training dataset diversity improves general cross-domain knowledge and downstream generalization capability for large-scale language models. With this in mind, we present \textit{the Pile}: an 825 GiB English text corpus targeted at training large-scale language models. The Pile is constructed from 22 diverse high-quality subsets -- both existing and new… ▽ More Recent work has demonstrated that increased training dataset diversity improves general cross-domain knowledge and downstream generalization capability for large-scale language models. With this in mind, we present \textit{the Pile}: an 825 GiB English text corpus targeted at training large-scale language models. The Pile is constructed from 22 diverse high-quality subsets -- both existing and newly constructed -- many of which derive from academic or professional sources. Our evaluation of the untuned performance of GPT-2 and GPT-3 on the Pile shows that these models struggle on many of its components, such as academic writing. Conversely, models trained on the Pile improve significantly over both Raw CC and CC-100 on all components of the Pile, while improving performance on downstream evaluations. Through an in-depth exploratory analysis, we document potentially concerning aspects of the data for prospective users. We make publicly available the code used in its construction. △ Less

Submitted 31 December, 2020; originally announced January 2021.

arXiv:2004.12195 [pdf, other]

QURATOR: Innovative Technologies for Content and Data Curation

Authors: Georg Rehm, Peter Bourgonje, Stefanie Hegele, Florian Kintzel, Julián Moreno Schneider, Malte Ostendorff, Karolina Zaczynska, Armin Berger, Stefan Grill, Sören Räuchle, Jens Rauenbusch, Lisa Rutenburg, André Schmidt, Mikka Wild, Henry Hoffmann, Julian Fink, Sarah Schulz, Jurica Seva, Joachim Quantz, Joachim Böttger, Josefine Matthey, Rolf Fricke, Jan Thomsen, Adrian Paschke, Jamal Al Qundus , et al. (15 additional authors not shown)

Abstract: In all domains and sectors, the demand for intelligent systems to support the processing and generation of digital content is rapidly increasing. The availability of vast amounts of content and the pressure to publish new content quickly and in rapid succession requires faster, more efficient and smarter processing and generation methods. With a consortium of ten partners from research and industr… ▽ More In all domains and sectors, the demand for intelligent systems to support the processing and generation of digital content is rapidly increasing. The availability of vast amounts of content and the pressure to publish new content quickly and in rapid succession requires faster, more efficient and smarter processing and generation methods. With a consortium of ten partners from research and industry and a broad range of expertise in AI, Machine Learning and Language Technologies, the QURATOR project, funded by the German Federal Ministry of Education and Research, develops a sustainable and innovative technology platform that provides services to support knowledge workers in various industries to address the challenges they face when curating digital content. The project's vision and ambition is to establish an ecosystem for content curation technologies that significantly pushes the current state of the art and transforms its region, the metropolitan area Berlin-Brandenburg, into a global centre of excellence for curation technologies. △ Less

Submitted 25 April, 2020; originally announced April 2020.

Comments: Proceedings of QURATOR 2020: The conference for intelligent content solutions, Berlin, Germany, February 2020

arXiv:1409.1360 [pdf, other]

doi 10.1007/s11943-015-0173-x

Everything counts! - Warum kleine Gemeinden die Gewinner der Zensuserhebung 2011 sind

Authors: Björn Christensen, Sören Christensen, Tim Hoppe, Michael Spandel

Abstract: The population and housing census 2011 was an EU-wide census in all EU member states. In Germany, the basis was a largely register-based method. In this paper, it is shown that communities with less than 10.000 inhabitants have significantly less relative losses in the number of inhabitants compared to communities with more than 10.000 inhabitants. The population and housing census 2011 was an EU-wide census in all EU member states. In Germany, the basis was a largely register-based method. In this paper, it is shown that communities with less than 10.000 inhabitants have significantly less relative losses in the number of inhabitants compared to communities with more than 10.000 inhabitants. △ Less

Submitted 4 September, 2014; originally announced September 2014.

Comments: in German

arXiv:1408.3644 [pdf, other]

doi 10.1016/j.dam.2015.07.017

Integer sequence discovery from small graphs

Authors: Travis Hoppe, Anna Petrone

Abstract: We have exhaustively enumerated all simple, connected graphs of a finite order and have computed a selection of invariants over this set. Integer sequences were constructed from these invariants and checked against the Online Encyclopedia of Integer Sequences (OEIS). 141 new sequences were added and 6 sequences were appended or corrected. From the graph database, we were able to programmatically s… ▽ More We have exhaustively enumerated all simple, connected graphs of a finite order and have computed a selection of invariants over this set. Integer sequences were constructed from these invariants and checked against the Online Encyclopedia of Integer Sequences (OEIS). 141 new sequences were added and 6 sequences were appended or corrected. From the graph database, we were able to programmatically suggest relationships among the invariants. It will be shown that we can readily visualize any sequence of graphs with a given criteria. The code has been released as an open-source framework for further analysis and the database was constructed to be extensible to invariants not considered in this work. △ Less

Submitted 15 August, 2014; originally announced August 2014.

Comments: Supplemented by two electronic repositories: https://github.com/thoppe/Encyclopedia-of-Finite-Graphs (DOI 10.5281/zenodo.11304) and https://github.com/thoppe/Simple-connected-graph-invariant-database (DOI 10.5281/zenodo.11280)

MSC Class: 05A15

Showing 1–6 of 6 results for author: Höppe, T