Skip to main content

Showing 1–50 of 56 results for author: Durand, F

Searching in archive cs. Search in all archives.
.
  1. Optimal Generation of Strictly Increasing Binary Trees and Beyond

    Authors: Olivier Bodini, Francis Durand, Philippe Marchal

    Abstract: This article presents two novel algorithms for generating random increasing trees. The first algorithm efficiently generates strictly increasing binary trees using an ad hoc method. The second algorithm improves the recursive method for weighted strictly increasing unary-binary increasing trees, optimizing randomness usage.

    Submitted 24 June, 2024; originally announced June 2024.

    Comments: In Proceedings GASCom 2024, arXiv:2406.14588

    Journal ref: EPTCS 403, 2024, pp. 60-65

  2. arXiv:2405.14867  [pdf, other

    cs.CV

    Improved Distribution Matching Distillation for Fast Image Synthesis

    Authors: Tianwei Yin, Michaël Gharbi, Taesung Park, Richard Zhang, Eli Shechtman, Fredo Durand, William T. Freeman

    Abstract: Recent approaches have shown promises distilling diffusion models into efficient one-step generators. Among them, Distribution Matching Distillation (DMD) produces one-step generators that match their teacher in distribution, without enforcing a one-to-one correspondence with the sampling trajectories of their teachers. However, to ensure stable training, DMD requires an additional regression loss… ▽ More

    Submitted 24 May, 2024; v1 submitted 23 May, 2024; originally announced May 2024.

    Comments: Code, model, and dataset are available at https://tianweiy.github.io/dmd2

  3. arXiv:2405.08564  [pdf, other

    cs.DS

    Anytime Sorting Algorithms (Extended Version)

    Authors: Emma Caizergues, François Durand, Fabien Mathieu

    Abstract: This paper addresses the anytime sorting problem, aiming to develop algorithms providing tentative estimates of the sorted list at each execution step. Comparisons are treated as steps, and the Spearman's footrule metric evaluates estimation accuracy. We propose a general approach for making any sorting algorithm anytime and introduce two new algorithms: multizip sort and Corsort. Simulations show… ▽ More

    Submitted 14 May, 2024; originally announced May 2024.

    Comments: 33rd International Joint Conference on Artificial Intelligence (IJCAI 2024), Aug 2024, Jeju City, Jeju Island, South Korea

  4. arXiv:2312.02970  [pdf, other

    cs.CV cs.AI cs.GR

    Alchemist: Parametric Control of Material Properties with Diffusion Models

    Authors: Prafull Sharma, Varun Jampani, Yuanzhen Li, Xuhui Jia, Dmitry Lagun, Fredo Durand, William T. Freeman, Mark Matthews

    Abstract: We propose a method to control material attributes of objects like roughness, metallic, albedo, and transparency in real images. Our method capitalizes on the generative prior of text-to-image models known for photorealism, employing a scalar value and instructions to alter low-level material properties. Addressing the lack of datasets with controlled material attributes, we generated an object-ce… ▽ More

    Submitted 5 December, 2023; originally announced December 2023.

  5. arXiv:2311.18828  [pdf, other

    cs.CV

    One-step Diffusion with Distribution Matching Distillation

    Authors: Tianwei Yin, Michaël Gharbi, Richard Zhang, Eli Shechtman, Fredo Durand, William T. Freeman, Taesung Park

    Abstract: Diffusion models generate high-quality images but require dozens of forward passes. We introduce Distribution Matching Distillation (DMD), a procedure to transform a diffusion model into a one-step image generator with minimal impact on image quality. We enforce the one-step image generator match the diffusion model at distribution level, by minimizing an approximate KL divergence whose gradient c… ▽ More

    Submitted 5 December, 2023; v1 submitted 30 November, 2023; originally announced November 2023.

    Comments: Project page: https://tianweiy.github.io/dmd/

    Journal ref: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2024

  6. arXiv:2309.02005  [pdf, other

    cs.LG cs.AI cs.MA math.OC

    Aggregating Correlated Estimations with (Almost) no Training

    Authors: Theo Delemazure, François Durand, Fabien Mathieu

    Abstract: Many decision problems cannot be solved exactly and use several estimation algorithms that assign scores to the different available options. The estimation errors can have various correlations, from low (e.g. between two very different approaches) to high (e.g. when using a given algorithm with different hyperparameters). Most aggregation rules would suffer from this diversity of correlations. In… ▽ More

    Submitted 5 September, 2023; originally announced September 2023.

    Journal ref: 26th European Conference on Artificial Intelligence (ECAI 2023), Sep 2023, Krak{ó}w, Poland

  7. arXiv:2306.11719  [pdf, other

    cs.CV cs.GR cs.LG

    Diffusion with Forward Models: Solving Stochastic Inverse Problems Without Direct Supervision

    Authors: Ayush Tewari, Tianwei Yin, George Cazenavette, Semon Rezchikov, Joshua B. Tenenbaum, Frédo Durand, William T. Freeman, Vincent Sitzmann

    Abstract: Denoising diffusion models are a powerful type of generative models used to capture complex distributions of real-world signals. However, their applicability is limited to scenarios where training samples are readily available, which is not always the case in real-world applications. For example, in inverse graphics, the goal is to generate samples from a distribution of 3D scenes that align with… ▽ More

    Submitted 16 November, 2023; v1 submitted 20 June, 2023; originally announced June 2023.

    Comments: Project page: https://diffusion-with-forward-models.github.io/

  8. arXiv:2305.13291  [pdf, other

    cs.CV cs.GR cs.LG

    Materialistic: Selecting Similar Materials in Images

    Authors: Prafull Sharma, Julien Philip, Michaël Gharbi, William T. Freeman, Fredo Durand, Valentin Deschaintre

    Abstract: Separating an image into meaningful underlying components is a crucial first step for both editing and understanding images. We present a method capable of selecting the regions of a photograph exhibiting the same material as an artist-chosen area. Our proposed approach is robust to shading, specular highlights, and cast shadows, enabling selection in real images. As we do not rely on semantic seg… ▽ More

    Submitted 22 May, 2023; originally announced May 2023.

  9. arXiv:2305.10431  [pdf, other

    cs.CV

    FastComposer: Tuning-Free Multi-Subject Image Generation with Localized Attention

    Authors: Guangxuan Xiao, Tianwei Yin, William T. Freeman, Frédo Durand, Song Han

    Abstract: Diffusion models excel at text-to-image generation, especially in subject-driven generation for personalized images. However, existing methods are inefficient due to the subject-specific fine-tuning, which is computationally intensive and hampers efficient deployment. Moreover, existing methods struggle with multi-subject generation as they often blend features among subjects. We present FastCompo… ▽ More

    Submitted 21 May, 2023; v1 submitted 17 May, 2023; originally announced May 2023.

    Comments: The first two authors contributed equally to this work

  10. arXiv:2304.11952  [pdf, ps, other

    cs.DS

    Sorting wild pigs

    Authors: Emma Caizergues, François Durand, Fabien Mathieu

    Abstract: Chjara, breeder in Carg{è}se, has n wild pigs. She would like to sort her herd by weight to better meet the demands of her buyers. Each beast has a distinct weight, alas unknown to Chjara. All she has at her disposal is a Roberval scale, which allows her to compare two pigs only at the cost of an acrobatic manoeuvre. The balance, quite old, can break at any time. Chjara therefore wants to sort his… ▽ More

    Submitted 24 April, 2023; originally announced April 2023.

    Comments: in French language, AlgoTel 2023 - 25{è}mes Rencontres Francophones sur les Aspects Algorithmiques des T{é}l{é}communications, May 2023, Cargese, France

  11. arXiv:2209.10507  [pdf, other

    cs.NI cs.CV

    Gemino: Practical and Robust Neural Compression for Video Conferencing

    Authors: Vibhaalakshmi Sivaraman, Pantea Karimi, Vedantha Venkatapathy, Mehrdad Khani, Sadjad Fouladi, Mohammad Alizadeh, Frédo Durand, Vivienne Sze

    Abstract: Video conferencing systems suffer from poor user experience when network conditions deteriorate because current video codecs simply cannot operate at extremely low bitrates. Recently, several neural alternatives have been proposed that reconstruct talking head videos at very low bitrates using sparse representations of each frame such as facial landmark information. However, these approaches produ… ▽ More

    Submitted 19 October, 2023; v1 submitted 21 September, 2022; originally announced September 2022.

    Comments: 13 pages, 5 appendix

    Journal ref: USENIX NSDI 2024

  12. arXiv:2209.10077  [pdf, other

    cs.CV cs.LG

    Can Shadows Reveal Biometric Information?

    Authors: Safa C. Medin, Amir Weiss, Frédo Durand, William T. Freeman, Gregory W. Wornell

    Abstract: We study the problem of extracting biometric information of individuals by looking at shadows of objects cast on diffuse surfaces. We show that the biometric information leakage from shadows can be sufficient for reliable identity inference under representative scenarios via a maximum likelihood analysis. We then develop a learning-based method that demonstrates this phenomenon in real settings, e… ▽ More

    Submitted 4 October, 2022; v1 submitted 20 September, 2022; originally announced September 2022.

  13. arXiv:2207.11232  [pdf, other

    cs.CV cs.AI cs.GR cs.LG

    Neural Groundplans: Persistent Neural Scene Representations from a Single Image

    Authors: Prafull Sharma, Ayush Tewari, Yilun Du, Sergey Zakharov, Rares Ambrus, Adrien Gaidon, William T. Freeman, Fredo Durand, Joshua B. Tenenbaum, Vincent Sitzmann

    Abstract: We present a method to map 2D image observations of a scene to a persistent 3D scene representation, enabling novel view synthesis and disentangled representation of the movable and immovable components of the scene. Motivated by the bird's-eye-view (BEV) representation commonly used in vision and robotics, we propose conditional neural groundplans, ground-aligned 2D feature grids, as persistent a… ▽ More

    Submitted 9 April, 2023; v1 submitted 22 July, 2022; originally announced July 2022.

    Comments: Project page: https://prafullsharma.net/neural_groundplans/

  14. arXiv:2206.05344  [pdf, other

    cs.GR cs.CV

    Differentiable Rendering of Neural SDFs through Reparameterization

    Authors: Sai Praveen Bangaru, Michaël Gharbi, Tzu-Mao Li, Fujun Luan, Kalyan Sunkavalli, Miloš Hašan, Sai Bi, Zexiang Xu, Gilbert Bernstein, Frédo Durand

    Abstract: We present a method to automatically compute correct gradients with respect to geometric scene parameters in neural SDF renderers. Recent physically-based differentiable rendering techniques for meshes have used edge-sampling to handle discontinuities, particularly at object silhouettes, but SDFs do not have a simple parametric form amenable to sampling. Instead, our approach builds on area-sampli… ▽ More

    Submitted 10 June, 2022; originally announced June 2022.

  15. arXiv:2205.03923  [pdf, other

    cs.CV cs.AI cs.GR cs.LG cs.MM

    Unsupervised Discovery and Composition of Object Light Fields

    Authors: Cameron Smith, Hong-Xing Yu, Sergey Zakharov, Fredo Durand, Joshua B. Tenenbaum, Jiajun Wu, Vincent Sitzmann

    Abstract: Neural scene representations, both continuous and discrete, have recently emerged as a powerful new paradigm for 3D scene understanding. Recent efforts have tackled unsupervised discovery of object-centric neural scene representations. However, the high cost of ray-marching, exacerbated by the fact that each object representation has to be ray-marched separately, leads to insufficiently sampled ra… ▽ More

    Submitted 15 July, 2023; v1 submitted 8 May, 2022; originally announced May 2022.

    Comments: Project website: https://cameronosmith.github.io/colf. TMLR 2023

  16. arXiv:2203.12691  [pdf, other

    cs.CV cs.GR

    Learning to generate line drawings that convey geometry and semantics

    Authors: Caroline Chan, Fredo Durand, Phillip Isola

    Abstract: This paper presents an unpaired method for creating line drawings from photographs. Current methods often rely on high quality paired datasets to generate line drawings. However, these datasets often have limitations due to the subjects of the drawings belonging to a specific domain, or in the amount of data collected. Although recent work in unsupervised image-to-image translation has shown much… ▽ More

    Submitted 28 March, 2022; v1 submitted 23 March, 2022; originally announced March 2022.

    Comments: Corrected and added references

  17. arXiv:2108.13027  [pdf, other

    cs.CV

    What You Can Learn by Staring at a Blank Wall

    Authors: Prafull Sharma, Miika Aittala, Yoav Y. Schechner, Antonio Torralba, Gregory W. Wornell, William T. Freeman, Fredo Durand

    Abstract: We present a passive non-line-of-sight method that infers the number of people or activity of a person from the observation of a blank wall in an unknown room. Our technique analyzes complex imperceptible changes in indirect illumination in a video of the wall to reveal a signal that is correlated with motion in the hidden part of a scene. We use this signal to classify between zero, one, or two m… ▽ More

    Submitted 30 August, 2021; originally announced August 2021.

  18. arXiv:2106.02634  [pdf, other

    cs.CV cs.AI cs.GR cs.LG cs.MM

    Light Field Networks: Neural Scene Representations with Single-Evaluation Rendering

    Authors: Vincent Sitzmann, Semon Rezchikov, William T. Freeman, Joshua B. Tenenbaum, Fredo Durand

    Abstract: Inferring representations of 3D scenes from 2D observations is a fundamental problem of computer graphics, computer vision, and artificial intelligence. Emerging 3D-structured neural scene representations are a promising approach to 3D scene understanding. In this work, we propose a novel neural scene representation, Light Field Networks or LFNs, which represent both geometry and appearance of the… ▽ More

    Submitted 18 January, 2022; v1 submitted 4 June, 2021; originally announced June 2021.

    Comments: First two authors contributed equally. Project website: https://vsitzmann.github.io/lfns/

  19. arXiv:2101.04822  [pdf, other

    eess.IV cs.CV

    Plug-and-Play Algorithms for Video Snapshot Compressive Imaging

    Authors: Xin Yuan, Yang Liu, **li Suo, Frédo Durand, Qionghai Dai

    Abstract: We consider the reconstruction problem of video snapshot compressive imaging (SCI), which captures high-speed videos using a low-speed 2D sensor (detector). The underlying principle of SCI is to modulate sequential high-speed frames with different masks and then these encoded frames are integrated into a snapshot on the sensor and thus the sensor can be of low-speed. On one hand, video SCI enjoys… ▽ More

    Submitted 12 January, 2021; originally announced January 2021.

    Comments: 18 pages, 12 figures and 4 tables. Journal extension of arXiv:2003.13654. Code available at https://github.com/liuyang12/PnP-SCI_python

  20. arXiv:2012.08141  [pdf, other

    cs.PL cs.AI cs.GR

    AsyncTaichi: On-the-fly Inter-kernel Optimizations for Imperative and Spatially Sparse Programming

    Authors: Yuanming Hu, Mingkuan Xu, Ye Kuang, Frédo Durand

    Abstract: Leveraging spatial sparsity has become a popular approach to accelerate 3D computer graphics applications. Spatially sparse data structures and efficient sparse kernels (such as parallel stencil operations on active voxels), are key to achieve high performance. Existing work focuses on improving performance within a single sparse computational kernel. We show that a system that looks beyond a sing… ▽ More

    Submitted 22 June, 2021; v1 submitted 15 December, 2020; originally announced December 2020.

    Comments: 18 pages, 20 figures, submitted to ACM SIGGRAPH Asia

    ACM Class: D.3.2; I.3.6; I.2.5

  21. arXiv:2007.15721  [pdf, ps, other

    math.DS cs.FL

    Dimension Groups and Dynamical Systems

    Authors: Fabien Durand, Dominique Perrin

    Abstract: We give a description of the link between topological dynamical systems and their dimension groups. The focus is on minimal systems and, in particular, on substitution shifts. We describe in detail the various classes of systems including Sturmian shifts and interval exchange shifts. This is a preliminary version of a book which will be published by Cambridge University Press. Any comments are of… ▽ More

    Submitted 11 December, 2020; v1 submitted 30 July, 2020; originally announced July 2020.

  22. arXiv:2007.08032  [pdf, other

    cs.CV cs.LG

    When and how CNNs generalize to out-of-distribution category-viewpoint combinations

    Authors: Spandan Madan, Timothy Henry, Jamell Dozier, Helen Ho, Nishchal Bhandari, Tomotake Sasaki, Frédo Durand, Hanspeter Pfister, Xavier Boix

    Abstract: Object recognition and viewpoint estimation lie at the heart of visual understanding. Recent works suggest that convolutional neural networks (CNNs) fail to generalize to out-of-distribution (OOD) category-viewpoint combinations, ie. combinations not seen during training. In this paper, we investigate when and how such OOD generalization may be possible by evaluating CNNs trained to classify both… ▽ More

    Submitted 17 November, 2021; v1 submitted 15 July, 2020; originally announced July 2020.

  23. arXiv:2001.01026  [pdf, other

    cs.GR cs.CV

    Painting Many Pasts: Synthesizing Time Lapse Videos of Paintings

    Authors: Amy Zhao, Guha Balakrishnan, Kathleen M. Lewis, Frédo Durand, John V. Guttag, Adrian V. Dalca

    Abstract: We introduce a new video synthesis task: synthesizing time lapse videos depicting how a given painting might have been created. Artists paint using unique combinations of brushes, strokes, and colors. There are often many possible ways to create a given painting. Our goal is to learn to capture this rich range of possibilities. Creating distributions of long-term videos is a challenge for learni… ▽ More

    Submitted 25 April, 2020; v1 submitted 3 January, 2020; originally announced January 2020.

    Comments: 10 pages, CVPR 2020

  24. arXiv:1912.02314  [pdf, other

    cs.CV cs.LG

    Computational Mirrors: Blind Inverse Light Transport by Deep Matrix Factorization

    Authors: Miika Aittala, Prafull Sharma, Lukas Murmann, Adam B. Yedidia, Gregory W. Wornell, William T. Freeman, Fredo Durand

    Abstract: We recover a video of the motion taking place in a hidden scene by observing changes in indirect illumination in a nearby uncalibrated visible region. We solve this problem by factoring the observed video into a matrix product between the unknown hidden scene video and an unknown light transport matrix. This task is extremely ill-posed, as any non-negative factorization will satisfy the data. Insp… ▽ More

    Submitted 4 December, 2019; originally announced December 2019.

    Comments: 14 pages, 5 figures, Advances in Neural Information Processing Systems 2019

    Journal ref: Aittala, Miika, et al. "Computational Mirrors: Blind Inverse Light Transport by Deep Matrix Factorization." Advances in Neural Information Processing Systems. 2019

  25. arXiv:1910.08131  [pdf, other

    cs.CV

    A Dataset of Multi-Illumination Images in the Wild

    Authors: Lukas Murmann, Michael Gharbi, Miika Aittala, Fredo Durand

    Abstract: Collections of images under a single, uncontrolled illumination have enabled the rapid advancement of core computer vision tasks like classification, detection, and segmentation. But even with modern learning techniques, many inverse problems involving lighting and material understanding remain too severely ill-posed to be solved with single-illumination datasets. To fill this gap, we introduce a… ▽ More

    Submitted 17 October, 2019; originally announced October 2019.

    Comments: ICCV 2019

  26. arXiv:1910.00935  [pdf, other

    cs.LG cs.GR physics.comp-ph stat.ML

    DiffTaichi: Differentiable Programming for Physical Simulation

    Authors: Yuanming Hu, Luke Anderson, Tzu-Mao Li, Qi Sun, Nathan Carr, Jonathan Ragan-Kelley, Frédo Durand

    Abstract: We present DiffTaichi, a new differentiable programming language tailored for building high-performance differentiable physical simulators. Based on an imperative programming language, DiffTaichi generates gradients of simulation steps using source code transformations that preserve arithmetic intensity and parallelism. A light-weight tape is used to record the whole simulation program structure a… ▽ More

    Submitted 14 February, 2020; v1 submitted 1 October, 2019; originally announced October 2019.

    Comments: Published at ICLR 2020

  27. arXiv:1909.00475  [pdf, other

    cs.CV

    Visual Deprojection: Probabilistic Recovery of Collapsed Dimensions

    Authors: Guha Balakrishnan, Adrian V. Dalca, Amy Zhao, John V. Guttag, Fredo Durand, William T. Freeman

    Abstract: We introduce visual deprojection: the task of recovering an image or video that has been collapsed along a dimension. Projections arise in various contexts, such as long-exposure photography, where a dynamic scene is collapsed in time to produce a motion-blurred image, and corner cameras, where reflected light from a scene is collapsed along a spatial dimension because of an edge occluder to yield… ▽ More

    Submitted 1 September, 2019; originally announced September 2019.

    Comments: ICCV 2019

  28. arXiv:1906.11557  [pdf, other

    cs.GR

    Flexible SVBRDF Capture with a Multi-Image Deep Network

    Authors: Valentin Deschaintre, Miika Aittala, Fredo Durand, George Drettakis, Adrien Bousseau

    Abstract: Empowered by deep learning, recent methods for material capture can estimate a spatially-varying reflectance from a single photograph. Such lightweight capture is in stark contrast with the tens or hundreds of pictures required by traditional optimization-based approaches. However, a single image is often simply not enough to observe the rich appearance of real-world materials. We present a deep-l… ▽ More

    Submitted 27 June, 2019; originally announced June 2019.

    Comments: Accepted to EGSR 2019 in the CGF track

    ACM Class: I.3

    Journal ref: Computer Graphics Forum (EGSR Conference Proceedings), 38, 4(July 2019), 13 pages

  29. arXiv:1904.08825  [pdf, other

    cs.CV

    Generating Training Data for Denoising Real RGB Images via Camera Pipeline Simulation

    Authors: Ronnachai Jaroensri, Camille Biscarrat, Miika Aittala, Frédo Durand

    Abstract: Image reconstruction techniques such as denoising often need to be applied to the RGB output of cameras and cellphones. Unfortunately, the commonly used additive white noise (AWGN) models do not accurately reproduce the noise and the degradation encountered on these inputs. This is particularly important for learning-based techniques, because the mismatch between training and real world data will… ▽ More

    Submitted 18 April, 2019; originally announced April 2019.

  30. arXiv:1902.09383  [pdf, other

    cs.CV

    Data augmentation using learned transformations for one-shot medical image segmentation

    Authors: Amy Zhao, Guha Balakrishnan, Frédo Durand, John V. Guttag, Adrian V. Dalca

    Abstract: Image segmentation is an important task in many medical applications. Methods based on convolutional neural networks attain state-of-the-art accuracy; however, they typically rely on supervised training with large labeled datasets. Labeling medical images requires significant expertise and time, and typical hand-tuned approaches for data augmentation fail to capture the complex variations in such… ▽ More

    Submitted 6 April, 2019; v1 submitted 25 February, 2019; originally announced February 2019.

    Comments: 9 pages, CVPR 2019

  31. arXiv:1811.03942  [pdf, ps, other

    math.DS cs.DM

    Decidability, arithmetic subsequences and eigenvalues of morphic subshifts

    Authors: Fabien Durand, Valérie Goyheneche

    Abstract: We prove decidability results on the existence of constant subsequences of uniformly recurrent morphic sequences along arithmetic progressions. We use spectral properties of the subshifts they generate to give a first algorithm deciding whether, given p $\in$ N, there exists such a constant subsequence along an arithmetic progression of common difference p. In the special case of uniformly recurre… ▽ More

    Submitted 16 November, 2018; v1 submitted 9 November, 2018; originally announced November 2018.

  32. Single-Image SVBRDF Capture with a Rendering-Aware Deep Network

    Authors: Valentin Deschaintre, Miika Aittala, Fredo Durand, George Drettakis, Adrien Bousseau

    Abstract: Texture, highlights, and shading are some of many visual cues that allow humans to perceive material appearance in single pictures. Yet, recovering spatially-varying bi-directional reflectance distribution functions (SVBRDFs) from a single image based on such cues has challenged researchers in computer graphics for decades. We tackle lightweight appearance capture by training a deep neural network… ▽ More

    Submitted 23 October, 2018; originally announced October 2018.

    Comments: 15 pages, presented at Siggraph 2018

    ACM Class: I.3

    Journal ref: ACM Trans. Graph. 37, 4, Article 128 (August 2018), 15 pages

  33. arXiv:1809.07851  [pdf, other

    cs.PF

    FFT Convolutions are Faster than Winograd on Modern CPUs, Here is Why

    Authors: Aleksandar Zlateski, Zhen Jia, Kai Li, Fredo Durand

    Abstract: Winograd-based convolution has quickly gained traction as a preferred approach to implement convolutional neural networks (ConvNet) on various hardware platforms because it requires fewer floating point operations than FFT-based or direct convolutions. This paper compares three highly optimized implementations (regular FFT--, Gauss--FFT--, and Winograd--based convolutions) on modern multi-- and… ▽ More

    Submitted 20 September, 2018; originally announced September 2018.

  34. Synthetically Trained Icon Proposals for Parsing and Summarizing Infographics

    Authors: Spandan Madan, Zoya Bylinskii, Matthew Tancik, Adrià Recasens, Kimberli Zhong, Sami Alsheikh, Hanspeter Pfister, Aude Oliva, Fredo Durand

    Abstract: Widely used in news, business, and educational media, infographics are handcrafted to effectively communicate messages about complex and often abstract topics including `ways to conserve the environment' and `understanding the financial crisis'. Composed of stylistically and semantically diverse visual and textual elements, infographics pose new challenges for computer vision. While automatic text… ▽ More

    Submitted 27 July, 2018; originally announced July 2018.

  35. arXiv:1806.04891  [pdf, other

    math.DS cs.DM

    Decidability of the isomorphism and the factorization between minimal substitution subshifts

    Authors: Fabien Durand, Julien Leroy

    Abstract: Classification is a central problem for dynamical systems, in particular for families that arise in a wide range of topics, like substitution subshifts. It is important to be able to distinguish whether two such subshifts are isomorphic, but the existing invariants are not sufficient for this purpose. We first show that given two minimal substitution subshifts, there exists a computable constant… ▽ More

    Submitted 23 August, 2022; v1 submitted 13 June, 2018; originally announced June 2018.

    Comments: 65 pages

    MSC Class: 37B10; 68R15

  36. arXiv:1804.07739  [pdf, other

    cs.CV

    Synthesizing Images of Humans in Unseen Poses

    Authors: Guha Balakrishnan, Amy Zhao, Adrian V. Dalca, Fredo Durand, John Guttag

    Abstract: We address the computational problem of novel human pose synthesis. Given an image of a person and a desired pose, we produce a depiction of that person in that pose, retaining the appearance of both the person and background. We present a modular generative neural network that synthesizes unseen poses using training pairs of images and poses taken from human action videos. Our network separates a… ▽ More

    Submitted 20 April, 2018; originally announced April 2018.

    Comments: CVPR 2018

  37. arXiv:1804.02684  [pdf, other

    cs.CV cs.GR

    Learning-based Video Motion Magnification

    Authors: Tae-Hyun Oh, Ronnachai Jaroensri, Changil Kim, Mohamed Elgharib, Frédo Durand, William T. Freeman, Wojciech Matusik

    Abstract: Video motion magnification techniques allow us to see small motions previously invisible to the naked eyes, such as those of vibrating airplane wings, or swaying buildings under the influence of the wind. Because the motion is small, the magnification results are prone to noise or excessive blurring. The state of the art relies on hand-designed filters to extract representations that may not be op… ▽ More

    Submitted 31 July, 2018; v1 submitted 8 April, 2018; originally announced April 2018.

    Comments: Accepted as ECCV 2018 Oral. The 1st and 2nd authors equally contributed. Video result: https://youtu.be/GrMLeEcSNzY , Project page: http://people.csail.mit.edu/tiam/deepmag/ Some bibliography information was fixed

  38. arXiv:1709.09215  [pdf, other

    cs.CV

    Understanding Infographics through Textual and Visual Tag Prediction

    Authors: Zoya Bylinskii, Sami Alsheikh, Spandan Madan, Adria Recasens, Kimberli Zhong, Hanspeter Pfister, Fredo Durand, Aude Oliva

    Abstract: We introduce the problem of visual hashtag discovery for infographics: extracting visual elements from an infographic that are diagnostic of its topic. Given an infographic as input, our computational approach automatically outputs textual and visual elements predicted to be representative of the infographic content. Concretely, from a curated dataset of 29K large infographic images sampled across… ▽ More

    Submitted 26 September, 2017; originally announced September 2017.

  39. Learning Visual Importance for Graphic Designs and Data Visualizations

    Authors: Zoya Bylinskii, Nam Wook Kim, Peter O'Donovan, Sami Alsheikh, Spandan Madan, Hanspeter Pfister, Fredo Durand, Bryan Russell, Aaron Hertzmann

    Abstract: Knowing where people look and click on visual designs can provide clues about how the designs are perceived, and where the most important or relevant content lies. The most important content of a visual design can be used for effective summarization or to facilitate retrieval from a database. We present automated models that predict the relative importance of different elements in data visualizati… ▽ More

    Submitted 8 August, 2017; originally announced August 2017.

    ACM Class: H.5.1

    Journal ref: UIST 2017

  40. Deep Bilateral Learning for Real-Time Image Enhancement

    Authors: Michaël Gharbi, Jiawen Chen, Jonathan T. Barron, Samuel W. Hasinoff, Frédo Durand

    Abstract: Performance is a critical challenge in mobile image processing. Given a reference imaging pipeline, or even human-adjusted pairs of images, we seek to reproduce the enhancements and enable real-time evaluation. For this, we introduce a new neural network architecture inspired by bilateral grid processing and local affine color transforms. Using pairs of input/output images, we train a convolutiona… ▽ More

    Submitted 22 August, 2017; v1 submitted 10 July, 2017; originally announced July 2017.

    Comments: 12 pages, 14 figures, Siggraph 2017

    Journal ref: ACM Trans. Graph. 36, 4, Article 118 (2017)

  41. arXiv:1702.05150  [pdf, other

    cs.HC cs.CV

    BubbleView: an interface for crowdsourcing image importance maps and tracking visual attention

    Authors: Nam Wook Kim, Zoya Bylinskii, Michelle A. Borkin, Krzysztof Z. Gajos, Aude Oliva, Fredo Durand, Hanspeter Pfister

    Abstract: In this paper, we present BubbleView, an alternative methodology for eye tracking using discrete mouse clicks to measure which information people consciously choose to examine. BubbleView is a mouse-contingent, moving-window interface in which participants are presented with a series of blurred images and click to reveal "bubbles" - small, circular areas of the image at original resolution, simila… ▽ More

    Submitted 9 August, 2017; v1 submitted 16 February, 2017; originally announced February 2017.

    Journal ref: TOCHI 2017

  42. arXiv:1612.04007  [pdf, other

    cs.CV

    A Video-Based Method for Objectively Rating Ataxia

    Authors: Ronnachai Jaroensri, Amy Zhao, Guha Balakrishnan, Derek Lo, Jeremy Schmahmann, John Guttag, Fredo Durand

    Abstract: For many movement disorders, such as Parkinson's disease and ataxia, disease progression is visually assessed by a clinician using a numerical disease rating scale. These tests are subjective, time-consuming, and must be administered by a professional. This can be problematic where specialists are not available, or when a patient is not consistently evaluated by the same clinician. We present an a… ▽ More

    Submitted 7 September, 2017; v1 submitted 12 December, 2016; originally announced December 2016.

    Comments: MLHC 2017

  43. arXiv:1610.05577  [pdf, ps, other

    cs.DM math.CO

    The constant of recognizability is computable for primitive morphisms

    Authors: Fabien Durand, Julien Leroy

    Abstract: Mossé proved that primitive morphisms are recognizable. In this paper we give a computable upper bound for the constant of recognizability of such a morphism. This bound can be expressed only using the cardinality of the alphabet and the length of the longest image under the morphism of a letter.

    Submitted 18 October, 2016; originally announced October 2016.

    MSC Class: 68R15

  44. arXiv:1610.02769  [pdf, other

    cs.GR

    Inverse Diffusion Curves using Shape Optimization

    Authors: Shuang Zhao, Fredo Durand, Changxi Zheng

    Abstract: The inverse diffusion curve problem focuses on automatic creation of diffusion curve images that resemble user provided color fields. This problem is challenging since the 1D curves have a nonlinear and global impact on resulting color fields via a partial differential equation (PDE). We introduce a new approach complementary to previous methods by optimizing curve geometry. In particular, we prop… ▽ More

    Submitted 9 October, 2016; originally announced October 2016.

    Comments: 14 pages

  45. arXiv:1604.03605  [pdf, other

    cs.CV

    What do different evaluation metrics tell us about saliency models?

    Authors: Zoya Bylinskii, Tilke Judd, Aude Oliva, Antonio Torralba, Frédo Durand

    Abstract: How best to evaluate a saliency model's ability to predict where humans look in images is an open research question. The choice of evaluation metric depends on how saliency is defined and how the ground truth is represented. Metrics differ in how they rank saliency models, and this results from how false positives and false negatives are treated, whether viewing biases are accounted for, whether s… ▽ More

    Submitted 6 April, 2017; v1 submitted 12 April, 2016; originally announced April 2016.

  46. Geometry on the Utility Space

    Authors: François Durand, Benoît Kloeckner, Fabien Mathieu, Ludovic Noirie

    Abstract: We study the geometrical properties of the utility space (the space of expected utilities over a finite set of options), which is commonly used to model the preferences of an agent in a situation of uncertainty. We focus on the case where the model is neutral with respect to the available options, i.e. treats them, a priori, as being symmetrical from one another. Specifically, we prove that the on… ▽ More

    Submitted 4 November, 2015; originally announced November 2015.

    Comments: in Fourth International Conference on Algorithmic Decision Theory, Sep 2015, Lexington, United States. pp.16, 2015, Fourth International Conference on Algorithmic Decision Theory

  47. arXiv:1208.6376  [pdf, ps, other

    cs.DM math.CO

    Towards a statement of the S-adic conjecture through examples

    Authors: Fabien Durand, Julien Leroy, Gwénaël Richomme

    Abstract: The $S$-adic conjecture claims that there exists a condition $C$ such that a sequence has a sub-linear complexity if and only if it is an $S$-adic sequence satisfying Condition $C$ for some finite set $S$ of morphisms. We present an overview of the factor complexity of $S$-adic sequences and we give some examples that either illustrate some interesting properties or that are counter-examples to wh… ▽ More

    Submitted 31 August, 2012; originally announced August 2012.

    Comments: 25

  48. arXiv:1204.6455  [pdf, other

    cs.NI

    On the Manipulability of Voting Systems: Application to Multi-Carrier Networks

    Authors: François Durand, Fabien Mathieu, Ludovic Noirie

    Abstract: Today, Internet involves many actors who are making revenues on it (operators, companies, service providers,...). It is therefore important to be able to make fair decisions in this large-scale and highly competitive economical ecosystem. One of the main issues is to prevent actors from manipulating the natural outcome of the decision process. For that purpose, game theory is a natural framework.… ▽ More

    Submitted 29 April, 2012; originally announced April 2012.

    Journal ref: N° 2012-04-001 (2012)

  49. arXiv:1204.5393  [pdf, ps, other

    math.CO cs.DM

    Decidability of uniform recurrence of morphic sequences

    Authors: Fabien Durand

    Abstract: We prove that the uniform recurrence of morphic sequences is decidable. For this we show that the number of derived sequences of uniformly recurrent morphic sequences is bounded. As a corollary we obtain that uniformly recurrent morphic sequences are primitive substitutive sequences.

    Submitted 31 August, 2012; v1 submitted 24 April, 2012; originally announced April 2012.

  50. arXiv:1111.3268  [pdf, ps, other

    math.CO cs.FL

    Decidability of the HD0L ultimate periodicity problem

    Authors: Fabien Durand

    Abstract: In this paper we prove the decidability of the HD0L ultimate periodicity problem.

    Submitted 2 January, 2013; v1 submitted 14 November, 2011; originally announced November 2011.