Skip to main content

Showing 1–11 of 11 results for author: McKinnon, D

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.13874  [pdf, other

    cs.CV

    Affine-based Deformable Attention and Selective Fusion for Semi-dense Matching

    Authors: Hongkai Chen, Zixin Luo, Yurun Tian, Xuyang Bai, Ziyu Wang, Lei Zhou, Mingmin Zhen, Tian Fang, David McKinnon, Yanghai Tsin, Long Quan

    Abstract: Identifying robust and accurate correspondences across images is a fundamental problem in computer vision that enables various downstream tasks. Recent semi-dense matching methods emphasize the effectiveness of fusing relevant cross-view information through Transformer. In this paper, we propose several improvements upon this paradigm. Firstly, we introduce affine-based local attention to model cr… ▽ More

    Submitted 22 May, 2024; originally announced May 2024.

    Comments: Accepted to CVPR2024 Image Matching Workshop

  2. arXiv:2312.11805  [pdf, other

    cs.CL cs.AI cs.CV

    Gemini: A Family of Highly Capable Multimodal Models

    Authors: Gemini Team, Rohan Anil, Sebastian Borgeaud, Jean-Baptiste Alayrac, Jiahui Yu, Radu Soricut, Johan Schalkwyk, Andrew M. Dai, Anja Hauth, Katie Millican, David Silver, Melvin Johnson, Ioannis Antonoglou, Julian Schrittwieser, Amelia Glaese, Jilin Chen, Emily Pitler, Timothy Lillicrap, Angeliki Lazaridou, Orhan Firat, James Molloy, Michael Isard, Paul R. Barham, Tom Hennigan, Benjamin Lee , et al. (1325 additional authors not shown)

    Abstract: This report introduces a new family of multimodal models, Gemini, that exhibit remarkable capabilities across image, audio, video, and text understanding. The Gemini family consists of Ultra, Pro, and Nano sizes, suitable for applications ranging from complex reasoning tasks to on-device memory-constrained use-cases. Evaluation on a broad range of benchmarks shows that our most-capable Gemini Ultr… ▽ More

    Submitted 17 June, 2024; v1 submitted 18 December, 2023; originally announced December 2023.

  3. arXiv:2311.15980  [pdf, other

    cs.CV

    Direct2.5: Diverse Text-to-3D Generation via Multi-view 2.5D Diffusion

    Authors: Yuanxun Lu, **gyang Zhang, Shiwei Li, Tian Fang, David McKinnon, Yanghai Tsin, Long Quan, Xun Cao, Yao Yao

    Abstract: Recent advances in generative AI have unveiled significant potential for the creation of 3D content. However, current methods either apply a pre-trained 2D diffusion model with the time-consuming score distillation sampling (SDS), or a direct 3D diffusion model trained on limited 3D data losing generation diversity. In this work, we approach the problem by employing a multi-view 2.5D diffusion fin… ▽ More

    Submitted 21 March, 2024; v1 submitted 27 November, 2023; originally announced November 2023.

    Comments: CVPR 2024 camera ready, including more evaluations and discussions. Project webpage: https://nju-3dv.github.io/projects/direct25

  4. arXiv:2310.06347  [pdf, other

    cs.CV

    JointNet: Extending Text-to-Image Diffusion for Dense Distribution Modeling

    Authors: **gyang Zhang, Shiwei Li, Yuanxun Lu, Tian Fang, David McKinnon, Yanghai Tsin, Long Quan, Yao Yao

    Abstract: We introduce JointNet, a novel neural network architecture for modeling the joint distribution of images and an additional dense modality (e.g., depth maps). JointNet is extended from a pre-trained text-to-image diffusion model, where a copy of the original network is created for the new dense modality branch and is densely connected with the RGB branch. The RGB branch is locked during network fin… ▽ More

    Submitted 10 October, 2023; originally announced October 2023.

  5. arXiv:2303.17147  [pdf, other

    cs.CV

    NeILF++: Inter-Reflectable Light Fields for Geometry and Material Estimation

    Authors: **gyang Zhang, Yao Yao, Shiwei Li, **gbo Liu, Tian Fang, David McKinnon, Yanghai Tsin, Long Quan

    Abstract: We present a novel differentiable rendering framework for joint geometry, material, and lighting estimation from multi-view images. In contrast to previous methods which assume a simplified environment map or co-located flashlights, in this work, we formulate the lighting of a static scene as one neural incident light field (NeILF) and one outgoing neural radiance field (NeRF). The key insight of… ▽ More

    Submitted 30 March, 2023; originally announced March 2023.

    Comments: Project page: \url{https://yoyo000.github.io/NeILF_pp}

  6. arXiv:2208.14201  [pdf, other

    cs.CV

    ASpanFormer: Detector-Free Image Matching with Adaptive Span Transformer

    Authors: Hongkai Chen, Zixin Luo, Lei Zhou, Yurun Tian, Mingmin Zhen, Tian Fang, David Mckinnon, Yanghai Tsin, Long Quan

    Abstract: Generating robust and reliable correspondences across images is a fundamental task for a diversity of applications. To capture context at both global and local granularity, we propose ASpanFormer, a Transformer-based detector-free matcher that is built on hierarchical attention structure, adopting a novel attention operation which is capable of adjusting attention span in a self-adaptive manner. T… ▽ More

    Submitted 30 August, 2022; originally announced August 2022.

    Comments: Accepted to ECCV2022, project page at https://aspanformer.github.io/

  7. arXiv:2206.03087  [pdf, other

    cs.CV

    Critical Regularizations for Neural Surface Reconstruction in the Wild

    Authors: **gyang Zhang, Yao Yao, Shiwei Li, Tian Fang, David McKinnon, Yanghai Tsin, Long Quan

    Abstract: Neural implicit functions have recently shown promising results on surface reconstructions from multiple views. However, current methods still suffer from excessive time complexity and poor robustness when reconstructing unbounded or complex scenes. In this paper, we present RegSDF, which shows that proper point cloud supervisions and geometry regularizations are sufficient to produce high-quality… ▽ More

    Submitted 7 June, 2022; originally announced June 2022.

    Comments: CVPR 2022

  8. arXiv:2203.07182  [pdf, other

    cs.CV

    NeILF: Neural Incident Light Field for Physically-based Material Estimation

    Authors: Yao Yao, **gyang Zhang, **gbo Liu, Yihang Qu, Tian Fang, David McKinnon, Yanghai Tsin, Long Quan

    Abstract: We present a differentiable rendering framework for material and lighting estimation from multi-view images and a reconstructed geometry. In the framework, we represent scene lightings as the Neural Incident Light Field (NeILF) and material properties as the surface BRDF modelled by multi-layer perceptrons. Compared with recent approaches that approximate scene lightings as the 2D environment map,… ▽ More

    Submitted 18 March, 2022; v1 submitted 14 March, 2022; originally announced March 2022.

  9. arXiv:1810.06681  [pdf, other

    cs.RO

    Learn Fast, Forget Slow: Safe Predictive Learning Control for Systems with Unknown and Changing Dynamics Performing Repetitive Tasks

    Authors: Christopher D. McKinnon, Angela P. Schoellig

    Abstract: We present a control method for improved repetitive path following for a ground vehicle that is geared towards long-term operation where the operating conditions can change over time and are initially unknown. We use weighted Bayesian Linear Regression (wBLR) to model the unknown dynamics, and show how this simple model is more accurate in both its estimate of the mean behaviour and model uncertai… ▽ More

    Submitted 9 April, 2019; v1 submitted 15 October, 2018; originally announced October 2018.

  10. arXiv:1803.04065  [pdf, other

    cs.RO

    Experience Recommendation for Long Term Safe Learning-based Model Predictive Control in Changing Operating Conditions

    Authors: Christopher D. McKinnon, Angela P. Schoellig

    Abstract: Learning has propelled the cutting edge of performance in robotic control to new heights, allowing robots to operate with high performance in conditions that were previously unimaginable. The majority of the work, however, assumes that the unknown parts are static or slowly changing. This limits them to static or slowly changing environments. However, in the real world, a robot may experience vari… ▽ More

    Submitted 11 March, 2018; originally announced March 2018.

  11. arXiv:1603.02772  [pdf, other

    cs.RO

    Unscented External Force and Torque Estimation for Quadrotors

    Authors: Christopher D. McKinnon, Angela P. Schoellig

    Abstract: In this paper, we describe an algorithm, based on the well-known Unscented Quaternion Estimator, to estimate external forces and torques acting on a quadrotor. This formulation uses a non-linear model for the quadrotor dynamics, naturally incorporates process and measurement noise, requires only a few parameters to be tuned manually, and uses singularity-free unit quaternions to represent attitude… ▽ More

    Submitted 2 August, 2016; v1 submitted 8 March, 2016; originally announced March 2016.