Skip to main content

Showing 1–14 of 14 results for author: Simao, C

.
  1. arXiv:2402.04627  [pdf, other

    cs.AI cs.CL cs.DB cs.IR

    SPARQL Generation: an analysis on fine-tuning OpenLLaMA for Question Answering over a Life Science Knowledge Graph

    Authors: Julio C. Rangel, Tarcisio Mendes de Farias, Ana Claudia Sima, Norio Kobayashi

    Abstract: The recent success of Large Language Models (LLM) in a wide range of Natural Language Processing applications opens the path towards novel Question Answering Systems over Knowledge Graphs leveraging LLMs. However, one of the main obstacles preventing their implementation is the scarcity of training data for the task of translating questions into corresponding SPARQL queries, particularly in the ca… ▽ More

    Submitted 7 February, 2024; originally announced February 2024.

    Comments: To appear in Proceedings of SWAT4HCLS 2024: Semantic Web Tools and Applications for Healthcare and Life Sciences

  2. arXiv:2312.14150  [pdf, other

    cs.CV

    DriveLM: Driving with Graph Visual Question Answering

    Authors: Chonghao Sima, Katrin Renz, Kashyap Chitta, Li Chen, Hanxue Zhang, Chengen Xie, ** Luo, Andreas Geiger, Hongyang Li

    Abstract: We study how vision-language models (VLMs) trained on web-scale data can be integrated into end-to-end driving systems to boost generalization and enable interactivity with human users. While recent approaches adapt VLMs to driving via single-round visual question answering (VQA), human drivers reason about decisions in multiple steps. Starting from the localization of key objects, humans estimate… ▽ More

    Submitted 21 December, 2023; originally announced December 2023.

  3. arXiv:2310.15670  [pdf, other

    cs.CV

    Leveraging Vision-Centric Multi-Modal Expertise for 3D Object Detection

    Authors: Linyan Huang, Zhiqi Li, Chonghao Sima, Wenhai Wang, **gdong Wang, Yu Qiao, Hongyang Li

    Abstract: Current research is primarily dedicated to advancing the accuracy of camera-only 3D object detectors (apprentice) through the knowledge transferred from LiDAR- or multi-modal-based counterparts (expert). However, the presence of the domain gap between LiDAR and camera features, coupled with the inherent incompatibility in temporal fusion, significantly hinders the effectiveness of distillation-bas… ▽ More

    Submitted 24 October, 2023; originally announced October 2023.

    Comments: Accepted by NeurIPS 2023

  4. arXiv:2306.02851  [pdf, other

    cs.CV cs.RO

    Scene as Occupancy

    Authors: Chonghao Sima, Wenwen Tong, Tai Wang, Li Chen, Silei Wu, Hanming Deng, Yi Gu, Lewei Lu, ** Luo, Dahua Lin, Hongyang Li

    Abstract: Human driver can easily describe the complex traffic scene by visual system. Such an ability of precise perception is essential for driver's planning. To achieve this, a geometry-aware representation that quantizes the physical 3D scene into structured grid map with semantic labels per cell, termed as 3D Occupancy, would be desirable. Compared to the form of bounding box, a key insight behind occu… ▽ More

    Submitted 26 June, 2023; v1 submitted 5 June, 2023; originally announced June 2023.

    Comments: Project link: https://github.com/OpenDriveLab/OccNet

  5. arXiv:2304.10440  [pdf, other

    cs.CV

    OpenLane-V2: A Topology Reasoning Benchmark for Unified 3D HD Map**

    Authors: Huijie Wang, Tianyu Li, Yang Li, Li Chen, Chonghao Sima, Zhenbo Liu, Bangjun Wang, Pei** Jia, Yuting Wang, Shengyin Jiang, Feng Wen, Hang Xu, ** Luo, Junchi Yan, Wei Zhang, Hongyang Li

    Abstract: Accurately depicting the complex traffic scene is a vital component for autonomous vehicles to execute correct judgments. However, existing benchmarks tend to oversimplify the scene by solely focusing on lane perception tasks. Observing that human drivers rely on both lanes and traffic signals to operate their vehicles safely, we present OpenLane-V2, the first dataset on topology reasoning for tra… ▽ More

    Submitted 28 October, 2023; v1 submitted 20 April, 2023; originally announced April 2023.

    Comments: Accepted by NeurIPS 2023 Track on Datasets and Benchmarks | OpenLane-V2 Dataset: https://github.com/OpenDriveLab/OpenLane-V2

  6. arXiv:2304.04179  [pdf, other

    cs.CV

    Sparse Dense Fusion for 3D Object Detection

    Authors: Yulu Gao, Chonghao Sima, Shaoshuai Shi, Shangzhe Di, Si Liu, Hongyang Li

    Abstract: With the prevalence of multimodal learning, camera-LiDAR fusion has gained popularity in 3D object detection. Although multiple fusion approaches have been proposed, they can be classified into either sparse-only or dense-only fashion based on the feature representation in the fusion module. In this paper, we analyze them in a common taxonomy and thereafter observe two challenges: 1) sparse-only s… ▽ More

    Submitted 9 April, 2023; originally announced April 2023.

  7. arXiv:2212.10156  [pdf, other

    cs.CV cs.RO

    Planning-oriented Autonomous Driving

    Authors: Yihan Hu, Jiazhi Yang, Li Chen, Keyu Li, Chonghao Sima, Xizhou Zhu, Siqi Chai, Senyao Du, Tianwei Lin, Wenhai Wang, Lewei Lu, Xiaosong Jia, Qiang Liu, Jifeng Dai, Yu Qiao, Hongyang Li

    Abstract: Modern autonomous driving system is characterized as modular tasks in sequential order, i.e., perception, prediction, and planning. In order to perform a wide diversity of tasks and achieve advanced-level intelligence, contemporary approaches either deploy standalone models for individual tasks, or design a multi-task paradigm with separate heads. However, they might suffer from accumulative error… ▽ More

    Submitted 23 March, 2023; v1 submitted 20 December, 2022; originally announced December 2022.

    Comments: CVPR 2023 award candidate. Project page: https://opendrivelab.github.io/UniAD/

  8. arXiv:2209.05324  [pdf, other

    cs.CV cs.LG cs.RO

    Delving into the Devils of Bird's-eye-view Perception: A Review, Evaluation and Recipe

    Authors: Hongyang Li, Chonghao Sima, Jifeng Dai, Wenhai Wang, Lewei Lu, Huijie Wang, Jia Zeng, Zhiqi Li, Jiazhi Yang, Hanming Deng, Hao Tian, Enze Xie, Jiangwei Xie, Li Chen, Tianyu Li, Yang Li, Yulu Gao, Xiaosong Jia, Si Liu, Jian** Shi, Dahua Lin, Yu Qiao

    Abstract: Learning powerful representations in bird's-eye-view (BEV) for perception tasks is trending and drawing extensive attention both from industry and academia. Conventional approaches for most autonomous driving algorithms perform detection, segmentation, tracking, etc., in a front or perspective view. As sensor configurations get more complex, integrating multi-source information from different sens… ▽ More

    Submitted 27 September, 2023; v1 submitted 12 September, 2022; originally announced September 2022.

    Comments: https://github.com/OpenDriveLab/Birds-eye-view-Perception

  9. arXiv:2203.17270  [pdf, other

    cs.CV

    BEVFormer: Learning Bird's-Eye-View Representation from Multi-Camera Images via Spatiotemporal Transformers

    Authors: Zhiqi Li, Wenhai Wang, Hongyang Li, Enze Xie, Chonghao Sima, Tong Lu, Qiao Yu, Jifeng Dai

    Abstract: 3D visual perception tasks, including 3D detection and map segmentation based on multi-camera images, are essential for autonomous driving systems. In this work, we present a new framework termed BEVFormer, which learns unified BEV representations with spatiotemporal transformers to support multiple autonomous driving perception tasks. In a nutshell, BEVFormer exploits both spatial and temporal in… ▽ More

    Submitted 13 July, 2022; v1 submitted 31 March, 2022; originally announced March 2022.

    Comments: Accepted to ECCV 2022

  10. arXiv:2203.11089  [pdf, other

    cs.CV

    PersFormer: 3D Lane Detection via Perspective Transformer and the OpenLane Benchmark

    Authors: Li Chen, Chonghao Sima, Yang Li, Zehan Zheng, Jiajie Xu, Xiangwei Geng, Hongyang Li, Conghui He, Jian** Shi, Yu Qiao, Junchi Yan

    Abstract: Methods for 3D lane detection have been recently proposed to address the issue of inaccurate lane layouts in many autonomous driving scenarios (uphill/downhill, bump, etc.). Previous work struggled in complex cases due to their simple designs of the spatial transformation between front view and bird's eye view (BEV) and the lack of a realistic dataset. Towards these issues, we present PersFormer:… ▽ More

    Submitted 19 July, 2022; v1 submitted 21 March, 2022; originally announced March 2022.

    Comments: Accepted by ECCV 2022 (Oral). Project page: https://github.com/OpenPerceptionX/PersFormer_3DLane | OpenLane dataset: https://github.com/OpenPerceptionX/OpenLane

  11. BV equivalence with boundary

    Authors: Francisco Manuel Castela Simão, Alberto S. Cattaneo, Michele Schiavina

    Abstract: An extension of the notion of classical equivalence of equivalence in the Batalin--(Fradkin)--Vilkovisky (BV) and (BFV) framework for local Lagrangian field theory on manifolds possibly with boundary is discussed. Equivalence is phrased in both a strict and a lax sense, distinguished by the compatibility between the BV data for a field theory and its boundary BFV data, necessary for quantisation.… ▽ More

    Submitted 7 March, 2023; v1 submitted 11 September, 2021; originally announced September 2021.

    Comments: Published version

    MSC Class: 81T70; 83C47; 70S15; 70B05

    Journal ref: Letters in Mathematical Physics volume 113 (25), 2023

  12. arXiv:2104.13744  [pdf, other

    cs.DB

    Bio-SODA: Enabling Natural Language Question Answering over Knowledge Graphs without Training Data

    Authors: Ana Claudia Sima, Tarcisio Mendes de Farias, Maria Anisimova, Christophe Dessimoz, Marc Robinson-Rechavi, Erich Zbinden, Kurt Stockinger

    Abstract: The problem of natural language processing over structured data has become a growing research field, both within the relational database and the Semantic Web community, with significant efforts involved in question answering over knowledge graphs (KGQA). However, many of these approaches are either specifically targeted at open-domain question answering using DBpedia, or require large training dat… ▽ More

    Submitted 14 June, 2021; v1 submitted 28 April, 2021; originally announced April 2021.

    Journal ref: 33rd International Conference on Scientific and Statistical Database Management (SSDBM 2021)

  13. Optical and mechanical properties of nanofibrillated cellulose: towards a robust platform for next-generation green technologies

    Authors: Claudia D. Simao, Juan S. Reparaz, Markus. R. Wagner, Bartlomiej Graczykowski, Martin Kreuzer, Yasser B. Ruiz-Blanco, Yamila Garcia, Jani-Markus Malho, Alejandro R. Goni, Jouni Ahopelto, Clivia M. Sotomayor Torres

    Abstract: Nanofibrillated cellulose, a polymer that can be obtained from one of the most abundant biopolymers in Nature, is being increasingly explored due to its outstanding properties for packaging and device applications. Still, open challenges in engineering its intrinsic properties remain to address. The results obtained show the precise determination of significant properties as elastic properties and… ▽ More

    Submitted 1 April, 2015; originally announced April 2015.

    Comments: in press in Carbohydrate Polymers (2015)

  14. Order quantification of hexagonal periodic arrays fabricated by in situ solvent-assisted nanoimprint lithography of block copolymers

    Authors: Claudia Simao, Worawut Khunsin, Nikolaos Kehagias, Mathieu Salaun, Marc Zelsmann, Michael A. Morris, Clivia M. Sotomayor Torres

    Abstract: Directed self-assembly of block copolymer polystyrene-b-polyethylene oxide (PS-b-PEO) thin film was achieved by one-pot methodology of solvent vapour assisted nanoimprint lithography (SAIL).

    Submitted 10 March, 2014; originally announced March 2014.

    Comments: 12 pages, 4 figures, paper accepted

    Journal ref: Nanotechnology (2014) 25 (7) 175703