Skip to main content

Showing 1–28 of 28 results for author: Qian, G

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.11483  [pdf

    cs.CE

    Analysis of water injection heat recovery potential of abandoned oil wells to geothermal wells in northern Shaanxi

    Authors: Yu Huagui, Liu Shi, Pang Yanyan, Wang Peng, Gao Qian

    Abstract: The Chang 2 bottom water reservoir area in the western part of northern Shaanxi is one of the core oil-producing areas in the Ordos Basin.One of the main reservoirs is the Chang 2 reservoir of the Triassic Yanchang Formation, which has good physical conditions, active edge and bottom water, and high geothermal gradient. In this paper, the reservoir numerical simulation software CMG is used to simu… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

    Journal ref: Modern Electric Power, 2023, 1-9

  2. arXiv:2405.00728  [pdf

    cs.CL cs.AI cs.HC

    Evaluating the Application of ChatGPT in Outpatient Triage Guidance: A Comparative Study

    Authors: Dou Liu, Ying Han, Xiandi Wang, Xiaomei Tan, Di Liu, Guangwu Qian, Kang Li, Dan Pu, Rong Yin

    Abstract: The integration of Artificial Intelligence (AI) in healthcare presents a transformative potential for enhancing operational efficiency and health outcomes. Large Language Models (LLMs), such as ChatGPT, have shown their capabilities in supporting medical decision-making. Embedding LLMs in medical systems is becoming a promising trend in healthcare development. The potential of ChatGPT to address t… ▽ More

    Submitted 27 April, 2024; originally announced May 2024.

    Comments: 8 pages, 1 figure, conference(International Ergonomics Association)

  3. arXiv:2402.10128  [pdf, other

    cs.CV cs.GR cs.LG

    GES: Generalized Exponential Splatting for Efficient Radiance Field Rendering

    Authors: Abdullah Hamdi, Luke Melas-Kyriazi, **jie Mai, Guocheng Qian, Ruoshi Liu, Carl Vondrick, Bernard Ghanem, Andrea Vedaldi

    Abstract: Advancements in 3D Gaussian Splatting have significantly accelerated 3D reconstruction and generation. However, it may require a large number of Gaussians, which creates a substantial memory footprint. This paper introduces GES (Generalized Exponential Splatting), a novel representation that employs Generalized Exponential Function (GEF) to model 3D scenes, requiring far fewer particles to represe… ▽ More

    Submitted 24 May, 2024; v1 submitted 15 February, 2024; originally announced February 2024.

    Comments: CVPR 2024 paper. project website https://abdullahamdi.com/ges

  4. arXiv:2402.05235  [pdf, other

    cs.CV

    SPAD : Spatially Aware Multiview Diffusers

    Authors: Yash Kant, Ziyi Wu, Michael Vasilkovsky, Guocheng Qian, Jian Ren, Riza Alp Guler, Bernard Ghanem, Sergey Tulyakov, Igor Gilitschenski, Aliaksandr Siarohin

    Abstract: We present SPAD, a novel approach for creating consistent multi-view images from text prompts or single images. To enable multi-view generation, we repurpose a pretrained 2D diffusion model by extending its self-attention layers with cross-view interactions, and fine-tune it on a high quality subset of Objaverse. We find that a naive extension of the self-attention proposed in prior work (e.g. MVD… ▽ More

    Submitted 7 February, 2024; originally announced February 2024.

    Comments: Webpage: https://yashkant.github.io/spad

  5. arXiv:2402.00867  [pdf, other

    cs.CV

    AToM: Amortized Text-to-Mesh using 2D Diffusion

    Authors: Guocheng Qian, Junli Cao, Aliaksandr Siarohin, Yash Kant, Chaoyang Wang, Michael Vasilkovsky, Hsin-Ying Lee, Yuwei Fang, Ivan Skorokhodov, Peiye Zhuang, Igor Gilitschenski, Jian Ren, Bernard Ghanem, Kfir Aberman, Sergey Tulyakov

    Abstract: We introduce Amortized Text-to-Mesh (AToM), a feed-forward text-to-mesh framework optimized across multiple text prompts simultaneously. In contrast to existing text-to-3D methods that often entail time-consuming per-prompt optimization and commonly output representations other than polygonal meshes, AToM directly generates high-quality textured meshes in less than 1 second with around 10 times re… ▽ More

    Submitted 1 February, 2024; originally announced February 2024.

    Comments: 19 pages with appendix and references. Webpage: https://snap-research.github.io/AToM/

  6. arXiv:2401.05583  [pdf, other

    cs.CV

    Diffusion Priors for Dynamic View Synthesis from Monocular Videos

    Authors: Chaoyang Wang, Peiye Zhuang, Aliaksandr Siarohin, Junli Cao, Guocheng Qian, Hsin-Ying Lee, Sergey Tulyakov

    Abstract: Dynamic novel view synthesis aims to capture the temporal evolution of visual content within videos. Existing methods struggle to distinguishing between motion and structure, particularly in scenarios where camera poses are either unknown or constrained compared to object motion. Furthermore, with information solely from reference images, it is extremely challenging to hallucinate unseen regions t… ▽ More

    Submitted 10 January, 2024; originally announced January 2024.

  7. arXiv:2401.04105  [pdf, other

    cs.CV cs.AI

    Dr$^2$Net: Dynamic Reversible Dual-Residual Networks for Memory-Efficient Finetuning

    Authors: Chen Zhao, Shuming Liu, Karttikeya Mangalam, Guocheng Qian, Fatimah Zohra, Abdulmohsen Alghannam, Jitendra Malik, Bernard Ghanem

    Abstract: Large pretrained models are increasingly crucial in modern computer vision tasks. These models are typically used in downstream tasks by end-to-end finetuning, which is highly memory-intensive for tasks with high-resolution data, e.g., video understanding, small object detection, and point cloud analysis. In this paper, we propose Dynamic Reversible Dual-Residual Networks, or Dr$^2$Net, a novel fa… ▽ More

    Submitted 30 March, 2024; v1 submitted 8 January, 2024; originally announced January 2024.

    Journal ref: the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2024

  8. arXiv:2309.01839  [pdf, other

    cs.CR

    Designing a Security System Administration Course for Cybersecurity with a Companion Project

    Authors: Fei Zuo, Junghwan Rhee, Myungah Park, Gang Qian

    Abstract: In the past few years, an incident response-oriented cybersecurity program has been constructed at University of Central Oklahoma. As a core course in the newly-established curricula, Secure System Administration focuses on the essential knowledge and skill set for system administration. To enrich students with hands-on experience, we also develop a companion coursework project, named PowerGrader.… ▽ More

    Submitted 13 October, 2023; v1 submitted 4 September, 2023; originally announced September 2023.

    Comments: Accepted by the 37th Annual CCSC: Southeastern Conference

  9. arXiv:2306.17843  [pdf, other

    cs.CV

    Magic123: One Image to High-Quality 3D Object Generation Using Both 2D and 3D Diffusion Priors

    Authors: Guocheng Qian, **jie Mai, Abdullah Hamdi, Jian Ren, Aliaksandr Siarohin, Bing Li, Hsin-Ying Lee, Ivan Skorokhodov, Peter Wonka, Sergey Tulyakov, Bernard Ghanem

    Abstract: We present Magic123, a two-stage coarse-to-fine approach for high-quality, textured 3D meshes generation from a single unposed image in the wild using both2D and 3D priors. In the first stage, we optimize a neural radiance field to produce a coarse geometry. In the second stage, we adopt a memory-efficient differentiable mesh representation to yield a high-resolution mesh with a visually appealing… ▽ More

    Submitted 23 July, 2023; v1 submitted 30 June, 2023; originally announced June 2023.

    Comments: webpage: https://guochengqian.github.io/project/magic123/

  10. arXiv:2306.04927  [pdf, other

    cs.CV

    An Efficient Transformer for Simultaneous Learning of BEV and Lane Representations in 3D Lane Detection

    Authors: Ziye Chen, Kate Smith-Miles, Bo Du, Guoqi Qian, Mingming Gong

    Abstract: Accurately detecting lane lines in 3D space is crucial for autonomous driving. Existing methods usually first transform image-view features into bird-eye-view (BEV) by aid of inverse perspective map** (IPM), and then detect lane lines based on the BEV features. However, IPM ignores the changes in road height, leading to inaccurate view transformations. Additionally, the two separate stages of th… ▽ More

    Submitted 8 June, 2023; originally announced June 2023.

  11. arXiv:2306.00450  [pdf, other

    cs.CV

    Exploring Open-Vocabulary Semantic Segmentation without Human Labels

    Authors: Jun Chen, Deyao Zhu, Guocheng Qian, Bernard Ghanem, Zhicheng Yan, Chenchen Zhu, Fanyi Xiao, Mohamed Elhoseiny, Sean Chang Culatana

    Abstract: Semantic segmentation is a crucial task in computer vision that involves segmenting images into semantically meaningful regions at the pixel level. However, existing approaches often rely on expensive human annotations as supervision for model training, limiting their scalability to large, unlabeled datasets. To address this challenge, we present ZeroSeg, a novel method that leverages the existing… ▽ More

    Submitted 1 June, 2023; originally announced June 2023.

  12. arXiv:2304.09349   

    cs.AI cs.CL cs.RO

    LLM as A Robotic Brain: Unifying Egocentric Memory and Control

    Authors: **jie Mai, Jun Chen, Bing Li, Guocheng Qian, Mohamed Elhoseiny, Bernard Ghanem

    Abstract: Embodied AI focuses on the study and development of intelligent systems that possess a physical or virtual embodiment (i.e. robots) and are able to dynamically interact with their environment. Memory and control are the two essential parts of an embodied system and usually require separate frameworks to model each of them. In this paper, we propose a novel and generalizable framework called LLM-Br… ▽ More

    Submitted 12 June, 2023; v1 submitted 18 April, 2023; originally announced April 2023.

    Comments: This early project is now integrated to: Mindstorms in Natural Language-Based Societies of Mind, arXiv:2305.17066

  13. arXiv:2302.10035  [pdf, other

    cs.CV cs.AI cs.MM

    Large-scale Multi-Modal Pre-trained Models: A Comprehensive Survey

    Authors: Xiao Wang, Guangyao Chen, Guangwu Qian, Pengcheng Gao, Xiao-Yong Wei, Yaowei Wang, Yonghong Tian, Wen Gao

    Abstract: With the urgent demand for generalized deep models, many pre-trained big models are proposed, such as BERT, ViT, GPT, etc. Inspired by the success of these models in single domains (like computer vision and natural language processing), the multi-modal pre-trained big models have also drawn more and more attention in recent years. In this work, we give a comprehensive survey of these models and ho… ▽ More

    Submitted 10 April, 2024; v1 submitted 20 February, 2023; originally announced February 2023.

    Comments: Accepted by Machine Intelligence Research (MIR)

  14. arXiv:2208.12259  [pdf, other

    cs.CV

    Pix4Point: Image Pretrained Standard Transformers for 3D Point Cloud Understanding

    Authors: Guocheng Qian, Abdullah Hamdi, Xingdi Zhang, Bernard Ghanem

    Abstract: While Transformers have achieved impressive success in natural language processing and computer vision, their performance on 3D point clouds is relatively poor. This is mainly due to the limitation of Transformers: a demanding need for extensive training data. Unfortunately, in the realm of 3D point clouds, the availability of large datasets is a challenge, exacerbating the issue of training Trans… ▽ More

    Submitted 2 February, 2024; v1 submitted 25 August, 2022; originally announced August 2022.

    Comments: camera-ready version at 3DV 2024

  15. arXiv:2206.04670  [pdf, other

    cs.CV cs.AI

    PointNeXt: Revisiting PointNet++ with Improved Training and Scaling Strategies

    Authors: Guocheng Qian, Yuchen Li, Houwen Peng, **jie Mai, Hasan Abed Al Kader Hammoud, Mohamed Elhoseiny, Bernard Ghanem

    Abstract: PointNet++ is one of the most influential neural architectures for point cloud understanding. Although the accuracy of PointNet++ has been largely surpassed by recent networks such as PointMLP and Point Transformer, we find that a large portion of the performance gain is due to improved training strategies, i.e. data augmentation and optimization techniques, and increased model sizes rather than a… ▽ More

    Submitted 12 October, 2022; v1 submitted 9 June, 2022; originally announced June 2022.

    Comments: Accepted by NeurIPS'22. Code and models are available at https://github.com/guochengqian/pointnext

  16. arXiv:2204.04918  [pdf, other

    cs.AI

    When NAS Meets Trees: An Efficient Algorithm for Neural Architecture Search

    Authors: Guocheng Qian, Xuanyang Zhang, Guohao Li, Chen Zhao, Yukang Chen, Xiangyu Zhang, Bernard Ghanem, Jian Sun

    Abstract: The key challenge in neural architecture search (NAS) is designing how to explore wisely in the huge search space. We propose a new NAS method called TNAS (NAS with trees), which improves search efficiency by exploring only a small number of architectures while also achieving a higher search accuracy. TNAS introduces an architecture tree and a binary operation tree, to factorize the search space a… ▽ More

    Submitted 11 April, 2022; originally announced April 2022.

    Comments: 4 pages, accepted at CVPR Workshop 2022 (ECV2022)

  17. arXiv:2201.08636  [pdf, ps, other

    cs.CV cs.AI

    Conceptor Learning for Class Activation Map**

    Authors: Guangwu Qian, Zhen-Qun Yang, Xu-Lu Zhang, Yaowei Wang, Qing Li, Xiao-Yong Wei

    Abstract: Class Activation Map** (CAM) has been widely adopted to generate saliency maps which provides visual explanations for deep neural networks (DNNs). The saliency maps are conventionally generated by fusing the channels of the target feature map using a weighted average scheme. It is a weak model for the inter-channel relation, in the sense that it only models the relation among channels in a contr… ▽ More

    Submitted 21 January, 2022; originally announced January 2022.

  18. arXiv:2201.00259  [pdf, other

    eess.IV cs.CV cs.MM

    Subspace modeling for fast and high-sensitivity X-ray chemical imaging

    Authors: Jizhou Li, Bin Chen, Guibin Zan, Guannan Qian, Piero Pianetta, Yi** Liu

    Abstract: Resolving morphological chemical phase transformations at the nanoscale is of vital importance to many scientific and industrial applications across various disciplines. The TXM-XANES imaging technique, by combining full field transmission X-ray microscopy (TXM) and X-ray absorption near edge structure (XANES), has been an emerging tool which operates by acquiring a series of microscopy images wit… ▽ More

    Submitted 1 January, 2022; originally announced January 2022.

  19. arXiv:2112.14327  [pdf, other

    cs.CV cs.LG

    Multi-Head Deep Metric Learning Using Global and Local Representations

    Authors: Mohammad K. Ebrahimpour, Gang Qian, Allison Beach

    Abstract: Deep Metric Learning (DML) models often require strong local and global representations, however, effective integration of local and global features in DML model training is a challenge. DML models are often trained with specific loss functions, including pairwise-based and proxy-based losses. The pairwise-based loss functions leverage rich semantic relations among data points, however, they often… ▽ More

    Submitted 28 December, 2021; originally announced December 2021.

    Comments: To appear in WACV 2022

  20. arXiv:2110.10538  [pdf, other

    cs.CV cs.LG

    ASSANet: An Anisotropic Separable Set Abstraction for Efficient Point Cloud Representation Learning

    Authors: Guocheng Qian, Hasan Abed Al Kader Hammoud, Guohao Li, Ali Thabet, Bernard Ghanem

    Abstract: Access to 3D point cloud representations has been widely facilitated by LiDAR sensors embedded in various mobile devices. This has led to an emerging need for fast and accurate point cloud processing techniques. In this paper, we revisit and dive deeper into PointNet++, one of the most influential yet under-explored networks, and develop faster and more accurate variants of the model. We first pre… ▽ More

    Submitted 24 October, 2021; v1 submitted 20 October, 2021; originally announced October 2021.

    Comments: ASSANet gets accepted to NeurIPS'21 as a Spotlight paper. code available at https://github.com/guochengqian/ASSANet

  21. arXiv:1912.03264  [pdf, other

    cs.CV cs.CG cs.LG

    PU-GCN: Point Cloud Upsampling using Graph Convolutional Networks

    Authors: Guocheng Qian, Abdulellah Abualshour, Guohao Li, Ali Thabet, Bernard Ghanem

    Abstract: The effectiveness of learning-based point cloud upsampling pipelines heavily relies on the upsampling modules and feature extractors used therein. For the point upsampling module, we propose a novel model called NodeShuffle, which uses a Graph Convolutional Network (GCN) to better encode local point information from point neighborhoods. NodeShuffle is versatile and can be incorporated into any poi… ▽ More

    Submitted 29 March, 2021; v1 submitted 30 November, 2019; originally announced December 2019.

    Comments: Get accepted to CVPR 2021. The source code of this work is available at https://github.com/guochengqian/PU-GCN

  22. arXiv:1912.00195  [pdf, other

    cs.LG cs.CV stat.ML

    SGAS: Sequential Greedy Architecture Search

    Authors: Guohao Li, Guocheng Qian, Itzel C. Delgadillo, Matthias Müller, Ali Thabet, Bernard Ghanem

    Abstract: Architecture design has become a crucial component of successful deep learning. Recent progress in automatic neural architecture search (NAS) shows a lot of promise. However, discovered architectures often fail to generalize in the final evaluation. Architectures with a higher validation accuracy during the search phase may perform worse in the evaluation. Aiming to alleviate this common issue, we… ▽ More

    Submitted 2 April, 2020; v1 submitted 30 November, 2019; originally announced December 2019.

    Comments: Accepted at CVPR'2020. Project website: https://www.deepgcns.org/auto/sgas

  23. arXiv:1910.06849  [pdf, other

    cs.CV cs.LG eess.IV

    DeepGCNs: Making GCNs Go as Deep as CNNs

    Authors: Guohao Li, Matthias Müller, Guocheng Qian, Itzel C. Delgadillo, Abdulellah Abualshour, Ali Thabet, Bernard Ghanem

    Abstract: Convolutional Neural Networks (CNNs) have been very successful at solving a variety of computer vision tasks such as object classification and detection, semantic segmentation, activity understanding, to name just a few. One key enabling factor for their great performance has been the ability to train very deep networks. Despite their huge success in many tasks, CNNs do not work well with non-Eucl… ▽ More

    Submitted 14 May, 2021; v1 submitted 15 October, 2019; originally announced October 2019.

    Comments: Accepted at TPAMI. This work is a journal extension of our ICCV'19 paper arXiv:1904.03751. The first three authors contributed equally

  24. arXiv:1905.02538  [pdf, other

    eess.IV cs.CV

    Rethinking Learning-based Demosaicing, Denoising, and Super-Resolution Pipeline

    Authors: Guocheng Qian, Yuanhao Wang, **** Gu, Chao Dong, Wolfgang Heidrich, Bernard Ghanem, Jimmy S. Ren

    Abstract: Imaging is usually a mixture problem of incomplete color sampling, noise degradation, and limited resolution. This mixture problem is typically solved by a sequential solution that applies demosaicing (DM), denoising (DN), and super-resolution (SR) sequentially in a fixed and predefined pipeline (execution order of tasks), DM$\to$DN$\to$SR. The most recent work on image processing focuses on devel… ▽ More

    Submitted 24 March, 2023; v1 submitted 7 May, 2019; originally announced May 2019.

    Comments: Accepted at ICCP'22. Code is available at: https://github.com/guochengqian/TENet

  25. arXiv:1901.08013  [pdf, other

    cs.NE cs.LG stat.ML

    DarwinML: A Graph-based Evolutionary Algorithm for Automated Machine Learning

    Authors: Fei Qi, Zhaohui Xia, Gaoyang Tang, Hang Yang, Yu Song, Guangrui Qian, Xiong An, Chunhuan Lin, Guangming Shi

    Abstract: As an emerging field, Automated Machine Learning (AutoML) aims to reduce or eliminate manual operations that require expertise in machine learning. In this paper, a graph-based architecture is employed to represent flexible combinations of ML models, which provides a large searching space compared to tree-based and stacking-based architectures. Based on this, an evolutionary algorithm is proposed… ▽ More

    Submitted 20 November, 2018; originally announced January 2019.

    Comments: 8 pages, 7 figures, 3 tables

  26. arXiv:1901.05593  [pdf

    cs.LG eess.SP stat.ML

    Quadratic Autoencoder (Q-AE) for Low-dose CT Denoising

    Authors: Fenglei Fan, Hongming Shan, Mannudeep K. Kalra, Ramandeep Singh, Guhan Qian, Matthew Getzin, Yueyang Teng, Juergen Hahn, Ge Wang

    Abstract: Inspired by complexity and diversity of biological neurons, our group proposed quadratic neurons by replacing the inner product in current artificial neurons with a quadratic operation on input data, thereby enhancing the capability of an individual neuron. Along this direction, we are motivated to evaluate the power of quadratic neurons in popular network architectures, simulating human-like lear… ▽ More

    Submitted 30 October, 2019; v1 submitted 16 January, 2019; originally announced January 2019.

  27. arXiv:1310.0611  [pdf

    cs.IT

    Map** and Coding Design for Channel Coded Physical-layer Network Coding

    Authors: Xu Li, Shengli Zhang, Gongbin Qian

    Abstract: Although BICM can significantly improves the BER performance by iteration processing between the demap** and the decoding in a traditional receiver, its design and performance in PNC system has fewer studied. This paper investigates a bit interleaved coded modulation (BICM) scheme in a Gaussian two-way relay channel operated with physical layer network coding (PNC). In particular, we first prese… ▽ More

    Submitted 2 October, 2013; originally announced October 2013.

    Comments: 6 pages and will appear in a conference

  28. arXiv:1206.6938  [pdf, ps, other

    cs.IT

    MIMO Physical Layer Network Coding Based on VBLAST Detection

    Authors: Shengli Zhang, Can** Nie, Liya Lu, Gongbin Qian

    Abstract: For MIMO two-way relay channel, this paper proposes a novel scheme, VBLAST-PNC, to transform the two superimposed packets received by the relay to their network coding form. Different from traditional schemes, which tries to detect each packet before network coding them, VBLAST-PNC detects the summation of the two packets before network coding. In particular, after firstly detecting the second lay… ▽ More

    Submitted 29 June, 2012; originally announced June 2012.