Skip to main content

Showing 1–14 of 14 results for author: Gao, E

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.18549  [pdf

    eess.IV cs.CV

    Advancements in Feature Extraction Recognition of Medical Imaging Systems Through Deep Learning Technique

    Authors: Qishi Zhan, Dan Sun, Erdi Gao, Yuhan Ma, Yaxin Liang, Haowei Yang

    Abstract: This study introduces a novel unsupervised medical image feature extraction method that employs spatial stratification techniques. An objective function based on weight is proposed to achieve the purpose of fast image recognition. The algorithm divides the pixels of the image into multiple subdomains and uses a quadtree to access the image. A technique for threshold optimization utilizing a simple… ▽ More

    Submitted 23 May, 2024; originally announced June 2024.

    Comments: conference

  2. arXiv:2406.08838  [pdf

    cs.CL cs.AI cs.LG

    Research on Optimization of Natural Language Processing Model Based on Multimodal Deep Learning

    Authors: Dan Sun, Yaxin Liang, Yining Yang, Yuhan Ma, Qishi Zhan, Erdi Gao

    Abstract: This project intends to study the image representation based on attention mechanism and multimodal data. By adding multiple pattern layers to the attribute model, the semantic and hidden layers of image content are integrated. The word vector is quantified by the Word2Vec method and then evaluated by a word embedding convolutional neural network. The published experimental results of the two group… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

  3. arXiv:2403.13430  [pdf, other

    cs.CV

    MTP: Advancing Remote Sensing Foundation Model via Multi-Task Pretraining

    Authors: Di Wang, **g Zhang, Minqiang Xu, Lin Liu, Dongsheng Wang, Erzhong Gao, Chengxi Han, Haonan Guo, Bo Du, Dacheng Tao, Liangpei Zhang

    Abstract: Foundation models have reshaped the landscape of Remote Sensing (RS) by enhancing various image interpretation tasks. Pretraining is an active research topic, encompassing supervised and self-supervised learning methods to initialize model weights effectively. However, transferring the pretrained models to downstream tasks may encounter task discrepancy due to their formulation of pretraining as i… ▽ More

    Submitted 29 May, 2024; v1 submitted 20 March, 2024; originally announced March 2024.

    Comments: Accepted by IEEE JSTARS Special issue on "Large-Scale Pretraining for Interpretation Promotion in Remote Sensing Domain". The codes and pretrained models are available at https://github.com/ViTAE-Transformer/MTP

  4. arXiv:2401.08537  [pdf

    cs.CL

    Spatial Entity Resolution between Restaurant Locations and Transportation Destinations in Southeast Asia

    Authors: Emily Gao, Dominic Widdows

    Abstract: As a tech company, Grab has expanded from transportation to food delivery, aiming to serve Southeast Asia with hyperlocalized applications. Information about places as transportation destinations can help to improve our knowledge about places as restaurants, so long as the spatial entity resolution problem between these datasets can be solved. In this project, we attempted to recognize identical p… ▽ More

    Submitted 16 January, 2024; originally announced January 2024.

    Journal ref: 6th International Conference on Geospatial Information Systems Theory, Applications, and Management. GISTAM 2020, Prague, Czech Republic, May 7-9, 2020

  5. arXiv:2401.04739  [pdf

    cs.CV cs.AI

    Content-Conditioned Generation of Stylized Free hand Sketches

    Authors: Jiajun Liu, Siyuan Wang, Guangming Zhu, Liang Zhang, Ning Li, Eryang Gao

    Abstract: In recent years, the recognition of free-hand sketches has remained a popular task. However, in some special fields such as the military field, free-hand sketches are difficult to sample on a large scale. Common data augmentation and image generation techniques are difficult to produce images with various free-hand sketching styles. Therefore, the recognition and segmentation tasks in related fiel… ▽ More

    Submitted 9 January, 2024; originally announced January 2024.

    Comments: 6 pages, 7 figures, ICSMD

  6. arXiv:2401.03828  [pdf

    cs.CV

    A multimodal gesture recognition dataset for desktop human-computer interaction

    Authors: Qi Wang, Fengchao Zhu, Guangming Zhu, Liang Zhang, Ning Li, Eryang Gao

    Abstract: Gesture recognition is an indispensable component of natural and efficient human-computer interaction technology, particularly in desktop-level applications, where it can significantly enhance people's productivity. However, the current gesture recognition community lacks a suitable desktop-level (top-view perspective) dataset for lightweight gesture capture devices. In this study, we have establi… ▽ More

    Submitted 8 January, 2024; originally announced January 2024.

  7. arXiv:2312.07999  [pdf, other

    cs.GT econ.TH

    Random Serial Dictatorship with Transfers

    Authors: Sudharsan Sundar, Eric Gao, Trevor Chow, Matthew Ding

    Abstract: It is well known that Random Serial Dictatorship is strategy-proof and leads to a Pareto-Efficient outcome. We show that this result breaks down when individuals are allowed to make transfers, and adapt Random Serial Dictatorship to encompass trades between individuals. Strategic analysis of play under the new mechanisms we define is given, accompanied by simulations to quantify the gains from tra… ▽ More

    Submitted 13 December, 2023; originally announced December 2023.

  8. arXiv:2302.02394  [pdf, other

    cs.CV

    Eliminating Contextual Prior Bias for Semantic Image Editing via Dual-Cycle Diffusion

    Authors: Zuopeng Yang, Tianshu Chu, Xin Lin, Erdun Gao, Daqing Liu, Jie Yang, Chaoyue Wang

    Abstract: The recent success of text-to-image generation diffusion models has also revolutionized semantic image editing, enabling the manipulation of images based on query/target texts. Despite these advancements, a significant challenge lies in the potential introduction of contextual prior bias in pre-trained models during image editing, e.g., making unexpected modifications to inappropriate regions. To… ▽ More

    Submitted 5 October, 2023; v1 submitted 5 February, 2023; originally announced February 2023.

    Comments: This paper has been accepted by the IEEE Transactions on Circuits and Systems for Video Technology (TCSVT)

  9. arXiv:2209.02946  [pdf, other

    stat.ML cs.LG

    On the Sparse DAG Structure Learning Based on Adaptive Lasso

    Authors: Danru Xu, Erdun Gao, Wei Huang, Menghan Wang, Andy Song, Mingming Gong

    Abstract: Learning the underlying Bayesian Networks (BNs), represented by directed acyclic graphs (DAGs), of the concerned events from purely-observational data is a crucial part of evidential reasoning. This task remains challenging due to the large and discrete search space. A recent flurry of developments followed NOTEARS[1] recast this combinatorial problem into a continuous optimization problem by leve… ▽ More

    Submitted 17 February, 2023; v1 submitted 7 September, 2022; originally announced September 2022.

    Comments: 11 pages, 8 figures

  10. arXiv:2205.13869  [pdf, other

    cs.LG stat.ML

    MissDAG: Causal Discovery in the Presence of Missing Data with Continuous Additive Noise Models

    Authors: Erdun Gao, Ignavier Ng, Mingming Gong, Li Shen, Wei Huang, Tongliang Liu, Kun Zhang, Howard Bondell

    Abstract: State-of-the-art causal discovery methods usually assume that the observational data is complete. However, the missing data problem is pervasive in many practical scenarios such as clinical trials, economics, and biology. One straightforward way to address the missing data problem is first to impute the data using off-the-shelf imputation methods and then apply existing causal discovery methods. H… ▽ More

    Submitted 16 January, 2023; v1 submitted 27 May, 2022; originally announced May 2022.

    Comments: Accepted to NeurIPS22

  11. arXiv:2112.03555  [pdf, other

    cs.LG stat.ML

    FedDAG: Federated DAG Structure Learning

    Authors: Erdun Gao, Junjia Chen, Li Shen, Tongliang Liu, Mingming Gong, Howard Bondell

    Abstract: To date, most directed acyclic graphs (DAGs) structure learning approaches require data to be stored in a central server. However, due to the consideration of privacy protection, data owners gradually refuse to share their personalized raw data to avoid private information leakage, making this task more troublesome by cutting off the first step. Thus, a puzzle arises: \textit{how do we discover th… ▽ More

    Submitted 16 January, 2023; v1 submitted 7 December, 2021; originally announced December 2021.

    Comments: Accepted to Transactions on Machine Learning Research

  12. arXiv:2107.03227  [pdf, other

    cs.CV cs.AI cs.LG eess.IV

    Scalable Data Balancing for Unlabeled Satellite Imagery

    Authors: Deep Patel, Erin Gao, Anirudh Koul, Siddha Ganju, Meher Anand Kasam

    Abstract: Data imbalance is a ubiquitous problem in machine learning. In large scale collected and annotated datasets, data imbalance is either mitigated manually by undersampling frequent classes and oversampling rare classes, or planned for with imputation and augmentation techniques. In both cases balancing data requires labels. In other words, only annotated data can be balanced. Collecting fully annota… ▽ More

    Submitted 7 July, 2021; originally announced July 2021.

    Comments: Accepted to COSPAR 2021 Workshop on Machine Learning for Space Sciences. 5 pages, 9 figures

  13. arXiv:2102.03629  [pdf

    cs.HC eess.SP

    EEG-based Investigation of the Impact of Classroom Design on Cognitive Performance of Students

    Authors: Jesus G. Cruz-Garza, Michael Darfler, James D. Rounds, Elita Gao, Saleh Kalantari

    Abstract: This study investigated the neural dynamics associated with short-term exposure to different virtual classroom designs with different window placement and room dimension. Participants engaged in five brief cognitive tasks in each design condition including the Stroop Test, the Digit Span Test, the Benton Test, a Visual Memory Test, and an Arithmetic Test. Performance on the cognitive tests and Ele… ▽ More

    Submitted 6 February, 2021; originally announced February 2021.

  14. arXiv:1908.05965  [pdf

    cs.MM

    Adaptive Embedding Pattern for Grayscale-Invariance Reversible Data Hiding

    Authors: Erdun Gao, Zhibin Pan, Xinyi Gao

    Abstract: In traditional reversible data hiding (RDH) methods, researchers pay attention to enlarge the embedding capacity (EC) and to reduce the embedding distortion (ED). Recently, a completely novel RDH algorithm was developed to embed secret data into color image without changing the corresponding grayscale [1], which largely expands the applications of RDH. In [1], for color image, channel R and channe… ▽ More

    Submitted 16 August, 2019; originally announced August 2019.