Skip to main content

Showing 1–50 of 57 results for author: Nie, Z

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.18254  [pdf, other

    cs.IR cs.AI cs.MM

    Improving the Consistency in Cross-Lingual Cross-Modal Retrieval with 1-to-K Contrastive Learning

    Authors: Zhijie Nie, Richong Zhang, Zhangchi Feng, Hailang Huang, Xudong Liu

    Abstract: Cross-lingual Cross-modal Retrieval (CCR) is an essential task in web search, which aims to break the barriers between modality and language simultaneously and achieves image-text retrieval in the multi-lingual scenario with a single model. In recent years, excellent progress has been made based on cross-lingual cross-modal pre-training; particularly, the methods based on contrastive learning on l… ▽ More

    Submitted 26 June, 2024; originally announced June 2024.

    Comments: Accepted by KDD 2024 Research Track

  2. arXiv:2406.17378  [pdf, other

    cs.CL cs.IR

    A Text is Worth Several Tokens: Text Embedding from LLMs Secretly Aligns Well with The Key Tokens

    Authors: Zhijie Nie, Richong Zhang, Zhanyu Wu

    Abstract: Text embeddings from large language models (LLMs) have achieved excellent results in tasks such as information retrieval, semantic textual similarity, etc. In this work, we show an interesting finding: when feeding a text into the embedding LLMs, the obtained text embedding will be able to be aligned with the key tokens in the input text. We first fully analyze this phenomenon on eight embedding L… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

    Comments: Work in Progress

  3. arXiv:2406.09841  [pdf, other

    cs.LG q-bio.BM

    Learning Multi-view Molecular Representations with Structured and Unstructured Knowledge

    Authors: Yizhen Luo, Kai Yang, Massimo Hong, Xing Yi Liu, Zikun Nie, Hao Zhou, Zaiqing Nie

    Abstract: Capturing molecular knowledge with representation learning approaches holds significant potential in vast scientific fields such as chemistry and life science. An effective and generalizable molecular representation is expected to capture the consensus and complementary molecular expertise from diverse views and perspectives. However, existing works fall short in learning multi-view molecular repr… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

    Comments: 12 pages, 4 figures

  4. arXiv:2405.15158  [pdf, other

    q-bio.BM cs.LG

    ProtFAD: Introducing function-aware domains as implicit modality towards protein function perception

    Authors: Mingqing Wang, Zhiwei Nie, Yonghong He, Zhixiang Ren

    Abstract: Protein function prediction is currently achieved by encoding its sequence or structure, where the sequence-to-function transcendence and high-quality structural data scarcity lead to obvious performance bottlenecks. Protein domains are "building blocks" of proteins that are functionally independent, and their combinations determine the diverse biological functions. However, most existing studies… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

    Comments: 16 pages, 6 figures, 5 tables

  5. arXiv:2405.06708  [pdf, other

    q-bio.GN cs.AI cs.CL

    LangCell: Language-Cell Pre-training for Cell Identity Understanding

    Authors: Suyuan Zhao, Jiahuan Zhang, Yushuai Wu, Yizhen Luo, Zaiqing Nie

    Abstract: Cell identity encompasses various semantic aspects of a cell, including cell type, pathway information, disease information, and more, which are essential for biologists to gain insights into its biological characteristics. Understanding cell identity from the transcriptomic data, such as annotating cell types, has become an important task in bioinformatics. As these semantic aspects are determine… ▽ More

    Submitted 11 June, 2024; v1 submitted 9 May, 2024; originally announced May 2024.

    Comments: Accpeted by ICML 2024, code released

  6. arXiv:2404.11317  [pdf, other

    cs.CV cs.AI

    Improving Composed Image Retrieval via Contrastive Learning with Scaling Positives and Negatives

    Authors: Zhangchi Feng, Richong Zhang, Zhijie Nie

    Abstract: The Composed Image Retrieval (CIR) task aims to retrieve target images using a composed query consisting of a reference image and a modified text. Advanced methods often utilize contrastive learning as the optimization objective, which benefits from adequate positive and negative examples. However, the triplet for CIR incurs high manual annotation costs, resulting in limited positive examples. Fur… ▽ More

    Submitted 17 April, 2024; originally announced April 2024.

    Comments: 12 pages, 11 figures

  7. arXiv:2404.09729  [pdf

    eess.SP cs.IT cs.LG stat.ME

    Amplitude-Phase Fusion for Enhanced Electrocardiogram Morphological Analysis

    Authors: Shuaicong Hu, Yanan Wang, Jian Liu, **gyu Lin, Shengmei Qin, Zhenning Nie, Zhifeng Yao, Wenjie Cai, Cuiwei Yang

    Abstract: Considering the variability of amplitude and phase patterns in electrocardiogram (ECG) signals due to cardiac activity and individual differences, existing entropy-based studies have not fully utilized these two patterns and lack integration. To address this gap, this paper proposes a novel fusion entropy metric, morphological ECG entropy (MEE) for the first time, specifically designed for ECG mor… ▽ More

    Submitted 15 April, 2024; originally announced April 2024.

    Comments: 16 pages, 12 figures

    ACM Class: I.5.2

  8. arXiv:2404.04395  [pdf, ps, other

    cs.CC

    A Critique of Du's "A Polynomial-Time Algorithm for 3-SAT

    Authors: Yumeng He, Matan Kotler-Berkowitz, Harry Liuson, Zeyu Nie

    Abstract: In this paper, we examine the claims made by the paper "A polynomial-time algorithm for 3-SAT" by Lizhi Du. The paper claims to provide a polynomial-time algorithm for solving the NP-complete problem 3-SAT. In examining the paper's argument, we find a flaw in one of the main sections of its algorithm. We argue that this flaw causes the paper's algorithm to incorrectly decide that an infinite famil… ▽ More

    Submitted 5 April, 2024; originally announced April 2024.

  9. arXiv:2404.00717  [pdf, other

    cs.RO cs.CV cs.MA

    End-to-End Autonomous Driving through V2X Cooperation

    Authors: Haibao Yu, Wenxian Yang, Jiaru Zhong, Zhenwei Yang, Siqi Fan, ** Luo, Zaiqing Nie

    Abstract: Cooperatively utilizing both ego-vehicle and infrastructure sensor data via V2X communication has emerged as a promising approach for advanced autonomous driving. However, current research mainly focuses on improving individual modules, rather than taking end-to-end learning to optimize final planning performance, resulting in underutilized data potential. In this paper, we introduce UniV2X, a pio… ▽ More

    Submitted 19 April, 2024; v1 submitted 31 March, 2024; originally announced April 2024.

  10. arXiv:2403.12995  [pdf, other

    q-bio.BM cs.CE cs.LG

    ESM All-Atom: Multi-scale Protein Language Model for Unified Molecular Modeling

    Authors: Kangjie Zheng, Siyu Long, Tianyu Lu, Junwei Yang, Xinyu Dai, Ming Zhang, Zaiqing Nie, Wei-Ying Ma, Hao Zhou

    Abstract: Protein language models have demonstrated significant potential in the field of protein engineering. However, current protein language models primarily operate at the residue scale, which limits their ability to provide information at the atom level. This limitation prevents us from fully exploiting the capabilities of protein language models for applications involving both proteins and small mole… ▽ More

    Submitted 12 June, 2024; v1 submitted 5 March, 2024; originally announced March 2024.

    Comments: ICML2024 camera-ready, update some experimental results, add github url, fix some typos

  11. arXiv:2403.10145  [pdf, other

    cs.CV cs.RO

    RCooper: A Real-world Large-scale Dataset for Roadside Cooperative Perception

    Authors: Ruiyang Hao, Siqi Fan, Yingru Dai, Zhenlin Zhang, Chenxi Li, Yuntian Wang, Haibao Yu, Wenxian Yang, Jirui Yuan, Zaiqing Nie

    Abstract: The value of roadside perception, which could extend the boundaries of autonomous driving and traffic management, has gradually become more prominent and acknowledged in recent years. However, existing roadside perception approaches only focus on the single-infrastructure sensor system, which cannot realize a comprehensive understanding of a traffic area because of the limited sensing range and bl… ▽ More

    Submitted 31 March, 2024; v1 submitted 15 March, 2024; originally announced March 2024.

    Comments: Accepted by CVPR2024. 10 pages with 6 figures

    ACM Class: I.4.8; I.5.4

  12. Cross-Modal and Uni-Modal Soft-Label Alignment for Image-Text Retrieval

    Authors: Hailang Huang, Zhijie Nie, Ziqiao Wang, Ziyu Shang

    Abstract: Current image-text retrieval methods have demonstrated impressive performance in recent years. However, they still face two problems: the inter-modal matching missing problem and the intra-modal semantic loss problem. These problems can significantly affect the accuracy of image-text retrieval. To address these challenges, we propose a novel method called Cross-modal and Uni-modal Soft-label Align… ▽ More

    Submitted 8 March, 2024; originally announced March 2024.

    Comments: 9 pages, Accepted by AAAI2024

  13. arXiv:2403.03768  [pdf, other

    cs.AI cs.LG q-bio.QM

    DeepCRE: Transforming Drug R&D via AI-Driven Cross-drug Response Evaluation

    Authors: Yushuai Wu, Ting Zhang, Hao Zhou, Hainan Wu, Hanwen Sunchu, Lei Hu, Xiaofang Chen, Suyuan Zhao, Gaochao Liu, Chao Sun, Jiahuan Zhang, Yizhen Luo, Peng Liu, Zaiqing Nie, Yushuai Wu

    Abstract: The fields of therapeutic application and drug research and development (R&D) both face substantial challenges, i.e., the therapeutic domain calls for more treatment alternatives, while numerous promising pre-clinical drugs have failed in clinical trials. One of the reasons is the inadequacy of Cross-drug Response Evaluation (CRE) during the late stages of drug R&D. Although in-silico CRE models b… ▽ More

    Submitted 18 March, 2024; v1 submitted 6 March, 2024; originally announced March 2024.

  14. arXiv:2402.18281  [pdf, other

    cs.CL

    Towards Better Understanding of Contrastive Sentence Representation Learning: A Unified Paradigm for Gradient

    Authors: Mingxin Li, Richong Zhang, Zhijie Nie

    Abstract: Sentence Representation Learning (SRL) is a crucial task in Natural Language Processing (NLP), where contrastive Self-Supervised Learning (SSL) is currently a mainstream approach. However, the reasons behind its remarkable effectiveness remain unclear. Specifically, many studies have investigated the similarities between contrastive and non-contrastive SSL from a theoretical perspective. Such simi… ▽ More

    Submitted 5 June, 2024; v1 submitted 28 February, 2024; originally announced February 2024.

    Comments: Accepted at ACL 2024 Main Conference

  15. arXiv:2312.06706  [pdf, other

    cs.CV

    UNeR3D: Versatile and Scalable 3D RGB Point Cloud Generation from 2D Images in Unsupervised Reconstruction

    Authors: Hongbin Lin, Juangui Xu, Qingfeng Xu, Zhengyu Hu, Handing Xu, Yunzhi Chen, Yongjun Hu, Zhenguo Nie

    Abstract: In the realm of 3D reconstruction from 2D images, a persisting challenge is to achieve high-precision reconstructions devoid of 3D Ground Truth data reliance. We present UNeR3D, a pioneering unsupervised methodology that sets a new standard for generating detailed 3D reconstructions solely from 2D views. Our model significantly cuts down the training costs tied to supervised approaches and introdu… ▽ More

    Submitted 10 December, 2023; originally announced December 2023.

    Comments: 17 pages

  16. arXiv:2312.02071  [pdf, ps, other

    cs.CC

    Evaluating the Claims of "SAT Requires Exhaustive Search"

    Authors: Michael C. Chavrimootoo, Yumeng He, Matan Kotler-Berkowitz, Harry Liuson, Zeyu Nie

    Abstract: In this paper, we take a closer look at the claims made by Xu and Zhou in their paper "SAT Requires Exhaustive Search" [XZ23], which claims to provide a lower bound on the complexity of the so-called Model RB. Xu and Zhou conclude that their result implies a separation between P and NP, since the lower bound purportedly proves that the Strong Exponential Time Hypothesis (SETH) is true. In examinin… ▽ More

    Submitted 4 December, 2023; originally announced December 2023.

  17. arXiv:2311.01682  [pdf, other

    cs.CV

    Flow-Based Feature Fusion for Vehicle-Infrastructure Cooperative 3D Object Detection

    Authors: Haibao Yu, Yingjuan Tang, Enze Xie, Jilei Mao, ** Luo, Zaiqing Nie

    Abstract: Cooperatively utilizing both ego-vehicle and infrastructure sensor data can significantly enhance autonomous driving perception abilities. However, the uncertain temporal asynchrony and limited communication conditions can lead to fusion misalignment and constrain the exploitation of infrastructure data. To address these issues in vehicle-infrastructure cooperative 3D (VIC3D) object detection, we… ▽ More

    Submitted 2 November, 2023; originally announced November 2023.

    Comments: Accepted by NeurIPs2023. arXiv admin note: text overlap with arXiv:2303.10552

  18. arXiv:2311.00371  [pdf, other

    cs.CV

    Learning Cooperative Trajectory Representations for Motion Forecasting

    Authors: Hongzhi Ruan, Haibao Yu, Wenxian Yang, Siqi Fan, Yingjuan Tang, Zaiqing Nie

    Abstract: Motion forecasting is an essential task for autonomous driving, and the effective information utilization from infrastructure and other vehicles can enhance motion forecasting capabilities. Existing research have primarily focused on leveraging single-frame cooperative information to enhance the limited perception capability of the ego vehicle, while underutilizing the motion and interaction infor… ▽ More

    Submitted 1 November, 2023; originally announced November 2023.

  19. arXiv:2309.06453  [pdf, other

    cs.CL cs.LG

    Narrowing the Gap between Supervised and Unsupervised Sentence Representation Learning with Large Language Model

    Authors: Mingxin Li, Richong Zhang, Zhijie Nie, Yongyi Mao

    Abstract: Sentence Representation Learning (SRL) is a fundamental task in Natural Language Processing (NLP), with the Contrastive Learning of Sentence Embeddings (CSE) being the mainstream technique due to its superior performance. An intriguing phenomenon in CSE is the significant performance gap between supervised and unsupervised methods, with their only difference lying in the training data. Previous wo… ▽ More

    Submitted 19 December, 2023; v1 submitted 12 September, 2023; originally announced September 2023.

    Comments: Accepted at AAAI24

  20. arXiv:2309.04695  [pdf, other

    cs.CL cs.AI

    Code-Style In-Context Learning for Knowledge-Based Question Answering

    Authors: Zhijie Nie, Richong Zhang, Zhongyuan Wang, Xudong Liu

    Abstract: Current methods for Knowledge-Based Question Answering (KBQA) usually rely on complex training techniques and model frameworks, leading to many limitations in practical applications. Recently, the emergence of In-Context Learning (ICL) capabilities in Large Language Models (LLMs) provides a simple and training-free semantic parsing paradigm for KBQA: Given a small number of questions and their lab… ▽ More

    Submitted 5 January, 2024; v1 submitted 9 September, 2023; originally announced September 2023.

    Comments: AAAI2024 Camera Ready

  21. arXiv:2308.09442  [pdf, other

    cs.CE

    BioMedGPT: Open Multimodal Generative Pre-trained Transformer for BioMedicine

    Authors: Yizhen Luo, Jiahuan Zhang, Siqi Fan, Kai Yang, Yushuai Wu, Mu Qiao, Zaiqing Nie

    Abstract: Foundation models (FMs) have exhibited remarkable performance across a wide range of downstream tasks in many domains. Nevertheless, general-purpose FMs often face challenges when confronted with domain-specific problems, due to their limited access to the proprietary training data in a particular domain. In biomedicine, there are various biological modalities, such as molecules, proteins, and cel… ▽ More

    Submitted 21 August, 2023; v1 submitted 18 August, 2023; originally announced August 2023.

    Comments: 12 pages, 4 figures

  22. arXiv:2308.09021  [pdf, ps, other

    cs.DS

    Simpler Analyses of Union-Find

    Authors: Zhiyi Huang, Chris Lambert, Zipei Nie, Richard Peng

    Abstract: We analyze union-find using potential functions motivated by continuous algorithms, and give alternate proofs of the $O(\log\log{n})$, $O(\log^{*}n)$, $O(\log^{**}n)$, and $O(α(n))$ amortized cost upper bounds. The proof of the $O(\log\log{n})$ amortized bound goes as follows. Let each node's potential be the square root of its size, i.e., the size of the subtree rooted from it. The overall potent… ▽ More

    Submitted 17 August, 2023; originally announced August 2023.

    Comments: 13 pages, 1 figure

  23. arXiv:2308.01804  [pdf, other

    cs.CV

    QUEST: Query Stream for Practical Cooperative Perception

    Authors: Siqi Fan, Haibao Yu, Wenxian Yang, Jirui Yuan, Zaiqing Nie

    Abstract: Cooperative perception can effectively enhance individual perception performance by providing additional viewpoint and expanding the sensing field. Existing cooperation paradigms are either interpretable (result cooperation) or flexible (feature cooperation). In this paper, we propose the concept of query cooperation to enable interpretable instance-level flexible feature interaction. To specifica… ▽ More

    Submitted 22 May, 2024; v1 submitted 3 August, 2023; originally announced August 2023.

    Comments: ICRA 2024

  24. arXiv:2307.12213  [pdf, other

    cs.HC

    LiveRetro: Visual Analytics for Strategic Retrospect in Livestream E-Commerce

    Authors: Yuchen Wu, Yuansong Xu, Shenghan Gao, Xingbo Wang, Wenkai Song, Zhiheng Nie, Xiaomeng Fan, Quan Li

    Abstract: Livestream e-commerce integrates live streaming and online shop**, allowing viewers to make purchases while watching. However, effective marketing strategies remain a challenge due to limited empirical research and subjective biases from the absence of quantitative data. Current tools fail to capture the interdependence between live performances and feedback. This study identified computational… ▽ More

    Submitted 2 August, 2023; v1 submitted 22 July, 2023; originally announced July 2023.

    Comments: Accepted by IEEE VIS 2023

  25. arXiv:2307.09484  [pdf, other

    q-bio.BM cs.CE cs.LG physics.chem-ph

    MolFM: A Multimodal Molecular Foundation Model

    Authors: Yizhen Luo, Kai Yang, Massimo Hong, Xing Yi Liu, Zaiqing Nie

    Abstract: Molecular knowledge resides within three different modalities of information sources: molecular structures, biomedical documents, and knowledge bases. Effective incorporation of molecular knowledge from these modalities holds paramount significance in facilitating biomedical research. However, existing multimodal molecular foundation models exhibit limitations in capturing intricate connections be… ▽ More

    Submitted 21 July, 2023; v1 submitted 6 June, 2023; originally announced July 2023.

    Comments: 31 pages, 15 figures, and 15 tables

  26. arXiv:2306.04371  [pdf, other

    cs.CE

    Large-Scale Cell Representation Learning via Divide-and-Conquer Contrastive Learning

    Authors: Suyuan Zhao, Jiahuan Zhang, Zaiqing Nie

    Abstract: Single-cell RNA sequencing (scRNA-seq) data is a potent tool for comprehending the "language of life" and can provide insights into various downstream biomedical tasks. Large-scale language models (LLMs) are starting to be used for cell representation learning. However, current LLM-based cell representation learning methods depend solely on the BERT architecture, causing an anisotropic embedding s… ▽ More

    Submitted 7 June, 2023; originally announced June 2023.

  27. arXiv:2305.05938  [pdf, other

    cs.CV cs.AI

    V2X-Seq: A Large-Scale Sequential Dataset for Vehicle-Infrastructure Cooperative Perception and Forecasting

    Authors: Haibao Yu, Wenxian Yang, Hongzhi Ruan, Zhenwei Yang, Yingjuan Tang, Xu Gao, Xin Hao, Yifeng Shi, Yifeng Pan, Ning Sun, Juan Song, Jirui Yuan, ** Luo, Zaiqing Nie

    Abstract: Utilizing infrastructure and vehicle-side information to track and forecast the behaviors of surrounding traffic participants can significantly improve decision-making and safety in autonomous driving. However, the lack of real-world sequential datasets limits research in this area. To address this issue, we introduce V2X-Seq, the first large-scale sequential V2X dataset, which includes data frame… ▽ More

    Submitted 10 May, 2023; originally announced May 2023.

    Comments: CVPR2023

  28. arXiv:2305.01523  [pdf, other

    cs.LG cs.AI cs.CE

    Towards Unified AI Drug Discovery with Multiple Knowledge Modalities

    Authors: Yizhen Luo, Xing Yi Liu, Kai Yang, Kui Huang, Massimo Hong, Jiahuan Zhang, Yushuai Wu, Zaiqing Nie

    Abstract: In recent years, AI models that mine intrinsic patterns from molecular structures and protein sequences have shown promise in accelerating drug discovery. However, these methods partly lag behind real-world pharmaceutical approaches of human experts that additionally grasp structured knowledge from knowledge bases and unstructured knowledge from biomedical literature. To bridge this gap, we propos… ▽ More

    Submitted 14 October, 2023; v1 submitted 17 April, 2023; originally announced May 2023.

    Comments: 10 pages, 6 figures

  29. arXiv:2304.11281  [pdf, other

    cs.DS

    Euclidean Capacitated Vehicle Routing in Random Setting: A $1.55$-Approximation Algorithm

    Authors: Zipei Nie, Hang Zhou

    Abstract: We study the unit-demand capacitated vehicle routing problem in the random setting of the Euclidean plane. The objective is to visit $n$ random terminals in a square using a set of tours of minimum total length, such that each tour visits the depot and at most $k$ terminals. We design an elegant algorithm combining the classical sweep heuristic and Arora's framework for the Euclidean traveling s… ▽ More

    Submitted 21 April, 2023; originally announced April 2023.

    Comments: 21 pages, 0 figures

  30. arXiv:2304.08502  [pdf, other

    cs.LG cs.AI

    CyFormer: Accurate State-of-Health Prediction of Lithium-Ion Batteries via Cyclic Attention

    Authors: Zhiqiang Nie, Jiankun Zhao, Qicheng Li, Yong Qin

    Abstract: Predicting the State-of-Health (SoH) of lithium-ion batteries is a fundamental task of battery management systems on electric vehicles. It aims at estimating future SoH based on historical aging data. Most existing deep learning methods rely on filter-based feature extractors (e.g., CNN or Kalman filters) and recurrent time sequence models. Though efficient, they generally ignore cyclic features a… ▽ More

    Submitted 16 April, 2023; originally announced April 2023.

  31. arXiv:2303.10552  [pdf, other

    cs.CV

    Vehicle-Infrastructure Cooperative 3D Object Detection via Feature Flow Prediction

    Authors: Haibao Yu, Yingjuan Tang, Enze Xie, Jilei Mao, Jirui Yuan, ** Luo, Zaiqing Nie

    Abstract: Cooperatively utilizing both ego-vehicle and infrastructure sensor data can significantly enhance autonomous driving perception abilities. However, temporal asynchrony and limited wireless communication in traffic environments can lead to fusion misalignment and impact detection performance. This paper proposes Feature Flow Net (FFNet), a novel cooperative detection framework that uses a feature f… ▽ More

    Submitted 18 March, 2023; originally announced March 2023.

    Comments: Under Review

  32. arXiv:2303.05730  [pdf, other

    cs.CV

    IC classifier: a classifier for 3D industrial components based on geometric prior using GNN

    Authors: Zipeng Lin, Zhenguo Nie

    Abstract: In this paper, we propose an approach to address the problem of classifying 3D industrial components by introducing a novel framework named IC-classifier (Industrial Component classifier). Our framework is designed to focus on the object's local and global structures, emphasizing the former by incorporating specific local features for embedding the model. By utilizing graphical neural networks and… ▽ More

    Submitted 10 March, 2023; originally announced March 2023.

    Comments: 15 pages including citations, 3 pages of figures

  33. arXiv:2303.02967  [pdf, other

    eess.IV cs.CV

    Automated Peripancreatic Vessel Segmentation and Labeling Based on Iterative Trunk Growth and Weakly Supervised Mechanism

    Authors: Liwen Zou, Zhenghua Cai, Liang Mao, Ziwei Nie, Yudong Qiu, ** Yang

    Abstract: Peripancreatic vessel segmentation and anatomical labeling play extremely important roles to assist the early diagnosis, surgery planning and prognosis for patients with pancreatic tumors. However, most current techniques cannot achieve satisfactory segmentation performance for peripancreatic veins and usually make predictions with poor integrity and connectivity. Besides, unsupervised labeling al… ▽ More

    Submitted 6 March, 2023; originally announced March 2023.

  34. arXiv:2301.05704  [pdf, ps, other

    math.CO cs.DS

    On a conjecture of Knuth about forward and back arcs

    Authors: Zipei Nie

    Abstract: Following Janson's method, we prove a conjecture of Knuth: the numbers of forward and back arcs for the depth-first search (DFS) in a digraph with a geometric outdegree distribution have the same distribution.

    Submitted 13 January, 2023; originally announced January 2023.

    Comments: 6 pages, 0 figures

  35. arXiv:2207.06345  [pdf, other

    cs.CV

    You Only Align Once: Bidirectional Interaction for Spatial-Temporal Video Super-Resolution

    Authors: Mengshun Hu, Kui Jiang, Zhixiang Nie, Zheng Wang

    Abstract: Spatial-Temporal Video Super-Resolution (ST-VSR) technology generates high-quality videos with higher resolution and higher frame rates. Existing advanced methods accomplish ST-VSR tasks through the association of Spatial and Temporal video super-resolution (S-VSR and T-VSR). These methods require two alignments and fusions in S-VSR and T-VSR, which is obviously redundant and fails to sufficiently… ▽ More

    Submitted 13 July, 2022; originally announced July 2022.

    Comments: ACMMM 2022

  36. arXiv:2204.05575  [pdf, other

    cs.CV cs.AI

    DAIR-V2X: A Large-Scale Dataset for Vehicle-Infrastructure Cooperative 3D Object Detection

    Authors: Haibao Yu, Yizhen Luo, Mao Shu, Yiyi Huo, Zebang Yang, Yifeng Shi, Zhenglong Guo, Hanyu Li, Xing Hu, Jirui Yuan, Zaiqing Nie

    Abstract: Autonomous driving faces great safety challenges for a lack of global perspective and the limitation of long-range perception capabilities. It has been widely agreed that vehicle-infrastructure cooperation is required to achieve Level 5 autonomy. However, there is still NO dataset from real scenarios available for computer vision researchers to work on vehicle-infrastructure cooperation-related pr… ▽ More

    Submitted 12 April, 2022; originally announced April 2022.

    Comments: CVPR2022

  37. arXiv:2111.05553  [pdf, other

    math.PR cs.DS

    Matrix anti-concentration inequalities with applications

    Authors: Zipei Nie

    Abstract: We provide a polynomial lower bound on the minimum singular value of an $m\times m$ random matrix $M$ with jointly Gaussian entries, under a polynomial bound on the matrix norm and a global small-ball probability bound $$\inf_{x,y\in S^{m-1}}\mathbb{P}\left(\left|x^* M y\right|>m^{-O(1)}\right)\ge \frac{1}{2}.$$ With the additional assumption that $M$ is self-adjoint, the global small-ball probabi… ▽ More

    Submitted 2 December, 2021; v1 submitted 10 November, 2021; originally announced November 2021.

    Comments: 42 pages, 1 figure, more references for better introduction, pseudocode for simplified block Krylov space algorithm added

  38. arXiv:2108.13239  [pdf, ps, other

    cs.LG cs.AI

    Adaptive perturbation adversarial training: based on reinforcement learning

    Authors: Zhishen Nie, Ying Lin, Sp Ren, Lan Zhang

    Abstract: Adversarial training has become the primary method to defend against adversarial samples. However, it is hard to practically apply due to many shortcomings. One of the shortcomings of adversarial training is that it will reduce the recognition accuracy of normal samples. Adaptive perturbation adversarial training is proposed to alleviate this problem. It uses marginal adversarial samples that are… ▽ More

    Submitted 30 August, 2021; originally announced August 2021.

  39. arXiv:2106.04224  [pdf, ps, other

    cs.DS

    Improved Online Correlated Selection

    Authors: Ruiquan Gao, Zhongtian He, Zhiyi Huang, Zipei Nie, Bijun Yuan, Yan Zhong

    Abstract: This paper studies the online correlated selection (OCS) problem. It was introduced by Fahrbach, Huang, Tao, and Zadimoghaddam (2020) to obtain the first edge-weighted online bipartite matching algorithm that breaks the $0.5$ barrier. Suppose that we receive a pair of elements in each round and immediately select one of them. Can we select with negative correlation to be more effective than indepe… ▽ More

    Submitted 15 December, 2021; v1 submitted 8 June, 2021; originally announced June 2021.

    Comments: Compared to the first version, this version adds a discussion on two concurrent works on the same topic, gives a more accurate description of previous results, and improves the presentation based on the feedbacks by anonymous reviewers. The conference version appears in FOCS 2021

  40. Anabranch Network for Camouflaged Object Segmentation

    Authors: Trung-Nghia Le, Tam V. Nguyen, Zhongliang Nie, Minh-Triet Tran, Akihiro Sugimoto

    Abstract: Camouflaged objects attempt to conceal their texture into the background and discriminating them from the background is hard even for human beings. The main objective of this paper is to explore the camouflaged object segmentation problem, namely, segmenting the camouflaged object(s) for a given image. This problem has not been well studied in spite of a wide range of potential applications includ… ▽ More

    Submitted 19 May, 2021; originally announced May 2021.

    Comments: Published in CVIU 2019. Project page: https://sites.google.com/view/ltnghia/research/camo

    Journal ref: Computer Vision and Image Understanding 184 (2019) 45-56

  41. arXiv:2104.09276  [pdf, other

    cs.CE cs.LG

    SuperMeshing: A New Deep Learning Architecture for Increasing the Mesh Density of Metal Forming Stress Field with Attention Mechanism and Perceptual Features

    Authors: Qingfeng Xu, Zhenguo Nie, Handing Xu, Haosu Zhou, Xinjun Liu

    Abstract: In stress field analysis, the finite element analysis is a crucial approach, in which the mesh-density has a significant impact on the results. High mesh density usually contributes authentic to simulation results but costs more computing resources, leading to curtailing efficiency during the design process. To eliminate this drawback, we propose a new data-driven mesh-density boost model named Su… ▽ More

    Submitted 12 March, 2021; originally announced April 2021.

    Comments: 15 pages, 12 figures

    MSC Class: 14J60 (Primary) 14F05; 14J26 (Secondary) ACM Class: F.2.2; I.2.7

  42. arXiv:2102.10284  [pdf, other

    cs.LG cs.AI

    Artificial Intelligence Enhanced Rapid and Efficient Diagnosis of Mycoplasma Pneumoniae Pneumonia in Children Patients

    Authors: Chenglin Pan, Kuan Yan, Xiao Liu, Yanjie Chen, Yanyan Luo, Xiaoming Li, Zhenguo Nie, Xinjun Liu

    Abstract: Artificial intelligence methods have been increasingly turning into a potentially powerful tool in the diagnosis and management of diseases. In this study, we utilized logistic regression (LR), decision tree (DT), gradient boosted decision tree (GBDT), support vector machine (SVM), and multilayer perceptron (MLP) as machine learning models to rapidly diagnose the mycoplasma pneumoniae pneumonia (M… ▽ More

    Submitted 20 February, 2021; originally announced February 2021.

    Comments: 23 pages

    MSC Class: 14J60 (Primary) 14F05; 14J26 (Secondary) ACM Class: F.2.2; I.2.7

  43. arXiv:2012.15136  [pdf, other

    eess.IV cs.CV

    Exploring Large Context for Cerebral Aneurysm Segmentation

    Authors: Jun Ma, Ziwei Nie

    Abstract: Automated segmentation of aneurysms from 3D CT is important for the diagnosis, monitoring, and treatment planning of the cerebral aneurysm disease. This short paper briefly presents the main technique details of the aneurysm segmentation method in the MICCAI 2020 CADA challenge. The main contribution is that we configure the 3D U-Net with a large patch size, which can obtain the large context. Our… ▽ More

    Submitted 30 December, 2020; originally announced December 2020.

    Comments: 2nd place in MICCAI 2020 CADA challenge

  44. arXiv:2012.09610  [pdf

    cs.LG

    Validate and Enable Machine Learning in Industrial AI

    Authors: Hongbo Zou, Guang**g Chen, Pengtao Xie, Sean Chen, Yongtian He, Hochih Huang, Zheng Nie, Hongbao Zhang, Tristan Bala, Kazi Tulip, Yuqi Wang, Shenlin Qin, Eric P. Xing

    Abstract: Industrial Artificial Intelligence (Industrial AI) is an emerging concept which refers to the application of artificial intelligence to industry. Industrial AI promises more efficient future industrial control systems. However, manufacturers and solution partners need to understand how to implement and integrate an AI model into the existing industrial control system. A well-trained machine learni… ▽ More

    Submitted 30 October, 2020; originally announced December 2020.

    Comments: 9 pages, 8 figures

  45. arXiv:2012.01675  [pdf, other

    cs.CL

    Federated Learning for Personalized Humor Recognition

    Authors: Xu Guo, Han Yu, Boyang Li, Hao Wang, Pengwei Xing, Siwei Feng, Zaiqing Nie, Chunyan Miao

    Abstract: Computational understanding of humor is an important topic under creative language understanding and modeling. It can play a key role in complex human-AI interactions. The challenge here is that human perception of humorous content is highly subjective. The same joke may receive different funniness ratings from different readers. This makes it highly challenging for humor recognition models to ach… ▽ More

    Submitted 6 April, 2022; v1 submitted 2 December, 2020; originally announced December 2020.

    Comments: 18 pages

  46. arXiv:2010.06694  [pdf, other

    cs.HC

    Easy, Reproducible and Quality-Controlled Data Collection with Crowdaq

    Authors: Qiang Ning, Hao Wu, Pradeep Dasigi, Dheeru Dua, Matt Gardner, Robert L. Logan IV, Ana Marasovic, Zhen Nie

    Abstract: High-quality and large-scale data are key to success for AI systems. However, large-scale data annotation efforts are often confronted with a set of common challenges: (1) designing a user-friendly annotation interface; (2) training enough annotators efficiently; and (3) reproducibility. To address these problems, we introduce Crowdaq, an open-source platform that standardizes the data collection… ▽ More

    Submitted 5 October, 2020; originally announced October 2020.

    Comments: Accepted to the demo track of EMNLP 2020

  47. arXiv:2010.05522  [pdf, other

    cs.CL

    Pre-trained Language Model Based Active Learning for Sentence Matching

    Authors: Guirong Bai, Shizhu He, Kang Liu, Jun Zhao, Zaiqing Nie

    Abstract: Active learning is able to significantly reduce the annotation cost for data-driven techniques. However, previous active learning approaches for natural language processing mainly depend on the entropy-based uncertainty criterion, and ignore the characteristics of natural language. In this paper, we propose a pre-trained language model based active learning approach for sentence matching. Differin… ▽ More

    Submitted 12 October, 2020; originally announced October 2020.

    Comments: Accepted by the conference of coling 2020

  48. arXiv:2006.11376  [pdf, other

    cs.CV cs.LG eess.IV

    StressGAN: A Generative Deep Learning Model for 2D Stress Distribution Prediction

    Authors: Haoliang Jiang, Zhenguo Nie, Roselyn Yeo, Amir Barati Farimani, Levent Burak Kara

    Abstract: Using deep learning to analyze mechanical stress distributions has been gaining interest with the demand for fast stress analysis methods. Deep learning approaches have achieved excellent outcomes when utilized to speed up stress computation and learn the physics without prior knowledge of underlying equations. However, most studies restrict the variation of geometry or boundary conditions, making… ▽ More

    Submitted 29 May, 2020; originally announced June 2020.

  49. arXiv:2004.12537  [pdf, other

    eess.IV cs.CV cs.LG

    Towards Data-Efficient Learning: A Benchmark for COVID-19 CT Lung and Infection Segmentation

    Authors: Jun Ma, Yixin Wang, Xingle An, Cheng Ge, Ziqi Yu, Jianan Chen, Qiongjie Zhu, Guoqiang Dong, Jian He, Zhiqiang He, Yuntao Zhu, Ziwei Nie, ** Yang

    Abstract: Purpose: Accurate segmentation of lung and infection in COVID-19 CT scans plays an important role in the quantitative management of patients. Most of the existing studies are based on large and private annotated datasets that are impractical to obtain from a single institution, especially when radiologists are busy fighting the coronavirus disease. Furthermore, it is hard to compare current COVID-… ▽ More

    Submitted 3 December, 2020; v1 submitted 26 April, 2020; originally announced April 2020.

    Comments: accepted for publication in Medical Physics

  50. arXiv:2004.11588  [pdf, other

    cs.IR

    Learning Hierarchical Review Graph Representations for Recommendation

    Authors: Yong Liu, Susen Yang, Yinan Zhang, Chunyan Miao, Zaiqing Nie, Juyong Zhang

    Abstract: The user review data have been demonstrated to be effective in solving different recommendation problems. Previous review-based recommendation methods usually employ sophisticated compositional models, such as Recurrent Neural Networks (RNN) and Convolutional Neural Networks (CNN), to learn semantic representations from the review data for recommendation. However, these methods mainly capture the… ▽ More

    Submitted 24 January, 2021; v1 submitted 24 April, 2020; originally announced April 2020.