Skip to main content

Showing 1–50 of 90 results for author: Lian, X

.
  1. arXiv:2406.18820  [pdf, other

    cs.DC cs.LG

    Universal Checkpointing: Efficient and Flexible Checkpointing for Large Scale Distributed Training

    Authors: Xinyu Lian, Sam Ade Jacobs, Lev Kurilenko, Masahiro Tanaka, Stas Bekman, Olatunji Ruwase, Minjia Zhang

    Abstract: Existing checkpointing approaches seem ill-suited for distributed training even though hardware limitations make model parallelism, i.e., sharding model state across multiple accelerators, a requirement for model scaling. Consolidating distributed model state into a single checkpoint unacceptably slows down training, and is impractical at extreme scales. Distributed checkpoints, in contrast, are t… ▽ More

    Submitted 27 June, 2024; v1 submitted 26 June, 2024; originally announced June 2024.

  2. arXiv:2406.04593  [pdf, other

    physics.chem-ph q-bio.BM

    SynAsk: Unleashing the Power of Large Language Models in Organic Synthesis

    Authors: Chonghuan Zhang, Qianghua Lin, Biwei Zhu, Haopeng Yang, Xiao Lian, Hao Deng, Jiajun Zheng, Kuangbiao Liao

    Abstract: The field of natural language processing (NLP) has witnessed a transformative shift with the emergence of large language models (LLMs), revolutionizing various language tasks and applications, and the integration of LLM into specialized domains enhances their capabilities for domain-specific applications. Notably, NLP has made significant strides in organic chemistry, particularly in predicting sy… ▽ More

    Submitted 13 June, 2024; v1 submitted 6 June, 2024; originally announced June 2024.

  3. arXiv:2405.19623  [pdf, other

    cs.SE

    A Novel Approach for Automated Design Information Mining from Issue Logs

    Authors: Jiuang Zhao, Zitian Yang, Li Zhang, Xiaoli Lian, Donghao Yang

    Abstract: Software architectures are usually meticulously designed to address multiple quality concerns and support long-term maintenance. However, due to the imbalance between the cost and value for developers to document design rationales (i.e., the design alternatives and the underlying arguments for making or rejecting decisions), these rationales are often obsolete or even missing. The lack of design k… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

  4. arXiv:2405.01510  [pdf, other

    cs.SI cs.DB

    Reverse Influential Community Search Over Social Networks (Technical Report)

    Authors: Qi Wen, Nan Zhang, Yutong Ye, Xiang Lian, Mingsong Chen

    Abstract: As an important fundamental task of numerous real-world applications such as social network analysis and online advertising/marketing, several prior works studied influential community search, which retrieves a community with high structural cohesiveness and maximum influences on other users in social networks. However, previous works usually considered the influences of the community on arbitrary… ▽ More

    Submitted 7 May, 2024; v1 submitted 2 May, 2024; originally announced May 2024.

  5. arXiv:2403.03835  [pdf, other

    cs.LG cs.AI cs.IR

    Cobweb: An Incremental and Hierarchical Model of Human-Like Category Learning

    Authors: Xin Lian, Sashank Varma, Christopher J. MacLellan

    Abstract: Cobweb, a human-like category learning system, differs from most cognitive science models in incrementally constructing hierarchically organized tree-like structures guided by the category utility measure. Prior studies have shown that Cobweb can capture psychological effects such as basic-level, typicality, and fan effects. However, a broader evaluation of Cobweb as a model of human categorizatio… ▽ More

    Submitted 8 May, 2024; v1 submitted 6 March, 2024; originally announced March 2024.

    Comments: Accepted by CogSci-24

  6. arXiv:2402.16933  [pdf, other

    cs.LG cs.AI cs.CV cs.IR

    Avoiding Catastrophic Forgetting in Visual Classification Using Human Concept Formation

    Authors: Nicki Barari, Xin Lian, Christopher J. MacLellan

    Abstract: Deep neural networks have excelled in machine learning, particularly in vision tasks, however, they often suffer from catastrophic forgetting when learning new tasks sequentially. In this work, we propose Cobweb4V, a novel visual classification approach that builds on Cobweb, a human like learning system that is inspired by the way humans incrementally learn new concepts over time. In this researc… ▽ More

    Submitted 26 February, 2024; originally announced February 2024.

  7. arXiv:2311.14987  [pdf

    physics.geo-ph astro-ph.IM

    Reconstruction of a Long-term spatially Contiguous Solar-Induced Fluorescence (LCSIF) over 1982-2022

    Authors: Jianing Fang, Xu Lian, Youngryel Ryu, Sungchan Jeong, Chongya Jiang, Pierre Gentine

    Abstract: Satellite-observed solar-induced chlorophyll fluorescence (SIF) is a powerful proxy for diagnosing the photosynthetic characteristics of terrestrial ecosystems. Despite the increasing spatial and temporal resolutions of these satellite retrievals, records of SIF are primarily limited to the recent decade, impeding their application in detecting long-term dynamics of ecosystem function and structur… ▽ More

    Submitted 19 June, 2024; v1 submitted 25 November, 2023; originally announced November 2023.

  8. arXiv:2311.13162  [pdf, other

    cs.SI cs.DB

    Top-L Most Influential Community Detection Over Social Networks (Technical Report)

    Authors: Nan Zhang, Yutong Ye, Xiang Lian, Mingsong Chen

    Abstract: In many real-world applications such as social network analysis and online marketing/advertising, the community detection is a fundamental task to identify communities (subgraphs) in social networks with high structural cohesiveness. While previous works focus on detecting communities alone, they do not consider the collective influences of users in these communities on other user nodes in social… ▽ More

    Submitted 1 March, 2024; v1 submitted 22 November, 2023; originally announced November 2023.

  9. arXiv:2311.02914  [pdf, other

    math.CO

    Tight upper bound on the clique size in the square of 2-degenerate graphs

    Authors: Seog-** Kim, Xiaopan Lian

    Abstract: The {\em square} of a graph $G$, denoted $G^2$, has the same vertex set as $G$ and has an edge between two vertices if the distance between them in $G$ is at most $2$. In general, $Δ(G) + 1 \leq χ(G^2) \leq Δ(G)^2 +1$ for every graph $G$. Charpentier [1] asked whether $χ(G^2) \leq 2 Δ(G)$ if $mad(G) < 4$. But Hocquard, Kim, and Pierron [6] answered his question negatively. For every even value of… ▽ More

    Submitted 6 November, 2023; originally announced November 2023.

    Comments: 32 pages

  10. arXiv:2310.09690  [pdf, other

    cs.SE cs.AI cs.OS

    Configuration Validation with Large Language Models

    Authors: Xinyu Lian, Yinfang Chen, Runxiang Cheng, Jie Huang, Parth Thakkar, Minjia Zhang, Tianyin Xu

    Abstract: Misconfigurations are major causes of software failures. Existing practices rely on developer-written rules or test cases to validate configurations, which are expensive. Machine learning (ML) for configuration validation is considered a promising direction, but has been facing challenges such as the need of large-scale field data and system-specific models. Recent advances in Large Language Model… ▽ More

    Submitted 2 April, 2024; v1 submitted 14 October, 2023; originally announced October 2023.

  11. arXiv:2309.15641  [pdf, other

    cs.DB

    Efficient Exact Subgraph Matching via GNN-based Path Dominance Embedding (Technical Report)

    Authors: Yutong Ye, Xiang Lian, Mingsong Chen

    Abstract: The classic problem of exact subgraph matching returns those subgraphs in a large-scale data graph that are isomorphic to a given query graph, which has gained increasing importance in many real-world applications such as social network analysis, knowledge graph discovery in the Semantic Web, bibliographical network mining, and so on. In this paper, we propose a novel and effective graph neural ne… ▽ More

    Submitted 15 January, 2024; v1 submitted 27 September, 2023; originally announced September 2023.

  12. arXiv:2307.05897  [pdf, ps, other

    math.CO

    On a variant of dichromatic number for digraphs with prescribed sets of arcs

    Authors: O-joung Kwon, Xiaopan Lian

    Abstract: In this paper, we consider a variant of dichromatic number on digraphs with prescribed sets of arcs. Let $D$ be a digraph and let $Z_1, Z_2$ be two sets of arcs in $D$. For a subdigraph $H$ of $D$, let $A(H)$ denote the set of all arcs of $H$. Let $μ(D, Z_1, Z_2)$ be the minimum number of parts in a vertex partition $\mathcal{P}$ of $D$ such that for every $X\in \mathcal{P}$, the subdigraph of… ▽ More

    Submitted 12 July, 2023; originally announced July 2023.

    Comments: 13 pages, 2 figures

  13. arXiv:2305.05194  [pdf, other

    math.CO

    The square of every subcubic planar graph of girth at least 6 is 7-choosable

    Authors: Seog-** Kim, Xiaopan Lian

    Abstract: The square of a graph $G$, denoted $G^2$, has the same vertex set as $G$ and has an edge between two vertices if the distance between them in $G$ is at most $2$. Thomassen (2018) and Hartke, Jahanbekam and Thomas (2016) proved that $χ(G^2) \leq 7$ if $G$ is a subcubic planar graph. A natural question is whether $χ_{\ell}(G^2) \leq 7$ or not if $G$ is a subcubic planar graph. Cranston and Kim (2008… ▽ More

    Submitted 9 May, 2023; originally announced May 2023.

    Comments: 9 pages, 1 figure

  14. arXiv:2305.01472  [pdf, other

    math.CO

    Group vertex-arboricity of group-labelled graphs

    Authors: O-joung Kwon, Xiaopan Lian

    Abstract: We introduce the vertex-arboricity of group-labelled graphs. For an abelian group $Γ$, a $Γ$-labelled graph is a graph whose edges are labelled by elements of $Γ$. For an abelian group $Γ$ and $A\subseteq Γ$, the $(Γ, A)$-vertex-arboricity of a $Γ$-labelled graph is the minimum integer $k$ such that its vertex set can be partitioned into $k$ parts where each part induces a subgraph having no cycle… ▽ More

    Submitted 2 May, 2023; originally announced May 2023.

    Comments: 15 pages, 1 figure

  15. arXiv:2304.11171  [pdf, other

    cs.LG cs.AI

    Granular-ball computing: an efficient, robust, and interpretable adaptive multi-granularity representation and computation method

    Authors: Shuyin Xia, Guoyin Wang, Xinbo Gao, Xiaoyu Lian

    Abstract: Human cognition operates on a "Global-first" cognitive mechanism, prioritizing information processing based on coarse-grained details. This mechanism inherently possesses an adaptive multi-granularity description capacity, resulting in computational traits such as efficiency, robustness, and interpretability. The analysis pattern reliance on the finest granularity and single-granularity makes most… ▽ More

    Submitted 18 January, 2024; v1 submitted 20 April, 2023; originally announced April 2023.

  16. arXiv:2301.12850  [pdf, other

    cs.CL cs.AI

    GE-Blender: Graph-Based Knowledge Enhancement for Blender

    Authors: Xiaolei Lian, Xunzhu Tang, Yue Wang

    Abstract: Although the great success of open-domain dialogue generation, unseen entities can have a large impact on the dialogue generation task. It leads to performance degradation of the model in the dialog generation. Previous researches used retrieved knowledge of seen entities as the auxiliary data to enhance the representation of the model. Nevertheless, logical explanation of unseen entities remains… ▽ More

    Submitted 30 January, 2023; originally announced January 2023.

  17. arXiv:2212.12948  [pdf, other

    cs.CV cs.AI

    Human Health Indicator Prediction from Gait Video

    Authors: Ziqing Li, Xuexin Yu, Xiaocong Lian, Yifeng Wang, Xiangyang Ji

    Abstract: Body Mass Index (BMI), age, height and weight are important indicators of human health conditions, which can provide useful information for plenty of practical purposes, such as health care, monitoring and re-identification. Most existing methods of health indicator prediction mainly use front-view body or face images. These inputs are hard to be obtained in daily life and often lead to the lack o… ▽ More

    Submitted 25 December, 2022; originally announced December 2022.

  18. arXiv:2211.16716  [pdf, other

    cs.SE

    Automated Generating Natural Language Requirements based on Domain Ontology

    Authors: Ziyan Zhao, Li Zhang, Xiaoyun Gao, Xiaoli Lian, Heyang Lv, Lin Shi

    Abstract: Software requirements specification is undoubtedly critical for the whole software life-cycle. Nowadays, writing software requirements specifications primarily depends on human work. Although massive studies have been proposed to fasten the process via proposing advanced elicitation and analysis techniques, it is still a time-consuming and error-prone task that needs to take domain knowledge and b… ▽ More

    Submitted 29 November, 2022; originally announced November 2022.

  19. arXiv:2210.17479  [pdf, other

    cs.DB

    kt-Safety: Graph Release via k-Anonymity and t-Closeness (Technical Report)

    Authors: Weilong Ren, Kambiz Ghazinour, Xiang Lian

    Abstract: In a wide spectrum of real-world applications, it is very important to analyze and mine graph data such as social networks, communication networks, citation networks, and so on. However, the release of such graph data often raises privacy issue, and the graph privacy preservation has recently drawn much attention from the database community. While prior works on graph privacy preservation mainly f… ▽ More

    Submitted 31 October, 2022; originally announced October 2022.

    Comments: 22 pages, 31 figures, the technical report of a TKDE paper entitled "kt-Safety: Graph Release via k-Anonymity and t-Closeness"

  20. arXiv:2210.11675  [pdf, other

    cs.LG cs.AI

    Granular-Ball Fuzzy Set and Its Implementation in SVM

    Authors: Shuyin Xia, Xiaoyu Lian, Guoyin Wang, Xinbo Gao, Yabin Shao

    Abstract: Most existing fuzzy set methods use points as their input, which is the finest granularity from the perspective of granular computing. Consequently, these methods are neither efficient nor robust to label noise. Therefore, we propose a frame-work called granular-ball fuzzy set by introducing granular-ball computing into fuzzy set. The computational framework is based on the granular-balls input ra… ▽ More

    Submitted 26 November, 2022; v1 submitted 20 October, 2022; originally announced October 2022.

  21. arXiv:2210.06247  [pdf, ps, other

    math.CO

    Some Mader-perfect graph classes

    Authors: Hui Lei, Siyan Li, Xiaopan Lian, Susu Wang

    Abstract: The dichromatic number of $D$, denoted by $\overrightarrowχ(D)$, is the smallest integer $k$ such that $D$ admits an acyclic $k$-coloring. We use $mader_{\overrightarrowχ}(F)$ to denote the smallest integer $k$ such that if $\overrightarrowχ(D)\ge k$, then $D$ contains a subdivision of $F$. A digraph $F$ is called Mader-perfect if for every subdigraph $F'$ of $F$,… ▽ More

    Submitted 12 October, 2022; originally announced October 2022.

    Comments: 12 pages, 2 figures

  22. arXiv:2210.03120  [pdf, other

    cs.LG cs.AI

    GBSVM: Granular-ball Support Vector Machine

    Authors: Shuyin Xia, Xiaoyu Lian, Guoyin Wang, Xinbo Gao, Jiancu Chen, Xiaoli Peng

    Abstract: GBSVM (Granular-ball Support Vector Machine) is a significant attempt to construct a classifier using the coarse-to-fine granularity of a granular-ball as input, rather than a single data point. It is the first classifier whose input contains no points. However, the existing model has some errors, and its dual model has not been derived. As a result, the current algorithm cannot be implemented or… ▽ More

    Submitted 11 February, 2024; v1 submitted 6 October, 2022; originally announced October 2022.

  23. Boosting the Discriminant Power of Naive Bayes

    Authors: Shihe Wang, Jianfeng Ren, Xiaoyu Lian, Ruibin Bai, Xudong Jiang

    Abstract: Naive Bayes has been widely used in many applications because of its simplicity and ability in handling both numerical data and categorical data. However, lack of modeling of correlations between features limits its performance. In addition, noise and outliers in the real-world dataset also greatly degrade the classification performance. In this paper, we propose a feature augmentation method empl… ▽ More

    Submitted 20 September, 2022; originally announced September 2022.

    Comments: Accepted by 2022 International Conference on Pattern Recognition

  24. arXiv:2208.12986  [pdf, other

    cs.RO cs.CV

    6D Robotic Assembly Based on RGB-only Object Pose Estimation

    Authors: Bowen Fu, Sek Kun Leong, Xiaocong Lian, Xiangyang Ji

    Abstract: Vision-based robotic assembly is a crucial yet challenging task as the interaction with multiple objects requires high levels of precision. In this paper, we propose an integrated 6D robotic system to perceive, grasp, manipulate and assemble blocks with tight tolerances. Aiming to provide an off-the-shelf RGB-only solution, our system is built upon a monocular 6D object pose estimation network tra… ▽ More

    Submitted 27 August, 2022; originally announced August 2022.

    Comments: Accepted by IROS 2022

  25. arXiv:2208.06757  [pdf, other

    cs.SE

    A Preliminary Study on the Potential Usefulness of Open Domain Model for Missing Software Requirements Recommendation

    Authors: Ziyan Zhao, Li Zhang, Xiaoli Lian

    Abstract: Completeness is one of the most important attributes of software requirement specifications. Unfortunately, incompleteness is meanwhile one of the most difficult problems to detect. Some approaches have been proposed to detect missing requirements based on the requirement-oriented domain model. However, this kind of models are lacking for lots of domains. Fortunately, the domain models constructed… ▽ More

    Submitted 13 August, 2022; originally announced August 2022.

  26. arXiv:2206.05778  [pdf, other

    cs.DB cs.LG

    Learning-Based Data Storage [Vision] (Technical Report)

    Authors: Xiang Lian, Xiaofei Zhang

    Abstract: Deep neural network (DNN) and its variants have been extensively used for a wide spectrum of real applications such as image classification, face/speech recognition, fraud detection, and so on. In addition to many important machine learning tasks, as artificial networks emulating the way brain cells function, DNNs also show the capability of storing non-linear relationships between input and outpu… ▽ More

    Submitted 22 January, 2023; v1 submitted 12 June, 2022; originally announced June 2022.

    Comments: 14 pages, 16 figures

    ACM Class: E.2; H.2.1; I.2.0; I.2.11

  27. arXiv:2206.02281  [pdf, other

    cs.CV

    E^2VTS: Energy-Efficient Video Text Spotting from Unmanned Aerial Vehicles

    Authors: Zhenyu Hu, Zhenyu Wu, Pengcheng Pi, Yunhe Xue, Jiayi Shen, Jianchao Tan, Xiangru Lian, Zhangyang Wang, Ji Liu

    Abstract: Unmanned Aerial Vehicles (UAVs) based video text spotting has been extensively used in civil and military domains. UAV's limited battery capacity motivates us to develop an energy-efficient video text spotting solution. In this paper, we first revisit RCNN's crop & resize training strategy and empirically find that it outperforms aligned RoI sampling on a real-world video text dataset captured by… ▽ More

    Submitted 5 June, 2022; originally announced June 2022.

  28. arXiv:2206.02114  [pdf, other

    cs.CL

    Speech Detection Task Against Asian Hate: BERT the Central, While Data-Centric Studies the Crucial

    Authors: Xin Lian

    Abstract: With the COVID-19 pandemic continuing, hatred against Asians is intensifying in countries outside Asia, especially among the Chinese. There is an urgent need to detect and prevent hate speech towards Asians effectively. In this work, we first create COVID-HATE-2022, an annotated dataset including 2,025 annotated tweets fetched in early February 2022, which are labeled based on specific criteria, a… ▽ More

    Submitted 21 August, 2022; v1 submitted 5 June, 2022; originally announced June 2022.

  29. arXiv:2205.11717  [pdf

    physics.optics physics.app-ph

    Realization of ultra-broadband IR up-conversion imaging

    Authors: X. H. Li, P. Bai, S. H. Huang, X. Q. Bai, W. J. Song, X. R. Lian, C. Hu, Z. W. Shi, W. Z. Shen, Y. H. Zhang, Z. L. Fu, D. X. Shao, Z. Y. Tan, J. C. Cao, C. Tan, G. Y. Xu

    Abstract: Ultra-broadband imaging devices with high performance are in great demand for a variety of technological applications, including imaging, remote sensing, and communications. An ultra-broadband up-converter is realized based on a p-GaAs homojunction interfacial workfunction internal photoemission (HIWIP) detector-light emitting diode (LED) device. The device demonstrates an ultra-broad response ran… ▽ More

    Submitted 23 May, 2022; originally announced May 2022.

    Comments: 23 pages, 5 figures

  30. arXiv:2204.13224  [pdf, other

    cs.DB

    Top-k Community Similarity Search Over Large-Scale Road Networks (Technical Report)

    Authors: Niranjan Rai, Xiang Lian

    Abstract: With the urbanization and development of infrastructure, the community search over road networks has become increasingly important in many real applications such as urban/city planning, social study on local communities, and community recommendations by real estate agencies. In this paper, we propose a novel problem, namely top-k community similarity search (Top-kCS2) over road networks, which eff… ▽ More

    Submitted 27 April, 2022; originally announced April 2022.

  31. Research Status of Deep Learning Methods for Rumor Detection

    Authors: Li Tan, Ge Wang, Feiyang Jia, Xiaofeng Lian

    Abstract: To manage the rumors in social media to reduce the harm of rumors in society. Many studies used methods of deep learning to detect rumors in open networks. To comprehensively sort out the research status of rumor detection from multiple perspectives, this paper analyzes the highly focused work from three perspectives: Feature Selection, Model Structure, and Research Methods. From the perspective o… ▽ More

    Submitted 25 April, 2022; originally announced April 2022.

    Comments: Accepted by MTAP

  32. arXiv:2204.05538  [pdf, other

    cs.CV

    NightLab: A Dual-level Architecture with Hardness Detection for Segmentation at Night

    Authors: Xueqing Deng, Peng Wang, Xiaochen Lian, Shawn Newsam

    Abstract: The semantic segmentation of nighttime scenes is a challenging problem that is key to impactful applications like self-driving cars. Yet, it has received little attention compared to its daytime counterpart. In this paper, we propose NightLab, a novel nighttime segmentation framework that leverages multiple deep learning models imbued with night-aware features to yield State-of-The-Art (SoTA) perf… ▽ More

    Submitted 12 April, 2022; originally announced April 2022.

    Comments: 8pages, 6 figures, accept at CVPR 2022

  33. arXiv:2202.08427  [pdf, ps, other

    math.CO

    Weak-odd chromatic index of special digraph classes

    Authors: Ruijuan Gu, Hui Lei, Xiaopan Lian, Zhenyu Taoqiu

    Abstract: Give a digraph $D=(V(D),A(D))$, let $\partial^+_D(v)=\{vw|w\in N^+_D(v)\}$ and $\partial^-_D(v)=\{uv|u\in N^-_D(v)\}$ be semi-cuts of $v$. A map** $\varphi:A(D)\rightarrow [k]$ is called a weak-odd $k$-edge coloring of $D$ if it satisfies the condition: for each $v\in V(D)$, there is at least one color with an odd number of occurrences on each non-empty semi-cut of $v$. We call the minimum integ… ▽ More

    Submitted 16 February, 2022; originally announced February 2022.

    Comments: 14 pages, 1 figures

  34. arXiv:2201.08616  [pdf, other

    cs.GT

    Diffusion Multi-unit Auctions with Diminishing Marginal Utility Buyers

    Authors: Haolin Liu, Xinyuan Lian, Dengji Zhao

    Abstract: We consider an auction design problem where a seller sells multiple homogeneous items to a set of connected buyers. Each buyer only knows the buyers she directly connects with and has a diminishing marginal utility valuation for the items. The seller initially only connects to some buyers who can be directly invited to the sale by the seller. Our goal is to design an auction to incentivize the buy… ▽ More

    Submitted 26 February, 2023; v1 submitted 21 January, 2022; originally announced January 2022.

  35. arXiv:2112.13387  [pdf, ps, other

    math.CO

    On critical graphs for the chromatic edge-stability number

    Authors: Hui Lei, Xiaopan Lian, Xianhao Meng, Yongtang Shi, Yiqiao Wang

    Abstract: The {\em chromatic edge-stability number} $es_χ(G)$ of a graph $G$ is the minimum number of edges whose removal results in a spanning subgraph with the chromatic number smaller than that of $G$. A graph $G$ is called {\em $(3,2)$-critical} if $χ(G)=3$, $es_χ(G)=2$ and for any edge $e\in E(G)$, $es_χ(G-e)<es_χ(G)$. In this paper, we characterize $(3,2)$-critical graphs which contain at least five o… ▽ More

    Submitted 26 December, 2021; originally announced December 2021.

    Comments: 12 pages, 2 figures

    MSC Class: 05C15

  36. arXiv:2111.05897  [pdf, other

    cs.LG cs.DC

    Persia: An Open, Hybrid System Scaling Deep Learning-based Recommenders up to 100 Trillion Parameters

    Authors: Xiangru Lian, Binhang Yuan, Xuefeng Zhu, Yulong Wang, Yongjun He, Honghuan Wu, Lei Sun, Haodong Lyu, Chengjun Liu, Xing Dong, Yiqiao Liao, Mingnan Luo, Congfei Zhang, **gru Xie, Haonan Li, Lei Chen, Renjie Huang, Jianying Lin, Chengchun Shu, Xuezhong Qiu, Zhishan Liu, Dongying Kong, Lei Yuan, Hai Yu, Sen Yang , et al. (2 additional authors not shown)

    Abstract: Deep learning based models have dominated the current landscape of production recommender systems. Furthermore, recent years have witnessed an exponential growth of the model scale--from Google's 2016 model with 1 billion parameters to the latest Facebook's model with 12 trillion parameters. Significant quality boost has come with each jump of the model capacity, which makes us believe the era of… ▽ More

    Submitted 23 November, 2021; v1 submitted 10 November, 2021; originally announced November 2021.

  37. arXiv:2110.07218  [pdf, other

    physics.optics physics.bio-ph

    Deep-3D Microscope: 3D volumetric microscopy of thick scattering samples using a wide-field microscope and machine learning

    Authors: Bowen Li, Shiyu Tan, Jiuyang Dong, Xiaocong Lian, Yongbing Zhang, Xiangyang Ji, Ashok Veeraraghavan

    Abstract: Confocal microscopy is the standard approach for obtaining volumetric images of a sample with high axial and lateral resolution, especially when dealing with scattering samples. Unfortunately, a confocal microscope is quite expensive compared to traditional microscopes. In addition, the point scanning in a confocal leads to slow imaging speed and photobleaching due to the high dose of laser energy… ▽ More

    Submitted 14 October, 2021; originally announced October 2021.

  38. arXiv:2110.05827  [pdf, ps, other

    math.CO

    A characterization of 4-$χ_S$-vertex-critical graphs for packing sequences with $s_1 =1$ and $s_2\ge 3$

    Authors: Sandi Klavžar, Hui Lei, Xiaopan Lian, Yongtang Shi

    Abstract: If $S=(s_1,s_2,\ldots)$ is a non-decreasing sequence of positive integers, then the $S$-packing $k$-coloring of a graph $G$ is a map** $c: V(G)\rightarrow[k]$ such that if $c(u)=c(v)=i$ for $u\neq v\in V(G)$, then $d_G(u,v)>s_i$. The $S$-packing chromatic number of $G$ is the smallest integer $k$ such that $G$ admits an $S$-packing $k$-coloring. A graph $G$ is $χ_S$-vertex-critical if… ▽ More

    Submitted 13 May, 2023; v1 submitted 12 October, 2021; originally announced October 2021.

    Comments: 19 pages, 2 figures

  39. arXiv:2110.02461  [pdf

    cond-mat.mes-hall cond-mat.mtrl-sci

    Epitaxial Growth of Ultraflat Bismuthene with Large Topological Band Inversion Enabled by Substrate-Orbital-Filtering Effect

    Authors: Shuo Sun, **g-Yang You, Sisheng Duan, Jian Gou, Yongzheng Luo, Weinan Lin, Xu Lian, Tengyu **, Jiawei Liu, Yuli Huang, Yihe Wang, Andrew T. S. Wee, Yuan ** Feng, Lei Shen, Jia Lin Zhang, **gsheng Chen, Wei Chen

    Abstract: Quantum spin Hall (QSH) systems hold promises of low-power-consuming spintronic devices, yet their practical applications are extremely impeded by the small energy gaps. Fabricating QSH materials with large gaps, especially under the guidance of design principles, is essential for both scientific research and practical applications. Here, we demonstrate that large on-site atomic spin-orbit couplin… ▽ More

    Submitted 18 December, 2021; v1 submitted 5 October, 2021; originally announced October 2021.

  40. arXiv:2107.01499  [pdf, other

    cs.LG cs.DC

    BAGUA: Scaling up Distributed Learning with System Relaxations

    Authors: Shaoduo Gan, Xiangru Lian, Rui Wang, Jianbin Chang, Chengjun Liu, Hongmei Shi, Shengzhuo Zhang, Xianghong Li, Tengxu Sun, Jiawei Jiang, Binhang Yuan, Sen Yang, Ji Liu, Ce Zhang

    Abstract: Recent years have witnessed a growing list of systems for distributed data-parallel training. Existing systems largely fit into two paradigms, i.e., parameter server and MPI-style collective operations. On the algorithmic side, researchers have proposed a wide range of techniques to lower the communication via system relaxations: quantization, decentralization, and communication delay. However, mo… ▽ More

    Submitted 25 November, 2021; v1 submitted 3 July, 2021; originally announced July 2021.

  41. arXiv:2106.06560  [pdf, other

    cs.CV

    HR-NAS: Searching Efficient High-Resolution Neural Architectures with Lightweight Transformers

    Authors: Mingyu Ding, Xiaochen Lian, Linjie Yang, Peng Wang, Xiaojie **, Zhiwu Lu, ** Luo

    Abstract: High-resolution representations (HR) are essential for dense prediction tasks such as segmentation, detection, and pose estimation. Learning HR representations is typically ignored in previous Neural Architecture Search (NAS) methods that focus on image classification. This work proposes a novel NAS method, called HR-NAS, which is able to find efficient and accurate networks for different tasks, b… ▽ More

    Submitted 11 June, 2021; originally announced June 2021.

    Comments: Accepted by CVPR 2021 (Oral)

  42. arXiv:2106.06135  [pdf, other

    cs.AI cs.LG

    DouZero: Mastering DouDizhu with Self-Play Deep Reinforcement Learning

    Authors: Daochen Zha, **gru Xie, Wenye Ma, Sheng Zhang, Xiangru Lian, Xia Hu, Ji Liu

    Abstract: Games are abstractions of the real world, where artificial agents learn to compete and cooperate with other agents. While significant achievements have been made in various perfect- and imperfect-information games, DouDizhu (a.k.a. Fighting the Landlord), a three-player card game, is still unsolved. DouDizhu is a very challenging domain with competition, collaboration, imperfect information, large… ▽ More

    Submitted 10 June, 2021; originally announced June 2021.

    Comments: Accepted by ICML 2021

  43. arXiv:2105.04486  [pdf, other

    cs.DB

    Probabilistic Top-k Dominating Queries in Distributed Uncertain Databases (Technical Report)

    Authors: Niranjan Rai, Xiang Lian

    Abstract: In many real-world applications such as business planning and sensor data monitoring, one important, yet challenging, the task is to rank objects(e.g., products, documents, or spatial objects) based on their ranking scores and efficiently return those objects with the highest scores. In practice, due to the unreliability of data sources, many real-world objects often contain noises and are thus im… ▽ More

    Submitted 12 May, 2021; v1 submitted 10 May, 2021; originally announced May 2021.

  44. arXiv:2103.11886  [pdf, other

    cs.CV

    DeepViT: Towards Deeper Vision Transformer

    Authors: Daquan Zhou, Bingyi Kang, Xiaojie **, Linjie Yang, Xiaochen Lian, Zihang Jiang, Qibin Hou, Jiashi Feng

    Abstract: Vision transformers (ViTs) have been successfully applied in image classification tasks recently. In this paper, we show that, unlike convolution neural networks (CNNs)that can be improved by stacking more convolutional layers, the performance of ViTs saturate fast when scaled to be deeper. More specifically, we empirically observe that such scaling difficulty is caused by the attention collapse i… ▽ More

    Submitted 19 April, 2021; v1 submitted 22 March, 2021; originally announced March 2021.

  45. arXiv:2103.11833  [pdf, other

    cs.CV

    AutoSpace: Neural Architecture Search with Less Human Interference

    Authors: Daquan Zhou, Xiaojie **, Xiaochen Lian, Linjie Yang, Yu**g Xue, Qibin Hou, Jiashi Feng

    Abstract: Current neural architecture search (NAS) algorithms still require expert knowledge and effort to design a search space for network construction. In this paper, we consider automating the search space design to minimize human interference, which however faces two challenges: the explosive complexity of the exploration space and the expensive computation cost to evaluate the quality of different sea… ▽ More

    Submitted 22 March, 2021; originally announced March 2021.

  46. arXiv:2103.08720  [pdf, other

    cs.DB

    Online Topic-Aware Entity Resolution Over Incomplete Data Streams (Technical Report)

    Authors: Weilong Ren, Xiang Lian, Kambiz Ghazinour

    Abstract: In many real applications such as the data integration, social network analysis, and the Semantic Web, the entity resolution (ER) is an important and fundamental problem, which identifies and links the same real-world entities from various data sources. While prior works usually consider ER over static and complete data, in practice, application data are usually collected in a streaming fashion, a… ▽ More

    Submitted 15 March, 2021; originally announced March 2021.

    Comments: Technical report of the paper entitled "Online Topic-Aware Entity Resolution Over Incomplete Data Streams", published on SIGMOD 2021

  47. arXiv:2103.02255  [pdf, other

    cs.SE cs.AI

    Automatically detecting the conflicts between software requirements based on finer semantic analysis

    Authors: Weize Guo, Li Zhang, Xiaoli Lian

    Abstract: Context: Conflicts between software requirements bring uncertainties to product development. Some great approaches have been proposed to identify these conflicts. However, they usually require the software requirements represented with specific templates and/or depend on other external source which is often uneasy to build for lots of projects in practice. Objective: We aim to propose an approach… ▽ More

    Submitted 3 March, 2021; originally announced March 2021.

    Comments: 17 pages, 2 figures

    MSC Class: 68N30 ACM Class: D.2.1

  48. arXiv:2102.02888  [pdf, other

    cs.LG cs.DC

    1-bit Adam: Communication Efficient Large-Scale Training with Adam's Convergence Speed

    Authors: Hanlin Tang, Shaoduo Gan, Ammar Ahmad Awan, Samyam Rajbhandari, Conglong Li, Xiangru Lian, Ji Liu, Ce Zhang, Yuxiong He

    Abstract: Scalable training of large models (like BERT and GPT-3) requires careful optimization rooted in model design, architecture, and system capabilities. From a system standpoint, communication has become a major bottleneck, especially on commodity systems with standard TCP interconnects that offer limited network bandwidth. Communication compression is an important technique to reduce training time on… ▽ More

    Submitted 29 June, 2021; v1 submitted 4 February, 2021; originally announced February 2021.

    Comments: arXiv admin note: text overlap with arXiv:2008.11343

  49. arXiv:2101.11446  [pdf

    stat.AP cs.IR

    A study on information behavior of scholars for article keywords selection

    Authors: Z. X. Lian

    Abstract: This project takes the factors of keyword selection behavior as the research object. Qualitative analysis methods such as interview and grounded theory were used to construct causal influence path model. Combined with computer simulation technology such as multi-agent simulation experiment method was used to study the factors of keyword selection from two dimensions of individual to group. The res… ▽ More

    Submitted 27 January, 2021; originally announced January 2021.

    Comments: 10 pages

    MSC Class: 62-11

  50. arXiv:2008.11343  [pdf, other

    cs.DC cs.LG stat.ML

    APMSqueeze: A Communication Efficient Adam-Preconditioned Momentum SGD Algorithm

    Authors: Hanlin Tang, Shaoduo Gan, Samyam Rajbhandari, Xiangru Lian, Ji Liu, Yuxiong He, Ce Zhang

    Abstract: Adam is the important optimization algorithm to guarantee efficiency and accuracy for training many important tasks such as BERT and ImageNet. However, Adam is generally not compatible with information (gradient) compression technology. Therefore, the communication usually becomes the bottleneck for parallelizing Adam. In this paper, we propose a communication efficient {\bf A}DAM {\bf p}reconditi… ▽ More

    Submitted 27 August, 2020; v1 submitted 25 August, 2020; originally announced August 2020.