Skip to main content

Showing 1–50 of 62 results for author: Lian, X

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.18820  [pdf, other

    cs.DC cs.LG

    Universal Checkpointing: Efficient and Flexible Checkpointing for Large Scale Distributed Training

    Authors: Xinyu Lian, Sam Ade Jacobs, Lev Kurilenko, Masahiro Tanaka, Stas Bekman, Olatunji Ruwase, Minjia Zhang

    Abstract: Existing checkpointing approaches seem ill-suited for distributed training even though hardware limitations make model parallelism, i.e., sharding model state across multiple accelerators, a requirement for model scaling. Consolidating distributed model state into a single checkpoint unacceptably slows down training, and is impractical at extreme scales. Distributed checkpoints, in contrast, are t… ▽ More

    Submitted 27 June, 2024; v1 submitted 26 June, 2024; originally announced June 2024.

  2. arXiv:2405.19623  [pdf, other

    cs.SE

    A Novel Approach for Automated Design Information Mining from Issue Logs

    Authors: Jiuang Zhao, Zitian Yang, Li Zhang, Xiaoli Lian, Donghao Yang

    Abstract: Software architectures are usually meticulously designed to address multiple quality concerns and support long-term maintenance. However, due to the imbalance between the cost and value for developers to document design rationales (i.e., the design alternatives and the underlying arguments for making or rejecting decisions), these rationales are often obsolete or even missing. The lack of design k… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

  3. arXiv:2405.01510  [pdf, other

    cs.SI cs.DB

    Reverse Influential Community Search Over Social Networks (Technical Report)

    Authors: Qi Wen, Nan Zhang, Yutong Ye, Xiang Lian, Mingsong Chen

    Abstract: As an important fundamental task of numerous real-world applications such as social network analysis and online advertising/marketing, several prior works studied influential community search, which retrieves a community with high structural cohesiveness and maximum influences on other users in social networks. However, previous works usually considered the influences of the community on arbitrary… ▽ More

    Submitted 7 May, 2024; v1 submitted 2 May, 2024; originally announced May 2024.

  4. arXiv:2403.03835  [pdf, other

    cs.LG cs.AI cs.IR

    Cobweb: An Incremental and Hierarchical Model of Human-Like Category Learning

    Authors: Xin Lian, Sashank Varma, Christopher J. MacLellan

    Abstract: Cobweb, a human-like category learning system, differs from most cognitive science models in incrementally constructing hierarchically organized tree-like structures guided by the category utility measure. Prior studies have shown that Cobweb can capture psychological effects such as basic-level, typicality, and fan effects. However, a broader evaluation of Cobweb as a model of human categorizatio… ▽ More

    Submitted 8 May, 2024; v1 submitted 6 March, 2024; originally announced March 2024.

    Comments: Accepted by CogSci-24

  5. arXiv:2402.16933  [pdf, other

    cs.LG cs.AI cs.CV cs.IR

    Avoiding Catastrophic Forgetting in Visual Classification Using Human Concept Formation

    Authors: Nicki Barari, Xin Lian, Christopher J. MacLellan

    Abstract: Deep neural networks have excelled in machine learning, particularly in vision tasks, however, they often suffer from catastrophic forgetting when learning new tasks sequentially. In this work, we propose Cobweb4V, a novel visual classification approach that builds on Cobweb, a human like learning system that is inspired by the way humans incrementally learn new concepts over time. In this researc… ▽ More

    Submitted 26 February, 2024; originally announced February 2024.

  6. arXiv:2311.13162  [pdf, other

    cs.SI cs.DB

    Top-L Most Influential Community Detection Over Social Networks (Technical Report)

    Authors: Nan Zhang, Yutong Ye, Xiang Lian, Mingsong Chen

    Abstract: In many real-world applications such as social network analysis and online marketing/advertising, the community detection is a fundamental task to identify communities (subgraphs) in social networks with high structural cohesiveness. While previous works focus on detecting communities alone, they do not consider the collective influences of users in these communities on other user nodes in social… ▽ More

    Submitted 1 March, 2024; v1 submitted 22 November, 2023; originally announced November 2023.

  7. arXiv:2310.09690  [pdf, other

    cs.SE cs.AI cs.OS

    Configuration Validation with Large Language Models

    Authors: Xinyu Lian, Yinfang Chen, Runxiang Cheng, Jie Huang, Parth Thakkar, Minjia Zhang, Tianyin Xu

    Abstract: Misconfigurations are major causes of software failures. Existing practices rely on developer-written rules or test cases to validate configurations, which are expensive. Machine learning (ML) for configuration validation is considered a promising direction, but has been facing challenges such as the need of large-scale field data and system-specific models. Recent advances in Large Language Model… ▽ More

    Submitted 2 April, 2024; v1 submitted 14 October, 2023; originally announced October 2023.

  8. arXiv:2309.15641  [pdf, other

    cs.DB

    Efficient Exact Subgraph Matching via GNN-based Path Dominance Embedding (Technical Report)

    Authors: Yutong Ye, Xiang Lian, Mingsong Chen

    Abstract: The classic problem of exact subgraph matching returns those subgraphs in a large-scale data graph that are isomorphic to a given query graph, which has gained increasing importance in many real-world applications such as social network analysis, knowledge graph discovery in the Semantic Web, bibliographical network mining, and so on. In this paper, we propose a novel and effective graph neural ne… ▽ More

    Submitted 15 January, 2024; v1 submitted 27 September, 2023; originally announced September 2023.

  9. arXiv:2304.11171  [pdf, other

    cs.LG cs.AI

    Granular-ball computing: an efficient, robust, and interpretable adaptive multi-granularity representation and computation method

    Authors: Shuyin Xia, Guoyin Wang, Xinbo Gao, Xiaoyu Lian

    Abstract: Human cognition operates on a "Global-first" cognitive mechanism, prioritizing information processing based on coarse-grained details. This mechanism inherently possesses an adaptive multi-granularity description capacity, resulting in computational traits such as efficiency, robustness, and interpretability. The analysis pattern reliance on the finest granularity and single-granularity makes most… ▽ More

    Submitted 18 January, 2024; v1 submitted 20 April, 2023; originally announced April 2023.

  10. arXiv:2301.12850  [pdf, other

    cs.CL cs.AI

    GE-Blender: Graph-Based Knowledge Enhancement for Blender

    Authors: Xiaolei Lian, Xunzhu Tang, Yue Wang

    Abstract: Although the great success of open-domain dialogue generation, unseen entities can have a large impact on the dialogue generation task. It leads to performance degradation of the model in the dialog generation. Previous researches used retrieved knowledge of seen entities as the auxiliary data to enhance the representation of the model. Nevertheless, logical explanation of unseen entities remains… ▽ More

    Submitted 30 January, 2023; originally announced January 2023.

  11. arXiv:2212.12948  [pdf, other

    cs.CV cs.AI

    Human Health Indicator Prediction from Gait Video

    Authors: Ziqing Li, Xuexin Yu, Xiaocong Lian, Yifeng Wang, Xiangyang Ji

    Abstract: Body Mass Index (BMI), age, height and weight are important indicators of human health conditions, which can provide useful information for plenty of practical purposes, such as health care, monitoring and re-identification. Most existing methods of health indicator prediction mainly use front-view body or face images. These inputs are hard to be obtained in daily life and often lead to the lack o… ▽ More

    Submitted 25 December, 2022; originally announced December 2022.

  12. arXiv:2211.16716  [pdf, other

    cs.SE

    Automated Generating Natural Language Requirements based on Domain Ontology

    Authors: Ziyan Zhao, Li Zhang, Xiaoyun Gao, Xiaoli Lian, Heyang Lv, Lin Shi

    Abstract: Software requirements specification is undoubtedly critical for the whole software life-cycle. Nowadays, writing software requirements specifications primarily depends on human work. Although massive studies have been proposed to fasten the process via proposing advanced elicitation and analysis techniques, it is still a time-consuming and error-prone task that needs to take domain knowledge and b… ▽ More

    Submitted 29 November, 2022; originally announced November 2022.

  13. arXiv:2210.17479  [pdf, other

    cs.DB

    kt-Safety: Graph Release via k-Anonymity and t-Closeness (Technical Report)

    Authors: Weilong Ren, Kambiz Ghazinour, Xiang Lian

    Abstract: In a wide spectrum of real-world applications, it is very important to analyze and mine graph data such as social networks, communication networks, citation networks, and so on. However, the release of such graph data often raises privacy issue, and the graph privacy preservation has recently drawn much attention from the database community. While prior works on graph privacy preservation mainly f… ▽ More

    Submitted 31 October, 2022; originally announced October 2022.

    Comments: 22 pages, 31 figures, the technical report of a TKDE paper entitled "kt-Safety: Graph Release via k-Anonymity and t-Closeness"

  14. arXiv:2210.11675  [pdf, other

    cs.LG cs.AI

    Granular-Ball Fuzzy Set and Its Implementation in SVM

    Authors: Shuyin Xia, Xiaoyu Lian, Guoyin Wang, Xinbo Gao, Yabin Shao

    Abstract: Most existing fuzzy set methods use points as their input, which is the finest granularity from the perspective of granular computing. Consequently, these methods are neither efficient nor robust to label noise. Therefore, we propose a frame-work called granular-ball fuzzy set by introducing granular-ball computing into fuzzy set. The computational framework is based on the granular-balls input ra… ▽ More

    Submitted 26 November, 2022; v1 submitted 20 October, 2022; originally announced October 2022.

  15. arXiv:2210.03120  [pdf, other

    cs.LG cs.AI

    GBSVM: Granular-ball Support Vector Machine

    Authors: Shuyin Xia, Xiaoyu Lian, Guoyin Wang, Xinbo Gao, Jiancu Chen, Xiaoli Peng

    Abstract: GBSVM (Granular-ball Support Vector Machine) is a significant attempt to construct a classifier using the coarse-to-fine granularity of a granular-ball as input, rather than a single data point. It is the first classifier whose input contains no points. However, the existing model has some errors, and its dual model has not been derived. As a result, the current algorithm cannot be implemented or… ▽ More

    Submitted 11 February, 2024; v1 submitted 6 October, 2022; originally announced October 2022.

  16. Boosting the Discriminant Power of Naive Bayes

    Authors: Shihe Wang, Jianfeng Ren, Xiaoyu Lian, Ruibin Bai, Xudong Jiang

    Abstract: Naive Bayes has been widely used in many applications because of its simplicity and ability in handling both numerical data and categorical data. However, lack of modeling of correlations between features limits its performance. In addition, noise and outliers in the real-world dataset also greatly degrade the classification performance. In this paper, we propose a feature augmentation method empl… ▽ More

    Submitted 20 September, 2022; originally announced September 2022.

    Comments: Accepted by 2022 International Conference on Pattern Recognition

  17. arXiv:2208.12986  [pdf, other

    cs.RO cs.CV

    6D Robotic Assembly Based on RGB-only Object Pose Estimation

    Authors: Bowen Fu, Sek Kun Leong, Xiaocong Lian, Xiangyang Ji

    Abstract: Vision-based robotic assembly is a crucial yet challenging task as the interaction with multiple objects requires high levels of precision. In this paper, we propose an integrated 6D robotic system to perceive, grasp, manipulate and assemble blocks with tight tolerances. Aiming to provide an off-the-shelf RGB-only solution, our system is built upon a monocular 6D object pose estimation network tra… ▽ More

    Submitted 27 August, 2022; originally announced August 2022.

    Comments: Accepted by IROS 2022

  18. arXiv:2208.06757  [pdf, other

    cs.SE

    A Preliminary Study on the Potential Usefulness of Open Domain Model for Missing Software Requirements Recommendation

    Authors: Ziyan Zhao, Li Zhang, Xiaoli Lian

    Abstract: Completeness is one of the most important attributes of software requirement specifications. Unfortunately, incompleteness is meanwhile one of the most difficult problems to detect. Some approaches have been proposed to detect missing requirements based on the requirement-oriented domain model. However, this kind of models are lacking for lots of domains. Fortunately, the domain models constructed… ▽ More

    Submitted 13 August, 2022; originally announced August 2022.

  19. arXiv:2206.05778  [pdf, other

    cs.DB cs.LG

    Learning-Based Data Storage [Vision] (Technical Report)

    Authors: Xiang Lian, Xiaofei Zhang

    Abstract: Deep neural network (DNN) and its variants have been extensively used for a wide spectrum of real applications such as image classification, face/speech recognition, fraud detection, and so on. In addition to many important machine learning tasks, as artificial networks emulating the way brain cells function, DNNs also show the capability of storing non-linear relationships between input and outpu… ▽ More

    Submitted 22 January, 2023; v1 submitted 12 June, 2022; originally announced June 2022.

    Comments: 14 pages, 16 figures

    ACM Class: E.2; H.2.1; I.2.0; I.2.11

  20. arXiv:2206.02281  [pdf, other

    cs.CV

    E^2VTS: Energy-Efficient Video Text Spotting from Unmanned Aerial Vehicles

    Authors: Zhenyu Hu, Zhenyu Wu, Pengcheng Pi, Yunhe Xue, Jiayi Shen, Jianchao Tan, Xiangru Lian, Zhangyang Wang, Ji Liu

    Abstract: Unmanned Aerial Vehicles (UAVs) based video text spotting has been extensively used in civil and military domains. UAV's limited battery capacity motivates us to develop an energy-efficient video text spotting solution. In this paper, we first revisit RCNN's crop & resize training strategy and empirically find that it outperforms aligned RoI sampling on a real-world video text dataset captured by… ▽ More

    Submitted 5 June, 2022; originally announced June 2022.

  21. arXiv:2206.02114  [pdf, other

    cs.CL

    Speech Detection Task Against Asian Hate: BERT the Central, While Data-Centric Studies the Crucial

    Authors: Xin Lian

    Abstract: With the COVID-19 pandemic continuing, hatred against Asians is intensifying in countries outside Asia, especially among the Chinese. There is an urgent need to detect and prevent hate speech towards Asians effectively. In this work, we first create COVID-HATE-2022, an annotated dataset including 2,025 annotated tweets fetched in early February 2022, which are labeled based on specific criteria, a… ▽ More

    Submitted 21 August, 2022; v1 submitted 5 June, 2022; originally announced June 2022.

  22. arXiv:2204.13224  [pdf, other

    cs.DB

    Top-k Community Similarity Search Over Large-Scale Road Networks (Technical Report)

    Authors: Niranjan Rai, Xiang Lian

    Abstract: With the urbanization and development of infrastructure, the community search over road networks has become increasingly important in many real applications such as urban/city planning, social study on local communities, and community recommendations by real estate agencies. In this paper, we propose a novel problem, namely top-k community similarity search (Top-kCS2) over road networks, which eff… ▽ More

    Submitted 27 April, 2022; originally announced April 2022.

  23. Research Status of Deep Learning Methods for Rumor Detection

    Authors: Li Tan, Ge Wang, Feiyang Jia, Xiaofeng Lian

    Abstract: To manage the rumors in social media to reduce the harm of rumors in society. Many studies used methods of deep learning to detect rumors in open networks. To comprehensively sort out the research status of rumor detection from multiple perspectives, this paper analyzes the highly focused work from three perspectives: Feature Selection, Model Structure, and Research Methods. From the perspective o… ▽ More

    Submitted 25 April, 2022; originally announced April 2022.

    Comments: Accepted by MTAP

  24. arXiv:2204.05538  [pdf, other

    cs.CV

    NightLab: A Dual-level Architecture with Hardness Detection for Segmentation at Night

    Authors: Xueqing Deng, Peng Wang, Xiaochen Lian, Shawn Newsam

    Abstract: The semantic segmentation of nighttime scenes is a challenging problem that is key to impactful applications like self-driving cars. Yet, it has received little attention compared to its daytime counterpart. In this paper, we propose NightLab, a novel nighttime segmentation framework that leverages multiple deep learning models imbued with night-aware features to yield State-of-The-Art (SoTA) perf… ▽ More

    Submitted 12 April, 2022; originally announced April 2022.

    Comments: 8pages, 6 figures, accept at CVPR 2022

  25. arXiv:2201.08616  [pdf, other

    cs.GT

    Diffusion Multi-unit Auctions with Diminishing Marginal Utility Buyers

    Authors: Haolin Liu, Xinyuan Lian, Dengji Zhao

    Abstract: We consider an auction design problem where a seller sells multiple homogeneous items to a set of connected buyers. Each buyer only knows the buyers she directly connects with and has a diminishing marginal utility valuation for the items. The seller initially only connects to some buyers who can be directly invited to the sale by the seller. Our goal is to design an auction to incentivize the buy… ▽ More

    Submitted 26 February, 2023; v1 submitted 21 January, 2022; originally announced January 2022.

  26. arXiv:2111.05897  [pdf, other

    cs.LG cs.DC

    Persia: An Open, Hybrid System Scaling Deep Learning-based Recommenders up to 100 Trillion Parameters

    Authors: Xiangru Lian, Binhang Yuan, Xuefeng Zhu, Yulong Wang, Yongjun He, Honghuan Wu, Lei Sun, Haodong Lyu, Chengjun Liu, Xing Dong, Yiqiao Liao, Mingnan Luo, Congfei Zhang, **gru Xie, Haonan Li, Lei Chen, Renjie Huang, Jianying Lin, Chengchun Shu, Xuezhong Qiu, Zhishan Liu, Dongying Kong, Lei Yuan, Hai Yu, Sen Yang , et al. (2 additional authors not shown)

    Abstract: Deep learning based models have dominated the current landscape of production recommender systems. Furthermore, recent years have witnessed an exponential growth of the model scale--from Google's 2016 model with 1 billion parameters to the latest Facebook's model with 12 trillion parameters. Significant quality boost has come with each jump of the model capacity, which makes us believe the era of… ▽ More

    Submitted 23 November, 2021; v1 submitted 10 November, 2021; originally announced November 2021.

  27. arXiv:2107.01499  [pdf, other

    cs.LG cs.DC

    BAGUA: Scaling up Distributed Learning with System Relaxations

    Authors: Shaoduo Gan, Xiangru Lian, Rui Wang, Jianbin Chang, Chengjun Liu, Hongmei Shi, Shengzhuo Zhang, Xianghong Li, Tengxu Sun, Jiawei Jiang, Binhang Yuan, Sen Yang, Ji Liu, Ce Zhang

    Abstract: Recent years have witnessed a growing list of systems for distributed data-parallel training. Existing systems largely fit into two paradigms, i.e., parameter server and MPI-style collective operations. On the algorithmic side, researchers have proposed a wide range of techniques to lower the communication via system relaxations: quantization, decentralization, and communication delay. However, mo… ▽ More

    Submitted 25 November, 2021; v1 submitted 3 July, 2021; originally announced July 2021.

  28. arXiv:2106.06560  [pdf, other

    cs.CV

    HR-NAS: Searching Efficient High-Resolution Neural Architectures with Lightweight Transformers

    Authors: Mingyu Ding, Xiaochen Lian, Linjie Yang, Peng Wang, Xiaojie **, Zhiwu Lu, ** Luo

    Abstract: High-resolution representations (HR) are essential for dense prediction tasks such as segmentation, detection, and pose estimation. Learning HR representations is typically ignored in previous Neural Architecture Search (NAS) methods that focus on image classification. This work proposes a novel NAS method, called HR-NAS, which is able to find efficient and accurate networks for different tasks, b… ▽ More

    Submitted 11 June, 2021; originally announced June 2021.

    Comments: Accepted by CVPR 2021 (Oral)

  29. arXiv:2106.06135  [pdf, other

    cs.AI cs.LG

    DouZero: Mastering DouDizhu with Self-Play Deep Reinforcement Learning

    Authors: Daochen Zha, **gru Xie, Wenye Ma, Sheng Zhang, Xiangru Lian, Xia Hu, Ji Liu

    Abstract: Games are abstractions of the real world, where artificial agents learn to compete and cooperate with other agents. While significant achievements have been made in various perfect- and imperfect-information games, DouDizhu (a.k.a. Fighting the Landlord), a three-player card game, is still unsolved. DouDizhu is a very challenging domain with competition, collaboration, imperfect information, large… ▽ More

    Submitted 10 June, 2021; originally announced June 2021.

    Comments: Accepted by ICML 2021

  30. arXiv:2105.04486  [pdf, other

    cs.DB

    Probabilistic Top-k Dominating Queries in Distributed Uncertain Databases (Technical Report)

    Authors: Niranjan Rai, Xiang Lian

    Abstract: In many real-world applications such as business planning and sensor data monitoring, one important, yet challenging, the task is to rank objects(e.g., products, documents, or spatial objects) based on their ranking scores and efficiently return those objects with the highest scores. In practice, due to the unreliability of data sources, many real-world objects often contain noises and are thus im… ▽ More

    Submitted 12 May, 2021; v1 submitted 10 May, 2021; originally announced May 2021.

  31. arXiv:2103.11886  [pdf, other

    cs.CV

    DeepViT: Towards Deeper Vision Transformer

    Authors: Daquan Zhou, Bingyi Kang, Xiaojie **, Linjie Yang, Xiaochen Lian, Zihang Jiang, Qibin Hou, Jiashi Feng

    Abstract: Vision transformers (ViTs) have been successfully applied in image classification tasks recently. In this paper, we show that, unlike convolution neural networks (CNNs)that can be improved by stacking more convolutional layers, the performance of ViTs saturate fast when scaled to be deeper. More specifically, we empirically observe that such scaling difficulty is caused by the attention collapse i… ▽ More

    Submitted 19 April, 2021; v1 submitted 22 March, 2021; originally announced March 2021.

  32. arXiv:2103.11833  [pdf, other

    cs.CV

    AutoSpace: Neural Architecture Search with Less Human Interference

    Authors: Daquan Zhou, Xiaojie **, Xiaochen Lian, Linjie Yang, Yu**g Xue, Qibin Hou, Jiashi Feng

    Abstract: Current neural architecture search (NAS) algorithms still require expert knowledge and effort to design a search space for network construction. In this paper, we consider automating the search space design to minimize human interference, which however faces two challenges: the explosive complexity of the exploration space and the expensive computation cost to evaluate the quality of different sea… ▽ More

    Submitted 22 March, 2021; originally announced March 2021.

  33. arXiv:2103.08720  [pdf, other

    cs.DB

    Online Topic-Aware Entity Resolution Over Incomplete Data Streams (Technical Report)

    Authors: Weilong Ren, Xiang Lian, Kambiz Ghazinour

    Abstract: In many real applications such as the data integration, social network analysis, and the Semantic Web, the entity resolution (ER) is an important and fundamental problem, which identifies and links the same real-world entities from various data sources. While prior works usually consider ER over static and complete data, in practice, application data are usually collected in a streaming fashion, a… ▽ More

    Submitted 15 March, 2021; originally announced March 2021.

    Comments: Technical report of the paper entitled "Online Topic-Aware Entity Resolution Over Incomplete Data Streams", published on SIGMOD 2021

  34. arXiv:2103.02255  [pdf, other

    cs.SE cs.AI

    Automatically detecting the conflicts between software requirements based on finer semantic analysis

    Authors: Weize Guo, Li Zhang, Xiaoli Lian

    Abstract: Context: Conflicts between software requirements bring uncertainties to product development. Some great approaches have been proposed to identify these conflicts. However, they usually require the software requirements represented with specific templates and/or depend on other external source which is often uneasy to build for lots of projects in practice. Objective: We aim to propose an approach… ▽ More

    Submitted 3 March, 2021; originally announced March 2021.

    Comments: 17 pages, 2 figures

    MSC Class: 68N30 ACM Class: D.2.1

  35. arXiv:2102.02888  [pdf, other

    cs.LG cs.DC

    1-bit Adam: Communication Efficient Large-Scale Training with Adam's Convergence Speed

    Authors: Hanlin Tang, Shaoduo Gan, Ammar Ahmad Awan, Samyam Rajbhandari, Conglong Li, Xiangru Lian, Ji Liu, Ce Zhang, Yuxiong He

    Abstract: Scalable training of large models (like BERT and GPT-3) requires careful optimization rooted in model design, architecture, and system capabilities. From a system standpoint, communication has become a major bottleneck, especially on commodity systems with standard TCP interconnects that offer limited network bandwidth. Communication compression is an important technique to reduce training time on… ▽ More

    Submitted 29 June, 2021; v1 submitted 4 February, 2021; originally announced February 2021.

    Comments: arXiv admin note: text overlap with arXiv:2008.11343

  36. arXiv:2101.11446  [pdf

    stat.AP cs.IR

    A study on information behavior of scholars for article keywords selection

    Authors: Z. X. Lian

    Abstract: This project takes the factors of keyword selection behavior as the research object. Qualitative analysis methods such as interview and grounded theory were used to construct causal influence path model. Combined with computer simulation technology such as multi-agent simulation experiment method was used to study the factors of keyword selection from two dimensions of individual to group. The res… ▽ More

    Submitted 27 January, 2021; originally announced January 2021.

    Comments: 10 pages

    MSC Class: 62-11

  37. arXiv:2008.11343  [pdf, other

    cs.DC cs.LG stat.ML

    APMSqueeze: A Communication Efficient Adam-Preconditioned Momentum SGD Algorithm

    Authors: Hanlin Tang, Shaoduo Gan, Samyam Rajbhandari, Xiangru Lian, Ji Liu, Yuxiong He, Ce Zhang

    Abstract: Adam is the important optimization algorithm to guarantee efficiency and accuracy for training many important tasks such as BERT and ImageNet. However, Adam is generally not compatible with information (gradient) compression technology. Therefore, the communication usually becomes the bottleneck for parallelizing Adam. In this paper, we propose a communication efficient {\bf A}DAM {\bf p}reconditi… ▽ More

    Submitted 27 August, 2020; v1 submitted 25 August, 2020; originally announced August 2020.

  38. arXiv:2007.03014  [pdf, other

    cs.DB

    Topic-based Community Search over Spatial-Social Networks (Technical Report)

    Authors: Ahmed Al-Baghdadi, Xiang Lian

    Abstract: Recently, the community search problem has attracted significant attention, due to its wide spectrum of real-world applications such as event organization, friend recommendation, advertisement in e-commence, and so on. Given a query vertex, the community search problem finds dense subgraph that contains the query vertex. In social networks, users have multiple check-in locations, influence score,… ▽ More

    Submitted 25 November, 2021; v1 submitted 6 July, 2020; originally announced July 2020.

    Comments: 15 pages, 12 figures, and 3 tables. To appear in the PVLDB'20

  39. arXiv:2004.01961  [pdf, other

    cs.CV

    Neural Architecture Search for Lightweight Non-Local Networks

    Authors: Yingwei Li, Xiaojie **, Jieru Mei, Xiaochen Lian, Linjie Yang, Cihang Xie, Qihang Yu, Yuyin Zhou, Song Bai, Alan Yuille

    Abstract: Non-Local (NL) blocks have been widely studied in various vision tasks. However, it has been rarely explored to embed the NL blocks in mobile neural networks, mainly due to the following challenges: 1) NL blocks generally have heavy computation cost which makes it difficult to be applied in applications where computational resources are limited, and 2) it is an open problem to discover an optimal… ▽ More

    Submitted 4 April, 2020; originally announced April 2020.

    Comments: CVPR 2020. Project page: https://github.com/LiYingwei/AutoNL

  40. arXiv:2003.04302  [pdf, other

    stat.ML cs.LG

    Stochastic Recursive Momentum for Policy Gradient Methods

    Authors: Huizhuo Yuan, Xiangru Lian, Ji Liu, Yuren Zhou

    Abstract: In this paper, we propose a novel algorithm named STOchastic Recursive Momentum for Policy Gradient (STORM-PG), which operates a SARAH-type stochastic recursive variance-reduced policy gradient in an exponential moving average fashion. STORM-PG enjoys a provably sharp $O(1/ε^3)$ sample complexity bound for STORM-PG, matching the best-known convergence rate for policy gradient algorithm. In the mea… ▽ More

    Submitted 9 March, 2020; originally announced March 2020.

  41. arXiv:2002.00743  [pdf, other

    cs.CL cs.LG stat.ML

    Unsupervised Multilingual Alignment using Wasserstein Barycenter

    Authors: Xin Lian, Kshitij Jain, Jakub Truszkowski, Pascal Poupart, Yaoliang Yu

    Abstract: We study unsupervised multilingual alignment, the problem of finding word-to-word translations between multiple languages without using any parallel data. One popular strategy is to reduce multilingual alignment to the much simplified bilingual setting, by picking one of the input languages as the pivot language that we transit through. However, it is well-known that transiting through a poorly ch… ▽ More

    Submitted 28 July, 2020; v1 submitted 28 January, 2020; originally announced February 2020.

    Comments: Code is available at https://github.com/alixxxin/multi-lang

    ACM Class: I.2.7

    Journal ref: Proceedings of International Joint Conference on Artificial Intelligence (IJCAI), 2020

  42. arXiv:1912.13515  [pdf, other

    stat.ML cs.LG math.OC

    Stochastic Recursive Variance Reduction for Efficient Smooth Non-Convex Compositional Optimization

    Authors: Huizhuo Yuan, Xiangru Lian, Ji Liu

    Abstract: Stochastic compositional optimization arises in many important machine learning tasks such as value function evaluation in reinforcement learning and portfolio management. The objective function is the composition of two expectations of stochastic functions, and is more challenging to optimize than vanilla stochastic optimization problems. In this paper, we investigate the stochastic compositional… ▽ More

    Submitted 25 January, 2020; v1 submitted 31 December, 2019; originally announced December 2019.

  43. arXiv:1912.09640  [pdf, other

    cs.CV

    AtomNAS: Fine-Grained End-to-End Neural Architecture Search

    Authors: Jieru Mei, Yingwei Li, Xiaochen Lian, Xiaojie **, Linjie Yang, Alan Yuille, Jianchao Yang

    Abstract: Search space design is very critical to neural architecture search (NAS) algorithms. We propose a fine-grained search space comprised of atomic blocks, a minimal search unit that is much smaller than the ones used in recent NAS algorithms. This search space allows a mix of operations by composing different types of atomic blocks, while the search space in previous methods only allows homogeneous o… ▽ More

    Submitted 23 February, 2020; v1 submitted 19 December, 2019; originally announced December 2019.

    Comments: ICLR 2020 camera ready version

  44. arXiv:1910.04786  [pdf, other

    cs.DB

    Efficient Path Routing Over Road Networks in the Presence of Ad-Hoc Obstacles (Technical Report)

    Authors: Ahmed Al-Baghdadi, Xiang Lian, En Cheng

    Abstract: Nowadays, the path routing over road networks has become increasingly important, yet challenging, in many real-world applications such as location-based services (LBS), logistics and supply chain management, transportation systems, map utilities, and so on. While many prior works aimed to find a path between a source and a destination with the smallest traveling distance/time, they do not take int… ▽ More

    Submitted 10 October, 2019; originally announced October 2019.

    Comments: Accepted for Information Systems (IS) Journal, 2019

  45. arXiv:1909.11224  [pdf, other

    cs.DB

    Skyline Queries Over Incomplete Data Streams (Technical Report)

    Authors: Weilong Ren, Xiang Lian, Kambiz Ghazinour

    Abstract: Nowadays, efficient and effective processing over massive stream data has attracted much attention from the database community, which are useful in many real applications such as sensor data monitoring, network intrusion detection, and so on. In practice, due to the malfunction of sensing devices or imperfect data collection techniques, real-world stream data may often contain missing or incomplet… ▽ More

    Submitted 25 September, 2019; v1 submitted 24 September, 2019; originally announced September 2019.

    Comments: 26 pages, 20 figures, VLDB Journal

  46. arXiv:1908.08654  [pdf, other

    cs.DB

    Efficient Join Processing Over Incomplete Data Streams (Technical Report)

    Authors: Weilong Ren, Xiang Lian, Kambiz Ghazinour

    Abstract: For decades, the join operator over fast data streams has always drawn much attention from the database community, due to its wide spectrum of real-world applications, such as online clustering, intrusion detection, sensor data monitoring, and so on. Existing works usually assume that the underlying streams to be joined are complete (without any missing values). However, this assumption may not al… ▽ More

    Submitted 22 August, 2019; originally announced August 2019.

    Comments: 11 pages, 11 figures, accepted conference paper for CIKM19

  47. arXiv:1907.07346  [pdf, other

    cs.DC cs.LG stat.ML

    $\texttt{DeepSqueeze}$: Decentralization Meets Error-Compensated Compression

    Authors: Hanlin Tang, Xiangru Lian, Shuang Qiu, Lei Yuan, Ce Zhang, Tong Zhang, Ji Liu

    Abstract: Communication is a key bottleneck in distributed training. Recently, an \emph{error-compensated} compression technology was particularly designed for the \emph{centralized} learning and receives huge successes, by showing significant advantages over state-of-the-art compression based methods in saving the communication cost. Since the \emph{decentralized} training has been witnessed to be superior… ▽ More

    Submitted 3 August, 2019; v1 submitted 17 July, 2019; originally announced July 2019.

  48. arXiv:1905.05957  [pdf, other

    cs.DC cs.LG

    DoubleSqueeze: Parallel Stochastic Gradient Descent with Double-Pass Error-Compensated Compression

    Authors: Hanlin Tang, Xiangru Lian, Chen Yu, Tong Zhang, Ji Liu

    Abstract: A standard approach in large scale machine learning is distributed stochastic gradient training, which requires the computation of aggregated stochastic gradients over multiple nodes on a network. Communication is a major bottleneck in such applications, and in recent years, compressed stochastic gradient methods such as QSGD (quantized SGD) and sparse SGD have been proposed to reduce communicatio… ▽ More

    Submitted 21 March, 2020; v1 submitted 15 May, 2019; originally announced May 2019.

  49. arXiv:1810.06177  [pdf, other

    math.OC cs.LG

    Revisit Batch Normalization: New Understanding from an Optimization View and a Refinement via Composition Optimization

    Authors: Xiangru Lian, Ji Liu

    Abstract: Batch Normalization (BN) has been used extensively in deep learning to achieve faster training process and better resulting models. However, whether BN works strongly depends on how the batches are constructed during training and it may not converge to a desired solution if the statistics on a batch are not close to the statistics over the whole dataset. In this paper, we try to understand BN from… ▽ More

    Submitted 15 October, 2018; originally announced October 2018.

  50. arXiv:1810.04915  [pdf, other

    cs.DB

    A Comparative Study of Consistent Snapshot Algorithms for Main-Memory Database Systems

    Authors: Liang Li, Guoren Wang, Gang Wu, Ye Yuan, Lei Chen, Xiang Lian

    Abstract: In-memory databases (IMDBs) are gaining increasing popularity in big data applications, where clients commit updates intensively. Specifically, it is necessary for IMDBs to have efficient snapshot performance to support certain special applications (e.g., consistent checkpoint, HTAP). Formally, the in-memory consistent snapshot problem refers to taking an in-memory consistent time-in-point snapsho… ▽ More

    Submitted 11 October, 2018; originally announced October 2018.