Search | arXiv e-print repository

A-ACT: Action Anticipation through Cycle Transformations

Authors: Akash Gupta, **gen Liu, Liefeng Bo, Amit K. Roy-Chowdhury, Tao Mei

Abstract: While action anticipation has garnered a lot of research interest recently, most of the works focus on anticipating future action directly through observed visual cues only. In this work, we take a step back to analyze how the human capability to anticipate the future can be transferred to machine learning algorithms. To incorporate this ability in intelligent systems a question worth pondering up… ▽ More While action anticipation has garnered a lot of research interest recently, most of the works focus on anticipating future action directly through observed visual cues only. In this work, we take a step back to analyze how the human capability to anticipate the future can be transferred to machine learning algorithms. To incorporate this ability in intelligent systems a question worth pondering upon is how exactly do we anticipate? Is it by anticipating future actions from past experiences? Or is it by simulating possible scenarios based on cues from the present? A recent study on human psychology explains that, in anticipating an occurrence, the human brain counts on both systems. In this work, we study the impact of each system for the task of action anticipation and introduce a paradigm to integrate them in a learning framework. We believe that intelligent systems designed by leveraging the psychological anticipation models will do a more nuanced job at the task of human action prediction. Furthermore, we introduce cyclic transformation in the temporal dimension in feature and semantic label space to instill the human ability of reasoning of past actions based on the predicted future. Experiments on Epic-Kitchen, Breakfast, and 50Salads dataset demonstrate that the action anticipation model learned using a combination of the two systems along with the cycle transformation performs favorably against various state-of-the-art approaches. △ Less

Submitted 2 April, 2022; originally announced April 2022.

arXiv:2203.02660 [pdf, other]

doi 10.1145/3510003.3510219

MVD: Memory-Related Vulnerability Detection Based on Flow-Sensitive Graph Neural Networks

Authors: Sicong Cao, Xiaobing Sun, Lili Bo, Rongxin Wu, Bin Li, Chuanqi Tao

Abstract: Memory-related vulnerabilities constitute severe threats to the security of modern software. Despite the success of deep learning-based approaches to generic vulnerability detection, they are still limited by the underutilization of flow information when applied for detecting memory-related vulnerabilities, leading to high false positives. In this paper,we propose MVD, a statement-level Memory-r… ▽ More Memory-related vulnerabilities constitute severe threats to the security of modern software. Despite the success of deep learning-based approaches to generic vulnerability detection, they are still limited by the underutilization of flow information when applied for detecting memory-related vulnerabilities, leading to high false positives. In this paper,we propose MVD, a statement-level Memory-related Vulnerability Detection approach based on flow-sensitive graph neural networks (FS-GNN). FS-GNN is employed to jointly embed both unstructured information (i.e., source code) and structured information (i.e., control- and data-flow) to capture implicit memory-related vulnerability patterns. We evaluate MVD on the dataset which contains 4,353 real-world memory-related vulnerabilities, and compare our approach with three state-of-the-art deep learning-based approaches as well as five popular static analysisbased memory detectors. The experiment results show that MVD achieves better detection accuracy, outperforming both state-of-theart DL-based and static analysis-based approaches. Furthermore, MVD makes a great trade-off between accuracy and efficiency. △ Less

Submitted 5 March, 2022; originally announced March 2022.

Comments: To appear in the Technical Track of ICSE 2022

arXiv:2202.02930 [pdf, other]

Towards Micro-video Thumbnail Selection via a Multi-label Visual-semantic Embedding Model

Authors: Liu Bo

Abstract: The thumbnail, as the first sight of a micro-video, plays a pivotal role in attracting users to click and watch. While in the real scenario, the more the thumbnails satisfy the users, the more likely the micro-videos will be clicked. In this paper, we aim to select the thumbnail of a given micro-video that meets most users` interests. Towards this end, we present a multi-label visual-semantic embe… ▽ More The thumbnail, as the first sight of a micro-video, plays a pivotal role in attracting users to click and watch. While in the real scenario, the more the thumbnails satisfy the users, the more likely the micro-videos will be clicked. In this paper, we aim to select the thumbnail of a given micro-video that meets most users` interests. Towards this end, we present a multi-label visual-semantic embedding model to estimate the similarity between the pair of each frame and the popular topics that users are interested in. In this model, the visual and textual information is embedded into a shared semantic space, whereby the similarity can be measured directly, even the unseen words. Moreover, to compare the frame to all words from the popular topics, we devise an attention embedding space associated with the semantic-attention projection. With the help of these two embedding spaces, the popularity score of a frame, which is defined by the sum of similarity scores over the corresponding visual information and popular topic pairs, is achieved. Ultimately, we fuse the visual representation score and the popularity score of each frame to select the attractive thumbnail for the given micro-video. Extensive experiments conducted on a real-world dataset have well-verified that our model significantly outperforms several state-of-the-art baselines. △ Less

Submitted 6 February, 2022; originally announced February 2022.

arXiv:2201.10761 [pdf, other]

An Efficient and Robust System for Vertically Federated Random Forest

Authors: Houpu Yao, Jiazhou Wang, Peng Dai, Liefeng Bo, Yanqing Chen

Abstract: As there is a growing interest in utilizing data across multiple resources to build better machine learning models, many vertically federated learning algorithms have been proposed to preserve the data privacy of the participating organizations. However, the efficiency of existing vertically federated learning algorithms remains to be a big problem, especially when applied to large-scale real-worl… ▽ More As there is a growing interest in utilizing data across multiple resources to build better machine learning models, many vertically federated learning algorithms have been proposed to preserve the data privacy of the participating organizations. However, the efficiency of existing vertically federated learning algorithms remains to be a big problem, especially when applied to large-scale real-world datasets. In this paper, we present a fast, accurate, scalable and yet robust system for vertically federated random forest. With extensive optimization, we achieved $5\times$ and $83\times$ speed up over the SOTA SecureBoost model \cite{cheng2019secureboost} for training and serving tasks. Moreover, the proposed system can achieve similar accuracy but with favorable scalability and partition tolerance. Our code has been made public to facilitate the development of the community and the protection of user data privacy. △ Less

Submitted 26 January, 2022; originally announced January 2022.

arXiv:2201.09406

Power Forward Performance in Semimartingale Markets with Stochastic Integrated Factors

Authors: Lijun Bo, Agostino Capponi, Chao Zhou

Abstract: We study the forward investment performance process (FIPP) in an incomplete semimartingale market model with closed and convex portfolio constraints, when the investor's risk preferences are of the power form. We provide necessary and sufficient conditions for the existence of such FIPP. In a semimartingale factor model, we show that the FIPP can be recovered as a triplet of processes which admit… ▽ More We study the forward investment performance process (FIPP) in an incomplete semimartingale market model with closed and convex portfolio constraints, when the investor's risk preferences are of the power form. We provide necessary and sufficient conditions for the existence of such FIPP. In a semimartingale factor model, we show that the FIPP can be recovered as a triplet of processes which admit an integral representation with respect to semimartingales. Using an integrated stochastic factor model, we relate the factor representation of the triplet of processes to the smooth solution of an ill-posed partial integro-differential Hamilton-Jacobi-Bellman (HJB) equation. We develop explicit constructions for the class of time-monotone FIPPs, generalizing existing results from Brownian to semimartingale market models. △ Less

Submitted 25 January, 2022; v1 submitted 23 January, 2022; originally announced January 2022.

Comments: This work was intended as a replacement of arXiv:1811.11899 and any subsequent updates will appear there

MSC Class: 3E20; 60J20

arXiv:2201.02435 [pdf, other]

doi 10.24963/ijcai.2021/225

Spatial-Temporal Sequential Hypergraph Network for Crime Prediction with Dynamic Multiplex Relation Learning

Authors: Lianghao Xia, Chao Huang, Yong Xu, Peng Dai, Liefeng Bo, Xiyue Zhang, Tianyi Chen

Abstract: Crime prediction is crucial for public safety and resource optimization, yet is very challenging due to two aspects: i) the dynamics of criminal patterns across time and space, crime events are distributed unevenly on both spatial and temporal domains; ii) time-evolving dependencies between different types of crimes (e.g., Theft, Robbery, Assault, Damage) which reveal fine-grained semantics of cri… ▽ More Crime prediction is crucial for public safety and resource optimization, yet is very challenging due to two aspects: i) the dynamics of criminal patterns across time and space, crime events are distributed unevenly on both spatial and temporal domains; ii) time-evolving dependencies between different types of crimes (e.g., Theft, Robbery, Assault, Damage) which reveal fine-grained semantics of crimes. To tackle these challenges, we propose Spatial-Temporal Sequential Hypergraph Network (ST-SHN) to collectively encode complex crime spatial-temporal patterns as well as the underlying category-wise crime semantic relationships. In specific, to handle spatial-temporal dynamics under the long-range and global context, we design a graph-structured message passing architecture with the integration of the hypergraph learning paradigm. To capture category-wise crime heterogeneous relations in a dynamic environment, we introduce a multi-channel routing mechanism to learn the time-evolving structural dependency across crime types. We conduct extensive experiments on two real-world datasets, showing that our proposed ST-SHN framework can significantly improve the prediction performance as compared to various state-of-the-art baselines. The source code is available at: https://github.com/akaxlh/ST-SHN. △ Less

Submitted 23 April, 2022; v1 submitted 7 January, 2022; originally announced January 2022.

Comments: This paper has been published as a research paper at IJCAI 2021

arXiv:2201.02307 [pdf, other]

doi 10.1109/ICDE51399.2021.00179

Multi-Behavior Enhanced Recommendation with Cross-Interaction Collaborative Relation Modeling

Authors: Lianghao Xia, Chao Huang, Yong Xu, Peng Dai, Mengyin Lu, Liefeng Bo

Abstract: Many previous studies aim to augment collaborative filtering with deep neural network techniques, so as to achieve better recommendation performance. However, most existing deep learning-based recommender systems are designed for modeling singular type of user-item interaction behavior, which can hardly distill the heterogeneous relations between user and item. In practical recommendation scenario… ▽ More Many previous studies aim to augment collaborative filtering with deep neural network techniques, so as to achieve better recommendation performance. However, most existing deep learning-based recommender systems are designed for modeling singular type of user-item interaction behavior, which can hardly distill the heterogeneous relations between user and item. In practical recommendation scenarios, there exist multityped user behaviors, such as browse and purchase. Due to the overlook of user's multi-behavioral patterns over different items, existing recommendation methods are insufficient to capture heterogeneous collaborative signals from user multi-behavior data. Inspired by the strength of graph neural networks for structured data modeling, this work proposes a Graph Neural Multi-Behavior Enhanced Recommendation (GNMR) framework which explicitly models the dependencies between different types of user-item interactions under a graph-based message passing architecture. GNMR devises a relation aggregation network to model interaction heterogeneity, and recursively performs embedding propagation between neighboring nodes over the user-item interaction graph. Experiments on real-world recommendation datasets show that our GNMR consistently outperforms state-of-the-art methods. The source code is available at https://github.com/akaxlh/GNMR. △ Less

Submitted 6 January, 2022; originally announced January 2022.

Comments: Published on ICDE 2021

arXiv:2112.14958 [pdf, other]

A Benchmark Dataset for Micro-video Thumbnail Selection

Authors: Liu Bo

Abstract: The thumbnail, as the first sight of a micro-video, plays a pivotal role in attracting users to click and watch. Although several pioneer efforts have been dedicated to jointly considering the quality and representativeness for selecting the thumbnail, they are limited in exploring the influence of users` interests. While in the real scenario, the more the thumbnails satisfy the users, the more li… ▽ More The thumbnail, as the first sight of a micro-video, plays a pivotal role in attracting users to click and watch. Although several pioneer efforts have been dedicated to jointly considering the quality and representativeness for selecting the thumbnail, they are limited in exploring the influence of users` interests. While in the real scenario, the more the thumbnails satisfy the users, the more likely the micro-videos will be clicked. In this paper, we aim to select the thumbnail of a given micro-video that meets most users` interests. Towards this end, we construct a large-scale dataset for the micro-video thumbnails. Ultimately, we conduct several baselines on the dataset and demonstrate the effectiveness of our dataset. △ Less

Submitted 30 December, 2021; originally announced December 2021.

arXiv:2112.12207 [pdf]

Nutritional blood concentration biomarkers in the Hispanic Community Health Study/Study of Latinos: Measurement characteristics and power

Authors: Lillian A. Boe, Yasmin Mossavar-Rahmani, Daniela Sotres-Alvarez, Martha L. Daviglus, Ramon A. Durazo-Arvizu, Bharat Thyagarajan, Robert C. Kaplan, Pamela A. Shaw

Abstract: Measurement error is a major issue in self-reported diet that can distort diet-disease relationships. Use of blood concentration biomarkers has the potential to mitigate the subjective bias inherent in self-report. As part of the Hispanic Community Health Study/Study of Latinos (HCHS/SOL) baseline visit (2008-2011), self-reported diet was collected on all participants (N=16,415). Blood concentrati… ▽ More Measurement error is a major issue in self-reported diet that can distort diet-disease relationships. Use of blood concentration biomarkers has the potential to mitigate the subjective bias inherent in self-report. As part of the Hispanic Community Health Study/Study of Latinos (HCHS/SOL) baseline visit (2008-2011), self-reported diet was collected on all participants (N=16,415). Blood concentration biomarkers for carotenoids, tocopherols, retinol, vitamin B12 and folate were collected on a subset (N=476), as part of the Study of Latinos: Nutrition and Physical Activity Assessment Study (SOLNAS). We examine the relationship between biomarker levels, self-reported intake, Hispanic/Latino background, and other participant characteristics in this diverse cohort. We build regression calibration-based prediction equations for ten nutritional biomarkers and use a simulation to study the power of detecting a diet-disease association in a multivariable Cox model using a predicted concentration level. Good power was observed for some nutrients with high prediction model R2 values, but further research is needed to understand how best to realize the potential of these dietary biomarkers. This study provides a comprehensive examination of several nutritional biomarkers within the HCHS/SOL, characterizing their associations with subject characteristics and the influence of the measurement characteristics on the power to detect associations with health outcomes. △ Less

Submitted 20 September, 2022; v1 submitted 22 December, 2021; originally announced December 2021.

Comments: 20 pages in main manuscript including 5 tables and 2 figures; 14 pages of supplement including 5 tables and 1 figure

arXiv:2111.12760 [pdf, ps, other]

An Augmented Likelihood Approach for the Discrete Proportional Hazards Model Using Auxiliary and Validated Outcome Data -- with Application to the HCHS/SOL Study

Authors: Lillian A. Boe, Pamela A. Shaw

Abstract: In large epidemiologic studies, it is typical for an inexpensive, non-invasive procedure to be used to record disease status during regular follow-up visits, with less frequent assessment by a gold standard test. Inexpensive outcome measures like self-reported disease status are practical to obtain, but can be error-prone. Association analysis reliant on error-prone outcomes may lead to biased res… ▽ More In large epidemiologic studies, it is typical for an inexpensive, non-invasive procedure to be used to record disease status during regular follow-up visits, with less frequent assessment by a gold standard test. Inexpensive outcome measures like self-reported disease status are practical to obtain, but can be error-prone. Association analysis reliant on error-prone outcomes may lead to biased results; however, restricting analyses to only data from the less frequently observed error-free outcome could be inefficient. We have developed an augmented likelihood that incorporates data from both error-prone outcomes and a gold standard assessment. We conduct a numerical study to show how we can improve statistical efficiency by using the proposed method over standard approaches for interval-censored survival data that do not leverage auxiliary data. We extend this method for the complex survey design setting so that it can be applied in our motivating data example. Our method is applied to data from the Hispanic Community Health Study/Study of Latinos to assess the association between energy and protein intake and the risk of incident diabetes. In our application, we demonstrate how our method can be used in combination with regression calibration to additionally address the covariate measurement error in self-reported diet. △ Less

Submitted 20 September, 2022; v1 submitted 24 November, 2021; originally announced November 2021.

Comments: Main manuscript: 31 pages including 6 pages of figures; 27 pages including 5 pages of figures and references

arXiv:2110.04038 [pdf, other]

Traffic Flow Forecasting with Spatial-Temporal Graph Diffusion Network

Authors: Xiyue Zhang, Chao Huang, Yong Xu, Lianghao Xia, Peng Dai, Liefeng Bo, Junbo Zhang, Yu Zheng

Abstract: Accurate forecasting of citywide traffic flow has been playing critical role in a variety of spatial-temporal mining applications, such as intelligent traffic control and public risk assessment. While previous work has made significant efforts to learn traffic temporal dynamics and spatial dependencies, two key limitations exist in current models. First, only the neighboring spatial correlations a… ▽ More Accurate forecasting of citywide traffic flow has been playing critical role in a variety of spatial-temporal mining applications, such as intelligent traffic control and public risk assessment. While previous work has made significant efforts to learn traffic temporal dynamics and spatial dependencies, two key limitations exist in current models. First, only the neighboring spatial correlations among adjacent regions are considered in most existing methods, and the global inter-region dependency is ignored. Additionally, these methods fail to encode the complex traffic transition regularities exhibited with time-dependent and multi-resolution in nature. To tackle these challenges, we develop a new traffic prediction framework-Spatial-Temporal Graph Diffusion Network (ST-GDN). In particular, ST-GDN is a hierarchically structured graph neural architecture which learns not only the local region-wise geographical dependencies, but also the spatial semantics from a global perspective. Furthermore, a multi-scale attention network is developed to empower ST-GDN with the capability of capturing multi-level temporal dynamics. Experiments on several real-life traffic datasets demonstrate that ST-GDN outperforms different types of state-of-the-art baselines. Source codes of implementations are available at https://github.com/jill001/ST-GDN. △ Less

Submitted 8 October, 2021; originally announced October 2021.

Comments: Published as a paper at AAAI 2021

arXiv:2110.04002 [pdf, other]

doi 10.1145/3397271.3401445

Multiplex Behavioral Relation Learning for Recommendation via Memory Augmented Transformer Network

Authors: Lianghao Xia, Chao Huang, Yong Xu, Peng Dai, Bo Zhang, Liefeng Bo

Abstract: Capturing users' precise preferences is of great importance in various recommender systems (eg., e-commerce platforms), which is the basis of how to present personalized interesting product lists to individual users. In spite of significant progress has been made to consider relations between users and items, most of the existing recommendation techniques solely focus on singular type of user-item… ▽ More Capturing users' precise preferences is of great importance in various recommender systems (eg., e-commerce platforms), which is the basis of how to present personalized interesting product lists to individual users. In spite of significant progress has been made to consider relations between users and items, most of the existing recommendation techniques solely focus on singular type of user-item interactions. However, user-item interactive behavior is often exhibited with multi-type (e.g., page view, add-to-favorite and purchase) and inter-dependent in nature. The overlook of multiplex behavior relations can hardly recognize the multi-modal contextual signals across different types of interactions, which limit the feasibility of current recommendation methods. To tackle the above challenge, this work proposes a Memory-Augmented Transformer Networks (MATN), to enable the recommendation with multiplex behavioral relational information, and joint modeling of type-specific behavioral context and type-wise behavior inter-dependencies, in a fully automatic manner. In our MATN framework, we first develop a transformer-based multi-behavior relation encoder, to make the learned interaction representations be reflective of the cross-type behavior relations. Furthermore, a memory attention network is proposed to supercharge MATN capturing the contextual signals of different types of behavior into the category-specific latent embedding space. Finally, a cross-behavior aggregation component is introduced to promote the comprehensive collaboration across type-aware interaction behavior representations, and discriminate their inherent contributions in assisting recommendations. Extensive experiments on two benchmark datasets and a real-world e-commence user behavior data demonstrate significant improvements obtained by MATN over baselines. Codes are available at: https://github.com/akaxlh/MATN. △ Less

Submitted 8 October, 2021; originally announced October 2021.

Comments: Published as a full paper at SIGIR 2020

arXiv:2110.04000 [pdf, other]

Knowledge-Enhanced Hierarchical Graph Transformer Network for Multi-Behavior Recommendation

Authors: Lianghao Xia, Chao Huang, Yong Xu, Peng Dai, Xiyue Zhang, Hongsheng Yang, Jian Pei, Liefeng Bo

Abstract: Accurate user and item embedding learning is crucial for modern recommender systems. However, most existing recommendation techniques have thus far focused on modeling users' preferences over singular type of user-item interactions. Many practical recommendation scenarios involve multi-typed user interactive behaviors (e.g., page view, add-to-favorite and purchase), which presents unique challenge… ▽ More Accurate user and item embedding learning is crucial for modern recommender systems. However, most existing recommendation techniques have thus far focused on modeling users' preferences over singular type of user-item interactions. Many practical recommendation scenarios involve multi-typed user interactive behaviors (e.g., page view, add-to-favorite and purchase), which presents unique challenges that cannot be handled by current recommendation solutions. In particular: i) complex inter-dependencies across different types of user behaviors; ii) the incorporation of knowledge-aware item relations into the multi-behavior recommendation framework; iii) dynamic characteristics of multi-typed user-item interactions. To tackle these challenges, this work proposes a Knowledge-Enhanced Hierarchical Graph Transformer Network (KHGT), to investigate multi-typed interactive patterns between users and items in recommender systems. Specifically, KHGT is built upon a graph-structured neural architecture to i) capture type-specific behavior characteristics; ii) explicitly discriminate which types of user-item interactions are more important in assisting the forecasting task on the target behavior. Additionally, we further integrate the graph attention layer with the temporal encoding strategy, to empower the learned embeddings be reflective of both dedicated multiplex user-item and item-item relations, as well as the underlying interaction dynamics. Extensive experiments conducted on three real-world datasets show that KHGT consistently outperforms many state-of-the-art recommendation methods across various evaluation settings. Our implementation code is available at https://github.com/akaxlh/KHGT. △ Less

Submitted 8 October, 2021; originally announced October 2021.

arXiv:2110.03996 [pdf, other]

Graph-Enhanced Multi-Task Learning of Multi-Level Transition Dynamics for Session-based Recommendation

Authors: Chao Huang, Jiahui Chen, Lianghao Xia, Yong Xu, Peng Dai, Yanqing Chen, Liefeng Bo, Jiashu Zhao, Jimmy Xiangji Huang

Abstract: Session-based recommendation plays a central role in a wide spectrum of online applications, ranging from e-commerce to online advertising services. However, the majority of existing session-based recommendation techniques (e.g., attention-based recurrent network or graph neural network) are not well-designed for capturing the complex transition dynamics exhibited with temporally-ordered and multi… ▽ More Session-based recommendation plays a central role in a wide spectrum of online applications, ranging from e-commerce to online advertising services. However, the majority of existing session-based recommendation techniques (e.g., attention-based recurrent network or graph neural network) are not well-designed for capturing the complex transition dynamics exhibited with temporally-ordered and multi-level inter-dependent relation structures. These methods largely overlook the relation hierarchy of item transitional patterns. In this paper, we propose a multi-task learning framework with Multi-level Transition Dynamics (MTD), which enables the jointly learning of intra- and inter-session item transition dynamics in automatic and hierarchical manner. Towards this end, we first develop a position-aware attention mechanism to learn item transitional regularities within individual session. Then, a graph-structured hierarchical relation encoder is proposed to explicitly capture the cross-session item transitions in the form of high-order connectivities by performing embedding propagation with the global graph context. The learning process of intra- and inter-session transition dynamics are integrated, to preserve the underlying low- and high-level item relationships in a common latent space. Extensive experiments on three real-world datasets demonstrate the superiority of MTD as compared to state-of-the-art baselines. △ Less

Submitted 8 October, 2021; originally announced October 2021.

Comments: Published as a paper at AAAI 2021

arXiv:2110.03987 [pdf, other]

Knowledge-aware Coupled Graph Neural Network for Social Recommendation

Authors: Chao Huang, Huance Xu, Yong Xu, Peng Dai, Lianghao Xia, Mengyin Lu, Liefeng Bo, Hao Xing, ** Lai, Yanfang Ye

Abstract: Social recommendation task aims to predict users' preferences over items with the incorporation of social connections among users, so as to alleviate the sparse issue of collaborative filtering. While many recent efforts show the effectiveness of neural network-based social recommender systems, several important challenges have not been well addressed yet: (i) The majority of models only consider… ▽ More Social recommendation task aims to predict users' preferences over items with the incorporation of social connections among users, so as to alleviate the sparse issue of collaborative filtering. While many recent efforts show the effectiveness of neural network-based social recommender systems, several important challenges have not been well addressed yet: (i) The majority of models only consider users' social connections, while ignoring the inter-dependent knowledge across items; (ii) Most of existing solutions are designed for singular type of user-item interactions, making them infeasible to capture the interaction heterogeneity; (iii) The dynamic nature of user-item interactions has been less explored in many social-aware recommendation techniques. To tackle the above challenges, this work proposes a Knowledge-aware Coupled Graph Neural Network (KCGN) that jointly injects the inter-dependent knowledge across items and users into the recommendation framework. KCGN enables the high-order user- and item-wise relation encoding by exploiting the mutual information for global graph structure awareness. Additionally, we further augment KCGN with the capability of capturing dynamic multi-typed user-item interactive patterns. Experimental studies on real-world datasets show the effectiveness of our method against many strong baselines in a variety of settings. Source codes are available at: https://github.com/xhcdream/KCGN. △ Less

Submitted 8 October, 2021; originally announced October 2021.

Comments: Published as a paper at AAAI 2021

arXiv:2110.03969 [pdf, other]

doi 10.1145/3404835.3462972

Graph Meta Network for Multi-Behavior Recommendation

Authors: Lianghao Xia, Yong Xu, Chao Huang, Peng Dai, Liefeng Bo

Abstract: Modern recommender systems often embed users and items into low-dimensional latent representations, based on their observed interactions. In practical recommendation scenarios, users often exhibit various intents which drive them to interact with items with multiple behavior types (e.g., click, tag-as-favorite, purchase). However, the diversity of user behaviors is ignored in most of the existing… ▽ More Modern recommender systems often embed users and items into low-dimensional latent representations, based on their observed interactions. In practical recommendation scenarios, users often exhibit various intents which drive them to interact with items with multiple behavior types (e.g., click, tag-as-favorite, purchase). However, the diversity of user behaviors is ignored in most of the existing approaches, which makes them difficult to capture heterogeneous relational structures across different types of interactive behaviors. Exploring multi-typed behavior patterns is of great importance to recommendation systems, yet is very challenging because of two aspects: i) The complex dependencies across different types of user-item interactions; ii) Diversity of such multi-behavior patterns may vary by users due to their personalized preference. To tackle the above challenges, we propose a Multi-Behavior recommendation framework with Graph Meta Network to incorporate the multi-behavior pattern modeling into a meta-learning paradigm. Our developed MB-GMN empowers the user-item interaction learning with the capability of uncovering type-dependent behavior representations, which automatically distills the behavior heterogeneity and interaction diversity for recommendations. Extensive experiments on three real-world datasets show the effectiveness of MB-GMN by significantly boosting the recommendation performance as compared to various state-of-the-art baselines. The source code is available athttps://github.com/akaxlh/MB-GMN. △ Less

Submitted 8 October, 2021; originally announced October 2021.

Comments: Published as a full paper at SIGIR 2021

arXiv:2110.03958 [pdf, other]

doi 10.1145/3459637.3482480

Social Recommendation with Self-Supervised Metagraph Informax Network

Authors: Xiaoling Long, Chao Huang, Yong Xu, Huance Xu, Peng Dai, Lianghao Xia, Liefeng Bo

Abstract: In recent years, researchers attempt to utilize online social information to alleviate data sparsity for collaborative filtering, based on the rationale that social networks offers the insights to understand the behavioral patterns. However, due to the overlook of inter-dependent knowledge across items (e.g., categories of products), existing social recommender systems are insufficient to distill… ▽ More In recent years, researchers attempt to utilize online social information to alleviate data sparsity for collaborative filtering, based on the rationale that social networks offers the insights to understand the behavioral patterns. However, due to the overlook of inter-dependent knowledge across items (e.g., categories of products), existing social recommender systems are insufficient to distill the heterogeneous collaborative signals from both user and item sides. In this work, we propose a Self-Supervised Metagraph Infor-max Network (SMIN) which investigates the potential of jointly incorporating social- and knowledge-aware relational structures into the user preference representation for recommendation. To model relation heterogeneity, we design a metapath-guided heterogeneous graph neural network to aggregate feature embeddings from different types of meta-relations across users and items, em-powering SMIN to maintain dedicated representations for multi-faceted user- and item-wise dependencies. Additionally, to inject high-order collaborative signals, we generalize the mutual information learning paradigm under the self-supervised graph-based collaborative filtering. This endows the expressive modeling of user-item interactive patterns, by exploring global-level collaborative relations and underlying isomorphic transformation property of graph topology. Experimental results on several real-world datasets demonstrate the effectiveness of our SMIN model over various state-of-the-art recommendation methods. We release our source code at https://github.com/SocialRecsys/SMIN. △ Less

Submitted 8 October, 2021; originally announced October 2021.

Comments: Published as a full paper in CIKM 2021

arXiv:2109.12519 [pdf, other]

doi 10.1145/3447548.3467169

AsySQN: Faster Vertical Federated Learning Algorithms with Better Computation Resource Utilization

Authors: Qingsong Zhang, Bin Gu, Cheng Deng, Songxiang Gu, Liefeng Bo, Jian Pei, Heng Huang

Abstract: Vertical federated learning (VFL) is an effective paradigm of training the emerging cross-organizational (e.g., different corporations, companies and organizations) collaborative learning with privacy preserving. Stochastic gradient descent (SGD) methods are the popular choices for training VFL models because of the low per-iteration computation. However, existing SGD-based VFL algorithms are comm… ▽ More Vertical federated learning (VFL) is an effective paradigm of training the emerging cross-organizational (e.g., different corporations, companies and organizations) collaborative learning with privacy preserving. Stochastic gradient descent (SGD) methods are the popular choices for training VFL models because of the low per-iteration computation. However, existing SGD-based VFL algorithms are communication-expensive due to a large number of communication rounds. Meanwhile, most existing VFL algorithms use synchronous computation which seriously hamper the computation resource utilization in real-world applications. To address the challenges of communication and computation resource utilization, we propose an asynchronous stochastic quasi-Newton (AsySQN) framework for VFL, under which three algorithms, i.e. AsySQN-SGD, -SVRG and -SAGA, are proposed. The proposed AsySQN-type algorithms making descent steps scaled by approximate (without calculating the inverse Hessian matrix explicitly) Hessian information convergence much faster than SGD-based methods in practice and thus can dramatically reduce the number of communication rounds. Moreover, the adopted asynchronous computation can make better use of the computation resource. We theoretically prove the convergence rates of our proposed algorithms for strongly convex problems. Extensive numerical experiments on real-word datasets demonstrate the lower communication costs and better computation resource utilization of our algorithms compared with state-of-the-art VFL algorithms. △ Less

Submitted 26 September, 2021; originally announced September 2021.

Comments: Accepted by KDD 2021, 33 pages, 4 figs

arXiv:2108.11048 [pdf, other]

Memory-Augmented Non-Local Attention for Video Super-Resolution

Authors: Jiyang Yu, **gen Liu, Liefeng Bo, Tao Mei

Abstract: In this paper, we propose a novel video super-resolution method that aims at generating high-fidelity high-resolution (HR) videos from low-resolution (LR) ones. Previous methods predominantly leverage temporal neighbor frames to assist the super-resolution of the current frame. Those methods achieve limited performance as they suffer from the challenge in spatial frame alignment and the lack of us… ▽ More In this paper, we propose a novel video super-resolution method that aims at generating high-fidelity high-resolution (HR) videos from low-resolution (LR) ones. Previous methods predominantly leverage temporal neighbor frames to assist the super-resolution of the current frame. Those methods achieve limited performance as they suffer from the challenge in spatial frame alignment and the lack of useful information from similar LR neighbor frames. In contrast, we devise a cross-frame non-local attention mechanism that allows video super-resolution without frame alignment, leading to be more robust to large motions in the video. In addition, to acquire the information beyond neighbor frames, we design a novel memory-augmented attention module to memorize general video details during the super-resolution training. Experimental results indicate that our method can achieve superior performance on large motion videos comparing to the state-of-the-art methods without aligning frames. Our source code will be released. △ Less

Submitted 25 August, 2021; originally announced August 2021.

arXiv:2108.05652 [pdf, other]

Modeling Relevance Ranking under the Pre-training and Fine-tuning Paradigm

Authors: Lin Bo, Liang Pang, Gang Wang, Jun Xu, XiuQiang He, Ji-Rong Wen

Abstract: Recently, pre-trained language models such as BERT have been applied to document ranking for information retrieval, which first pre-train a general language model on an unlabeled large corpus and then conduct ranking-specific fine-tuning on expert-labeled relevance datasets. Ideally, an IR system would model relevance from a user-system dualism: the user's view and the system's view. User's view j… ▽ More Recently, pre-trained language models such as BERT have been applied to document ranking for information retrieval, which first pre-train a general language model on an unlabeled large corpus and then conduct ranking-specific fine-tuning on expert-labeled relevance datasets. Ideally, an IR system would model relevance from a user-system dualism: the user's view and the system's view. User's view judges the relevance based on the activities of "real users" while the system's view focuses on the relevance signals from the system side, e.g., from the experts or algorithms, etc. Inspired by the user-system relevance views and the success of pre-trained language models, in this paper we propose a novel ranking framework called Pre-Rank that takes both user's view and system's view into consideration, under the pre-training and fine-tuning paradigm. Specifically, to model the user's view of relevance, Pre-Rank pre-trains the initial query-document representations based on large-scale user activities data such as the click log. To model the system's view of relevance, Pre-Rank further fine-tunes the model on expert-labeled relevance data. More importantly, the pre-trained representations, are fine-tuned together with handcrafted learning-to-rank features under a wide and deep network architecture. In this way, Pre-Rank can model the relevance by incorporating the relevant knowledge and signals from both real search users and the IR experts. To verify the effectiveness of Pre-Rank, we showed two implementations by using BERT and SetRank as the underlying ranking model, respectively. Experimental results base on three publicly available benchmarks showed that in both of the implementations, Pre-Rank can respectively outperform the underlying ranking models and achieved state-of-the-art performances. △ Less

Submitted 12 August, 2021; originally announced August 2021.

arXiv:2108.00799 [pdf, other]

Mean Field Game of Optimal Relative Investment with Jump Risk

Authors: Lijun Bo, Shihua Wang, Xiang Yu

Abstract: This paper studies the n-player game and the mean field game under the CRRA relative performance on terminal wealth, in which the interaction occurs by peer competition. In the model with n agents, the price dynamics of underlying risky assets depend on a common noise and contagious jump risk modelled by a multi-dimensional nonlinear Hawkes process. With a continuum of agents, we formulate the MFG… ▽ More This paper studies the n-player game and the mean field game under the CRRA relative performance on terminal wealth, in which the interaction occurs by peer competition. In the model with n agents, the price dynamics of underlying risky assets depend on a common noise and contagious jump risk modelled by a multi-dimensional nonlinear Hawkes process. With a continuum of agents, we formulate the MFG problem and characterize a deterministic mean field equilibrium in an analytical form under some conditions, allowing us to investigate some impacts of model parameters in the limiting model and discuss some financial implications. Moreover, based on the mean field equilibrium, we construct an approximate Nash equilibrium for the n-player game when n is sufficiently large. The explicit order of the approximation error is also derived. △ Less

Submitted 9 February, 2023; v1 submitted 2 August, 2021; originally announced August 2021.

Comments: Final version, forthcoming in Science China Mathematics

arXiv:2107.04129 [pdf, other]

Fedlearn-Algo: A flexible open-source privacy-preserving machine learning platform

Authors: Bo Liu, Chaowei Tan, Jiazhou Wang, Tao Zeng, Huasong Shan, Houpu Yao, Heng Huang, Peng Dai, Liefeng Bo, Yanqing Chen

Abstract: In this paper, we present Fedlearn-Algo, an open-source privacy preserving machine learning platform. We use this platform to demonstrate our research and development results on privacy preserving machine learning algorithms. As the first batch of novel FL algorithm examples, we release vertical federated kernel binary classification model and vertical federated random forest model. They have been… ▽ More In this paper, we present Fedlearn-Algo, an open-source privacy preserving machine learning platform. We use this platform to demonstrate our research and development results on privacy preserving machine learning algorithms. As the first batch of novel FL algorithm examples, we release vertical federated kernel binary classification model and vertical federated random forest model. They have been tested to be more efficient than existing vertical federated learning models in our practice. Besides the novel FL algorithm examples, we also release a machine communication module. The uniform data transfer interface supports transferring widely used data formats between machines. We will maintain this platform by adding more functional modules and algorithm examples. The code is available at https://github.com/fedlearnAI/fedlearn-algo. △ Less

Submitted 30 July, 2021; v1 submitted 8 July, 2021; originally announced July 2021.

arXiv:2106.09978 [pdf, ps, other]

Centralized systemic risk control in the interbank system: Weak formulation and Gamma-convergence

Authors: Lijun Bo, Tongqing Li, Xiang Yu

Abstract: This paper studies a systemic risk control problem by the central bank, which dynamically plans monetary supply to stabilize the interbank system with borrowing and lending activities. Facing both heterogeneity among banks and the common noise, the central bank aims to find an optimal strategy to minimize the average distance between log-monetary reserves of all banks and the benchmark of some tar… ▽ More This paper studies a systemic risk control problem by the central bank, which dynamically plans monetary supply to stabilize the interbank system with borrowing and lending activities. Facing both heterogeneity among banks and the common noise, the central bank aims to find an optimal strategy to minimize the average distance between log-monetary reserves of all banks and the benchmark of some target steady levels. A weak formulation is adopted, and an optimal randomized control can be obtained in the system with finite banks by applying Ekeland's variational principle. As the number of banks grows large, we prove the convergence of optimal strategies using the Gamma-convergence argument, which yields an optimal weak control in the mean field model. It is shown that this mean field optimal control is associated to the solution of a stochastic Fokker-Planck-Kolmogorov (FPK) equation, for which the uniqueness of the solution is established under some mild conditions. △ Less

Submitted 17 May, 2022; v1 submitted 18 June, 2021; originally announced June 2021.

Comments: Final version, forthcoming in Stochastic Processes and their Applications

arXiv:2106.00264 [pdf, other]

Hardness Sampling for Self-Training Based Transductive Zero-Shot Learning

Authors: Liu Bo, Qiulei Dong, Zhanyi Hu

Abstract: Transductive zero-shot learning (T-ZSL) which could alleviate the domain shift problem in existing ZSL works, has received much attention recently. However, an open problem in T-ZSL: how to effectively make use of unseen-class samples for training, still remains. Addressing this problem, we first empirically analyze the roles of unseen-class samples with different degrees of hardness in the traini… ▽ More Transductive zero-shot learning (T-ZSL) which could alleviate the domain shift problem in existing ZSL works, has received much attention recently. However, an open problem in T-ZSL: how to effectively make use of unseen-class samples for training, still remains. Addressing this problem, we first empirically analyze the roles of unseen-class samples with different degrees of hardness in the training process based on the uneven prediction phenomenon found in many ZSL methods, resulting in three observations. Then, we propose two hardness sampling approaches for selecting a subset of diverse and hard samples from a given unseen-class dataset according to these observations. The first one identifies the samples based on the class-level frequency of the model predictions while the second enhances the former by normalizing the class frequency via an approximate class prior estimated by an explored prior estimation algorithm. Finally, we design a new Self-Training framework with Hardness Sampling for T-ZSL, called STHS, where an arbitrary inductive ZSL method could be seamlessly embedded and it is iteratively trained with unseen-class samples selected by the hardness sampling approach. We introduce two typical ZSL methods into the STHS framework and extensive experiments demonstrate that the derived T-ZSL methods outperform many state-of-the-art methods on three public benchmarks. Besides, we note that the unseen-class dataset is separately used for training in some existing transductive generalized ZSL (T-GZSL) methods, which is not strict for a GZSL task. Hence, we suggest a more strict T-GZSL data setting and establish a competitive baseline on this setting by introducing the proposed STHS framework to T-GZSL. △ Less

Submitted 1 June, 2021; originally announced June 2021.

Comments: 11 pages, 4 figures

arXiv:2105.02440 [pdf, other]

Detection, Tracking, and Counting Meets Drones in Crowds: A Benchmark

Authors: Longyin Wen, Dawei Du, Pengfei Zhu, Qinghua Hu, Qilong Wang, Liefeng Bo, Siwei Lyu

Abstract: To promote the developments of object detection, tracking and counting algorithms in drone-captured videos, we construct a benchmark with a new drone-captured largescale dataset, named as DroneCrowd, formed by 112 video clips with 33,600 HD frames in various scenarios. Notably, we annotate 20,800 people trajectories with 4.8 million heads and several video-level attributes. Meanwhile, we design th… ▽ More To promote the developments of object detection, tracking and counting algorithms in drone-captured videos, we construct a benchmark with a new drone-captured largescale dataset, named as DroneCrowd, formed by 112 video clips with 33,600 HD frames in various scenarios. Notably, we annotate 20,800 people trajectories with 4.8 million heads and several video-level attributes. Meanwhile, we design the Space-Time Neighbor-Aware Network (STNNet) as a strong baseline to solve object detection, tracking and counting jointly in dense crowds. STNNet is formed by the feature extraction module, followed by the density map estimation heads, and localization and association subnets. To exploit the context information of neighboring objects, we design the neighboring context loss to guide the association subnet training, which enforces consistent relative position of nearby objects in temporal domain. Extensive experiments on our DroneCrowd dataset demonstrate that STNNet performs favorably against the state-of-the-arts. △ Less

Submitted 6 May, 2021; originally announced May 2021.

Comments: Accpted to CVPR 2021. Dataset and codes can be found in https://github.com/VisDrone/DroneCrowd. arXiv admin note: text overlap with arXiv:1912.01811

arXiv:2103.02852 [pdf, other]

Data Augmentation for Object Detection via Differentiable Neural Rendering

Authors: Guanghan Ning, Guang Chen, Chaowei Tan, Si Luo, Liefeng Bo, Heng Huang

Abstract: It is challenging to train a robust object detector under the supervised learning setting when the annotated data are scarce. Thus, previous approaches tackling this problem are in two categories: semi-supervised learning models that interpolate labeled data from unlabeled data, and self-supervised learning approaches that exploit signals within unlabeled data via pretext tasks. To seamlessly inte… ▽ More It is challenging to train a robust object detector under the supervised learning setting when the annotated data are scarce. Thus, previous approaches tackling this problem are in two categories: semi-supervised learning models that interpolate labeled data from unlabeled data, and self-supervised learning approaches that exploit signals within unlabeled data via pretext tasks. To seamlessly integrate and enhance existing supervised object detection methods, in this work, we focus on addressing the data scarcity problem from a fundamental viewpoint without changing the supervised learning paradigm. We propose a new offline data augmentation method for object detection, which semantically interpolates the training data with novel views. Specifically, our new system generates controllable views of training images based on differentiable neural rendering, together with corresponding bounding box annotations which involve no human intervention. Firstly, we extract and project pixel-aligned image features into point clouds while estimating depth maps. We then re-project them with a target camera pose and render a novel-view 2d image. Objects in the form of keypoints are marked in point clouds to recover annotations in new views. Our new method is fully compatible with online data augmentation methods, such as affine transform, image mixup, etc. Extensive experiments show that our method, as a cost-free tool to enrich images and labels, can significantly boost the performance of object detection systems with scarce training data. Code is available at \url{https://github.com/Guanghan/DANR}. △ Less

Submitted 5 April, 2021; v1 submitted 4 March, 2021; originally announced March 2021.

Comments: 15 pages, 15 figures

arXiv:2101.00828 [pdf, other]

Transformer-based Conditional Variational Autoencoder for Controllable Story Generation

Authors: Le Fang, Tao Zeng, Chaochun Liu, Liefeng Bo, Wen Dong, Changyou Chen

Abstract: We investigate large-scale latent variable models (LVMs) for neural story generation -- an under-explored application for open-domain long text -- with objectives in two threads: generation effectiveness and controllability. LVMs, especially the variational autoencoder (VAE), have achieved both effective and controllable generation through exploiting flexible distributional latent representations.… ▽ More We investigate large-scale latent variable models (LVMs) for neural story generation -- an under-explored application for open-domain long text -- with objectives in two threads: generation effectiveness and controllability. LVMs, especially the variational autoencoder (VAE), have achieved both effective and controllable generation through exploiting flexible distributional latent representations. Recently, Transformers and its variants have achieved remarkable effectiveness without explicit latent representation learning, thus lack satisfying controllability in generation. In this paper, we advocate to revive latent variable modeling, essentially the power of representation learning, in the era of Transformers to enhance controllability without hurting state-of-the-art generation effectiveness. Specifically, we integrate latent representation vectors with a Transformer-based pre-trained architecture to build conditional variational autoencoder (CVAE). Model components such as encoder, decoder and the variational posterior are all built on top of pre-trained language models -- GPT2 specifically in this paper. Experiments demonstrate state-of-the-art conditional generation ability of our model, as well as its excellent representation learning capability and controllability. △ Less

Submitted 8 July, 2021; v1 submitted 4 January, 2021; originally announced January 2021.

arXiv:2101.00822 [pdf, other]

Outline to Story: Fine-grained Controllable Story Generation from Cascaded Events

Authors: Le Fang, Tao Zeng, Chaochun Liu, Liefeng Bo, Wen Dong, Changyou Chen

Abstract: Large-scale pretrained language models have shown thrilling generation capabilities, especially when they generate consistent long text in thousands of words with ease. However, users of these models can only control the prefix of sentences or certain global aspects of generated text. It is challenging to simultaneously achieve fine-grained controllability and preserve the state-of-the-art uncondi… ▽ More Large-scale pretrained language models have shown thrilling generation capabilities, especially when they generate consistent long text in thousands of words with ease. However, users of these models can only control the prefix of sentences or certain global aspects of generated text. It is challenging to simultaneously achieve fine-grained controllability and preserve the state-of-the-art unconditional text generation capability. In this paper, we first propose a new task named "Outline to Story" (O2S) as a test bed for fine-grained controllable generation of long text, which generates a multi-paragraph story from cascaded events, i.e. a sequence of outline events that guide subsequent paragraph generation. We then create dedicate datasets for future benchmarks, built by state-of-the-art keyword extraction techniques. Finally, we propose an extremely simple yet strong baseline method for the O2S task, which fine tunes pre-trained language models on augmented sequences of outline-story pairs with simple language modeling objective. Our method does not introduce any new parameters or perform any architecture modification, except several special tokens as delimiters to build augmented sequences. Extensive experiments on various datasets demonstrate state-of-the-art conditional story generation performance with our model, achieving better fine-grained controllability and user flexibility. Our paper is among the first ones by our knowledge to propose a model and to create datasets for the task of "outline to story". Our work also instantiates research interest of fine-grained controllable generation of open-domain long text, where controlling inputs are represented by short text. △ Less

Submitted 4 January, 2021; originally announced January 2021.

arXiv:2011.00204 [pdf, ps, other]

Nonexistence of NNSC-cobordism of Bartnik data

Authors: Leyang Bo, Yuguang Shi

Abstract: In this paper, we consider the problem of nonnegative scalar curvature (NNSC) cobordism of Bartnik data $(Σ_1^{n-1}, γ_1, H_1)$ and $(Σ_2^{n-1}, γ_2, H_2)$. We prove that given two metrics $γ_1$ and $γ_2$ on $S^{n-1}$ ($3\le n\le 7$) with $H_1$ fixed, then $(S^{n-1}, γ_1, H_1)$ and $(S^{n-1}, γ_2, H_2)$ admit no NNSC cobordism provided the prescribed mean curvature $H_2$ is large enough(Theorem \r… ▽ More In this paper, we consider the problem of nonnegative scalar curvature (NNSC) cobordism of Bartnik data $(Σ_1^{n-1}, γ_1, H_1)$ and $(Σ_2^{n-1}, γ_2, H_2)$. We prove that given two metrics $γ_1$ and $γ_2$ on $S^{n-1}$ ($3\le n\le 7$) with $H_1$ fixed, then $(S^{n-1}, γ_1, H_1)$ and $(S^{n-1}, γ_2, H_2)$ admit no NNSC cobordism provided the prescribed mean curvature $H_2$ is large enough(Theorem \ref{highdimnoncob0}). Moreover, we show that for $n=3$, a much weaker condition that the total mean curvature $\int_{S^2}H_2dμ_{γ_2}$ is large enough rules out NNSC cobordisms(Theorem \ref{2-d0}); if we require the Gaussian curvature of $γ_2$ to be positive, we get a criterion for non existence of trivial NNSC-cobordism by using Hawking mass and Brown-York mass(Theorem \ref{cobordism20}). For the general topology case, we prove that $(Σ_1^{n-1}, γ_1, 0)$ and $(Σ_2^{n-1}, γ_2, H_2)$ admit no NNSC cobordism provided the prescribed mean curvature $H_2$ is large enough(Theorem \ref{highdimnoncob10}). △ Less

Submitted 6 February, 2021; v1 submitted 31 October, 2020; originally announced November 2020.

Comments: 17pages, All comments are welcome! The paper has been accepted for publication in SCIENCE CHINA Mathematics

MSC Class: 53C20; 83C99

arXiv:2007.07341 [pdf, other]

doi 10.3389/fmolb.2021.607443

Molecular mechanisms behind anti SARS-CoV-2 action of lactoferrin

Authors: Mattia Miotto, Lorenzo Di Rienzo, Leonardo Bò, Alberto Boffi, Giancarlo Ruocco, Edoardo Milanetti

Abstract: Despite the huge effort to contain the infection, the novel SARS-CoV-2 coronavirus has rapidly become pandemics, mainly due to its extremely high human-to-human transmission capability, and a surprisingly high viral charge of symptom-less people. While the seek of a vaccine is still ongoing, promising results have been obtained with antiviral compounds. In particular, lactoferrin is found to have… ▽ More Despite the huge effort to contain the infection, the novel SARS-CoV-2 coronavirus has rapidly become pandemics, mainly due to its extremely high human-to-human transmission capability, and a surprisingly high viral charge of symptom-less people. While the seek of a vaccine is still ongoing, promising results have been obtained with antiviral compounds. In particular, lactoferrin is found to have beneficial effects both in preventing and soothing the infection. Here, we explore the possible molecular mechanisms with which lactoferrin interferes with SARS-CoV-2 cell invasion, preventing attachment and/or entry of the virus. To this aim, we search for possible interactions lactoferrin may have with virus structural proteins and host receptors. Representing the molecular iso-electron surface of proteins in terms of 2D-Zernike descriptors, we (i) identified putative regions on the lactoferrin surface able to bind sialic acid receptors on the host cell membrane, sheltering the cell from the virus attachment; (ii) showed that no significant shape complementarity is present between lactoferrin and the ACE2 receptor, while (iii) two high complementarity regions are found on the N- and C-terminal domains of the SARS-CoV-2 spike protein, hinting at a possible competition between lactoferrin and ACE2 for the binding to the spike protein. △ Less

Submitted 14 July, 2020; originally announced July 2020.

Comments: 9 pages, 4 figures

Journal ref: Front Mol Biosci. 2021; 8: 607443

arXiv:2006.13661 [pdf, ps, other]

Optimal Tracking Portfolio with A Ratcheting Capital Benchmark

Authors: Lijun Bo, Huafu Liao, Xiang Yu

Abstract: This paper studies the finite horizon portfolio management by optimally tracking a ratcheting capital benchmark process. It is assumed that the fund manager can dynamically inject capital into the portfolio account such that the total capital dominates a non-decreasing benchmark floor process at each intermediate time. The tracking problem is formulated to minimize the cost of accumulated capital… ▽ More This paper studies the finite horizon portfolio management by optimally tracking a ratcheting capital benchmark process. It is assumed that the fund manager can dynamically inject capital into the portfolio account such that the total capital dominates a non-decreasing benchmark floor process at each intermediate time. The tracking problem is formulated to minimize the cost of accumulated capital injection. We first transform the original problem with floor constraints into an unconstrained control problem, however, under a running maximum cost. By identifying a controlled state process with reflection, the problem is further shown to be equivalent to an auxiliary problem, which leads to a nonlinear Hamilton-Jacobi-Bellman (HJB) equation with a Neumann boundary condition. By employing the dual transform, the probabilistic representation and some stochastic flow analysis, the existence of the unique classical solution to the HJB equation is established. The verification theorem is carefully proved, which gives the complete characterization of the feedback optimal portfolio. The application to market index tracking is also discussed when the index process is modeled by a geometric Brownian motion. △ Less

Submitted 30 April, 2021; v1 submitted 24 June, 2020; originally announced June 2020.

Comments: Final version, forthcoming in SIAM Journal on Control and Optimization

arXiv:2005.13131 [pdf, other]

Efficient Pig Counting in Crowds with Keypoints Tracking and Spatial-aware Temporal Response Filtering

Authors: Guang Chen, Shiwen Shen, Longyin Wen, Si Luo, Liefeng Bo

Abstract: Pig counting is a crucial task for large-scale pig farming, which is usually completed by human visually. But this process is very time-consuming and error-prone. Few studies in literature developed automated pig counting method. Existing methods only focused on pig counting using single image, and its accuracy is challenged by several factors, including pig movements, occlusion and overlap**. E… ▽ More Pig counting is a crucial task for large-scale pig farming, which is usually completed by human visually. But this process is very time-consuming and error-prone. Few studies in literature developed automated pig counting method. Existing methods only focused on pig counting using single image, and its accuracy is challenged by several factors, including pig movements, occlusion and overlap**. Especially, the field of view of a single image is very limited, and could not meet the requirements of pig counting for large pig grou** houses. To that end, we presented a real-time automated pig counting system in crowds using only one monocular fisheye camera with an inspection robot. Our system showed that it produces accurate results surpassing human. Our pipeline began with a novel bottom-up pig detection algorithm to avoid false negatives due to overlap**, occlusion and deformation of pigs. A deep convolution neural network (CNN) is designed to detect keypoints of pig body part and associate the keypoints to identify individual pigs. After that, an efficient on-line tracking method is used to associate pigs across video frames. Finally, a novel spatial-aware temporal response filtering (STRF) method is proposed to predict the counts of pigs, which is effective to suppress false positives caused by pig or camera movements or tracking failures. The whole pipeline has been deployed in an edge computing device, and demonstrated the effectiveness. △ Less

Submitted 26 May, 2020; originally announced May 2020.

Comments: IEEE International Conference on Robotics and Automation (ICRA) 2020

arXiv:2004.01112 [pdf, other]

An Approximate Quasi-Likelihood Approach for Error-Prone Failure Time Outcomes and Exposures

Authors: Lillian A. Boe, Lesley F. Tinker, Pamela A. Shaw

Abstract: Measurement error arises commonly in clinical research settings that rely on data from electronic health records or large observational cohorts. In particular, self-reported outcomes are typical in cohort studies for chronic diseases such as diabetes in order to avoid the burden of expensive diagnostic tests. Dietary intake, which is also commonly collected by self-report and subject to measuremen… ▽ More Measurement error arises commonly in clinical research settings that rely on data from electronic health records or large observational cohorts. In particular, self-reported outcomes are typical in cohort studies for chronic diseases such as diabetes in order to avoid the burden of expensive diagnostic tests. Dietary intake, which is also commonly collected by self-report and subject to measurement error, is a major factor linked to diabetes and other chronic diseases. These errors can bias exposure-disease associations that ultimately can mislead clinical decision-making. We have extended an existing semiparametric likelihood-based method for handling error-prone, discrete failure time outcomes to also address covariate error. We conduct an extensive numerical study to compare the proposed method to the naive approach that ignores measurement error in terms of bias and efficiency in the estimation of the regression parameter of interest. In all settings considered, the proposed method showed minimal bias and maintained coverage probability, thus outperforming the naive analysis which showed extreme bias and low coverage. This method is applied to data from the Women's Health Initiative to assess the association between energy and protein intake and the risk of incident diabetes mellitus. Our results show that correcting for errors in both the self-reported outcome and dietary exposures leads to considerably different hazard ratio estimates than those from analyses that ignore measurement error, which demonstrates the importance of correcting for both outcome and covariate error. Computational details and R code for implementing the proposed method are presented in Section S1 of the Supplementary Materials. △ Less

Submitted 4 February, 2021; v1 submitted 2 April, 2020; originally announced April 2020.

Comments: 61 pages, 1 figure, 14 tables in total. Main manuscript: first 38 pages including references and 6 tables, followed by supplementary materials with remaining 23 pages including 1 figure and 8 tables

arXiv:2003.13230 [pdf, ps, other]

AliCoCo: Alibaba E-commerce Cognitive Concept Net

Authors: Xusheng Luo, Luxin Liu, Yonghua Yang, Le Bo, Yuanpeng Cao, **hang Wu, Qiang Li, Ke** Yang, Kenny Q. Zhu

Abstract: One of the ultimate goals of e-commerce platforms is to satisfy various shop** needs for their customers. Much efforts are devoted to creating taxonomies or ontologies in e-commerce towards this goal. However, user needs in e-commerce are still not well defined, and none of the existing ontologies has the enough depth and breadth for universal user needs understanding. The semantic gap in-betwee… ▽ More One of the ultimate goals of e-commerce platforms is to satisfy various shop** needs for their customers. Much efforts are devoted to creating taxonomies or ontologies in e-commerce towards this goal. However, user needs in e-commerce are still not well defined, and none of the existing ontologies has the enough depth and breadth for universal user needs understanding. The semantic gap in-between prevents shop** experience from being more intelligent. In this paper, we propose to construct a large-scale e-commerce cognitive concept net named "AliCoCo", which is practiced in Alibaba, the largest Chinese e-commerce platform in the world. We formally define user needs in e-commerce, then conceptualize them as nodes in the net. We present details on how AliCoCo is constructed semi-automatically and its successful, ongoing and potential applications in e-commerce. △ Less

Submitted 30 March, 2020; originally announced March 2020.

Comments: 15 pages. Accepted by SIGMOD 2020 Industry Track

arXiv:2003.05143 [pdf, ps, other]

Probabilistic Analysis of Replicator-Mutator Equations

Authors: Lijun Bo, Huafu Liao

Abstract: This paper introduces a general class of Replicator-Mutator equations on a multi-dimensional fitness space. We establish a novel probabilistic representation of weak solutions of the equation by using the theory of Fockker-Planck-Kolmogorov (FPK) equations and a martingale extraction approach. The examples with closed-form probabilistic solutions for different fitness functions considered in the e… ▽ More This paper introduces a general class of Replicator-Mutator equations on a multi-dimensional fitness space. We establish a novel probabilistic representation of weak solutions of the equation by using the theory of Fockker-Planck-Kolmogorov (FPK) equations and a martingale extraction approach. The examples with closed-form probabilistic solutions for different fitness functions considered in the existing literature are provided. We also construct a particle system and prove a general convergence result to any solution to the FPK equation associated with the extended Replicator-Mutator equation with respect to a Wasserstein-like distance adapted to our probabilistic framework. △ Less

Submitted 12 March, 2020; v1 submitted 11 March, 2020; originally announced March 2020.

Comments: 25 pages, 0 figure

MSC Class: 92B05; 35K15; 60H10; 60G46

arXiv:1912.11113 [pdf, other]

EnsemFDet: An Ensemble Approach to Fraud Detection based on Bipartite Graph

Authors: Yuxiang Ren, Hao Zhu, Jiawei Zhang, Peng Dai, Liefeng Bo

Abstract: Fraud detection is extremely critical for e-commerce business. It is the intent of the companies to detect and prevent fraud as early as possible. Existing fraud detection methods try to identify unexpected dense subgraphs and treat related nodes as suspicious. Spectral relaxation-based methods solve the problem efficiently but hurt the performance due to the relaxed constraints. Besides, many met… ▽ More Fraud detection is extremely critical for e-commerce business. It is the intent of the companies to detect and prevent fraud as early as possible. Existing fraud detection methods try to identify unexpected dense subgraphs and treat related nodes as suspicious. Spectral relaxation-based methods solve the problem efficiently but hurt the performance due to the relaxed constraints. Besides, many methods cannot be accelerated with parallel computation or control the number of returned suspicious nodes because they provide a set of subgraphs with diverse node sizes. These drawbacks affect the real-world applications of existing methods. In this paper, we propose an Ensemble-based Fraud Detection (EnsemFDet) method to scale up fraud detection in bipartite graphs by decomposing the original problem into subproblems on small-sized subgraphs. By oversampling the graph and solving the subproblems, the ensemble approach further votes suspicious nodes without sacrificing the prediction accuracy. Extensive experiments have been done on real transaction data from JD.com, which is one of the world's largest e-commerce platforms. Experimental results demonstrate the effectiveness, practicability, and scalability of EnsemFDet. More specifically, EnsemFDet is up to 100x faster than the state-of-the-art methods due to its parallelism with all aspects of data. △ Less

Submitted 5 November, 2020; v1 submitted 23 December, 2019; originally announced December 2019.

Comments: Accepted by ICDE 2021

arXiv:1912.01811 [pdf, other]

Drone-based Joint Density Map Estimation, Localization and Tracking with Space-Time Multi-Scale Attention Network

Authors: Longyin Wen, Dawei Du, Pengfei Zhu, Qinghua Hu, Qilong Wang, Liefeng Bo, Siwei Lyu

Abstract: This paper proposes a space-time multi-scale attention network (STANet) to solve density map estimation, localization and tracking in dense crowds of video clips captured by drones with arbitrary crowd density, perspective, and flight altitude. Our STANet method aggregates multi-scale feature maps in sequential frames to exploit the temporal coherency, and then predict the density maps, localize t… ▽ More This paper proposes a space-time multi-scale attention network (STANet) to solve density map estimation, localization and tracking in dense crowds of video clips captured by drones with arbitrary crowd density, perspective, and flight altitude. Our STANet method aggregates multi-scale feature maps in sequential frames to exploit the temporal coherency, and then predict the density maps, localize the targets, and associate them in crowds simultaneously. A coarse-to-fine process is designed to gradually apply the attention module on the aggregated multi-scale feature maps to enforce the network to exploit the discriminative space-time features for better performance. The whole network is trained in an end-to-end manner with the multi-task loss, formed by three terms, i.e., the density map loss, localization loss and association loss. The non-maximal suppression followed by the min-cost flow framework is used to generate the trajectories of targets' in scenarios. Since existing crowd counting datasets merely focus on crowd counting in static cameras rather than density map estimation, counting and tracking in crowds on drones, we have collected a new large-scale drone-based dataset, DroneCrowd, formed by 112 video clips with 33,600 high resolution frames (i.e., 1920x1080) captured in 70 different scenarios. With intensive amount of effort, our dataset provides 20,800 people trajectories with 4.8 million head annotations and several video-level attributes in sequences. Extensive experiments are conducted on two challenging public datasets, i.e., Shanghaitech and UCF-QNRF, and our DroneCrowd, to demonstrate that STANet achieves favorable performance against the state-of-the-arts. The datasets and codes can be found at https://github.com/VisDrone. △ Less

Submitted 4 December, 2019; originally announced December 2019.

arXiv:1911.08538 [pdf, other]

Heterogeneous Deep Graph Infomax

Authors: Yuxiang Ren, Bo Liu, Chao Huang, Peng Dai, Liefeng Bo, Jiawei Zhang

Abstract: Graph representation learning is to learn universal node representations that preserve both node attributes and structural information. The derived node representations can be used to serve various downstream tasks, such as node classification and node clustering. When a graph is heterogeneous, the problem becomes more challenging than the homogeneous graph node learning problem. Inspired by the e… ▽ More Graph representation learning is to learn universal node representations that preserve both node attributes and structural information. The derived node representations can be used to serve various downstream tasks, such as node classification and node clustering. When a graph is heterogeneous, the problem becomes more challenging than the homogeneous graph node learning problem. Inspired by the emerging information theoretic-based learning algorithm, in this paper we propose an unsupervised graph neural network Heterogeneous Deep Graph Infomax (HDGI) for heterogeneous graph representation learning. We use the meta-path structure to analyze the connections involving semantics in heterogeneous graphs and utilize graph convolution module and semantic-level attention mechanism to capture local representations. By maximizing local-global mutual information, HDGI effectively learns high-level node representations that can be utilized in downstream graph-related tasks. Experiment results show that HDGI remarkably outperforms state-of-the-art unsupervised graph representation learning methods on both classification and clustering tasks. By feeding the learned representations into a parametric model, such as logistic regression, we even achieve comparable performance in node classification tasks when comparing with state-of-the-art supervised end-to-end GNN models. △ Less

Submitted 13 November, 2020; v1 submitted 19 November, 2019; originally announced November 2019.

arXiv:1906.08894 [pdf, ps, other]

Large Sample Mean-Field Stochastic Optimization

Authors: Lijun Bo, Agostino Capponi, Huafu Liao

Abstract: We study a class of sampled stochastic optimization problems, where the underlying state process has diffusive dynamics of the mean-field type. We establish the existence of optimal relaxed controls when the sample set has finite size. The core of our paper is to prove, via $Γ$-convergence, that the minimizer of the finite sample relaxed problem converges to that of the limiting optimization probl… ▽ More We study a class of sampled stochastic optimization problems, where the underlying state process has diffusive dynamics of the mean-field type. We establish the existence of optimal relaxed controls when the sample set has finite size. The core of our paper is to prove, via $Γ$-convergence, that the minimizer of the finite sample relaxed problem converges to that of the limiting optimization problem. We connect the limit of the sampled objective functional to the unique solution, in the trajectory sense, of a nonlinear Fokker-Planck-Kolmogorov (FPK) equation in a random environment. We highlight the connection between the minimizers of our optimization problems and the optimal training weights of a deep residual neural network. △ Less

Submitted 5 June, 2022; v1 submitted 20 June, 2019; originally announced June 2019.

Comments: 30 pages. To appear in SIAM Journal on Control and Optimization

MSC Class: 3E20; 60J20; 93E35; 93E20; 60F05

arXiv:1905.08004 [pdf, ps, other]

Risk-Sensitive Credit Portfolio Optimization under Partial Information and Contagion Risk

Authors: Lijun Bo, Huafu Liao, Xiang Yu

Abstract: This paper investigates the finite horizon risk-sensitive portfolio optimization in a regime-switching credit market with physical and information-induced default contagion. It is assumed that the underlying regime-switching process has countable states and is unobservable. The stochastic control problem is formulated under partial observations of asset prices and sequential default events. By est… ▽ More This paper investigates the finite horizon risk-sensitive portfolio optimization in a regime-switching credit market with physical and information-induced default contagion. It is assumed that the underlying regime-switching process has countable states and is unobservable. The stochastic control problem is formulated under partial observations of asset prices and sequential default events. By establishing a martingale representation theorem based on incomplete and phasing out filtration, we connect the control problem to a quadratic BSDE with jumps, in which the driver term is non-standard and carries the conditional filter as an infinite-dimensional parameter. By proposing some truncation techniques and proving a uniform a priori estimates, we obtain the existence of a solution to the BSDE using the convergence of solutions associated to some truncated BSDEs. The verification theorem can be concluded with the aid of our BSDE results, which in turn yields the uniqueness of the solution to the BSDE. △ Less

Submitted 27 July, 2021; v1 submitted 20 May, 2019; originally announced May 2019.

Comments: Final version, forthcoming in the Annals of Applied Probability

arXiv:1904.07399 [pdf, other]

Adaptive Wing Loss for Robust Face Alignment via Heatmap Regression

Authors: Xinyao Wang, Liefeng Bo, Li Fuxin

Abstract: Heatmap regression with a deep network has become one of the mainstream approaches to localize facial landmarks. However, the loss function for heatmap regression is rarely studied. In this paper, we analyze the ideal loss function properties for heatmap regression in face alignment problems. Then we propose a novel loss function, named Adaptive Wing loss, that is able to adapt its shape to differ… ▽ More Heatmap regression with a deep network has become one of the mainstream approaches to localize facial landmarks. However, the loss function for heatmap regression is rarely studied. In this paper, we analyze the ideal loss function properties for heatmap regression in face alignment problems. Then we propose a novel loss function, named Adaptive Wing loss, that is able to adapt its shape to different types of ground truth heatmap pixels. This adaptability penalizes loss more on foreground pixels while less on background pixels. To address the imbalance between foreground and background pixels, we also propose Weighted Loss Map, which assigns high weights on foreground and difficult background pixels to help training process focus more on pixels that are crucial to landmark localization. To further improve face alignment accuracy, we introduce boundary prediction and CoordConv with boundary coordinates. Extensive experiments on different benchmarks, including COFW, 300W and WFLW, show our approach outperforms the state-of-the-art by a significant margin on various evaluation metrics. Besides, the Adaptive Wing loss also helps other heatmap regression tasks. Code will be made publicly available at https://github.com/protossw512/AdaptiveWingLoss. △ Less

Submitted 19 May, 2020; v1 submitted 15 April, 2019; originally announced April 2019.

Comments: [v2] Camera-ready version for ICCV 2019. [v3] Corrected AUC(fr10%) on table 2

arXiv:1904.02363 [pdf, other]

Spatiotemporal CNN for Video Object Segmentation

Authors: Kai Xu, Longyin Wen, Guorong Li, Liefeng Bo, Qingming Huang

Abstract: In this paper, we present a unified, end-to-end trainable spatiotemporal CNN model for VOS, which consists of two branches, i.e., the temporal coherence branch and the spatial segmentation branch. Specifically, the temporal coherence branch pretrained in an adversarial fashion from unlabeled video data, is designed to capture the dynamic appearance and motion cues of video sequences to guide objec… ▽ More In this paper, we present a unified, end-to-end trainable spatiotemporal CNN model for VOS, which consists of two branches, i.e., the temporal coherence branch and the spatial segmentation branch. Specifically, the temporal coherence branch pretrained in an adversarial fashion from unlabeled video data, is designed to capture the dynamic appearance and motion cues of video sequences to guide object segmentation. The spatial segmentation branch focuses on segmenting objects accurately based on the learned appearance and motion cues. To obtain accurate segmentation results, we design a coarse-to-fine process to sequentially apply a designed attention module on multi-scale feature maps, and concatenate them to produce the final prediction. In this way, the spatial segmentation branch is enforced to gradually concentrate on object regions. These two branches are jointly fine-tuned on video segmentation sequences in an end-to-end manner. Several experiments are carried out on three challenging datasets (i.e., DAVIS-2016, DAVIS-2017 and Youtube-Object) to show that our method achieves favorable performance against the state-of-the-arts. Code is available at https://github.com/longyin880815/STCNN. △ Less

Submitted 4 April, 2019; originally announced April 2019.

Comments: 10 pages, 3 figures, 6 tables, CVPR 2019

arXiv:1902.10398 [pdf]

High-performance dendritic metamaterial absorber for broadband and near-meter wave radar

Authors: Song Jiaoyan, Zhao **g, Li Yimin, Li Bo, Zhao Xiaopeng

Abstract: Absorbing materials in ultra-high frequency (UHF) band has constantly been a major challenge. The size of the absorber in UHF band is large, whereas the resonant frequency band is narrow. According to Rozanov's theory, two kinds of composite metamaterial absorbers are designed to realize the requirements of low-frequency broadband metamaterial microwave absorber: the magnetic-metamaterial composit… ▽ More Absorbing materials in ultra-high frequency (UHF) band has constantly been a major challenge. The size of the absorber in UHF band is large, whereas the resonant frequency band is narrow. According to Rozanov's theory, two kinds of composite metamaterial absorbers are designed to realize the requirements of low-frequency broadband metamaterial microwave absorber: the magnetic-metamaterial composite absorber1 (MA1) and the dielectric-metamaterial composite absorber 2 (MA2). In the range of approximately 300-1000MHz, both absorbers achieve absorption of over 90% and feature good adaptability to the incident angle of the incident wave. The absorbers also present good absorption rate of over 80% in the range of 0-45 degree. Processing samples of indium tin oxide (ITO) resistance film and polymethacrylimide (PMI) foam board feature simple preparation and low cost, and the most important thing is to consider the weight problem, which features certain advantages in terms of use. △ Less

Submitted 27 February, 2019; originally announced February 2019.

arXiv:1811.11899 [pdf, ps, other]

Power Forward Performance in Semimartingale Markets with Stochastic Integrated Factors

Authors: Lijun Bo, Agostino Capponi, Chao Zhou

Abstract: We study the forward investment performance process (FIPP) in an incomplete semimartingale market model with closed and convex portfolio constraints, when the investor's risk preferences are of the power form. We provide necessary and sufficient conditions for the construction of such a performance process, and show that it can be recovered as the unique solution of an infinite horizon quadratic b… ▽ More We study the forward investment performance process (FIPP) in an incomplete semimartingale market model with closed and convex portfolio constraints, when the investor's risk preferences are of the power form. We provide necessary and sufficient conditions for the construction of such a performance process, and show that it can be recovered as the unique solution of an infinite horizon quadratic backward stochastic differential equation (BSDE) with a nonmonotone driver. In an integrated stochastic factor model, we relate the factor representation of the BSDE solution to the smooth solution of an ill-posed partial integro-differential Hamilton-Jacobi-Bellman (HJB) equation. We provide an explicit construction of the BSDE solution for the class of time-monotone FIPPs, generalizing existing results from Brownian to semimartingale market models. △ Less

Submitted 25 January, 2022; v1 submitted 28 November, 2018; originally announced November 2018.

Comments: 40 pages, 0 figures. To appear in Mathematics of Operations Research; Previously this version appeared as arXiv:2201:09406 which was submitted as a new work by accident

MSC Class: 3E20; 60J20

arXiv:1811.01646 [pdf, ps, other]

On existence of the prescribing $k$-curvature of the Einstein tensor

Authors: Leyang Bo, Weimin Sheng

Abstract: In this paper, we study the problem of conformally deforming a metric on a $3$-dimensional manifold $M^3$ such that its $k$-curvature equals to a prescribed function, where the $k$-curvature is defined by the $k$-th elementary symmetric function of the eigenvalues of the Einstein tensor, $1\le k\le 3$. We prove the solvability of the problem and the compactness of the solution sets on manifolds wh… ▽ More In this paper, we study the problem of conformally deforming a metric on a $3$-dimensional manifold $M^3$ such that its $k$-curvature equals to a prescribed function, where the $k$-curvature is defined by the $k$-th elementary symmetric function of the eigenvalues of the Einstein tensor, $1\le k\le 3$. We prove the solvability of the problem and the compactness of the solution sets on manifolds when $k=2$ and $3$, provided the conformal class admits a negative $k$-admissible metric with respect to the Einstein tensor. △ Less

Submitted 5 November, 2018; originally announced November 2018.

Comments: 14 pages. All comments are welcome

MSC Class: 53C21; 35J60

arXiv:1810.08425 [pdf, other]

ScratchDet: Training Single-Shot Object Detectors from Scratch

Authors: Rui Zhu, Shifeng Zhang, Xiaobo Wang, Longyin Wen, Hailin Shi, Liefeng Bo, Tao Mei

Abstract: Current state-of-the-art object objectors are fine-tuned from the off-the-shelf networks pretrained on large-scale classification dataset ImageNet, which incurs some additional problems: 1) The classification and detection have different degrees of sensitivity to translation, resulting in the learning objective bias; 2) The architecture is limited by the classification network, leading to the inco… ▽ More Current state-of-the-art object objectors are fine-tuned from the off-the-shelf networks pretrained on large-scale classification dataset ImageNet, which incurs some additional problems: 1) The classification and detection have different degrees of sensitivity to translation, resulting in the learning objective bias; 2) The architecture is limited by the classification network, leading to the inconvenience of modification. To cope with these problems, training detectors from scratch is a feasible solution. However, the detectors trained from scratch generally perform worse than the pretrained ones, even suffer from the convergence issue in training. In this paper, we explore to train object detectors from scratch robustly. By analysing the previous work on optimization landscape, we find that one of the overlooked points in current trained-from-scratch detector is the BatchNorm. Resorting to the stable and predictable gradient brought by BatchNorm, detectors can be trained from scratch stably while kee** the favourable performance independent to the network architecture. Taking this advantage, we are able to explore various types of networks for object detection, without suffering from the poor convergence. By extensive experiments and analyses on downsampling factor, we propose the Root-ResNet backbone network, which makes full use of the information from original images. Our ScratchDet achieves the state-of-the-art accuracy on PASCAL VOC 2007, 2012 and MS COCO among all the train-from-scratch detectors and even performs better than several one-stage pretrained methods. Codes will be made publicly available at https://github.com/KimSoybean/ScratchDet. △ Less

Submitted 5 May, 2019; v1 submitted 19 October, 2018; originally announced October 2018.

Comments: CVPR2019 Oral Presentation. Camera Ready Version

arXiv:1807.05513 [pdf, other]

Optimal Credit Investment and Risk Control for an Insurer with Regime-Switching

Authors: Lijun Bo, Huafu Liao, Yong** Wang

Abstract: This paper studies an optimal investment and risk control problem for an insurer with default contagion and regime-switching. The insurer in our model allocates his/her wealth across multi-name defaultable stocks and a riskless bond under regime-switching risk. Default events have an impact on the distress state of the surviving stocks in the portfolio. The aim of the insurer is to maximize the ex… ▽ More This paper studies an optimal investment and risk control problem for an insurer with default contagion and regime-switching. The insurer in our model allocates his/her wealth across multi-name defaultable stocks and a riskless bond under regime-switching risk. Default events have an impact on the distress state of the surviving stocks in the portfolio. The aim of the insurer is to maximize the expected utility of the terminal wealth by selecting optimal investment and risk control strategies. We characterize the optimal trading strategy of defaultable stocks and risk control for the insurer. By develo** a truncation technique, we analyze the existence and uniqueness of global (classical) solutions to the recursive HJB system. We prove the verification theorem based on the (classical) solutions of the recursive HJB system. △ Less

Submitted 15 July, 2018; originally announced July 2018.

Comments: 30 pages, 16 figures

MSC Class: 3E20; 60J20

arXiv:1806.07175 [pdf, other]

Portfolio Choice with Market-Credit Risk Dependencies

Authors: Lijun Bo, Agostino Capponi

Abstract: We study an optimal investment/consumption problem in a model capturing market and credit risk dependencies. Stochastic factors drive both the default intensity and the volatility of the stocks in the portfolio. We use the martingale approach and analyze the recursive system of nonlinear Hamilton-Jacobi-Bellman equations associated with the dual problem. We transform such a system into an equivale… ▽ More We study an optimal investment/consumption problem in a model capturing market and credit risk dependencies. Stochastic factors drive both the default intensity and the volatility of the stocks in the portfolio. We use the martingale approach and analyze the recursive system of nonlinear Hamilton-Jacobi-Bellman equations associated with the dual problem. We transform such a system into an equivalent system of semi-linear PDEs, for which we establish existence and uniqueness of a bounded global classical solution. We obtain explicit representations for the optimal strategy, consumption path and wealth process, in terms of the solution to the recursive system of semi-linear PDEs. We numerically analyze the sensitivity of the optimal investment strategies to risk aversion, default risk and volatility. △ Less

Submitted 19 June, 2018; originally announced June 2018.

Comments: 38 pages, 12 figures, Forthcoming in SIAM Journal on Control and Optimization

MSC Class: 91G10; 91G40; 60J20

arXiv:1712.05676 [pdf, ps, other]

Risk Sensitive Portfolio Optimization with Default Contagion and Regime-Switching

Authors: Lijun Bo, Huafu Liao, Xiang Yu

Abstract: We study an open problem of risk-sensitive portfolio allocation in a regime-switching credit market with default contagion. The state space of the Markovian regime-switching process is assumed to be a countably infinite set. To characterize the value function, we investigate the corresponding recursive infinite-dimensional nonlinear dynamical programming equations (DPEs) based on default states. W… ▽ More We study an open problem of risk-sensitive portfolio allocation in a regime-switching credit market with default contagion. The state space of the Markovian regime-switching process is assumed to be a countably infinite set. To characterize the value function, we investigate the corresponding recursive infinite-dimensional nonlinear dynamical programming equations (DPEs) based on default states. We propose to work in the following procedure: Applying the theory of monotone dynamical system, we first establish the existence and uniqueness of classical solutions to the recursive DPEs by a truncation argument in the finite state space. The associated optimal feedback strategy is characterized by develo** a rigorous verification theorem. Building upon results in the first stage, we construct a sequence of approximating risk sensitive control problems with finite states and prove that the resulting smooth value functions will converge to the classical solution of the original system of DPEs. The construction and approximation of the optimal feedback strategy for the original problem are also thoroughly discussed. △ Less

Submitted 24 October, 2018; v1 submitted 15 December, 2017; originally announced December 2017.

Comments: Final version. SIAM Journal on Control and Optimization, forthcoming; Keywords: Default contagion; regime switching; countably infinite states; risk sensitive control; recursive dynamical programming equations; verification theorems

arXiv:1709.01115 [pdf, ps, other]

Risk-Minimizing Hedging of Counterparty Risk

Authors: Lijun Bo, Agostino Capponi, Claudia Ceci

Abstract: We study dynamic hedging of counterparty risk for a portfolio of credit derivatives. Our empirically driven credit model consists of interacting default intensities which ramp up and then decay after the occurrence of credit events. Using the Galtchouk-Kunita-Watanabe decomposition of the counterparty risk price payment stream, we recover a closed-form representation for the risk minimizing strate… ▽ More We study dynamic hedging of counterparty risk for a portfolio of credit derivatives. Our empirically driven credit model consists of interacting default intensities which ramp up and then decay after the occurrence of credit events. Using the Galtchouk-Kunita-Watanabe decomposition of the counterparty risk price payment stream, we recover a closed-form representation for the risk minimizing strategy in terms of classical solutions to nonlinear recursive systems of Cauchy problems. We discuss applications of our framework to the most prominent class of credit derivatives, including credit swap and risky bond portfolios, as well as first-to-default claims. △ Less

Submitted 4 September, 2017; originally announced September 2017.

Comments: 32 pages

MSC Class: 60J25; 60J75; 60H30; 91B28

Showing 51–100 of 124 results for author: Bo, L