Skip to main content

Showing 151–200 of 240 results for author: Pu, S

.
  1. arXiv:2005.13118  [pdf, other

    cs.CV

    TRIE: End-to-End Text Reading and Information Extraction for Document Understanding

    Authors: Peng Zhang, Yunlu Xu, Zhanzhan Cheng, Shiliang Pu, **g Lu, Liang Qiao, Yi Niu, Fei Wu

    Abstract: Since real-world ubiquitous documents (e.g., invoices, tickets, resumes and leaflets) contain rich information, automatic document image understanding has become a hot topic. Most existing works decouple the problem into two separate tasks, (1) text reading for detecting and recognizing texts in images and (2) information extraction for analyzing and extracting key elements from previously extract… ▽ More

    Submitted 25 October, 2021; v1 submitted 26 May, 2020; originally announced May 2020.

    Comments: Accepted to ACM MM2020. Code is available at https://davar-lab.github.io/publication.html or https://github.com/hikopensource/DAVAR-Lab-OCR

  2. arXiv:2005.13117  [pdf, other

    cs.CV

    SPIN: Structure-Preserving Inner Offset Network for Scene Text Recognition

    Authors: Chengwei Zhang, Yunlu Xu, Zhanzhan Cheng, Shiliang Pu, Yi Niu, Fei Wu, Futai Zou

    Abstract: Arbitrary text appearance poses a great challenge in scene text recognition tasks. Existing works mostly handle with the problem in consideration of the shape distortion, including perspective distortions, line curvature or other style variations. Therefore, methods based on spatial transformers are extensively studied. However, chromatic difficulties in complex scenes have not been paid much atte… ▽ More

    Submitted 25 October, 2021; v1 submitted 26 May, 2020; originally announced May 2020.

    Comments: Accepted to AAAI21. Code is available at https://davar-lab.github.io/publication.html or https://github.com/hikopensource/DAVAR-Lab-OCR

  3. arXiv:2005.13116  [pdf, other

    cs.CV

    Object-QA: Towards High Reliable Object Quality Assessment

    Authors: **g Lu, Baorui Zou, Zhanzhan Cheng, Shiliang Pu, Shuigeng Zhou, Yi Niu, Fei Wu

    Abstract: In object recognition applications, object images usually appear with different quality levels. Practically, it is very important to indicate object image qualities for better application performance, e.g. filtering out low-quality object image frames to maintain robust video object recognition results and speed up inference. However, no previous works are explicitly proposed for addressing the pr… ▽ More

    Submitted 26 May, 2020; originally announced May 2020.

  4. arXiv:2005.10598  [pdf, other

    q-bio.NC math.NA

    Fast and Accurate Langevin Simulations of Stochastic Hodgkin-Huxley Dynamics

    Authors: Shusen Pu, Peter J. Thomas

    Abstract: Fox and Lu introduced a Langevin framework for discrete-time stochastic models of randomly gated ion channels such as the Hodgkin-Huxley (HH) system. They derived a Fokker-Planck equation with state-dependent diffusion tensor $D$ and suggested a Langevin formulation with noise coefficient matrix $S$ such that $SS^\intercal=D$. Subsequently, several authors introduced a variety of Langevin equation… ▽ More

    Submitted 21 May, 2020; originally announced May 2020.

    Comments: 55 pages, 9 figures

    MSC Class: 65C30 (Primary) 92C20; 37N25; 92B25 (Secondary)

  5. arXiv:2005.10432  [pdf, other

    hep-ph nucl-ex nucl-th

    Recent developments in chiral and spin polarization effects in heavy-ion collisions

    Authors: Jian-Hua Gao, Guo-Liang Ma, Shi Pu, Qun Wang

    Abstract: We give a brief overview of recent theoretical and experimental results on the chiral magnetic effect and spin polarization effect in heavy-ion collisions. We present updated experimental results for the chiral magnetic effect and related phenomena. The time evolution of the magnetic fields in different models is discussed. The newly developed quantum kinetic theory for massive fermions is reviewe… ▽ More

    Submitted 3 August, 2020; v1 submitted 20 May, 2020; originally announced May 2020.

    Comments: 22 pages, 9 figures. A review article for Nucl. Sci. Tech; more references are added

  6. arXiv:2004.14774  [pdf, other

    cs.CV cs.LG cs.RO eess.IV stat.ML

    IROS 2019 Lifelong Robotic Vision Challenge -- Lifelong Object Recognition Report

    Authors: Qi She, Fan Feng, Qi Liu, Rosa H. M. Chan, Xinyue Hao, Chuanlin Lan, Qihan Yang, Vincenzo Lomonaco, German I. Parisi, Heechul Bae, Eoin Brophy, Baoquan Chen, Gabriele Graffieti, Vidit Goel, Hyonyoung Han, Sathursan Kanagarajah, Somesh Kumar, Siew-Kei Lam, Tin Lun Lam, Liang Ma, Davide Maltoni, Lorenzo Pellegrini, Duvindu Piyasena, Shiliang Pu, Debdoot Sheet , et al. (11 additional authors not shown)

    Abstract: This report summarizes IROS 2019-Lifelong Robotic Vision Competition (Lifelong Object Recognition Challenge) with methods and results from the top $8$ finalists (out of over~$150$ teams). The competition dataset (L)ifel(O)ng (R)obotic V(IS)ion (OpenLORIS) - Object Recognition (OpenLORIS-object) is designed for driving lifelong/continual learning research and application in robotic vision domain, w… ▽ More

    Submitted 26 April, 2020; originally announced April 2020.

    Comments: 9 pages, 11 figures, 3 tables, accepted into IEEE Robotics and Automation Magazine. arXiv admin note: text overlap with arXiv:1911.06487

  7. Anomalous magnetohydrodynamics with constant anisotropic electric conductivities

    Authors: Ren-jie Wang, Patrick Co**er, Shi Pu

    Abstract: We study anomalous magnetohydrodynamics in a longitudinal boost invariant Bjorken flow with constant anisotropic electric conductivities as outlined in Ref. [1]. For simplicity, we consider a neutral fluid and a force-free magnetic field in the transverse direction. We derived analytic solutions of the electromagnetic fields in the laboratory frame, the chiral density, and the energy density as fu… ▽ More

    Submitted 14 April, 2020; originally announced April 2020.

    Comments: 5 pages, proceedings for Quark Matter 2019

  8. arXiv:2003.13980  [pdf, ps, other

    math.OC cs.DC cs.MA cs.SI

    A Robust Gradient Tracking Method for Distributed Optimization over Directed Networks

    Authors: Shi Pu

    Abstract: In this paper, we consider the problem of distributed consensus optimization over multi-agent networks with directed network topology. Assuming each agent has a local cost function that is smooth and strongly convex, the global objective is to minimize the average of all the local cost functions. To solve the problem, we introduce a robust gradient tracking method (R-Push-Pull) adapted from the re… ▽ More

    Submitted 20 August, 2020; v1 submitted 31 March, 2020; originally announced March 2020.

  9. arXiv:2003.06576  [pdf, other

    cs.CV cs.CL cs.MM

    Counterfactual Samples Synthesizing for Robust Visual Question Answering

    Authors: Long Chen, Xin Yan, Jun Xiao, Hanwang Zhang, Shiliang Pu, Yueting Zhuang

    Abstract: Despite Visual Question Answering (VQA) has realized impressive progress over the last few years, today's VQA models tend to capture superficial linguistic correlations in the train set and fail to generalize to the test set with different QA distributions. To reduce the language biases, several recent works introduce an auxiliary question-only model to regularize the training of targeted VQA mode… ▽ More

    Submitted 14 March, 2020; originally announced March 2020.

    Comments: Appear in CVPR 2020; Codes in https://github.com/yanxinzju/CSS-VQA

  10. arXiv:2002.12580  [pdf, other

    cs.CV

    Neural Inheritance Relation Guided One-Shot Layer Assignment Search

    Authors: Rang Meng, Weijie Chen, Di Xie, Yuan Zhang, Shiliang Pu

    Abstract: Layer assignment is seldom picked out as an independent research topic in neural architecture search. In this paper, for the first time, we systematically investigate the impact of different layer assignments to the network performance by building an architecture dataset of layer assignment on CIFAR-100. Through analyzing this dataset, we discover a neural inheritance relation among the networks w… ▽ More

    Submitted 28 February, 2020; originally announced February 2020.

    Comments: AAAI2020

  11. arXiv:2002.11338  [pdf, other

    cs.CV cs.LG cs.NE

    Refined Gate: A Simple and Effective Gating Mechanism for Recurrent Units

    Authors: Zhanzhan Cheng, Yunlu Xu, Mingjian Cheng, Yu Qiao, Shiliang Pu, Yi Niu, Fei Wu

    Abstract: Recurrent neural network (RNN) has been widely studied in sequence learning tasks, while the mainstream models (e.g., LSTM and GRU) rely on the gating mechanism (in control of how information flows between hidden states). However, the vanilla gates in RNN (e.g., the input gate in LSTM) suffer from the problem of gate undertraining, which can be caused by various factors, such as the saturating act… ▽ More

    Submitted 26 May, 2020; v1 submitted 26 February, 2020; originally announced February 2020.

  12. arXiv:2002.09356  [pdf, other

    cond-mat.str-el cond-mat.mes-hall

    Interplay between fractional quantum Hall liquid and crystal phases at low filling

    Authors: Zheng-Wei Zuo, Ajit C. Balram, Songyang Pu, Jianyun Zhao, Thierry Jolicoeur, A. Wójs, J. K. Jain

    Abstract: The nature of the state at low Landau-level filling factors has been a longstanding puzzle in the field of the fractional quantum Hall effect. While theoretical calculations suggest that a crystal is favored at filling factors $ν\lesssim 1/6$, experiments show, at somewhat elevated temperatures, minima in the longitudinal resistance that are associated with fractional quantum Hall effect at $ν=$ 1… ▽ More

    Submitted 19 August, 2020; v1 submitted 21 February, 2020; originally announced February 2020.

    Comments: 13 pages, 10 figures, published version

    Journal ref: Phys. Rev. B 102, 075307 (2020)

  13. arXiv:2002.06820  [pdf, other

    cs.CV

    Text Perceptron: Towards End-to-End Arbitrary-Shaped Text Spotting

    Authors: Liang Qiao, Sanli Tang, Zhanzhan Cheng, Yunlu Xu, Yi Niu, Shiliang Pu, Fei Wu

    Abstract: Many approaches have recently been proposed to detect irregular scene text and achieved promising results. However, their localization results may not well satisfy the following text recognition part mainly because of two reasons: 1) recognizing arbitrary shaped text is still a challenging task, and 2) prevalent non-trainable pipeline strategies between text detection and text recognition will lea… ▽ More

    Submitted 25 October, 2021; v1 submitted 17 February, 2020; originally announced February 2020.

    Comments: Accepted by AAAI2020. Code is available at https://davar-lab.github.io/publication.html or https://github.com/hikopensource/DAVAR-Lab-OCR

  14. Relativistic decomposition of the orbital and the spin angular momentum in chiral physics and Feynman's angular momentum paradox

    Authors: Kenji Fukushima, Shi Pu

    Abstract: Over recent years we have witnessed tremendous progresses in our understanding on the angular momentum decomposition. In the context of the proton spin problem in high energy processes the angular momentum decomposition by Jaffe and Manohar, which is based on the canonical definition, and the alternative by Ji, which is based on the Belinfante improved one, have been revisited under light shed by… ▽ More

    Submitted 19 April, 2020; v1 submitted 2 January, 2020; originally announced January 2020.

    Comments: 16 pages, 3 figures, 1 table; contribution to Lecture Notes in Physics volume on "Strongly Interacting Matter under Rotation"; references added in revision

    Journal ref: Lect.Notes Phys. 987 (2021) 381-396

  15. arXiv:1912.04457  [pdf, other

    hep-ph hep-ex nucl-th physics.comp-ph

    Towards a full solution of relativistic Boltzmann equation for quark-gluon matter on GPUs

    Authors: Jun-Jie Zhang, Hong-Zhong Wu, Shi Pu, Guang-You Qin, Qun Wang

    Abstract: We have developed a numerical framework for a full solution of the relativistic Boltzmann equations for the quark-gluon matter using the multiple Graphics Processing Units (GPUs) on distributed clusters. Including all the $2 \to 2$ scattering processes of 3-flavor quarks and gluons, we compute the time evolution of distribution functions in both coordinate and momentum spaces for the cases of pure… ▽ More

    Submitted 9 December, 2019; originally announced December 2019.

    Comments: 32 pages, 12 figures

    Journal ref: Phys. Rev. D 102, 074011 (2020)

  16. arXiv:1911.09349  [pdf, other

    eess.AS cs.CV cs.LG cs.MM

    An End-to-End Audio Classification System based on Raw Waveforms and Mix-Training Strategy

    Authors: Jiaxu Chen, **g Hao, Kai Chen, Di Xie, Shicai Yang, Shiliang Pu

    Abstract: Audio classification can distinguish different kinds of sounds, which is helpful for intelligent applications in daily life. However, it remains a challenging task since the sound events in an audio clip is probably multiple, even overlap**. This paper introduces an end-to-end audio classification system based on raw waveforms and mix-training strategy. Compared to human-designed features which… ▽ More

    Submitted 21 November, 2019; originally announced November 2019.

    Comments: InterSpeech 2019

  17. arXiv:1910.06496  [pdf, other

    cond-mat.str-el cond-mat.mes-hall math-ph

    Hall Viscosity of Composite Fermions

    Authors: Songyang Pu, Mikael Fremling, J. K. Jain

    Abstract: Hall viscosity, also known as the Lorentz shear modulus, has been proposed as a topological property of a quantum Hall fluid. Using a recent formulation of the composite fermion theory on the torus, we evaluate the Hall viscosities for a large number of fractional quantum Hall states at filling factors of the form $ν=n/(2pn\pm 1)$, where $n$ and $p$ are integers, from the explicit wave functions f… ▽ More

    Submitted 14 July, 2020; v1 submitted 14 October, 2019; originally announced October 2019.

    Comments: 19 pages, 9 figures

    Journal ref: Phys. Rev. Research 2, 013139 (2020)

  18. arXiv:1908.02422  [pdf, other

    cs.CV

    Adversarial Seeded Sequence Growing for Weakly-Supervised Temporal Action Localization

    Authors: Chengwei Zhang, Yunlu Xu, Zhanzhan Cheng, Yi Niu, Shiliang Pu, Fei Wu, Futai Zou

    Abstract: Temporal action localization is an important yet challenging research topic due to its various applications. Since the frame-level or segment-level annotations of untrimmed videos require amounts of labor expenditure, studies on the weakly-supervised action detection have been springing up. However, most of existing frameworks rely on Class Activation Sequence (CAS) to localize actions by minimizi… ▽ More

    Submitted 6 August, 2019; originally announced August 2019.

    Comments: To be appeared in ACM MM2019

  19. arXiv:1906.12345  [pdf, other

    math.OC cs.DC cs.LG cs.MA

    Asymptotic Network Independence in Distributed Stochastic Optimization for Machine Learning

    Authors: Shi Pu, Alex Olshevsky, Ioannis Ch. Paschalidis

    Abstract: We provide a discussion of several recent results which, in certain scenarios, are able to overcome a barrier in distributed stochastic optimization for machine learning. Our focus is the so-called asymptotic network independence property, which is achieved whenever a distributed method executed over a network of n nodes asymptotically converges to the optimal solution at a comparable rate to a ce… ▽ More

    Submitted 18 February, 2020; v1 submitted 28 June, 2019; originally announced June 2019.

  20. arXiv:1906.02702  [pdf, other

    math.OC cs.DC cs.LG cs.MA

    A Sharp Estimate on the Transient Time of Distributed Stochastic Gradient Descent

    Authors: Shi Pu, Alex Olshevsky, Ioannis Ch. Paschalidis

    Abstract: This paper is concerned with minimizing the average of $n$ cost functions over a network in which agents may communicate and exchange information with each other. We consider the setting where only noisy gradient information is available. To solve the problem, we study the distributed stochastic gradient descent (DSGD) method and perform a non-asymptotic convergence analysis. For strongly convex a… ▽ More

    Submitted 29 January, 2021; v1 submitted 6 June, 2019; originally announced June 2019.

  21. arXiv:1905.01025  [pdf, other

    eess.IV cs.CV

    Learned Quality Enhancement via Multi-Frame Priors for HEVC Compliant Low-Delay Applications

    Authors: Ming Lu, Ming Cheng, Yiling Xu, Shiliang Pu, Qiu Shen, Zhan Ma

    Abstract: Networked video applications, e.g., video conferencing, often suffer from poor visual quality due to unexpected network fluctuation and limited bandwidth. In this paper, we have developed a Quality Enhancement Network (QENet) to reduce the video compression artifacts, leveraging the spatial and temporal priors generated by respective multi-scale convolutions spatially and warped temporal predictio… ▽ More

    Submitted 2 May, 2019; originally announced May 2019.

  22. arXiv:1904.08051  [pdf, other

    cs.CL cs.LG

    Posterior-regularized REINFORCE for Instance Selection in Distant Supervision

    Authors: Qi Zhang, Siliang Tang, Xiang Ren, Fei Wu, Shiliang Pu, Yueting Zhuang

    Abstract: This paper provides a new way to improve the efficiency of the REINFORCE training process. We apply it to the task of instance selection in distant supervision. Modeling the instance selection in one bag as a sequential decision process, a reinforcement learning agent is trained to determine whether an instance is valuable or not and construct a new bag with less noisy instances. However unbiased… ▽ More

    Submitted 16 April, 2019; originally announced April 2019.

    Comments: Five pages

    Journal ref: naacl 2019

  23. Anomalous magnetohydrodynamics with longitudinal boost invariance and chiral magnetic effect

    Authors: Irfan Siddique, Ren-jie Wang, Shi Pu, Qun Wang

    Abstract: We study relativistic magnetohydrodynamics with longitudinal boost invariance in the presence of chiral magnetic effects and finite electric conductivity. With initial magnetic fields parallel or anti-parallel to electric fields, we derive the analytic solutions of electromagnetic fields and the chiral number and energy density in an expansion of several parameters determined by initial conditions… ▽ More

    Submitted 3 April, 2019; originally announced April 2019.

    Comments: 27 pages, 5 figures

    Journal ref: Phys. Rev. D 99, 114029 (2019)

  24. arXiv:1903.05285  [pdf, other

    cs.CV

    All You Need is a Few Shifts: Designing Efficient Convolutional Neural Networks for Image Classification

    Authors: Weijie Chen, Di Xie, Yuan Zhang, Shiliang Pu

    Abstract: Shift operation is an efficient alternative over depthwise separable convolution. However, it is still bottlenecked by its implementation manner, namely memory movement. To put this direction forward, a new and novel basic component named Sparse Shift Layer (SSL) is introduced in this paper to construct efficient convolutional neural networks. In this family of architectures, the basic block is on… ▽ More

    Submitted 12 March, 2019; originally announced March 2019.

    Comments: CVPR2019

  25. arXiv:1903.03299  [pdf, other

    cs.CV

    You Only Recognize Once: Towards Fast Video Text Spotting

    Authors: Zhanzhan Cheng, **g Lu, Yi Niu, Shiliang Pu, Fei Wu, Shuigeng Zhou

    Abstract: Video text spotting is still an important research topic due to its various real-applications. Previous approaches usually fall into the four-staged pipeline: text detection in individual images, framewisely recognizing localized text regions, tracking text streams and generating final results with complicated post-processing skills, which might suffer from the huge computational cost as well as t… ▽ More

    Submitted 25 October, 2021; v1 submitted 8 March, 2019; originally announced March 2019.

    Comments: Accepted by ACM Multimedia 2019. Code is available at https://davar-lab.github.io/publication.html or https://github.com/hikopensource/DAVAR-Lab-OCR

  26. arXiv:1903.01197  [pdf, other

    cs.CV

    Collaborative Spatio-temporal Feature Learning for Video Action Recognition

    Authors: Chao Li, Qiaoyong Zhong, Di Xie, Shiliang Pu

    Abstract: Spatio-temporal feature learning is of central importance for action recognition in videos. Existing deep neural network models either learn spatial and temporal features independently (C2D) or jointly with unconstrained parameters (C3D). In this paper, we propose a novel neural operation which encodes spatio-temporal features collaboratively by imposing a weight-sharing constraint on the learnabl… ▽ More

    Submitted 4 March, 2019; originally announced March 2019.

    Comments: CVPR 2019

  27. arXiv:1812.10604  [pdf, other

    cs.CL cs.LG

    Cross-relation Cross-bag Attention for Distantly-supervised Relation Extraction

    Authors: Yu** Yuan, Liyuan Liu, Siliang Tang, Zhongfei Zhang, Yueting Zhuang, Shiliang Pu, Fei Wu, Xiang Ren

    Abstract: Distant supervision leverages knowledge bases to automatically label instances, thus allowing us to train relation extractor without human annotations. However, the generated training data typically contain massive noise, and may result in poor performances with the vanilla supervised learning. In this paper, we propose to conduct multi-instance learning with a novel Cross-relation Cross-bag Selec… ▽ More

    Submitted 26 December, 2018; originally announced December 2018.

    Comments: AAAI 2019

  28. arXiv:1812.06611  [pdf, other

    cs.CV

    A Layer Decomposition-Recomposition Framework for Neuron Pruning towards Accurate Lightweight Networks

    Authors: Weijie Chen, Yuan Zhang, Di Xie, Shiliang Pu

    Abstract: Neuron pruning is an efficient method to compress the network into a slimmer one for reducing the computational cost and storage overhead. Most of state-of-the-art results are obtained in a layer-by-layer optimization mode. It discards the unimportant input neurons and uses the survived ones to reconstruct the output neurons approaching to the original ones in a layer-by-layer manner. However, an… ▽ More

    Submitted 16 December, 2018; originally announced December 2018.

    Comments: accepted by AAAI19 as oral

  29. arXiv:1812.06576  [pdf, other

    cs.CV cs.LG

    Learning Incremental Triplet Margin for Person Re-identification

    Authors: Yingying Zhang, Qiaoyong Zhong, Liang Ma, Di Xie, Shiliang Pu

    Abstract: Person re-identification (ReID) aims to match people across multiple non-overlap** video cameras deployed at different locations. To address this challenging problem, many metric learning approaches have been proposed, among which triplet loss is one of the state-of-the-arts. In this work, we explore the margin between positive and negative pairs of triplets and prove that large margin is benefi… ▽ More

    Submitted 16 December, 2018; originally announced December 2018.

    Comments: accepted by AAAI19 as spotlight

  30. arXiv:1812.02347  [pdf, other

    cs.CV

    Counterfactual Critic Multi-Agent Training for Scene Graph Generation

    Authors: Long Chen, Hanwang Zhang, Jun Xiao, Xiangnan He, Shiliang Pu, Shih-Fu Chang

    Abstract: Scene graphs -- objects as nodes and visual relationships as edges -- describe the whereabouts and interactions of the things and stuff in an image for comprehensive scene understanding. To generate coherent scene graphs, almost all existing methods exploit the fruitful visual context by modeling message passing among objects, fitting the dynamic nature of reasoning with visual context, eg, "perso… ▽ More

    Submitted 9 August, 2019; v1 submitted 5 December, 2018; originally announced December 2018.

    Comments: International Conference on Computer Vision (ICCV), 2019 (oral)

  31. arXiv:1811.07460  [pdf, other

    cs.CV

    Segregated Temporal Assembly Recurrent Networks for Weakly Supervised Multiple Action Detection

    Authors: Yunlu Xu, Chengwei Zhang, Zhanzhan Cheng, Jianwen Xie, Yi Niu, Shiliang Pu, Fei Wu

    Abstract: This paper proposes a segregated temporal assembly recurrent (STAR) network for weakly-supervised multiple action detection. The model learns from untrimmed videos with only supervision of video-level labels and makes prediction of intervals of multiple actions. Specifically, we first assemble video clips according to class labels by an attention mechanism that learns class-variable attention weig… ▽ More

    Submitted 18 November, 2018; originally announced November 2018.

    Comments: Accepted to Proc. AAAI Conference on Artificial Intelligence 2019

  32. arXiv:1810.06653  [pdf, other

    math.OC cs.DC cs.MA cs.SI

    Push-Pull Gradient Methods for Distributed Optimization in Networks

    Authors: Shi Pu, Wei Shi, **ming Xu, Angelia Nedić

    Abstract: In this paper, we focus on solving a distributed convex optimization problem in a network, where each agent has its own convex cost function and the goal is to minimize the sum of the agents' cost functions while obeying the network connectivity structure. In order to minimize the sum of the cost functions, we consider new distributed gradient-based methods where each node maintains two estimates,… ▽ More

    Submitted 6 February, 2020; v1 submitted 15 October, 2018; originally announced October 2018.

    Comments: Parts of the results appear in Proceedings of the 57th IEEE Conference on Decision and Control (see arXiv:1803.07588)

  33. arXiv:1810.03851  [pdf, other

    cs.CV

    Deep Attentive Tracking via Reciprocative Learning

    Authors: Shi Pu, Yibing Song, Chao Ma, Honggang Zhang, Ming-Hsuan Yang

    Abstract: Visual attention, derived from cognitive neuroscience, facilitates human perception on the most pertinent subset of the sensory data. Recently, significant efforts have been made to exploit attention schemes to advance computer vision systems. For visual tracking, it is often challenging to track target objects undergoing large appearance changes. Attention maps facilitate visual tracking by selec… ▽ More

    Submitted 15 October, 2018; v1 submitted 9 October, 2018; originally announced October 2018.

    Comments: In NIPS 2018

  34. arXiv:1808.08016  [pdf, other

    hep-ph cond-mat.quant-gas

    Eddy magnetization from the chiral Barnett effect

    Authors: Kenji Fukushima, Shi Pu, Zebin Qiu

    Abstract: We discuss the spin, the angular momentum, and the magnetic moment of rotating chiral fermions using a kinetic theory. We find that, in addition to the chiral vortical contribution along the rotation axis, finite circular spin polarization is induced by the spin-momentum correlation of chiral fermions, which is canceled by a change in the orbital angular momentum. We point out that the eddy magnet… ▽ More

    Submitted 11 April, 2019; v1 submitted 24 August, 2018; originally announced August 2018.

    Comments: 9 pages, 1 figure; Some typos are fixed and some reference are added

    Journal ref: Phys. Rev. A 99, 032105 (2019)

  35. arXiv:1807.11254  [pdf, other

    cs.CV

    Extreme Network Compression via Filter Group Approximation

    Authors: Bo Peng, Wenming Tan, Zheyang Li, Shun Zhang, Di Xie, Shiliang Pu

    Abstract: In this paper we propose a novel decomposition method based on filter group approximation, which can significantly reduce the redundancy of deep convolutional neural networks (CNNs) while maintaining the majority of feature representation. Unlike other low-rank decomposition algorithms which operate on spatial or channel dimension of filters, our proposed method mainly focuses on exploiting the fi… ▽ More

    Submitted 31 July, 2018; v1 submitted 30 July, 2018; originally announced July 2018.

    Comments: Accepted by ECCV2018

  36. Non-Equilibrium Quantum Transport of Chiral Fluids from Kinetic Theory

    Authors: Yoshimasa Hidaka, Shi Pu, Di-Lun Yang

    Abstract: We introduce the quantum-field-theory (QFT) derivation of chiral kinetic theory (CKT) from the Wigner-function approach, which manifests side jumps and non-scalar distribution functions associated with Lorentz covariance and incorporates both background fields and collisions. The formalism is utilized to investigate second-order responses of chiral fluids near local equilibrium. Such non-equilibri… ▽ More

    Submitted 13 July, 2018; originally announced July 2018.

    Comments: 4 pages, 1 figure, Quark Matter 2018 Proceedings, parallel talk presented by Di-Lun Yang

  37. arXiv:1807.04416  [pdf, other

    hep-th cond-mat.str-el hep-ph

    Axial Ward identity and the Schwinger mechanism -- Applications to the real-time chiral magnetic effect and condensates

    Authors: Patrick Co**er, Kenji Fukushima, Shi Pu

    Abstract: We elucidate chirality production under parity breaking constant electromagnetic fields, with which we clarify qualitative differences in and out of equilibrium. For a strong magnetic field the pair production from the Schwinger mechanism increments the chirality. The pair production rate is exponentially suppressed with mass according to the Schwinger formula, while the mass dependence of chirali… ▽ More

    Submitted 12 July, 2018; originally announced July 2018.

    Comments: 5 pages, 2 figures

    Journal ref: Phys. Rev. Lett. 121, 261602 (2018)

  38. arXiv:1807.01438  [pdf, other

    cs.CV

    Small-scale Pedestrian Detection Based on Somatic Topology Localization and Temporal Feature Aggregation

    Authors: Tao Song, Leiyu Sun, Di Xie, Haiming Sun, Shiliang Pu

    Abstract: A critical issue in pedestrian detection is to detect small-scale objects that will introduce feeble contrast and motion blur in images and videos, which in our opinion should partially resort to deep-rooted annotation bias. Motivated by this, we propose a novel method integrated with somatic topological line localization (TLL) and temporal feature aggregation for detecting multi-scale pedestrians… ▽ More

    Submitted 3 July, 2018; originally announced July 2018.

    Comments: Accepted by ECCV18

  39. arXiv:1806.04207  [pdf, ps, other

    math.OC cs.DC cs.MA cs.SI stat.ML

    Swarming for Faster Convergence in Stochastic Optimization

    Authors: Shi Pu, Alfredo Garcia

    Abstract: We study a distributed framework for stochastic optimization which is inspired by models of collective motion found in nature (e.g., swarming) with mild communication requirements. Specifically, we analyze a scheme in which each one of $N > 1$ independent threads, implements in a distributed and unsynchronized fashion, a stochastic gradient-descent algorithm which is perturbed by a swarming potent… ▽ More

    Submitted 6 August, 2018; v1 submitted 11 June, 2018; originally announced June 2018.

  40. arXiv:1805.11454  [pdf, ps, other

    math.OC cs.DC cs.SI stat.ML

    Distributed Stochastic Gradient Tracking Methods

    Authors: Shi Pu, Angelia Nedić

    Abstract: In this paper, we study the problem of distributed multi-agent optimization over a network, where each agent possesses a local cost function that is smooth and strongly convex. The global objective is to find a common solution that minimizes the average of all cost functions. Assuming agents only have access to unbiased estimates of the gradients of their local cost functions, we consider a distri… ▽ More

    Submitted 10 March, 2020; v1 submitted 25 May, 2018; originally announced May 2018.

    Comments: Accepted in Mathematical Programming. This article draws heavily from arXiv:1803.07741 (conference submission)

  41. arXiv:1805.09237  [pdf, other

    cond-mat.str-el cond-mat.mes-hall

    Berry phase of the composite-fermion Fermi Sea: Effect of Landau-level mixing

    Authors: Songyang Pu, Mikael Fremling, J. K. Jain

    Abstract: We construct explicit lowest-Landau-level wave functions for the composite-fermion Fermi sea and its low energy excitations following a recently developed approach [Pu, Wu and Jain, Phys. Rev. B 96, 195302 (2018)] and demonstrate them to be very accurate representations of the Coulomb eigenstates. We further ask how the Berry phase associated with a closed loop around the Fermi circle, predicted t… ▽ More

    Submitted 1 October, 2018; v1 submitted 23 May, 2018; originally announced May 2018.

    Comments: 13 pages, 6 figures

    Journal ref: Phys. Rev. B 98, 075304 (2018)

  42. arXiv:1805.06121  [pdf, other

    cs.MM

    A practical convolutional neural network as loop filter for intra frame

    Authors: Xiaodan Song, Jiabao Yao, Lulu Zhou, Li Wang, Xiaoyang Wu, Di Xie, Shiliang Pu

    Abstract: Loop filters are used in video coding to remove artifacts or improve performance. Recent advances in deploying convolutional neural network (CNN) to replace traditional loop filters show large gains but with problems for practical application. First, different model is used for frames encoded with different quantization parameter (QP), respectively. It is expensive for hardware. Second, float poin… ▽ More

    Submitted 16 May, 2018; originally announced May 2018.

    Comments: Accepted by ICIP 2018

  43. arXiv:1805.03384  [pdf, ps, other

    cs.CV

    Edit Probability for Scene Text Recognition

    Authors: Fan Bai, Zhanzhan Cheng, Yi Niu, Shiliang Pu, Shuigeng Zhou

    Abstract: We consider the scene text recognition problem under the attention-based encoder-decoder framework, which is the state of the art. The existing methods usually employ a frame-wise maximal likelihood loss to optimize the models. When we train the model, the misalignment between the ground truth strings and the attention's output sequences of probability distribution, which is caused by missing or s… ▽ More

    Submitted 9 May, 2018; originally announced May 2018.

  44. arXiv:1804.06055  [pdf, other

    cs.CV

    Co-occurrence Feature Learning from Skeleton Data for Action Recognition and Detection with Hierarchical Aggregation

    Authors: Chao Li, Qiaoyong Zhong, Di Xie, Shiliang Pu

    Abstract: Skeleton-based human action recognition has recently drawn increasing attentions with the availability of large-scale skeleton datasets. The most crucial factors for this task lie in two aspects: the intra-frame representation for joint co-occurrences and the inter-frame representation for skeletons' temporal evolutions. In this paper we propose an end-to-end convolutional co-occurrence feature le… ▽ More

    Submitted 17 April, 2018; originally announced April 2018.

    Comments: IJCAI18 oral

  45. arXiv:1803.07741  [pdf, ps, other

    math.OC cs.DC cs.MA

    A Distributed Stochastic Gradient Tracking Method

    Authors: Shi Pu, Angelia Nedić

    Abstract: In this paper, we study the problem of distributed multi-agent optimization over a network, where each agent possesses a local cost function that is smooth and strongly convex. The global objective is to find a common solution that minimizes the average of all cost functions. Assuming agents only have access to unbiased estimates of the gradients of their local cost functions, we consider a distri… ▽ More

    Submitted 1 August, 2019; v1 submitted 21 March, 2018; originally announced March 2018.

    Comments: Accepted in CDC 2018. Extended (journal) version can be found at arXiv:1805.11454

  46. arXiv:1803.07588  [pdf, ps, other

    math.OC cs.DC cs.NI

    A Push-Pull Gradient Method for Distributed Optimization in Networks

    Authors: Shi Pu, Wei Shi, **ming Xu, Angelia Nedić

    Abstract: In this paper, we focus on solving a distributed convex optimization problem in a network, where each agent has its own convex cost function and the goal is to minimize the sum of the agents' cost functions while obeying the network connectivity structure. In order to minimize the sum of the cost functions, we consider a new distributed gradient-based method where each node maintains two estimates… ▽ More

    Submitted 1 August, 2019; v1 submitted 20 March, 2018; originally announced March 2018.

    Comments: Accepted in CDC 2018

  47. Abelian and non-Abelian Berry curvatures in lattice QCD

    Authors: Shi Pu, Arata Yamamoto

    Abstract: We studied the Berry curvature of the massive Dirac fermion in 3+1 dimensions. For the non-interacting Dirac fermion, the Berry curvature is non-Abelian because of the degeneracy of positive and negative helicity modes. We calculated the non-Abelian Berry curvature analytically and numerically. For the interacting Dirac fermion in QCD, the degeneracy is lost because gluons carry helicity and color… ▽ More

    Submitted 5 June, 2018; v1 submitted 6 December, 2017; originally announced December 2017.

    Journal ref: Nucl. Phys. B933 (2018) 53

  48. arXiv:1711.04226  [pdf, other

    cs.CV

    AON: Towards Arbitrarily-Oriented Text Recognition

    Authors: Zhanzhan Cheng, Yangliu Xu, Fan Bai, Yi Niu, Shiliang Pu, Shuigeng Zhou

    Abstract: Recognizing text from natural images is a hot research topic in computer vision due to its various applications. Despite the enduring research of several decades on optical character recognition (OCR), recognizing texts from natural images is still a challenging task. This is because scene texts are often in irregular (e.g. curved, arbitrarily-oriented or seriously distorted) arrangements, which h… ▽ More

    Submitted 22 March, 2018; v1 submitted 11 November, 2017; originally announced November 2017.

    Comments: Accepted by CVPR2018

  49. arXiv:1710.10749  [pdf, other

    cs.CV

    Cascade Region Proposal and Global Context for Deep Object Detection

    Authors: Qiaoyong Zhong, Chao Li, Yingying Zhang, Di Xie, Shicai Yang, Shiliang Pu

    Abstract: Deep region-based object detector consists of a region proposal step and a deep object recognition step. In this paper, we make significant improvements on both of the two steps. For region proposal we propose a novel lightweight cascade structure which can effectively improve RPN proposal quality. For object recognition we re-implement global context modeling with a few modications and obtain a p… ▽ More

    Submitted 29 October, 2017; originally announced October 2017.

    Comments: Preprint to appear in Neurocomputing

  50. arXiv:1710.00278  [pdf, ps, other

    hep-th cond-mat.mes-hall cond-mat.stat-mech nucl-th

    Nonlinear Responses of Chiral Fluids from Kinetic Theory

    Authors: Yoshimasa Hidaka, Shi Pu, Di-Lun Yang

    Abstract: The second-order nonlinear responses of inviscid chiral fluids near local equilibrium are investigated by applying the chiral kinetic theory (CKT) incorporating side-jump effects. It is shown that the local equilibrium distribution function can be non-trivially introduced in a co-moving frame with respect to the fluid velocity when the quantum corrections in collisions are involved. For the study… ▽ More

    Submitted 29 May, 2018; v1 submitted 30 September, 2017; originally announced October 2017.

    Comments: 34 pages, a missing term of collisions in Eq.(8) and relevant parts added, results and conclusions remain unchanged

    Report number: RIKEN-QHP-260, RIKEN-iTHEMS-Report-17

    Journal ref: Phys. Rev. D 97, 016004 (2018)