-
NVS-Solver: Video Diffusion Model as Zero-Shot Novel View Synthesizer
Authors:
Meng You,
Zhiyu Zhu,
Hui Liu,
Junhui Hou
Abstract:
By harnessing the potent generative capabilities of pre-trained large video diffusion models, we propose NVS-Solver, a new novel view synthesis (NVS) paradigm that operates \textit{without} the need for training. NVS-Solver adaptively modulates the diffusion sampling process with the given views to enable the creation of remarkable visual experiences from single or multiple views of static scenes…
▽ More
By harnessing the potent generative capabilities of pre-trained large video diffusion models, we propose NVS-Solver, a new novel view synthesis (NVS) paradigm that operates \textit{without} the need for training. NVS-Solver adaptively modulates the diffusion sampling process with the given views to enable the creation of remarkable visual experiences from single or multiple views of static scenes or monocular videos of dynamic scenes. Specifically, built upon our theoretical modeling, we iteratively modulate the score function with the given scene priors represented with warped input views to control the video diffusion process. Moreover, by theoretically exploring the boundary of the estimation error, we achieve the modulation in an adaptive fashion according to the view pose and the number of diffusion steps. Extensive evaluations on both static and dynamic scenes substantiate the significant superiority of our NVS-Solver over state-of-the-art methods both quantitatively and qualitatively. \textit{ Source code in } \href{https://github.com/ZHU-Zhiyu/NVS_Solver}{https://github.com/ZHU-Zhiyu/NVS$\_$Solver}.
△ Less
Submitted 24 May, 2024;
originally announced May 2024.
-
On the orbits of a finite solvable primitive linear group
Authors:
Yong Yang,
Mengxi You
Abstract:
In this paper, we strengthen a result of Seager regarding the number of orbits of a solvable primitive linear group.
In this paper, we strengthen a result of Seager regarding the number of orbits of a solvable primitive linear group.
△ Less
Submitted 14 May, 2024;
originally announced May 2024.
-
HyperCLOVA X Technical Report
Authors:
Kang Min Yoo,
Jaegeun Han,
Sookyo In,
Heewon Jeon,
Jisu Jeong,
Jaewook Kang,
Hyunwook Kim,
Kyung-Min Kim,
Munhyong Kim,
Sungju Kim,
Donghyun Kwak,
Hanock Kwak,
Se Jung Kwon,
Bado Lee,
Dongsoo Lee,
Gichang Lee,
Jooho Lee,
Baeseong Park,
Seong** Shin,
Joonsang Yu,
Seolki Baek,
Sumin Byeon,
Eungsup Cho,
Dooseok Choe,
Jeesung Han
, et al. (371 additional authors not shown)
Abstract:
We introduce HyperCLOVA X, a family of large language models (LLMs) tailored to the Korean language and culture, along with competitive capabilities in English, math, and coding. HyperCLOVA X was trained on a balanced mix of Korean, English, and code data, followed by instruction-tuning with high-quality human-annotated datasets while abiding by strict safety guidelines reflecting our commitment t…
▽ More
We introduce HyperCLOVA X, a family of large language models (LLMs) tailored to the Korean language and culture, along with competitive capabilities in English, math, and coding. HyperCLOVA X was trained on a balanced mix of Korean, English, and code data, followed by instruction-tuning with high-quality human-annotated datasets while abiding by strict safety guidelines reflecting our commitment to responsible AI. The model is evaluated across various benchmarks, including comprehensive reasoning, knowledge, commonsense, factuality, coding, math, chatting, instruction-following, and harmlessness, in both Korean and English. HyperCLOVA X exhibits strong reasoning capabilities in Korean backed by a deep understanding of the language and cultural nuances. Further analysis of the inherent bilingual nature and its extension to multilingualism highlights the model's cross-lingual proficiency and strong generalization ability to untargeted languages, including machine translation between several language pairs and cross-lingual inference tasks. We believe that HyperCLOVA X can provide helpful guidance for regions or countries in develo** their sovereign LLMs.
△ Less
Submitted 13 April, 2024; v1 submitted 2 April, 2024;
originally announced April 2024.
-
Evaluating and Enhancing Large Language Models Performance in Domain-specific Medicine: Osteoarthritis Management with DocOA
Authors:
Xi Chen,
MingKe You,
Li Wang,
WeiZhi Liu,
Yu Fu,
Jie Xu,
Shaoting Zhang,
Gang Chen,
Kang Li,
Jian Li
Abstract:
The efficacy of large language models (LLMs) in domain-specific medicine, particularly for managing complex diseases such as osteoarthritis (OA), remains largely unexplored. This study focused on evaluating and enhancing the clinical capabilities of LLMs in specific domains, using osteoarthritis (OA) management as a case study. A domain specific benchmark framework was developed, which evaluate LL…
▽ More
The efficacy of large language models (LLMs) in domain-specific medicine, particularly for managing complex diseases such as osteoarthritis (OA), remains largely unexplored. This study focused on evaluating and enhancing the clinical capabilities of LLMs in specific domains, using osteoarthritis (OA) management as a case study. A domain specific benchmark framework was developed, which evaluate LLMs across a spectrum from domain-specific knowledge to clinical applications in real-world clinical scenarios. DocOA, a specialized LLM tailored for OA management that integrates retrieval-augmented generation (RAG) and instruction prompts, was developed. The study compared the performance of GPT-3.5, GPT-4, and a specialized assistant, DocOA, using objective and human evaluations. Results showed that general LLMs like GPT-3.5 and GPT-4 were less effective in the specialized domain of OA management, particularly in providing personalized treatment recommendations. However, DocOA showed significant improvements. This study introduces a novel benchmark framework which assesses the domain-specific abilities of LLMs in multiple aspects, highlights the limitations of generalized LLMs in clinical contexts, and demonstrates the potential of tailored approaches for develo** domain-specific medical LLMs.
△ Less
Submitted 19 January, 2024;
originally announced January 2024.
-
Modular extension of topological orders from congruence representations
Authors:
Donghae Seo,
Minyoung You,
Gil Young Cho,
Hee-Cheol Kim
Abstract:
We present an efficient method to compute the modular extension of both fermionic topological orders and $\mathbb{Z}_2$-symmetric bosonic topological orders in two spatial dimensions, basing on congruence representations of $\mathrm{SL}_2(\mathbb{Z})$ and its subgroups. To demonstrate the validity of our approach, we provide explicit calculations for topological orders with rank up to 10 for the f…
▽ More
We present an efficient method to compute the modular extension of both fermionic topological orders and $\mathbb{Z}_2$-symmetric bosonic topological orders in two spatial dimensions, basing on congruence representations of $\mathrm{SL}_2(\mathbb{Z})$ and its subgroups. To demonstrate the validity of our approach, we provide explicit calculations for topological orders with rank up to 10 for the fermionic cases and up to 6 for the bosonic cases. Along the way, we clarify the relation between fermionic rational conformal field theories, which live on the boundary of the corresponding fermionic topological orders, and modular extensions. In particular, we show that the $\mathrm{SL}_2(\mathbb{Z})$ representation of the R-R sector can be determined from the NS-NS sector using the modular extensions.
△ Less
Submitted 1 December, 2023;
originally announced December 2023.
-
Duality and Stacking of Bosonic and Fermionic SPT Phases
Authors:
Alex Turzillo,
Minyoung You
Abstract:
We study the interplay of duality and stacking of bosonic and fermionic symmetry-protected topological phases in one spatial dimension. In general the classifications of bosonic and fermionic phases have different group structures under the operation of stacking, but we argue that they are often isomorphic and give an explicit isomorphism when it exists. This occurs for all unitary symmetry groups…
▽ More
We study the interplay of duality and stacking of bosonic and fermionic symmetry-protected topological phases in one spatial dimension. In general the classifications of bosonic and fermionic phases have different group structures under the operation of stacking, but we argue that they are often isomorphic and give an explicit isomorphism when it exists. This occurs for all unitary symmetry groups and many groups with antiunitary symmetries, which we characterize. We find that this isomorphism is typically not implemented by the Jordan-Wigner transformation, nor is it a consequence of any other duality transformation that falls within the framework of topological holography. Along the way to this conclusion, we recover the fermionic stacking rule in terms of G-pin partition functions, give a gauge-invariant characterization of the twisted group cohomology invariant, and state a procedure for stacking gapped phases in the formalism of symmetry topological field theory.
△ Less
Submitted 30 November, 2023;
originally announced November 2023.
-
Gapped boundaries of fermionic topological orders and higher central charges
Authors:
Minyoung You
Abstract:
We develop a test for the vanishing of higher central charges of a fermionic topological order, which is a necessary condition for the existence of a gapped boundary, purely in terms of the modular data of the super-modular tensor category. More precisely, we test whether a given super-MTC has $c = 0$ mod $\frac{1}{2}$, and, if so, whether the modular extension with $c =0$ mod $8$ has vanishing hi…
▽ More
We develop a test for the vanishing of higher central charges of a fermionic topological order, which is a necessary condition for the existence of a gapped boundary, purely in terms of the modular data of the super-modular tensor category. More precisely, we test whether a given super-MTC has $c = 0$ mod $\frac{1}{2}$, and, if so, whether the modular extension with $c =0$ mod $8$ has vanishing higher central charges. The test itself does not require an explicit computation of the modular extensions and is easily carried out. We apply this test to known examples of super-modular tensor categories. Since our test allows us to obtain information about the chiral central charge of a super-modular tensor category in terms of its modular data without direct knowledge of its modular extensions, this can also be thought of as the first step towards a fermionic analogue of the Gauss-Milgram formula.
△ Less
Submitted 2 November, 2023;
originally announced November 2023.
-
Goal-Conditioned Reinforcement Learning with Disentanglement-based Reachability Planning
Authors:
Zhifeng Qian,
Mingyu You,
Hongjun Zhou,
Xuanhui Xu,
Bin He
Abstract:
Goal-Conditioned Reinforcement Learning (GCRL) can enable agents to spontaneously set diverse goals to learn a set of skills. Despite the excellent works proposed in various fields, reaching distant goals in temporally extended tasks remains a challenge for GCRL. Current works tackled this problem by leveraging planning algorithms to plan intermediate subgoals to augment GCRL. Their methods need t…
▽ More
Goal-Conditioned Reinforcement Learning (GCRL) can enable agents to spontaneously set diverse goals to learn a set of skills. Despite the excellent works proposed in various fields, reaching distant goals in temporally extended tasks remains a challenge for GCRL. Current works tackled this problem by leveraging planning algorithms to plan intermediate subgoals to augment GCRL. Their methods need two crucial requirements: (i) a state representation space to search valid subgoals, and (ii) a distance function to measure the reachability of subgoals. However, they struggle to scale to high-dimensional state space due to their non-compact representations. Moreover, they cannot collect high-quality training data through standard GC policies, which results in an inaccurate distance function. Both affect the efficiency and performance of planning and policy learning. In the paper, we propose a goal-conditioned RL algorithm combined with Disentanglement-based Reachability Planning (REPlan) to solve temporally extended tasks. In REPlan, a Disentangled Representation Module (DRM) is proposed to learn compact representations which disentangle robot poses and object positions from high-dimensional observations in a self-supervised manner. A simple REachability discrimination Module (REM) is also designed to determine the temporal distance of subgoals. Moreover, REM computes intrinsic bonuses to encourage the collection of novel states for training. We evaluate our REPlan in three vision-based simulation tasks and one real-world task. The experiments demonstrate that our REPlan significantly outperforms the prior state-of-the-art methods in solving temporally extended tasks.
△ Less
Submitted 20 July, 2023;
originally announced July 2023.
-
Decoupling Dynamic Monocular Videos for Dynamic View Synthesis
Authors:
Meng You,
Junhui Hou
Abstract:
The challenge of dynamic view synthesis from dynamic monocular videos, i.e., synthesizing novel views for free viewpoints given a monocular video of a dynamic scene captured by a moving camera, mainly lies in accurately modeling the \textbf{dynamic objects} of a scene using limited 2D frames, each with a varying timestamp and viewpoint. Existing methods usually require pre-processed 2D optical flo…
▽ More
The challenge of dynamic view synthesis from dynamic monocular videos, i.e., synthesizing novel views for free viewpoints given a monocular video of a dynamic scene captured by a moving camera, mainly lies in accurately modeling the \textbf{dynamic objects} of a scene using limited 2D frames, each with a varying timestamp and viewpoint. Existing methods usually require pre-processed 2D optical flow and depth maps by off-the-shelf methods to supervise the network, making them suffer from the inaccuracy of the pre-processed supervision and the ambiguity when lifting the 2D information to 3D. In this paper, we tackle this challenge in an unsupervised fashion. Specifically, we decouple the motion of the dynamic objects into object motion and camera motion, respectively regularized by proposed unsupervised surface consistency and patch-based multi-view constraints. The former enforces the 3D geometric surfaces of moving objects to be consistent over time, while the latter regularizes their appearances to be consistent across different viewpoints. Such a fine-grained motion formulation can alleviate the learning difficulty for the network, thus enabling it to produce not only novel views with higher quality but also more accurate scene flows and depth than existing methods requiring extra supervision.
△ Less
Submitted 30 May, 2024; v1 submitted 4 April, 2023;
originally announced April 2023.
-
Model-free Optimization and Experimental Validation of RIS-assisted Wireless Communications under Rich Multipath Fading
Authors:
Tianrui Chen,
Minglei You,
Yangyishi Zhang,
Gan Zheng,
Jean Baptiste Gros,
Geoffroy Lerosey,
Youssef Nasser,
Fraser Burton,
Gabriele Gradoni
Abstract:
Reconfigurable intelligent surface (RIS) devices have emerged as an effective way to control the propagation channels for enhancing the end-users' performance. However, RIS optimization involves configuring the radio frequency response of a large number of radiating elements, which is challenging in real-world applications due to high computational complexity. In this paper, a model-free cross-ent…
▽ More
Reconfigurable intelligent surface (RIS) devices have emerged as an effective way to control the propagation channels for enhancing the end-users' performance. However, RIS optimization involves configuring the radio frequency response of a large number of radiating elements, which is challenging in real-world applications due to high computational complexity. In this paper, a model-free cross-entropy (CE) algorithm is proposed to optimize the binary RIS configuration for improving the signal-to-noise ratio (SNR) at the receiver. One key advantage of the proposed method is that it only requires system performance indicators, e.g., the received SNR, without the need for channel models or channel state information. Both simulations and experiments are conducted to evaluate the performance of the proposed CE algorithm. This study provides an experimental demonstration of the channel hardening effect in a multi-antenna RIS-assisted wireless system under rich multipath fading.
△ Less
Submitted 15 February, 2024; v1 submitted 21 February, 2023;
originally announced February 2023.
-
Classification of Fermionic Topological Orders from Congruence Representations
Authors:
Gil Young Cho,
Hee-cheol Kim,
Donghae Seo,
Minyoung You
Abstract:
The fusion rules and braiding statistics of anyons in $(2+1)$D fermionic topological orders are characterized by the modular data of a super-modular category. On the other hand, the modular data of a super-modular category form a congruence representation of the $Γ_θ$ subgroup of the modular group $\mathrm{SL}_2(\mathbb{Z})$. We provide a method to classify the modular data of super-modular catego…
▽ More
The fusion rules and braiding statistics of anyons in $(2+1)$D fermionic topological orders are characterized by the modular data of a super-modular category. On the other hand, the modular data of a super-modular category form a congruence representation of the $Γ_θ$ subgroup of the modular group $\mathrm{SL}_2(\mathbb{Z})$. We provide a method to classify the modular data of super-modular categories by first obtaining the congruence representations of $Γ_θ$ and then building candidate modular data out of those representations. We carry out this classification up to rank $10$. We obtain both unitary and non-unitary modular data, including all previously known unitary modular data, and also discover new classes of modular data of rank $10$. We also determine the central charges of all these modular data, without explicitly computing their modular extensions.
△ Less
Submitted 7 October, 2022;
originally announced October 2022.
-
Learning A Locally Unified 3D Point Cloud for View Synthesis
Authors:
Meng You,
Mantang Guo,
Xianqiang Lyu,
Hui Liu,
Junhui Hou
Abstract:
In this paper, we explore the problem of 3D point cloud representation-based view synthesis from a set of sparse source views. To tackle this challenging problem, we propose a new deep learning-based view synthesis paradigm that learns a locally unified 3D point cloud from source views. Specifically, we first construct sub-point clouds by projecting source views to 3D space based on their depth ma…
▽ More
In this paper, we explore the problem of 3D point cloud representation-based view synthesis from a set of sparse source views. To tackle this challenging problem, we propose a new deep learning-based view synthesis paradigm that learns a locally unified 3D point cloud from source views. Specifically, we first construct sub-point clouds by projecting source views to 3D space based on their depth maps. Then, we learn the locally unified 3D point cloud by adaptively fusing points at a local neighborhood defined on the union of the sub-point clouds. Besides, we also propose a 3D geometry-guided image restoration module to fill the holes and recover high-frequency details of the rendered novel views. Experimental results on three benchmark datasets demonstrate that our method can improve the average PSNR by more than 4 dB while preserving more accurate visual details, compared with state-of-the-art view synthesis methods.
△ Less
Submitted 30 September, 2023; v1 submitted 12 September, 2022;
originally announced September 2022.
-
Bridging Music and Text with Crowdsourced Music Comments: A Sequence-to-Sequence Framework for Thematic Music Comments Generation
Authors:
Peining Zhang,
Junliang Guo,
Linli Xu,
Mu You,
Junming Yin
Abstract:
We consider a novel task of automatically generating text descriptions of music. Compared with other well-established text generation tasks such as image caption, the scarcity of well-paired music and text datasets makes it a much more challenging task. In this paper, we exploit the crowd-sourced music comments to construct a new dataset and propose a sequence-to-sequence model to generate text de…
▽ More
We consider a novel task of automatically generating text descriptions of music. Compared with other well-established text generation tasks such as image caption, the scarcity of well-paired music and text datasets makes it a much more challenging task. In this paper, we exploit the crowd-sourced music comments to construct a new dataset and propose a sequence-to-sequence model to generate text descriptions of music. More concretely, we use the dilated convolutional layer as the basic component of the encoder and a memory based recurrent neural network as the decoder. To enhance the authenticity and thematicity of generated texts, we further propose to fine-tune the model with a discriminator as well as a novel topic evaluator. To measure the quality of generated texts, we also propose two new evaluation metrics, which are more aligned with human evaluation than traditional metrics such as BLEU. Experimental results verify that our model is capable of generating fluent and meaningful comments while containing thematic and content information of the original music.
△ Less
Submitted 5 September, 2022;
originally announced September 2022.
-
3D Part Assembly Generation with Instance Encoded Transformer
Authors:
Rufeng Zhang,
Tao Kong,
Weihao Wang,
Xuan Han,
Mingyu You
Abstract:
It is desirable to enable robots capable of automatic assembly. Structural understanding of object parts plays a crucial role in this task yet remains relatively unexplored. In this paper, we focus on the setting of furniture assembly from a complete set of part geometries, which is essentially a 6-DoF part pose estimation problem. We propose a multi-layer transformer-based framework that involves…
▽ More
It is desirable to enable robots capable of automatic assembly. Structural understanding of object parts plays a crucial role in this task yet remains relatively unexplored. In this paper, we focus on the setting of furniture assembly from a complete set of part geometries, which is essentially a 6-DoF part pose estimation problem. We propose a multi-layer transformer-based framework that involves geometric and relational reasoning between parts to update the part poses iteratively. We carefully design a unique instance encoding to solve the ambiguity between geometrically-similar parts so that all parts can be distinguished. In addition to assembling from scratch, we extend our framework to a new task called in-process part assembly. Analogous to furniture maintenance, it requires robots to continue with unfinished products and assemble the remaining parts into appropriate positions. Our method achieves far more than 10% improvements over the current state-of-the-art in multiple metrics on the public PartNet dataset. Extensive experiments and quantitative comparisons demonstrate the effectiveness of the proposed framework.
△ Less
Submitted 4 July, 2022;
originally announced July 2022.
-
A symmetry principle for gauge theories with fractons
Authors:
Yuji Hirono,
Minyoung You,
Stephen Angus,
Gil Young Cho
Abstract:
Fractonic phases are new phases of matter that host excitations with restricted mobility. We show that a certain class of gapless fractonic phases are realized as a result of spontaneous breaking of continuous higher-form symmetries whose conserved charges do not commute with spatial translations. We refer to such symmetries as nonuniform higher-form symmetries. These symmetries fall within the st…
▽ More
Fractonic phases are new phases of matter that host excitations with restricted mobility. We show that a certain class of gapless fractonic phases are realized as a result of spontaneous breaking of continuous higher-form symmetries whose conserved charges do not commute with spatial translations. We refer to such symmetries as nonuniform higher-form symmetries. These symmetries fall within the standard definition of higher-form symmetries in quantum field theory, and the corresponding symmetry generators are topological. Worldlines of particles are regarded as the charged objects of 1-form symmetries, and mobility restrictions can be implemented by introducing additional 1-form symmetries whose generators do not commute with spatial translations. These features are realized by effective field theories associated with spontaneously broken nonuniform 1-form symmetries. At low energies, the theories reduce to known higher-rank gauge theories such as scalar/vector charge gauge theories, and the gapless excitations in these theories are interpreted as Nambu--Goldstone modes for higher-form symmetries. Due to the nonuniformity of the symmetry, some of the modes acquire a gap, which is the higher-form analogue of the inverse Higgs mechanism of spacetime symmetries. The gauge theories have emergent nonuniform magnetic symmetries, and some of the magnetic monopoles become fractonic. We identify the 't~Hooft anomalies of the nonuniform higher-form symmetries and the corresponding bulk symmetry-protected topological phases. By this method, the mobility restrictions are fully determined by the choice of the commutation relations of charges with translations. This approach allows us to view existing (gapless) fracton models such as the scalar/vector charge gauge theories and their variants from a unified perspective and enables us to engineer theories with desired mobility restrictions.
△ Less
Submitted 24 November, 2023; v1 submitted 2 July, 2022;
originally announced July 2022.
-
Weakly Supervised Disentangled Representation for Goal-conditioned Reinforcement Learning
Authors:
Zhifeng Qian,
Mingyu You,
Hongjun Zhou,
Bin He
Abstract:
Goal-conditioned reinforcement learning is a crucial yet challenging algorithm which enables agents to achieve multiple user-specified goals when learning a set of skills in a dynamic environment. However, it typically requires millions of the environmental interactions explored by agents, which is sample-inefficient. In the paper, we propose a skill learning framework DR-GRL that aims to improve…
▽ More
Goal-conditioned reinforcement learning is a crucial yet challenging algorithm which enables agents to achieve multiple user-specified goals when learning a set of skills in a dynamic environment. However, it typically requires millions of the environmental interactions explored by agents, which is sample-inefficient. In the paper, we propose a skill learning framework DR-GRL that aims to improve the sample efficiency and policy generalization by combining the Disentangled Representation learning and Goal-conditioned visual Reinforcement Learning. In a weakly supervised manner, we propose a Spatial Transform AutoEncoder (STAE) to learn an interpretable and controllable representation in which different parts correspond to different object attributes (shape, color, position). Due to the high controllability of the representations, STAE can simply recombine and recode the representations to generate unseen goals for agents to practice themselves. The manifold structure of the learned representation maintains consistency with the physical position, which is beneficial for reward calculation. We empirically demonstrate that DR-GRL significantly outperforms the previous methods in sample efficiency and policy generalization. In addition, DR-GRL is also easy to expand to the real robot.
△ Less
Submitted 28 February, 2022;
originally announced February 2022.
-
Design and Analysis of SWIPT with Safety Constraints
Authors:
Constantinos Psomas,
Minglei You,
Kai Liang,
Gan Zheng,
Ioannis Krikidis
Abstract:
Simultaneous wireless information and power transfer (SWIPT) has long been proposed as a key solution for charging and communicating with low-cost and low-power devices. However, the employment of radio frequency (RF) signals for information/power transfer needs to comply with international health and safety regulations. In this paper, we provide a complete framework for the design and analysis of…
▽ More
Simultaneous wireless information and power transfer (SWIPT) has long been proposed as a key solution for charging and communicating with low-cost and low-power devices. However, the employment of radio frequency (RF) signals for information/power transfer needs to comply with international health and safety regulations. In this paper, we provide a complete framework for the design and analysis of far-field SWIPT under safety constraints. In particular, we deal with two RF exposure regulations, namely, the specific absorption rate (SAR) and the maximum permissible exposure (MPE). The state-of-the-art regarding SAR and MPE is outlined together with a description as to how these can be modeled in the context of communication networks. We propose a deep learning approach for the design of robust beamforming subject to specific information, energy harvesting and SAR constraints. Furthermore, we present a thorough analytical study for the performance of large-scale SWIPT systems, in terms of information and energy coverage under MPE constraints. This work provides insights with regards to the optimal SWIPT design as well as the potentials from the proper development of SWIPT systems under health and safety restrictions.
△ Less
Submitted 20 November, 2021;
originally announced November 2021.
-
Digital Twins based Day-ahead Integrated Energy System Scheduling under Load and Renewable Energy Uncertainties
Authors:
Minglei You,
Qian Wang,
Hongjian Sun,
Ivan Castro,
**g Jiang
Abstract:
By constructing digital twins (DT) of an integrated energy system (IES), one can benefit from DT's predictive capabilities to improve coordinations among various energy converters, hence enhancing energy efficiency, cost savings and carbon emission reduction. This paper is motivated by the fact that practical IESs suffer from multiple uncertainty sources, and complicated surrounding environment. T…
▽ More
By constructing digital twins (DT) of an integrated energy system (IES), one can benefit from DT's predictive capabilities to improve coordinations among various energy converters, hence enhancing energy efficiency, cost savings and carbon emission reduction. This paper is motivated by the fact that practical IESs suffer from multiple uncertainty sources, and complicated surrounding environment. To address this problem, a novel DT-based day-ahead scheduling method is proposed. The physical IES is modelled as a multi-vector energy system in its virtual space that interacts with the physical IES to manipulate its operations. A deep neural network is trained to make statistical cost-saving scheduling by learning from both historical forecasting errors and day-ahead forecasts. Case studies of IESs show that the proposed DT-based method is able to reduce the operating cost of IES by 63.5%, comparing to the existing forecast-based scheduling methods. It is also found that both electric vehicles and thermal energy storages play proactive roles in the proposed method, highlighting their importance in future energy system integration and decarbonisation.
△ Less
Submitted 29 September, 2021;
originally announced September 2021.
-
Model-driven Learning for Generic MIMO Downlink Beamforming With Uplink Channel Information
Authors:
Ju** Zhang,
Minglei You,
Gan Zheng,
Ioannis Krikidis,
Liqiang Zhao
Abstract:
Accurate downlink channel information is crucial to the beamforming design, but it is difficult to obtain in practice. This paper investigates a deep learning-based optimization approach of the downlink beamforming to maximize the system sum rate, when only the uplink channel information is available. Our main contribution is to propose a model-driven learning technique that exploits the structure…
▽ More
Accurate downlink channel information is crucial to the beamforming design, but it is difficult to obtain in practice. This paper investigates a deep learning-based optimization approach of the downlink beamforming to maximize the system sum rate, when only the uplink channel information is available. Our main contribution is to propose a model-driven learning technique that exploits the structure of the optimal downlink beamforming to design an effective hybrid learning strategy with the aim to maximize the sum rate performance. This is achieved by jointly considering the learning performance of the downlink channel, the power and the sum rate in the training stage. The proposed approach applies to generic cases in which the uplink channel information is available, but its relation to the downlink channel is unknown and does not require an explicit downlink channel estimation. We further extend the developed technique to massive multiple-input multiple-output scenarios and achieve a distributed learning strategy for multicell systems without an inter-cell signalling overhead. Simulation results verify that our proposed method provides the performance close to the state of the art numerical algorithms with perfect downlink channel information and significantly outperforms existing data-driven methods in terms of the sum rate.
△ Less
Submitted 16 September, 2021;
originally announced September 2021.
-
On the vertex-degree based invariants of digraphs
Authors:
Hanyuan Deng,
Jiaxiang Yang,
Zikai Tang,
**g Yang,
Meiling You
Abstract:
Let $D=(V,A)$ be a digraphs without isolated vertices. A vertex-degree based invariant $I(D)$ related to a real function $\varphi$ of $D$ is defined as a summation over all arcs, $I(D) = \frac{1}{2}\sum_{uv\in A}{\varphi(d_u^+,d_v^-)}$, where $d_u^+$ (resp. $d_u^-$) denotes the out-degree (resp. in-degree) of a vertex $u$. In this paper, we give the extremal values and extremal digraphs of $I(D)$…
▽ More
Let $D=(V,A)$ be a digraphs without isolated vertices. A vertex-degree based invariant $I(D)$ related to a real function $\varphi$ of $D$ is defined as a summation over all arcs, $I(D) = \frac{1}{2}\sum_{uv\in A}{\varphi(d_u^+,d_v^-)}$, where $d_u^+$ (resp. $d_u^-$) denotes the out-degree (resp. in-degree) of a vertex $u$. In this paper, we give the extremal values and extremal digraphs of $I(D)$ over all digraphs with $n$ non-isolated vertices. Applying these results, we obtain the extremal values of some vertex-degree based topological indices of digraphs, such as the Randić index, the Zagreb index, the sum-connectivity index, the $GA$ index, the $ABC$ index and the harmonic index, and the corresponding extremal digraphs.
△ Less
Submitted 29 April, 2021;
originally announced April 2021.
-
MeSIN: Multilevel Selective and Interactive Network for Medication Recommendation
Authors:
Yang An,
Liang Zhang,
Mao You,
Xueqing Tian,
Bo **,
Xiaopeng Wei
Abstract:
Recommending medications for patients using electronic health records (EHRs) is a crucial data mining task for an intelligent healthcare system. It can assist doctors in making clinical decisions more efficiently. However, the inherent complexity of the EHR data renders it as a challenging task: (1) Multilevel structures: the EHR data typically contains multilevel structures which are closely rela…
▽ More
Recommending medications for patients using electronic health records (EHRs) is a crucial data mining task for an intelligent healthcare system. It can assist doctors in making clinical decisions more efficiently. However, the inherent complexity of the EHR data renders it as a challenging task: (1) Multilevel structures: the EHR data typically contains multilevel structures which are closely related with the decision-making pathways, e.g., laboratory results lead to disease diagnoses, and then contribute to the prescribed medications; (2) Multiple sequences interactions: multiple sequences in EHR data are usually closely correlated with each other; (3) Abundant noise: lots of task-unrelated features or noise information within EHR data generally result in suboptimal performance. To tackle the above challenges, we propose a multilevel selective and interactive network (MeSIN) for medication recommendation. Specifically, MeSIN is designed with three components. First, an attentional selective module (ASM) is applied to assign flexible attention scores to different medical codes embeddings by their relevance to the recommended medications in every admission. Second, we incorporate a novel interactive long-short term memory network (InLSTM) to reinforce the interactions of multilevel medical sequences in EHR data with the help of the calibrated memory-augmented cell and an enhanced input gate. Finally, we employ a global selective fusion module (GSFM) to infuse the multi-sourced information embeddings into final patient representations for medications recommendation. To validate our method, extensive experiments have been conducted on a real-world clinical dataset. The results demonstrate a consistent superiority of our framework over several baselines and testify the effectiveness of our proposed approach.
△ Less
Submitted 22 April, 2021;
originally announced April 2021.
-
Diverse Knowledge Distillation for End-to-End Person Search
Authors:
Xinyu Zhang,
Xinlong Wang,
Jia-Wang Bian,
Chunhua Shen,
Mingyu You
Abstract:
Person search aims to localize and identify a specific person from a gallery of images. Recent methods can be categorized into two groups, i.e., two-step and end-to-end approaches. The former views person search as two independent tasks and achieves dominant results using separately trained person detection and re-identification (Re-ID) models. The latter performs person search in an end-to-end fa…
▽ More
Person search aims to localize and identify a specific person from a gallery of images. Recent methods can be categorized into two groups, i.e., two-step and end-to-end approaches. The former views person search as two independent tasks and achieves dominant results using separately trained person detection and re-identification (Re-ID) models. The latter performs person search in an end-to-end fashion. Although the end-to-end approaches yield higher inference efficiency, they largely lag behind those two-step counterparts in terms of accuracy. In this paper, we argue that the gap between the two kinds of methods is mainly caused by the Re-ID sub-networks of end-to-end methods. To this end, we propose a simple yet strong end-to-end network with diverse knowledge distillation to break the bottleneck. We also design a spatial-invariant augmentation to assist model to be invariant to inaccurate detection results. Experimental results on the CUHK-SYSU and PRW datasets demonstrate the superiority of our method against existing approaches -- it achieves on par accuracy with state-of-the-art two-step methods while maintaining high efficiency due to the single joint model. Code is available at: https://git.io/DKD-PersonSearch.
△ Less
Submitted 21 December, 2020;
originally announced December 2020.
-
Supersymmetric boundaries of one-dimensional phases of fermions beyond symmetry-protected topological states
Authors:
Alex Turzillo,
Minyoung You
Abstract:
It has recently been demonstrated that protected supersymmetry emerges on the boundaries of one-dimensional intrinsically fermionic symmetry protected trivial (SPT) phases. Here we investigate the boundary supersymmetry of one-dimensional fermionic phases beyond SPT phases. Using the connection between Majorana edge modes and real supercharges, we compute, in terms of the bulk phase invariants, th…
▽ More
It has recently been demonstrated that protected supersymmetry emerges on the boundaries of one-dimensional intrinsically fermionic symmetry protected trivial (SPT) phases. Here we investigate the boundary supersymmetry of one-dimensional fermionic phases beyond SPT phases. Using the connection between Majorana edge modes and real supercharges, we compute, in terms of the bulk phase invariants, the number of protected boundary supercharges.
△ Less
Submitted 19 July, 2021; v1 submitted 8 December, 2020;
originally announced December 2020.
-
Braiding Statistics of Vortices in $2+1$d Topological Superconductors from Stacking
Authors:
Minyoung You
Abstract:
Class D topological superconductors in $2+1$ dimensions are known to have a $\mathbb{Z}_{16}$ classification in the presence of interactions, with $16$ different topological orders underlying the $16$ distinct phases. By applying the fermionic stacking law, which involves anyon condensation, on the effective Hamiltonian describing the topological interaction of vortices in the $p+ip$ superconducto…
▽ More
Class D topological superconductors in $2+1$ dimensions are known to have a $\mathbb{Z}_{16}$ classification in the presence of interactions, with $16$ different topological orders underlying the $16$ distinct phases. By applying the fermionic stacking law, which involves anyon condensation, on the effective Hamiltonian describing the topological interaction of vortices in the $p+ip$ superconductor, which generates the $16$ other phases, we recover the braiding coefficients of vortices for all remaining phases as well as the $\mathbb{Z}_{16}$ group law. We also apply this stacking law to the time-reversal invariant Class DIII superconductors (which can themselves be obtained from stacking two Class D superconductors) and recover their $\mathbb{Z}_2$ classification.
△ Less
Submitted 31 July, 2020;
originally announced August 2020.
-
Mask Encoding for Single Shot Instance Segmentation
Authors:
Rufeng Zhang,
Zhi Tian,
Chunhua Shen,
Mingyu You,
Youliang Yan
Abstract:
To date, instance segmentation is dominated by twostage methods, as pioneered by Mask R-CNN. In contrast, one-stage alternatives cannot compete with Mask R-CNN in mask AP, mainly due to the difficulty of compactly representing masks, making the design of one-stage methods very challenging. In this work, we propose a simple singleshot instance segmentation framework, termed mask encoding based inst…
▽ More
To date, instance segmentation is dominated by twostage methods, as pioneered by Mask R-CNN. In contrast, one-stage alternatives cannot compete with Mask R-CNN in mask AP, mainly due to the difficulty of compactly representing masks, making the design of one-stage methods very challenging. In this work, we propose a simple singleshot instance segmentation framework, termed mask encoding based instance segmentation (MEInst). Instead of predicting the two-dimensional mask directly, MEInst distills it into a compact and fixed-dimensional representation vector, which allows the instance segmentation task to be incorporated into one-stage bounding-box detectors and results in a simple yet efficient instance segmentation framework. The proposed one-stage MEInst achieves 36.4% in mask AP with single-model (ResNeXt-101-FPN backbone) and single-scale testing on the MS-COCO benchmark. We show that the much simpler and flexible one-stage instance segmentation method, can also achieve competitive performance. This framework can be easily adapted for other instance-level recognition tasks. Code is available at: https://git.io/AdelaiDet
△ Less
Submitted 6 May, 2020; v1 submitted 25 March, 2020;
originally announced March 2020.
-
Deep Learning Enabled Optimization of Downlink Beamforming Under Per-Antenna Power Constraints: Algorithms and Experimental Demonstration
Authors:
Ju** Zhang,
Wenchao Xia,
Minglei You,
Gan Zheng,
Sangarapillai Lambotharan,
Kai-Kit Wong
Abstract:
This paper studies fast downlink beamforming algorithms using deep learning in multiuser multiple-input-single-output systems where each transmit antenna at the base station has its own power constraint. We focus on the signal-to-interference-plus-noise ratio (SINR) balancing problem which is quasi-convex but there is no efficient solution available. We first design a fast subgradient algorithm th…
▽ More
This paper studies fast downlink beamforming algorithms using deep learning in multiuser multiple-input-single-output systems where each transmit antenna at the base station has its own power constraint. We focus on the signal-to-interference-plus-noise ratio (SINR) balancing problem which is quasi-convex but there is no efficient solution available. We first design a fast subgradient algorithm that can achieve near-optimal solution with reduced complexity. We then propose a deep neural network structure to learn the optimal beamforming based on convolutional networks and exploitation of the duality of the original problem. Two strategies of learning various dual variables are investigated with different accuracies, and the corresponding recovery of the original solution is facilitated by the subgradient algorithm. We also develop a generalization method of the proposed algorithms so that they can adapt to the varying number of users and antennas without re-training. We carry out intensive numerical simulations and testbed experiments to evaluate the performance of the proposed algorithms. Results show that the proposed algorithms achieve close to optimal solution in simulations with perfect channel information and outperform the alleged theoretically optimal solution in experiments, illustrating a better performance-complexity tradeoff than existing schemes.
△ Less
Submitted 28 February, 2020;
originally announced February 2020.
-
Part-Guided Attention Learning for Vehicle Instance Retrieval
Authors:
Xinyu Zhang,
Rufeng Zhang,
Jiewei Cao,
Dong Gong,
Mingyu You,
Chunhua Shen
Abstract:
Vehicle instance retrieval often requires one to recognize the fine-grained visual differences between vehicles. Besides the holistic appearance of vehicles which is easily affected by the viewpoint variation and distortion, vehicle parts also provide crucial cues to differentiate near-identical vehicles. Motivated by these observations, we introduce a Part-Guided Attention Network (PGAN) to pinpo…
▽ More
Vehicle instance retrieval often requires one to recognize the fine-grained visual differences between vehicles. Besides the holistic appearance of vehicles which is easily affected by the viewpoint variation and distortion, vehicle parts also provide crucial cues to differentiate near-identical vehicles. Motivated by these observations, we introduce a Part-Guided Attention Network (PGAN) to pinpoint the prominent part regions and effectively combine the global and part information for discriminative feature learning. PGAN first detects the locations of different part components and salient regions regardless of the vehicle identity, which serve as the bottom-up attention to narrow down the possible searching regions. To estimate the importance of detected parts, we propose a Part Attention Module (PAM) to adaptively locate the most discriminative regions with high-attention weights and suppress the distraction of irrelevant parts with relatively low weights. The PAM is guided by the instance retrieval loss and therefore provides top-down attention that enables attention to be calculated at the level of car parts and other salient regions. Finally, we aggregate the global appearance and part features to improve the feature performance further. The PGAN combines part-guided bottom-up and top-down attention, global and part visual features in an end-to-end framework. Extensive experiments demonstrate that the proposed method achieves new state-of-the-art vehicle instance retrieval performance on four large-scale benchmark datasets.
△ Less
Submitted 26 September, 2020; v1 submitted 12 September, 2019;
originally announced September 2019.
-
Self-training with progressive augmentation for unsupervised cross-domain person re-identification
Authors:
Xinyu Zhang,
Jiewei Cao,
Chunhua Shen,
Mingyu You
Abstract:
Person re-identification (Re-ID) has achieved great improvement with deep learning and a large amount of labelled training data. However, it remains a challenging task for adapting a model trained in a source domain of labelled data to a target domain of only unlabelled data available. In this work, we develop a self-training method with progressive augmentation framework (PAST) to promote the mod…
▽ More
Person re-identification (Re-ID) has achieved great improvement with deep learning and a large amount of labelled training data. However, it remains a challenging task for adapting a model trained in a source domain of labelled data to a target domain of only unlabelled data available. In this work, we develop a self-training method with progressive augmentation framework (PAST) to promote the model performance progressively on the target dataset. Specially, our PAST framework consists of two stages, namely, conservative stage and promoting stage. The conservative stage captures the local structure of target-domain data points with triplet-based loss functions, leading to improved feature representations. The promoting stage continuously optimizes the network by appending a changeable classification layer to the last layer of the model, enabling the use of global information about the data distribution. Importantly, we propose a new self-training strategy that progressively augments the model capability by adopting conservative and promoting stages alternately. Furthermore, to improve the reliability of selected triplet samples, we introduce a ranking-based triplet loss in the conservative stage, which is a label-free objective function basing on the similarities between data pairs. Experiments demonstrate that the proposed method achieves state-of-the-art person Re-ID performance under the unsupervised cross-domain setting. Code is available at: https://tinyurl.com/PASTReID
△ Less
Submitted 31 July, 2019;
originally announced July 2019.
-
An Alternative Data-Driven Prediction Approach Based on Real Option Theories
Authors:
Abdullah AlShelahi,
**gxing Wang,
Mingdi You,
Eunshin Byon,
Romesh Saigal
Abstract:
This paper presents a new prediction model for time series data by integrating a time-varying Geometric Brownian Motion model with a pricing mechanism used in financial engineering. Typical time series models such as Auto-Regressive Integrated Moving Average assumes a linear correlation structure in time series data. When a stochastic process is highly volatile, such an assumption can be easily vi…
▽ More
This paper presents a new prediction model for time series data by integrating a time-varying Geometric Brownian Motion model with a pricing mechanism used in financial engineering. Typical time series models such as Auto-Regressive Integrated Moving Average assumes a linear correlation structure in time series data. When a stochastic process is highly volatile, such an assumption can be easily violated, leading to inaccurate predictions. We develop a new prediction model that can flexibly characterize a time-varying volatile process without assuming linearity. We formulate the prediction problem as an optimization problem with unequal overestimation and underestimation costs. Based on real option theories developed in finance, we solve the optimization problem and obtain a predicted value, which can minimize the expected prediction cost. We evaluate the proposed approach using multiple datasets obtained from real-life applications including manufacturing, finance, and environment. The numerical results demonstrate that the proposed model shows competitive prediction capability, compared with alternative approaches.
△ Less
Submitted 19 April, 2019;
originally announced April 2019.
-
Block Chain based Intelligent Industrial Network (DSDIN)
Authors:
Barco You,
Matthias Hub,
Mengzhe You,
Bo Xu,
Mingzhi Yu,
Ivan Uemlianin
Abstract:
The manufacturing industry featured centralization in the past due to technical limitations, and factories (especially large manufacturers) gathered almost all of the resources for manufacturing, including: technologies, raw materials, equipment, workers, market information, etc. However, such centralized production is costly, inefficient and inflexible, and difficult to respond to rapidly changin…
▽ More
The manufacturing industry featured centralization in the past due to technical limitations, and factories (especially large manufacturers) gathered almost all of the resources for manufacturing, including: technologies, raw materials, equipment, workers, market information, etc. However, such centralized production is costly, inefficient and inflexible, and difficult to respond to rapidly changing, diverse and personalized user needs. This paper introduces an Intelligent Industrial Network (DSDIN), which provides a fully distributed manufacturing network where everyone can participate in manufacturing due to decentralization and no intermediate links, allowing them to quickly get the products or services they want and also to be authorized, recognized and get returns in a low-cost way due to their efforts (such as providing creative ideas, designs or equipment, raw materials or physical strength). DSDIN is a blockchain based IoT and AI technology platform, and also an IoT based intelligent service standard. Due to the intelligent network formed by DSDIN, the manufacturing center is no longer a factory, and actually there are no manufacturing centers. DSDIN provides a multi-participation peer-to-peer network for people and things (including raw materials, equipment, finished / semi-finished products, etc.). The information transmitted through the network is called Intelligent Service Algorithm (ISA). The user can send a process model, formula or control parameter to a device via an ISA, and every transaction in DSDIN is an intelligent service defined by ISA.
△ Less
Submitted 18 September, 2018;
originally announced September 2018.
-
Free and Interacting Short-Range Entangled Phases of Fermions: Beyond the Ten-Fold Way
Authors:
Yu-An Chen,
Anton Kapustin,
Alex Turzillo,
Minyoung You
Abstract:
We extend the periodic table of phases of free fermions in the ten-fold way symmetry classes to a classification of free fermionic phases protected by an arbitrary on-site unitary symmetry $\hat G$ in an arbitrary dimension. The classification is described as a function of the real representation theory of $\hat G$ and the data of the original periodic table. We also systematically study in low di…
▽ More
We extend the periodic table of phases of free fermions in the ten-fold way symmetry classes to a classification of free fermionic phases protected by an arbitrary on-site unitary symmetry $\hat G$ in an arbitrary dimension. The classification is described as a function of the real representation theory of $\hat G$ and the data of the original periodic table. We also systematically study in low dimensions the relationship between the free invariants and the invariants of short-range entangled interacting phases of fermions. Namely we determine whether a given symmetry protected phase of free fermions is destabilized by sufficiently strong interactions or it remains stable even in the presence of interactions. We also determine which interacting fermionic phases cannot be realized by free fermions. Examples of both destabilized free phases and intrinsically interacting phases are common in all dimensions.
△ Less
Submitted 19 November, 2019; v1 submitted 13 September, 2018;
originally announced September 2018.
-
Integrative Density Forecast and Uncertainty Quantification of Wind Power Generation
Authors:
**gxing Wang,
Abdullah Alshelahi,
Mingdi You,
Eunshin Byon,
Romesh Saigal
Abstract:
The volatile nature of wind power generation creates challenges in achieving secure power grid operations. It is, therefore, necessary to make accurate wind power prediction and its uncertainty quantification. Wind power forecasting usually depends on wind speed prediction and the wind-to-power conversion process. However, most current wind power prediction models only consider portions of the unc…
▽ More
The volatile nature of wind power generation creates challenges in achieving secure power grid operations. It is, therefore, necessary to make accurate wind power prediction and its uncertainty quantification. Wind power forecasting usually depends on wind speed prediction and the wind-to-power conversion process. However, most current wind power prediction models only consider portions of the uncertainty. This paper develops an integrative framework for predicting wind power density, considering uncertainties arising from both wind speed prediction and the wind-to-power conversion process. Specifically, we model wind speed using the inhomogeneous Geometric Brownian Motion and convert the wind speed prediction density into the wind power density in a closed-form. The resulting wind power density allows quantifying prediction uncertainties through prediction intervals. To forecast the power output, we minimize the expected prediction cost with (unequal) penalties on the overestimation and underestimation. We show the predictive power of the proposed approach using data from multiple operating wind farms located at different sites.
△ Less
Submitted 27 September, 2020; v1 submitted 22 August, 2018;
originally announced August 2018.
-
Fermionic Matrix Product States and One-Dimensional Short-Range Entangled Phases with Anti-Unitary Symmetries
Authors:
Alex Turzillo,
Minyoung You
Abstract:
We extend the formalism of Matrix Product States (MPS) to describe one-dimensional gapped systems of fermions with both unitary and anti-unitary symmetries. Additionally, systems with orientation-reversing spatial symmetries are considered. The short-ranged entangled phases of such systems are classified by three invariants, which characterize the projective action of the symmetry on edge states.…
▽ More
We extend the formalism of Matrix Product States (MPS) to describe one-dimensional gapped systems of fermions with both unitary and anti-unitary symmetries. Additionally, systems with orientation-reversing spatial symmetries are considered. The short-ranged entangled phases of such systems are classified by three invariants, which characterize the projective action of the symmetry on edge states. We give interpretations of these invariants as properties of states on the closed chain. The relationship between fermionic MPS systems at an RG fixed point and equivariant algebras is exploited to derive a group law for the stacking of fermionic phases protected by general fermionic symmetry groups.
△ Less
Submitted 23 January, 2024; v1 submitted 29 September, 2017;
originally announced October 2017.
-
Adversarial Generation of Training Examples: Applications to Moving Vehicle License Plate Recognition
Authors:
Xinlong Wang,
Zhipeng Man,
Mingyu You,
Chunhua Shen
Abstract:
Generative Adversarial Networks (GAN) have attracted much research attention recently, leading to impressive results for natural image generation. However, to date little success was observed in using GAN generated images for improving classification tasks. Here we attempt to explore, in the context of car license plate recognition, whether it is possible to generate synthetic training data using…
▽ More
Generative Adversarial Networks (GAN) have attracted much research attention recently, leading to impressive results for natural image generation. However, to date little success was observed in using GAN generated images for improving classification tasks. Here we attempt to explore, in the context of car license plate recognition, whether it is possible to generate synthetic training data using GAN to improve recognition accuracy. With a carefully-designed pipeline, we show that the answer is affirmative. First, a large-scale image set is generated using the generator of GAN, without manual annotation. Then, these images are fed to a deep convolutional neural network (DCNN) followed by a bidirectional recurrent neural network (BRNN) with long short-term memory (LSTM), which performs the feature learning and sequence labelling. Finally, the pre-trained model is fine-tuned on real images. Our experimental results on a few data sets demonstrate the effectiveness of using GAN images: an improvement of 7.5% over a strong baseline with moderate-sized real data being available. We show that the proposed framework achieves competitive recognition accuracy on challenging test datasets. We also leverage the depthwise separate convolution to construct a lightweight convolutional RNN, which is about half size and 2x faster on CPU. Combining this framework and the proposed pipeline, we make progress in performing accurate recognition on mobile and embedded devices.
△ Less
Submitted 10 November, 2017; v1 submitted 11 July, 2017;
originally announced July 2017.
-
Unified Framework for the Effective Rate Analysis of Wireless Communication Systems over MISO Fading Channels
Authors:
Minglei You,
Hongjian Sun,
**g Jiang,
Jiayi Zhang
Abstract:
This paper proposes a unified framework for the effective rate analysis over arbitrary correlated and not necessarily identical multiple inputs single output (MISO) fading channels, which uses moment generating function (MGF) based approach and H transform representation. The proposed framework has the potential to simplify the cumbersome analysis procedure compared to the probability density func…
▽ More
This paper proposes a unified framework for the effective rate analysis over arbitrary correlated and not necessarily identical multiple inputs single output (MISO) fading channels, which uses moment generating function (MGF) based approach and H transform representation. The proposed framework has the potential to simplify the cumbersome analysis procedure compared to the probability density function (PDF) based approach. Moreover, the effective rates over two specific fading scenarios are investigated, namely independent but not necessarily identical distributed (i.n.i.d.) MISO hyper Fox's H fading channels and arbitrary correlated generalized K fading channels. The exact analytical representations for these two scenarios are also presented. By substituting corresponding parameters, the effective rates in various practical fading scenarios, such as Rayleigh, Nakagami-m, Weibull/Gamma and generalized K fading channels, are readily available. In addition, asymptotic approximations are provided for the proposed H transform and MGF based approach as well as for the effective rate over i.n.i.d. MISO hyper Fox's H fading channels. Simulations under various fading scenarios are also presented, which support the validity of the proposed method.
△ Less
Submitted 12 December, 2016;
originally announced December 2016.
-
Spin Topological Field Theory and Fermionic Matrix Product States
Authors:
Anton Kapustin,
Alex Turzillo,
Minyoung You
Abstract:
We study state-sum constructions of G-equivariant spin-TQFTs and their relationship to Matrix Product States. We show that in the Neveu-Schwarz, Ramond, and twisted sectors, the states of the theory are generalized Matrix Product States. We apply our results to revisit the classification of fermionic Short-Range-Entangled phases with a unitary symmetry G and determine the group law on the set of s…
▽ More
We study state-sum constructions of G-equivariant spin-TQFTs and their relationship to Matrix Product States. We show that in the Neveu-Schwarz, Ramond, and twisted sectors, the states of the theory are generalized Matrix Product States. We apply our results to revisit the classification of fermionic Short-Range-Entangled phases with a unitary symmetry G and determine the group law on the set of such phases. Interesting subtleties appear when the total symmetry group is a nontrivial extension of G by fermion parity.
△ Less
Submitted 7 November, 2016; v1 submitted 31 October, 2016;
originally announced October 2016.
-
Topological Field Theory and Matrix Product States
Authors:
Anton Kapustin,
Alex Turzillo,
Minyoung You
Abstract:
It is believed that most (perhaps all) gapped phases of matter can be described at long distances by Topological Quantum Field Theory (TQFT). On the other hand, it has been rigorously established that in 1+1d ground states of gapped Hamiltonians can be approximated by Matrix Product States (MPS). We show that the state-sum construction of 2d TQFT naturally leads to MPS in their standard form. In t…
▽ More
It is believed that most (perhaps all) gapped phases of matter can be described at long distances by Topological Quantum Field Theory (TQFT). On the other hand, it has been rigorously established that in 1+1d ground states of gapped Hamiltonians can be approximated by Matrix Product States (MPS). We show that the state-sum construction of 2d TQFT naturally leads to MPS in their standard form. In the case of systems with a global symmetry $G$, this leads to a classification of gapped phases in 1+1d in terms of Morita-equivalence classes of $G$-equivariant algebras. Non-uniqueness of the MPS representation is traced to the freedom of choosing an algebra in a particular Morita class. In the case of Short-Range Entangled phases, we recover the group cohomology classification of SPT phases.
△ Less
Submitted 2 November, 2019; v1 submitted 22 July, 2016;
originally announced July 2016.
-
A New approach to the construction of braided T-categories
Authors:
Daowei Lu,
Miman You
Abstract:
The aim of this paper is to construct a new braided $T$-category via the generalized Yetter-Drinfel'd modules and Drinfel'd codouble over Hopf algebra, an approach different from that proposed by Panaite and Staic \cite{PS}. Moreover, in the case of finite dimensional, we will show that this category coincides with the corepresentation of a certain coquasitriangular Turaev group algebra that we co…
▽ More
The aim of this paper is to construct a new braided $T$-category via the generalized Yetter-Drinfel'd modules and Drinfel'd codouble over Hopf algebra, an approach different from that proposed by Panaite and Staic \cite{PS}. Moreover, in the case of finite dimensional, we will show that this category coincides with the corepresentation of a certain coquasitriangular Turaev group algebra that we construct. Finally we apply our theory to the case of group algebra.
△ Less
Submitted 12 February, 2017; v1 submitted 5 May, 2016;
originally announced May 2016.
-
A Note on Braided $T$-categories over Monoidal Hom-Hopf Algebras
Authors:
Miman You,
Shuanhong Wang
Abstract:
Let $ Aut_{mHH}(H)$ denote the set of all automorphisms of a monoidal Hopf algebra $H$ with bijective antipode in the sense of Caenepeel and Goyvaerts \cite{CG2011}. The main aim of this paper is to provide new examples of braided $T$-category in the sense of Turaev \cite{T2008}. For this, first we construct a monoidal Hom-Hopf $T$-coalgebra $\mathcal{MHD}(H)$ and prove that the $T$-category…
▽ More
Let $ Aut_{mHH}(H)$ denote the set of all automorphisms of a monoidal Hopf algebra $H$ with bijective antipode in the sense of Caenepeel and Goyvaerts \cite{CG2011}. The main aim of this paper is to provide new examples of braided $T$-category in the sense of Turaev \cite{T2008}. For this, first we construct a monoidal Hom-Hopf $T$-coalgebra $\mathcal{MHD}(H)$ and prove that the $T$-category $Rep(\mathcal{MHD}(H))$ of representation of $\mathcal{MHD}(H)$ is isomorphic to $\mathcal {MHYD}(H)$ as braided $T$-categories, if $H$ is finite-dimensional. Then we construct a new braided $T$-category $\mathcal{ZMHYD}(H)$ over $\mathbb{Z},$ generalizing the main construction by Staic \cite{S2007}.
△ Less
Submitted 24 November, 2014; v1 submitted 30 October, 2014;
originally announced October 2014.
-
Constructing New Braided $T$-categories over Monoidal Hom-Hopf Algebras
Authors:
Miman You,
Shuanhong Wang
Abstract:
Let $ Aut_{mHH}(H)$ denote a set of all automorphisms of a monoidal Hopf algebra $H$ with bijective antipode in the sense of Caenepeel S. and Goyvaerts I. (Commun. Algebra 39, 2216-2240, 2011) and let $G$ be a crossed product group $ Aut_{mHH}(H)\times Aut_{mHH}(H)$. The main aim of this paper is to provide further examples of braided $T$-category in the sense of Turaev (1994, 2008). For this purp…
▽ More
Let $ Aut_{mHH}(H)$ denote a set of all automorphisms of a monoidal Hopf algebra $H$ with bijective antipode in the sense of Caenepeel S. and Goyvaerts I. (Commun. Algebra 39, 2216-2240, 2011) and let $G$ be a crossed product group $ Aut_{mHH}(H)\times Aut_{mHH}(H)$. The main aim of this paper is to provide further examples of braided $T$-category in the sense of Turaev (1994, 2008). For this purpose, we first introduce a class of new categories $_{H}\mathcal {MHYD}^{H}(A, B)$ of monoidal Hom $(A, B)$-Yetter-Drinfeld modules with $A, B \in Aut_{mHH}(H)$. Then we show that the category ${\cal MHYD}(H)=\{{}_{H}\mathcal {MHYD}^{H}(A, B)\}_{(A, B)\in G}$ forms a braided $T$-category, generalizing the main constructions construction by Panaite and Staic (Isr J Math 158:349-365, 2007).
△ Less
Submitted 30 October, 2014; v1 submitted 26 May, 2014;
originally announced May 2014.
-
Beam Emittance Measurements for the Low-Energy Demonstration Accelerator Radio-Frequency Quadrupole
Authors:
M. E. Schulze,
J. D. Gilpatrick,
W. P. Lysenko,
L. J. Rybarcyk,
J. D. Schneider,
H. V. Smith, Jr.,
L. M. You
Abstract:
The Low-Energy Demonstration Accelerator (LEDA) radio-frequency quadrupole (RFQ) is a 100% duty factor (CW) linac that delivers >100 mA of H+ beam at 6.7 MeV. The 8-m-long, 350-MHz RFQ structure accelerates a dc, 75-keV, 110-mA H+ beam from the LEDA injector with >90% transmission. LEDA [1,2] consists of a 75-keV proton injector, 6.7-MeV, 350-MHz CW RFQ with associated high-power and low-level r…
▽ More
The Low-Energy Demonstration Accelerator (LEDA) radio-frequency quadrupole (RFQ) is a 100% duty factor (CW) linac that delivers >100 mA of H+ beam at 6.7 MeV. The 8-m-long, 350-MHz RFQ structure accelerates a dc, 75-keV, 110-mA H+ beam from the LEDA injector with >90% transmission. LEDA [1,2] consists of a 75-keV proton injector, 6.7-MeV, 350-MHz CW RFQ with associated high-power and low-level rf systems, a short high-energy beam transport (HEBT) and high-power (670-kW CW) beam stop. The beam emittance is inferred from wire scanner measurements of the beam profile at a single location in the HEBT. The beam profile is measured as a function of the magnetic field gradient in one of the HEBT quadrupoles. As the gradient is changed the spot size passes through a transverse waist. Measurements are presented for peak currents between 25 and 100 mA.
△ Less
Submitted 17 August, 2000; v1 submitted 15 August, 2000;
originally announced August 2000.