Search | arXiv e-print repository

Flexible Music-Conditioned Dance Generation with Style Description Prompts

Authors: Hongsong Wang, Yin Zhu, Xin Geng

Abstract: Dance plays an important role as an artistic form and expression in human culture, yet the creation of dance remains a challenging task. Most dance generation methods primarily rely solely on music, seldom taking into consideration intrinsic attributes such as music style or genre. In this work, we introduce Flexible Dance Generation with Style Description Prompts (DGSDP), a diffusion-based framew… ▽ More Dance plays an important role as an artistic form and expression in human culture, yet the creation of dance remains a challenging task. Most dance generation methods primarily rely solely on music, seldom taking into consideration intrinsic attributes such as music style or genre. In this work, we introduce Flexible Dance Generation with Style Description Prompts (DGSDP), a diffusion-based framework suitable for diversified tasks of dance generation by fully leveraging the semantics of music style. The core component of this framework is Music-Conditioned Style-Aware Diffusion (MCSAD), which comprises a Transformer-based network and a music Style Modulation module. The MCSAD seemly integrates music conditions and style description prompts into the dance generation framework, ensuring that generated dances are consistent with the music content and style. To facilitate flexible dance generation and accommodate different tasks, a spatial-temporal masking strategy is effectively applied in the backward diffusion process. The proposed framework successfully generates realistic dance sequences that are accurately aligned with music for a variety of tasks such as long-term generation, dance in-betweening, dance inpainting, and etc. We hope that this work has the potential to inspire dance generation and creation, with promising applications in entertainment, art, and education. △ Less

Submitted 12 June, 2024; originally announced June 2024.

arXiv:2405.02132 [pdf, other]

Unveiling the Potential of LLM-Based ASR on Chinese Open-Source Datasets

Authors: Xuelong Geng, Tianyi Xu, Kun Wei, Bingshen Mu, Hongfei Xue, He Wang, Yangze Li, Pengcheng Guo, Yuhang Dai, Longhao Li, Mingchen Shao, Lei Xie

Abstract: Large Language Models (LLMs) have demonstrated unparalleled effectiveness in various NLP tasks, and integrating LLMs with automatic speech recognition (ASR) is becoming a mainstream paradigm. Building upon this momentum, our research delves into an in-depth examination of this paradigm on a large open-source Chinese dataset. Specifically, our research aims to evaluate the impact of various configu… ▽ More Large Language Models (LLMs) have demonstrated unparalleled effectiveness in various NLP tasks, and integrating LLMs with automatic speech recognition (ASR) is becoming a mainstream paradigm. Building upon this momentum, our research delves into an in-depth examination of this paradigm on a large open-source Chinese dataset. Specifically, our research aims to evaluate the impact of various configurations of speech encoders, LLMs, and projector modules in the context of the speech foundation encoder-LLM ASR paradigm. Furthermore, we introduce a three-stage training approach, expressly developed to enhance the model's ability to align auditory and textual information. The implementation of this approach, alongside the strategic integration of ASR components, enabled us to achieve the SOTA performance on the AISHELL-1, Test_Net, and Test_Meeting test sets. Our analysis presents an empirical foundation for future research in LLM-based ASR systems and offers insights into optimizing performance using Chinese datasets. We will publicly release all scripts used for data preparation, training, inference, and scoring, as well as pre-trained models and training logs to promote reproducible research. △ Less

Submitted 6 May, 2024; v1 submitted 3 May, 2024; originally announced May 2024.

arXiv:2404.03915 [pdf, other]

Nonlinear Kalman Filtering based on Self-Attention Mechanism and Lattice Trajectory Piecewise Linear Approximation

Authors: Jiaming Wang, Xinyu Geng, Jun Xu

Abstract: The traditional Kalman filter (KF) is widely applied in control systems, but it relies heavily on the accuracy of the system model and noise parameters, leading to potential performance degradation when facing inaccuracies. To address this issue, introducing neural networks into the KF framework offers a data-driven solution to compensate for these inaccuracies, improving the filter's performance… ▽ More The traditional Kalman filter (KF) is widely applied in control systems, but it relies heavily on the accuracy of the system model and noise parameters, leading to potential performance degradation when facing inaccuracies. To address this issue, introducing neural networks into the KF framework offers a data-driven solution to compensate for these inaccuracies, improving the filter's performance while maintaining interpretability. Nevertheless, existing studies mostly employ recurrent neural network (RNN), which fails to fully capture the dependencies among state sequences and lead to an unstable training process. In this paper, we propose a novel Kalman filtering algorithm named the attention Kalman filter (AtKF), which incorporates a self-attention network to capture the dependencies among state sequences. To address the instability in the recursive training process, a parallel pre-training strategy is devised. Specifically, this strategy involves piecewise linearizing the system via lattice trajectory piecewise linear (LTPWL) expression, and generating pre-training data through a batch estimation algorithm, which exploits the self-attention mechanism's parallel processing ability. Experimental results on a two-dimensional nonlinear system demonstrate that AtKF outperforms other filters under noise disturbances and model mismatches. △ Less

Submitted 5 April, 2024; originally announced April 2024.

Comments: 7 pages, 4 figures

arXiv:2302.08223 [pdf, other]

doi 10.1109/TWC.2023.3336059

Energy Efficient Operation of Adaptive Massive MIMO 5G HetNets

Authors: Siddarth Marwaha, Eduard A. Jorswieck, Mostafa Jassim, Thomas Kuerner, David Lopez Perez, Xilnli Geng, Harvey Bao

Abstract: For energy efficient operation of the massive multiple-input multiple-output (MIMO) networks, various aspects of energy efficiency maximization have been addressed, where a careful selection of number of active antennas has shown significant gains. Moreover, switching-off physical resource blocks (PRBs) and carrier shutdown saves energy in low load scenarios. However, the joint optimization of spe… ▽ More For energy efficient operation of the massive multiple-input multiple-output (MIMO) networks, various aspects of energy efficiency maximization have been addressed, where a careful selection of number of active antennas has shown significant gains. Moreover, switching-off physical resource blocks (PRBs) and carrier shutdown saves energy in low load scenarios. However, the joint optimization of spectral PRB allocation and spatial layering in a heterogeneous network has not been completely solved yet. Therefore, we study a power consumption model for multi-cell multi-user massive MIMO 5G network, capturing the joint effects of both dimensions. We characterize the optimal resource allocation under practical constraints, i.e., limited number of available antennas, PRBs, base stations (BSs), and frequency bands. We observe a single spatial layer achieving lowest energy consumption in very low load scenarios, whereas, spatial layering is required in high load scenarios. Finally, we derive novel algorithms for energy efficient user to BS assignment and propose an adaptive algorithm for PRB assignment and power control. All results are illustrated by numerical system-level simulations, describing a realistic metropolis scenario. The results show that a higher frequency band should be used to support users with large rate requirements via spatial multiplexing and assigning each user maximum available PRBs. △ Less

Submitted 10 October, 2023; v1 submitted 16 February, 2023; originally announced February 2023.

Journal ref: IEEE Transactions on Wireless Communications, 13 Nov, 2023

arXiv:2108.13405 [pdf, other]

Stochastic Uncertainty Propagation in Power System Dynamics using Measure-valued Proximal Recursions

Authors: Abhishek Halder, Kenneth F. Caluya, Pegah Ojaghi, Xinbo Geng

Abstract: We present a proximal algorithm that performs a variational recursion on the space of joint probability measures to propagate the stochastic uncertainties in power system dynamics over high dimensional state space. The proposed algorithm takes advantage of the exact nonlinearity structures in the trajectory-level dynamics of the networked power systems, and is nonparametric. Lifting the dynamics t… ▽ More We present a proximal algorithm that performs a variational recursion on the space of joint probability measures to propagate the stochastic uncertainties in power system dynamics over high dimensional state space. The proposed algorithm takes advantage of the exact nonlinearity structures in the trajectory-level dynamics of the networked power systems, and is nonparametric. Lifting the dynamics to the space of probability measures allows us to design a scalable algorithm that obviates gridding the underlying high dimensional state space which is computationally prohibitive. The proximal recursion implements a generalized infinite dimensional gradient flow, and evolves probability-weighted scattered point clouds. We clarify the theoretical nuances and algorithmic details specific to the power system nonlinearities, and provide illustrative numerical examples. △ Less

Submitted 24 August, 2022; v1 submitted 30 August, 2021; originally announced August 2021.

arXiv:2103.16424 [pdf, other]

Two-stage Robust Energy Storage Planning with Probabilistic Guarantees: A Data-driven Approach

Authors: Chao Yan, Xinbo Geng, Zhaohong Bie, Le Xie

Abstract: This paper addresses a central challenge of jointly considering shorter-term (e.g. hourly) and longer-term (e.g. yearly) uncertainties in power system planning with increasing penetration of renewable and storage resources. In conventional planning decision making, shorter-term (e.g., hourly) variations are not explicitly accounted for. However, given the deepening penetration of variable resource… ▽ More This paper addresses a central challenge of jointly considering shorter-term (e.g. hourly) and longer-term (e.g. yearly) uncertainties in power system planning with increasing penetration of renewable and storage resources. In conventional planning decision making, shorter-term (e.g., hourly) variations are not explicitly accounted for. However, given the deepening penetration of variable resources, it is becoming imperative to consider such shorter-term variation in the longer-term planning exercise. By leveraging the abundant amount of operational observation data, we propose a scenario-based robust planning framework that provides rigorous guarantees on the future operation risk of planning decisions considering a broad range of operational conditions, such as renewable generation fluctuations and load variations. By connecting two-stage robust optimization with the scenario approach theory, we show that with a carefully chosen number of scenarios, the operational risk level of the robust solution can be adaptive to the risk preference set by planners. The theoretical guarantees hold true for any distributions, and the proposed approach is scalable towards real-world power grids. Furthermore, the column-and-constraint generation algorithm is used to solve the two-stage robust planning problem and tighten theoretical guarantees. We substantiate this framework through a planning problem of energy storage in a power grid with deep renewable penetration. Case studies are performed on large-scale test systems (modified IEEE 118-bus system) to illustrate the theoretical bounds as well as the scalability of proposed algorithm. △ Less

Submitted 10 September, 2021; v1 submitted 30 March, 2021; originally announced March 2021.

arXiv:2012.14624 [pdf, other]

Deferrable Load Scheduling under Demand Charge: A Block Model-Predictive Control Approach

Authors: Lei Yang, Xinbo Geng, Xiaohong Guan, Lang Tong

Abstract: Optimal scheduling of deferrable electrical loads can reshape the aggregated load profile to achieve higher operational efficiency and reliability. This paper studies deferrable load scheduling under demand charge that imposes a penalty on the peak consumption over a billing period. Such a terminal cost poses challenges in real-time dispatch when demand forecasts are inaccurate. A block model-pred… ▽ More Optimal scheduling of deferrable electrical loads can reshape the aggregated load profile to achieve higher operational efficiency and reliability. This paper studies deferrable load scheduling under demand charge that imposes a penalty on the peak consumption over a billing period. Such a terminal cost poses challenges in real-time dispatch when demand forecasts are inaccurate. A block model-predictive control approach is proposed by breaking demand charge into a sequence of stage costs. The problem of charging electric vehicles is used to illustrate the efficacy of the proposed approach. Numerical examples show that the block model-predictive control outperforms benchmark methods in various settings. △ Less

Submitted 11 January, 2021; v1 submitted 29 December, 2020; originally announced December 2020.

Comments: 10 pages, 4 plots

arXiv:2011.05193 [pdf, other]

Probabilistic Hosting Capacity Analysis via Bayesian Optimization

Authors: Xinbo Geng, Lang Tong, Anirban Bhattacharya, Bani Mallick, Le Xie

Abstract: This paper studies the probabilistic hosting capacity analysis (PHCA) problem in distribution networks considering uncertainties from distributed energy resources (DERs) and residential loads. PHCA aims to compute the hosting capacity, which is defined as the maximal level of DERs that can be securely integrated into a distribution network while satisfying operational constraints with high probabi… ▽ More This paper studies the probabilistic hosting capacity analysis (PHCA) problem in distribution networks considering uncertainties from distributed energy resources (DERs) and residential loads. PHCA aims to compute the hosting capacity, which is defined as the maximal level of DERs that can be securely integrated into a distribution network while satisfying operational constraints with high probability. We formulate PHCA as a chance-constrained optimization problem, and model the uncertainties from DERs and loads using historical data. Due to non-convexities and a substantial number of historical scenarios being used, PHCA is often formulated as large-scale nonlinear optimization problem, thus computationally intractable to solve. To address the core computational challenges, we propose a fast and extensible framework to solve PHCA based on Bayesian Optimization (BayesOpt). Comparing with state-of-the-art algorithms such as interior point and active set, numerical results show that the proposed BayesOpt approach is able to find better solutions (25% higher hosting capacity) with 70% savings in computation time on average. △ Less

Submitted 10 November, 2020; originally announced November 2020.

arXiv:2001.00692 [pdf]

FFusionCGAN: An end-to-end fusion method for few-focus images using conditional GAN in cytopathological digital slides

Authors: Xiebo Geng, Sibo Liua, Wei Han, Xu Li, Jiabo Ma, **gya Yu, Xiuli Liu, Sahoqun Zeng, Li Chen, Shenghua Cheng

Abstract: Multi-focus image fusion technologies compress different focus depth images into an image in which most objects are in focus. However, although existing image fusion techniques, including traditional algorithms and deep learning-based algorithms, can generate high-quality fused images, they need multiple images with different focus depths in the same field of view. This criterion may not be met in… ▽ More Multi-focus image fusion technologies compress different focus depth images into an image in which most objects are in focus. However, although existing image fusion techniques, including traditional algorithms and deep learning-based algorithms, can generate high-quality fused images, they need multiple images with different focus depths in the same field of view. This criterion may not be met in some cases where time efficiency is required or the hardware is insufficient. The problem is especially prominent in large-size whole slide images. This paper focused on the multi-focus image fusion of cytopathological digital slide images, and proposed a novel method for generating fused images from single-focus or few-focus images based on conditional generative adversarial network (GAN). Through the adversarial learning of the generator and discriminator, the method is capable of generating fused images with clear textures and large depth of field. Combined with the characteristics of cytopathological images, this paper designs a new generator architecture combining U-Net and DenseBlock, which can effectively improve the network's receptive field and comprehensively encode image features. Meanwhile, this paper develops a semantic segmentation network that identifies the blurred regions in cytopathological images. By integrating the network into the generative model, the quality of the generated fused images is effectively improved. Our method can generate fused images from only single-focus or few-focus images, thereby avoiding the problem of collecting multiple images of different focus depths with increased time and hardware costs. Furthermore, our model is designed to learn the direct map** of input source images to fused images without the need to manually design complex activity level measurements and fusion rules as in traditional methods. △ Less

Submitted 2 January, 2020; originally announced January 2020.

arXiv:1910.10639 [pdf, other]

Chance-constrained Unit Commitment via the Scenario Approach

Authors: Xinbo Geng, Le Xie

Abstract: Kee** the balance between supply and demand is a fundamental task in power system operational planning practices. This task becomes particularly challenging due to the deepening penetration of renewable energy resources, which induces a significant amount of uncertainties. In this paper, we propose a chance-constrained Unit Commitment (c-UC) framework to tackle challenges from uncertainties of r… ▽ More Kee** the balance between supply and demand is a fundamental task in power system operational planning practices. This task becomes particularly challenging due to the deepening penetration of renewable energy resources, which induces a significant amount of uncertainties. In this paper, we propose a chance-constrained Unit Commitment (c-UC) framework to tackle challenges from uncertainties of renewables. The proposed c-UC framework seeks cost-efficient scheduling of generators while ensuring operation constraints with guaranteed probability. We show that the scenario approach can be used to solve c-UC despite of the non-convexity from binary decision variables. We reveal the salient structural properties of c-UC, which could significantly reduce the sample complexity required by the scenario approach and speed up computation. Case studies are performed on a modified 118-bus system. △ Less

Submitted 21 October, 2019; originally announced October 2019.

Comments: An extended version (added an illustrative example using the 3-bus system) of the conference paper in Proceedings of the 51st North American Power Symposium (NAPS). arXiv admin note: text overlap with arXiv:1910.07672

arXiv:1910.07672 [pdf, other]

Computing Essential Sets for Convex and Non-convex Scenario Problems: Theory and Application

Authors: Xinbo Geng, Le Xie, M. Sadegh Modarresi

Abstract: The scenario approach is a general data-driven algorithm to chance-constrained optimization. It seeks the optimal solution that is feasible to a carefully chosen number of scenarios. A crucial step in the scenario approach is to compute the cardinality of essential sets, which is the smallest subset of scenarios that determine the optimal solution. This paper addresses the challenge of efficiently… ▽ More The scenario approach is a general data-driven algorithm to chance-constrained optimization. It seeks the optimal solution that is feasible to a carefully chosen number of scenarios. A crucial step in the scenario approach is to compute the cardinality of essential sets, which is the smallest subset of scenarios that determine the optimal solution. This paper addresses the challenge of efficiently identifying essential sets. For convex problems, we demonstrate that the sparsest dual solution of the scenario problem could pinpoint the essential set. For non-convex problems, we show that two simple algorithms return the essential set when the scenario problem is non-degenerate. Finally, we illustrate the theoretical results and computational algorithms on security-constrained unit commitment (SCUC) in power systems. In particular, case studies of chance-constrained SCUC are performed in the IEEE 118-bus system. Numerical results suggest that the scenario approach could be an attractive solution to practical power system applications. △ Less

Submitted 13 October, 2020; v1 submitted 16 October, 2019; originally announced October 2019.

arXiv:1907.09811 [pdf, other]

doi 10.1109/TIP.2020.2984849

NPSA: Nonorthogonal Principal Skewness Analysis

Authors: Xiurui Geng, Lei Wang

Abstract: Principal skewness analysis (PSA) has been introduced for feature extraction in hyperspectral imagery. As a third-order generalization of principal component analysis (PCA), its solution of searching for the locally maximum skewness direction is transformed into the problem of calculating the eigenpairs (the eigenvalues and the corresponding eigenvectors) of a coskewness tensor. By combining a fix… ▽ More Principal skewness analysis (PSA) has been introduced for feature extraction in hyperspectral imagery. As a third-order generalization of principal component analysis (PCA), its solution of searching for the locally maximum skewness direction is transformed into the problem of calculating the eigenpairs (the eigenvalues and the corresponding eigenvectors) of a coskewness tensor. By combining a fixed-point method with an orthogonal constraint, it can prevent the new eigenpairs from converging to the same maxima that has been determined before. However, the eigenvectors of the supersymmetric tensor are not inherently orthogonal in general, which implies that the results obtained by the search strategy used in PSA may unavoidably deviate from the actual eigenpairs. In this paper, we propose a new nonorthogonal search strategy to solve this problem and the new algorithm is named nonorthogonal principal skewness analysis (NPSA). The contribution of NPSA lies in the finding that the search space of the eigenvector to be determined can be enlarged by using the orthogonal complement of the Kronecker product of the previous one, instead of its orthogonal complement space. We give a detailed theoretical proof to illustrate why the new strategy can result in the more accurate eigenpairs. In addition, after some algebraic derivations, the complexity of the presented algorithm is also greatly reduced. Experiments with both simulated data and real multi/hyperspectral imagery demonstrate its validity in feature extraction. △ Less

Submitted 23 July, 2019; originally announced July 2019.

arXiv:1606.09564 [pdf, other]

Architecture and Algorithms for Privacy Preserving Thermal Inertial Load Management by A Load Serving Entity

Authors: Abhishek Halder, Xinbo Geng, P. R. Kumar, Le Xie

Abstract: Motivated by the growing importance of demand response in modern power system's operations, we propose an architecture and supporting algorithms for privacy preserving thermal inertial load management as a service provided by the load serving entity (LSE). We focus on an LSE managing a population of its customers' air conditioners, and propose a contractual model where the LSE guarantees quality o… ▽ More Motivated by the growing importance of demand response in modern power system's operations, we propose an architecture and supporting algorithms for privacy preserving thermal inertial load management as a service provided by the load serving entity (LSE). We focus on an LSE managing a population of its customers' air conditioners, and propose a contractual model where the LSE guarantees quality of service to each customer in terms of kee** their indoor temperature trajectories within respective bands around the desired individual comfort temperatures. We show how the LSE can price the contracts differentiated by the flexibility embodied by the width of the specified bands. We address architectural questions of (i) how the LSE can strategize its energy procurement based on price and ambient temperature forecasts, (ii) how an LSE can close the real time control loop at the aggregate level while providing individual comfort guarantees to loads, without ever measuring the states of an air conditioner for privacy reasons. Control algorithms to enable our proposed architecture are given, and their efficacy is demonstrated on real data. △ Less

Submitted 29 November, 2016; v1 submitted 30 June, 2016; originally announced June 2016.

arXiv:1603.07276 [pdf, other]

Learning the LMP-Load Coupling From Data: A Support Vector Machine Based Approach

Authors: Xinbo Geng, Le Xie

Abstract: This paper investigates the fundamental coupling between loads and locational marginal prices (LMPs) in security-constrained economic dispatch (SCED). Theoretical analysis based on multi-parametric programming theory points out the unique one-to-one map** between load and LMP vectors. Such one-to-one map** is depicted by the concept of system pattern region (SPR) and identifying SPRs is the ke… ▽ More This paper investigates the fundamental coupling between loads and locational marginal prices (LMPs) in security-constrained economic dispatch (SCED). Theoretical analysis based on multi-parametric programming theory points out the unique one-to-one map** between load and LMP vectors. Such one-to-one map** is depicted by the concept of system pattern region (SPR) and identifying SPRs is the key to understanding the LMP-load coupling. Built upon the characteristics of SPRs, the SPR identification problem is modeled as a classification problem from a market participant's viewpoint, and a Support Vector Machine based data-driven approach is proposed. It is shown that even without the knowledge of system topology and parameters, the SPRs can be estimated by learning from historical load and price data. Visualization and illustration of the proposed data-driven approach are performed on a 3-bus system as well as the IEEE 118-bus system. △ Less

Submitted 23 March, 2016; originally announced March 2016.

Showing 1–14 of 14 results for author: Geng, X