Skip to main content

Showing 1–21 of 21 results for author: Singh, M K

Searching in archive eess. Search in all archives.
.
  1. arXiv:2406.16917  [pdf

    eess.SP

    GreenShield: CNN-Based Real-Time Forest Monitoring and Response

    Authors: Avishek Bhattacharjee, Swarup Samanta, Jagadish Bhattacharya, Manish Kumar Singh

    Abstract: This research introduces an innovative forest monitoring system designed to detect and mitigate the threats of forest fires. The proposed system leverages Arduino-based technology integrated with state-of-the-art sensors, including DHT11 for temperature and humidity detection and Flame sensor along with GSM module for gas and smoke detection. The integration of these sensors enables real-time data… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

  2. arXiv:2406.03822  [pdf, other

    cs.SD cs.CR eess.AS

    SilentCipher: Deep Audio Watermarking

    Authors: Mayank Kumar Singh, Naoya Takahashi, Weihsiang Liao, Yuki Mitsufuji

    Abstract: In the realm of audio watermarking, it is challenging to simultaneously encode imperceptible messages while enhancing the message capacity and robustness. Although recent advancements in deep learning-based methods bolster the message capacity and robustness over traditional methods, the encoded messages introduce audible artefacts that restricts their usage in professional settings. In this study… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

  3. arXiv:2305.15055  [pdf, other

    cs.SD cs.AI eess.AS

    Iteratively Improving Speech Recognition and Voice Conversion

    Authors: Mayank Kumar Singh, Naoya Takahashi, Onoe Naoyuki

    Abstract: Many existing works on voice conversion (VC) tasks use automatic speech recognition (ASR) models for ensuring linguistic consistency between source and converted samples. However, for the low-data resource domains, training a high-quality ASR remains to be a challenging task. In this work, we propose a novel iterative way of improving both the ASR and VC models. We first train an ASR model which i… ▽ More

    Submitted 24 May, 2023; originally announced May 2023.

  4. arXiv:2302.13838  [pdf, other

    cs.CV cs.SD eess.AS

    Cross-modal Face- and Voice-style Transfer

    Authors: Naoya Takahashi, Mayank K. Singh, Yuki Mitsufuji

    Abstract: Image-to-image translation and voice conversion enable the generation of a new facial image and voice while maintaining some of the semantics such as a pose in an image and linguistic content in audio, respectively. They can aid in the content-creation process in many applications. However, as they are limited to the conversion within each modality, matching the impression of the generated face an… ▽ More

    Submitted 1 March, 2023; v1 submitted 27 February, 2023; originally announced February 2023.

  5. arXiv:2302.10536  [pdf, other

    cs.SD cs.AI eess.AS

    Nonparallel Emotional Voice Conversion For Unseen Speaker-Emotion Pairs Using Dual Domain Adversarial Network & Virtual Domain Pairing

    Authors: Nirmesh Shah, Mayank Kumar Singh, Naoya Takahashi, Naoyuki Onoe

    Abstract: Primary goal of an emotional voice conversion (EVC) system is to convert the emotion of a given speech signal from one style to another style without modifying the linguistic content of the signal. Most of the state-of-the-art approaches convert emotions for seen speaker-emotion combinations only. In this paper, we tackle the problem of converting the emotion of speakers whose only neutral data ar… ▽ More

    Submitted 21 February, 2023; originally announced February 2023.

    Comments: Demo Samples at https://demosamplesites.github.io/EVCUP/

  6. arXiv:2301.09776  [pdf, ps, other

    eess.IV cs.IT cs.LG cs.MM

    Differentiable bit-rate estimation for neural-based video codec enhancement

    Authors: Amir Said, Manish Kumar Singh, Reza Pourreza

    Abstract: Neural networks (NN) can improve standard video compression by pre- and post-processing the encoded video. For optimal NN training, the standard codec needs to be replaced with a codec proxy that can provide derivatives of estimated bit-rate and distortion, which are used for gradient back-propagation. Since entropy coding of standard codecs is designed to take into account non-linear dependencies… ▽ More

    Submitted 23 January, 2023; originally announced January 2023.

    Journal ref: Picture Coding Symposium (PCS), San Jose, CA, USA, 2022, pp. 379-383

  7. arXiv:2210.11096  [pdf, other

    cs.SD cs.LG eess.AS

    Robust One-Shot Singing Voice Conversion

    Authors: Naoya Takahashi, Mayank Kumar Singh, Yuki Mitsufuji

    Abstract: Recent progress in deep generative models has improved the quality of voice conversion in the speech domain. However, high-quality singing voice conversion (SVC) of unseen singers remains challenging due to the wider variety of musical expressions in pitch, loudness, and pronunciation. Moreover, singing voices are often recorded with reverb and accompaniment music, which make SVC even more challen… ▽ More

    Submitted 6 October, 2023; v1 submitted 20 October, 2022; originally announced October 2022.

  8. arXiv:2210.05841  [pdf, other

    math.OC eess.SY

    Towards Optimal Primary- and Secondary-control Design for Networks with Generators and Inverters

    Authors: Manish K. Singh, D. Venkatramanan, Sairaj Dhople

    Abstract: For power grids predominantly featuring large synchronous generators (SGs), there exists a significant body of work bridging optimization and control tasks. A generic workflow in such efforts entails: characterizing the steady state of control algorithms and SG dynamics; assessing the optimality of the resulting operating point with respect to an optimal dispatch task; and prescribing control para… ▽ More

    Submitted 11 October, 2022; originally announced October 2022.

    Comments: Presented at the Allerton conference 2022

  9. Learning Provably Stable Local Volt/Var Controllers for Efficient Network Operation

    Authors: Zhenyi Yuan, Guido Cavraro, Manish K. Singh, Jorge Cortés

    Abstract: This paper develops a data-driven framework to synthesize local Volt/Var control strategies for distributed energy resources (DERs) in power distribution networks (DNs). Aiming to improve DN operational efficiency, as quantified by a generic optimal reactive power flow (ORPF) problem, we propose a two-stage approach. The first stage involves learning the manifold of optimal operating points determ… ▽ More

    Submitted 12 June, 2024; v1 submitted 26 September, 2022; originally announced September 2022.

    Journal ref: IEEE Transactions on Power Systems, vol. 39, no. 1, pp. 2066-2079, 2024

  10. arXiv:2208.09117  [pdf, ps, other

    eess.SY

    Learning Local Volt/Var Controllers Towards Efficient Network Operation with Stability Guarantees

    Authors: Guido Cavraro, Zhenyi Yuan, Manish K. Singh, Jorge Cortés

    Abstract: This paper considers the problem of voltage regulation in distribution networks. The primary motivation is to keep voltages within preassigned operating limits by commanding the reactive power output of distributed energy resources (DERs) deployed in the grid. We develop a framework for develo** local Volt/Var control that comprises two main steps. In the first, by exploiting historical data and… ▽ More

    Submitted 18 August, 2022; originally announced August 2022.

    Comments: Accepted by IEEE CDC 2022

  11. arXiv:2204.08998  [pdf, other

    math.OC eess.SY

    Optimal Power Flow Schedules with Reduced Low-Frequency Oscillations

    Authors: Manish K. Singh, Vassilis Kekatos

    Abstract: The dynamic response of power grids to small events or persistent stochastic disturbances influences their stable operation. Low-frequency inter-area oscillations are of particular concern due to insufficient dam**. This paper studies the effect of the operating point on the linear time-invariant dynamics of power networks. A pertinent metric based on the frequency response of grid dynamics is p… ▽ More

    Submitted 19 April, 2022; originally announced April 2022.

  12. arXiv:2203.12084  [pdf, other

    eess.SY

    Time-domain Generalization of Kron Reduction

    Authors: Manish K. Singh, Sairaj Dhople, Florian Dorfler, Georgios B. Giannakis

    Abstract: Kron reduction is a network-reduction method that eliminates nodes with zero current injections from electrical networks operating in sinusoidal steady state. In the time domain, the state-of-the-art application of Kron reduction has been in networks with transmission lines that have constant R/L ratios. This paper considers RL networks without such restriction and puts forth a provably exact time… ▽ More

    Submitted 22 March, 2022; originally announced March 2022.

  13. arXiv:2203.08253  [pdf, other

    eess.SY math.DS

    Integrated System Models for Networks with Generators & Inverters

    Authors: D. Venkatramanan, Manish K. Singh, Olaolu Ajala, Alejandro Dominguez-Garcia, Sairaj Dhople

    Abstract: Synchronous generators and inverter-based resources are complex systems with dynamics that cut across multiple intertwined physical domains and control loops. Modeling individual generators and inverters is, in itself, a very involved activity and has attracted dedicated attention from power engineers and control theorists over the years. Control and stability challenges associated with increasing… ▽ More

    Submitted 24 July, 2022; v1 submitted 15 March, 2022; originally announced March 2022.

    Comments: In proceedings of the 11th Bulk Power Systems Dynamics and Control Symposium (IREP 2022), July 25-30, 2022, Banff, Canada

    Report number: IREP2022-72

  14. arXiv:2202.07500  [pdf, other

    eess.SP

    Fast Inverter Control by Learning the OPF Map** using Sensitivity-Informed Gaussian Processes

    Authors: Mana Jalali, Manish K. Singh, Vassilis Kekatos, Georgios B. Giannakis, Chen-Ching Liu

    Abstract: Fast inverter control is a desideratum towards the smoother integration of renewables. Adjusting inverter injection setpoints for distributed energy resources can be an effective grid control mechanism. However, finding such setpoints optimally requires solving an optimal power flow (OPF), which can be computationally taxing in real time. This work proposes learning the map** from grid condition… ▽ More

    Submitted 30 October, 2022; v1 submitted 15 February, 2022; originally announced February 2022.

  15. arXiv:2110.05054  [pdf, other

    cs.SD cs.CR eess.AS

    Source Mixing and Separation Robust Audio Steganography

    Authors: Naoya Takahashi, Mayank Kumar Singh, Yuki Mitsufuji

    Abstract: Audio steganography aims at concealing secret information in carrier audio with imperceptible modification on the carrier. Although previous works addressed the robustness of concealed message recovery against distortions introduced during transmission, they do not address the robustness against aggressive editing such as mixing of other audio sources and source separation. In this work, we propos… ▽ More

    Submitted 17 February, 2022; v1 submitted 11 October, 2021; originally announced October 2021.

    Comments: Accepted to ICASSP 2022

  16. arXiv:2103.13505  [pdf, other

    eess.SY math.OC

    Ripple-Type Control for Enhancing Resilience of Networked Physical Systems

    Authors: Manish K. Singh, Guido Cavraro, Andrey Bernstein, Vassilis Kekatos

    Abstract: Distributed control agents have been advocated as an effective means for improving the resiliency of our physical infrastructures under unexpected events. Purely local control has been shown to be insufficient, centralized optimal resource allocation approaches can be slow. In this context, we put forth a hybrid low-communication saturation-driven protocol for the coordination of control agents th… ▽ More

    Submitted 24 March, 2021; originally announced March 2021.

    Comments: Accepted for presentation at the American Control Conference (ACC) 2021

  17. arXiv:2101.06842  [pdf, other

    cs.SD cs.LG eess.AS

    Hierarchical disentangled representation learning for singing voice conversion

    Authors: Naoya Takahashi, Mayank Kumar Singh, Yuki Mitsufuji

    Abstract: Conventional singing voice conversion (SVC) methods often suffer from operating in high-resolution audio owing to a high dimensionality of data. In this paper, we propose a hierarchical representation learning that enables the learning of disentangled representations with multiple resolutions independently. With the learned disentangled representations, the proposed method progressively performs S… ▽ More

    Submitted 25 April, 2021; v1 submitted 17 January, 2021; originally announced January 2021.

    Comments: accepted at IJCNN 2021

  18. arXiv:2009.03380  [pdf, other

    eess.SY

    Chance-Constrained Optimal Distribution Network Partitioning to Enhance Grid Resilience

    Authors: Shuchismita Biswas, Manish K. Singh, Virgilio Centeno

    Abstract: This paper formulates a chance-constrained optimal distribution network partitioning (ODNP) problem addressing uncertainties in load and renewable energy generation; and presents a solution methodology using sample average approximation (SAA). The objective is to identify potential sub-networks in the existing distribution grid; that are likely to survive as self-adequate islands if supply from th… ▽ More

    Submitted 7 September, 2020; originally announced September 2020.

  19. arXiv:2003.12192  [pdf, other

    eess.SY math.OC

    Moving horizon-based optimal scheduling of EV charging: A power system-cognizant approach

    Authors: Nitasha Sahani, Manish Kumar Singh, Chen-Ching Liu

    Abstract: The rapid escalation in plug-in electric vehicles (PEVs) and their uncoordinated charging patterns pose several challenges in distribution system operation. Some of the undesirable effects include overloading of transformers, rapid voltage fluctuations, and over/under voltages. While this compromises the consumer power quality, it also puts on extra stress on the local voltage control devices. The… ▽ More

    Submitted 26 March, 2020; originally announced March 2020.

    Comments: Accepted for presentation at PES-General Meeting, Montreal, 2020

  20. arXiv:1911.12928  [pdf, other

    cs.SD cs.LG eess.AS

    Improving Voice Separation by Incorporating End-to-end Speech Recognition

    Authors: Naoya Takahashi, Mayank Kumar Singh, Sakya Basak, Parthasaarathy Sudarsanam, Sriram Ganapathy, Yuki Mitsufuji

    Abstract: Despite recent advances in voice separation methods, many challenges remain in realistic scenarios such as noisy recording and the limits of available data. In this work, we propose to explicitly incorporate the phonetic and linguistic nature of speech by taking a transfer learning approach using an end-to-end automatic speech recognition (E2EASR) system. The voice separation is conditioned on dee… ▽ More

    Submitted 3 May, 2020; v1 submitted 28 November, 2019; originally announced November 2019.

    Comments: Accepted in ICASSP 2020

  21. arXiv:1910.03020  [pdf, other

    eess.SY math.OC

    Joint Grid Topology Reconfiguration and Design of Watt-VAR Curves for DERs

    Authors: Manish K. Singh, Sina Taheri, Vassilis Kekatos, Kevin P. Schneider, Chen-Ching Liu

    Abstract: Operators can now remotely control switches and update the control settings for voltage regulators and distributed energy resources (DERs), thus unleashing the network reconfiguration opportunities to improve efficiency. Aligned to this direction, this work puts forth a comprehensive toolbox of optimization models leveraging the control capabilities of smart grid assets. We put forth detailed yet… ▽ More

    Submitted 22 February, 2020; v1 submitted 7 October, 2019; originally announced October 2019.