Search | arXiv e-print repository

SepHRNet: Generating High-Resolution Crop Maps from Remote Sensing imagery using HRNet with Separable Convolution

Authors: Priyanka Goyal, Sohan Patnaik, Adway Mitra, Manjira Sinha

Abstract: The accurate map** of crop production is crucial for ensuring food security, effective resource management, and sustainable agricultural practices. One way to achieve this is by analyzing high-resolution satellite imagery. Deep Learning has been successful in analyzing images, including remote sensing imagery. However, capturing intricate crop patterns is challenging due to their complexity and… ▽ More The accurate map** of crop production is crucial for ensuring food security, effective resource management, and sustainable agricultural practices. One way to achieve this is by analyzing high-resolution satellite imagery. Deep Learning has been successful in analyzing images, including remote sensing imagery. However, capturing intricate crop patterns is challenging due to their complexity and variability. In this paper, we propose a novel Deep learning approach that integrates HRNet with Separable Convolutional layers to capture spatial patterns and Self-attention to capture temporal patterns of the data. The HRNet model acts as a backbone and extracts high-resolution features from crop images. Spatially separable convolution in the shallow layers of the HRNet model captures intricate crop patterns more effectively while reducing the computational cost. The multi-head attention mechanism captures long-term temporal dependencies from the encoded vector representation of the images. Finally, a CNN decoder generates a crop map from the aggregated representation. Adaboost is used on top of this to further improve accuracy. The proposed algorithm achieves a high classification accuracy of 97.5\% and IoU of 55.2\% in generating crop maps. We evaluate the performance of our pipeline on the Zuericrop dataset and demonstrate that our results outperform state-of-the-art models such as U-Net++, ResNet50, VGG19, InceptionV3, DenseNet, and EfficientNet. This research showcases the potential of Deep Learning for Earth Observation Systems. △ Less

Submitted 11 July, 2023; originally announced July 2023.

arXiv:2302.09521 [pdf, other]

Rank-Minimizing and Structured Model Inference

Authors: Pawan Goyal, Benjamin Peherstorfer, Peter Benner

Abstract: While extracting information from data with machine learning plays an increasingly important role, physical laws and other first principles continue to provide critical insights about systems and processes of interest in science and engineering. This work introduces a method that infers models from data with physical insights encoded in the form of structure and that minimizes the model order so t… ▽ More While extracting information from data with machine learning plays an increasingly important role, physical laws and other first principles continue to provide critical insights about systems and processes of interest in science and engineering. This work introduces a method that infers models from data with physical insights encoded in the form of structure and that minimizes the model order so that the training data are fitted well while redundant degrees of freedom without conditions and sufficient data to fix them are automatically eliminated. The models are formulated via solution matrices of specific instances of generalized Sylvester equations that enforce interpolation of the training data and relate the model order to the rank of the solution matrices. The proposed method numerically solves the Sylvester equations for minimal-rank solutions and so obtains models of low order. Numerical experiments demonstrate that the combination of structure preservation and rank minimization leads to accurate models with orders of magnitude fewer degrees of freedom than models of comparable prediction quality that are learned with structure preservation alone. △ Less

Submitted 19 February, 2023; originally announced February 2023.

arXiv:2111.12995 [pdf, other]

Learning Low-Dimensional Quadratic-Embeddings of High-Fidelity Nonlinear Dynamics using Deep Learning

Authors: Pawan Goyal, Peter Benner

Abstract: Learning dynamical models from data plays a vital role in engineering design, optimization, and predictions. Building models describing dynamics of complex processes (e.g., weather dynamics, or reactive flows) using empirical knowledge or first principles are onerous or infeasible. Moreover, these models are high-dimensional but spatially correlated. It is, however, observed that the dynamics of h… ▽ More Learning dynamical models from data plays a vital role in engineering design, optimization, and predictions. Building models describing dynamics of complex processes (e.g., weather dynamics, or reactive flows) using empirical knowledge or first principles are onerous or infeasible. Moreover, these models are high-dimensional but spatially correlated. It is, however, observed that the dynamics of high-fidelity models often evolve in low-dimensional manifolds. Furthermore, it is also known that for sufficiently smooth vector fields defining the nonlinear dynamics, a quadratic model can describe it accurately in an appropriate coordinate system, conferring to the McCormick relaxation idea in nonconvex optimization. Here, we aim at finding a low-dimensional embedding of high-fidelity dynamical data, ensuring a simple quadratic model to explain its dynamics. To that aim, this work leverages deep learning to identify low-dimensional quadratic embeddings for high-fidelity dynamical systems. Precisely, we identify the embedding of data using an autoencoder to have the desired property of the embedding. We also embed a Runge-Kutta method to avoid the time-derivative computations, which is often a challenge. We illustrate the ability of the approach by a couple of examples, arising in describing flow dynamics and the oscillatory tubular reactor model. △ Less

Submitted 25 November, 2021; originally announced November 2021.

arXiv:2107.12950 [pdf, ps, other]

A Greedy Data Collection Scheme For Linear Dynamical Systems

Authors: Karim Cherifi, Pawan Goyal, Peter Benner

Abstract: Mathematical models are essential to analyze and understand the dynamics of complex systems. Recently, data-driven methodologies have got a lot of attention which is leveraged by advancements in sensor technology. However, the quality of obtained data plays a vital role in learning a good and reliable model. Therefore, in this paper, we propose an efficient heuristic methodology to collect data bo… ▽ More Mathematical models are essential to analyze and understand the dynamics of complex systems. Recently, data-driven methodologies have got a lot of attention which is leveraged by advancements in sensor technology. However, the quality of obtained data plays a vital role in learning a good and reliable model. Therefore, in this paper, we propose an efficient heuristic methodology to collect data both in the frequency domain and time-domain, aiming at the best possible information gain from limited experimental data. The efficiency of the proposed methodology is illustrated by means of several examples, and also, its robustness in the presence of noisy data is shown. △ Less

Submitted 27 July, 2021; originally announced July 2021.

arXiv:2106.05852 [pdf]

Automatic Speech Recognition in Sanskrit: A New Speech Corpus and Modelling Insights

Authors: Devaraja Adiga, Rishabh Kumar, Amrith Krishna, Preethi Jyothi, Ganesh Ramakrishnan, Pawan Goyal

Abstract: Automatic speech recognition (ASR) in Sanskrit is interesting, owing to the various linguistic peculiarities present in the language. The Sanskrit language is lexically productive, undergoes euphonic assimilation of phones at the word boundaries and exhibits variations in spelling conventions and in pronunciations. In this work, we propose the first large scale study of automatic speech recognitio… ▽ More Automatic speech recognition (ASR) in Sanskrit is interesting, owing to the various linguistic peculiarities present in the language. The Sanskrit language is lexically productive, undergoes euphonic assimilation of phones at the word boundaries and exhibits variations in spelling conventions and in pronunciations. In this work, we propose the first large scale study of automatic speech recognition (ASR) in Sanskrit, with an emphasis on the impact of unit selection in Sanskrit ASR. In this work, we release a 78 hour ASR dataset for Sanskrit, which faithfully captures several of the linguistic characteristics expressed by the language. We investigate the role of different acoustic model and language model units in ASR systems for Sanskrit. We also propose a new modelling unit, inspired by the syllable level unit selection, that captures character sequences from one vowel in the word to the next vowel. We also highlight the importance of choosing graphemic representations for Sanskrit and show the impact of this choice on word error rates (WER). Finally, we extend these insights from Sanskrit ASR for building ASR systems in two other Indic languages, Gujarati and Telugu. For both these languages, our experimental results show that the use of phonetic based graphemic representations in ASR results in performance improvements as compared to ASR systems that use native scripts. △ Less

Submitted 23 July, 2021; v1 submitted 2 June, 2021; originally announced June 2021.

Comments: Accepted paper at the 59th Annual Meeting of the Association for Computational Linguistics (ACL 2021 Findings)

arXiv:2005.09371 [pdf, ps, other]

A Non-Intrusive Method to Inferring Linear Port-Hamiltonian Realizations using Time-Domain Data

Authors: Karim Cherifi, Pawan Goyal, Peter Benner

Abstract: Port-Hamiltonian systems have gained a lot of attention in recent years due to their inherent valuable properties in modeling and control. In this paper, we are interested in constructing linear port-Hamiltonian systems from time-domain input-output data. We discuss a non-intrusive methodology that is comprised of two main ingredients -- (a) inferring frequency response data from time-domain data… ▽ More Port-Hamiltonian systems have gained a lot of attention in recent years due to their inherent valuable properties in modeling and control. In this paper, we are interested in constructing linear port-Hamiltonian systems from time-domain input-output data. We discuss a non-intrusive methodology that is comprised of two main ingredients -- (a) inferring frequency response data from time-domain data and (b) constructing an underlying port-Hamiltonian realization using the inferred frequency response data. We illustrate the proposed methodology by means of two numerical examples and also compare it with two other system identification methods to infer the frequency response from the input-output data. △ Less

Submitted 17 November, 2020; v1 submitted 19 May, 2020; originally announced May 2020.

MSC Class: 93A30; 93B30; 93B15; 93B20

arXiv:1911.00080 [pdf, ps, other]

Identification of Port-Hamiltonian Systems from Frequency Response Data

Authors: Peter Benner, Pawan Goyal, Paul Van Dooren

Abstract: In this paper, we study the identification problem of a passive system from tangential interpolation data. We present a simple construction approach based on the Mayo-Antoulas generalized realization theory that automatically yields a port-Hamiltonian realization for every strictly passive system with simple spectral zeros. Furthermore, we discuss the construction of a frequency-limited port-Hamil… ▽ More In this paper, we study the identification problem of a passive system from tangential interpolation data. We present a simple construction approach based on the Mayo-Antoulas generalized realization theory that automatically yields a port-Hamiltonian realization for every strictly passive system with simple spectral zeros. Furthermore, we discuss the construction of a frequency-limited port-Hamiltonian realization. We illustrate the proposed method by means of several examples. △ Less

Submitted 31 October, 2019; originally announced November 2019.

arXiv:1910.00838 [pdf, ps, other]

Data-Driven Identification of Rayleigh-Damped Second-Order Systems

Authors: Igor Pontes Duff, Pawan Goyal, Peter Benner

Abstract: In this paper, we present a data-driven approach to identify second-order systems, having internal Rayleigh dam**. This means that the dam** matrix is given as a linear combination of the mass and stiffness matrices. These systems typically appear when performing various engineering studies, e.g., vibrational and structural analysis. In an experimental setup, the frequency response of a system… ▽ More In this paper, we present a data-driven approach to identify second-order systems, having internal Rayleigh dam**. This means that the dam** matrix is given as a linear combination of the mass and stiffness matrices. These systems typically appear when performing various engineering studies, e.g., vibrational and structural analysis. In an experimental setup, the frequency response of a system can be measured via various approaches, for instance, by measuring the vibrations using an accelerometer. As a consequence, given frequency samples, the identification of the underlying system relies on rational approximation. To that aim, we propose an identification of the corresponding second-order system, extending the Loewner framework for this class of systems. The efficiency of the proposed method is demonstrated by means of various numerical benchmarks. △ Less

Submitted 2 October, 2019; originally announced October 2019.

Comments: 16 pages, 6 figures

arXiv:1901.08759 [pdf, other]

Misleading Metadata Detection on YouTube

Authors: Priyank Palod, Ayush Patwari, Sudhanshu Bahety, Saurabh Bagchi, Pawan Goyal

Abstract: YouTube is the leading social media platform for sharing videos. As a result, it is plagued with misleading content that includes staged videos presented as real footages from an incident, videos with misrepresented context and videos where audio/video content is morphed. We tackle the problem of detecting such misleading videos as a supervised classification task. We develop UCNet - a deep networ… ▽ More YouTube is the leading social media platform for sharing videos. As a result, it is plagued with misleading content that includes staged videos presented as real footages from an incident, videos with misrepresented context and videos where audio/video content is morphed. We tackle the problem of detecting such misleading videos as a supervised classification task. We develop UCNet - a deep network to detect fake videos and perform our experiments on two datasets - VAVD created by us and publicly available FVC [8]. We achieve a macro averaged F-score of 0.82 while training and testing on a 70:30 split of FVC, while the baseline model scores 0.36. We find that the proposed model generalizes well when trained on one dataset and tested on the other. △ Less

Submitted 25 January, 2019; originally announced January 2019.

Comments: Accepted at European Conference on Information Retrieval(ECIR) 2019. 7 Pages

Showing 1–9 of 9 results for author: Goyal, P