Search | arXiv e-print repository

Distilling Opinions at Scale: Incremental Opinion Summarization using XL-OPSUMM

Authors: Sri Raghava Muddu, Rupasai Rangaraju, Tejpalsingh Siledar, Swaroop Nath, Pushpak Bhattacharyya, Swaprava Nath, Suman Banerjee, Amey Patil, Muthusamy Chelliah, Sudhanshu Shekhar Singh, Nikesh Garera

Abstract: Opinion summarization in e-commerce encapsulates the collective views of numerous users about a product based on their reviews. Typically, a product on an e-commerce platform has thousands of reviews, each review comprising around 10-15 words. While Large Language Models (LLMs) have shown proficiency in summarization tasks, they struggle to handle such a large volume of reviews due to context limi… ▽ More Opinion summarization in e-commerce encapsulates the collective views of numerous users about a product based on their reviews. Typically, a product on an e-commerce platform has thousands of reviews, each review comprising around 10-15 words. While Large Language Models (LLMs) have shown proficiency in summarization tasks, they struggle to handle such a large volume of reviews due to context limitations. To mitigate, we propose a scalable framework called Xl-OpSumm that generates summaries incrementally. However, the existing test set, AMASUM has only 560 reviews per product on average. Due to the lack of a test set with thousands of reviews, we created a new test set called Xl-Flipkart by gathering data from the Flipkart website and generating summaries using GPT-4. Through various automatic evaluations and extensive analysis, we evaluated the framework's efficiency on two datasets, AMASUM and Xl-Flipkart. Experimental results show that our framework, Xl-OpSumm powered by Llama-3-8B-8k, achieves an average ROUGE-1 F1 gain of 4.38% and a ROUGE-L F1 gain of 3.70% over the next best-performing model. △ Less

Submitted 16 June, 2024; originally announced June 2024.

arXiv:2404.17616 [pdf, ps, other]

Behaviours of rip cosmological models in $f(Q,C)$ gravity

Authors: Amit Samaddar, S. Surendra Singh, Shah Muhammad, Euaggelos E. Zotos

Abstract: In this study, the Universe's rip cosmology theories have been provided for the $f(Q,C)$ gravity theory, where $Q$ and $C$ stand for the non-metricity scalar and boundary term. We assumed $f(Q,C)=αQ^{n}+βC$ and analyzed the nature of the physical parameters for the Little Rip, Big Rip and Pseudo Rip models. In the LR and PR models, the EoS parameter exhibits phantom characteristics but remains clo… ▽ More In this study, the Universe's rip cosmology theories have been provided for the $f(Q,C)$ gravity theory, where $Q$ and $C$ stand for the non-metricity scalar and boundary term. We assumed $f(Q,C)=αQ^{n}+βC$ and analyzed the nature of the physical parameters for the Little Rip, Big Rip and Pseudo Rip models. In the LR and PR models, the EoS parameter exhibits phantom characteristics but remains closely aligned with the $Λ$CDM line. After investigating the energy conditions, we recognised that our model violates the strong energy constraint. Avoiding singularity situations has been noted in all of these accelerated models. The characteristics of the jerk and snap parameters have been investigated. Our model provides an effective description of the Universe's evolutionary history and fits well with contemporary cosmic data. △ Less

Submitted 26 April, 2024; originally announced April 2024.

arXiv:2404.05243 [pdf, other]

Product Description and QA Assisted Self-Supervised Opinion Summarization

Authors: Tejpalsingh Siledar, Rupasai Rangaraju, Sankara Sri Raghava Ravindra Muddu, Suman Banerjee, Amey Patil, Sudhanshu Shekhar Singh, Muthusamy Chelliah, Nikesh Garera, Swaprava Nath, Pushpak Bhattacharyya

Abstract: In e-commerce, opinion summarization is the process of summarizing the consensus opinions found in product reviews. However, the potential of additional sources such as product description and question-answers (QA) has been considered less often. Moreover, the absence of any supervised training data makes this task challenging. To address this, we propose a novel synthetic dataset creation (SDC) s… ▽ More In e-commerce, opinion summarization is the process of summarizing the consensus opinions found in product reviews. However, the potential of additional sources such as product description and question-answers (QA) has been considered less often. Moreover, the absence of any supervised training data makes this task challenging. To address this, we propose a novel synthetic dataset creation (SDC) strategy that leverages information from reviews as well as additional sources for selecting one of the reviews as a pseudo-summary to enable supervised training. Our Multi-Encoder Decoder framework for Opinion Summarization (MEDOS) employs a separate encoder for each source, enabling effective selection of information while generating the summary. For evaluation, due to the unavailability of test sets with additional sources, we extend the Amazon, Oposum+, and Flipkart test sets and leverage ChatGPT to annotate summaries. Experiments across nine test sets demonstrate that the combination of our SDC approach and MEDOS model achieves on average a 14.5% improvement in ROUGE-1 F1 over the SOTA. Moreover, comparative analysis underlines the significance of incorporating additional sources for generating more informative summaries. Human evaluations further indicate that MEDOS scores relatively higher in coherence and fluency with 0.41 and 0.5 (-1 to 1) respectively, compared to existing models. To the best of our knowledge, we are the first to generate opinion summaries leveraging additional sources in a self-supervised setting. △ Less

Submitted 8 April, 2024; originally announced April 2024.

arXiv:2403.20313 [pdf, other]

Towards a turnkey approach to unbiased Monte Carlo estimation of smooth functions of expectations

Authors: Nicolas Chopin, Francesca R. Crucinio, Sumeetpal S. Singh

Abstract: Given a smooth function $f$, we develop a general approach to turn Monte Carlo samples with expectation $m$ into an unbiased estimate of $f(m)$. Specifically, we develop estimators that are based on randomly truncating the Taylor series expansion of $f$ and estimating the coefficients of the truncated series. We derive their properties and propose a strategy to set their tuning parameters -- which… ▽ More Given a smooth function $f$, we develop a general approach to turn Monte Carlo samples with expectation $m$ into an unbiased estimate of $f(m)$. Specifically, we develop estimators that are based on randomly truncating the Taylor series expansion of $f$ and estimating the coefficients of the truncated series. We derive their properties and propose a strategy to set their tuning parameters -- which depend on $m$ -- automatically, with a view to make the whole approach simple to use. We develop our methods for the specific functions $f(x)=\log x$ and $f(x)=1/x$, as they arise in several statistical applications such as maximum likelihood estimation of latent variable models and Bayesian inference for un-normalised models. Detailed numerical studies are performed for a range of applications to determine how competitive and reliable the proposed approach is. △ Less

Submitted 12 April, 2024; v1 submitted 29 March, 2024; originally announced March 2024.

arXiv:2402.15473 [pdf, other]

Leveraging Domain Knowledge for Efficient Reward Modelling in RLHF: A Case-Study in E-Commerce Opinion Summarization

Authors: Swaroop Nath, Tejpalsingh Siledar, Sankara Sri Raghava Ravindra Muddu, Rupasai Rangaraju, Harshad Khadilkar, Pushpak Bhattacharyya, Suman Banerjee, Amey Patil, Sudhanshu Shekhar Singh, Muthusamy Chelliah, Nikesh Garera

Abstract: Reinforcement Learning from Human Feedback (RLHF) has become a dominating strategy in aligning Language Models (LMs) with human values/goals. The key to the strategy is learning a reward model ($\varphi$), which can reflect the latent reward model of humans. While this strategy has proven effective, the training methodology requires a lot of human preference annotation (usually in the order of ten… ▽ More Reinforcement Learning from Human Feedback (RLHF) has become a dominating strategy in aligning Language Models (LMs) with human values/goals. The key to the strategy is learning a reward model ($\varphi$), which can reflect the latent reward model of humans. While this strategy has proven effective, the training methodology requires a lot of human preference annotation (usually in the order of tens of thousands) to train $\varphi$. Such a large-scale annotation is justifiable when it's a one-time effort, and the reward model is universally applicable. However, human goals are subjective and depend on the task, requiring task-specific preference annotations, which can be impractical to fulfill. To address this challenge, we propose a novel approach to infuse domain knowledge into $\varphi$, which reduces the amount of preference annotation required ($21\times$), omits Alignment Tax, and provides some interpretability. We validate our approach in E-Commerce Opinion Summarization, with a significant reduction in dataset size (to just $940$ samples) while advancing the SOTA ($\sim4$ point ROUGE-L improvement, $68\%$ of times preferred by humans over SOTA). Our contributions include a novel Reward Modeling technique and two new datasets: PromptOpinSumm (supervised data for Opinion Summarization) and OpinPref (a gold-standard human preference dataset). The proposed methodology opens up avenues for efficient RLHF, making it more adaptable to applications with varying human values. We release the artifacts (Code: github.com/efficient-rlhf. PromptOpinSumm: hf.co/prompt-opin-summ. OpinPref: hf.co/opin-pref) for usage under MIT License. △ Less

Submitted 18 April, 2024; v1 submitted 23 February, 2024; originally announced February 2024.

Comments: 19 pages, 6 figures, 21 tables

arXiv:2402.11683 [pdf, other]

One Prompt To Rule Them All: LLMs for Opinion Summary Evaluation

Authors: Tejpalsingh Siledar, Swaroop Nath, Sankara Sri Raghava Ravindra Muddu, Rupasai Rangaraju, Swaprava Nath, Pushpak Bhattacharyya, Suman Banerjee, Amey Patil, Sudhanshu Shekhar Singh, Muthusamy Chelliah, Nikesh Garera

Abstract: Evaluation of opinion summaries using conventional reference-based metrics rarely provides a holistic evaluation and has been shown to have a relatively low correlation with human judgments. Recent studies suggest using Large Language Models (LLMs) as reference-free metrics for NLG evaluation, however, they remain unexplored for opinion summary evaluation. Moreover, limited opinion summary evaluat… ▽ More Evaluation of opinion summaries using conventional reference-based metrics rarely provides a holistic evaluation and has been shown to have a relatively low correlation with human judgments. Recent studies suggest using Large Language Models (LLMs) as reference-free metrics for NLG evaluation, however, they remain unexplored for opinion summary evaluation. Moreover, limited opinion summary evaluation datasets inhibit progress. To address this, we release the SUMMEVAL-OP dataset covering 7 dimensions related to the evaluation of opinion summaries: fluency, coherence, relevance, faithfulness, aspect coverage, sentiment consistency, and specificity. We investigate Op-I-Prompt a dimension-independent prompt, and Op-Prompts, a dimension-dependent set of prompts for opinion summary evaluation. Experiments indicate that Op-I-Prompt emerges as a good alternative for evaluating opinion summaries achieving an average Spearman correlation of 0.70 with humans, outperforming all previous approaches. To the best of our knowledge, we are the first to investigate LLMs as evaluators on both closed-source and open-source models in the opinion summarization domain. △ Less

Submitted 9 June, 2024; v1 submitted 18 February, 2024; originally announced February 2024.

arXiv:2312.17572 [pdf, other]

Mixing time of the conditional backward sampling particle filter

Authors: Joona Karjalainen, Anthony Lee, Sumeetpal S. Singh, Matti Vihola

Abstract: The conditional backward sampling particle filter (CBPF) is a powerful Markov chain Monte Carlo sampler for general state space hidden Markov model smoothing. It was proposed as an improvement over the conditional particle filter, which is known to have an $O(T^2)$ computational time complexity under a general `strong' mixing assumption, where $T$ is the time horizon. We provide the first proof th… ▽ More The conditional backward sampling particle filter (CBPF) is a powerful Markov chain Monte Carlo sampler for general state space hidden Markov model smoothing. It was proposed as an improvement over the conditional particle filter, which is known to have an $O(T^2)$ computational time complexity under a general `strong' mixing assumption, where $T$ is the time horizon. We provide the first proof that the CBPF admits an $O(T \log T)$ time complexity under strong mixing, complementing strong empirical evidence of the superiority of the CBPF in practice. In particular, the CBPF's mixing time is upper bounded by $O(\log T)$, for any sufficiently large number of particles $N$ that depends only on the mixing assumptions and not $T$. We show that an $O(\log T)$ mixing time is optimal. The proof involves the analysis of a novel coupling of two CBPFs, which involves a maximal coupling of two particle systems at each time instant. The coupling is implementable, and thus can also be used to construct unbiased, finite variance, estimates of functionals which have arbitrary dependence on the latent state's path, with a total expected cost of $O(T \log T)$. We also investigate other couplings, and we show some of these alternatives have improved empirical behaviour. △ Less

Submitted 22 February, 2024; v1 submitted 29 December, 2023; originally announced December 2023.

Comments: 30 pages, 7 figures; revised before submission to a journal

MSC Class: Primary 60J22; secondary 65C05; 65C40; 65C35; 62M05

arXiv:2309.08517 [pdf, ps, other]

On the Forgetting of Particle Filters

Authors: Joona Karjalainen, Anthony Lee, Sumeetpal S. Singh, Matti Vihola

Abstract: We study the forgetting properties of the particle filter when its state - the collection of particles - is regarded as a Markov chain. Under a strong mixing assumption on the particle filter's underlying Feynman-Kac model, we find that the particle filter is exponentially mixing, and forgets its initial state in $O(\log N )$ `time', where $N$ is the number of particles and time refers to the numb… ▽ More We study the forgetting properties of the particle filter when its state - the collection of particles - is regarded as a Markov chain. Under a strong mixing assumption on the particle filter's underlying Feynman-Kac model, we find that the particle filter is exponentially mixing, and forgets its initial state in $O(\log N )$ `time', where $N$ is the number of particles and time refers to the number of particle filter algorithm steps, each comprising a selection (or resampling) and mutation (or prediction) operation. We present an example which suggests that this rate is optimal. In contrast to our result, available results to-date are extremely conservative, suggesting $O(α^N)$ time steps are needed, for some $α>1$, for the particle filter to forget its initialisation. We also study the conditional particle filter (CPF) and extend our forgetting result to this context. We establish a similar conclusion, namely, CPF is exponentially mixing and forgets its initial state in $O(\log N )$ time. To support this analysis, we establish new time-uniform $L^p$ error estimates for CPF, which can be of independent interest. △ Less

Submitted 15 September, 2023; originally announced September 2023.

Comments: 26 pages

arXiv:2308.10874 [pdf, other]

Analyzing Transformer Dynamics as Movement through Embedding Space

Authors: Sumeet S. Singh

Abstract: Transformer based language models exhibit intelligent behaviors such as understanding natural language, recognizing patterns, acquiring knowledge, reasoning, planning, reflecting and using tools. This paper explores how their underlying mechanics give rise to intelligent behaviors. Towards that end, we propose framing Transformer dynamics as movement through embedding space. Examining Transformers… ▽ More Transformer based language models exhibit intelligent behaviors such as understanding natural language, recognizing patterns, acquiring knowledge, reasoning, planning, reflecting and using tools. This paper explores how their underlying mechanics give rise to intelligent behaviors. Towards that end, we propose framing Transformer dynamics as movement through embedding space. Examining Transformers through this perspective reveals key insights, establishing a Theory of Transformers: 1) Intelligent behaviours map to paths in Embedding Space which, the Transformer random-walks through during inferencing. 2) LM training learns a probability distribution over all possible paths. `Intelligence' is learnt by assigning higher probabilities to paths representing intelligent behaviors. No learning can take place in-context; context only narrows the subset of paths sampled during decoding. 5) The Transformer is a self-map** composition function, folding a context sequence into a context-vector such that it's proximity to a token-vector reflects its co-occurrence and conditioned probability. Thus, the physical arrangement of vectors in Embedding Space determines path probabilities. 6) Context vectors are composed by aggregating features of the sequence's tokens via a process we call the encoding walk. Attention contributes a - potentially redundant - association-bias to this process. 7) This process is comprised of two principal operation types: filtering (data independent) and aggregation (data dependent). This generalization unifies Transformers with other sequence models. Building upon this foundation, we formalize a popular semantic interpretation of embeddings into a ``concept-space theory'' and find some evidence of it's validity. △ Less

Submitted 14 November, 2023; v1 submitted 21 August, 2023; originally announced August 2023.

Comments: V2. Rewrote abstract. Rewrote / re-organized the entire paper into a more formal proposition/argument/result format. To shorten main paper length: Wrote more compact text in general, moved "negative self bias" and "encoder v/s decoder walks" sections to the appendix and packed figures. Styled as TMLR

arXiv:2306.07689 [pdf, ps, other]

Stability Analysis of Cosmological models in $f(T,φ)$ Gravity

Authors: Amit Samaddar, S. Surendra Singh

Abstract: We investigated the stability condition in $f(T,φ)$ gravity theory for considering two models by using dynamical system. We assume the forms of $G(T)$ are $(i)$ $G(T)$ = $αT+\fracβ{T}$, $(ii)$ $G(T)$ = $ζT$ ln$(ψT)$, where $α$, $β$, $ζ$ and $ψ$ be the free parameters. We evaluated the equilibrium points for these models and examine the stability behavior. We found five stable critical points for M… ▽ More We investigated the stability condition in $f(T,φ)$ gravity theory for considering two models by using dynamical system. We assume the forms of $G(T)$ are $(i)$ $G(T)$ = $αT+\fracβ{T}$, $(ii)$ $G(T)$ = $ζT$ ln$(ψT)$, where $α$, $β$, $ζ$ and $ψ$ be the free parameters. We evaluated the equilibrium points for these models and examine the stability behavior. We found five stable critical points for Model I and three stable critical points for Model II. The phase plots for these systems are examined and discussed the physical interpretation. We illustrate all the cosmological parameters such as $Ω_{m}$, $Ω_φ$, $q$ and $ω_{Tot}$ at each fixed points and compare the parameters with observational values. Further, we assume hybrid scale factor and the equation of redshift and time is $t(z)=\fracδσW\bigg[\fracσδ\bigg(\frac{1}{a_{1}(1+z)}\bigg)^{\frac{1}δ}\bigg]$. We transform all the parameters in redshift by using this equation and examine the behavior of these parameters. Our models represent the accelerating stage of the Universe. The energy conditions are examined in terms of redshift and SEC is not satisfied for the model. We also find the statefinder parameters $\{r,s\}$ in terms of z and discuss the nature of $r-s$ and $r-q$ plane. For both pairs $\{r,s\}$ and $\{r,q\}$ our models represent the $Λ$CDM model. Hence, we determine that our $f(T,φ)$ models are stable and it satisfies all the observational values. △ Less

Submitted 13 June, 2023; originally announced June 2023.

arXiv:2302.02999 [pdf, ps, other]

Stability analysis of cosmological models coupled minimally with scalar field in $f(Q)$ gravity

Authors: Amit Samaddar, S. Surendra Singh, Shivangi Rathore

Abstract: In this work, in the framework of dynamical system analysis, we focus on the study of the accelerated expansion of the Universe of $f(Q)$ gravity theory where $Q$ be the non-metricity that describes the gravitational interaction. We consider the linear form of $f(Q)$ gravity i.e. $f(Q)=-α_{1}Q-α_{2}$ where $α_{1}$ and $α_{2}$ are constants. We consider an interaction between dark matter (DM) and d… ▽ More In this work, in the framework of dynamical system analysis, we focus on the study of the accelerated expansion of the Universe of $f(Q)$ gravity theory where $Q$ be the non-metricity that describes the gravitational interaction. We consider the linear form of $f(Q)$ gravity i.e. $f(Q)=-α_{1}Q-α_{2}$ where $α_{1}$ and $α_{2}$ are constants. We consider an interaction between dark matter (DM) and dark energy (DE) in $f(Q)$ gravity. To reduce the modified Friedmann equations to an autonomous system of first-order ordinary differential equations, we introduce some dimensionless new variables. The nature of the critical points are discussed by finding the eigenvalues of the Jacobian matrix. We get six critical points for interacting DE model. We also analyze the density parameter, equation of state (EoS) parameter and deceleration parameter and draw their plots and we conclude that for some suitable range of the parameters $λ$ and $α$, the value of the deceleration parameter is $q=-1$ which shows that the expansion of Universe is accelerating and the value of EoS parameter is $ω_φ=-1$ which shows that the model is $Λ$CDM model. Finally, we discussed the classical as well as quantum stability of the model. △ Less

Submitted 18 January, 2023; originally announced February 2023.

Comments: 23 pages, 10 figures

arXiv:2211.12580 [pdf, other]

Quasi-Newton Sequential Monte Carlo

Authors: Samuel Duffield, Sumeetpal S. Singh

Abstract: Sequential Monte Carlo samplers represent a compelling approach to posterior inference in Bayesian models, due to being parallelisable and providing an unbiased estimate of the posterior normalising constant. In this work, we significantly accelerate sequential Monte Carlo samplers by adopting the L-BFGS Hessian approximation which represents the state-of-the-art in full-batch optimisation techniq… ▽ More Sequential Monte Carlo samplers represent a compelling approach to posterior inference in Bayesian models, due to being parallelisable and providing an unbiased estimate of the posterior normalising constant. In this work, we significantly accelerate sequential Monte Carlo samplers by adopting the L-BFGS Hessian approximation which represents the state-of-the-art in full-batch optimisation techniques. The L-BFGS Hessian approximation has only linear complexity in the parameter dimension and requires no additional posterior or gradient evaluations. The resulting sequential Monte Carlo algorithm is adaptive, parallelisable and well-suited to high-dimensional and multi-modal settings, which we demonstrate in numerical experiments on challenging posterior distributions. △ Less

Submitted 22 November, 2022; originally announced November 2022.

arXiv:2211.07376 [pdf, ps, other]

doi 10.1140/epjc/s10052-023-11458-2

Qualitative Stability Analysis of Cosmological Parameters in $f(T,B)$ Gravity

Authors: Amit Samaddar, S. Surendra Singh

Abstract: We analyze the cosmological solutions of $f(T,B)$ gravity using dynamical system analysis where $T$ is the torsion scalar and $B$ be the boundary term scalar. In our work, we assume two specific cosmological models. For first model, we consider $ f(T,B)=f_{0}(B^{k}+T^{m})$, where $k$ and $m$ are constants. For second model, we consider $f(T,B)=f_{0}T B$. We generate an autonomous system of differe… ▽ More We analyze the cosmological solutions of $f(T,B)$ gravity using dynamical system analysis where $T$ is the torsion scalar and $B$ be the boundary term scalar. In our work, we assume two specific cosmological models. For first model, we consider $ f(T,B)=f_{0}(B^{k}+T^{m})$, where $k$ and $m$ are constants. For second model, we consider $f(T,B)=f_{0}T B$. We generate an autonomous system of differential equations for each models by introducing new dimensionless variables. To solve this system of equations, we use dynamical system analysis. We also investigate the critical points and their natures, stability conditions and their behaviors of Universe expansion. For both models, we get four critical points. The phase plots of this system are analyzed in detail and study their geometrical interpretations also. In both model, we evaluated density parameters such as $Ω_{r}$, $Ω_{m}$, $Ω_Λ$ and $ω_{eff}$ and deceleration parameter $(q)$ and find their suitable range of the parameter $λ$ for stability. For first model, we get $ω_{eff}=-0.833,-0.166$ and for second model, we get $ω_{eff}=-\frac{1}{3}$. This shows that both the models are in quintessence phase. Further, we compare the values of EoS parameter and deceleration parameter with the observational values. △ Less

Submitted 10 November, 2022; originally announced November 2022.

Comments: Figures=4; page=17,

arXiv:2211.05086 [pdf, ps, other]

Renyi Holographic dark energy and its behaviour in f(G) gravity

Authors: Md Khurshid Alam, S. Surendra Singh, L. Anjana Devi

Abstract: In this work, the Renyi holographic dark energy (RHDE)and its behaviour has been explored with the anisotropic and spatially homogeneous Bianchi type-I Universe in the framework of $f(G)$ gravity. We use IR cutoff as the Hubble and Granda-Oliveros (GO) horizons. To find a consistent solutions of the field equations of the models, it is assumed that the deceleration parameter is defined in terms of… ▽ More In this work, the Renyi holographic dark energy (RHDE)and its behaviour has been explored with the anisotropic and spatially homogeneous Bianchi type-I Universe in the framework of $f(G)$ gravity. We use IR cutoff as the Hubble and Granda-Oliveros (GO) horizons. To find a consistent solutions of the field equations of the models, it is assumed that the deceleration parameter is defined in terms of function of Hubble parameter $H$. With reference to current cosmological data, the behaviors of the cosmological parameters relating to the dark energy model are evaluated and their physical significance is examined. It is observed that for both the models, the equation of state parameter approaches to $-1$ at late times. However, the RHDE model with the Hubble horizon exhibits stability from the squared sound speed, but the RHDE model with the GO horizon exhibits instability. In both the models, deceleration parameter and statefinder diagnostic confirm the accelerated expansion of the Universe and also correspond to the $Λ$CDM model at late times. △ Less

Submitted 26 October, 2022; originally announced November 2022.

arXiv:2209.03959 [pdf, ps, other]

Anisotropic Universe in f(Q) gravity with Hybrid expansion

Authors: L. Anjana Devi, S. Surendra Singh, Leishingam Kumrah, Md Khurshid Alam

Abstract: Despite having a reasonably successful account of accelerated cosmology, understanding the early evolution of Universe has always been difficult for mankind. Our promising strategy is based on a novel class of symmetric teleparallel theories of gravity called $f(Q)$, in which the gravitational interaction is caused by the non-metricity scalar $Q$, which may help to solve some problems. We consider… ▽ More Despite having a reasonably successful account of accelerated cosmology, understanding the early evolution of Universe has always been difficult for mankind. Our promising strategy is based on a novel class of symmetric teleparallel theories of gravity called $f(Q)$, in which the gravitational interaction is caused by the non-metricity scalar $Q$, which may help to solve some problems. We consider the locally rotationally symmetric (LRS) Bianchi type-I spacetime cosmological models and derive the motion of equations to study the early evolution of the cosmos. By assuming the Hybrid Expansion Law (HEL) for the average scale factor, we are able to determine the solutions to the field equations of Bianchi type-I spacetime. We discuss the energy density profile, the equation of state, and the skewness parameter and conclude that our models preserve anisotropic spatial geometry during the early stages of the Universe with the possibility of an anisotropic fluid present. However, as time goes on, even in the presence of an anisotropic fluid, the Universe may move towards isotropy due to inflation while the anisotropy of the fluid dims away at the same time. It is seen from the squared speed of sound that Universe shows phantom nature at the beginning then approaches to dark energy at present epoch. We analyze both geometrical and physical behaviors of the derived model. △ Less

Submitted 7 September, 2022; originally announced September 2022.

Comments: arXiv admin note: text overlap with arXiv:2208.08877

arXiv:2208.09597 [pdf, other]

FRW cosmology with varying cubic deceleration parameter

Authors: Leishingam Kumrah, S. Surendra Singh, L. Anjana Devi, Md Khurshid Alam

Abstract: In this work a new law of varying deceleration parameter of third degree have been proposed. The solutions of the modified field equations have been derived under the newly proposed law of the deceleration parameter. Model exhibits the Big-bang singularity at cosmic time ($t=0$) and shows Big Rip at ($t=n$) then it re-enter the phase of initial singularity at $t=2n$ and ends its cyclic behavior at… ▽ More In this work a new law of varying deceleration parameter of third degree have been proposed. The solutions of the modified field equations have been derived under the newly proposed law of the deceleration parameter. Model exhibits the Big-bang singularity at cosmic time ($t=0$) and shows Big Rip at ($t=n$) then it re-enter the phase of initial singularity at $t=2n$ and ends its cyclic behavior at $t=3n$. The evolution of the physical and dynamical parameters of the Universe have been studied and the graphical representation has also been shown. Further $Om(z)$ diagnostic parameter and the energy conditions have also been studied together with their graphical representations. △ Less

Submitted 19 August, 2022; originally announced August 2022.

arXiv:2206.10478 [pdf, other]

doi 10.5705/ss.202022.0210

De-biasing particle filtering for a continuous time hidden Markov model with a Cox process observation model

Authors: Ruiyang **, Sumeetpal S. Singh, Nicolas Chopin

Abstract: We develop a (nearly) unbiased particle filtering algorithm for a specific class of continuous-time state-space models, such that (a) the latent process $X_t$ is a linear Gaussian diffusion; and (b) the observations arise from a Poisson process with intensity $λ(X_t)$. The likelihood of the posterior probability density function of the latent process includes an intractable path integral. Our algo… ▽ More We develop a (nearly) unbiased particle filtering algorithm for a specific class of continuous-time state-space models, such that (a) the latent process $X_t$ is a linear Gaussian diffusion; and (b) the observations arise from a Poisson process with intensity $λ(X_t)$. The likelihood of the posterior probability density function of the latent process includes an intractable path integral. Our algorithm relies on Poisson estimates which approximate unbiasedly this integral. We show how we can tune these Poisson estimates to ensure that, with large probability, all but a few of the estimates generated by the algorithm are positive. Then replacing the negative estimates by zero leads to a much smaller bias than what would obtain through discretisation. We quantify the probability of negative estimates for certain special cases and show that our particle filter is effectively unbiased. We apply our method to a challenging 3D single molecule tracking example with a Born and Wolf observation model. △ Less

Submitted 30 June, 2022; v1 submitted 21 June, 2022; originally announced June 2022.

Comments: 34 pages, 14 figures

arXiv:2205.13898 [pdf, other]

Conditional particle filters with bridge backward sampling

Authors: Santeri Karppinen, Sumeetpal S. Singh, Matti Vihola

Abstract: Conditional particle filters (CPFs) with backward/ancestor sampling are powerful methods for sampling from the posterior distribution of the latent states of a dynamic model such as a hidden Markov model. However, the performance of these methods deteriorates with models involving weakly informative observations and/or slowly mixing dynamics. Both of these complications arise when sampling finely… ▽ More Conditional particle filters (CPFs) with backward/ancestor sampling are powerful methods for sampling from the posterior distribution of the latent states of a dynamic model such as a hidden Markov model. However, the performance of these methods deteriorates with models involving weakly informative observations and/or slowly mixing dynamics. Both of these complications arise when sampling finely time-discretised continuous-time path integral models, but can occur with hidden Markov models too. Multinomial resampling, which is commonly employed with CPFs, resamples excessively for weakly informative observations and thereby introduces extra variance. Furthermore, slowly mixing dynamics render the backward/ancestor sampling steps ineffective, leading to degeneracy issues. We detail two conditional resampling strategies suitable for the weakly informative regime: the so-called `killing' resampling and the systematic resampling with mean partial order. To avoid the degeneracy issues, we introduce a generalisation of the CPF with backward sampling that involves auxiliary `bridging' CPF steps that are parameterised by a blocking sequence. We present practical tuning strategies for choosing an appropriate blocking. Our experiments demonstrate that the CPF with a suitable resampling and the developed `bridge backward sampling' can lead to substantial efficiency gains in the weakly informative and slow mixing regime. △ Less

Submitted 19 June, 2023; v1 submitted 27 May, 2022; originally announced May 2022.

arXiv:2203.16892 [pdf, ps, other]

doi 10.1140/epjc/s10052-022-10826-8

Dynamical systems of cosmological models for different possibilities of $G$ and $ρ_Λ$

Authors: Chingtham Sonia, S. Surendra Singh

Abstract: The present paper deals with the dynamics of spatially flat Friedmann-Lemaitre-Robertson-Walker (FLRW) cosmological model with a time varying cosmological constant $Λ$ where $Λ$ evolves with the cosmic time (t) through the Hubble parameter (H). We consider that the model dynamics has a reflection symmetry $H \rightarrow -H $ with $Λ(H)$ expressed in the form of Taylor series with respect to H. Dyn… ▽ More The present paper deals with the dynamics of spatially flat Friedmann-Lemaitre-Robertson-Walker (FLRW) cosmological model with a time varying cosmological constant $Λ$ where $Λ$ evolves with the cosmic time (t) through the Hubble parameter (H). We consider that the model dynamics has a reflection symmetry $H \rightarrow -H $ with $Λ(H)$ expressed in the form of Taylor series with respect to H. Dynamical systems for three different cases based on the possibilities of gravitational constant G and the vacuum energy density $ρ_Λ$ have been analysed. In Case I, both G and $ρ_Λ$ are taken to be constant. We analyse stability of the system by using the notion of spectral radius, behavior of perturbation along each of the axis with respect to cosmic time and Poincare sphere. In Case II, we have dynamical system analysis for G=constant and $ρ_Λ \neq $ constant where we study stability by using the concept of spectral radius and perturbation function. In Case III, we take $G \neq$ constant and $ρ_Λ \neq$ constant where we introduce a new set of variables to set up the corresponding dynamical system. We find out the fixed points of the system and analyse the stability from different directions: by analysing behaviour of the perturbation along each of the axis, Center Manifold Theory and stability at infinity using Poincare sphere respectively. Phase plots and perturbation plots have been presented. We deeply study the cosmological scenario with respect to the fixed points obtained and analyse the late time behavior of the Universe. Our model agrees with the fact that the Universe is in the epoch of accelerated expansion. The EOS parameter $ω_{eff}$, total energy density $Ω_{tt}$ are also evaluated at the fixed points for each of the three cases and these values are in agreement with the observational values in [1]. △ Less

Submitted 31 March, 2022; originally announced March 2022.

Comments: 43 pages, 20 figures

MSC Class: 83F05 ACM Class: A.0

arXiv:2203.12961 [pdf, other]

Multilevel Bayesian Deep Neural Networks

Authors: Neil K. Chada, Ajay Jasra, Kody J. H. Law, Sumeetpal S. Singh

Abstract: In this article we consider Bayesian inference associated to deep neural networks (DNNs) and in particular, trace-class neural network (TNN) priors which were proposed by Sell et al. [39]. Such priors were developed as more robust alternatives to classical architectures in the context of inference problems. For this work we develop multilevel Monte Carlo (MLMC) methods for such models. MLMC is a p… ▽ More In this article we consider Bayesian inference associated to deep neural networks (DNNs) and in particular, trace-class neural network (TNN) priors which were proposed by Sell et al. [39]. Such priors were developed as more robust alternatives to classical architectures in the context of inference problems. For this work we develop multilevel Monte Carlo (MLMC) methods for such models. MLMC is a popular variance reduction technique, with particular applications in Bayesian statistics and uncertainty quantification. We show how a particular advanced MLMC method that was introduced in [4] can be applied to Bayesian inference from DNNs and establish mathematically, that the computational cost to achieve a particular mean square error, associated to posterior expectation computation, can be reduced by several orders, versus more conventional techniques. To verify such results we provide numerous numerical experiments on model problems arising in machine learning. These include Bayesian regression, as well as Bayesian classification and reinforcement learning. △ Less

Submitted 20 July, 2022; v1 submitted 24 March, 2022; originally announced March 2022.

arXiv:2203.10037 [pdf, other]

On resampling schemes for particle filters with weakly informative observations

Authors: Nicolas Chopin, Sumeetpal S. Singh, Tomás Soto, Matti Vihola

Abstract: We consider particle filters with weakly informative observations (or `potentials') relative to the latent state dynamics. The particular focus of this work is on particle filters to approximate time-discretisations of continuous-time Feynman--Kac path integral models -- a scenario that naturally arises when addressing filtering and smoothing problems in continuous time -- but our findings are ind… ▽ More We consider particle filters with weakly informative observations (or `potentials') relative to the latent state dynamics. The particular focus of this work is on particle filters to approximate time-discretisations of continuous-time Feynman--Kac path integral models -- a scenario that naturally arises when addressing filtering and smoothing problems in continuous time -- but our findings are indicative about weakly informative settings beyond this context too. We study the performance of different resampling schemes, such as systematic resampling, SSP (Srinivasan sampling process) and stratified resampling, as the time-discretisation becomes finer and also identify their continuous-time limit, which is expressed as a suitably defined `infinitesimal generator.' By contrasting these generators, we find that (certain modifications of) systematic and SSP resampling `dominate' stratified and independent `killing' resampling in terms of their limiting overall resampling rate. The reduced intensity of resampling manifests itself in lower variance in our numerical experiment. This efficiency result, through an ordering of the resampling rate, is new to the literature. The second major contribution of this work concerns the analysis of the limiting behaviour of the entire population of particles of the particle filter as the time discretisation becomes finer. We provide the first proof, under general conditions, that the particle approximation of the discretised continuous-time Feynman--Kac path integral models converges to a (uniformly weighted) continuous-time particle system. △ Less

Submitted 9 July, 2022; v1 submitted 18 March, 2022; originally announced March 2022.

Comments: 36 pages, 9 figures

MSC Class: Primary 65C35; secondary 65C05; 65C60; 60J25

arXiv:2112.05789 [pdf]

Interaction of Anisotropic Dark Energy with Generalized Hybrid Expansion Law

Authors: Md. Khurshid Alam, S. Surendra Singh, L. Anjana Devi

Abstract: Interaction of dark energy in the anisotropic Locally Rotationally Symmetric (LRS) Bianchi type-I metric is investigated in the context of modified f(R,T) theory of gravity, where R is the Ricci scalar and T is the trace of stress energy momentum tensor. We choose the particular form of the functional f(R,T)=f_1 (R,T)+f_2 (R,T) then we find the exact solutions of the field equations by applying in… ▽ More Interaction of dark energy in the anisotropic Locally Rotationally Symmetric (LRS) Bianchi type-I metric is investigated in the context of modified f(R,T) theory of gravity, where R is the Ricci scalar and T is the trace of stress energy momentum tensor. We choose the particular form of the functional f(R,T)=f_1 (R,T)+f_2 (R,T) then we find the exact solutions of the field equations by applying inhomogeneous equation of state, p= ω\r{ho}-Λ(t) and a generalized form of hybrid expansion law. Transition of deceleration to acceleration is observed in this model. It is also observed that the Universe shows accelerated expansion at late epoch. The derived model overlaps with ΛCDM at late time which is in agreement with present observation. Energy conditions of the derived model are also investigated. From the plot, we observe the age of Universe t_0=13.821 Gyr for the observed H_0=70.07Kms^(-1) Mpc^(-1). The physical and geometrical behaviours of these models are also discussed. △ Less

Submitted 10 December, 2021; originally announced December 2021.

Comments: 12 pages, 10 figures. Manuscript has been accepted for publication in Advances in High Energy Physics

MSC Class: 83F05

arXiv:2110.03034 [pdf, other]

doi 10.1016/j.spl.2022.109523

Ensemble Kalman Inversion for General Likelihoods

Authors: Samuel Duffield, Sumeetpal S. Singh

Abstract: In this letter we generalise Ensemble Kalman inversion techniques to general Bayesian models where previously they were restricted to additive Gaussian likelihoods - all in the difficult setting where the likelihood can be sampled from, but its density not necessarily evaluated. In this letter we generalise Ensemble Kalman inversion techniques to general Bayesian models where previously they were restricted to additive Gaussian likelihoods - all in the difficult setting where the likelihood can be sampled from, but its density not necessarily evaluated. △ Less

Submitted 7 June, 2022; v1 submitted 6 October, 2021; originally announced October 2021.

Journal ref: Statistics & Probability Letters 187 (2022)

arXiv:2106.01746 [pdf, ps, other]

Limits of accuracy for parameter estimation and localisation in Single-Molecule Microscopy via sequential Monte Carlo methods

Authors: A. Marie d'Avigneau, S. S. Singh, R. J. Ober

Abstract: Assessing the quality of parameter estimates for models describing the motion of single molecules in cellular environments is an important problem in fluorescence microscopy. We consider the fundamental data model, where molecules emit photons at random times and the photons arrive at random locations on the detector according to complex point spread functions (PSFs). The random, non-Gaussian PSF… ▽ More Assessing the quality of parameter estimates for models describing the motion of single molecules in cellular environments is an important problem in fluorescence microscopy. We consider the fundamental data model, where molecules emit photons at random times and the photons arrive at random locations on the detector according to complex point spread functions (PSFs). The random, non-Gaussian PSF of the detection process and random trajectory of the molecule make inference challenging. Moreover, the presence of other nearby molecules causes further uncertainty in the origin of the measurements, which impacts the statistical precision of estimates. We quantify the limits of accuracy of model parameter estimates and separation distance between closely spaced molecules (known as the resolution problem) by computing the Cramer-Rao lower bound (CRLB), or equivalently the inverse of the Fisher information matrix (FIM), for the variance of estimates. This fundamental CRLB is crucial, as it provides a lower bound for more practical scenarios. While analytic expressions for the FIM can be derived for static molecules, the analytical tools to evaluate it for molecules whose trajectories follow SDEs are still mostly missing. We address this by presenting a general SMC based methodology for both parameter inference and computing the desired accuracy limits for non-static molecules and a non-Gaussian fundamental detection model. For the first time, we are able to estimate the FIM for stochastically moving molecules observed through the Airy and Born & Wolf PSF. This is achieved by estimating the score and observed information matrix via SMC. We sum up the outcome of our numerical work by summarising the qualitative behaviours for the accuracy limits as functions of e.g. collected photon count, molecule diffusion, etc. We also verify that we can recover known results from the static molecule case. △ Less

Submitted 14 September, 2021; v1 submitted 3 June, 2021; originally announced June 2021.

Comments: 38 pages (inc. 7 pages appendix), 11 figures

MSC Class: 65C05; 92C55

arXiv:2103.14380 [pdf, ps, other]

Three loop correction in the formation of QGP droplet

Authors: M. Jena, K. K. Gupta, S. Somorendro Singh

Abstract: Quark-gluon plasma (QGP) droplet formation is re-considered with the addition of three loop correction to the earlier loop factors in the mean field potential. The correction of the three loop factor increases stability in the droplet formations of QGP at different parametrization factors of the QGP fluid and it is in better agreement in comparison to the lattice results of pressure, energy densit… ▽ More Quark-gluon plasma (QGP) droplet formation is re-considered with the addition of three loop correction to the earlier loop factors in the mean field potential. The correction of the three loop factor increases stability in the droplet formations of QGP at different parametrization factors of the QGP fluid and it is in better agreement in comparison to the lattice results of pressure, energy density and other thermodynamic relations. This implies that the contribution of the three loop enhances in showing the characteristic features of the QGP fluid. It shows that increasing the loop increased the strength of parametrization value which we defined earlier as a number parameter of fluid dynamics. It indicates that the model with the loop correction boosts in explaining about the formation of QGP droplet in the expansion of early universe △ Less

Submitted 26 March, 2021; originally announced March 2021.

Comments: 7 Pages, 11 figures

arXiv:2103.09017 [pdf, other]

Gradient-Based Markov Chain Monte Carlo for Bayesian Inference With Non-Differentiable Priors

Authors: Jacob Vorstrup Goldman, Torben Sell, Sumeetpal Sidhu Singh

Abstract: The use of non-differentiable priors in Bayesian statistics has become increasingly popular, in particular in Bayesian imaging analysis. Current state of the art methods are approximate in the sense that they replace the posterior with a smooth approximation via Moreau-Yosida envelopes, and apply gradient-based discretized diffusions to sample from the resulting distribution. We characterize the e… ▽ More The use of non-differentiable priors in Bayesian statistics has become increasingly popular, in particular in Bayesian imaging analysis. Current state of the art methods are approximate in the sense that they replace the posterior with a smooth approximation via Moreau-Yosida envelopes, and apply gradient-based discretized diffusions to sample from the resulting distribution. We characterize the error of the Moreau-Yosida approximation and propose a novel implementation using underdamped Langevin dynamics. In misson-critical cases, however, replacing the posterior with an approximation may not be a viable option. Instead, we show that Piecewise-Deterministic Markov Processes (PDMP) can be utilized for exact posterior inference from distributions satisfying almost everywhere differentiability. Furthermore, in contrast with diffusion-based methods, the suggested PDMP-based samplers place no assumptions on the prior shape, nor require access to a computationally cheap proximal operator, and consequently have a much broader scope of application. Through detailed numerical examples, including a non-differentiable circular distribution and a non-convex genomics model, we elucidate the relative strengths of these sampling methods on problems of moderate to high dimensions, underlining the benefits of PDMP-based methods when accurate sampling is decisive. △ Less

Submitted 16 March, 2021; originally announced March 2021.

Comments: Accepted for publication by the Journal of the American Statistical Association

arXiv:2103.06450 [pdf, other]

doi 10.1007/978-3-030-86334-0_4

Full Page Handwriting Recognition via Image to Sequence Extraction

Authors: Sumeet S. Singh, Sergey Karayev

Abstract: We present a Neural Network based Handwritten Text Recognition (HTR) model architecture that can be trained to recognize full pages of handwritten or printed text without image segmentation. Being based on Image to Sequence architecture, it can extract text present in an image and then sequence it correctly without imposing any constraints regarding orientation, layout and size of text and non-tex… ▽ More We present a Neural Network based Handwritten Text Recognition (HTR) model architecture that can be trained to recognize full pages of handwritten or printed text without image segmentation. Being based on Image to Sequence architecture, it can extract text present in an image and then sequence it correctly without imposing any constraints regarding orientation, layout and size of text and non-text. Further, it can also be trained to generate auxiliary markup related to formatting, layout and content. We use character level vocabulary, thereby enabling language and terminology of any subject. The model achieves a new state-of-art in paragraph level recognition on the IAM dataset. When evaluated on scans of real world handwritten free form test answers - beset with curved and slanted lines, drawings, tables, math, chemistry and other symbols - it performs better than all commercially available HTR cloud APIs. It is deployed in production as part of a commercial web application. △ Less

Submitted 26 June, 2022; v1 submitted 10 March, 2021; originally announced March 2021.

Comments: Appeared in ICDAR 2021

arXiv:2101.03079 [pdf, other]

Spatiotemporal blocking of the bouncy particle sampler for efficient inference in state space models

Authors: Jacob Vorstrup Goldman, Sumeetpal Sidhu Singh

Abstract: We propose a novel blocked version of the continuous-time bouncy particle sampler of [Bouchard-Côté et al., 2018] which is applicable to any differentiable probability density. This alternative implementation is motivated by blocked Gibbs sampling for state space models [Singh et al., 2017] and leads to significant improvement in terms of effective sample size per second, and furthermore, allows f… ▽ More We propose a novel blocked version of the continuous-time bouncy particle sampler of [Bouchard-Côté et al., 2018] which is applicable to any differentiable probability density. This alternative implementation is motivated by blocked Gibbs sampling for state space models [Singh et al., 2017] and leads to significant improvement in terms of effective sample size per second, and furthermore, allows for significant parallelization of the resulting algorithm. The new algorithms are particularly efficient for latent state inference in high-dimensional state space models, where blocking in both space and time is necessary to avoid degeneracy of MCMC. The efficiency of our blocked bouncy particle sampler, in comparison with both the standard implementation of the bouncy particle sampler and the particle Gibbs algorithm of Andrieu et al. [2010], is illustrated numerically for both simulated data and a challenging real-world financial dataset. △ Less

Submitted 9 July, 2021; v1 submitted 8 January, 2021; originally announced January 2021.

Comments: 22 pages, 5 figures

arXiv:2012.10943 [pdf, other]

Trace-class Gaussian priors for Bayesian learning of neural networks with MCMC

Authors: Torben Sell, Sumeetpal S. Singh

Abstract: This paper introduces a new neural network based prior for real valued functions on $\mathbb R^d$ which, by construction, is more easily and cheaply scaled up in the domain dimension $d$ compared to the usual Karhunen-Loève function space prior. The new prior is a Gaussian neural network prior, where each weight and bias has an independent Gaussian prior, but with the key difference that the varia… ▽ More This paper introduces a new neural network based prior for real valued functions on $\mathbb R^d$ which, by construction, is more easily and cheaply scaled up in the domain dimension $d$ compared to the usual Karhunen-Loève function space prior. The new prior is a Gaussian neural network prior, where each weight and bias has an independent Gaussian prior, but with the key difference that the variances decrease in the width of the network in such a way that the resulting function is \emph{almost surely} well defined in the limit of an infinite width network. We show that in a Bayesian treatment of inferring unknown functions, the induced posterior over functions is amenable to Monte Carlo sampling using Hilbert space Markov chain Monte Carlo (MCMC) methods. This type of MCMC is popular, e.g. in the Bayesian Inverse Problems literature, because it is stable under \emph{mesh refinement}, i.e. the acceptance probability does not shrink to $0$ as more parameters of the function's prior are introduced, even \emph{ad infinitum}. In numerical examples we demonstrate these stated competitive advantages over other function space priors. We also implement examples in Bayesian Reinforcement Learning to automate tasks from data and demonstrate, for the first time, stability of MCMC to mesh refinement for these type of problems. △ Less

Submitted 8 September, 2022; v1 submitted 20 December, 2020; originally announced December 2020.

Comments: 24 pages, 21 figures

arXiv:2012.04602 [pdf, other]

doi 10.1109/TSP.2022.3141259

Online Particle Smoothing with Application to Map-matching

Authors: Samuel Duffield, Sumeetpal S. Singh

Abstract: We introduce a novel method for online smoothing in state-space models that utilises a fixed-lag approximation to overcome the well known issue of path degeneracy. Unlike classical fixed-lag techniques that only approximate certain marginals, we introduce an online resampling algorithm, called particle stitching, that converts these marginal samples into a full posterior approximation. We demonstr… ▽ More We introduce a novel method for online smoothing in state-space models that utilises a fixed-lag approximation to overcome the well known issue of path degeneracy. Unlike classical fixed-lag techniques that only approximate certain marginals, we introduce an online resampling algorithm, called particle stitching, that converts these marginal samples into a full posterior approximation. We demonstrate the utility of our method in the context of map-matching, the task of inferring a vehicle's trajectory given a road network and noisy GPS observations. We develop a new state-space model for the difficult task of map-matching on dense, urban road networks. △ Less

Submitted 2 August, 2021; v1 submitted 8 December, 2020; originally announced December 2020.

Journal ref: IEEE Transactions on Signal Processing 2022

arXiv:2010.08918 [pdf, ps, other]

Quark Gluon Plasma (QGP) evolution under loop corrections

Authors: K K Gupta, Agam K Jha, S. Somorendro Singh

Abstract: We review free energy evolution of QGP (Quark-gluon plasma) under zero-loop, one loop and two loop corrections in the mean field potential. The free energies of QGP under the comparison of zero-loop and loop corrections of the interacting potential among the quarks, anti-quarks and gluons are shown. We observe that the formation of stable QGP droplet is dependent on the loop corrections with the d… ▽ More We review free energy evolution of QGP (Quark-gluon plasma) under zero-loop, one loop and two loop corrections in the mean field potential. The free energies of QGP under the comparison of zero-loop and loop corrections of the interacting potential among the quarks, anti-quarks and gluons are shown. We observe that the formation of stable QGP droplet is dependent on the loop corrections with the different parametrization values of fluid. With the increase in the parametrization value, stability of droplet formation increases with smaller size of droplet. This indicates that the formation of QGP droplet can be signified more importantly by the parametrization value like the Reynold number in fluid dynamics. It means that there may be different phenomenological parameter to define the stable QGP droplet when QGP fluid is studied under loop corrections. △ Less

Submitted 18 October, 2020; originally announced October 2020.

Comments: 7 pages 9 figures

arXiv:2006.14875 [pdf, other]

doi 10.1007/s11222-021-10048-0

Anytime Parallel Tempering

Authors: A. Marie d'Avigneau, S. S. Singh, L. M. Murray

Abstract: Develo** efficient MCMC algorithms is indispensable in Bayesian inference. In parallel tempering, multiple interacting MCMC chains run to more efficiently explore the state space and improve performance. The multiple chains advance independently through local moves, and the performance enhancement steps are exchange moves, where the chains pause to exchange their current sample amongst each othe… ▽ More Develo** efficient MCMC algorithms is indispensable in Bayesian inference. In parallel tempering, multiple interacting MCMC chains run to more efficiently explore the state space and improve performance. The multiple chains advance independently through local moves, and the performance enhancement steps are exchange moves, where the chains pause to exchange their current sample amongst each other. To accelerate the independent local moves, they may be performed simultaneously on multiple processors. Another problem is then encountered: depending on the MCMC implementation and inference problem, local moves can take a varying and random amount of time to complete. There may also be infrastructure-induced variations, such as competing jobs on the same processors, which arises in cloud computing. Before exchanges can occur, all chains must complete the local moves they are engaged in to avoid introducing a potentially substantial bias (Proposition 2.1). To solve this issue of randomly varying local move completion times in multi-processor parallel tempering, we adopt the Anytime Monte Carlo framework of Murray et al. (2016): we impose real-time deadlines on the parallel local moves and perform exchanges at these deadlines without any processor idling. We show our methodology for exchanges at real-time deadlines does not introduce a bias and leads to significant performance enhancements over the naïve approach of idling until every processor's local moves complete. The methodology is then applied in an ABC setting, where an Anytime ABC parallel tempering algorithm is derived for the difficult task of estimating the parameters of a Lotka-Volterra predator-prey model, and similar efficiency enhancements are observed. △ Less

Submitted 14 September, 2021; v1 submitted 26 June, 2020; originally announced June 2020.

Comments: 34 Pages, 10 Figures

MSC Class: 62-08; 62F15

arXiv:1907.03538 [pdf, ps, other]

Evolution of Kaluza-Klein Like Wet Dark Fluid in $f(R,T)$ Theory of Gravitation

Authors: Koijam Manihar Singh, S. Surendra Singh, Leishingam Kumrah

Abstract: Here we study the essence of $f(R,T)$ gravitation theory in five dimensional Universe and see the role of dark energy in the form of wet dark fluid in such a Universe. It is found that the dark energy is not exaggerated in contributing to the accelerating expansion of the Universe though the expansion is inherent as a result of the theory itself and due to the geometric contribution of matter. It… ▽ More Here we study the essence of $f(R,T)$ gravitation theory in five dimensional Universe and see the role of dark energy in the form of wet dark fluid in such a Universe. It is found that the dark energy is not exaggerated in contributing to the accelerating expansion of the Universe though the expansion is inherent as a result of the theory itself and due to the geometric contribution of matter. It is interesting to see that in some model it is found that there was some era before the beginning of the present era, and some of the model Universe came out to be either oscillatory or cyclic. Some of the models are seen to go to $ΛCDM$ models in late future as in Einstein gravitation theory, starting the evolution with a big bang. Most of the models undergo early inflation as well as late time accelerating expansion thus defining as good models for real astrophysical situations, with dark energy playing fundamental role in these Universe. △ Less

Submitted 13 June, 2019; originally announced July 2019.

Comments: 18 pages, 15 figures

arXiv:1906.11947 [pdf]

Dynamical system perspective of cosmological models minimally coupled with scalar field

Authors: S. Surendra Singh, Chingtham Sonia

Abstract: The stability criteria for spatially flat homogeneous and isotropic cosmological dynamical system is investigated with the interaction of a scalar field endowed with a perfect fluid.In this paper, we depict the dynamical system perspective to study, qualitatively, the scalar field cosmology under two special cases, with and without potential. For analysis with potential we use simple exponential p… ▽ More The stability criteria for spatially flat homogeneous and isotropic cosmological dynamical system is investigated with the interaction of a scalar field endowed with a perfect fluid.In this paper, we depict the dynamical system perspective to study, qualitatively, the scalar field cosmology under two special cases, with and without potential. For analysis with potential we use simple exponential potential form, $V_{o}e^{-λφ}$. We generate, by introducing new dimensionless variables, an autonomous system of ordinary differential equations $(ASODE)$ for each case and obtain respective fixed points. We also analyse the type of fixed points, nature and stability of the fixed points and how their nature and behavior reflect towards the cosmic scenarios. Throughout the whole work, the investigation of this model has shown us the deep connection between these theories and cosmic acceleration phenomena. The phase plots of the system at different conditions and different values of $γ$ have been analyzed in detail and their interpretations have been worked out.The perturbation plots of the dynamical system have also been studied and analyzed which emphasize our analytical findings. △ Less

Submitted 22 January, 2021; v1 submitted 18 June, 2019; originally announced June 2019.

Comments: 14 pages,16 figures

arXiv:1906.09145 [pdf, ps, other]

Backward It{ô}-Ventzell and stochastic interpolation formulae

Authors: Pierre del Moral, Sumeetpal Sidhu Singh

Abstract: We present a novel backward It{ô}-Ventzell formula and an extension of the Aleeksev-Gröbner interpolating formula to stochastic flows. We also present some natural spectral conditions that yield direct and simple proofs of time uniform estimates of the difference between the two stochastic flows when their drift and diffusion functions are not the same, yielding what seems to be the first results… ▽ More We present a novel backward It{ô}-Ventzell formula and an extension of the Aleeksev-Gröbner interpolating formula to stochastic flows. We also present some natural spectral conditions that yield direct and simple proofs of time uniform estimates of the difference between the two stochastic flows when their drift and diffusion functions are not the same, yielding what seems to be the first results of this type for this class of anticipative models. We illustrate the impact of these results in the context of diffusion perturbation theory, interacting diffusions and discrete time approximations △ Less

Submitted 4 May, 2021; v1 submitted 21 June, 2019; originally announced June 2019.

arXiv:1901.05926 [pdf, ps, other]

doi 10.1142/S0218301319500083

Structural properties and decay modes of Z $=$ 122, 120 and 118 superheavy nuclei

Authors: G. Saxena, M. Kumawat, S. Somorendro Singh, Mamta Aggarwal

Abstract: Structural properties and the decay modes of the superheavy elements Z $=$ 122, 120, 118 are studied in a microscopic framework. We evaluate the binding energy, one- and two- proton and neutron separation energy, shell correction and density profile of even and odd isotopes of Z $=$ 122, 120, 118 (284 $\leq$ A $\leq$ 352) which show a reasonable match with FRDM results and the available experiment… ▽ More Structural properties and the decay modes of the superheavy elements Z $=$ 122, 120, 118 are studied in a microscopic framework. We evaluate the binding energy, one- and two- proton and neutron separation energy, shell correction and density profile of even and odd isotopes of Z $=$ 122, 120, 118 (284 $\leq$ A $\leq$ 352) which show a reasonable match with FRDM results and the available experimental data. Equillibrium shape and deformation of the superheavy region are predicted. We investigate the possible decay modes of this region specifically $α$-decay, spontaneous fission (SF) and the $β$-decay and evaluate the probable $α$-decay chains. The phenomena of bubble like structure in the charge density is predicted in $^{330}$122, $^{292,328}$120 and $^{326}$118 with significant depletion fraction around 20-24$\%$ which increases with increasing Coulomb energy and diminishes with increasing isospin (N$-$Z) values exhibiting the fact that the coloumb forces are the main driving force in the central depletion in superheavy systems. △ Less

Submitted 17 January, 2019; originally announced January 2019.

Comments: 18 pages, 7 figures

Journal ref: International Journal of Modern Physics 2019

arXiv:1811.11834 [pdf, other]

doi 10.1016/j.spa.2020.10.006Get

Asymptotic Analysis of Model Selection Criteria for General Hidden Markov Models

Authors: Shouto Yonekura, Alexandros Beskos, Sumeetpal S. Singh

Abstract: The paper obtains analytical results for the asymptotic properties of Model Selection Criteria -- widely used in practice -- for a general family of hidden Markov models (HMMs), thereby substantially extending the related theory beyond typical i.i.d.-like model structures and filling in an important gap in the relevant literature. In particular, we look at the Bayesian and Akaike Information Crite… ▽ More The paper obtains analytical results for the asymptotic properties of Model Selection Criteria -- widely used in practice -- for a general family of hidden Markov models (HMMs), thereby substantially extending the related theory beyond typical i.i.d.-like model structures and filling in an important gap in the relevant literature. In particular, we look at the Bayesian and Akaike Information Criteria (BIC and AIC) and the model evidence. In the setting of nested classes of models, we prove that BIC and the evidence are strongly consistent for HMMs (under regularity conditions), whereas AIC is not weakly consistent. Numerical experiments support our theoretical results. △ Less

Submitted 30 March, 2020; v1 submitted 28 November, 2018; originally announced November 2018.

arXiv:1806.06520 [pdf, ps, other]

Stability of Conditional Sequential Monte Carlo

Authors: Bernd Kuhlenschmidt, Sumeetpal S. Singh

Abstract: The particle Gibbs (PG) sampler is a Markov Chain Monte Carlo (MCMC) algorithm, which uses an interacting particle system to perform the Gibbs steps. Each Gibbs step consists of simulating a particle system conditioned on one particle path. It relies on a conditional Sequential Monte Carlo (cSMC) method to create the particle system. We propose a novel interpretation of the cSMC algorithm as a per… ▽ More The particle Gibbs (PG) sampler is a Markov Chain Monte Carlo (MCMC) algorithm, which uses an interacting particle system to perform the Gibbs steps. Each Gibbs step consists of simulating a particle system conditioned on one particle path. It relies on a conditional Sequential Monte Carlo (cSMC) method to create the particle system. We propose a novel interpretation of the cSMC algorithm as a perturbed Sequential Monte Carlo (SMC) method and apply telescopic decompositions developed for the analysis of SMC algorithms \cite{delmoral2004} to derive a bound for the distance between the expected sampled path from cSMC and the target distribution of the MCMC algorithm. This can be used to get a uniform ergodicity result. In particular, we can show that the mixing rate of cSMC can be kept constant by increasing the number of particles linearly with the number of observations. Based on our decomposition, we also prove a central limit theorem for the cSMC Algorithm, which cannot be done using the approaches in \cite{Andrieu2013} and \cite{Lindsten2014}. △ Less

Submitted 18 June, 2018; originally announced June 2018.

arXiv:1806.05852 [pdf, other]

Coupled conditional backward sampling particle filter

Authors: Anthony Lee, Sumeetpal S. Singh, Matti Vihola

Abstract: The conditional particle filter (CPF) is a promising algorithm for general hidden Markov model smoothing. Empirical evidence suggests that the variant of CPF with backward sampling (CBPF) performs well even with long time series. Previous theoretical results have not been able to demonstrate the improvement brought by backward sampling, whereas we provide rates showing that CBPF can remain effecti… ▽ More The conditional particle filter (CPF) is a promising algorithm for general hidden Markov model smoothing. Empirical evidence suggests that the variant of CPF with backward sampling (CBPF) performs well even with long time series. Previous theoretical results have not been able to demonstrate the improvement brought by backward sampling, whereas we provide rates showing that CBPF can remain effective with a fixed number of particles independent of the time horizon. Our result is based on analysis of a new coupling of two CBPFs, the coupled conditional backward sampling particle filter (CCBPF). We show that CCBPF has good stability properties in the sense that with fixed number of particles, the coupling time in terms of iterations increases only linearly with respect to the time horizon under a general (strong mixing) condition. The CCBPF is useful not only as a theoretical tool, but also as a practical method that allows for unbiased estimation of smoothing expectations, following the recent developments by Jacob et al. (to appear). Unbiased estimation has many advantages, such as enabling the construction of asymptotically exact confidence intervals and straightforward parallelisation. △ Less

Submitted 28 August, 2019; v1 submitted 15 June, 2018; originally announced June 2018.

Comments: 24 pages, 5 figures

MSC Class: 65C05 (Primary); 60J05; 65C35; 65C40 (secondary)

arXiv:1804.07117 [pdf, other]

On Large Lag Smoothing for Hidden Markov Models

Authors: Jeremie Houssineau, Ajay Jasra, Sumeetpal S. Singh

Abstract: In this article we consider the smoothing problem for hidden Markov models (HMM). Given a hidden Markov chain $\{X_n\}_{n\geq 0}$ and observations $\{Y_n\}_{n\geq 0}$, our objective is to compute $\mathbb{E}[\varphi(X_0,\dots,X_k)|y_{0},\dots,y_n]$ for some real-valued, integrable functional $\varphi$ and $k$ fixed, $k \ll n$ and for some realisation $(y_0,\dots,y_n)$ of $(Y_0,\dots,Y_n)$. We intr… ▽ More In this article we consider the smoothing problem for hidden Markov models (HMM). Given a hidden Markov chain $\{X_n\}_{n\geq 0}$ and observations $\{Y_n\}_{n\geq 0}$, our objective is to compute $\mathbb{E}[\varphi(X_0,\dots,X_k)|y_{0},\dots,y_n]$ for some real-valued, integrable functional $\varphi$ and $k$ fixed, $k \ll n$ and for some realisation $(y_0,\dots,y_n)$ of $(Y_0,\dots,Y_n)$. We introduce a novel application of the multilevel Monte Carlo (MLMC) method with a coupling based on the Knothe-Rosenblatt rearrangement. We prove that this method can approximate the afore-mentioned quantity with a mean square error (MSE) of $\mathcal{O}(ε^2)$, for arbitrary $ε>0$ with a cost of $\mathcal{O}(ε^{-2})$. This is in contrast to the same direct Monte Carlo method, which requires a cost of $\mathcal{O}(nε^{-2})$ for the same MSE. The approach we suggest is, in general, not possible to implement, so the optimal transport methodology of \cite{span} is used, which directly approximates our strategy. We show that our theoretical improvements are achieved, even under approximation, in several numerical examples. △ Less

Submitted 19 April, 2018; originally announced April 2018.

arXiv:1803.09496 [pdf, ps, other]

On the loss of Fisher information in some multi-object tracking observation models

Authors: Jeremie Houssineau, Ajay Jasra, Sumeetpal S. Singh

Abstract: The concept of Fisher information can be useful even in cases where the probability distributions of interest are not absolutely continuous with respect to the natural reference measure on the underlying space. Practical examples where this extension is useful are provided in the context of multi-object tracking statistical models. Upon defining the Fisher information without introducing a referen… ▽ More The concept of Fisher information can be useful even in cases where the probability distributions of interest are not absolutely continuous with respect to the natural reference measure on the underlying space. Practical examples where this extension is useful are provided in the context of multi-object tracking statistical models. Upon defining the Fisher information without introducing a reference measure, we provide remarkably concise proofs of the loss of Fisher information in some widely used multi-object tracking observation models. △ Less

Submitted 26 March, 2018; originally announced March 2018.

arXiv:1802.05415 [pdf, other]

Teaching Machines to Code: Neural Markup Generation with Visual Attention

Authors: Sumeet S. Singh

Abstract: We present a neural transducer model with visual attention that learns to generate LaTeX markup of a real-world math formula given its image. Applying sequence modeling and transduction techniques that have been very successful across modalities such as natural language, image, handwriting, speech and audio; we construct an image-to-markup model that learns to produce syntactically and semanticall… ▽ More We present a neural transducer model with visual attention that learns to generate LaTeX markup of a real-world math formula given its image. Applying sequence modeling and transduction techniques that have been very successful across modalities such as natural language, image, handwriting, speech and audio; we construct an image-to-markup model that learns to produce syntactically and semantically correct LaTeX markup code over 150 words long and achieves a BLEU score of 89%; improving upon the previous state-of-art for the Im2Latex problem. We also demonstrate with heat-map visualization how attention helps in interpreting the model and can pinpoint (detect and localize) symbols on the image accurately despite having been trained without any bounding box data. △ Less

Submitted 15 June, 2018; v1 submitted 15 February, 2018; originally announced February 2018.

Comments: For datasets, visualizations and ancillary material see: https://untrix.github.io/i2l . For source code go to: https://github.com/untrix/im2latex

arXiv:1711.02836 [pdf, other]

Multilevel Monte Carlo for Smoothing via Transport Methods

Authors: Jeremie Houssineau, Ajay Jasra, Sumeetpal S. Singh

Abstract: In this article we consider recursive approximations of the smoothing distribution associated to partially observed stochastic differential equations (SDEs), which are observed discretely in time. Such models appear in a wide variety of applications including econometrics, finance and engineering. This problem is notoriously challenging, as the smoother is not available analytically and hence requ… ▽ More In this article we consider recursive approximations of the smoothing distribution associated to partially observed stochastic differential equations (SDEs), which are observed discretely in time. Such models appear in a wide variety of applications including econometrics, finance and engineering. This problem is notoriously challenging, as the smoother is not available analytically and hence require numerical approximation. This usually consists by applying a time-discretization to the SDE, for instance the Euler method, and then applying a numerical (e.g. Monte Carlo) method to approximate the smoother. This has lead to a vast literature on methodology for solving such problems, perhaps the most popular of which is based upon the particle filter (PF) e.g. [9]. In the context of filtering for this class of problems, it is well-known that the particle filter can be improved upon in terms of cost to achieve a given mean squared error (MSE) for estimates. This in the sense that the computational effort can be reduced to achieve this target MSE, by using multilevel (ML) methods [12, 13, 18], via the multilevel particle filter (MLPF) [16, 20, 21]. For instance, to obtain a MSE of $\mathcal{O}(ε^2)$ for some $ε> 0$ when approximating filtering distributions associated with Euler-discretized diffusions with constant diffusion coefficients, the cost of the PF is $\mathcal{O}(ε^{-3})$ while the cost of the MLPF is $\mathcal{O}(ε^{-2}\log(ε)^2)$. In this article we consider a new approach to replace the particle filter, using transport methods in [27]. In the context of filtering, one expects that the proposed method improves upon the MLPF by yielding, under assumptions, a MSE of $\mathcal{O}(ε^2)$ for a cost of $\mathcal{O}(ε^{-2})$. This is established theoretically in an "ideal" example and numerically in numerous examples. △ Less

Submitted 14 May, 2018; v1 submitted 8 November, 2017; originally announced November 2017.

MSC Class: 62M05; 60J60

arXiv:1708.08216 [pdf, ps, other]

Effect of two loop correction in the formation of QGP droplet

Authors: S. Somorendro Singh

Abstract: The effect of two loop correction in the formation of quark-gluon plasma (QGP) droplet is studied with the introduction of the two loop correction factor in the mean field potential. Due to the correction factor it shows stability in the droplet formation of QGP indicating at different parametrization factors of the QGP fluid. The correction factor in the potential also shows gluon parameter facto… ▽ More The effect of two loop correction in the formation of quark-gluon plasma (QGP) droplet is studied with the introduction of the two loop correction factor in the mean field potential. Due to the correction factor it shows stability in the droplet formation of QGP indicating at different parametrization factors of the QGP fluid. The correction factor in the potential also shows gluon parameter factor shifts to a larger value from its earlier value of gluon factor of one loop correction in obtaining the stable droplets. The results show decreasing in the observable QGP droplets and droplet sizes are found to be $1.5-2.0$ fm radii with the two loop correction. It indicates that there is parameter like Reynold's number which can control the dynamics of QGP droplet formation and the stability of droplet in the case of droplet formation with the two loop correction factor. △ Less

Submitted 28 August, 2017; originally announced August 2017.

Comments: 7 pages, 7 figures

arXiv:1707.04371 [pdf, other]

Identification of multi-object dynamical systems: consistency and Fisher information

Authors: Jeremie Houssineau, Sumeetpal S. Singh, Ajay Jasra

Abstract: Learning the model parameters of a multi-object dynamical system from partial and perturbed observations is a challenging task. Despite recent numerical advancements in learning these parameters, theoretical guarantees are extremely scarce. In this article, we study the identifiability of these parameters and the consistency of the corresponding maximum likelihood estimate (MLE) under assumptions… ▽ More Learning the model parameters of a multi-object dynamical system from partial and perturbed observations is a challenging task. Despite recent numerical advancements in learning these parameters, theoretical guarantees are extremely scarce. In this article, we study the identifiability of these parameters and the consistency of the corresponding maximum likelihood estimate (MLE) under assumptions on the different components of the underlying multi-object system. In order to understand the impact of the various sources of observation noise on the ability to learn the model parameters, we study the asymptotic variance of the MLE through the associated Fisher information matrix. For example, we show that specific aspects of the multi-target tracking (MTT) problem such as detection failures and unknown data association lead to a loss of information which is quantified in special cases of interest. △ Less

Submitted 13 July, 2017; originally announced July 2017.

arXiv:1606.08650 [pdf, other]

doi 10.1109/TSP.2017.2733504

Approximate Smoothing and Parameter Estimation in High-Dimensional State-Space Models

Authors: Axel Finke, Sumeetpal S. Singh

Abstract: We present approximate algorithms for performing smoothing in a class of high-dimensional state-space models via sequential Monte Carlo methods ("particle filters"). In high dimensions, a prohibitively large number of Monte Carlo samples ("particles") -- growing exponentially in the dimension of the state space -- is usually required to obtain a useful smoother. Using blocking strategies as in Reb… ▽ More We present approximate algorithms for performing smoothing in a class of high-dimensional state-space models via sequential Monte Carlo methods ("particle filters"). In high dimensions, a prohibitively large number of Monte Carlo samples ("particles") -- growing exponentially in the dimension of the state space -- is usually required to obtain a useful smoother. Using blocking strategies as in Rebeschini and Van Handel (2015) (and earlier pioneering work on blocking), we exploit the spatial ergodicity properties of the model to circumvent this curse of dimensionality. We thus obtain approximate smoothers that can be computed recursively in time and in parallel in space. First, we show that the bias of our blocked smoother is bounded uniformly in the time horizon and in the model dimension. We then approximate the blocked smoother with particles and derive the asymptotic variance of idealised versions of our blocked particle smoother to show that variance is no longer adversely effected by the dimension of the model. Finally, we employ our method to successfully perform maximum-likelihood estimation via stochastic gradient-ascent and stochastic expectation--maximisation algorithms in a 100-dimensional state-space model. △ Less

Submitted 20 September, 2017; v1 submitted 28 June, 2016; originally announced June 2016.

Comments: Includes supplementary materials

Journal ref: IEEE Transactions on Signal Processing, 65(22), 5982-5994, 2017

arXiv:1603.05522 [pdf, ps, other]

Tracking multiple moving objects in images using Markov Chain Monte Carlo

Authors: Lan Jiang, Sumeetpal S. Singh

Abstract: A new Bayesian state and parameter learning algorithm for multiple target tracking (MTT) models with image observations is proposed. Specifically, a Markov chain Monte Carlo algorithm is designed to sample from the posterior distribution of the unknown number of targets, their birth and death times, states and model parameters, which constitutes the complete solution to the tracking problem. The c… ▽ More A new Bayesian state and parameter learning algorithm for multiple target tracking (MTT) models with image observations is proposed. Specifically, a Markov chain Monte Carlo algorithm is designed to sample from the posterior distribution of the unknown number of targets, their birth and death times, states and model parameters, which constitutes the complete solution to the tracking problem. The conventional approach is to pre-process the images to extract point observations and then perform tracking. We model the image generation process directly to avoid potential loss of information when extracting point observations. Numerical examples show that our algorithm has improved tracking performance over commonly used techniques, for both synthetic examples and real florescent microscopy data, especially in the case of dim targets with overlap** illuminated regions. △ Less

Submitted 17 March, 2016; originally announced March 2016.

arXiv:1509.08362 [pdf, ps, other]

Blocking Strategies and Stability of Particle Gibbs Samplers

Authors: Sumeetpal S. Singh, Fredrik Lindsten, Eric Moulines

Abstract: Sampling from the conditional (or posterior) probability distribution of the latent states of a Hidden Markov Model, given the realization of the observed process, is a non-trivial problem in the context of Markov Chain Monte Carlo. To do this Andrieu et al. (2010) constructed a Markov kernel which leaves this conditional distribution invariant using a Particle Filter. From a practitioner's point… ▽ More Sampling from the conditional (or posterior) probability distribution of the latent states of a Hidden Markov Model, given the realization of the observed process, is a non-trivial problem in the context of Markov Chain Monte Carlo. To do this Andrieu et al. (2010) constructed a Markov kernel which leaves this conditional distribution invariant using a Particle Filter. From a practitioner's point of view, this Markov kernel attempts to mimic the act of sampling all the latent state variables as one block from the posterior distribution but for models where exact simulation is not possible. There are some recent theoretical results that establish the uniform ergodicity of this Markov kernel and that the mixing rate does not diminish provided the number of particles grows at least linearly with the number of latent states in the posterior. This gives rise to a cost, per application of the kernel, that is quadratic in the number of latent states which could be prohibitive for long observation sequences. We seek to answer an obvious but important question: is there a different implementation with a cost per-iteration that grows linearly with the number of latent states, but which is still stable in the sense that its mixing rate does not deteriorate? We address this problem using blocking strategies, which are easily parallelizable, and prove stability of the resulting sampler. △ Less

Submitted 28 September, 2015; originally announced September 2015.

arXiv:1509.05986 [pdf, ps, other]

Scaling in topological properties of brain networks

Authors: Soibam Shyamchand Singh, Khundrakpam Budhachandra Singh, Romana Ishrat, B. Indrajit Sharma, R. K. Brojen Singh

Abstract: The organization in brain networks shows highly modular features with weak inter-modular interaction. The topology of the networks involves emergence of modules and sub-modules at different levels of constitution governed by fractal laws. The modular organization, in terms of modular mass, inter-modular, and intra-modular interaction, also obeys fractal nature. The parameters which characterize to… ▽ More The organization in brain networks shows highly modular features with weak inter-modular interaction. The topology of the networks involves emergence of modules and sub-modules at different levels of constitution governed by fractal laws. The modular organization, in terms of modular mass, inter-modular, and intra-modular interaction, also obeys fractal nature. The parameters which characterize topological properties of brain networks follow one parameter scaling theory in all levels of network structure which reveals the self-similar rules governing the network structure. The calculated fractal dimensions of brain networks of different species are found to decrease when one goes from lower to higher level species which implicates the more ordered and self-organized topography at higher level species. The sparsely distributed hubs in brain networks may be most influencing nodes but their absence may not cause network breakdown, and centrality parameters characterizing them also follow one parameter scaling law indicating self-similar roles of these hubs at different levels of organization in brain networks. △ Less

Submitted 20 September, 2015; originally announced September 2015.

arXiv:1505.06356 [pdf, other]

Particle ancestor sampling for near-degenerate or intractable state transition models

Authors: Fredrik Lindsten, Pete Bunch, Sumeetpal S. Singh, Thomas B. Schön

Abstract: We consider Bayesian inference in sequential latent variable models in general, and in nonlinear state space models in particular (i.e., state smoothing). We work with sequential Monte Carlo (SMC) algorithms, which provide a powerful inference framework for addressing this problem. However, for certain challenging and common model classes the state-of-the-art algorithms still struggle. The work is… ▽ More We consider Bayesian inference in sequential latent variable models in general, and in nonlinear state space models in particular (i.e., state smoothing). We work with sequential Monte Carlo (SMC) algorithms, which provide a powerful inference framework for addressing this problem. However, for certain challenging and common model classes the state-of-the-art algorithms still struggle. The work is motivated in particular by two such model classes: (i) models where the state transition kernel is (nearly) degenerate, i.e. (nearly) concentrated on a low-dimensional manifold, and (ii) models where point-wise evaluation of the state transition density is intractable. Both types of models arise in many applications of interest, including tracking, epidemiology, and econometrics. The difficulties with these types of models is that they essentially rule out forward-backward-based methods, which are known to be of great practical importance, not least to construct computationally efficient particle Markov chain Monte Carlo (PMCMC) algorithms. To alleviate this, we propose a "particle rejuvenation" technique to enable the use of the forward-backward strategy for (nearly) degenerate models and, by extension, for intractable models. We derive the proposed method specifically within the context of PMCMC, but we emphasise that it is applicable to any forward-backward-based Monte Carlo method. △ Less

Submitted 23 May, 2015; originally announced May 2015.

Showing 1–50 of 76 results for author: Singh, S S