-
Distilling Opinions at Scale: Incremental Opinion Summarization using XL-OPSUMM
Authors:
Sri Raghava Muddu,
Rupasai Rangaraju,
Tejpalsingh Siledar,
Swaroop Nath,
Pushpak Bhattacharyya,
Swaprava Nath,
Suman Banerjee,
Amey Patil,
Muthusamy Chelliah,
Sudhanshu Shekhar Singh,
Nikesh Garera
Abstract:
Opinion summarization in e-commerce encapsulates the collective views of numerous users about a product based on their reviews. Typically, a product on an e-commerce platform has thousands of reviews, each review comprising around 10-15 words. While Large Language Models (LLMs) have shown proficiency in summarization tasks, they struggle to handle such a large volume of reviews due to context limi…
▽ More
Opinion summarization in e-commerce encapsulates the collective views of numerous users about a product based on their reviews. Typically, a product on an e-commerce platform has thousands of reviews, each review comprising around 10-15 words. While Large Language Models (LLMs) have shown proficiency in summarization tasks, they struggle to handle such a large volume of reviews due to context limitations. To mitigate, we propose a scalable framework called Xl-OpSumm that generates summaries incrementally. However, the existing test set, AMASUM has only 560 reviews per product on average. Due to the lack of a test set with thousands of reviews, we created a new test set called Xl-Flipkart by gathering data from the Flipkart website and generating summaries using GPT-4. Through various automatic evaluations and extensive analysis, we evaluated the framework's efficiency on two datasets, AMASUM and Xl-Flipkart. Experimental results show that our framework, Xl-OpSumm powered by Llama-3-8B-8k, achieves an average ROUGE-1 F1 gain of 4.38% and a ROUGE-L F1 gain of 3.70% over the next best-performing model.
△ Less
Submitted 16 June, 2024;
originally announced June 2024.
-
Behaviours of rip cosmological models in $f(Q,C)$ gravity
Authors:
Amit Samaddar,
S. Surendra Singh,
Shah Muhammad,
Euaggelos E. Zotos
Abstract:
In this study, the Universe's rip cosmology theories have been provided for the $f(Q,C)$ gravity theory, where $Q$ and $C$ stand for the non-metricity scalar and boundary term. We assumed $f(Q,C)=αQ^{n}+βC$ and analyzed the nature of the physical parameters for the Little Rip, Big Rip and Pseudo Rip models. In the LR and PR models, the EoS parameter exhibits phantom characteristics but remains clo…
▽ More
In this study, the Universe's rip cosmology theories have been provided for the $f(Q,C)$ gravity theory, where $Q$ and $C$ stand for the non-metricity scalar and boundary term. We assumed $f(Q,C)=αQ^{n}+βC$ and analyzed the nature of the physical parameters for the Little Rip, Big Rip and Pseudo Rip models. In the LR and PR models, the EoS parameter exhibits phantom characteristics but remains closely aligned with the $Λ$CDM line. After investigating the energy conditions, we recognised that our model violates the strong energy constraint. Avoiding singularity situations has been noted in all of these accelerated models. The characteristics of the jerk and snap parameters have been investigated. Our model provides an effective description of the Universe's evolutionary history and fits well with contemporary cosmic data.
△ Less
Submitted 26 April, 2024;
originally announced April 2024.
-
Product Description and QA Assisted Self-Supervised Opinion Summarization
Authors:
Tejpalsingh Siledar,
Rupasai Rangaraju,
Sankara Sri Raghava Ravindra Muddu,
Suman Banerjee,
Amey Patil,
Sudhanshu Shekhar Singh,
Muthusamy Chelliah,
Nikesh Garera,
Swaprava Nath,
Pushpak Bhattacharyya
Abstract:
In e-commerce, opinion summarization is the process of summarizing the consensus opinions found in product reviews. However, the potential of additional sources such as product description and question-answers (QA) has been considered less often. Moreover, the absence of any supervised training data makes this task challenging. To address this, we propose a novel synthetic dataset creation (SDC) s…
▽ More
In e-commerce, opinion summarization is the process of summarizing the consensus opinions found in product reviews. However, the potential of additional sources such as product description and question-answers (QA) has been considered less often. Moreover, the absence of any supervised training data makes this task challenging. To address this, we propose a novel synthetic dataset creation (SDC) strategy that leverages information from reviews as well as additional sources for selecting one of the reviews as a pseudo-summary to enable supervised training. Our Multi-Encoder Decoder framework for Opinion Summarization (MEDOS) employs a separate encoder for each source, enabling effective selection of information while generating the summary. For evaluation, due to the unavailability of test sets with additional sources, we extend the Amazon, Oposum+, and Flipkart test sets and leverage ChatGPT to annotate summaries. Experiments across nine test sets demonstrate that the combination of our SDC approach and MEDOS model achieves on average a 14.5% improvement in ROUGE-1 F1 over the SOTA. Moreover, comparative analysis underlines the significance of incorporating additional sources for generating more informative summaries. Human evaluations further indicate that MEDOS scores relatively higher in coherence and fluency with 0.41 and 0.5 (-1 to 1) respectively, compared to existing models. To the best of our knowledge, we are the first to generate opinion summaries leveraging additional sources in a self-supervised setting.
△ Less
Submitted 8 April, 2024;
originally announced April 2024.
-
Towards a turnkey approach to unbiased Monte Carlo estimation of smooth functions of expectations
Authors:
Nicolas Chopin,
Francesca R. Crucinio,
Sumeetpal S. Singh
Abstract:
Given a smooth function $f$, we develop a general approach to turn Monte Carlo samples with expectation $m$ into an unbiased estimate of $f(m)$. Specifically, we develop estimators that are based on randomly truncating the Taylor series expansion of $f$ and estimating the coefficients of the truncated series. We derive their properties and propose a strategy to set their tuning parameters -- which…
▽ More
Given a smooth function $f$, we develop a general approach to turn Monte Carlo samples with expectation $m$ into an unbiased estimate of $f(m)$. Specifically, we develop estimators that are based on randomly truncating the Taylor series expansion of $f$ and estimating the coefficients of the truncated series. We derive their properties and propose a strategy to set their tuning parameters -- which depend on $m$ -- automatically, with a view to make the whole approach simple to use. We develop our methods for the specific functions $f(x)=\log x$ and $f(x)=1/x$, as they arise in several statistical applications such as maximum likelihood estimation of latent variable models and Bayesian inference for un-normalised models. Detailed numerical studies are performed for a range of applications to determine how competitive and reliable the proposed approach is.
△ Less
Submitted 12 April, 2024; v1 submitted 29 March, 2024;
originally announced March 2024.
-
Leveraging Domain Knowledge for Efficient Reward Modelling in RLHF: A Case-Study in E-Commerce Opinion Summarization
Authors:
Swaroop Nath,
Tejpalsingh Siledar,
Sankara Sri Raghava Ravindra Muddu,
Rupasai Rangaraju,
Harshad Khadilkar,
Pushpak Bhattacharyya,
Suman Banerjee,
Amey Patil,
Sudhanshu Shekhar Singh,
Muthusamy Chelliah,
Nikesh Garera
Abstract:
Reinforcement Learning from Human Feedback (RLHF) has become a dominating strategy in aligning Language Models (LMs) with human values/goals. The key to the strategy is learning a reward model ($\varphi$), which can reflect the latent reward model of humans. While this strategy has proven effective, the training methodology requires a lot of human preference annotation (usually in the order of ten…
▽ More
Reinforcement Learning from Human Feedback (RLHF) has become a dominating strategy in aligning Language Models (LMs) with human values/goals. The key to the strategy is learning a reward model ($\varphi$), which can reflect the latent reward model of humans. While this strategy has proven effective, the training methodology requires a lot of human preference annotation (usually in the order of tens of thousands) to train $\varphi$. Such a large-scale annotation is justifiable when it's a one-time effort, and the reward model is universally applicable. However, human goals are subjective and depend on the task, requiring task-specific preference annotations, which can be impractical to fulfill. To address this challenge, we propose a novel approach to infuse domain knowledge into $\varphi$, which reduces the amount of preference annotation required ($21\times$), omits Alignment Tax, and provides some interpretability. We validate our approach in E-Commerce Opinion Summarization, with a significant reduction in dataset size (to just $940$ samples) while advancing the SOTA ($\sim4$ point ROUGE-L improvement, $68\%$ of times preferred by humans over SOTA). Our contributions include a novel Reward Modeling technique and two new datasets: PromptOpinSumm (supervised data for Opinion Summarization) and OpinPref (a gold-standard human preference dataset). The proposed methodology opens up avenues for efficient RLHF, making it more adaptable to applications with varying human values. We release the artifacts (Code: github.com/efficient-rlhf. PromptOpinSumm: hf.co/prompt-opin-summ. OpinPref: hf.co/opin-pref) for usage under MIT License.
△ Less
Submitted 18 April, 2024; v1 submitted 23 February, 2024;
originally announced February 2024.
-
One Prompt To Rule Them All: LLMs for Opinion Summary Evaluation
Authors:
Tejpalsingh Siledar,
Swaroop Nath,
Sankara Sri Raghava Ravindra Muddu,
Rupasai Rangaraju,
Swaprava Nath,
Pushpak Bhattacharyya,
Suman Banerjee,
Amey Patil,
Sudhanshu Shekhar Singh,
Muthusamy Chelliah,
Nikesh Garera
Abstract:
Evaluation of opinion summaries using conventional reference-based metrics rarely provides a holistic evaluation and has been shown to have a relatively low correlation with human judgments. Recent studies suggest using Large Language Models (LLMs) as reference-free metrics for NLG evaluation, however, they remain unexplored for opinion summary evaluation. Moreover, limited opinion summary evaluat…
▽ More
Evaluation of opinion summaries using conventional reference-based metrics rarely provides a holistic evaluation and has been shown to have a relatively low correlation with human judgments. Recent studies suggest using Large Language Models (LLMs) as reference-free metrics for NLG evaluation, however, they remain unexplored for opinion summary evaluation. Moreover, limited opinion summary evaluation datasets inhibit progress. To address this, we release the SUMMEVAL-OP dataset covering 7 dimensions related to the evaluation of opinion summaries: fluency, coherence, relevance, faithfulness, aspect coverage, sentiment consistency, and specificity. We investigate Op-I-Prompt a dimension-independent prompt, and Op-Prompts, a dimension-dependent set of prompts for opinion summary evaluation. Experiments indicate that Op-I-Prompt emerges as a good alternative for evaluating opinion summaries achieving an average Spearman correlation of 0.70 with humans, outperforming all previous approaches. To the best of our knowledge, we are the first to investigate LLMs as evaluators on both closed-source and open-source models in the opinion summarization domain.
△ Less
Submitted 9 June, 2024; v1 submitted 18 February, 2024;
originally announced February 2024.
-
Mixing time of the conditional backward sampling particle filter
Authors:
Joona Karjalainen,
Anthony Lee,
Sumeetpal S. Singh,
Matti Vihola
Abstract:
The conditional backward sampling particle filter (CBPF) is a powerful Markov chain Monte Carlo sampler for general state space hidden Markov model smoothing. It was proposed as an improvement over the conditional particle filter, which is known to have an $O(T^2)$ computational time complexity under a general `strong' mixing assumption, where $T$ is the time horizon. We provide the first proof th…
▽ More
The conditional backward sampling particle filter (CBPF) is a powerful Markov chain Monte Carlo sampler for general state space hidden Markov model smoothing. It was proposed as an improvement over the conditional particle filter, which is known to have an $O(T^2)$ computational time complexity under a general `strong' mixing assumption, where $T$ is the time horizon. We provide the first proof that the CBPF admits an $O(T \log T)$ time complexity under strong mixing, complementing strong empirical evidence of the superiority of the CBPF in practice. In particular, the CBPF's mixing time is upper bounded by $O(\log T)$, for any sufficiently large number of particles $N$ that depends only on the mixing assumptions and not $T$. We show that an $O(\log T)$ mixing time is optimal. The proof involves the analysis of a novel coupling of two CBPFs, which involves a maximal coupling of two particle systems at each time instant. The coupling is implementable, and thus can also be used to construct unbiased, finite variance, estimates of functionals which have arbitrary dependence on the latent state's path, with a total expected cost of $O(T \log T)$. We also investigate other couplings, and we show some of these alternatives have improved empirical behaviour.
△ Less
Submitted 22 February, 2024; v1 submitted 29 December, 2023;
originally announced December 2023.
-
On the Forgetting of Particle Filters
Authors:
Joona Karjalainen,
Anthony Lee,
Sumeetpal S. Singh,
Matti Vihola
Abstract:
We study the forgetting properties of the particle filter when its state - the collection of particles - is regarded as a Markov chain. Under a strong mixing assumption on the particle filter's underlying Feynman-Kac model, we find that the particle filter is exponentially mixing, and forgets its initial state in $O(\log N )$ `time', where $N$ is the number of particles and time refers to the numb…
▽ More
We study the forgetting properties of the particle filter when its state - the collection of particles - is regarded as a Markov chain. Under a strong mixing assumption on the particle filter's underlying Feynman-Kac model, we find that the particle filter is exponentially mixing, and forgets its initial state in $O(\log N )$ `time', where $N$ is the number of particles and time refers to the number of particle filter algorithm steps, each comprising a selection (or resampling) and mutation (or prediction) operation. We present an example which suggests that this rate is optimal. In contrast to our result, available results to-date are extremely conservative, suggesting $O(α^N)$ time steps are needed, for some $α>1$, for the particle filter to forget its initialisation. We also study the conditional particle filter (CPF) and extend our forgetting result to this context. We establish a similar conclusion, namely, CPF is exponentially mixing and forgets its initial state in $O(\log N )$ time. To support this analysis, we establish new time-uniform $L^p$ error estimates for CPF, which can be of independent interest.
△ Less
Submitted 15 September, 2023;
originally announced September 2023.
-
Analyzing Transformer Dynamics as Movement through Embedding Space
Authors:
Sumeet S. Singh
Abstract:
Transformer based language models exhibit intelligent behaviors such as understanding natural language, recognizing patterns, acquiring knowledge, reasoning, planning, reflecting and using tools. This paper explores how their underlying mechanics give rise to intelligent behaviors. Towards that end, we propose framing Transformer dynamics as movement through embedding space. Examining Transformers…
▽ More
Transformer based language models exhibit intelligent behaviors such as understanding natural language, recognizing patterns, acquiring knowledge, reasoning, planning, reflecting and using tools. This paper explores how their underlying mechanics give rise to intelligent behaviors. Towards that end, we propose framing Transformer dynamics as movement through embedding space. Examining Transformers through this perspective reveals key insights, establishing a Theory of Transformers: 1) Intelligent behaviours map to paths in Embedding Space which, the Transformer random-walks through during inferencing. 2) LM training learns a probability distribution over all possible paths. `Intelligence' is learnt by assigning higher probabilities to paths representing intelligent behaviors. No learning can take place in-context; context only narrows the subset of paths sampled during decoding. 5) The Transformer is a self-map** composition function, folding a context sequence into a context-vector such that it's proximity to a token-vector reflects its co-occurrence and conditioned probability. Thus, the physical arrangement of vectors in Embedding Space determines path probabilities. 6) Context vectors are composed by aggregating features of the sequence's tokens via a process we call the encoding walk. Attention contributes a - potentially redundant - association-bias to this process. 7) This process is comprised of two principal operation types: filtering (data independent) and aggregation (data dependent). This generalization unifies Transformers with other sequence models. Building upon this foundation, we formalize a popular semantic interpretation of embeddings into a ``concept-space theory'' and find some evidence of it's validity.
△ Less
Submitted 14 November, 2023; v1 submitted 21 August, 2023;
originally announced August 2023.
-
Stability Analysis of Cosmological models in $f(T,φ)$ Gravity
Authors:
Amit Samaddar,
S. Surendra Singh
Abstract:
We investigated the stability condition in $f(T,φ)$ gravity theory for considering two models by using dynamical system. We assume the forms of $G(T)$ are $(i)$ $G(T)$ = $αT+\fracβ{T}$, $(ii)$ $G(T)$ = $ζT$ ln$(ψT)$, where $α$, $β$, $ζ$ and $ψ$ be the free parameters. We evaluated the equilibrium points for these models and examine the stability behavior. We found five stable critical points for M…
▽ More
We investigated the stability condition in $f(T,φ)$ gravity theory for considering two models by using dynamical system. We assume the forms of $G(T)$ are $(i)$ $G(T)$ = $αT+\fracβ{T}$, $(ii)$ $G(T)$ = $ζT$ ln$(ψT)$, where $α$, $β$, $ζ$ and $ψ$ be the free parameters. We evaluated the equilibrium points for these models and examine the stability behavior. We found five stable critical points for Model I and three stable critical points for Model II. The phase plots for these systems are examined and discussed the physical interpretation. We illustrate all the cosmological parameters such as $Ω_{m}$, $Ω_φ$, $q$ and $ω_{Tot}$ at each fixed points and compare the parameters with observational values. Further, we assume hybrid scale factor and the equation of redshift and time is $t(z)=\fracδσW\bigg[\fracσδ\bigg(\frac{1}{a_{1}(1+z)}\bigg)^{\frac{1}δ}\bigg]$. We transform all the parameters in redshift by using this equation and examine the behavior of these parameters. Our models represent the accelerating stage of the Universe. The energy conditions are examined in terms of redshift and SEC is not satisfied for the model. We also find the statefinder parameters $\{r,s\}$ in terms of z and discuss the nature of $r-s$ and $r-q$ plane. For both pairs $\{r,s\}$ and $\{r,q\}$ our models represent the $Λ$CDM model. Hence, we determine that our $f(T,φ)$ models are stable and it satisfies all the observational values.
△ Less
Submitted 13 June, 2023;
originally announced June 2023.
-
Stability analysis of cosmological models coupled minimally with scalar field in $f(Q)$ gravity
Authors:
Amit Samaddar,
S. Surendra Singh,
Shivangi Rathore
Abstract:
In this work, in the framework of dynamical system analysis, we focus on the study of the accelerated expansion of the Universe of $f(Q)$ gravity theory where $Q$ be the non-metricity that describes the gravitational interaction. We consider the linear form of $f(Q)$ gravity i.e. $f(Q)=-α_{1}Q-α_{2}$ where $α_{1}$ and $α_{2}$ are constants. We consider an interaction between dark matter (DM) and d…
▽ More
In this work, in the framework of dynamical system analysis, we focus on the study of the accelerated expansion of the Universe of $f(Q)$ gravity theory where $Q$ be the non-metricity that describes the gravitational interaction. We consider the linear form of $f(Q)$ gravity i.e. $f(Q)=-α_{1}Q-α_{2}$ where $α_{1}$ and $α_{2}$ are constants. We consider an interaction between dark matter (DM) and dark energy (DE) in $f(Q)$ gravity. To reduce the modified Friedmann equations to an autonomous system of first-order ordinary differential equations, we introduce some dimensionless new variables. The nature of the critical points are discussed by finding the eigenvalues of the Jacobian matrix. We get six critical points for interacting DE model. We also analyze the density parameter, equation of state (EoS) parameter and deceleration parameter and draw their plots and we conclude that for some suitable range of the parameters $λ$ and $α$, the value of the deceleration parameter is $q=-1$ which shows that the expansion of Universe is accelerating and the value of EoS parameter is $ω_φ=-1$ which shows that the model is $Λ$CDM model. Finally, we discussed the classical as well as quantum stability of the model.
△ Less
Submitted 18 January, 2023;
originally announced February 2023.
-
Quasi-Newton Sequential Monte Carlo
Authors:
Samuel Duffield,
Sumeetpal S. Singh
Abstract:
Sequential Monte Carlo samplers represent a compelling approach to posterior inference in Bayesian models, due to being parallelisable and providing an unbiased estimate of the posterior normalising constant. In this work, we significantly accelerate sequential Monte Carlo samplers by adopting the L-BFGS Hessian approximation which represents the state-of-the-art in full-batch optimisation techniq…
▽ More
Sequential Monte Carlo samplers represent a compelling approach to posterior inference in Bayesian models, due to being parallelisable and providing an unbiased estimate of the posterior normalising constant. In this work, we significantly accelerate sequential Monte Carlo samplers by adopting the L-BFGS Hessian approximation which represents the state-of-the-art in full-batch optimisation techniques. The L-BFGS Hessian approximation has only linear complexity in the parameter dimension and requires no additional posterior or gradient evaluations. The resulting sequential Monte Carlo algorithm is adaptive, parallelisable and well-suited to high-dimensional and multi-modal settings, which we demonstrate in numerical experiments on challenging posterior distributions.
△ Less
Submitted 22 November, 2022;
originally announced November 2022.
-
Qualitative Stability Analysis of Cosmological Parameters in $f(T,B)$ Gravity
Authors:
Amit Samaddar,
S. Surendra Singh
Abstract:
We analyze the cosmological solutions of $f(T,B)$ gravity using dynamical system analysis where $T$ is the torsion scalar and $B$ be the boundary term scalar. In our work, we assume two specific cosmological models. For first model, we consider $ f(T,B)=f_{0}(B^{k}+T^{m})$, where $k$ and $m$ are constants. For second model, we consider $f(T,B)=f_{0}T B$. We generate an autonomous system of differe…
▽ More
We analyze the cosmological solutions of $f(T,B)$ gravity using dynamical system analysis where $T$ is the torsion scalar and $B$ be the boundary term scalar. In our work, we assume two specific cosmological models. For first model, we consider $ f(T,B)=f_{0}(B^{k}+T^{m})$, where $k$ and $m$ are constants. For second model, we consider $f(T,B)=f_{0}T B$. We generate an autonomous system of differential equations for each models by introducing new dimensionless variables. To solve this system of equations, we use dynamical system analysis. We also investigate the critical points and their natures, stability conditions and their behaviors of Universe expansion. For both models, we get four critical points. The phase plots of this system are analyzed in detail and study their geometrical interpretations also. In both model, we evaluated density parameters such as $Ω_{r}$, $Ω_{m}$, $Ω_Λ$ and $ω_{eff}$ and deceleration parameter $(q)$ and find their suitable range of the parameter $λ$ for stability. For first model, we get $ω_{eff}=-0.833,-0.166$ and for second model, we get $ω_{eff}=-\frac{1}{3}$. This shows that both the models are in quintessence phase. Further, we compare the values of EoS parameter and deceleration parameter with the observational values.
△ Less
Submitted 10 November, 2022;
originally announced November 2022.
-
Renyi Holographic dark energy and its behaviour in f(G) gravity
Authors:
Md Khurshid Alam,
S. Surendra Singh,
L. Anjana Devi
Abstract:
In this work, the Renyi holographic dark energy (RHDE)and its behaviour has been explored with the anisotropic and spatially homogeneous Bianchi type-I Universe in the framework of $f(G)$ gravity. We use IR cutoff as the Hubble and Granda-Oliveros (GO) horizons. To find a consistent solutions of the field equations of the models, it is assumed that the deceleration parameter is defined in terms of…
▽ More
In this work, the Renyi holographic dark energy (RHDE)and its behaviour has been explored with the anisotropic and spatially homogeneous Bianchi type-I Universe in the framework of $f(G)$ gravity. We use IR cutoff as the Hubble and Granda-Oliveros (GO) horizons. To find a consistent solutions of the field equations of the models, it is assumed that the deceleration parameter is defined in terms of function of Hubble parameter $H$. With reference to current cosmological data, the behaviors of the cosmological parameters relating to the dark energy model are evaluated and their physical significance is examined. It is observed that for both the models, the equation of state parameter approaches to $-1$ at late times. However, the RHDE model with the Hubble horizon exhibits stability from the squared sound speed, but the RHDE model with the GO horizon exhibits instability. In both the models, deceleration parameter and statefinder diagnostic confirm the accelerated expansion of the Universe and also correspond to the $Λ$CDM model at late times.
△ Less
Submitted 26 October, 2022;
originally announced November 2022.
-
Anisotropic Universe in f(Q) gravity with Hybrid expansion
Authors:
L. Anjana Devi,
S. Surendra Singh,
Leishingam Kumrah,
Md Khurshid Alam
Abstract:
Despite having a reasonably successful account of accelerated cosmology, understanding the early evolution of Universe has always been difficult for mankind. Our promising strategy is based on a novel class of symmetric teleparallel theories of gravity called $f(Q)$, in which the gravitational interaction is caused by the non-metricity scalar $Q$, which may help to solve some problems. We consider…
▽ More
Despite having a reasonably successful account of accelerated cosmology, understanding the early evolution of Universe has always been difficult for mankind. Our promising strategy is based on a novel class of symmetric teleparallel theories of gravity called $f(Q)$, in which the gravitational interaction is caused by the non-metricity scalar $Q$, which may help to solve some problems. We consider the locally rotationally symmetric (LRS) Bianchi type-I spacetime cosmological models and derive the motion of equations to study the early evolution of the cosmos. By assuming the Hybrid Expansion Law (HEL) for the average scale factor, we are able to determine the solutions to the field equations of Bianchi type-I spacetime. We discuss the energy density profile, the equation of state, and the skewness parameter and conclude that our models preserve anisotropic spatial geometry during the early stages of the Universe with the possibility of an anisotropic fluid present. However, as time goes on, even in the presence of an anisotropic fluid, the Universe may move towards isotropy due to inflation while the anisotropy of the fluid dims away at the same time. It is seen from the squared speed of sound that Universe shows phantom nature at the beginning then approaches to dark energy at present epoch. We analyze both geometrical and physical behaviors of the derived model.
△ Less
Submitted 7 September, 2022;
originally announced September 2022.
-
FRW cosmology with varying cubic deceleration parameter
Authors:
Leishingam Kumrah,
S. Surendra Singh,
L. Anjana Devi,
Md Khurshid Alam
Abstract:
In this work a new law of varying deceleration parameter of third degree have been proposed. The solutions of the modified field equations have been derived under the newly proposed law of the deceleration parameter. Model exhibits the Big-bang singularity at cosmic time ($t=0$) and shows Big Rip at ($t=n$) then it re-enter the phase of initial singularity at $t=2n$ and ends its cyclic behavior at…
▽ More
In this work a new law of varying deceleration parameter of third degree have been proposed. The solutions of the modified field equations have been derived under the newly proposed law of the deceleration parameter. Model exhibits the Big-bang singularity at cosmic time ($t=0$) and shows Big Rip at ($t=n$) then it re-enter the phase of initial singularity at $t=2n$ and ends its cyclic behavior at $t=3n$. The evolution of the physical and dynamical parameters of the Universe have been studied and the graphical representation has also been shown. Further $Om(z)$ diagnostic parameter and the energy conditions have also been studied together with their graphical representations.
△ Less
Submitted 19 August, 2022;
originally announced August 2022.
-
De-biasing particle filtering for a continuous time hidden Markov model with a Cox process observation model
Authors:
Ruiyang **,
Sumeetpal S. Singh,
Nicolas Chopin
Abstract:
We develop a (nearly) unbiased particle filtering algorithm for a specific class of continuous-time state-space models, such that (a) the latent process $X_t$ is a linear Gaussian diffusion; and (b) the observations arise from a Poisson process with intensity $λ(X_t)$. The likelihood of the posterior probability density function of the latent process includes an intractable path integral. Our algo…
▽ More
We develop a (nearly) unbiased particle filtering algorithm for a specific class of continuous-time state-space models, such that (a) the latent process $X_t$ is a linear Gaussian diffusion; and (b) the observations arise from a Poisson process with intensity $λ(X_t)$. The likelihood of the posterior probability density function of the latent process includes an intractable path integral. Our algorithm relies on Poisson estimates which approximate unbiasedly this integral. We show how we can tune these Poisson estimates to ensure that, with large probability, all but a few of the estimates generated by the algorithm are positive. Then replacing the negative estimates by zero leads to a much smaller bias than what would obtain through discretisation. We quantify the probability of negative estimates for certain special cases and show that our particle filter is effectively unbiased. We apply our method to a challenging 3D single molecule tracking example with a Born and Wolf observation model.
△ Less
Submitted 30 June, 2022; v1 submitted 21 June, 2022;
originally announced June 2022.
-
Conditional particle filters with bridge backward sampling
Authors:
Santeri Karppinen,
Sumeetpal S. Singh,
Matti Vihola
Abstract:
Conditional particle filters (CPFs) with backward/ancestor sampling are powerful methods for sampling from the posterior distribution of the latent states of a dynamic model such as a hidden Markov model. However, the performance of these methods deteriorates with models involving weakly informative observations and/or slowly mixing dynamics. Both of these complications arise when sampling finely…
▽ More
Conditional particle filters (CPFs) with backward/ancestor sampling are powerful methods for sampling from the posterior distribution of the latent states of a dynamic model such as a hidden Markov model. However, the performance of these methods deteriorates with models involving weakly informative observations and/or slowly mixing dynamics. Both of these complications arise when sampling finely time-discretised continuous-time path integral models, but can occur with hidden Markov models too. Multinomial resampling, which is commonly employed with CPFs, resamples excessively for weakly informative observations and thereby introduces extra variance. Furthermore, slowly mixing dynamics render the backward/ancestor sampling steps ineffective, leading to degeneracy issues. We detail two conditional resampling strategies suitable for the weakly informative regime: the so-called `killing' resampling and the systematic resampling with mean partial order. To avoid the degeneracy issues, we introduce a generalisation of the CPF with backward sampling that involves auxiliary `bridging' CPF steps that are parameterised by a blocking sequence. We present practical tuning strategies for choosing an appropriate blocking. Our experiments demonstrate that the CPF with a suitable resampling and the developed `bridge backward sampling' can lead to substantial efficiency gains in the weakly informative and slow mixing regime.
△ Less
Submitted 19 June, 2023; v1 submitted 27 May, 2022;
originally announced May 2022.
-
Dynamical systems of cosmological models for different possibilities of $G$ and $ρ_Λ$
Authors:
Chingtham Sonia,
S. Surendra Singh
Abstract:
The present paper deals with the dynamics of spatially flat Friedmann-Lemaitre-Robertson-Walker (FLRW) cosmological model with a time varying cosmological constant $Λ$ where $Λ$ evolves with the cosmic time (t) through the Hubble parameter (H). We consider that the model dynamics has a reflection symmetry $H \rightarrow -H $ with $Λ(H)$ expressed in the form of Taylor series with respect to H. Dyn…
▽ More
The present paper deals with the dynamics of spatially flat Friedmann-Lemaitre-Robertson-Walker (FLRW) cosmological model with a time varying cosmological constant $Λ$ where $Λ$ evolves with the cosmic time (t) through the Hubble parameter (H). We consider that the model dynamics has a reflection symmetry $H \rightarrow -H $ with $Λ(H)$ expressed in the form of Taylor series with respect to H. Dynamical systems for three different cases based on the possibilities of gravitational constant G and the vacuum energy density $ρ_Λ$ have been analysed. In Case I, both G and $ρ_Λ$ are taken to be constant. We analyse stability of the system by using the notion of spectral radius, behavior of perturbation along each of the axis with respect to cosmic time and Poincare sphere. In Case II, we have dynamical system analysis for G=constant and $ρ_Λ \neq $ constant where we study stability by using the concept of spectral radius and perturbation function. In Case III, we take $G \neq$ constant and $ρ_Λ \neq$ constant where we introduce a new set of variables to set up the corresponding dynamical system. We find out the fixed points of the system and analyse the stability from different directions: by analysing behaviour of the perturbation along each of the axis, Center Manifold Theory and stability at infinity using Poincare sphere respectively. Phase plots and perturbation plots have been presented. We deeply study the cosmological scenario with respect to the fixed points obtained and analyse the late time behavior of the Universe. Our model agrees with the fact that the Universe is in the epoch of accelerated expansion. The EOS parameter $ω_{eff}$, total energy density $Ω_{tt}$ are also evaluated at the fixed points for each of the three cases and these values are in agreement with the observational values in [1].
△ Less
Submitted 31 March, 2022;
originally announced March 2022.
-
Multilevel Bayesian Deep Neural Networks
Authors:
Neil K. Chada,
Ajay Jasra,
Kody J. H. Law,
Sumeetpal S. Singh
Abstract:
In this article we consider Bayesian inference associated to deep neural networks (DNNs) and in particular, trace-class neural network (TNN) priors which were proposed by Sell et al. [39]. Such priors were developed as more robust alternatives to classical architectures in the context of inference problems. For this work we develop multilevel Monte Carlo (MLMC) methods for such models. MLMC is a p…
▽ More
In this article we consider Bayesian inference associated to deep neural networks (DNNs) and in particular, trace-class neural network (TNN) priors which were proposed by Sell et al. [39]. Such priors were developed as more robust alternatives to classical architectures in the context of inference problems. For this work we develop multilevel Monte Carlo (MLMC) methods for such models. MLMC is a popular variance reduction technique, with particular applications in Bayesian statistics and uncertainty quantification. We show how a particular advanced MLMC method that was introduced in [4] can be applied to Bayesian inference from DNNs and establish mathematically, that the computational cost to achieve a particular mean square error, associated to posterior expectation computation, can be reduced by several orders, versus more conventional techniques. To verify such results we provide numerous numerical experiments on model problems arising in machine learning. These include Bayesian regression, as well as Bayesian classification and reinforcement learning.
△ Less
Submitted 20 July, 2022; v1 submitted 24 March, 2022;
originally announced March 2022.
-
On resampling schemes for particle filters with weakly informative observations
Authors:
Nicolas Chopin,
Sumeetpal S. Singh,
Tomás Soto,
Matti Vihola
Abstract:
We consider particle filters with weakly informative observations (or `potentials') relative to the latent state dynamics. The particular focus of this work is on particle filters to approximate time-discretisations of continuous-time Feynman--Kac path integral models -- a scenario that naturally arises when addressing filtering and smoothing problems in continuous time -- but our findings are ind…
▽ More
We consider particle filters with weakly informative observations (or `potentials') relative to the latent state dynamics. The particular focus of this work is on particle filters to approximate time-discretisations of continuous-time Feynman--Kac path integral models -- a scenario that naturally arises when addressing filtering and smoothing problems in continuous time -- but our findings are indicative about weakly informative settings beyond this context too. We study the performance of different resampling schemes, such as systematic resampling, SSP (Srinivasan sampling process) and stratified resampling, as the time-discretisation becomes finer and also identify their continuous-time limit, which is expressed as a suitably defined `infinitesimal generator.' By contrasting these generators, we find that (certain modifications of) systematic and SSP resampling `dominate' stratified and independent `killing' resampling in terms of their limiting overall resampling rate. The reduced intensity of resampling manifests itself in lower variance in our numerical experiment. This efficiency result, through an ordering of the resampling rate, is new to the literature. The second major contribution of this work concerns the analysis of the limiting behaviour of the entire population of particles of the particle filter as the time discretisation becomes finer. We provide the first proof, under general conditions, that the particle approximation of the discretised continuous-time Feynman--Kac path integral models converges to a (uniformly weighted) continuous-time particle system.
△ Less
Submitted 9 July, 2022; v1 submitted 18 March, 2022;
originally announced March 2022.
-
Interaction of Anisotropic Dark Energy with Generalized Hybrid Expansion Law
Authors:
Md. Khurshid Alam,
S. Surendra Singh,
L. Anjana Devi
Abstract:
Interaction of dark energy in the anisotropic Locally Rotationally Symmetric (LRS) Bianchi type-I metric is investigated in the context of modified f(R,T) theory of gravity, where R is the Ricci scalar and T is the trace of stress energy momentum tensor. We choose the particular form of the functional f(R,T)=f_1 (R,T)+f_2 (R,T) then we find the exact solutions of the field equations by applying in…
▽ More
Interaction of dark energy in the anisotropic Locally Rotationally Symmetric (LRS) Bianchi type-I metric is investigated in the context of modified f(R,T) theory of gravity, where R is the Ricci scalar and T is the trace of stress energy momentum tensor. We choose the particular form of the functional f(R,T)=f_1 (R,T)+f_2 (R,T) then we find the exact solutions of the field equations by applying inhomogeneous equation of state, p= ω\r{ho}-Λ(t) and a generalized form of hybrid expansion law. Transition of deceleration to acceleration is observed in this model. It is also observed that the Universe shows accelerated expansion at late epoch. The derived model overlaps with ΛCDM at late time which is in agreement with present observation. Energy conditions of the derived model are also investigated. From the plot, we observe the age of Universe t_0=13.821 Gyr for the observed H_0=70.07Kms^(-1) Mpc^(-1). The physical and geometrical behaviours of these models are also discussed.
△ Less
Submitted 10 December, 2021;
originally announced December 2021.
-
Ensemble Kalman Inversion for General Likelihoods
Authors:
Samuel Duffield,
Sumeetpal S. Singh
Abstract:
In this letter we generalise Ensemble Kalman inversion techniques to general Bayesian models where previously they were restricted to additive Gaussian likelihoods - all in the difficult setting where the likelihood can be sampled from, but its density not necessarily evaluated.
In this letter we generalise Ensemble Kalman inversion techniques to general Bayesian models where previously they were restricted to additive Gaussian likelihoods - all in the difficult setting where the likelihood can be sampled from, but its density not necessarily evaluated.
△ Less
Submitted 7 June, 2022; v1 submitted 6 October, 2021;
originally announced October 2021.
-
Limits of accuracy for parameter estimation and localisation in Single-Molecule Microscopy via sequential Monte Carlo methods
Authors:
A. Marie d'Avigneau,
S. S. Singh,
R. J. Ober
Abstract:
Assessing the quality of parameter estimates for models describing the motion of single molecules in cellular environments is an important problem in fluorescence microscopy. We consider the fundamental data model, where molecules emit photons at random times and the photons arrive at random locations on the detector according to complex point spread functions (PSFs). The random, non-Gaussian PSF…
▽ More
Assessing the quality of parameter estimates for models describing the motion of single molecules in cellular environments is an important problem in fluorescence microscopy. We consider the fundamental data model, where molecules emit photons at random times and the photons arrive at random locations on the detector according to complex point spread functions (PSFs). The random, non-Gaussian PSF of the detection process and random trajectory of the molecule make inference challenging. Moreover, the presence of other nearby molecules causes further uncertainty in the origin of the measurements, which impacts the statistical precision of estimates. We quantify the limits of accuracy of model parameter estimates and separation distance between closely spaced molecules (known as the resolution problem) by computing the Cramer-Rao lower bound (CRLB), or equivalently the inverse of the Fisher information matrix (FIM), for the variance of estimates. This fundamental CRLB is crucial, as it provides a lower bound for more practical scenarios. While analytic expressions for the FIM can be derived for static molecules, the analytical tools to evaluate it for molecules whose trajectories follow SDEs are still mostly missing. We address this by presenting a general SMC based methodology for both parameter inference and computing the desired accuracy limits for non-static molecules and a non-Gaussian fundamental detection model. For the first time, we are able to estimate the FIM for stochastically moving molecules observed through the Airy and Born & Wolf PSF. This is achieved by estimating the score and observed information matrix via SMC. We sum up the outcome of our numerical work by summarising the qualitative behaviours for the accuracy limits as functions of e.g. collected photon count, molecule diffusion, etc. We also verify that we can recover known results from the static molecule case.
△ Less
Submitted 14 September, 2021; v1 submitted 3 June, 2021;
originally announced June 2021.
-
Three loop correction in the formation of QGP droplet
Authors:
M. Jena,
K. K. Gupta,
S. Somorendro Singh
Abstract:
Quark-gluon plasma (QGP) droplet formation is re-considered with the addition of three loop correction to the earlier loop factors in the mean field potential. The correction of the three loop factor increases stability in the droplet formations of QGP at different parametrization factors of the QGP fluid and it is in better agreement in comparison to the lattice results of pressure, energy densit…
▽ More
Quark-gluon plasma (QGP) droplet formation is re-considered with the addition of three loop correction to the earlier loop factors in the mean field potential. The correction of the three loop factor increases stability in the droplet formations of QGP at different parametrization factors of the QGP fluid and it is in better agreement in comparison to the lattice results of pressure, energy density and other thermodynamic relations. This implies that the contribution of the three loop enhances in showing the characteristic features of the QGP fluid. It shows that increasing the loop increased the strength of parametrization value which we defined earlier as a number parameter of fluid dynamics. It indicates that the model with the loop correction boosts in explaining about the formation of QGP droplet in the expansion of early universe
△ Less
Submitted 26 March, 2021;
originally announced March 2021.
-
Gradient-Based Markov Chain Monte Carlo for Bayesian Inference With Non-Differentiable Priors
Authors:
Jacob Vorstrup Goldman,
Torben Sell,
Sumeetpal Sidhu Singh
Abstract:
The use of non-differentiable priors in Bayesian statistics has become increasingly popular, in particular in Bayesian imaging analysis. Current state of the art methods are approximate in the sense that they replace the posterior with a smooth approximation via Moreau-Yosida envelopes, and apply gradient-based discretized diffusions to sample from the resulting distribution. We characterize the e…
▽ More
The use of non-differentiable priors in Bayesian statistics has become increasingly popular, in particular in Bayesian imaging analysis. Current state of the art methods are approximate in the sense that they replace the posterior with a smooth approximation via Moreau-Yosida envelopes, and apply gradient-based discretized diffusions to sample from the resulting distribution. We characterize the error of the Moreau-Yosida approximation and propose a novel implementation using underdamped Langevin dynamics. In misson-critical cases, however, replacing the posterior with an approximation may not be a viable option. Instead, we show that Piecewise-Deterministic Markov Processes (PDMP) can be utilized for exact posterior inference from distributions satisfying almost everywhere differentiability. Furthermore, in contrast with diffusion-based methods, the suggested PDMP-based samplers place no assumptions on the prior shape, nor require access to a computationally cheap proximal operator, and consequently have a much broader scope of application. Through detailed numerical examples, including a non-differentiable circular distribution and a non-convex genomics model, we elucidate the relative strengths of these sampling methods on problems of moderate to high dimensions, underlining the benefits of PDMP-based methods when accurate sampling is decisive.
△ Less
Submitted 16 March, 2021;
originally announced March 2021.
-
Full Page Handwriting Recognition via Image to Sequence Extraction
Authors:
Sumeet S. Singh,
Sergey Karayev
Abstract:
We present a Neural Network based Handwritten Text Recognition (HTR) model architecture that can be trained to recognize full pages of handwritten or printed text without image segmentation. Being based on Image to Sequence architecture, it can extract text present in an image and then sequence it correctly without imposing any constraints regarding orientation, layout and size of text and non-tex…
▽ More
We present a Neural Network based Handwritten Text Recognition (HTR) model architecture that can be trained to recognize full pages of handwritten or printed text without image segmentation. Being based on Image to Sequence architecture, it can extract text present in an image and then sequence it correctly without imposing any constraints regarding orientation, layout and size of text and non-text. Further, it can also be trained to generate auxiliary markup related to formatting, layout and content. We use character level vocabulary, thereby enabling language and terminology of any subject. The model achieves a new state-of-art in paragraph level recognition on the IAM dataset. When evaluated on scans of real world handwritten free form test answers - beset with curved and slanted lines, drawings, tables, math, chemistry and other symbols - it performs better than all commercially available HTR cloud APIs. It is deployed in production as part of a commercial web application.
△ Less
Submitted 26 June, 2022; v1 submitted 10 March, 2021;
originally announced March 2021.
-
Spatiotemporal blocking of the bouncy particle sampler for efficient inference in state space models
Authors:
Jacob Vorstrup Goldman,
Sumeetpal Sidhu Singh
Abstract:
We propose a novel blocked version of the continuous-time bouncy particle sampler of [Bouchard-Côté et al., 2018] which is applicable to any differentiable probability density. This alternative implementation is motivated by blocked Gibbs sampling for state space models [Singh et al., 2017] and leads to significant improvement in terms of effective sample size per second, and furthermore, allows f…
▽ More
We propose a novel blocked version of the continuous-time bouncy particle sampler of [Bouchard-Côté et al., 2018] which is applicable to any differentiable probability density. This alternative implementation is motivated by blocked Gibbs sampling for state space models [Singh et al., 2017] and leads to significant improvement in terms of effective sample size per second, and furthermore, allows for significant parallelization of the resulting algorithm. The new algorithms are particularly efficient for latent state inference in high-dimensional state space models, where blocking in both space and time is necessary to avoid degeneracy of MCMC. The efficiency of our blocked bouncy particle sampler, in comparison with both the standard implementation of the bouncy particle sampler and the particle Gibbs algorithm of Andrieu et al. [2010], is illustrated numerically for both simulated data and a challenging real-world financial dataset.
△ Less
Submitted 9 July, 2021; v1 submitted 8 January, 2021;
originally announced January 2021.
-
Trace-class Gaussian priors for Bayesian learning of neural networks with MCMC
Authors:
Torben Sell,
Sumeetpal S. Singh
Abstract:
This paper introduces a new neural network based prior for real valued functions on $\mathbb R^d$ which, by construction, is more easily and cheaply scaled up in the domain dimension $d$ compared to the usual Karhunen-Loève function space prior. The new prior is a Gaussian neural network prior, where each weight and bias has an independent Gaussian prior, but with the key difference that the varia…
▽ More
This paper introduces a new neural network based prior for real valued functions on $\mathbb R^d$ which, by construction, is more easily and cheaply scaled up in the domain dimension $d$ compared to the usual Karhunen-Loève function space prior. The new prior is a Gaussian neural network prior, where each weight and bias has an independent Gaussian prior, but with the key difference that the variances decrease in the width of the network in such a way that the resulting function is \emph{almost surely} well defined in the limit of an infinite width network. We show that in a Bayesian treatment of inferring unknown functions, the induced posterior over functions is amenable to Monte Carlo sampling using Hilbert space Markov chain Monte Carlo (MCMC) methods. This type of MCMC is popular, e.g. in the Bayesian Inverse Problems literature, because it is stable under \emph{mesh refinement}, i.e. the acceptance probability does not shrink to $0$ as more parameters of the function's prior are introduced, even \emph{ad infinitum}. In numerical examples we demonstrate these stated competitive advantages over other function space priors. We also implement examples in Bayesian Reinforcement Learning to automate tasks from data and demonstrate, for the first time, stability of MCMC to mesh refinement for these type of problems.
△ Less
Submitted 8 September, 2022; v1 submitted 20 December, 2020;
originally announced December 2020.
-
Online Particle Smoothing with Application to Map-matching
Authors:
Samuel Duffield,
Sumeetpal S. Singh
Abstract:
We introduce a novel method for online smoothing in state-space models that utilises a fixed-lag approximation to overcome the well known issue of path degeneracy. Unlike classical fixed-lag techniques that only approximate certain marginals, we introduce an online resampling algorithm, called particle stitching, that converts these marginal samples into a full posterior approximation. We demonstr…
▽ More
We introduce a novel method for online smoothing in state-space models that utilises a fixed-lag approximation to overcome the well known issue of path degeneracy. Unlike classical fixed-lag techniques that only approximate certain marginals, we introduce an online resampling algorithm, called particle stitching, that converts these marginal samples into a full posterior approximation. We demonstrate the utility of our method in the context of map-matching, the task of inferring a vehicle's trajectory given a road network and noisy GPS observations. We develop a new state-space model for the difficult task of map-matching on dense, urban road networks.
△ Less
Submitted 2 August, 2021; v1 submitted 8 December, 2020;
originally announced December 2020.
-
Quark Gluon Plasma (QGP) evolution under loop corrections
Authors:
K K Gupta,
Agam K Jha,
S. Somorendro Singh
Abstract:
We review free energy evolution of QGP (Quark-gluon plasma) under zero-loop, one loop and two loop corrections in the mean field potential. The free energies of QGP under the comparison of zero-loop and loop corrections of the interacting potential among the quarks, anti-quarks and gluons are shown. We observe that the formation of stable QGP droplet is dependent on the loop corrections with the d…
▽ More
We review free energy evolution of QGP (Quark-gluon plasma) under zero-loop, one loop and two loop corrections in the mean field potential. The free energies of QGP under the comparison of zero-loop and loop corrections of the interacting potential among the quarks, anti-quarks and gluons are shown. We observe that the formation of stable QGP droplet is dependent on the loop corrections with the different parametrization values of fluid. With the increase in the parametrization value, stability of droplet formation increases with smaller size of droplet. This indicates that the formation of QGP droplet can be signified more importantly by the parametrization value like the Reynold number in fluid dynamics. It means that there may be different phenomenological parameter to define the stable QGP droplet when QGP fluid is studied under loop corrections.
△ Less
Submitted 18 October, 2020;
originally announced October 2020.
-
Anytime Parallel Tempering
Authors:
A. Marie d'Avigneau,
S. S. Singh,
L. M. Murray
Abstract:
Develo** efficient MCMC algorithms is indispensable in Bayesian inference. In parallel tempering, multiple interacting MCMC chains run to more efficiently explore the state space and improve performance. The multiple chains advance independently through local moves, and the performance enhancement steps are exchange moves, where the chains pause to exchange their current sample amongst each othe…
▽ More
Develo** efficient MCMC algorithms is indispensable in Bayesian inference. In parallel tempering, multiple interacting MCMC chains run to more efficiently explore the state space and improve performance. The multiple chains advance independently through local moves, and the performance enhancement steps are exchange moves, where the chains pause to exchange their current sample amongst each other. To accelerate the independent local moves, they may be performed simultaneously on multiple processors. Another problem is then encountered: depending on the MCMC implementation and inference problem, local moves can take a varying and random amount of time to complete. There may also be infrastructure-induced variations, such as competing jobs on the same processors, which arises in cloud computing. Before exchanges can occur, all chains must complete the local moves they are engaged in to avoid introducing a potentially substantial bias (Proposition 2.1). To solve this issue of randomly varying local move completion times in multi-processor parallel tempering, we adopt the Anytime Monte Carlo framework of Murray et al. (2016): we impose real-time deadlines on the parallel local moves and perform exchanges at these deadlines without any processor idling. We show our methodology for exchanges at real-time deadlines does not introduce a bias and leads to significant performance enhancements over the naïve approach of idling until every processor's local moves complete. The methodology is then applied in an ABC setting, where an Anytime ABC parallel tempering algorithm is derived for the difficult task of estimating the parameters of a Lotka-Volterra predator-prey model, and similar efficiency enhancements are observed.
△ Less
Submitted 14 September, 2021; v1 submitted 26 June, 2020;
originally announced June 2020.
-
Evolution of Kaluza-Klein Like Wet Dark Fluid in $f(R,T)$ Theory of Gravitation
Authors:
Koijam Manihar Singh,
S. Surendra Singh,
Leishingam Kumrah
Abstract:
Here we study the essence of $f(R,T)$ gravitation theory in five dimensional Universe and see the role of dark energy in the form of wet dark fluid in such a Universe. It is found that the dark energy is not exaggerated in contributing to the accelerating expansion of the Universe though the expansion is inherent as a result of the theory itself and due to the geometric contribution of matter. It…
▽ More
Here we study the essence of $f(R,T)$ gravitation theory in five dimensional Universe and see the role of dark energy in the form of wet dark fluid in such a Universe. It is found that the dark energy is not exaggerated in contributing to the accelerating expansion of the Universe though the expansion is inherent as a result of the theory itself and due to the geometric contribution of matter. It is interesting to see that in some model it is found that there was some era before the beginning of the present era, and some of the model Universe came out to be either oscillatory or cyclic. Some of the models are seen to go to $ΛCDM$ models in late future as in Einstein gravitation theory, starting the evolution with a big bang. Most of the models undergo early inflation as well as late time accelerating expansion thus defining as good models for real astrophysical situations, with dark energy playing fundamental role in these Universe.
△ Less
Submitted 13 June, 2019;
originally announced July 2019.
-
Dynamical system perspective of cosmological models minimally coupled with scalar field
Authors:
S. Surendra Singh,
Chingtham Sonia
Abstract:
The stability criteria for spatially flat homogeneous and isotropic cosmological dynamical system is investigated with the interaction of a scalar field endowed with a perfect fluid.In this paper, we depict the dynamical system perspective to study, qualitatively, the scalar field cosmology under two special cases, with and without potential. For analysis with potential we use simple exponential p…
▽ More
The stability criteria for spatially flat homogeneous and isotropic cosmological dynamical system is investigated with the interaction of a scalar field endowed with a perfect fluid.In this paper, we depict the dynamical system perspective to study, qualitatively, the scalar field cosmology under two special cases, with and without potential. For analysis with potential we use simple exponential potential form, $V_{o}e^{-λφ}$. We generate, by introducing new dimensionless variables, an autonomous system of ordinary differential equations $(ASODE)$ for each case and obtain respective fixed points. We also analyse the type of fixed points, nature and stability of the fixed points and how their nature and behavior reflect towards the cosmic scenarios. Throughout the whole work, the investigation of this model has shown us the deep connection between these theories and cosmic acceleration phenomena. The phase plots of the system at different conditions and different values of $γ$ have been analyzed in detail and their interpretations have been worked out.The perturbation plots of the dynamical system have also been studied and analyzed which emphasize our analytical findings.
△ Less
Submitted 22 January, 2021; v1 submitted 18 June, 2019;
originally announced June 2019.
-
Backward It{ô}-Ventzell and stochastic interpolation formulae
Authors:
Pierre del Moral,
Sumeetpal Sidhu Singh
Abstract:
We present a novel backward It{ô}-Ventzell formula and an extension of the Aleeksev-Gröbner interpolating formula to stochastic flows. We also present some natural spectral conditions that yield direct and simple proofs of time uniform estimates of the difference between the two stochastic flows when their drift and diffusion functions are not the same, yielding what seems to be the first results…
▽ More
We present a novel backward It{ô}-Ventzell formula and an extension of the Aleeksev-Gröbner interpolating formula to stochastic flows. We also present some natural spectral conditions that yield direct and simple proofs of time uniform estimates of the difference between the two stochastic flows when their drift and diffusion functions are not the same, yielding what seems to be the first results of this type for this class of anticipative models. We illustrate the impact of these results in the context of diffusion perturbation theory, interacting diffusions and discrete time approximations
△ Less
Submitted 4 May, 2021; v1 submitted 21 June, 2019;
originally announced June 2019.
-
Structural properties and decay modes of Z $=$ 122, 120 and 118 superheavy nuclei
Authors:
G. Saxena,
M. Kumawat,
S. Somorendro Singh,
Mamta Aggarwal
Abstract:
Structural properties and the decay modes of the superheavy elements Z $=$ 122, 120, 118 are studied in a microscopic framework. We evaluate the binding energy, one- and two- proton and neutron separation energy, shell correction and density profile of even and odd isotopes of Z $=$ 122, 120, 118 (284 $\leq$ A $\leq$ 352) which show a reasonable match with FRDM results and the available experiment…
▽ More
Structural properties and the decay modes of the superheavy elements Z $=$ 122, 120, 118 are studied in a microscopic framework. We evaluate the binding energy, one- and two- proton and neutron separation energy, shell correction and density profile of even and odd isotopes of Z $=$ 122, 120, 118 (284 $\leq$ A $\leq$ 352) which show a reasonable match with FRDM results and the available experimental data. Equillibrium shape and deformation of the superheavy region are predicted. We investigate the possible decay modes of this region specifically $α$-decay, spontaneous fission (SF) and the $β$-decay and evaluate the probable $α$-decay chains. The phenomena of bubble like structure in the charge density is predicted in $^{330}$122, $^{292,328}$120 and $^{326}$118 with significant depletion fraction around 20-24$\%$ which increases with increasing Coulomb energy and diminishes with increasing isospin (N$-$Z) values exhibiting the fact that the coloumb forces are the main driving force in the central depletion in superheavy systems.
△ Less
Submitted 17 January, 2019;
originally announced January 2019.
-
Asymptotic Analysis of Model Selection Criteria for General Hidden Markov Models
Authors:
Shouto Yonekura,
Alexandros Beskos,
Sumeetpal S. Singh
Abstract:
The paper obtains analytical results for the asymptotic properties of Model Selection Criteria -- widely used in practice -- for a general family of hidden Markov models (HMMs), thereby substantially extending the related theory beyond typical i.i.d.-like model structures and filling in an important gap in the relevant literature. In particular, we look at the Bayesian and Akaike Information Crite…
▽ More
The paper obtains analytical results for the asymptotic properties of Model Selection Criteria -- widely used in practice -- for a general family of hidden Markov models (HMMs), thereby substantially extending the related theory beyond typical i.i.d.-like model structures and filling in an important gap in the relevant literature. In particular, we look at the Bayesian and Akaike Information Criteria (BIC and AIC) and the model evidence. In the setting of nested classes of models, we prove that BIC and the evidence are strongly consistent for HMMs (under regularity conditions), whereas AIC is not weakly consistent. Numerical experiments support our theoretical results.
△ Less
Submitted 30 March, 2020; v1 submitted 28 November, 2018;
originally announced November 2018.
-
Stability of Conditional Sequential Monte Carlo
Authors:
Bernd Kuhlenschmidt,
Sumeetpal S. Singh
Abstract:
The particle Gibbs (PG) sampler is a Markov Chain Monte Carlo (MCMC) algorithm, which uses an interacting particle system to perform the Gibbs steps. Each Gibbs step consists of simulating a particle system conditioned on one particle path. It relies on a conditional Sequential Monte Carlo (cSMC) method to create the particle system. We propose a novel interpretation of the cSMC algorithm as a per…
▽ More
The particle Gibbs (PG) sampler is a Markov Chain Monte Carlo (MCMC) algorithm, which uses an interacting particle system to perform the Gibbs steps. Each Gibbs step consists of simulating a particle system conditioned on one particle path. It relies on a conditional Sequential Monte Carlo (cSMC) method to create the particle system. We propose a novel interpretation of the cSMC algorithm as a perturbed Sequential Monte Carlo (SMC) method and apply telescopic decompositions developed for the analysis of SMC algorithms \cite{delmoral2004} to derive a bound for the distance between the expected sampled path from cSMC and the target distribution of the MCMC algorithm. This can be used to get a uniform ergodicity result. In particular, we can show that the mixing rate of cSMC can be kept constant by increasing the number of particles linearly with the number of observations. Based on our decomposition, we also prove a central limit theorem for the cSMC Algorithm, which cannot be done using the approaches in \cite{Andrieu2013} and \cite{Lindsten2014}.
△ Less
Submitted 18 June, 2018;
originally announced June 2018.
-
Coupled conditional backward sampling particle filter
Authors:
Anthony Lee,
Sumeetpal S. Singh,
Matti Vihola
Abstract:
The conditional particle filter (CPF) is a promising algorithm for general hidden Markov model smoothing. Empirical evidence suggests that the variant of CPF with backward sampling (CBPF) performs well even with long time series. Previous theoretical results have not been able to demonstrate the improvement brought by backward sampling, whereas we provide rates showing that CBPF can remain effecti…
▽ More
The conditional particle filter (CPF) is a promising algorithm for general hidden Markov model smoothing. Empirical evidence suggests that the variant of CPF with backward sampling (CBPF) performs well even with long time series. Previous theoretical results have not been able to demonstrate the improvement brought by backward sampling, whereas we provide rates showing that CBPF can remain effective with a fixed number of particles independent of the time horizon. Our result is based on analysis of a new coupling of two CBPFs, the coupled conditional backward sampling particle filter (CCBPF). We show that CCBPF has good stability properties in the sense that with fixed number of particles, the coupling time in terms of iterations increases only linearly with respect to the time horizon under a general (strong mixing) condition. The CCBPF is useful not only as a theoretical tool, but also as a practical method that allows for unbiased estimation of smoothing expectations, following the recent developments by Jacob et al. (to appear). Unbiased estimation has many advantages, such as enabling the construction of asymptotically exact confidence intervals and straightforward parallelisation.
△ Less
Submitted 28 August, 2019; v1 submitted 15 June, 2018;
originally announced June 2018.
-
On Large Lag Smoothing for Hidden Markov Models
Authors:
Jeremie Houssineau,
Ajay Jasra,
Sumeetpal S. Singh
Abstract:
In this article we consider the smoothing problem for hidden Markov models (HMM). Given a hidden Markov chain $\{X_n\}_{n\geq 0}$ and observations $\{Y_n\}_{n\geq 0}$, our objective is to compute $\mathbb{E}[\varphi(X_0,\dots,X_k)|y_{0},\dots,y_n]$ for some real-valued, integrable functional $\varphi$ and $k$ fixed, $k \ll n$ and for some realisation $(y_0,\dots,y_n)$ of $(Y_0,\dots,Y_n)$. We intr…
▽ More
In this article we consider the smoothing problem for hidden Markov models (HMM). Given a hidden Markov chain $\{X_n\}_{n\geq 0}$ and observations $\{Y_n\}_{n\geq 0}$, our objective is to compute $\mathbb{E}[\varphi(X_0,\dots,X_k)|y_{0},\dots,y_n]$ for some real-valued, integrable functional $\varphi$ and $k$ fixed, $k \ll n$ and for some realisation $(y_0,\dots,y_n)$ of $(Y_0,\dots,Y_n)$. We introduce a novel application of the multilevel Monte Carlo (MLMC) method with a coupling based on the Knothe-Rosenblatt rearrangement. We prove that this method can approximate the afore-mentioned quantity with a mean square error (MSE) of $\mathcal{O}(ε^2)$, for arbitrary $ε>0$ with a cost of $\mathcal{O}(ε^{-2})$. This is in contrast to the same direct Monte Carlo method, which requires a cost of $\mathcal{O}(nε^{-2})$ for the same MSE. The approach we suggest is, in general, not possible to implement, so the optimal transport methodology of \cite{span} is used, which directly approximates our strategy. We show that our theoretical improvements are achieved, even under approximation, in several numerical examples.
△ Less
Submitted 19 April, 2018;
originally announced April 2018.
-
On the loss of Fisher information in some multi-object tracking observation models
Authors:
Jeremie Houssineau,
Ajay Jasra,
Sumeetpal S. Singh
Abstract:
The concept of Fisher information can be useful even in cases where the probability distributions of interest are not absolutely continuous with respect to the natural reference measure on the underlying space. Practical examples where this extension is useful are provided in the context of multi-object tracking statistical models. Upon defining the Fisher information without introducing a referen…
▽ More
The concept of Fisher information can be useful even in cases where the probability distributions of interest are not absolutely continuous with respect to the natural reference measure on the underlying space. Practical examples where this extension is useful are provided in the context of multi-object tracking statistical models. Upon defining the Fisher information without introducing a reference measure, we provide remarkably concise proofs of the loss of Fisher information in some widely used multi-object tracking observation models.
△ Less
Submitted 26 March, 2018;
originally announced March 2018.
-
Teaching Machines to Code: Neural Markup Generation with Visual Attention
Authors:
Sumeet S. Singh
Abstract:
We present a neural transducer model with visual attention that learns to generate LaTeX markup of a real-world math formula given its image. Applying sequence modeling and transduction techniques that have been very successful across modalities such as natural language, image, handwriting, speech and audio; we construct an image-to-markup model that learns to produce syntactically and semanticall…
▽ More
We present a neural transducer model with visual attention that learns to generate LaTeX markup of a real-world math formula given its image. Applying sequence modeling and transduction techniques that have been very successful across modalities such as natural language, image, handwriting, speech and audio; we construct an image-to-markup model that learns to produce syntactically and semantically correct LaTeX markup code over 150 words long and achieves a BLEU score of 89%; improving upon the previous state-of-art for the Im2Latex problem. We also demonstrate with heat-map visualization how attention helps in interpreting the model and can pinpoint (detect and localize) symbols on the image accurately despite having been trained without any bounding box data.
△ Less
Submitted 15 June, 2018; v1 submitted 15 February, 2018;
originally announced February 2018.
-
Multilevel Monte Carlo for Smoothing via Transport Methods
Authors:
Jeremie Houssineau,
Ajay Jasra,
Sumeetpal S. Singh
Abstract:
In this article we consider recursive approximations of the smoothing distribution associated to partially observed stochastic differential equations (SDEs), which are observed discretely in time. Such models appear in a wide variety of applications including econometrics, finance and engineering. This problem is notoriously challenging, as the smoother is not available analytically and hence requ…
▽ More
In this article we consider recursive approximations of the smoothing distribution associated to partially observed stochastic differential equations (SDEs), which are observed discretely in time. Such models appear in a wide variety of applications including econometrics, finance and engineering. This problem is notoriously challenging, as the smoother is not available analytically and hence require numerical approximation. This usually consists by applying a time-discretization to the SDE, for instance the Euler method, and then applying a numerical (e.g. Monte Carlo) method to approximate the smoother. This has lead to a vast literature on methodology for solving such problems, perhaps the most popular of which is based upon the particle filter (PF) e.g. [9]. In the context of filtering for this class of problems, it is well-known that the particle filter can be improved upon in terms of cost to achieve a given mean squared error (MSE) for estimates. This in the sense that the computational effort can be reduced to achieve this target MSE, by using multilevel (ML) methods [12, 13, 18], via the multilevel particle filter (MLPF) [16, 20, 21]. For instance, to obtain a MSE of $\mathcal{O}(ε^2)$ for some $ε> 0$ when approximating filtering distributions associated with Euler-discretized diffusions with constant diffusion coefficients, the cost of the PF is $\mathcal{O}(ε^{-3})$ while the cost of the MLPF is $\mathcal{O}(ε^{-2}\log(ε)^2)$. In this article we consider a new approach to replace the particle filter, using transport methods in [27]. In the context of filtering, one expects that the proposed method improves upon the MLPF by yielding, under assumptions, a MSE of $\mathcal{O}(ε^2)$ for a cost of $\mathcal{O}(ε^{-2})$. This is established theoretically in an "ideal" example and numerically in numerous examples.
△ Less
Submitted 14 May, 2018; v1 submitted 8 November, 2017;
originally announced November 2017.
-
Effect of two loop correction in the formation of QGP droplet
Authors:
S. Somorendro Singh
Abstract:
The effect of two loop correction in the formation of quark-gluon plasma (QGP) droplet is studied with the introduction of the two loop correction factor in the mean field potential. Due to the correction factor it shows stability in the droplet formation of QGP indicating at different parametrization factors of the QGP fluid. The correction factor in the potential also shows gluon parameter facto…
▽ More
The effect of two loop correction in the formation of quark-gluon plasma (QGP) droplet is studied with the introduction of the two loop correction factor in the mean field potential. Due to the correction factor it shows stability in the droplet formation of QGP indicating at different parametrization factors of the QGP fluid. The correction factor in the potential also shows gluon parameter factor shifts to a larger value from its earlier value of gluon factor of one loop correction in obtaining the stable droplets. The results show decreasing in the observable QGP droplets and droplet sizes are found to be $1.5-2.0$ fm radii with the two loop correction. It indicates that there is parameter like Reynold's number which can control the dynamics of QGP droplet formation and the stability of droplet in the case of droplet formation with the two loop correction factor.
△ Less
Submitted 28 August, 2017;
originally announced August 2017.
-
Identification of multi-object dynamical systems: consistency and Fisher information
Authors:
Jeremie Houssineau,
Sumeetpal S. Singh,
Ajay Jasra
Abstract:
Learning the model parameters of a multi-object dynamical system from partial and perturbed observations is a challenging task. Despite recent numerical advancements in learning these parameters, theoretical guarantees are extremely scarce. In this article, we study the identifiability of these parameters and the consistency of the corresponding maximum likelihood estimate (MLE) under assumptions…
▽ More
Learning the model parameters of a multi-object dynamical system from partial and perturbed observations is a challenging task. Despite recent numerical advancements in learning these parameters, theoretical guarantees are extremely scarce. In this article, we study the identifiability of these parameters and the consistency of the corresponding maximum likelihood estimate (MLE) under assumptions on the different components of the underlying multi-object system. In order to understand the impact of the various sources of observation noise on the ability to learn the model parameters, we study the asymptotic variance of the MLE through the associated Fisher information matrix. For example, we show that specific aspects of the multi-target tracking (MTT) problem such as detection failures and unknown data association lead to a loss of information which is quantified in special cases of interest.
△ Less
Submitted 13 July, 2017;
originally announced July 2017.
-
Approximate Smoothing and Parameter Estimation in High-Dimensional State-Space Models
Authors:
Axel Finke,
Sumeetpal S. Singh
Abstract:
We present approximate algorithms for performing smoothing in a class of high-dimensional state-space models via sequential Monte Carlo methods ("particle filters"). In high dimensions, a prohibitively large number of Monte Carlo samples ("particles") -- growing exponentially in the dimension of the state space -- is usually required to obtain a useful smoother. Using blocking strategies as in Reb…
▽ More
We present approximate algorithms for performing smoothing in a class of high-dimensional state-space models via sequential Monte Carlo methods ("particle filters"). In high dimensions, a prohibitively large number of Monte Carlo samples ("particles") -- growing exponentially in the dimension of the state space -- is usually required to obtain a useful smoother. Using blocking strategies as in Rebeschini and Van Handel (2015) (and earlier pioneering work on blocking), we exploit the spatial ergodicity properties of the model to circumvent this curse of dimensionality. We thus obtain approximate smoothers that can be computed recursively in time and in parallel in space. First, we show that the bias of our blocked smoother is bounded uniformly in the time horizon and in the model dimension. We then approximate the blocked smoother with particles and derive the asymptotic variance of idealised versions of our blocked particle smoother to show that variance is no longer adversely effected by the dimension of the model. Finally, we employ our method to successfully perform maximum-likelihood estimation via stochastic gradient-ascent and stochastic expectation--maximisation algorithms in a 100-dimensional state-space model.
△ Less
Submitted 20 September, 2017; v1 submitted 28 June, 2016;
originally announced June 2016.
-
Tracking multiple moving objects in images using Markov Chain Monte Carlo
Authors:
Lan Jiang,
Sumeetpal S. Singh
Abstract:
A new Bayesian state and parameter learning algorithm for multiple target tracking (MTT) models with image observations is proposed. Specifically, a Markov chain Monte Carlo algorithm is designed to sample from the posterior distribution of the unknown number of targets, their birth and death times, states and model parameters, which constitutes the complete solution to the tracking problem. The c…
▽ More
A new Bayesian state and parameter learning algorithm for multiple target tracking (MTT) models with image observations is proposed. Specifically, a Markov chain Monte Carlo algorithm is designed to sample from the posterior distribution of the unknown number of targets, their birth and death times, states and model parameters, which constitutes the complete solution to the tracking problem. The conventional approach is to pre-process the images to extract point observations and then perform tracking. We model the image generation process directly to avoid potential loss of information when extracting point observations. Numerical examples show that our algorithm has improved tracking performance over commonly used techniques, for both synthetic examples and real florescent microscopy data, especially in the case of dim targets with overlap** illuminated regions.
△ Less
Submitted 17 March, 2016;
originally announced March 2016.
-
Blocking Strategies and Stability of Particle Gibbs Samplers
Authors:
Sumeetpal S. Singh,
Fredrik Lindsten,
Eric Moulines
Abstract:
Sampling from the conditional (or posterior) probability distribution of the latent states of a Hidden Markov Model, given the realization of the observed process, is a non-trivial problem in the context of Markov Chain Monte Carlo. To do this Andrieu et al. (2010) constructed a Markov kernel which leaves this conditional distribution invariant using a Particle Filter. From a practitioner's point…
▽ More
Sampling from the conditional (or posterior) probability distribution of the latent states of a Hidden Markov Model, given the realization of the observed process, is a non-trivial problem in the context of Markov Chain Monte Carlo. To do this Andrieu et al. (2010) constructed a Markov kernel which leaves this conditional distribution invariant using a Particle Filter. From a practitioner's point of view, this Markov kernel attempts to mimic the act of sampling all the latent state variables as one block from the posterior distribution but for models where exact simulation is not possible. There are some recent theoretical results that establish the uniform ergodicity of this Markov kernel and that the mixing rate does not diminish provided the number of particles grows at least linearly with the number of latent states in the posterior. This gives rise to a cost, per application of the kernel, that is quadratic in the number of latent states which could be prohibitive for long observation sequences. We seek to answer an obvious but important question: is there a different implementation with a cost per-iteration that grows linearly with the number of latent states, but which is still stable in the sense that its mixing rate does not deteriorate? We address this problem using blocking strategies, which are easily parallelizable, and prove stability of the resulting sampler.
△ Less
Submitted 28 September, 2015;
originally announced September 2015.
-
Scaling in topological properties of brain networks
Authors:
Soibam Shyamchand Singh,
Khundrakpam Budhachandra Singh,
Romana Ishrat,
B. Indrajit Sharma,
R. K. Brojen Singh
Abstract:
The organization in brain networks shows highly modular features with weak inter-modular interaction. The topology of the networks involves emergence of modules and sub-modules at different levels of constitution governed by fractal laws. The modular organization, in terms of modular mass, inter-modular, and intra-modular interaction, also obeys fractal nature. The parameters which characterize to…
▽ More
The organization in brain networks shows highly modular features with weak inter-modular interaction. The topology of the networks involves emergence of modules and sub-modules at different levels of constitution governed by fractal laws. The modular organization, in terms of modular mass, inter-modular, and intra-modular interaction, also obeys fractal nature. The parameters which characterize topological properties of brain networks follow one parameter scaling theory in all levels of network structure which reveals the self-similar rules governing the network structure. The calculated fractal dimensions of brain networks of different species are found to decrease when one goes from lower to higher level species which implicates the more ordered and self-organized topography at higher level species. The sparsely distributed hubs in brain networks may be most influencing nodes but their absence may not cause network breakdown, and centrality parameters characterizing them also follow one parameter scaling law indicating self-similar roles of these hubs at different levels of organization in brain networks.
△ Less
Submitted 20 September, 2015;
originally announced September 2015.
-
Particle ancestor sampling for near-degenerate or intractable state transition models
Authors:
Fredrik Lindsten,
Pete Bunch,
Sumeetpal S. Singh,
Thomas B. Schön
Abstract:
We consider Bayesian inference in sequential latent variable models in general, and in nonlinear state space models in particular (i.e., state smoothing). We work with sequential Monte Carlo (SMC) algorithms, which provide a powerful inference framework for addressing this problem. However, for certain challenging and common model classes the state-of-the-art algorithms still struggle. The work is…
▽ More
We consider Bayesian inference in sequential latent variable models in general, and in nonlinear state space models in particular (i.e., state smoothing). We work with sequential Monte Carlo (SMC) algorithms, which provide a powerful inference framework for addressing this problem. However, for certain challenging and common model classes the state-of-the-art algorithms still struggle. The work is motivated in particular by two such model classes: (i) models where the state transition kernel is (nearly) degenerate, i.e. (nearly) concentrated on a low-dimensional manifold, and (ii) models where point-wise evaluation of the state transition density is intractable. Both types of models arise in many applications of interest, including tracking, epidemiology, and econometrics. The difficulties with these types of models is that they essentially rule out forward-backward-based methods, which are known to be of great practical importance, not least to construct computationally efficient particle Markov chain Monte Carlo (PMCMC) algorithms. To alleviate this, we propose a "particle rejuvenation" technique to enable the use of the forward-backward strategy for (nearly) degenerate models and, by extension, for intractable models. We derive the proposed method specifically within the context of PMCMC, but we emphasise that it is applicable to any forward-backward-based Monte Carlo method.
△ Less
Submitted 23 May, 2015;
originally announced May 2015.