-
Lexical Simplification using multi level and modular approach
Authors:
Nikita Katyal,
Pawan Kumar Rajpoot
Abstract:
Text Simplification is an ongoing problem in Natural Language Processing, solution to which has varied implications. In conjunction with the TSAR-2022 Workshop @EMNLP2022 Lexical Simplification is the process of reducing the lexical complexity of a text by replacing difficult words with easier to read (or understand) expressions while preserving the original information and meaning. This paper exp…
▽ More
Text Simplification is an ongoing problem in Natural Language Processing, solution to which has varied implications. In conjunction with the TSAR-2022 Workshop @EMNLP2022 Lexical Simplification is the process of reducing the lexical complexity of a text by replacing difficult words with easier to read (or understand) expressions while preserving the original information and meaning. This paper explains the work done by our team "teamPN" for English sub task. We created a modular pipeline which combines modern day transformers based models with traditional NLP methods like paraphrasing and verb sense disambiguation. We created a multi level and modular pipeline where the target text is treated according to its semantics(Part of Speech Tag). Pipeline is multi level as we utilize multiple source models to find potential candidates for replacement, It is modular as we can switch the source models and their weight-age in the final re-ranking.
△ Less
Submitted 3 February, 2023;
originally announced February 2023.
-
Understanding Performance of Long-Document Ranking Models through Comprehensive Evaluation and Leaderboarding
Authors:
Leonid Boytsov,
David Akinpelu,
Tianyi Lin,
Fangwei Gao,
Yutian Zhao,
Jeffrey Huang,
Nipun Katyal,
Eric Nyberg
Abstract:
We evaluated 20+ Transformer models for ranking of long documents (including recent LongP models trained with FlashAttention) and compared them with a simple FirstP baseline, which applies the same model to the truncated input (at most 512 tokens). We used MS MARCO Documents v1 as a primary training set and evaluated both the zero-shot transferred and fine-tuned models.
On MS MARCO, TREC DLs, an…
▽ More
We evaluated 20+ Transformer models for ranking of long documents (including recent LongP models trained with FlashAttention) and compared them with a simple FirstP baseline, which applies the same model to the truncated input (at most 512 tokens). We used MS MARCO Documents v1 as a primary training set and evaluated both the zero-shot transferred and fine-tuned models.
On MS MARCO, TREC DLs, and Robust04 no long-document model outperformed FirstP by more than 5% in NDCG and MRR (when averaged over all test sets). We conjectured this was not due to models' inability to process long context, but due to a positional bias of relevant passages, whose distribution was skewed towards the beginning of documents. We found direct evidence of this bias in some test sets, which motivated us to create MS MARCO FarRelevant (based on MS MARCO Passages) where the relevant passages were not present among the first 512 tokens.
Unlike standard collections where we saw both little benefit from incorporating longer contexts and limited variability in model performance (within a few %), experiments on MS MARCO FarRelevant uncovered dramatic differences among models. The FirstP models performed roughly at the random-baseline level in both zero-shot and fine-tuning scenarios. Simple aggregation models including MaxP and PARADE Attention had good zero-shot accuracy, but benefited little from fine-tuning. Most other models had poor zero-shot performance (sometimes at a random baseline level), but outstripped MaxP by as much as 13-28% after fine-tuning. Thus, the positional bias not only diminishes benefits of processing longer document contexts, but also leads to model overfitting to positional bias and performing poorly in a zero-shot setting when the distribution of relevant passages changes substantially. We make our software and data available.
△ Less
Submitted 16 June, 2024; v1 submitted 4 July, 2022;
originally announced July 2022.
-
Planetary interior and habitability of exoplanets: Recent developments
Authors:
Nisha Katyal
Abstract:
This article deals with the most recent developments in the field of exoplanetary science connecting the interior of the planets with their habitability. In this issue, I have specified the importance of interior dynamics and briefly reviewed some of the main factors by which interior of a planet can effect the habitability of extra-solar planets.
This article deals with the most recent developments in the field of exoplanetary science connecting the interior of the planets with their habitability. In this issue, I have specified the importance of interior dynamics and briefly reviewed some of the main factors by which interior of a planet can effect the habitability of extra-solar planets.
△ Less
Submitted 29 April, 2021;
originally announced April 2021.
-
Effect of mantle oxidation state and escape upon the evolution of Earth's magma ocean atmosphere
Authors:
Nisha Katyal,
Gianluigi Ortenzi,
John Lee Grenfell,
Lena Noack,
Frank Sohl,
Mareike Godolt,
Antonio García Muñoz,
Franz Schreier,
Fabian Wunderlich,
Heike Rauer
Abstract:
The magma ocean period was a critical phase determining how Earth atmosphere developed into habitability. However there are major uncertainties in the role of key processes such as outgassing from the planetary interior and escape of species to space that play a major role in determining the atmosphere of early Earth. We investigate the influence of outgassing of various species and escape of H…
▽ More
The magma ocean period was a critical phase determining how Earth atmosphere developed into habitability. However there are major uncertainties in the role of key processes such as outgassing from the planetary interior and escape of species to space that play a major role in determining the atmosphere of early Earth. We investigate the influence of outgassing of various species and escape of H$_2$ for different mantle redox states upon the composition and evolution of the atmosphere for the magma ocean period. We include an important new atmosphere-interior coupling mechanism namely the redox evolution of the mantle which strongly affects the outgassing of species. We simulate the volatile outgassing and chemical speciation at the surface for various redox states of the mantle by employing a C-H-O based chemical speciation model combined with an interior outgassing model. We then apply a line-by-line radiative transfer model to study the remote appearance of the planet in terms of the infrared emission and transmission. Finally, we use a parameterized diffusion-limited and XUV energy-driven atmospheric escape model to calculate the loss of H$_2$ to space. We have simulated the thermal emission and transmission spectra for reduced or oxidized atmospheres present during the magma ocean period of Earth. Reduced or thin atmospheres consisting of H$_2$ in abundance emit more radiation to space and have larger effective height as compared to oxidized or thick atmospheres which are abundant in H$_2$O and CO$_2$. We obtain the outgassing rates of H2 from the mantle into the atmosphere to be a factor of ten times larger than the rates of diffusion-limited escape to space. Our work presents useful insight into the development of Earth atmosphere during the magma ocean period as well as input to guide future studies discussing exoplanetary interior compositions.
△ Less
Submitted 5 November, 2020; v1 submitted 30 September, 2020;
originally announced September 2020.
-
Coarsening Dynamics in the Vicsek Model of Active Matter
Authors:
Nisha Katyal,
Supravat Dey,
Dibyendu Das,
Sanjay Puri
Abstract:
We study the flocking model introduced by Vicsek in the "coarsening" regime. At standard self-propulsion speeds, we find two distinct growth laws for the coupled density and velocity fields. The characteristic length scale of the density domains grows as $L_ρ(t) \sim t^{θ_ρ}$ (with $θ_ρ\simeq 0.25$), while the velocity length scale grows much faster, $viz.$, $L_{v}(t) \sim t^{θ_v}$ (with…
▽ More
We study the flocking model introduced by Vicsek in the "coarsening" regime. At standard self-propulsion speeds, we find two distinct growth laws for the coupled density and velocity fields. The characteristic length scale of the density domains grows as $L_ρ(t) \sim t^{θ_ρ}$ (with $θ_ρ\simeq 0.25$), while the velocity length scale grows much faster, $viz.$, $L_{v}(t) \sim t^{θ_v}$ (with $θ_v \simeq 0.83$). The spatial fluctuations in the density and velocity fields are studied by calculating the two-point correlation function and the structure factor, which show deviations from the well-known Porod's law. This is a natural consequence of scattering from irregular morphologies that dynamically arise in the system. At large values of the scaled wave-vector, the scaled structure factors for the density and velocity fields decay with powers $-2.6$ and $-1.52$, respectively.
△ Less
Submitted 6 December, 2019;
originally announced December 2019.
-
What factors affect the duration and outgassing of the terrestrial magma ocean?
Authors:
Athanasia Nikolaou,
Nisha Katyal,
Nicola Tosi,
Mareike Godolt,
John Lee Grenfell,
Heike Rauer
Abstract:
The magma ocean (MO) is a crucial stage in the build-up of terrestrial planets. Its solidification and the accompanying outgassing of volatiles set the conditions for important processes occurring later or even simultaneously, such as solid-state mantle convection and atmospheric escape. To constrain the duration of a global-scale Earth MO we have built and applied a 1D interior model coupled alte…
▽ More
The magma ocean (MO) is a crucial stage in the build-up of terrestrial planets. Its solidification and the accompanying outgassing of volatiles set the conditions for important processes occurring later or even simultaneously, such as solid-state mantle convection and atmospheric escape. To constrain the duration of a global-scale Earth MO we have built and applied a 1D interior model coupled alternatively with a grey H2O/CO2 atmosphere or with a pure H2O atmosphere treated with a line-by-line model described in a companion paper by Katyal et al. (2019). We study in detail the effects of several factors affecting the MO lifetime, such as the initial abundance of H2O and CO2, the convection regime, the viscosity, the mantle melting temperature, and the longwave radiation absorption from the atmosphere. In this specifically multi-variable system we assess the impact of each factor with respect to a reference setting commonly assumed in the literature. We find that the MO stage can last from a few thousand to several million years. By coupling the interior model with the line-by-line atmosphere model, we identify the conditions that determine whether the planet experiences a transient magma ocean or it ceases to cool and maintains a continuous magma ocean. We find a dependence of this distinction simultaneously on the mass of the outgassed H2O atmosphere and on the MO surface melting temperature. We discuss their combined impact on the MO's lifetime in addition to the known dependence on albedo, orbital distance and stellar luminosity and we note observational degeneracies that arise thereby for target exoplanets.
△ Less
Submitted 18 March, 2019;
originally announced March 2019.
-
Evolution and Spectral Response of a Steam Atmosphere for Early Earth with a coupled climate-interior model
Authors:
Nisha Katyal,
Athanasia Nikolaou,
Mareike Godolt,
John Lee Grenfell,
Nicola Tosi,
Franz Schreier,
Heike Rauer
Abstract:
The evolution of Earth's early atmosphere and the emergence of habitable conditions on our planet are intricately coupled with the development and duration of the magma ocean phase during the early Hadean period (4 to 4.5 Ga). In this paper, we deal with the evolution of the steam atmosphere during the magma ocean period. We obtain the outgoing longwave radiation using a line-by-line radiative tra…
▽ More
The evolution of Earth's early atmosphere and the emergence of habitable conditions on our planet are intricately coupled with the development and duration of the magma ocean phase during the early Hadean period (4 to 4.5 Ga). In this paper, we deal with the evolution of the steam atmosphere during the magma ocean period. We obtain the outgoing longwave radiation using a line-by-line radiative transfer code GARLIC. Our study suggests that an atmosphere consisting of pure H$_{2}$O, built as a result of outgassing extends the magma ocean lifetime to several million years. The thermal emission as a function of solidification timescale of magma ocean is shown. We study the effect of thermal dissociation of H$_{2}$O at higher temperatures by applying atmospheric chemical equilibrium which results in the formation of H$_{2}$ and O$_{2}$ during the early phase of the magma ocean. A 1-6\% reduction in the OLR is seen. We also obtain the effective height of the atmosphere by calculating the transmission spectra for the whole duration of the magma ocean. An atmosphere of depth ~100 km is seen for pure water atmospheres. The effect of thermal dissociation on the effective height of the atmosphere is also shown. Due to the difference in the absorption behavior at different altitudes, the spectral features of H$_{2}$ and O$_{2}$ are seen at different altitudes of the atmosphere. Therefore, these species along with H$_{2}$O have a significant contribution to the transmission spectra and could be useful for placing observational constraints upon magma ocean exoplanets.
△ Less
Submitted 11 March, 2019;
originally announced March 2019.
-
Robustness of the Fractal Regime for the Multiple-Scattering Structure Factor
Authors:
Nisha Katyal,
Robert Botet,
Sanjay Puri
Abstract:
In the single-scattering theory of electromagnetic radiation, the {\it fractal regime} is a definite range in the photon momentum-transfer $q$, which is characterized by the scaling-law behavior of the structure factor: $S(q) \propto 1/q^{d_f}$. This allows a straightforward estimation of the fractal dimension $d_f$ of aggregates in {\it Small-Angle X-ray Scattering} (SAXS) experiments. However, t…
▽ More
In the single-scattering theory of electromagnetic radiation, the {\it fractal regime} is a definite range in the photon momentum-transfer $q$, which is characterized by the scaling-law behavior of the structure factor: $S(q) \propto 1/q^{d_f}$. This allows a straightforward estimation of the fractal dimension $d_f$ of aggregates in {\it Small-Angle X-ray Scattering} (SAXS) experiments. However, this behavior is not commonly studied in optical scattering experiments because of the lack of information on its domain of validity. In the present work, we propose a definition of the multiple-scattering structure factor, which naturally generalizes the single-scattering function $S(q)$. We show that the mean-field theory of electromagnetic scattering provides an explicit condition to interpret the significance of multiple scattering. In this paper, we investigate and discuss electromagnetic scattering by three classes of fractal aggregates. The results obtained from the TMatrix method show that the fractal scaling range is divided into two domains: 1) a genuine fractal regime, which is robust; 2) a possible anomalous scaling regime, $S(q) \propto 1/q^δ$, with exponent $δ$ independent of $d_f$, and related to the way the scattering mechanism uses the local morphology of the scatterer. The recognition, and an analysis, of the latter domain is of importance because it may result in significant reduction of the fractal regime, and brings into question the proper mechanism in the build-up of multiple-scattering.
△ Less
Submitted 8 March, 2016;
originally announced March 2016.
-
Fractal Signatures in Analogs of Interplanetary Dust Particles
Authors:
Nisha Katyal,
Varsha Banerjee,
Sanjay Puri
Abstract:
Interplanetary dust particles (IDPs) are an important constituent of the earth's stratosphere, interstellar and interplanetary medium, cometary comae and tails, etc. Their physical and optical characteristics are significantly influenced by the morphology of silicate aggregates which form the core in IDPs. In this paper we reinterpret scattering data from laboratory analogs of cosmic silicate aggr…
▽ More
Interplanetary dust particles (IDPs) are an important constituent of the earth's stratosphere, interstellar and interplanetary medium, cometary comae and tails, etc. Their physical and optical characteristics are significantly influenced by the morphology of silicate aggregates which form the core in IDPs. In this paper we reinterpret scattering data from laboratory analogs of cosmic silicate aggregates created by Volten et al. \cite{volten2007}, to extract their morphological features. By evaluating the structure factor, we find that the aggregates are mass fractals with a mass fractal dimension $d_{m} \simeq 1.75$. The same fractal dimension also characterizes clusters obtained from {\it diffusion limited aggregation} (DLA). This suggests that the analogs are formed by an irreversible aggregation of stochastically-transported silicate particles
△ Less
Submitted 28 February, 2014;
originally announced February 2014.
-
Interstellar Dust models towards some IUE stars
Authors:
Nisha Katyal,
Ranjan Gupta,
D B Vaidya
Abstract:
We study the extinction properties of the composite dust grains, consisting of host silicate spheroids and graphite as inclusions, using discrete dipole approximation (DDA). We calculate the extinction cross sections of the composite grains in the ultraviolet spectral region, 1200Å-3200Åand study the variation in extinction as a function of the volume fraction of the inclusions. We compare the mod…
▽ More
We study the extinction properties of the composite dust grains, consisting of host silicate spheroids and graphite as inclusions, using discrete dipole approximation (DDA). We calculate the extinction cross sections of the composite grains in the ultraviolet spectral region, 1200Å-3200Åand study the variation in extinction as a function of the volume fraction of the inclusions. We compare the model extinction curves with the observed interstellar extinction curves obtained from the data given by the International Ultraviolet Explorer (IUE) satellite. Our results for the composite grains show a distinct variation in the extinction efficiencies with the variation in the volume fraction of the inclusions. In particular, it is found that the wavelength of peak absorption at `2175Å' shifts towards the longer wavelength with the variation in the volume fraction of inclusions. We find that the composite grain models with the axial ratios viz. 1.33 and 2.0 fit the observed extinction reasonably well with a grain size distribution, a = 0.005-0.250$μm$. Moreover, our results of the composite grains clearly indicate that the inhomogeneity in the grain structure, composition and the surrounding media modifies the extinction properties of the grains.
△ Less
Submitted 2 October, 2013;
originally announced October 2013.
-
Interstellar Grains: Effect of Inclusions on Extinction
Authors:
Nisha Katyal,
Ranjan Gupta,
D. B. Vaidya
Abstract:
A composite dust grain model which simultaneously explains the observed interstellar extinction, polarization, IR emission and the abundance constraints, is required. We present a composite grain model, which is made up of a host silicate oblate spheroid and graphite inclusions. The interstellar extinction curve is evaluated in the spectral region 3.4-0.1$μm$ using the extinction efficiencies of t…
▽ More
A composite dust grain model which simultaneously explains the observed interstellar extinction, polarization, IR emission and the abundance constraints, is required. We present a composite grain model, which is made up of a host silicate oblate spheroid and graphite inclusions. The interstellar extinction curve is evaluated in the spectral region 3.4-0.1$μm$ using the extinction efficiencies of the composite spheroidal grains for three axial ratios. Extinction curves are computed using the discrete dipole approximation (DDA). The model curves are subsequently compared with the average observed interstellar extinction curve and with an extinction curve derived from the IUE catalogue data.
△ Less
Submitted 21 June, 2011;
originally announced June 2011.