-
Mathify: Evaluating Large Language Models on Mathematical Problem Solving Tasks
Authors:
Avinash Anand,
Mohit Gupta,
Kritarth Prasad,
Navya Singla,
Sanjana Sanjeev,
Jatin Kumar,
Adarsh Raj Shivam,
Rajiv Ratn Shah
Abstract:
The rapid progress in the field of natural language processing (NLP) systems and the expansion of large language models (LLMs) have opened up numerous opportunities in the field of education and instructional methods. These advancements offer the potential for tailored learning experiences and immediate feedback, all delivered through accessible and cost-effective services. One notable application…
▽ More
The rapid progress in the field of natural language processing (NLP) systems and the expansion of large language models (LLMs) have opened up numerous opportunities in the field of education and instructional methods. These advancements offer the potential for tailored learning experiences and immediate feedback, all delivered through accessible and cost-effective services. One notable application area for this technological advancement is in the realm of solving mathematical problems. Mathematical problem-solving not only requires the ability to decipher complex problem statements but also the skill to perform precise arithmetic calculations at each step of the problem-solving process. However, the evaluation of the arithmetic capabilities of large language models remains an area that has received relatively little attention. In response, we introduce an extensive mathematics dataset called "MathQuest" sourced from the 11th and 12th standard Mathematics NCERT textbooks. This dataset encompasses mathematical challenges of varying complexity and covers a wide range of mathematical concepts. Utilizing this dataset, we conduct fine-tuning experiments with three prominent LLMs: LLaMA-2, WizardMath, and MAmmoTH. These fine-tuned models serve as benchmarks for evaluating their performance on our dataset. Our experiments reveal that among the three models, MAmmoTH-13B emerges as the most proficient, achieving the highest level of competence in solving the presented mathematical problems. Consequently, MAmmoTH-13B establishes itself as a robust and dependable benchmark for addressing NCERT mathematics problems.
△ Less
Submitted 19 April, 2024;
originally announced April 2024.
-
Accelerating Universe with binary mixture of bulk viscous fluid and dark energy
Authors:
Nishant Singla,
M. K. Gupta,
Anil Kumar Yadav,
G. K. Goswami
Abstract:
In this paper, we have proposed a model of accelerating Universe with binary mixture of bulk viscous fluid and dark energy. and probed the model parameters: present values of Hubble's constant $H_{0}$, Equation of state paper of dark energy $ω_{de}$ and density parameter of dark energy $(Ω_{de})_{0}$ with recent OHD as well as joint Pantheon compilation of SN Ia data and OHD. Using cosmic chronome…
▽ More
In this paper, we have proposed a model of accelerating Universe with binary mixture of bulk viscous fluid and dark energy. and probed the model parameters: present values of Hubble's constant $H_{0}$, Equation of state paper of dark energy $ω_{de}$ and density parameter of dark energy $(Ω_{de})_{0}$ with recent OHD as well as joint Pantheon compilation of SN Ia data and OHD. Using cosmic chronometric technique, we obtain $H_{0} = 69.80 \pm 1.64~km~s^{-1}Mpc^{-1}$ and $70.0258 \pm 1.72~km~s^{-1}Mpc^{-1}$ by restricting our derived model with recent OHD and joint Pantheon compilation SN Ia data and OHD respectively. The age of the Universe in derived model is estimated as $t_{0} = 13.82 \pm 0.33\; Gyrs$. Also, we observe that derived model represents a model of transitioning Universe with transition redshift $z_{t} = 0.7286$. We have constrained the present value of jerk parameter as $j_{0} = 0.969 \pm 0.0075$ with joint OHD and Pantheon data. From this analysis, we observed that the model of the Universe, presented in this paper shows a marginal departure from $Λ$CDM model.
△ Less
Submitted 6 July, 2021;
originally announced July 2021.
-
Accelerating model of flat universe in $f(R,T)$ gravity
Authors:
Nishant Singla,
Mukesh Kumar Gupta,
Anil Kumar Yadav
Abstract:
The $f(R,T)$ theory of gravitation is an extended theory of gravitation in which the gravitational action contains both the Ricci scalar $R$ and the trace of energy momentum tensor $T$ and hence the cosmological models based on $f(R,T)$ gravity are eligible to describing late time acceleration of present universe. In this paper, we investigate an accelerating model of flat universe with linearly v…
▽ More
The $f(R,T)$ theory of gravitation is an extended theory of gravitation in which the gravitational action contains both the Ricci scalar $R$ and the trace of energy momentum tensor $T$ and hence the cosmological models based on $f(R,T)$ gravity are eligible to describing late time acceleration of present universe. In this paper, we investigate an accelerating model of flat universe with linearly varying deceleration parameter (LVDP). We apply the linearly time varying law for deceleration parameters that generates a model of transitioning universe from early decelerating phase to current accelerating phase. We carry out the state-finder and Om(z) analysis, and obtain that LVDP model have consistency with astrophysical observations. We also discuss profoundly the violation of energy-momentum conservation law in $f(R,T)$ gravity and dynamical behavior of the model.
△ Less
Submitted 4 December, 2019; v1 submitted 16 September, 2019;
originally announced September 2019.
-
Deep learning enabled multi-wavelength spatial coherence microscope for the classification of malaria-infected stages with limited labelled data size
Authors:
Neeru Singla,
Vishal Srivastava
Abstract:
Malaria is a life-threatening mosquito-borne blood disease, hence early detection is very crucial for health. The conventional method for the detection is a microscopic examination of Giemsa-stained blood smears, which needs a highly trained skilled technician. Automated classifications of different stages of malaria still a challenging task, especially having poor sensitivity in detecting the ear…
▽ More
Malaria is a life-threatening mosquito-borne blood disease, hence early detection is very crucial for health. The conventional method for the detection is a microscopic examination of Giemsa-stained blood smears, which needs a highly trained skilled technician. Automated classifications of different stages of malaria still a challenging task, especially having poor sensitivity in detecting the early trophozoite and late trophozoite or schizont stage with limited labelled datasize. The study aims to develop a fast, robust and fully automated system for the classification of different stages of malaria with limited data size by using the pre-trained convolutional neural networks (CNNs) as a classifier and multi-wavelength to increase the sample size. We also compare our customized CNN with other well-known CNNs and shows that our network have a comparable performance with less computational time. We believe that our proposed method can be applied to other limited labelled biological datasets.
△ Less
Submitted 14 March, 2019;
originally announced March 2019.
-
Minimum Mean-Square-Error Equalization using Priors for Two-Dimensional Intersymbol Interference
Authors:
N. Singla,
J. A. O'Sullivan
Abstract:
Joint equalization and decoding schemes are described for two-dimensional intersymbol interference (ISI) channels. Equalization is performed using the minimum mean-square-error (MMSE) criterion. Low-density parity-check codes are used for error correction. The MMSE schemes are the extension of those proposed by Tuechler et al. (2002) for one-dimensional ISI channels. Extrinsic information transf…
▽ More
Joint equalization and decoding schemes are described for two-dimensional intersymbol interference (ISI) channels. Equalization is performed using the minimum mean-square-error (MMSE) criterion. Low-density parity-check codes are used for error correction. The MMSE schemes are the extension of those proposed by Tuechler et al. (2002) for one-dimensional ISI channels. Extrinsic information transfer charts, density evolution, and bit-error rate versus signal-to-noise ratio curves are used to study the performance of the schemes.
△ Less
Submitted 4 September, 2005;
originally announced September 2005.
-
Joint Equalization and Decoding for Nonlinear Two-Dimensional Intersymbol Interference Channels
Authors:
N. Singla,
J. A. O'Sullivan
Abstract:
An algorithm that performs joint equalization and decoding for channels with nonlinear two-dimensional intersymbol interference is presented. The algorithm performs sum-product message-passing on a factor graph that represents the underlying system. The two-dimensional optical storage (TwoDOS) technology is an example of a system with nonlinear two-dimensional intersymbol interference. Simulatio…
▽ More
An algorithm that performs joint equalization and decoding for channels with nonlinear two-dimensional intersymbol interference is presented. The algorithm performs sum-product message-passing on a factor graph that represents the underlying system. The two-dimensional optical storage (TwoDOS) technology is an example of a system with nonlinear two-dimensional intersymbol interference. Simulations for the nonlinear channel model of TwoDOS show significant improvement in performance over uncoded performance. Noise tolerance thresholds for the TwoDOS channel computed using density evolution are also presented.
△ Less
Submitted 4 September, 2005;
originally announced September 2005.
-
Joint Equalization and Decoding for Nonlinear Two-Dimensional Intersymbol Interference Channels with Application to Optical Storage
Authors:
N. Singla,
J. A. O'Sullivan
Abstract:
An algorithm that performs joint equalization and decoding for nonlinear two-dimensional intersymbol interference channels is presented. The algorithm performs sum-product message-passing on a factor graph that represents the underlying system. The two-dimensional optical storage (TWODOS) technology is an example of a system with nonlinear two-dimensional intersymbol interference. Simulations fo…
▽ More
An algorithm that performs joint equalization and decoding for nonlinear two-dimensional intersymbol interference channels is presented. The algorithm performs sum-product message-passing on a factor graph that represents the underlying system. The two-dimensional optical storage (TWODOS) technology is an example of a system with nonlinear two-dimensional intersymbol interference. Simulations for the nonlinear channel model of TWODOS show significant improvement in performance over uncoded performance. Noise tolerance thresholds for the algorithm for the TWODOS channel, computed using density evolution, are also presented and accurately predict the limiting performance of the algorithm as the codeword length increases.
△ Less
Submitted 4 September, 2005;
originally announced September 2005.