Search | arXiv e-print repository

Mathify: Evaluating Large Language Models on Mathematical Problem Solving Tasks

Authors: Avinash Anand, Mohit Gupta, Kritarth Prasad, Navya Singla, Sanjana Sanjeev, Jatin Kumar, Adarsh Raj Shivam, Rajiv Ratn Shah

Abstract: The rapid progress in the field of natural language processing (NLP) systems and the expansion of large language models (LLMs) have opened up numerous opportunities in the field of education and instructional methods. These advancements offer the potential for tailored learning experiences and immediate feedback, all delivered through accessible and cost-effective services. One notable application… ▽ More The rapid progress in the field of natural language processing (NLP) systems and the expansion of large language models (LLMs) have opened up numerous opportunities in the field of education and instructional methods. These advancements offer the potential for tailored learning experiences and immediate feedback, all delivered through accessible and cost-effective services. One notable application area for this technological advancement is in the realm of solving mathematical problems. Mathematical problem-solving not only requires the ability to decipher complex problem statements but also the skill to perform precise arithmetic calculations at each step of the problem-solving process. However, the evaluation of the arithmetic capabilities of large language models remains an area that has received relatively little attention. In response, we introduce an extensive mathematics dataset called "MathQuest" sourced from the 11th and 12th standard Mathematics NCERT textbooks. This dataset encompasses mathematical challenges of varying complexity and covers a wide range of mathematical concepts. Utilizing this dataset, we conduct fine-tuning experiments with three prominent LLMs: LLaMA-2, WizardMath, and MAmmoTH. These fine-tuned models serve as benchmarks for evaluating their performance on our dataset. Our experiments reveal that among the three models, MAmmoTH-13B emerges as the most proficient, achieving the highest level of competence in solving the presented mathematical problems. Consequently, MAmmoTH-13B establishes itself as a robust and dependable benchmark for addressing NCERT mathematics problems. △ Less

Submitted 19 April, 2024; originally announced April 2024.

Comments: 10 pages, 3 figures, NeurIPS 2023 Workshop on Generative AI for Education (GAIED)

Journal ref: NeurIPS 2023 Workshop on Generative AI for Education (GAIED)

arXiv:2107.02536 [pdf, other]

doi 10.1142/S0217751X21501487

Accelerating Universe with binary mixture of bulk viscous fluid and dark energy

Authors: Nishant Singla, M. K. Gupta, Anil Kumar Yadav, G. K. Goswami

Abstract: In this paper, we have proposed a model of accelerating Universe with binary mixture of bulk viscous fluid and dark energy. and probed the model parameters: present values of Hubble's constant $H_{0}$, Equation of state paper of dark energy $ω_{de}$ and density parameter of dark energy $(Ω_{de})_{0}$ with recent OHD as well as joint Pantheon compilation of SN Ia data and OHD. Using cosmic chronome… ▽ More In this paper, we have proposed a model of accelerating Universe with binary mixture of bulk viscous fluid and dark energy. and probed the model parameters: present values of Hubble's constant $H_{0}$, Equation of state paper of dark energy $ω_{de}$ and density parameter of dark energy $(Ω_{de})_{0}$ with recent OHD as well as joint Pantheon compilation of SN Ia data and OHD. Using cosmic chronometric technique, we obtain $H_{0} = 69.80 \pm 1.64~km~s^{-1}Mpc^{-1}$ and $70.0258 \pm 1.72~km~s^{-1}Mpc^{-1}$ by restricting our derived model with recent OHD and joint Pantheon compilation SN Ia data and OHD respectively. The age of the Universe in derived model is estimated as $t_{0} = 13.82 \pm 0.33\; Gyrs$. Also, we observe that derived model represents a model of transitioning Universe with transition redshift $z_{t} = 0.7286$. We have constrained the present value of jerk parameter as $j_{0} = 0.969 \pm 0.0075$ with joint OHD and Pantheon data. From this analysis, we observed that the model of the Universe, presented in this paper shows a marginal departure from $Λ$CDM model. △ Less

Submitted 6 July, 2021; originally announced July 2021.

Comments: 10 Pages, 7 Figures, Accepted in Int. J. Mod. Phys. A

Journal ref: International Journal of Modern Physics A 36, 2150148 (2021)

arXiv:1909.08985 [pdf, other]

doi 10.1134/S0202289320020103

Accelerating model of flat universe in $f(R,T)$ gravity

Authors: Nishant Singla, Mukesh Kumar Gupta, Anil Kumar Yadav

Abstract: The $f(R,T)$ theory of gravitation is an extended theory of gravitation in which the gravitational action contains both the Ricci scalar $R$ and the trace of energy momentum tensor $T$ and hence the cosmological models based on $f(R,T)$ gravity are eligible to describing late time acceleration of present universe. In this paper, we investigate an accelerating model of flat universe with linearly v… ▽ More The $f(R,T)$ theory of gravitation is an extended theory of gravitation in which the gravitational action contains both the Ricci scalar $R$ and the trace of energy momentum tensor $T$ and hence the cosmological models based on $f(R,T)$ gravity are eligible to describing late time acceleration of present universe. In this paper, we investigate an accelerating model of flat universe with linearly varying deceleration parameter (LVDP). We apply the linearly time varying law for deceleration parameters that generates a model of transitioning universe from early decelerating phase to current accelerating phase. We carry out the state-finder and Om(z) analysis, and obtain that LVDP model have consistency with astrophysical observations. We also discuss profoundly the violation of energy-momentum conservation law in $f(R,T)$ gravity and dynamical behavior of the model. △ Less

Submitted 4 December, 2019; v1 submitted 16 September, 2019; originally announced September 2019.

Comments: 11 Pages and 11 Figure panels

Journal ref: Gravitation and Cosmology 26 (2), 144 (2020)

arXiv:1903.06056 [pdf]

doi 10.1016/j.optlastec.2020.106335

Deep learning enabled multi-wavelength spatial coherence microscope for the classification of malaria-infected stages with limited labelled data size

Authors: Neeru Singla, Vishal Srivastava

Abstract: Malaria is a life-threatening mosquito-borne blood disease, hence early detection is very crucial for health. The conventional method for the detection is a microscopic examination of Giemsa-stained blood smears, which needs a highly trained skilled technician. Automated classifications of different stages of malaria still a challenging task, especially having poor sensitivity in detecting the ear… ▽ More Malaria is a life-threatening mosquito-borne blood disease, hence early detection is very crucial for health. The conventional method for the detection is a microscopic examination of Giemsa-stained blood smears, which needs a highly trained skilled technician. Automated classifications of different stages of malaria still a challenging task, especially having poor sensitivity in detecting the early trophozoite and late trophozoite or schizont stage with limited labelled datasize. The study aims to develop a fast, robust and fully automated system for the classification of different stages of malaria with limited data size by using the pre-trained convolutional neural networks (CNNs) as a classifier and multi-wavelength to increase the sample size. We also compare our customized CNN with other well-known CNNs and shows that our network have a comparable performance with less computational time. We believe that our proposed method can be applied to other limited labelled biological datasets. △ Less

Submitted 14 March, 2019; originally announced March 2019.

arXiv:cs/0509010 [pdf, ps, other]

doi 10.1109/ISIT.2004.1365168

Minimum Mean-Square-Error Equalization using Priors for Two-Dimensional Intersymbol Interference

Authors: N. Singla, J. A. O'Sullivan

Abstract: Joint equalization and decoding schemes are described for two-dimensional intersymbol interference (ISI) channels. Equalization is performed using the minimum mean-square-error (MMSE) criterion. Low-density parity-check codes are used for error correction. The MMSE schemes are the extension of those proposed by Tuechler et al. (2002) for one-dimensional ISI channels. Extrinsic information transf… ▽ More Joint equalization and decoding schemes are described for two-dimensional intersymbol interference (ISI) channels. Equalization is performed using the minimum mean-square-error (MMSE) criterion. Low-density parity-check codes are used for error correction. The MMSE schemes are the extension of those proposed by Tuechler et al. (2002) for one-dimensional ISI channels. Extrinsic information transfer charts, density evolution, and bit-error rate versus signal-to-noise ratio curves are used to study the performance of the schemes. △ Less

Submitted 4 September, 2005; originally announced September 2005.

Comments: 12 pages, 4 figures, submitted to IEEE Transactions on Communications

arXiv:cs/0509009 [pdf, ps, other]

Joint Equalization and Decoding for Nonlinear Two-Dimensional Intersymbol Interference Channels

Authors: N. Singla, J. A. O'Sullivan

Abstract: An algorithm that performs joint equalization and decoding for channels with nonlinear two-dimensional intersymbol interference is presented. The algorithm performs sum-product message-passing on a factor graph that represents the underlying system. The two-dimensional optical storage (TwoDOS) technology is an example of a system with nonlinear two-dimensional intersymbol interference. Simulatio… ▽ More An algorithm that performs joint equalization and decoding for channels with nonlinear two-dimensional intersymbol interference is presented. The algorithm performs sum-product message-passing on a factor graph that represents the underlying system. The two-dimensional optical storage (TwoDOS) technology is an example of a system with nonlinear two-dimensional intersymbol interference. Simulations for the nonlinear channel model of TwoDOS show significant improvement in performance over uncoded performance. Noise tolerance thresholds for the TwoDOS channel computed using density evolution are also presented. △ Less

Submitted 4 September, 2005; originally announced September 2005.

Comments: 5 pages, 3 figures, 2005 International Symposium on Information Theory

arXiv:cs/0509008 [pdf, ps, other]

Joint Equalization and Decoding for Nonlinear Two-Dimensional Intersymbol Interference Channels with Application to Optical Storage

Authors: N. Singla, J. A. O'Sullivan

Abstract: An algorithm that performs joint equalization and decoding for nonlinear two-dimensional intersymbol interference channels is presented. The algorithm performs sum-product message-passing on a factor graph that represents the underlying system. The two-dimensional optical storage (TWODOS) technology is an example of a system with nonlinear two-dimensional intersymbol interference. Simulations fo… ▽ More An algorithm that performs joint equalization and decoding for nonlinear two-dimensional intersymbol interference channels is presented. The algorithm performs sum-product message-passing on a factor graph that represents the underlying system. The two-dimensional optical storage (TWODOS) technology is an example of a system with nonlinear two-dimensional intersymbol interference. Simulations for the nonlinear channel model of TWODOS show significant improvement in performance over uncoded performance. Noise tolerance thresholds for the algorithm for the TWODOS channel, computed using density evolution, are also presented and accurately predict the limiting performance of the algorithm as the codeword length increases. △ Less

Submitted 4 September, 2005; originally announced September 2005.

Comments: 12 pages, 4 figures, submitted to IEEE Transactions on Communications

Showing 1–7 of 7 results for author: Singla, N