Skip to main content

Showing 1–43 of 43 results for author: Dandapat, S

.
  1. arXiv:2405.06551  [pdf, other

    cs.CL cs.SI

    ADSumm: Annotated Ground-truth Summary Datasets for Disaster Tweet Summarization

    Authors: Piyush Kumar Garg, Roshni Chakraborty, Sourav Kumar Dandapat

    Abstract: Online social media platforms, such as Twitter, provide valuable information during disaster events. Existing tweet disaster summarization approaches provide a summary of these events to aid government agencies, humanitarian organizations, etc., to ensure effective disaster response. In the literature, there are two types of approaches for disaster summarization, namely, supervised and unsupervise… ▽ More

    Submitted 10 May, 2024; originally announced May 2024.

  2. arXiv:2405.06541  [pdf, other

    cs.CL cs.SI

    ATSumm: Auxiliary information enhanced approach for abstractive disaster Tweet Summarization with sparse training data

    Authors: Piyush Kumar Garg, Roshni Chakraborty, Sourav Kumar Dandapat

    Abstract: The abundance of situational information on Twitter poses a challenge for users to manually discern vital and relevant information during disasters. A concise and human-interpretable overview of this information helps decision-makers in implementing efficient and quick disaster response. Existing abstractive summarization approaches can be categorized as sentence-based or key-phrase-based approach… ▽ More

    Submitted 10 May, 2024; originally announced May 2024.

  3. arXiv:2402.03472  [pdf, other

    astro-ph.HE astro-ph.GA gr-qc

    Efficient prescription to search for linear gravitational wave memory from hyperbolic black hole encounters and its application to the NANOGrav 12.5-year dataset

    Authors: Subhajit Dandapat, Abhimanyu Susobhanan, Lankeswar Dey, A. Gopakumar, Paul T. Baker, Philippe Jetzer

    Abstract: Burst with memory events are potential transient gravitational wave sources for the maturing pulsar timing array (PTA) efforts. We provide a computationally efficient prescription to model pulsar timing residuals induced by supermassive black hole pairs in general relativistic hyperbolic trajectories employing a Keplerian-type parametric solution. Injection studies have been pursued on the resulti… ▽ More

    Submitted 16 May, 2024; v1 submitted 5 February, 2024; originally announced February 2024.

    Comments: 20 pages, 11 figures

    Journal ref: Phys. Rev. D 109, 103018, 2024

  4. arXiv:2401.06810  [pdf, other

    cs.AI

    TONE: A 3-Tiered ONtology for Emotion analysis

    Authors: Srishti Gupta, Piyush Kumar Garg, Sourav Kumar Dandapat

    Abstract: Emotions have played an important part in many sectors, including psychology, medicine, mental health, computer science, and so on, and categorizing them has proven extremely useful in separating one emotion from another. Emotions can be classified using the following two methods: (1) The supervised method's efficiency is strongly dependent on the size and domain of the data collected. A categoriz… ▽ More

    Submitted 10 January, 2024; originally announced January 2024.

  5. Low-frequency pulse-jitter measurement with the uGMRT I : PSR J0437$-$4715

    Authors: Tomonosuke Kikunaga, Shinnosuke Hisano, Neelam Dhanda Batra, Shantanu Desai, Bhal Chandra Joshi, Manjari Bagchi, T. Prabu, Keitaro Takahashi, Swetha Arumugam, Adarsh Bathula, Subhajit Dandapat, Debabrata Deb, Churchil Dwivedi, Yashwant Gupta, Shebin Jose Jacob, Fazal Kareem, Nobleson K, Pragna Mamidipaka, Avinash Kumar Paladi, Arul Pandian B, Prerna Rana, Jaikhomba Singha, Aman Srivastava, Mayuresh Surnis, Pratik Tarafdar

    Abstract: High-precision pulsar timing observations are limited in their accuracy by the jitter noise that appears in the arrival time of pulses. Therefore, it is important to systematically characterise the amplitude of the jitter noise and its variation with frequency. In this paper, we provide jitter measurements from low-frequency wideband observations of PSR J0437$-$4715 using data obtained as part of… ▽ More

    Submitted 18 April, 2024; v1 submitted 4 December, 2023; originally announced December 2023.

    Comments: 13 pages, 12 figures, 3 tables, accepted for Publication of the Astronomical Society of Australia

  6. arXiv:2309.16765  [pdf, other

    astro-ph.IM

    Using low-frequency scatter-broadening measurements for precision estimates of dispersion measures

    Authors: Jaikhomba Singha, Bhal Chandra Joshi, M. A. Krishnakumar, Fazal Kareem, Adarsh Bathula, Churchil Dwivedi, Shebin Jose Jacob, Shantanu Desai, Pratik Tarafdar, P. Arumugam, Swetha Arumugam, Manjari Bagchi, Neelam Dhanda Batra, Subhajit Dandapat, Debabrata Deb, Jyotijwal Debnath, A Gopakumar, Yashwant Gupta, Shinnosuke Hisano, Ryo Kato, Tomonosuke Kikunaga, Piyush Marmat, K. Nobleson, Avinash K. Paladi, Arul Pandian B. , et al. (6 additional authors not shown)

    Abstract: A pulsar's pulse profile gets broadened at low frequencies due to dispersion along the line of sight or due to multi-path propagation. The dynamic nature of the interstellar medium makes both of these effects time-dependent and introduces slowly varying time delays in the measured times-of-arrival similar to those introduced by passing gravitational waves. In this article, we present a new method… ▽ More

    Submitted 28 September, 2023; originally announced September 2023.

    Comments: 9 pages, 8 figures, Submitted to MNRAS

  7. arXiv:2309.00693  [pdf, other

    astro-ph.HE gr-qc

    Comparing recent PTA results on the nanohertz stochastic gravitational wave background

    Authors: The International Pulsar Timing Array Collaboration, G. Agazie, J. Antoniadis, A. Anumarlapudi, A. M. Archibald, P. Arumugam, S. Arumugam, Z. Arzoumanian, J. Askew, S. Babak, M. Bagchi, M. Bailes, A. -S. Bak Nielsen, P. T. Baker, C. G. Bassa, A. Bathula, B. Bécsy, A. Berthereau, N. D. R. Bhat, L. Blecha, M. Bonetti, E. Bortolas, A. Brazier, P. R. Brook, M. Burgay , et al. (220 additional authors not shown)

    Abstract: The Australian, Chinese, European, Indian, and North American pulsar timing array (PTA) collaborations recently reported, at varying levels, evidence for the presence of a nanohertz gravitational wave background (GWB). Given that each PTA made different choices in modeling their data, we perform a comparison of the GWB and individual pulsar noise parameters across the results reported from the PTA… ▽ More

    Submitted 1 September, 2023; originally announced September 2023.

    Comments: 21 pages, 9 figures, submitted to ApJ

  8. arXiv:2306.16227  [pdf, other

    astro-ph.CO astro-ph.GA gr-qc

    The second data release from the European Pulsar Timing Array: IV. Implications for massive black holes, dark matter and the early Universe

    Authors: J. Antoniadis, P. Arumugam, S. Arumugam, P. Auclair, S. Babak, M. Bagchi, A. -S. Bak Nielsen, E. Barausse, C. G. Bassa, A. Bathula, A. Berthereau, M. Bonetti, E. Bortolas, P. R. Brook, M. Burgay, R. N. Caballero, C. Caprini, A. Chalumeau, D. J. Champion, S. Chanlaridis, S. Chen, I. Cognard, M. Crisostomi, S. Dandapat, D. Deb , et al. (89 additional authors not shown)

    Abstract: The European Pulsar Timing Array (EPTA) and Indian Pulsar Timing Array (InPTA) collaborations have measured a low-frequency common signal in the combination of their second and first data releases respectively, with the correlation properties of a gravitational wave background (GWB). Such signal may have its origin in a number of physical processes including a cosmic population of inspiralling sup… ▽ More

    Submitted 15 May, 2024; v1 submitted 28 June, 2023; originally announced June 2023.

    Comments: 30 pages, 23 figures, replaced to match the version published in Astronomy & Astrophysics, note the change in the numbering order in the series (now paper IV)

  9. arXiv:2306.16226  [pdf, other

    astro-ph.HE astro-ph.CO astro-ph.GA gr-qc

    The second data release from the European Pulsar Timing Array V. Search for continuous gravitational wave signals

    Authors: J. Antoniadis, P. Arumugam, S. Arumugam, S. Babak, M. Bagchi, A. S. Bak Nielsen, C. G. Bassa, A. Bathula, A. Berthereau, M. Bonetti, E. Bortolas, P. R. Brook, M. Burgay, R. N. Caballero, A. Chalumeau, D. J. Champion, S. Chanlaridis, S. Chen, I. Cognard, S. Dandapat, D. Deb, S. Desai, G. Desvignes, N. Dhanda-Batra, C. Dwivedi , et al. (75 additional authors not shown)

    Abstract: We present the results of a search for continuous gravitational wave signals (CGWs) in the second data release (DR2) of the European Pulsar Timing Array (EPTA) collaboration. The most significant candidate event from this search has a gravitational wave frequency of 4-5 nHz. Such a signal could be generated by a supermassive black hole binary (SMBHB) in the local Universe. We present the results o… ▽ More

    Submitted 25 June, 2024; v1 submitted 28 June, 2023; originally announced June 2023.

    Comments: 13 figures, 15 pages, accepted

  10. arXiv:2306.16225  [pdf, other

    astro-ph.HE astro-ph.IM

    The second data release from the European Pulsar Timing Array II. Customised pulsar noise models for spatially correlated gravitational waves

    Authors: J. Antoniadis, P. Arumugam, S. Arumugam, S. Babak, M. Bagchi, A. S. Bak Nielsen, C. G. Bassa, A. Bathula, A. Berthereau, M. Bonetti, E. Bortolas, P. R. Brook, M. Burgay, R. N. Caballero, A. Chalumeau, D. J. Champion, S. Chanlaridis, S. Chen, I. Cognard, S. Dandapat, D. Deb, S. Desai, G. Desvignes, N. Dhanda-Batra, C. Dwivedi , et al. (73 additional authors not shown)

    Abstract: The nanohertz gravitational wave background (GWB) is expected to be an aggregate signal of an ensemble of gravitational waves emitted predominantly by a large population of coalescing supermassive black hole binaries in the centres of merging galaxies. Pulsar timing arrays, ensembles of extremely stable pulsars, are the most precise experiments capable of detecting this background. However, the su… ▽ More

    Submitted 28 June, 2023; originally announced June 2023.

    Comments: 20 pages, 6 figures, 9 tables

    Journal ref: A&A 678, A49 (2023)

  11. arXiv:2306.16214  [pdf, other

    astro-ph.HE astro-ph.CO astro-ph.GA

    The second data release from the European Pulsar Timing Array III. Search for gravitational wave signals

    Authors: J. Antoniadis, P. Arumugam, S. Arumugam, S. Babak, M. Bagchi, A. -S. Bak Nielsen, C. G. Bassa, A. Bathula, A. Berthereau, M. Bonetti, E. Bortolas, P. R. Brook, M. Burgay, R. N. Caballero, A. Chalumeau, D. J. Champion, S. Chanlaridis, S. Chen, I. Cognard, S. Dandapat, D. Deb, S. Desai, G. Desvignes, N. Dhanda-Batra, C. Dwivedi , et al. (73 additional authors not shown)

    Abstract: We present the results of the search for an isotropic stochastic gravitational wave background (GWB) at nanohertz frequencies using the second data release of the European Pulsar Timing Array (EPTA) for 25 millisecond pulsars and a combination with the first data release of the Indian Pulsar Timing Array (InPTA). We analysed (i) the full 24.7-year EPTA data set, (ii) its 10.3-year subset based on… ▽ More

    Submitted 28 June, 2023; originally announced June 2023.

    Comments: 21 pages, 14 figures, 4 appendix figures, accepted for publication in A&A

    Journal ref: A&A 678, A50 (2023)

  12. Gravitational Waves from Black-Hole Encounters: Prospects for Ground- and Galaxy-Based Observatories

    Authors: Subhajit Dandapat, Michael Ebersold, Abhimanyu Susobhanan, Prerna Rana, Achamveedu Gopakumar, Shubhanshu Tiwari, Maria Haney, Hyung Mok Lee, Neel Kolhe

    Abstract: Close hyperbolic encounters of black holes (BHs) generate certain Burst With Memory (BWM) events in the frequency windows of the operational, planned, and proposed gravitational wave (GW) observatories. We present detailed explorations of the detectable parameter space of such events that are relevant for the LIGO-Virgo-KAGRA and the International Pulsar Timing Array (IPTA) consortia. The underlyi… ▽ More

    Submitted 30 May, 2023; originally announced May 2023.

    Comments: 19 pages, 11 figures, accepted for publication in Phys. Rev. D

  13. arXiv:2305.11592  [pdf, ps, other

    cs.CL cs.SI

    IKDSumm: Incorporating Key-phrases into BERT for extractive Disaster Tweet Summarization

    Authors: Piyush Kumar Garg, Roshni Chakraborty, Srishti Gupta, Sourav Kumar Dandapat

    Abstract: Online social media platforms, such as Twitter, are one of the most valuable sources of information during disaster events. Therefore, humanitarian organizations, government agencies, and volunteers rely on a summary of this information, i.e., tweets, for effective disaster management. Although there are several existing supervised and unsupervised approaches for automated tweet summary approaches… ▽ More

    Submitted 19 May, 2023; originally announced May 2023.

  14. arXiv:2305.11536  [pdf, other

    cs.CL cs.SI

    PORTRAIT: a hybrid aPproach tO cReate extractive ground-TRuth summAry for dIsaster evenT

    Authors: Piyush Kumar Garg, Roshni Chakraborty, Sourav Kumar Dandapat

    Abstract: Disaster summarization approaches provide an overview of the important information posted during disaster events on social media platforms, such as, Twitter. However, the type of information posted significantly varies across disasters depending on several factors like the location, type, severity, etc. Verification of the effectiveness of disaster summarization approaches still suffer due to the… ▽ More

    Submitted 19 May, 2023; originally announced May 2023.

  15. arXiv:2304.13072  [pdf, other

    astro-ph.IM astro-ph.HE

    Multi-band Extension of the Wideband Timing Technique

    Authors: Avinash Kumar Paladi, Churchil Dwivedi, Prerna Rana, Nobleson K, Abhimanyu Susobhanan, Bhal Chandra Joshi, Pratik Tarafdar, Debabrata Deb, Swetha Arumugam, A Gopakumar, M A Krishnakumar, Neelam Dhanda Batra, Jyotijwal Debnath, Fazal Kareem, Paramasivan Arumugam, Manjari Bagchi, Adarsh Bathula, Subhajit Dandapat, Shantanu Desai, Yashwant Gupta, Shinnosuke Hisano, Divyansh Kharbanda, Tomonosuke Kikunaga, Neel Kolhe, Yogesh Maan , et al. (5 additional authors not shown)

    Abstract: The wideband timing technique enables the high-precision simultaneous estimation of pulsar Times of Arrival (ToAs) and Dispersion Measures (DMs) while effectively modeling frequency-dependent profile evolution. We present two novel independent methods that extend the standard wideband technique to handle simultaneous multi-band pulsar data incorporating profile evolution over a larger frequency sp… ▽ More

    Submitted 8 November, 2023; v1 submitted 25 April, 2023; originally announced April 2023.

    Comments: Published in MNRAS

  16. arXiv:2303.12105  [pdf, other

    astro-ph.HE astro-ph.IM

    Noise analysis of the Indian Pulsar Timing Array data release I

    Authors: Aman Srivastava, Shantanu Desai, Neel Kolhe, Mayuresh Surnis, Bhal Chandra Joshi, Abhimanyu Susobhanan, Aurélien Chalumeau, Shinnosuke Hisano, Nobleson K., Swetha Arumugam, Divyansh Kharbanda, Jaikhomba Singha, Pratik Tarafdar, P Arumugam, Manjari Bagchi, Adarsh Bathula, Subhajit Dandapat, Lankeswar Dey, Churchil Dwivedi, Raghav Girgaonkar, A. Gopakumar, Yashwant Gupta, Tomonosuke Kikunaga, M. A. Krishnakumar, Kuo Liu , et al. (6 additional authors not shown)

    Abstract: The Indian Pulsar Timing Array (InPTA) collaboration has recently made its first official data release (DR1) for a sample of 14 pulsars using 3.5 years of uGMRT observations. We present the results of single-pulsar noise analysis for each of these 14 pulsars using the InPTA DR1. For this purpose, we consider white noise, achromatic red noise, dispersion measure (DM) variations, and scattering vari… ▽ More

    Submitted 16 June, 2023; v1 submitted 21 March, 2023; originally announced March 2023.

    Comments: Accepted for publication in PRD, 30 pages, 17 figures, 4 tables

  17. arXiv:2303.02357  [pdf, other

    cs.CL cs.AI

    DiTTO: A Feature Representation Imitation Approach for Improving Cross-Lingual Transfer

    Authors: Shanu Kumar, Abbaraju Soujanya, Sandipan Dandapat, Sunayana Sitaram, Monojit Choudhury

    Abstract: Zero-shot cross-lingual transfer is promising, however has been shown to be sub-optimal, with inferior transfer performance across low-resource languages. In this work, we envision languages as domains for improving zero-shot transfer by jointly reducing the feature incongruity between the source and the target language and increasing the generalization capabilities of pre-trained multilingual tra… ▽ More

    Submitted 4 March, 2023; originally announced March 2023.

    Comments: Accepted at EACL 2023

  18. arXiv:2210.15184  [pdf, other

    cs.CL

    Too Brittle To Touch: Comparing the Stability of Quantization and Distillation Towards Develo** Lightweight Low-Resource MT Models

    Authors: Harshita Diddee, Sandipan Dandapat, Monojit Choudhury, Tanuja Ganu, Kalika Bali

    Abstract: Leveraging shared learning through Massively Multilingual Models, state-of-the-art machine translation models are often able to adapt to the paucity of data for low-resource languages. However, this performance comes at the cost of significantly bloated models which are not practically deployable. Knowledge Distillation is one popular technique to develop competitive, lightweight models: In this w… ▽ More

    Submitted 9 November, 2022; v1 submitted 27 October, 2022; originally announced October 2022.

    Comments: 16 Pages, 7 Figures, Accepted to WMT 2022 (Research Track)

  19. arXiv:2210.12265  [pdf, other

    cs.CL

    On the Calibration of Massively Multilingual Language Models

    Authors: Kabir Ahuja, Sunayana Sitaram, Sandipan Dandapat, Monojit Choudhury

    Abstract: Massively Multilingual Language Models (MMLMs) have recently gained popularity due to their surprising effectiveness in cross-lingual transfer. While there has been much work in evaluating these models for their performance on a variety of tasks and languages, little attention has been paid on how well calibrated these models are with respect to the confidence in their predictions. We first invest… ▽ More

    Submitted 21 October, 2022; originally announced October 2022.

    Comments: EMNLP 2022

  20. arXiv:2207.06461  [pdf, other

    astro-ph.HE astro-ph.IM astro-ph.SR

    Nanohertz Gravitational Wave Astronomy during the SKA Era: An InPTA perspective

    Authors: Bhal Chandra Joshi, Achamveedu Gopakumar, Arul Pandian, Thiagaraj Prabu, Lankeswar Dey, Manjari Bagchi, Shantanu Desai, Pratik Tarafdar, Prerna Rana, Yogesh Maan, Neelam Dhanda Batra, Raghav Girgaonkar, Nikita Agarwal, Paramasivan Arumugam, Sarmistha Banik, Avishek Basu, Adarsh Bathula, Subhajit Dandapat, Yashwant Gupta, Shinnosuke Hisano, Ryo Kato, Divyansh Kharbanda, Tomonosuke Kikunaga, Neel Kolhe, M. A. Krishnakumar , et al. (12 additional authors not shown)

    Abstract: Decades long monitoring of millisecond pulsars, which exhibit highly stable rotational periods, in pulsar timing array experiments is on the threshold of discovering nanohertz stochastic gravitational wave background. This paper describes the Indian Pulsar timing array (InPTA) experiment, which employs the upgraded Giant Metrewave Radio Telescope (uGMRT) for timing an ensemble of millisecond pulsa… ▽ More

    Submitted 13 July, 2022; originally announced July 2022.

    Comments: Accepted for publication in Journal of Astronomy and Astrophysics for Special Issue on Indian Participation in the SKA (Editors : Abhirup Datta, Nirupam Roy, Preeti Kharb and Tirthankar Roy Choudhury)

  21. arXiv:2206.15010  [pdf, other

    cs.CL

    "Diversity and Uncertainty in Moderation" are the Key to Data Selection for Multilingual Few-shot Transfer

    Authors: Shanu Kumar, Sandipan Dandapat, Monojit Choudhury

    Abstract: Few-shot transfer often shows substantial gain over zero-shot transfer~\cite{lauscher2020zero}, which is a practically useful trade-off between fully supervised and unsupervised learning approaches for multilingual pretrained model-based systems. This paper explores various strategies for selecting data for annotation that can result in a better few-shot transfer. The proposed approaches rely on m… ▽ More

    Submitted 30 June, 2022; originally announced June 2022.

    Comments: NAACL 2022

  22. arXiv:2206.09289  [pdf, other

    astro-ph.IM astro-ph.HE

    The Indian Pulsar Timing Array: First data release

    Authors: Pratik Tarafdar, Nobleson K., Prerna Rana, Jaikhomba Singha, M. A. Krishnakumar, Bhal Chandra Joshi, Avinash Kumar Paladi, Neel Kolhe, Neelam Dhanda Batra, Nikita Agarwal, Adarsh Bathula, Subhajit Dandapat, Shantanu Desai, Lankeswar Dey, Shinnosuke Hisano, Prathamesh Ingale, Ryo Kato, Divyansh Kharbanda, Tomonosuke Kikunaga, Piyush Marmat, B. Arul Pandian, T. Prabu, Aman Srivastava, Mayuresh Surnis, Sai Chaitanya Susarla , et al. (13 additional authors not shown)

    Abstract: We present the pulse arrival times and high-precision dispersion measure estimates for 14 millisecond pulsars observed simultaneously in the 300-500 MHz and 1260-1460 MHz frequency bands using the upgraded Giant Metrewave Radio Telescope (uGMRT). The data spans over a baseline of 3.5 years (2018-2021), and is the first official data release made available by the Indian Pulsar Timing Array collabor… ▽ More

    Submitted 25 October, 2022; v1 submitted 18 June, 2022; originally announced June 2022.

    Comments: 23 pages, 21 figures, 3 tables. Published in PASA

    Journal ref: Publications of the Astronomical Society of Australia, Volume 39, 2022, e053

  23. arXiv:2205.06356  [pdf, other

    cs.CL

    Beyond Static Models and Test Sets: Benchmarking the Potential of Pre-trained Models Across Tasks and Languages

    Authors: Kabir Ahuja, Sandipan Dandapat, Sunayana Sitaram, Monojit Choudhury

    Abstract: Although recent Massively Multilingual Language Models (MMLMs) like mBERT and XLMR support around 100 languages, most existing multilingual NLP benchmarks provide evaluation data in only a handful of these languages with little linguistic diversity. We argue that this makes the existing practices in multilingual evaluation unreliable and does not provide a full picture of the performance of MMLMs… ▽ More

    Submitted 14 November, 2022; v1 submitted 12 May, 2022; originally announced May 2022.

    Comments: NLP Power! Workshop, ACL 2022

  24. arXiv:2205.06350  [pdf, other

    cs.CL

    On the Economics of Multilingual Few-shot Learning: Modeling the Cost-Performance Trade-offs of Machine Translated and Manual Data

    Authors: Kabir Ahuja, Monojit Choudhury, Sandipan Dandapat

    Abstract: Borrowing ideas from {\em Production functions} in micro-economics, in this paper we introduce a framework to systematically evaluate the performance and cost trade-offs between machine-translated and manually-created labelled data for task-specific fine-tuning of massively multilingual language models. We illustrate the effectiveness of our framework through a case-study on the TyDIQA-GoldP datas… ▽ More

    Submitted 14 November, 2022; v1 submitted 12 May, 2022; originally announced May 2022.

    Comments: NAACL 2022

  25. arXiv:2205.06130  [pdf, other

    cs.CL

    Multi Task Learning For Zero Shot Performance Prediction of Multilingual Models

    Authors: Kabir Ahuja, Shanu Kumar, Sandipan Dandapat, Monojit Choudhury

    Abstract: Massively Multilingual Transformer based Language Models have been observed to be surprisingly effective on zero-shot transfer across languages, though the performance varies from language to language depending on the pivot language(s) used for fine-tuning. In this work, we build upon some of the existing techniques for predicting the zero-shot performance on a task, by modeling it as a multi-task… ▽ More

    Submitted 12 May, 2022; originally announced May 2022.

    Comments: ACL 2022

  26. arXiv:2203.12865  [pdf, other

    cs.CL cs.LG

    Multilingual CheckList: Generation and Evaluation

    Authors: Karthikeyan K, Shaily Bhatt, Pankaj Singh, Somak Aditya, Sandipan Dandapat, Sunayana Sitaram, Monojit Choudhury

    Abstract: Multilingual evaluation benchmarks usually contain limited high-resource languages and do not test models for specific linguistic capabilities. CheckList is a template-based evaluation approach that tests models for specific capabilities. The CheckList template creation process requires native speakers, posing a challenge in scaling to hundreds of languages. In this work, we explore multiple appro… ▽ More

    Submitted 11 October, 2022; v1 submitted 24 March, 2022; originally announced March 2022.

    Comments: Accepted to Findings of AACL-IJCNLP 2022

  27. arXiv:2203.01188  [pdf, ps, other

    cs.SI

    EnDSUM: Entropy and Diversity based Disaster Tweet Summarization

    Authors: Piyush Kumar Garg, Roshni Chakraborty, Sourav Kumar Dandapat

    Abstract: The huge amount of information shared in Twitter during disaster events are utilized by government agencies and humanitarian organizations to ensure quick crisis response and provide situational updates. However, the huge number of tweets posted makes manual identification of the relevant tweets impossible. To address the information overload, there is a need to automatically generate summary of a… ▽ More

    Submitted 2 March, 2022; originally announced March 2022.

  28. arXiv:2201.07472  [pdf, other

    cs.SI

    Detecting Stance in Tweets : A Signed Network based Approach

    Authors: Roshni Chakraborty, Maitry Bhavsar, Sourav Kumar Dandapat, Joydeep Chandra

    Abstract: Identifying user stance related to a political event has several applications, like determination of individual stance, sha** of public opinion, identifying popularity of government measures and many others. The huge volume of political discussions on social media platforms, like, Twitter, provide opportunities in develo** automated mechanisms to identify individual stance and subsequently, sc… ▽ More

    Submitted 19 January, 2022; originally announced January 2022.

  29. arXiv:2201.06545  [pdf, ps, other

    cs.SI

    OntoDSumm : Ontology based Tweet Summarization for Disaster Events

    Authors: Piyush Kumar Garg, Roshni Chakraborty, Sourav Kumar Dandapat

    Abstract: The huge popularity of social media platforms like Twitter attracts a large fraction of users to share real-time information and short situational messages during disasters. A summary of these tweets is required by the government organizations, agencies, and volunteers for efficient and quick disaster response. However, the huge influx of tweets makes it difficult to manually get a precise overvie… ▽ More

    Submitted 19 November, 2022; v1 submitted 17 January, 2022; originally announced January 2022.

    ACM Class: H.0

  30. arXiv:2112.06908  [pdf, other

    astro-ph.IM astro-ph.HE

    Low-frequency wideband timing of InPTA pulsars observed with the uGMRT

    Authors: K Nobleson, Nikita Agarwal, Raghav Girgaonkar, Arul Pandian, Bhal Chandra Joshi, M A Krishnakumar, Abhimanyu Susobhanan, Shantanu Desai, T Prabu, Adarsh Bathula, Timothy T Pennucci, Sarmistha Banik, Manjari Bagchi, Neelam Dhanda Batra, Arpita Choudhary, Subhajit Dandapat, Lankeswar Dey, Yashwant Gupta, Shinnosuke Hisano, Ryo Kato, Divyansh Kharbanda, Tomonosuke Kikunaga, Neel Kolhe, Yogesh Maan, Piyush Marmat , et al. (7 additional authors not shown)

    Abstract: High-precision measurements of the pulsar dispersion measure (DM) are possible using telescopes with low-frequency wideband receivers. We present an initial study of the application of the wideband timing technique, which can simultaneously measure the pulsar times of arrival (ToAs) and DMs, for a set of five pulsars observed with the upgraded Giant Metrewave Radio Telescope (uGMRT) as part of the… ▽ More

    Submitted 23 February, 2022; v1 submitted 13 December, 2021; originally announced December 2021.

    Comments: Accepted for publication in MNRAS

  31. Third order post-Newtonian gravitational radiation from two-body scattering: Instantaneous energy and angular momentum radiation

    Authors: Gihyuk Cho, Subhajit Dandapat, Achamveedu Gopakumar

    Abstract: We compute the third post-Newtonian (3PN) accurate instantaneous contributions to the radiated gravitational wave (GW) energy and angular momentum arising from the hyperbolic passages of non-spinning compact objects. The present computations employ 3PN-accurate instantaneous contributions to the far-zone energy and angular momentum fluxes and the 3PN-accurate Keplerian type parametric solution for… ▽ More

    Submitted 20 April, 2022; v1 submitted 1 November, 2021; originally announced November 2021.

    Journal ref: Phys. Rev. D 105, 084018 (2022)

  32. arXiv:2110.08875  [pdf, other

    cs.CL cs.LG

    Predicting the Performance of Multilingual NLP Models

    Authors: Anirudh Srinivasan, Sunayana Sitaram, Tanuja Ganu, Sandipan Dandapat, Kalika Bali, Monojit Choudhury

    Abstract: Recent advancements in NLP have given us models like mBERT and XLMR that can serve over 100 languages. The languages that these models are evaluated on, however, are very few in number, and it is unlikely that evaluation datasets will cover all the languages that these models support. Potential solutions to the costly problem of dataset creation are to translate datasets to new languages or use te… ▽ More

    Submitted 17 October, 2021; originally announced October 2021.

  33. arXiv:2109.07140  [pdf, ps, other

    cs.CL

    On the Universality of Deep Contextual Language Models

    Authors: Shaily Bhatt, Poonam Goyal, Sandipan Dandapat, Monojit Choudhury, Sunayana Sitaram

    Abstract: Deep Contextual Language Models (LMs) like ELMO, BERT, and their successors dominate the landscape of Natural Language Processing due to their ability to scale across multiple tasks rapidly by pre-training a single model, followed by task-specific fine-tuning. Furthermore, multilingual versions of such models like XLM-R and mBERT have given promising results in zero-shot cross-lingual transfer, po… ▽ More

    Submitted 18 December, 2021; v1 submitted 15 September, 2021; originally announced September 2021.

    Comments: 9 pages

  34. Evidence for profile changes in PSR J1713+0747 using the uGMRT

    Authors: Jaikhomba Singha, Mayuresh P Surnis, Bhal Chandra Joshi, Pratik Tarafdar, Prerna Rana, Abhimanyu Susobhanan, Raghav Girgaonkar, Neel Kolhe, Nikita Agarwal, Shantanu Desai, T Prabu, Adarsh Bathula, Subhajit Dandapat, Lankeswar Dey, Shinnosuke Hisano, Ryo Kato, Divyansh Kharbanda, Tomonosuke Kikunaga, Piyush Marmat, Sai Chaitanya Susarla, Manjari Bagchi, Neelam Dhanda Batra, Arpita Choudhury, A Gopakumar, Yashwant Gupta , et al. (7 additional authors not shown)

    Abstract: PSR J1713+0747 is one of the most precisely timed pulsars in the international pulsar timing array experiment. This pulsar showed an abrupt profile shape change between April 16, 2021 (MJD 59320) and April 17, 2021 (MJD 59321). In this paper, we report the results from multi-frequency observations of this pulsar carried out with the upgraded Giant Metrewave Radio Telescope (uGMRT) before and after… ▽ More

    Submitted 16 August, 2021; v1 submitted 9 July, 2021; originally announced July 2021.

    Comments: Accepted for publication in MNRAS-Letters

  35. arXiv:2105.10427  [pdf, other

    cs.AR

    Prefetcher-based DRAM Architecture

    Authors: Saurabh Jaiswal, Shailendra Kumar Gupta, Soumya Soubhagya Dandapat

    Abstract: Advancement in Processor technology has made it easy to handle data-intensive workloads, but limiting main memory advances has created performance bottlenecks. In DRAM, there have been improvements in DRAM access latency as well as reduction in cost-per-bit with the increase in cell density. But still DRAM data transfer rate lags behind the processing speed of the current generation processors. As… ▽ More

    Submitted 21 May, 2021; originally announced May 2021.

  36. A Large-Scale Study of the Twitter Follower Network to Characterize the Spread of Prescription Drug Abuse Tweets

    Authors: Ryan Sequeira, Avijit Gayen, Niloy Ganguly, Sourav Kumar Dandapat, Joydeep Chandra

    Abstract: In this article, we perform a large-scale study of the Twitter follower network, involving around 0.42 million users who justify DA, to characterize the spreading of DA tweets across the network. Our observations reveal the existence of a very large giant component involving 99% of these users with dense local connectivity that facilitates the spreading of such messages. We further identify active… ▽ More

    Submitted 17 February, 2021; originally announced February 2021.

    Comments: 13 pages, 9 figures, and accepted by IEEE Transactions on Computational Social Systems

    Journal ref: IEEE Transactions on Computational Social Systems, vol. 6, no. 6, pp. 1232-1244, Dec. 2019

  37. arXiv:2004.12376  [pdf, other

    cs.CL

    GLUECoS : An Evaluation Benchmark for Code-Switched NLP

    Authors: Simran Khanuja, Sandipan Dandapat, Anirudh Srinivasan, Sunayana Sitaram, Monojit Choudhury

    Abstract: Code-switching is the use of more than one language in the same conversation or utterance. Recently, multilingual contextual embedding models, trained on multiple monolingual corpora, have shown promising results on cross-lingual and multilingual tasks. We present an evaluation benchmark, GLUECoS, for code-switched languages, that spans several NLP tasks in English-Hindi and English-Spanish. Speci… ▽ More

    Submitted 14 May, 2020; v1 submitted 26 April, 2020; originally announced April 2020.

    Comments: To appear at ACL 2020

  38. arXiv:2004.05051  [pdf, other

    cs.CL

    A New Dataset for Natural Language Inference from Code-mixed Conversations

    Authors: Simran Khanuja, Sandipan Dandapat, Sunayana Sitaram, Monojit Choudhury

    Abstract: Natural Language Inference (NLI) is the task of inferring the logical relationship, typically entailment or contradiction, between a premise and hypothesis. Code-mixing is the use of more than one language in the same conversation or utterance, and is prevalent in multilingual communities all over the world. In this paper, we present the first dataset for code-mixed NLI, in which both the premises… ▽ More

    Submitted 13 April, 2020; v1 submitted 10 April, 2020; originally announced April 2020.

    Comments: To appear in CALCS, LREC 2020

  39. arXiv:2002.02631  [pdf, other

    cs.CL cs.AI

    Translating Web Search Queries into Natural Language Questions

    Authors: Adarsh Kumar, Sandipan Dandapat, Sushil Chordia

    Abstract: Users often query a search engine with a specific question in mind and often these queries are keywords or sub-sentential fragments. For example, if the users want to know the answer for "What's the capital of USA", they will most probably query "capital of USA" or "USA capital" or some keyword-based variation of this. For example, for the user entered query "capital of USA", the most probable que… ▽ More

    Submitted 7 February, 2020; originally announced February 2020.

    Comments: Eleventh International Conference on Language Resources and Evaluation, LREC 2018

  40. arXiv:1901.09334  [pdf

    cs.SI cs.IR cs.LG

    Predicting Tomorrow's Headline using Today's Twitter Deliberations

    Authors: Roshni Chakraborty, Abhijeet Kharat, Apalak Khatua, Sourav Kumar Dandapat, Joydeep Chandra

    Abstract: Predicting the popularity of news article is a challenging task. Existing literature mostly focused on article contents and polarity to predict popularity. However, existing research has not considered the users' preference towards a particular article. Understanding users' preference is an important aspect for predicting the popularity of news articles. Hence, we consider the social media data, f… ▽ More

    Submitted 27 January, 2019; originally announced January 2019.

    Comments: This paper was accepted in CIKM Workshop on News Recommendation and Analytics (INRA), 2018, Turin, Italy

  41. arXiv:1601.06608  [pdf, other

    cs.CV

    An Unsupervised Method for Detection and Validation of The Optic Disc and The Fovea

    Authors: Mrinal Haloi, Samarendra Dandapat, Rohit Sinha

    Abstract: In this work, we have presented a novel method for detection of retinal image features, the optic disc and the fovea, from colour fundus photographs of dilated eyes for Computer-aided Diagnosis(CAD) system. A saliency map based method was used to detect the optic disc followed by an unsupervised probabilistic Latent Semantic Analysis for detection validation. The validation concept is based on dis… ▽ More

    Submitted 25 January, 2016; originally announced January 2016.

    MSC Class: 68T45

  42. arXiv:1505.00737  [pdf, other

    cs.CV

    A Gaussian Scale Space Approach For Exudates Detection, Classification And Severity Prediction

    Authors: Mrinal Haloi, Samarendra Dandapat, Rohit Sinha

    Abstract: In the context of Computer Aided Diagnosis system for diabetic retinopathy, we present a novel method for detection of exudates and their classification for disease severity prediction. The method is based on Gaussian scale space based interest map and mathematical morphology. It makes use of support vector machine for classification and location information of the optic disc and the macula region… ▽ More

    Submitted 4 May, 2015; originally announced May 2015.

    Comments: Accepted in ICIP 2015, Quebec city, Canada

    MSC Class: 68T45

  43. arXiv:0809.2231  [pdf, ps, other

    physics.flu-dyn

    Circular hydraulic jump in generalized-Newtonian fluids

    Authors: Ashutosh Rai, B. S. Dandapat, Swarup Poria

    Abstract: We carry out an analytical study of laminar circular hydraulic jumps, in generalized-Newtonian fluids obeying the two-parametric power-law model of Ostwald-de Waele. Under the boundary-layer approximation we obtained exact expressions determining the flow, an implicit relation for the jump radius is derived. Corresponding results for Newtonian fluids can be retrieved as a limiting case for the f… ▽ More

    Submitted 22 September, 2008; v1 submitted 12 September, 2008; originally announced September 2008.

    Comments: 4 pages, 3 figures, added references, corrected typos