Search | arXiv e-print repository

ADSumm: Annotated Ground-truth Summary Datasets for Disaster Tweet Summarization

Authors: Piyush Kumar Garg, Roshni Chakraborty, Sourav Kumar Dandapat

Abstract: Online social media platforms, such as Twitter, provide valuable information during disaster events. Existing tweet disaster summarization approaches provide a summary of these events to aid government agencies, humanitarian organizations, etc., to ensure effective disaster response. In the literature, there are two types of approaches for disaster summarization, namely, supervised and unsupervise… ▽ More Online social media platforms, such as Twitter, provide valuable information during disaster events. Existing tweet disaster summarization approaches provide a summary of these events to aid government agencies, humanitarian organizations, etc., to ensure effective disaster response. In the literature, there are two types of approaches for disaster summarization, namely, supervised and unsupervised approaches. Although supervised approaches are typically more effective, they necessitate a sizable number of disaster event summaries for testing and training. However, there is a lack of good number of disaster summary datasets for training and evaluation. This motivates us to add more datasets to make supervised learning approaches more efficient. In this paper, we present ADSumm, which adds annotated ground-truth summaries for eight disaster events which consist of both natural and man-made disaster events belonging to seven different countries. Our experimental analysis shows that the newly added datasets improve the performance of the supervised summarization approaches by 8-28% in terms of ROUGE-N F1-score. Moreover, in newly annotated dataset, we have added a category label for each input tweet which helps to ensure good coverage from different categories in summary. Additionally, we have added two other features relevance label and key-phrase, which provide information about the quality of a tweet and explanation about the inclusion of the tweet into summary, respectively. For ground-truth summary creation, we provide the annotation procedure adapted in detail, which has not been described in existing literature. Experimental analysis shows the quality of ground-truth summary is very good with Coverage, Relevance and Diversity. △ Less

Submitted 10 May, 2024; originally announced May 2024.

arXiv:2405.06541 [pdf, other]

ATSumm: Auxiliary information enhanced approach for abstractive disaster Tweet Summarization with sparse training data

Authors: Piyush Kumar Garg, Roshni Chakraborty, Sourav Kumar Dandapat

Abstract: The abundance of situational information on Twitter poses a challenge for users to manually discern vital and relevant information during disasters. A concise and human-interpretable overview of this information helps decision-makers in implementing efficient and quick disaster response. Existing abstractive summarization approaches can be categorized as sentence-based or key-phrase-based approach… ▽ More The abundance of situational information on Twitter poses a challenge for users to manually discern vital and relevant information during disasters. A concise and human-interpretable overview of this information helps decision-makers in implementing efficient and quick disaster response. Existing abstractive summarization approaches can be categorized as sentence-based or key-phrase-based approaches. This paper focuses on sentence-based approach, which is typically implemented as a dual-phase procedure in literature. The initial phase, known as the extractive phase, involves identifying the most relevant tweets. The subsequent phase, referred to as the abstractive phase, entails generating a more human-interpretable summary. In this study, we adopt the methodology from prior research for the extractive phase. For the abstractive phase of summarization, most existing approaches employ deep learning-based frameworks, which can either be pre-trained or require training from scratch. However, to achieve the appropriate level of performance, it is imperative to have substantial training data for both methods, which is not readily available. This work presents an Abstractive Tweet Summarizer (ATSumm) that effectively addresses the issue of data sparsity by using auxiliary information. We introduced the Auxiliary Pointer Generator Network (AuxPGN) model, which utilizes a unique attention mechanism called Key-phrase attention. This attention mechanism incorporates auxiliary information in the form of key-phrases and their corresponding importance scores from the input tweets. We evaluate the proposed approach by comparing it with 10 state-of-the-art approaches across 13 disaster datasets. The evaluation results indicate that ATSumm achieves superior performance compared to state-of-the-art approaches, with improvement of 4-80% in ROUGE-N F1-score. △ Less

Submitted 10 May, 2024; originally announced May 2024.

arXiv:2402.03472 [pdf, other]

doi 10.1103/PhysRevD.109.103018

Efficient prescription to search for linear gravitational wave memory from hyperbolic black hole encounters and its application to the NANOGrav 12.5-year dataset

Authors: Subhajit Dandapat, Abhimanyu Susobhanan, Lankeswar Dey, A. Gopakumar, Paul T. Baker, Philippe Jetzer

Abstract: Burst with memory events are potential transient gravitational wave sources for the maturing pulsar timing array (PTA) efforts. We provide a computationally efficient prescription to model pulsar timing residuals induced by supermassive black hole pairs in general relativistic hyperbolic trajectories employing a Keplerian-type parametric solution. Injection studies have been pursued on the resulti… ▽ More Burst with memory events are potential transient gravitational wave sources for the maturing pulsar timing array (PTA) efforts. We provide a computationally efficient prescription to model pulsar timing residuals induced by supermassive black hole pairs in general relativistic hyperbolic trajectories employing a Keplerian-type parametric solution. Injection studies have been pursued on the resulting bursts with linear GW memory (LGWM) events with simulated datasets to test the performance of our pipeline, followed by its application to the publicly available NANOGrav 12.5-year (NG12.5) dataset. Given the absence of any evidence of LGWM events within the real NG12.5 dataset, we impose $95\%$ upper limits on the PTA signal amplitude as a function of the sky location of the source and certain characteristic frequency ($n$) of the signal. The upper limits are computed using a signal model that takes into account the presence of intrinsic timing noise specific to each pulsar, as well as a common, spatially uncorrelated red noise, alongside the LGWM signal. Our investigations reveal that the $95\%$ upper limits on LGWM amplitude, marginalized over all other parameters, is 3.48 $\pm 0.51 \ μ$s for $n>3.16$ nHz. This effort should be relevant for constraining both burst and memory events in the upcoming International Pulsar Timing Array data releases. △ Less

Submitted 16 May, 2024; v1 submitted 5 February, 2024; originally announced February 2024.

Comments: 20 pages, 11 figures

Journal ref: Phys. Rev. D 109, 103018, 2024

arXiv:2401.06810 [pdf, other]

TONE: A 3-Tiered ONtology for Emotion analysis

Authors: Srishti Gupta, Piyush Kumar Garg, Sourav Kumar Dandapat

Abstract: Emotions have played an important part in many sectors, including psychology, medicine, mental health, computer science, and so on, and categorizing them has proven extremely useful in separating one emotion from another. Emotions can be classified using the following two methods: (1) The supervised method's efficiency is strongly dependent on the size and domain of the data collected. A categoriz… ▽ More Emotions have played an important part in many sectors, including psychology, medicine, mental health, computer science, and so on, and categorizing them has proven extremely useful in separating one emotion from another. Emotions can be classified using the following two methods: (1) The supervised method's efficiency is strongly dependent on the size and domain of the data collected. A categorization established using relevant data from one domain may not work well in another. (2) An unsupervised method that uses either domain expertise or a knowledge base of emotion types already exists. Though this second approach provides a suitable and generic categorization of emotions and is cost-effective, the literature doesn't possess a publicly available knowledge base that can be directly applied to any emotion categorization-related task. This pushes us to create a knowledge base that can be used for emotion classification across domains, and ontology is often used for this purpose. In this study, we provide TONE, an emotion-based ontology that effectively creates an emotional hierarchy based on Dr. Gerrod Parrot's group of emotions. In addition to ontology development, we introduce a semi-automated vocabulary construction process to generate a detailed collection of terms for emotions at each tier of the hierarchy. We also demonstrate automated methods for establishing three sorts of dependencies in order to develop linkages between different emotions. Our human and automatic evaluation results show the ontology's quality. Furthermore, we describe three distinct use cases that demonstrate the applicability of our ontology. △ Less

Submitted 10 January, 2024; originally announced January 2024.

arXiv:2312.01875 [pdf, other]

doi 10.1017/pasa.2024.30

Low-frequency pulse-jitter measurement with the uGMRT I : PSR J0437$-$4715

Authors: Tomonosuke Kikunaga, Shinnosuke Hisano, Neelam Dhanda Batra, Shantanu Desai, Bhal Chandra Joshi, Manjari Bagchi, T. Prabu, Keitaro Takahashi, Swetha Arumugam, Adarsh Bathula, Subhajit Dandapat, Debabrata Deb, Churchil Dwivedi, Yashwant Gupta, Shebin Jose Jacob, Fazal Kareem, Nobleson K, Pragna Mamidipaka, Avinash Kumar Paladi, Arul Pandian B, Prerna Rana, Jaikhomba Singha, Aman Srivastava, Mayuresh Surnis, Pratik Tarafdar

Abstract: High-precision pulsar timing observations are limited in their accuracy by the jitter noise that appears in the arrival time of pulses. Therefore, it is important to systematically characterise the amplitude of the jitter noise and its variation with frequency. In this paper, we provide jitter measurements from low-frequency wideband observations of PSR J0437$-$4715 using data obtained as part of… ▽ More High-precision pulsar timing observations are limited in their accuracy by the jitter noise that appears in the arrival time of pulses. Therefore, it is important to systematically characterise the amplitude of the jitter noise and its variation with frequency. In this paper, we provide jitter measurements from low-frequency wideband observations of PSR J0437$-$4715 using data obtained as part of the Indian Pulsar Timing Array experiment. We were able to detect jitter in both the 300 - 500 MHz and 1260 - 1460 MHz observations of the upgraded Giant Metrewave Radio Telescope (uGMRT). The former is the first jitter measurement for this pulsar below 700 MHz, and the latter is in good agreement with results from previous studies. In addition, at 300 - 500 MHz, we investigated the frequency dependence of the jitter by calculating the jitter for each sub-banded arrival time of pulses. We found that the jitter amplitude increases with frequency. This trend is opposite as compared to previous studies, indicating that there is a turnover at intermediate frequencies. It will be possible to investigate this in more detail with uGMRT observations at 550 - 750 MHz and future high sensitive wideband observations from next generation telescopes, such as the Square Kilometre Array. We also explored the effect of jitter on the high precision dispersion measure (DM) measurements derived from short duration observations. We find that even though the DM precision will be better at lower frequencies due to the smaller amplitude of jitter noise, it will limit the DM precision for high signal-to-noise observations, which are of short durations. This limitation can be overcome by integrating for a long enough duration optimised for a given pulsar. △ Less

Submitted 18 April, 2024; v1 submitted 4 December, 2023; originally announced December 2023.

Comments: 13 pages, 12 figures, 3 tables, accepted for Publication of the Astronomical Society of Australia

arXiv:2309.16765 [pdf, other]

Using low-frequency scatter-broadening measurements for precision estimates of dispersion measures

Authors: Jaikhomba Singha, Bhal Chandra Joshi, M. A. Krishnakumar, Fazal Kareem, Adarsh Bathula, Churchil Dwivedi, Shebin Jose Jacob, Shantanu Desai, Pratik Tarafdar, P. Arumugam, Swetha Arumugam, Manjari Bagchi, Neelam Dhanda Batra, Subhajit Dandapat, Debabrata Deb, Jyotijwal Debnath, A Gopakumar, Yashwant Gupta, Shinnosuke Hisano, Ryo Kato, Tomonosuke Kikunaga, Piyush Marmat, K. Nobleson, Avinash K. Paladi, Arul Pandian B. , et al. (6 additional authors not shown)

Abstract: A pulsar's pulse profile gets broadened at low frequencies due to dispersion along the line of sight or due to multi-path propagation. The dynamic nature of the interstellar medium makes both of these effects time-dependent and introduces slowly varying time delays in the measured times-of-arrival similar to those introduced by passing gravitational waves. In this article, we present a new method… ▽ More A pulsar's pulse profile gets broadened at low frequencies due to dispersion along the line of sight or due to multi-path propagation. The dynamic nature of the interstellar medium makes both of these effects time-dependent and introduces slowly varying time delays in the measured times-of-arrival similar to those introduced by passing gravitational waves. In this article, we present a new method to correct for such delays by obtaining unbiased dispersion measure (DM) measurements by using low-frequency estimates of the scattering parameters. We evaluate this method by comparing the obtained DM estimates with those, where scatter-broadening is ignored using simulated data. A bias is seen in the estimated DMs for simulated data with pulse-broadening with a larger variability for a data set with a variable frequency scaling index, $α$, as compared to that assuming a Kolmogorov turbulence. Application of the proposed method removes this bias robustly for data with band averaged signal-to-noise ratio larger than 100. We report, for the first time, the measurements of the scatter-broadening time and $α$ from analysis of PSR J1643$-$1224, observed with upgraded Giant Metrewave Radio Telescope as part of the Indian Pulsar Timing Array experiment. These scattering parameters were found to vary with epoch and $α$ was different from that expected for Kolmogorov turbulence. Finally, we present the DM time-series after application of the new technique to PSR J1643$-$1224. △ Less

Submitted 28 September, 2023; originally announced September 2023.

Comments: 9 pages, 8 figures, Submitted to MNRAS

arXiv:2309.00693 [pdf, other]

Comparing recent PTA results on the nanohertz stochastic gravitational wave background

Authors: The International Pulsar Timing Array Collaboration, G. Agazie, J. Antoniadis, A. Anumarlapudi, A. M. Archibald, P. Arumugam, S. Arumugam, Z. Arzoumanian, J. Askew, S. Babak, M. Bagchi, M. Bailes, A. -S. Bak Nielsen, P. T. Baker, C. G. Bassa, A. Bathula, B. Bécsy, A. Berthereau, N. D. R. Bhat, L. Blecha, M. Bonetti, E. Bortolas, A. Brazier, P. R. Brook, M. Burgay , et al. (220 additional authors not shown)

Abstract: The Australian, Chinese, European, Indian, and North American pulsar timing array (PTA) collaborations recently reported, at varying levels, evidence for the presence of a nanohertz gravitational wave background (GWB). Given that each PTA made different choices in modeling their data, we perform a comparison of the GWB and individual pulsar noise parameters across the results reported from the PTA… ▽ More The Australian, Chinese, European, Indian, and North American pulsar timing array (PTA) collaborations recently reported, at varying levels, evidence for the presence of a nanohertz gravitational wave background (GWB). Given that each PTA made different choices in modeling their data, we perform a comparison of the GWB and individual pulsar noise parameters across the results reported from the PTAs that constitute the International Pulsar Timing Array (IPTA). We show that despite making different modeling choices, there is no significant difference in the GWB parameters that are measured by the different PTAs, agreeing within $1σ$. The pulsar noise parameters are also consistent between different PTAs for the majority of the pulsars included in these analyses. We bridge the differences in modeling choices by adopting a standardized noise model for all pulsars and PTAs, finding that under this model there is a reduction in the tension in the pulsar noise parameters. As part of this reanalysis, we "extended" each PTA's data set by adding extra pulsars that were not timed by that PTA. Under these extensions, we find better constraints on the GWB amplitude and a higher signal-to-noise ratio for the Hellings and Downs correlations. These extensions serve as a prelude to the benefits offered by a full combination of data across all pulsars in the IPTA, i.e., the IPTA's Data Release 3, which will involve not just adding in additional pulsars, but also including data from all three PTAs where any given pulsar is timed by more than as single PTA. △ Less

Submitted 1 September, 2023; originally announced September 2023.

Comments: 21 pages, 9 figures, submitted to ApJ

arXiv:2306.16227 [pdf, other]

The second data release from the European Pulsar Timing Array: IV. Implications for massive black holes, dark matter and the early Universe

Authors: J. Antoniadis, P. Arumugam, S. Arumugam, P. Auclair, S. Babak, M. Bagchi, A. -S. Bak Nielsen, E. Barausse, C. G. Bassa, A. Bathula, A. Berthereau, M. Bonetti, E. Bortolas, P. R. Brook, M. Burgay, R. N. Caballero, C. Caprini, A. Chalumeau, D. J. Champion, S. Chanlaridis, S. Chen, I. Cognard, M. Crisostomi, S. Dandapat, D. Deb , et al. (89 additional authors not shown)

Abstract: The European Pulsar Timing Array (EPTA) and Indian Pulsar Timing Array (InPTA) collaborations have measured a low-frequency common signal in the combination of their second and first data releases respectively, with the correlation properties of a gravitational wave background (GWB). Such signal may have its origin in a number of physical processes including a cosmic population of inspiralling sup… ▽ More The European Pulsar Timing Array (EPTA) and Indian Pulsar Timing Array (InPTA) collaborations have measured a low-frequency common signal in the combination of their second and first data releases respectively, with the correlation properties of a gravitational wave background (GWB). Such signal may have its origin in a number of physical processes including a cosmic population of inspiralling supermassive black hole binaries (SMBHBs); inflation, phase transitions, cosmic strings and tensor mode generation by non-linear evolution of scalar perturbations in the early Universe; oscillations of the Galactic potential in the presence of ultra-light dark matter (ULDM). At the current stage of emerging evidence, it is impossible to discriminate among the different origins. Therefore, in this paper, we consider each process separately, and investigate the implications of the signal under the hypothesis that it is generated by that specific process. We find that the signal is consistent with a cosmic population of inspiralling SMBHBs, and its relatively high amplitude can be used to place constraints on binary merger timescales and the SMBH-host galaxy scaling relations. If this origin is confirmed, this is the first direct evidence that SMBHBs merge in nature, adding an important observational piece to the puzzle of structure formation and galaxy evolution. As for early Universe processes, the measurement would place tight constraints on the cosmic string tension and on the level of turbulence developed by first-order phase transitions. Other processes would require non-standard scenarios, such as a blue-tilted inflationary spectrum or an excess in the primordial spectrum of scalar perturbations at large wavenumbers. Finally, a ULDM origin of the detected signal is disfavoured, which leads to direct constraints on the abundance of ULDM in our Galaxy. △ Less

Submitted 15 May, 2024; v1 submitted 28 June, 2023; originally announced June 2023.

Comments: 30 pages, 23 figures, replaced to match the version published in Astronomy & Astrophysics, note the change in the numbering order in the series (now paper IV)

arXiv:2306.16226 [pdf, other]

The second data release from the European Pulsar Timing Array V. Search for continuous gravitational wave signals

Authors: J. Antoniadis, P. Arumugam, S. Arumugam, S. Babak, M. Bagchi, A. S. Bak Nielsen, C. G. Bassa, A. Bathula, A. Berthereau, M. Bonetti, E. Bortolas, P. R. Brook, M. Burgay, R. N. Caballero, A. Chalumeau, D. J. Champion, S. Chanlaridis, S. Chen, I. Cognard, S. Dandapat, D. Deb, S. Desai, G. Desvignes, N. Dhanda-Batra, C. Dwivedi , et al. (75 additional authors not shown)

Abstract: We present the results of a search for continuous gravitational wave signals (CGWs) in the second data release (DR2) of the European Pulsar Timing Array (EPTA) collaboration. The most significant candidate event from this search has a gravitational wave frequency of 4-5 nHz. Such a signal could be generated by a supermassive black hole binary (SMBHB) in the local Universe. We present the results o… ▽ More We present the results of a search for continuous gravitational wave signals (CGWs) in the second data release (DR2) of the European Pulsar Timing Array (EPTA) collaboration. The most significant candidate event from this search has a gravitational wave frequency of 4-5 nHz. Such a signal could be generated by a supermassive black hole binary (SMBHB) in the local Universe. We present the results of a follow-up analysis of this candidate using both Bayesian and frequentist methods. The Bayesian analysis gives a Bayes factor of 4 in favor of the presence of the CGW over a common uncorrelated noise process, while the frequentist analysis estimates the p-value of the candidate to be 1%, also assuming the presence of common uncorrelated red noise. However, comparing a model that includes both a CGW and a gravitational wave background (GWB) to a GWB only, the Bayes factor in favour of the CGW model is only 0.7. Therefore, we cannot conclusively determine the origin of the observed feature, but we cannot rule it out as a CGW source. We present results of simulations that demonstrate that data containing a weak gravitational wave background can be misinterpreted as data including a CGW and vice versa, providing two plausible explanations of the EPTA DR2 data. Further investigations combining data from all PTA collaborations will be needed to reveal the true origin of this feature. △ Less

Submitted 25 June, 2024; v1 submitted 28 June, 2023; originally announced June 2023.

Comments: 13 figures, 15 pages, accepted

arXiv:2306.16225 [pdf, other]

doi 10.1051/0004-6361/202346842

The second data release from the European Pulsar Timing Array II. Customised pulsar noise models for spatially correlated gravitational waves

Authors: J. Antoniadis, P. Arumugam, S. Arumugam, S. Babak, M. Bagchi, A. S. Bak Nielsen, C. G. Bassa, A. Bathula, A. Berthereau, M. Bonetti, E. Bortolas, P. R. Brook, M. Burgay, R. N. Caballero, A. Chalumeau, D. J. Champion, S. Chanlaridis, S. Chen, I. Cognard, S. Dandapat, D. Deb, S. Desai, G. Desvignes, N. Dhanda-Batra, C. Dwivedi , et al. (73 additional authors not shown)

Abstract: The nanohertz gravitational wave background (GWB) is expected to be an aggregate signal of an ensemble of gravitational waves emitted predominantly by a large population of coalescing supermassive black hole binaries in the centres of merging galaxies. Pulsar timing arrays, ensembles of extremely stable pulsars, are the most precise experiments capable of detecting this background. However, the su… ▽ More The nanohertz gravitational wave background (GWB) is expected to be an aggregate signal of an ensemble of gravitational waves emitted predominantly by a large population of coalescing supermassive black hole binaries in the centres of merging galaxies. Pulsar timing arrays, ensembles of extremely stable pulsars, are the most precise experiments capable of detecting this background. However, the subtle imprints that the GWB induces on pulsar timing data are obscured by many sources of noise. These must be carefully characterized to increase the sensitivity to the GWB. In this paper, we present a novel technique to estimate the optimal number of frequency coefficients for modelling achromatic and chromatic noise and perform model selection. We also incorporate a new model to fit for scattering variations in the pulsar timing package temponest and created realistic simulations of the European Pulsar Timing Array (EPTA) datasets that allowed us to test the efficacy of our noise modelling algorithms. We present an in-depth analysis of the noise properties of 25 millisecond pulsars (MSPs) that form the second data release (DR2) of the EPTA and investigate the effect of incorporating low-frequency data from the Indian PTA collaboration. We use enterprise and temponest packages to compare noise models with those reported with the EPTA DR1. We find that, while in some pulsars we can successfully disentangle chromatic from achromatic noise owing to the wider frequency coverage in DR2, in others the noise models evolve in a more complicated way. We also find evidence of long-term scattering variations in PSR J1600$-$3053. Through our simulations, we identify intrinsic biases in our current noise analysis techniques and discuss their effect on GWB searches. The results presented here directly help improve sensitivity to the GWB and are already being used as part of global PTA efforts. △ Less

Submitted 28 June, 2023; originally announced June 2023.

Comments: 20 pages, 6 figures, 9 tables

Journal ref: A&A 678, A49 (2023)

arXiv:2306.16214 [pdf, other]

doi 10.1051/0004-6361/202346844

The second data release from the European Pulsar Timing Array III. Search for gravitational wave signals

Authors: J. Antoniadis, P. Arumugam, S. Arumugam, S. Babak, M. Bagchi, A. -S. Bak Nielsen, C. G. Bassa, A. Bathula, A. Berthereau, M. Bonetti, E. Bortolas, P. R. Brook, M. Burgay, R. N. Caballero, A. Chalumeau, D. J. Champion, S. Chanlaridis, S. Chen, I. Cognard, S. Dandapat, D. Deb, S. Desai, G. Desvignes, N. Dhanda-Batra, C. Dwivedi , et al. (73 additional authors not shown)

Abstract: We present the results of the search for an isotropic stochastic gravitational wave background (GWB) at nanohertz frequencies using the second data release of the European Pulsar Timing Array (EPTA) for 25 millisecond pulsars and a combination with the first data release of the Indian Pulsar Timing Array (InPTA). We analysed (i) the full 24.7-year EPTA data set, (ii) its 10.3-year subset based on… ▽ More We present the results of the search for an isotropic stochastic gravitational wave background (GWB) at nanohertz frequencies using the second data release of the European Pulsar Timing Array (EPTA) for 25 millisecond pulsars and a combination with the first data release of the Indian Pulsar Timing Array (InPTA). We analysed (i) the full 24.7-year EPTA data set, (ii) its 10.3-year subset based on modern observing systems, (iii) the combination of the full data set with the first data release of the InPTA for ten commonly timed millisecond pulsars, and (iv) the combination of the 10.3-year subset with the InPTA data. These combinations allowed us to probe the contributions of instrumental noise and interstellar propagation effects. With the full data set, we find marginal evidence for a GWB, with a Bayes factor of four and a false alarm probability of $4\%$. With the 10.3-year subset, we report evidence for a GWB, with a Bayes factor of $60$ and a false alarm probability of about $0.1\%$ ($\gtrsim 3σ$ significance). The addition of the InPTA data yields results that are broadly consistent with the EPTA-only data sets, with the benefit of better noise modelling. Analyses were performed with different data processing pipelines to test the consistency of the results from independent software packages. The inferred spectrum from the latest EPTA data from new generation observing systems is rather uncertain and in mild tension with the common signal measured in the full data set. However, if the spectral index is fixed at 13/3, the two data sets give a similar amplitude of ($2.5\pm0.7)\times10^{-15}$ at a reference frequency of $1\,{\rm yr}^{-1}$. By continuing our detection efforts as part of the International Pulsar Timing Array (IPTA), we expect to be able to improve the measurement of spatial correlations and better characterise this signal in the coming years. △ Less

Submitted 28 June, 2023; originally announced June 2023.

Comments: 21 pages, 14 figures, 4 appendix figures, accepted for publication in A&A

Journal ref: A&A 678, A50 (2023)

arXiv:2305.19318 [pdf, other]

doi 10.1103/PhysRevD.108.024013

Gravitational Waves from Black-Hole Encounters: Prospects for Ground- and Galaxy-Based Observatories

Authors: Subhajit Dandapat, Michael Ebersold, Abhimanyu Susobhanan, Prerna Rana, Achamveedu Gopakumar, Shubhanshu Tiwari, Maria Haney, Hyung Mok Lee, Neel Kolhe

Abstract: Close hyperbolic encounters of black holes (BHs) generate certain Burst With Memory (BWM) events in the frequency windows of the operational, planned, and proposed gravitational wave (GW) observatories. We present detailed explorations of the detectable parameter space of such events that are relevant for the LIGO-Virgo-KAGRA and the International Pulsar Timing Array (IPTA) consortia. The underlyi… ▽ More Close hyperbolic encounters of black holes (BHs) generate certain Burst With Memory (BWM) events in the frequency windows of the operational, planned, and proposed gravitational wave (GW) observatories. We present detailed explorations of the detectable parameter space of such events that are relevant for the LIGO-Virgo-KAGRA and the International Pulsar Timing Array (IPTA) consortia. The underlying temporally evolving GW polarization states are adapted from Cho et al. [Phys. Rev. D 98, 024039 (2018)] and therefore incorporate general relativistic effects up to the third post-Newtonian order. Further, we provide a prescription to ensure the validity of our waveform family while describing close encounters. Preliminary investigations reveal that optimally placed BWM events should be visible to megaparsec distances for the existing ground-based observatories. In contrast, maturing IPTA datasets should be able to provide constraints on the occurrences of such hyperbolic encounters of supermassive BHs to gigaparsec distances. △ Less

Submitted 30 May, 2023; originally announced May 2023.

Comments: 19 pages, 11 figures, accepted for publication in Phys. Rev. D

arXiv:2305.11592 [pdf, ps, other]

IKDSumm: Incorporating Key-phrases into BERT for extractive Disaster Tweet Summarization

Authors: Piyush Kumar Garg, Roshni Chakraborty, Srishti Gupta, Sourav Kumar Dandapat

Abstract: Online social media platforms, such as Twitter, are one of the most valuable sources of information during disaster events. Therefore, humanitarian organizations, government agencies, and volunteers rely on a summary of this information, i.e., tweets, for effective disaster management. Although there are several existing supervised and unsupervised approaches for automated tweet summary approaches… ▽ More Online social media platforms, such as Twitter, are one of the most valuable sources of information during disaster events. Therefore, humanitarian organizations, government agencies, and volunteers rely on a summary of this information, i.e., tweets, for effective disaster management. Although there are several existing supervised and unsupervised approaches for automated tweet summary approaches, these approaches either require extensive labeled information or do not incorporate specific domain knowledge of disasters. Additionally, the most recent approaches to disaster summarization have proposed BERT-based models to enhance the summary quality. However, for further improved performance, we introduce the utilization of domain-specific knowledge without any human efforts to understand the importance (salience) of a tweet which further aids in summary creation and improves summary quality. In this paper, we propose a disaster-specific tweet summarization framework, IKDSumm, which initially identifies the crucial and important information from each tweet related to a disaster through key-phrases of that tweet. We identify these key-phrases by utilizing the domain knowledge (using existing ontology) of disasters without any human intervention. Further, we utilize these key-phrases to automatically generate a summary of the tweets. Therefore, given tweets related to a disaster, IKDSumm ensures fulfillment of the summarization key objectives, such as information coverage, relevance, and diversity in summary without any human intervention. We evaluate the performance of IKDSumm with 8 state-of-the-art techniques on 12 disaster datasets. The evaluation results show that IKDSumm outperforms existing techniques by approximately 2-79% in terms of ROUGE-N F1-score. △ Less

Submitted 19 May, 2023; originally announced May 2023.

arXiv:2305.11536 [pdf, other]

PORTRAIT: a hybrid aPproach tO cReate extractive ground-TRuth summAry for dIsaster evenT

Authors: Piyush Kumar Garg, Roshni Chakraborty, Sourav Kumar Dandapat

Abstract: Disaster summarization approaches provide an overview of the important information posted during disaster events on social media platforms, such as, Twitter. However, the type of information posted significantly varies across disasters depending on several factors like the location, type, severity, etc. Verification of the effectiveness of disaster summarization approaches still suffer due to the… ▽ More Disaster summarization approaches provide an overview of the important information posted during disaster events on social media platforms, such as, Twitter. However, the type of information posted significantly varies across disasters depending on several factors like the location, type, severity, etc. Verification of the effectiveness of disaster summarization approaches still suffer due to the lack of availability of good spectrum of datasets along with the ground-truth summary. Existing approaches for ground-truth summary generation (ground-truth for extractive summarization) relies on the wisdom and intuition of the annotators. Annotators are provided with a complete set of input tweets from which a subset of tweets is selected by the annotators for the summary. This process requires immense human effort and significant time. Additionally, this intuition-based selection of the tweets might lead to a high variance in summaries generated across annotators. Therefore, to handle these challenges, we propose a hybrid (semi-automated) approach (PORTRAIT) where we partly automate the ground-truth summary generation procedure. This approach reduces the effort and time of the annotators while ensuring the quality of the created ground-truth summary. We validate the effectiveness of PORTRAIT on 5 disaster events through quantitative and qualitative comparisons of ground-truth summaries generated by existing intuitive approaches, a semi-automated approach, and PORTRAIT. We prepare and release the ground-truth summaries for 5 disaster events which consist of both natural and man-made disaster events belonging to 4 different countries. Finally, we provide a study about the performance of various state-of-the-art summarization approaches on the ground-truth summaries generated by PORTRAIT using ROUGE-N F1-scores. △ Less

Submitted 19 May, 2023; originally announced May 2023.

arXiv:2304.13072 [pdf, other]

doi 10.1093/mnras/stad3122

Multi-band Extension of the Wideband Timing Technique

Authors: Avinash Kumar Paladi, Churchil Dwivedi, Prerna Rana, Nobleson K, Abhimanyu Susobhanan, Bhal Chandra Joshi, Pratik Tarafdar, Debabrata Deb, Swetha Arumugam, A Gopakumar, M A Krishnakumar, Neelam Dhanda Batra, Jyotijwal Debnath, Fazal Kareem, Paramasivan Arumugam, Manjari Bagchi, Adarsh Bathula, Subhajit Dandapat, Shantanu Desai, Yashwant Gupta, Shinnosuke Hisano, Divyansh Kharbanda, Tomonosuke Kikunaga, Neel Kolhe, Yogesh Maan , et al. (5 additional authors not shown)

Abstract: The wideband timing technique enables the high-precision simultaneous estimation of pulsar Times of Arrival (ToAs) and Dispersion Measures (DMs) while effectively modeling frequency-dependent profile evolution. We present two novel independent methods that extend the standard wideband technique to handle simultaneous multi-band pulsar data incorporating profile evolution over a larger frequency sp… ▽ More The wideband timing technique enables the high-precision simultaneous estimation of pulsar Times of Arrival (ToAs) and Dispersion Measures (DMs) while effectively modeling frequency-dependent profile evolution. We present two novel independent methods that extend the standard wideband technique to handle simultaneous multi-band pulsar data incorporating profile evolution over a larger frequency span to estimate DMs and ToAs with enhanced precision. We implement the wideband likelihood using the libstempo python interface to perform wideband timing in the tempo2 framework. We present the application of these techniques to the dataset of fourteen millisecond pulsars observed simultaneously in Band 3 (300 - 500 MHz) and Band 5 (1260 - 1460 MHz) of the upgraded Giant Metrewave Radio Telescope (uGMRT) with a large band gap of 760 MHz as a part of the Indian Pulsar Timing Array (InPTA) campaign. We achieve increased ToA and DM precision and sub-microsecond root mean square post-fit timing residuals by combining simultaneous multi-band pulsar observations done in non-contiguous bands for the first time using our novel techniques. △ Less

Submitted 8 November, 2023; v1 submitted 25 April, 2023; originally announced April 2023.

Comments: Published in MNRAS

arXiv:2303.12105 [pdf, other]

doi 10.1103/PhysRevD.108.023008

Noise analysis of the Indian Pulsar Timing Array data release I

Authors: Aman Srivastava, Shantanu Desai, Neel Kolhe, Mayuresh Surnis, Bhal Chandra Joshi, Abhimanyu Susobhanan, Aurélien Chalumeau, Shinnosuke Hisano, Nobleson K., Swetha Arumugam, Divyansh Kharbanda, Jaikhomba Singha, Pratik Tarafdar, P Arumugam, Manjari Bagchi, Adarsh Bathula, Subhajit Dandapat, Lankeswar Dey, Churchil Dwivedi, Raghav Girgaonkar, A. Gopakumar, Yashwant Gupta, Tomonosuke Kikunaga, M. A. Krishnakumar, Kuo Liu , et al. (6 additional authors not shown)

Abstract: The Indian Pulsar Timing Array (InPTA) collaboration has recently made its first official data release (DR1) for a sample of 14 pulsars using 3.5 years of uGMRT observations. We present the results of single-pulsar noise analysis for each of these 14 pulsars using the InPTA DR1. For this purpose, we consider white noise, achromatic red noise, dispersion measure (DM) variations, and scattering vari… ▽ More The Indian Pulsar Timing Array (InPTA) collaboration has recently made its first official data release (DR1) for a sample of 14 pulsars using 3.5 years of uGMRT observations. We present the results of single-pulsar noise analysis for each of these 14 pulsars using the InPTA DR1. For this purpose, we consider white noise, achromatic red noise, dispersion measure (DM) variations, and scattering variations in our analysis. We apply Bayesian model selection to obtain the preferred noise models among these for each pulsar. For PSR J1600$-$3053, we find no evidence of DM and scattering variations, while for PSR J1909$-$3744, we find no significant scattering variations. Properties vary dramatically among pulsars. For example, we find a strong chromatic noise with chromatic index $\sim$ 2.9 for PSR J1939+2134, indicating the possibility of a scattering index that doesn't agree with that expected for a Kolmogorov scattering medium consistent with similar results for millisecond pulsars in past studies. Despite the relatively short time baseline, the noise models broadly agree with the other PTAs and provide, at the same time, well-constrained DM and scattering variations. △ Less

Submitted 16 June, 2023; v1 submitted 21 March, 2023; originally announced March 2023.

Comments: Accepted for publication in PRD, 30 pages, 17 figures, 4 tables

arXiv:2303.02357 [pdf, other]

DiTTO: A Feature Representation Imitation Approach for Improving Cross-Lingual Transfer

Authors: Shanu Kumar, Abbaraju Soujanya, Sandipan Dandapat, Sunayana Sitaram, Monojit Choudhury

Abstract: Zero-shot cross-lingual transfer is promising, however has been shown to be sub-optimal, with inferior transfer performance across low-resource languages. In this work, we envision languages as domains for improving zero-shot transfer by jointly reducing the feature incongruity between the source and the target language and increasing the generalization capabilities of pre-trained multilingual tra… ▽ More Zero-shot cross-lingual transfer is promising, however has been shown to be sub-optimal, with inferior transfer performance across low-resource languages. In this work, we envision languages as domains for improving zero-shot transfer by jointly reducing the feature incongruity between the source and the target language and increasing the generalization capabilities of pre-trained multilingual transformers. We show that our approach, DiTTO, significantly outperforms the standard zero-shot fine-tuning method on multiple datasets across all languages using solely unlabeled instances in the target language. Empirical results show that jointly reducing feature incongruity for multiple target languages is vital for successful cross-lingual transfer. Moreover, our model enables better cross-lingual transfer than standard fine-tuning methods, even in the few-shot setting. △ Less

Submitted 4 March, 2023; originally announced March 2023.

Comments: Accepted at EACL 2023

arXiv:2210.15184 [pdf, other]

Too Brittle To Touch: Comparing the Stability of Quantization and Distillation Towards Develo** Lightweight Low-Resource MT Models

Authors: Harshita Diddee, Sandipan Dandapat, Monojit Choudhury, Tanuja Ganu, Kalika Bali

Abstract: Leveraging shared learning through Massively Multilingual Models, state-of-the-art machine translation models are often able to adapt to the paucity of data for low-resource languages. However, this performance comes at the cost of significantly bloated models which are not practically deployable. Knowledge Distillation is one popular technique to develop competitive, lightweight models: In this w… ▽ More Leveraging shared learning through Massively Multilingual Models, state-of-the-art machine translation models are often able to adapt to the paucity of data for low-resource languages. However, this performance comes at the cost of significantly bloated models which are not practically deployable. Knowledge Distillation is one popular technique to develop competitive, lightweight models: In this work, we first evaluate its use to compress MT models focusing on languages with extremely limited training data. Through our analysis across 8 languages, we find that the variance in the performance of the distilled models due to their dependence on priors including the amount of synthetic data used for distillation, the student architecture, training hyperparameters and confidence of the teacher models, makes distillation a brittle compression mechanism. To mitigate this, we explore the use of post-training quantization for the compression of these models. Here, we find that while distillation provides gains across some low-resource languages, quantization provides more consistent performance trends for the entire range of languages, especially the lowest-resource languages in our target set. △ Less

Submitted 9 November, 2022; v1 submitted 27 October, 2022; originally announced October 2022.

Comments: 16 Pages, 7 Figures, Accepted to WMT 2022 (Research Track)

arXiv:2210.12265 [pdf, other]

On the Calibration of Massively Multilingual Language Models

Authors: Kabir Ahuja, Sunayana Sitaram, Sandipan Dandapat, Monojit Choudhury

Abstract: Massively Multilingual Language Models (MMLMs) have recently gained popularity due to their surprising effectiveness in cross-lingual transfer. While there has been much work in evaluating these models for their performance on a variety of tasks and languages, little attention has been paid on how well calibrated these models are with respect to the confidence in their predictions. We first invest… ▽ More Massively Multilingual Language Models (MMLMs) have recently gained popularity due to their surprising effectiveness in cross-lingual transfer. While there has been much work in evaluating these models for their performance on a variety of tasks and languages, little attention has been paid on how well calibrated these models are with respect to the confidence in their predictions. We first investigate the calibration of MMLMs in the zero-shot setting and observe a clear case of miscalibration in low-resource languages or those which are typologically diverse from English. Next, we empirically show that calibration methods like temperature scaling and label smoothing do reasonably well towards improving calibration in the zero-shot scenario. We also find that few-shot examples in the language can further help reduce the calibration errors, often substantially. Overall, our work contributes towards building more reliable multilingual models by highlighting the issue of their miscalibration, understanding what language and model specific factors influence it, and pointing out the strategies to improve the same. △ Less

Submitted 21 October, 2022; originally announced October 2022.

Comments: EMNLP 2022

arXiv:2207.06461 [pdf, other]

doi 10.1007/s12036-022-09869-w

Nanohertz Gravitational Wave Astronomy during the SKA Era: An InPTA perspective

Authors: Bhal Chandra Joshi, Achamveedu Gopakumar, Arul Pandian, Thiagaraj Prabu, Lankeswar Dey, Manjari Bagchi, Shantanu Desai, Pratik Tarafdar, Prerna Rana, Yogesh Maan, Neelam Dhanda Batra, Raghav Girgaonkar, Nikita Agarwal, Paramasivan Arumugam, Sarmistha Banik, Avishek Basu, Adarsh Bathula, Subhajit Dandapat, Yashwant Gupta, Shinnosuke Hisano, Ryo Kato, Divyansh Kharbanda, Tomonosuke Kikunaga, Neel Kolhe, M. A. Krishnakumar , et al. (12 additional authors not shown)

Abstract: Decades long monitoring of millisecond pulsars, which exhibit highly stable rotational periods, in pulsar timing array experiments is on the threshold of discovering nanohertz stochastic gravitational wave background. This paper describes the Indian Pulsar timing array (InPTA) experiment, which employs the upgraded Giant Metrewave Radio Telescope (uGMRT) for timing an ensemble of millisecond pulsa… ▽ More Decades long monitoring of millisecond pulsars, which exhibit highly stable rotational periods, in pulsar timing array experiments is on the threshold of discovering nanohertz stochastic gravitational wave background. This paper describes the Indian Pulsar timing array (InPTA) experiment, which employs the upgraded Giant Metrewave Radio Telescope (uGMRT) for timing an ensemble of millisecond pulsars for this purpose. We highlight InPTA's observation strategies and analysis methods, which are relevant for a future PTA experiment with the more sensitive Square Kilometer Array (SKA) telescope. We show that the unique multi-sub-array multi-band wide-bandwidth frequency coverage of the InPTA provides Dispersion Measure estimates with unprecedented precision for PTA pulsars, e.g., ~ 2 x 10{-5} pc-cm{-3} for PSR J1909-3744. Configuring the SKA-low and SKA-mid as two and four sub-arrays respectively, it is shown that comparable precision is achievable, using observation strategies similar to those pursued by the InPTA, for a larger sample of 62 pulsars requiring about 26 and 7 hours per epoch for the SKA-mid and the SKA-low telescopes respectively. We also review the ongoing efforts to develop PTA-relevant general relativistic constructs that will be required to search for nanohertz gravitational waves from isolated super-massive black hole binary systems like blazar OJ 287. These efforts should be relevant to pursue persistent multi-messenger gravitational wave astronomy during the forthcoming era of the SKA telescope, the Thirty Meter Telescope, and the next-generation Event Horizon Telescope. △ Less

Submitted 13 July, 2022; originally announced July 2022.

Comments: Accepted for publication in Journal of Astronomy and Astrophysics for Special Issue on Indian Participation in the SKA (Editors : Abhirup Datta, Nirupam Roy, Preeti Kharb and Tirthankar Roy Choudhury)

arXiv:2206.15010 [pdf, other]

"Diversity and Uncertainty in Moderation" are the Key to Data Selection for Multilingual Few-shot Transfer

Authors: Shanu Kumar, Sandipan Dandapat, Monojit Choudhury

Abstract: Few-shot transfer often shows substantial gain over zero-shot transfer~\cite{lauscher2020zero}, which is a practically useful trade-off between fully supervised and unsupervised learning approaches for multilingual pretrained model-based systems. This paper explores various strategies for selecting data for annotation that can result in a better few-shot transfer. The proposed approaches rely on m… ▽ More Few-shot transfer often shows substantial gain over zero-shot transfer~\cite{lauscher2020zero}, which is a practically useful trade-off between fully supervised and unsupervised learning approaches for multilingual pretrained model-based systems. This paper explores various strategies for selecting data for annotation that can result in a better few-shot transfer. The proposed approaches rely on multiple measures such as data entropy using $n$-gram language model, predictive entropy, and gradient embedding. We propose a loss embedding method for sequence labeling tasks, which induces diversity and uncertainty sampling similar to gradient embedding. The proposed data selection strategies are evaluated and compared for POS tagging, NER, and NLI tasks for up to 20 languages. Our experiments show that the gradient and loss embedding-based strategies consistently outperform random data selection baselines, with gains varying with the initial performance of the zero-shot transfer. Furthermore, the proposed method shows similar trends in improvement even when the model is fine-tuned using a lower proportion of the original task-specific labeled training data for zero-shot transfer. △ Less

Submitted 30 June, 2022; originally announced June 2022.

Comments: NAACL 2022

arXiv:2206.09289 [pdf, other]

doi 10.1017/pasa.2022.46

The Indian Pulsar Timing Array: First data release

Authors: Pratik Tarafdar, Nobleson K., Prerna Rana, Jaikhomba Singha, M. A. Krishnakumar, Bhal Chandra Joshi, Avinash Kumar Paladi, Neel Kolhe, Neelam Dhanda Batra, Nikita Agarwal, Adarsh Bathula, Subhajit Dandapat, Shantanu Desai, Lankeswar Dey, Shinnosuke Hisano, Prathamesh Ingale, Ryo Kato, Divyansh Kharbanda, Tomonosuke Kikunaga, Piyush Marmat, B. Arul Pandian, T. Prabu, Aman Srivastava, Mayuresh Surnis, Sai Chaitanya Susarla , et al. (13 additional authors not shown)

Abstract: We present the pulse arrival times and high-precision dispersion measure estimates for 14 millisecond pulsars observed simultaneously in the 300-500 MHz and 1260-1460 MHz frequency bands using the upgraded Giant Metrewave Radio Telescope (uGMRT). The data spans over a baseline of 3.5 years (2018-2021), and is the first official data release made available by the Indian Pulsar Timing Array collabor… ▽ More We present the pulse arrival times and high-precision dispersion measure estimates for 14 millisecond pulsars observed simultaneously in the 300-500 MHz and 1260-1460 MHz frequency bands using the upgraded Giant Metrewave Radio Telescope (uGMRT). The data spans over a baseline of 3.5 years (2018-2021), and is the first official data release made available by the Indian Pulsar Timing Array collaboration. This data release presents a unique opportunity for investigating the interstellar medium effects at low radio frequencies and their impact on the timing precision of pulsar timing array experiments. In addition to the dispersion measure time series and pulse arrival times obtained using both narrowband and wideband timing techniques, we also present the dispersion measure structure function analysis for selected pulsars. Our ongoing investigations regarding the frequency dependence of dispersion measures have been discussed. Based on the preliminary analysis for five millisecond pulsars, we do not find any conclusive evidence of chromaticity in dispersion measures. Data from regular simultaneous two-frequency observations are presented for the first time in this work. This distinctive feature leads us to the highest precision dispersion measure estimates obtained so far for a subset of our sample. Simultaneous multi-band uGMRT observations in Band 3 and Band 5 are crucial for high-precision dispersion measure estimation and for the prospect of expanding the overall frequency coverage upon the combination of data from the various Pulsar Timing Array consortia in the near future. Parts of the data presented in this work are expected to be incorporated into the upcoming third data release of the International Pulsar Timing Array. △ Less

Submitted 25 October, 2022; v1 submitted 18 June, 2022; originally announced June 2022.

Comments: 23 pages, 21 figures, 3 tables. Published in PASA

Journal ref: Publications of the Astronomical Society of Australia, Volume 39, 2022, e053

arXiv:2205.06356 [pdf, other]

Beyond Static Models and Test Sets: Benchmarking the Potential of Pre-trained Models Across Tasks and Languages

Authors: Kabir Ahuja, Sandipan Dandapat, Sunayana Sitaram, Monojit Choudhury

Abstract: Although recent Massively Multilingual Language Models (MMLMs) like mBERT and XLMR support around 100 languages, most existing multilingual NLP benchmarks provide evaluation data in only a handful of these languages with little linguistic diversity. We argue that this makes the existing practices in multilingual evaluation unreliable and does not provide a full picture of the performance of MMLMs… ▽ More Although recent Massively Multilingual Language Models (MMLMs) like mBERT and XLMR support around 100 languages, most existing multilingual NLP benchmarks provide evaluation data in only a handful of these languages with little linguistic diversity. We argue that this makes the existing practices in multilingual evaluation unreliable and does not provide a full picture of the performance of MMLMs across the linguistic landscape. We propose that the recent work done in Performance Prediction for NLP tasks can serve as a potential solution in fixing benchmarking in Multilingual NLP by utilizing features related to data and language typology to estimate the performance of an MMLM on different languages. We compare performance prediction with translating test data with a case study on four different multilingual datasets, and observe that these methods can provide reliable estimates of the performance that are often on-par with the translation based approaches, without the need for any additional translation as well as evaluation costs. △ Less

Submitted 14 November, 2022; v1 submitted 12 May, 2022; originally announced May 2022.

Comments: NLP Power! Workshop, ACL 2022

arXiv:2205.06350 [pdf, other]

On the Economics of Multilingual Few-shot Learning: Modeling the Cost-Performance Trade-offs of Machine Translated and Manual Data

Authors: Kabir Ahuja, Monojit Choudhury, Sandipan Dandapat

Abstract: Borrowing ideas from {\em Production functions} in micro-economics, in this paper we introduce a framework to systematically evaluate the performance and cost trade-offs between machine-translated and manually-created labelled data for task-specific fine-tuning of massively multilingual language models. We illustrate the effectiveness of our framework through a case-study on the TyDIQA-GoldP datas… ▽ More Borrowing ideas from {\em Production functions} in micro-economics, in this paper we introduce a framework to systematically evaluate the performance and cost trade-offs between machine-translated and manually-created labelled data for task-specific fine-tuning of massively multilingual language models. We illustrate the effectiveness of our framework through a case-study on the TyDIQA-GoldP dataset. One of the interesting conclusions of the study is that if the cost of machine translation is greater than zero, the optimal performance at least cost is always achieved with at least some or only manually-created data. To our knowledge, this is the first attempt towards extending the concept of production functions to study data collection strategies for training multilingual models, and can serve as a valuable tool for other similar cost vs data trade-offs in NLP. △ Less

Submitted 14 November, 2022; v1 submitted 12 May, 2022; originally announced May 2022.

Comments: NAACL 2022

arXiv:2205.06130 [pdf, other]

Multi Task Learning For Zero Shot Performance Prediction of Multilingual Models

Authors: Kabir Ahuja, Shanu Kumar, Sandipan Dandapat, Monojit Choudhury

Abstract: Massively Multilingual Transformer based Language Models have been observed to be surprisingly effective on zero-shot transfer across languages, though the performance varies from language to language depending on the pivot language(s) used for fine-tuning. In this work, we build upon some of the existing techniques for predicting the zero-shot performance on a task, by modeling it as a multi-task… ▽ More Massively Multilingual Transformer based Language Models have been observed to be surprisingly effective on zero-shot transfer across languages, though the performance varies from language to language depending on the pivot language(s) used for fine-tuning. In this work, we build upon some of the existing techniques for predicting the zero-shot performance on a task, by modeling it as a multi-task learning problem. We jointly train predictive models for different tasks which helps us build more accurate predictors for tasks where we have test data in very few languages to measure the actual performance of the model. Our approach also lends us the ability to perform a much more robust feature selection and identify a common set of features that influence zero-shot performance across a variety of tasks. △ Less

Submitted 12 May, 2022; originally announced May 2022.

Comments: ACL 2022

arXiv:2203.12865 [pdf, other]

Multilingual CheckList: Generation and Evaluation

Authors: Karthikeyan K, Shaily Bhatt, Pankaj Singh, Somak Aditya, Sandipan Dandapat, Sunayana Sitaram, Monojit Choudhury

Abstract: Multilingual evaluation benchmarks usually contain limited high-resource languages and do not test models for specific linguistic capabilities. CheckList is a template-based evaluation approach that tests models for specific capabilities. The CheckList template creation process requires native speakers, posing a challenge in scaling to hundreds of languages. In this work, we explore multiple appro… ▽ More Multilingual evaluation benchmarks usually contain limited high-resource languages and do not test models for specific linguistic capabilities. CheckList is a template-based evaluation approach that tests models for specific capabilities. The CheckList template creation process requires native speakers, posing a challenge in scaling to hundreds of languages. In this work, we explore multiple approaches to generate Multilingual CheckLists. We device an algorithm - Template Extraction Algorithm (TEA) for automatically extracting target language CheckList templates from machine translated instances of a source language templates. We compare the TEA CheckLists with CheckLists created with different levels of human intervention. We further introduce metrics along the dimensions of cost, diversity, utility, and correctness to compare the CheckLists. We thoroughly analyze different approaches to creating CheckLists in Hindi. Furthermore, we experiment with 9 more different languages. We find that TEA followed by human verification is ideal for scaling Checklist-based evaluation to multiple languages while TEA gives a good estimates of model performance. △ Less

Submitted 11 October, 2022; v1 submitted 24 March, 2022; originally announced March 2022.

Comments: Accepted to Findings of AACL-IJCNLP 2022

arXiv:2203.01188 [pdf, ps, other]

EnDSUM: Entropy and Diversity based Disaster Tweet Summarization

Authors: Piyush Kumar Garg, Roshni Chakraborty, Sourav Kumar Dandapat

Abstract: The huge amount of information shared in Twitter during disaster events are utilized by government agencies and humanitarian organizations to ensure quick crisis response and provide situational updates. However, the huge number of tweets posted makes manual identification of the relevant tweets impossible. To address the information overload, there is a need to automatically generate summary of a… ▽ More The huge amount of information shared in Twitter during disaster events are utilized by government agencies and humanitarian organizations to ensure quick crisis response and provide situational updates. However, the huge number of tweets posted makes manual identification of the relevant tweets impossible. To address the information overload, there is a need to automatically generate summary of all the tweets which can highlight the important aspects of the disaster. In this paper, we propose an entropy and diversity based summarizer, termed as EnDSUM, specifically for disaster tweet summarization. Our comprehensive analysis on 6 datasets indicates the effectiveness of EnDSUM and additionally, highlights the scope of improvement of EnDSUM. △ Less

Submitted 2 March, 2022; originally announced March 2022.

arXiv:2201.07472 [pdf, other]

Detecting Stance in Tweets : A Signed Network based Approach

Authors: Roshni Chakraborty, Maitry Bhavsar, Sourav Kumar Dandapat, Joydeep Chandra

Abstract: Identifying user stance related to a political event has several applications, like determination of individual stance, sha** of public opinion, identifying popularity of government measures and many others. The huge volume of political discussions on social media platforms, like, Twitter, provide opportunities in develo** automated mechanisms to identify individual stance and subsequently, sc… ▽ More Identifying user stance related to a political event has several applications, like determination of individual stance, sha** of public opinion, identifying popularity of government measures and many others. The huge volume of political discussions on social media platforms, like, Twitter, provide opportunities in develo** automated mechanisms to identify individual stance and subsequently, scale to a large volume of users. However, issues like short text and huge variance in the vocabulary of the tweets make such exercise enormously difficult. Existing stance detection algorithms require either event specific training data or annotated twitter handles and therefore, are difficult to adapt to new events. In this paper, we propose a sign network based framework that use external information sources, like news articles to create a signed network of relevant entities with respect to a news event and subsequently use the same to detect stance of any tweet towards the event. Validation on 5,000 tweets related to 10 events indicates that the proposed approach can ensure over 6.5% increase in average F1 score compared to the existing stance detection approaches. △ Less

Submitted 19 January, 2022; originally announced January 2022.

arXiv:2201.06545 [pdf, ps, other]

OntoDSumm : Ontology based Tweet Summarization for Disaster Events

Authors: Piyush Kumar Garg, Roshni Chakraborty, Sourav Kumar Dandapat

Abstract: The huge popularity of social media platforms like Twitter attracts a large fraction of users to share real-time information and short situational messages during disasters. A summary of these tweets is required by the government organizations, agencies, and volunteers for efficient and quick disaster response. However, the huge influx of tweets makes it difficult to manually get a precise overvie… ▽ More The huge popularity of social media platforms like Twitter attracts a large fraction of users to share real-time information and short situational messages during disasters. A summary of these tweets is required by the government organizations, agencies, and volunteers for efficient and quick disaster response. However, the huge influx of tweets makes it difficult to manually get a precise overview of ongoing events. To handle this challenge, several tweet summarization approaches have been proposed. In most of the existing literature, tweet summarization is broken into a two-step process where in the first step, it categorizes tweets, and in the second step, it chooses representative tweets from each category. There are both supervised as well as unsupervised approaches found in literature to solve the problem of first step. Supervised approaches requires huge amount of labelled data which incurs cost as well as time. On the other hand, unsupervised approaches could not clusters tweet properly due to the overlap** keywords, vocabulary size, lack of understanding of semantic meaning etc. While, for the second step of summarization, existing approaches applied different ranking methods where those ranking methods are very generic which fail to compute proper importance of a tweet respect to a disaster. Both the problems can be handled far better with proper domain knowledge. In this paper, we exploited already existing domain knowledge by the means of ontology in both the steps and proposed a novel disaster summarization method OntoDSumm. We evaluate this proposed method with 4 state-of-the-art methods using 10 disaster datasets. Evaluation results reveal that OntoDSumm outperforms existing methods by approximately 2-66% in terms of ROUGE-1 F1 score. △ Less

Submitted 19 November, 2022; v1 submitted 17 January, 2022; originally announced January 2022.

ACM Class: H.0

arXiv:2112.06908 [pdf, other]

doi 10.1093/mnras/stac532

Low-frequency wideband timing of InPTA pulsars observed with the uGMRT

Authors: K Nobleson, Nikita Agarwal, Raghav Girgaonkar, Arul Pandian, Bhal Chandra Joshi, M A Krishnakumar, Abhimanyu Susobhanan, Shantanu Desai, T Prabu, Adarsh Bathula, Timothy T Pennucci, Sarmistha Banik, Manjari Bagchi, Neelam Dhanda Batra, Arpita Choudhary, Subhajit Dandapat, Lankeswar Dey, Yashwant Gupta, Shinnosuke Hisano, Ryo Kato, Divyansh Kharbanda, Tomonosuke Kikunaga, Neel Kolhe, Yogesh Maan, Piyush Marmat , et al. (7 additional authors not shown)

Abstract: High-precision measurements of the pulsar dispersion measure (DM) are possible using telescopes with low-frequency wideband receivers. We present an initial study of the application of the wideband timing technique, which can simultaneously measure the pulsar times of arrival (ToAs) and DMs, for a set of five pulsars observed with the upgraded Giant Metrewave Radio Telescope (uGMRT) as part of the… ▽ More High-precision measurements of the pulsar dispersion measure (DM) are possible using telescopes with low-frequency wideband receivers. We present an initial study of the application of the wideband timing technique, which can simultaneously measure the pulsar times of arrival (ToAs) and DMs, for a set of five pulsars observed with the upgraded Giant Metrewave Radio Telescope (uGMRT) as part of the Indian Pulsar Timing Array (InPTA) campaign. We have used the observations with the 300-500 MHz band of the uGMRT for this purpose. We obtain high precision in DM measurements with precisions of the order 10^{-6}cm^{-3}pc. The ToAs obtained have sub-μs precision and the root-mean-square of the post-fit ToA residuals are in the sub-μs range. We find that the uncertainties in the DMs and ToAs obtained with this wideband technique, applied to low-frequency data, are consistent with the results obtained with traditional pulsar timing techniques and comparable to high-frequency results from other PTAs. This work opens up an interesting possibility of using low-frequency wideband observations for precision pulsar timing and gravitational wave detection with similar precision as high-frequency observations used conventionally. △ Less

Submitted 23 February, 2022; v1 submitted 13 December, 2021; originally announced December 2021.

Comments: Accepted for publication in MNRAS

arXiv:2111.00818 [pdf, other]

doi 10.1103/PhysRevD.105.084018

Third order post-Newtonian gravitational radiation from two-body scattering: Instantaneous energy and angular momentum radiation

Authors: Gihyuk Cho, Subhajit Dandapat, Achamveedu Gopakumar

Abstract: We compute the third post-Newtonian (3PN) accurate instantaneous contributions to the radiated gravitational wave (GW) energy and angular momentum arising from the hyperbolic passages of non-spinning compact objects. The present computations employ 3PN-accurate instantaneous contributions to the far-zone energy and angular momentum fluxes and the 3PN-accurate Keplerian type parametric solution for… ▽ More We compute the third post-Newtonian (3PN) accurate instantaneous contributions to the radiated gravitational wave (GW) energy and angular momentum arising from the hyperbolic passages of non-spinning compact objects. The present computations employ 3PN-accurate instantaneous contributions to the far-zone energy and angular momentum fluxes and the 3PN-accurate Keplerian type parametric solution for compact binaries in hyperbolic orbits. △ Less

Submitted 20 April, 2022; v1 submitted 1 November, 2021; originally announced November 2021.

Journal ref: Phys. Rev. D 105, 084018 (2022)

arXiv:2110.08875 [pdf, other]

Predicting the Performance of Multilingual NLP Models

Authors: Anirudh Srinivasan, Sunayana Sitaram, Tanuja Ganu, Sandipan Dandapat, Kalika Bali, Monojit Choudhury

Abstract: Recent advancements in NLP have given us models like mBERT and XLMR that can serve over 100 languages. The languages that these models are evaluated on, however, are very few in number, and it is unlikely that evaluation datasets will cover all the languages that these models support. Potential solutions to the costly problem of dataset creation are to translate datasets to new languages or use te… ▽ More Recent advancements in NLP have given us models like mBERT and XLMR that can serve over 100 languages. The languages that these models are evaluated on, however, are very few in number, and it is unlikely that evaluation datasets will cover all the languages that these models support. Potential solutions to the costly problem of dataset creation are to translate datasets to new languages or use template-filling based techniques for creation. This paper proposes an alternate solution for evaluating a model across languages which make use of the existing performance scores of the model on languages that a particular task has test sets for. We train a predictor on these performance scores and use this predictor to predict the model's performance in different evaluation settings. Our results show that our method is effective in filling the gaps in the evaluation for an existing set of languages, but might require additional improvements if we want it to generalize to unseen languages. △ Less

Submitted 17 October, 2021; originally announced October 2021.

arXiv:2109.07140 [pdf, ps, other]

On the Universality of Deep Contextual Language Models

Authors: Shaily Bhatt, Poonam Goyal, Sandipan Dandapat, Monojit Choudhury, Sunayana Sitaram

Abstract: Deep Contextual Language Models (LMs) like ELMO, BERT, and their successors dominate the landscape of Natural Language Processing due to their ability to scale across multiple tasks rapidly by pre-training a single model, followed by task-specific fine-tuning. Furthermore, multilingual versions of such models like XLM-R and mBERT have given promising results in zero-shot cross-lingual transfer, po… ▽ More Deep Contextual Language Models (LMs) like ELMO, BERT, and their successors dominate the landscape of Natural Language Processing due to their ability to scale across multiple tasks rapidly by pre-training a single model, followed by task-specific fine-tuning. Furthermore, multilingual versions of such models like XLM-R and mBERT have given promising results in zero-shot cross-lingual transfer, potentially enabling NLP applications in many under-served and under-resourced languages. Due to this initial success, pre-trained models are being used as `Universal Language Models' as the starting point across diverse tasks, domains, and languages. This work explores the notion of `Universality' by identifying seven dimensions across which a universal model should be able to scale, that is, perform equally well or reasonably well, to be useful across diverse settings. We outline the current theoretical and empirical results that support model performance across these dimensions, along with extensions that may help address some of their current limitations. Through this survey, we lay the foundation for understanding the capabilities and limitations of massive contextual language models and help discern research gaps and directions for future work to make these LMs inclusive and fair to diverse applications, users, and linguistic phenomena. △ Less

Submitted 18 December, 2021; v1 submitted 15 September, 2021; originally announced September 2021.

Comments: 9 pages

arXiv:2107.04607 [pdf, other]

doi 10.1093/mnrasl/slab098

Evidence for profile changes in PSR J1713+0747 using the uGMRT

Authors: Jaikhomba Singha, Mayuresh P Surnis, Bhal Chandra Joshi, Pratik Tarafdar, Prerna Rana, Abhimanyu Susobhanan, Raghav Girgaonkar, Neel Kolhe, Nikita Agarwal, Shantanu Desai, T Prabu, Adarsh Bathula, Subhajit Dandapat, Lankeswar Dey, Shinnosuke Hisano, Ryo Kato, Divyansh Kharbanda, Tomonosuke Kikunaga, Piyush Marmat, Sai Chaitanya Susarla, Manjari Bagchi, Neelam Dhanda Batra, Arpita Choudhury, A Gopakumar, Yashwant Gupta , et al. (7 additional authors not shown)

Abstract: PSR J1713+0747 is one of the most precisely timed pulsars in the international pulsar timing array experiment. This pulsar showed an abrupt profile shape change between April 16, 2021 (MJD 59320) and April 17, 2021 (MJD 59321). In this paper, we report the results from multi-frequency observations of this pulsar carried out with the upgraded Giant Metrewave Radio Telescope (uGMRT) before and after… ▽ More PSR J1713+0747 is one of the most precisely timed pulsars in the international pulsar timing array experiment. This pulsar showed an abrupt profile shape change between April 16, 2021 (MJD 59320) and April 17, 2021 (MJD 59321). In this paper, we report the results from multi-frequency observations of this pulsar carried out with the upgraded Giant Metrewave Radio Telescope (uGMRT) before and after the event. We demonstrate the profile change seen in Band 5 (1260 MHz - 1460 MHz) and Band 3 (300 MHz - 500 MHz). The timing analysis of this pulsar shows a disturbance accompanying this profile change followed by a recovery with a timescale of $\sim 159$ days. Our data suggest that a model with chromatic index as a free parameter is preferred over models with combinations of achromaticity with DM bump or scattering bump. We determine the frequency dependence to be $\simν^{+1.34}$. △ Less

Submitted 16 August, 2021; v1 submitted 9 July, 2021; originally announced July 2021.

Comments: Accepted for publication in MNRAS-Letters

arXiv:2105.10427 [pdf, other]

Prefetcher-based DRAM Architecture

Authors: Saurabh Jaiswal, Shailendra Kumar Gupta, Soumya Soubhagya Dandapat

Abstract: Advancement in Processor technology has made it easy to handle data-intensive workloads, but limiting main memory advances has created performance bottlenecks. In DRAM, there have been improvements in DRAM access latency as well as reduction in cost-per-bit with the increase in cell density. But still DRAM data transfer rate lags behind the processing speed of the current generation processors. As… ▽ More Advancement in Processor technology has made it easy to handle data-intensive workloads, but limiting main memory advances has created performance bottlenecks. In DRAM, there have been improvements in DRAM access latency as well as reduction in cost-per-bit with the increase in cell density. But still DRAM data transfer rate lags behind the processing speed of the current generation processors. As Memory advancements based on hardware have been progressing at a slower pace, to cope up with High-end Processors, Architectural level advancements such as Prediction techniques, Replacement policies, etc are the major subject. In the recent field of research, Data prediction is a sought out topic as correct prediction can boost performance by decreasing the amount of excess memory access by predicting data beforehand using data access trends and behaviors. Though prediction techniques have been implemented at most of the Computer Architecture, We propose implementing data prediction in DRAM level architectures like TL-DRAM and CROW. Both of these method distributes the DRAM into different parts which contain a smaller section which is faster and larger section which contains the bulk of data but is comparatively slower. We wish to use data prediction in between these sections of memory to have predicted data transferred to the faster sections to improve the overall performance by reducing the memory access time. △ Less

Submitted 21 May, 2021; originally announced May 2021.

arXiv:2102.08661 [pdf, ps, other]

doi 10.1109/TCSS.2019.2943238

A Large-Scale Study of the Twitter Follower Network to Characterize the Spread of Prescription Drug Abuse Tweets

Authors: Ryan Sequeira, Avijit Gayen, Niloy Ganguly, Sourav Kumar Dandapat, Joydeep Chandra

Abstract: In this article, we perform a large-scale study of the Twitter follower network, involving around 0.42 million users who justify DA, to characterize the spreading of DA tweets across the network. Our observations reveal the existence of a very large giant component involving 99% of these users with dense local connectivity that facilitates the spreading of such messages. We further identify active… ▽ More In this article, we perform a large-scale study of the Twitter follower network, involving around 0.42 million users who justify DA, to characterize the spreading of DA tweets across the network. Our observations reveal the existence of a very large giant component involving 99% of these users with dense local connectivity that facilitates the spreading of such messages. We further identify active cascades over the network and observe that the cascades of DA tweets get spread over a long distance through the engagement of several closely connected groups of users. Moreover, our observations also reveal a collective phenomenon, involving a large set of active fringe nodes (with a small number of follower and following) along with a small set of well-connected nonfringe nodes that work together toward such spread, thus potentially complicating the process of arresting such cascades. Furthermore, we discovered that the engagement of the users with respect to certain drugs, such as Vicodin, Percocet, and OxyContin, that were observed to be most mentioned in Twitter is instantaneous. On the other hand, for drugs, such as Lortab, that found lesser mentions, the engagement probability becomes high with increasing exposure to such tweets, thereby indicating that drug abusers engaged on Twitter remain vulnerable to adopting newer drugs, aggravating the problem further. △ Less

Submitted 17 February, 2021; originally announced February 2021.

Comments: 13 pages, 9 figures, and accepted by IEEE Transactions on Computational Social Systems

Journal ref: IEEE Transactions on Computational Social Systems, vol. 6, no. 6, pp. 1232-1244, Dec. 2019

arXiv:2004.12376 [pdf, other]

GLUECoS : An Evaluation Benchmark for Code-Switched NLP

Authors: Simran Khanuja, Sandipan Dandapat, Anirudh Srinivasan, Sunayana Sitaram, Monojit Choudhury

Abstract: Code-switching is the use of more than one language in the same conversation or utterance. Recently, multilingual contextual embedding models, trained on multiple monolingual corpora, have shown promising results on cross-lingual and multilingual tasks. We present an evaluation benchmark, GLUECoS, for code-switched languages, that spans several NLP tasks in English-Hindi and English-Spanish. Speci… ▽ More Code-switching is the use of more than one language in the same conversation or utterance. Recently, multilingual contextual embedding models, trained on multiple monolingual corpora, have shown promising results on cross-lingual and multilingual tasks. We present an evaluation benchmark, GLUECoS, for code-switched languages, that spans several NLP tasks in English-Hindi and English-Spanish. Specifically, our evaluation benchmark includes Language Identification from text, POS tagging, Named Entity Recognition, Sentiment Analysis, Question Answering and a new task for code-switching, Natural Language Inference. We present results on all these tasks using cross-lingual word embedding models and multilingual models. In addition, we fine-tune multilingual models on artificially generated code-switched data. Although multilingual models perform significantly better than cross-lingual models, our results show that in most tasks, across both language pairs, multilingual models fine-tuned on code-switched data perform best, showing that multilingual models can be further optimized for code-switching tasks. △ Less

Submitted 14 May, 2020; v1 submitted 26 April, 2020; originally announced April 2020.

Comments: To appear at ACL 2020

arXiv:2004.05051 [pdf, other]

A New Dataset for Natural Language Inference from Code-mixed Conversations

Authors: Simran Khanuja, Sandipan Dandapat, Sunayana Sitaram, Monojit Choudhury

Abstract: Natural Language Inference (NLI) is the task of inferring the logical relationship, typically entailment or contradiction, between a premise and hypothesis. Code-mixing is the use of more than one language in the same conversation or utterance, and is prevalent in multilingual communities all over the world. In this paper, we present the first dataset for code-mixed NLI, in which both the premises… ▽ More Natural Language Inference (NLI) is the task of inferring the logical relationship, typically entailment or contradiction, between a premise and hypothesis. Code-mixing is the use of more than one language in the same conversation or utterance, and is prevalent in multilingual communities all over the world. In this paper, we present the first dataset for code-mixed NLI, in which both the premises and hypotheses are in code-mixed Hindi-English. We use data from Hindi movies (Bollywood) as premises, and crowd-source hypotheses from Hindi-English bilinguals. We conduct a pilot annotation study and describe the final annotation protocol based on observations from the pilot. Currently, the data collected consists of 400 premises in the form of code-mixed conversation snippets and 2240 code-mixed hypotheses. We conduct an extensive analysis to infer the linguistic phenomena commonly observed in the dataset obtained. We evaluate the dataset using a standard mBERT-based pipeline for NLI and report results. △ Less

Submitted 13 April, 2020; v1 submitted 10 April, 2020; originally announced April 2020.

Comments: To appear in CALCS, LREC 2020

arXiv:2002.02631 [pdf, other]

Translating Web Search Queries into Natural Language Questions

Authors: Adarsh Kumar, Sandipan Dandapat, Sushil Chordia

Abstract: Users often query a search engine with a specific question in mind and often these queries are keywords or sub-sentential fragments. For example, if the users want to know the answer for "What's the capital of USA", they will most probably query "capital of USA" or "USA capital" or some keyword-based variation of this. For example, for the user entered query "capital of USA", the most probable que… ▽ More Users often query a search engine with a specific question in mind and often these queries are keywords or sub-sentential fragments. For example, if the users want to know the answer for "What's the capital of USA", they will most probably query "capital of USA" or "USA capital" or some keyword-based variation of this. For example, for the user entered query "capital of USA", the most probable question intent is "What's the capital of USA?". In this paper, we are proposing a method to generate well-formed natural language question from a given keyword-based query, which has the same question intent as the query. Conversion of keyword-based web query into a well-formed question has lots of applications, with some of them being in search engines, Community Question Answering (CQA) website and bots communication. We found a synergy between query-to-question problem with standard machine translation(MT) task. We have used both Statistical MT (SMT) and Neural MT (NMT) models to generate the questions from the query. We have observed that MT models perform well in terms of both automatic and human evaluation. △ Less

Submitted 7 February, 2020; originally announced February 2020.

Comments: Eleventh International Conference on Language Resources and Evaluation, LREC 2018

arXiv:1901.09334 [pdf]

Predicting Tomorrow's Headline using Today's Twitter Deliberations

Authors: Roshni Chakraborty, Abhijeet Kharat, Apalak Khatua, Sourav Kumar Dandapat, Joydeep Chandra

Abstract: Predicting the popularity of news article is a challenging task. Existing literature mostly focused on article contents and polarity to predict popularity. However, existing research has not considered the users' preference towards a particular article. Understanding users' preference is an important aspect for predicting the popularity of news articles. Hence, we consider the social media data, f… ▽ More Predicting the popularity of news article is a challenging task. Existing literature mostly focused on article contents and polarity to predict popularity. However, existing research has not considered the users' preference towards a particular article. Understanding users' preference is an important aspect for predicting the popularity of news articles. Hence, we consider the social media data, from the Twitter platform, to address this research gap. In our proposed model, we have considered the users' involvement as well as the users' reaction towards an article to predict the popularity of the article. In short, we are predicting tomorrow's headline by probing today's Twitter discussion. We have considered 300 political news article from the New York Post, and our proposed approach has outperformed other baseline models. △ Less

Submitted 27 January, 2019; originally announced January 2019.

Comments: This paper was accepted in CIKM Workshop on News Recommendation and Analytics (INRA), 2018, Turin, Italy

arXiv:1601.06608 [pdf, other]

An Unsupervised Method for Detection and Validation of The Optic Disc and The Fovea

Authors: Mrinal Haloi, Samarendra Dandapat, Rohit Sinha

Abstract: In this work, we have presented a novel method for detection of retinal image features, the optic disc and the fovea, from colour fundus photographs of dilated eyes for Computer-aided Diagnosis(CAD) system. A saliency map based method was used to detect the optic disc followed by an unsupervised probabilistic Latent Semantic Analysis for detection validation. The validation concept is based on dis… ▽ More In this work, we have presented a novel method for detection of retinal image features, the optic disc and the fovea, from colour fundus photographs of dilated eyes for Computer-aided Diagnosis(CAD) system. A saliency map based method was used to detect the optic disc followed by an unsupervised probabilistic Latent Semantic Analysis for detection validation. The validation concept is based on distinct vessels structures in the optic disc. By using the clinical information of standard location of the fovea with respect to the optic disc, the macula region is estimated. Accuracy of 100\% detection is achieved for the optic disc and the macula on MESSIDOR and DIARETDB1 and 98.8\% detection accuracy on STARE dataset. △ Less

Submitted 25 January, 2016; originally announced January 2016.

MSC Class: 68T45

arXiv:1505.00737 [pdf, other]

A Gaussian Scale Space Approach For Exudates Detection, Classification And Severity Prediction

Authors: Mrinal Haloi, Samarendra Dandapat, Rohit Sinha

Abstract: In the context of Computer Aided Diagnosis system for diabetic retinopathy, we present a novel method for detection of exudates and their classification for disease severity prediction. The method is based on Gaussian scale space based interest map and mathematical morphology. It makes use of support vector machine for classification and location information of the optic disc and the macula region… ▽ More In the context of Computer Aided Diagnosis system for diabetic retinopathy, we present a novel method for detection of exudates and their classification for disease severity prediction. The method is based on Gaussian scale space based interest map and mathematical morphology. It makes use of support vector machine for classification and location information of the optic disc and the macula region for severity prediction. It can efficiently handle luminance variation and it is suitable for varied sized exudates. The method has been probed in publicly available DIARETDB1V2 and e-ophthaEX databases. For exudate detection the proposed method achieved a sensitivity of 96.54% and prediction of 98.35% in DIARETDB1V2 database. △ Less

Submitted 4 May, 2015; originally announced May 2015.

Comments: Accepted in ICIP 2015, Quebec city, Canada

MSC Class: 68T45

arXiv:0809.2231 [pdf, ps, other]

Circular hydraulic jump in generalized-Newtonian fluids

Authors: Ashutosh Rai, B. S. Dandapat, Swarup Poria

Abstract: We carry out an analytical study of laminar circular hydraulic jumps, in generalized-Newtonian fluids obeying the two-parametric power-law model of Ostwald-de Waele. Under the boundary-layer approximation we obtained exact expressions determining the flow, an implicit relation for the jump radius is derived. Corresponding results for Newtonian fluids can be retrieved as a limiting case for the f… ▽ More We carry out an analytical study of laminar circular hydraulic jumps, in generalized-Newtonian fluids obeying the two-parametric power-law model of Ostwald-de Waele. Under the boundary-layer approximation we obtained exact expressions determining the flow, an implicit relation for the jump radius is derived. Corresponding results for Newtonian fluids can be retrieved as a limiting case for the flow behavior index n=1, predictions are made for fluids deviating from Newtonian behavior. △ Less

Submitted 22 September, 2008; v1 submitted 12 September, 2008; originally announced September 2008.

Comments: 4 pages, 3 figures, added references, corrected typos

Showing 1–43 of 43 results for author: Dandapat, S