Skip to main content

Showing 1–18 of 18 results for author: Goswami, N

.
  1. arXiv:2403.13015  [pdf, other

    eess.IV cs.LG

    HyperVQ: MLR-based Vector Quantization in Hyperbolic Space

    Authors: Nabarun Goswami, Yusuke Mukuta, Tatsuya Harada

    Abstract: The success of models operating on tokenized data has led to an increased demand for effective tokenization methods, particularly when applied to vision or auditory tasks, which inherently involve non-discrete data. One of the most popular tokenization methods is Vector Quantization (VQ), a key component of several recent state-of-the-art methods across various domains. Typically, a VQ Variational… ▽ More

    Submitted 17 March, 2024; originally announced March 2024.

  2. arXiv:2401.10005  [pdf, other

    cs.CV cs.CL

    Advancing Large Multi-modal Models with Explicit Chain-of-Reasoning and Visual Question Generation

    Authors: Kohei Uehara, Nabarun Goswami, Hanqin Wang, Toshiaki Baba, Kohtaro Tanaka, Tomohiro Hashimoto, Kai Wang, Rei Ito, Takagi Naoya, Ryo Umagami, Yingyi Wen, Tanachai Anakewat, Tatsuya Harada

    Abstract: The increasing demand for intelligent systems capable of interpreting and reasoning about visual content requires the development of Large Multi-Modal Models (LMMs) that are not only accurate but also have explicit reasoning capabilities. This paper presents a novel approach to imbue an LMM with the ability to conduct explicit reasoning based on visual content and textual instructions. We introduc… ▽ More

    Submitted 18 January, 2024; originally announced January 2024.

  3. arXiv:2308.06979  [pdf, other

    eess.AS cs.SD

    The Sound Demixing Challenge 2023 $\unicode{x2013}$ Music Demixing Track

    Authors: Giorgio Fabbro, Stefan Uhlich, Chieh-Hsin Lai, Woosung Choi, Marco Martínez-Ramírez, Weihsiang Liao, Igor Gadelha, Geraldo Ramos, Eddie Hsu, Hugo Rodrigues, Fabian-Robert Stöter, Alexandre Défossez, Yi Luo, Jianwei Yu, Dipam Chakraborty, Sharada Mohanty, Roman Solovyev, Alexander Stempkovskiy, Tatiana Habruseva, Nabarun Goswami, Tatsuya Harada, Minseok Kim, Jun Hyung Lee, Yuanliang Dong, Xinran Zhang , et al. (2 additional authors not shown)

    Abstract: This paper summarizes the music demixing (MDX) track of the Sound Demixing Challenge (SDX'23). We provide a summary of the challenge setup and introduce the task of robust music source separation (MSS), i.e., training MSS models in the presence of errors in the training data. We propose a formalization of the errors that can occur in the design of a training dataset for MSS systems and introduce t… ▽ More

    Submitted 19 April, 2024; v1 submitted 14 August, 2023; originally announced August 2023.

    Comments: Published in Transactions of the International Society for Music Information Retrieval (https://transactions.ismir.net/articles/10.5334/tismir.171)

    Journal ref: Transactions of the International Society for Music Information Retrieval, 7(1), pp.63-84, 2024

  4. arXiv:2212.13711  [pdf

    physics.ao-ph physics.geo-ph

    Climate Change and Potential Demise of the Indian Deserts

    Authors: P. V. Rajesh, B. N. Goswami

    Abstract: In contrast to the wet gets wetter and dry gets drier paradigm, here, using observations and climate model simulations, we show that the mean rainfall over the semi-arid northwest parts of India and Pakistan has increased by 10 to 50 percent during 1901 to 2015 and is expected to increase by 50 to 200 percent under moderate greenhouse gas (GHG) scenarios, e.g, SSP2 4.5. The GHG forcing primarily d… ▽ More

    Submitted 28 December, 2022; originally announced December 2022.

  5. arXiv:2207.06011  [pdf, other

    eess.AS cs.SD

    SATTS: Speaker Attractor Text to Speech, Learning to Speak by Learning to Separate

    Authors: Nabarun Goswami, Tatsuya Harada

    Abstract: The map** of text to speech (TTS) is non-deterministic, letters may be pronounced differently based on context, or phonemes can vary depending on various physiological and stylistic factors like gender, age, accent, emotions, etc. Neural speaker embeddings, trained to identify or verify speakers are typically used to represent and transfer such characteristics from reference speech to synthesize… ▽ More

    Submitted 13 July, 2022; originally announced July 2022.

    Comments: Accepted to Interspeech 2022. Visit https://naba89.github.io/SATTS-demo/ for a demo

  6. arXiv:2201.09010  [pdf

    physics.optics cond-mat.mtrl-sci

    Enhancement of optical properties and dielectric nature of Sm$_3$+doped Na$_2$O-ZnO-TeO$_2$ Glass materials

    Authors: Jyotindra Nath Mirdda, Subhadipta Mukhopadhyay, Kriti Ranjan Sahu, Makhanlal Nanda Goswami

    Abstract: Samarium doped Na$_2$O-ZnO-TeO$_2$ (NZT) glasses were prepared by the melt quenching method. The glass-forming ability and glass stability of prepared glass was estimated by Hruby parameter using Differential Thermal Analysis (DTA) and Thermo-gravimetric Analysis (TGA). The study of FTIR spectra and X-ray diffraction described the ionic nature and the amorphous pattern of glass respectively. The a… ▽ More

    Submitted 22 January, 2022; originally announced January 2022.

  7. arXiv:2106.05543  [pdf

    physics.ao-ph

    High ENSO-based 18-month lead Potential Predictability of Indian Summer Monsoon Rainfall

    Authors: Devabrat Sharma, Santu Das, B. N. Goswami

    Abstract: Scientific basis for long-lead seasonal prediction of Indian summer monsoon rainfall (ISMR) critical for water resource and crop strategy planning is lacking. Using a new predictor discovery method, here we show that the depth of 20 degree isotherm (D20) is least influenced by atmospheric noise and that the 18-month lead forecasts of ISMR have high potential skill (r = 0.86). The high potential pr… ▽ More

    Submitted 11 June, 2021; v1 submitted 10 June, 2021; originally announced June 2021.

    Comments: 44 pages, 6 figures, 11 extended figures, 1 table

  8. arXiv:2106.04545  [pdf

    cond-mat.mtrl-sci

    Optical and electrical properties of Nd3+doped Na2O-ZnO-TeO2 Material

    Authors: J. N. Mirdda, S. Mukhopadhyay, K. R. Sahu, M. N. Goswami

    Abstract: Neodymium doped Na2O-ZnO-TeO2 (NZT) glasses were prepared by the conventional melt quenching technique. DTA and TG were used to confirmation of glass preparation through the glass transition temperature at 447°C for the glass system. The analysis of FTIR spectra and X-ray diffraction described the nature of the samples were ionic and amorphous respectively. The optical bandgap energy was estimated… ▽ More

    Submitted 8 June, 2021; originally announced June 2021.

  9. arXiv:2104.04772  [pdf

    physics.optics

    Diffraction as scattering under the Born approximation

    Authors: Neha Goswami, Gabriel Popescu

    Abstract: Light diffraction at an aperture is a basic problem that has generated a tremendous amount of interest in optics. Some of the most significant diffraction results are the Fresnel-Kirchhoff and Rayleigh-Sommerfeld formulas. These theories are based on solving the wave equation using Green's theorem and result in slightly different expressions depending on the particular boundary conditions employed… ▽ More

    Submitted 10 April, 2021; originally announced April 2021.

  10. arXiv:2011.02368  [pdf

    cs.DC cs.AR cs.GR

    An Empirical-cum-Statistical Approach to Power-Performance Characterization of Concurrent GPU Kernels

    Authors: Nilanjan Goswami, Amer Qouneh, Chao Li, Tao Li

    Abstract: Growing deployment of power and energy efficient throughput accelerators (GPU) in data centers demands enhancement of power-performance co-optimization capabilities of GPUs. Realization of exascale computing using accelerators requires further improvements in power efficiency. With hardwired kernel concurrency enablement in accelerators, inter- and intra-workload simultaneous kernels computation p… ▽ More

    Submitted 4 November, 2020; v1 submitted 4 November, 2020; originally announced November 2020.

  11. arXiv:2004.08888  [pdf

    physics.ao-ph

    Electrical Route to Realising Intensity Simulation of Heavy Rain Events in Tropics

    Authors: Dipjyoti Mudiar, Anupam Hazra, S. D. Pawar, Rama Krishna Karumuri, Mahen Konwar, Subrata Mukherjee, M. K. Srivastava, B. N. Goswami

    Abstract: In the backdrop of a revolution in weather prediction by Numerical Weather Prediction (NWP) models, quantitative prediction of intensity of heavy rainfall events and associated disasters has remained a challenge. Encouraged by compelling evidence of electrical influences on cloud/rain microphysical processes, here we propose a hypothesis that modification of raindrop size distribution (RDSD) towar… ▽ More

    Submitted 19 April, 2020; originally announced April 2020.

  12. arXiv:1911.10013  [pdf, other

    physics.ao-ph physics.flu-dyn

    Role of the North Atlantic in Indian Monsoon Droughts

    Authors: Pritam Borah, V. Venugopal, Jai Sukhatme, Pranesh Muddebihal, B. N. Goswami

    Abstract: The forecast of Indian monsoon droughts has been predicated on the notion of a season-long rainfall deficit linked to warm anomalies in the equatorial Pacific. Here, we show that in fact nearly half of all droughts over the past century were sub-seasonal, and characterized by an abrupt decline in late-season rainfall. Furthermore, the potential driver of this class of droughts is a coherent cold a… ▽ More

    Submitted 22 November, 2019; originally announced November 2019.

    Comments: 29 pages, 13 figures, 3 tables (Submitted July 2019)

  13. arXiv:1904.03065  [pdf, other

    cs.SD eess.AS

    Recursive speech separation for unknown number of speakers

    Authors: Naoya Takahashi, Sudarsanam Parthasaarathy, Nabarun Goswami, Yuki Mitsufuji

    Abstract: In this paper we propose a method of single-channel speaker-independent multi-speaker speech separation for an unknown number of speakers. As opposed to previous works, in which the number of speakers is assumed to be known in advance and speech separation models are specific for the number of speakers, our proposed method can be applied to cases with different numbers of speakers using a single m… ▽ More

    Submitted 1 September, 2019; v1 submitted 5 April, 2019; originally announced April 2019.

    Comments: Interspeech 2019 (oral)

  14. Unraveling the Mystery of Indian Summer Monsoon Prediction: Improved Estimate of Predictability Limit

    Authors: Subodh Kumar Saha, Anupam Hazra, Samir Pokhrel, Hemantkumar S. Chaudhari, K. Sujith, Archana Rai, Hasibur Rahaman, B. N. Goswami

    Abstract: Large socio-economic impact of the Indian Summer Monsoon (ISM) extremes motivated numerous attempts at its long range prediction over the past century. However, a rather estimated low potential predictability limit (PPL) of seasonal prediction of the ISM, contributed significantly by 'internal' interannual variability was considered insurmountable. Here we show that the 'internal' variability cont… ▽ More

    Submitted 4 September, 2018; originally announced September 2018.

  15. arXiv:1805.02410  [pdf, other

    cs.SD eess.AS

    MMDenseLSTM: An efficient combination of convolutional and recurrent neural networks for audio source separation

    Authors: Naoya Takahashi, Nabarun Goswami, Yuki Mitsufuji

    Abstract: Deep neural networks have become an indispensable technique for audio source separation (ASS). It was recently reported that a variant of CNN architecture called MMDenseNet was successfully employed to solve the ASS problem of estimating source amplitudes, and state-of-the-art results were obtained for DSD100 dataset. To further enhance MMDenseNet, here we propose a novel architecture that integra… ▽ More

    Submitted 29 May, 2018; v1 submitted 7 May, 2018; originally announced May 2018.

  16. arXiv:1709.02606  [pdf

    physics.ao-ph

    Discovery of a Phenomenological Dynamical Model for Predicting the El Niño-Southern Oscillation

    Authors: Shivsai Ajit Dixit, B N Goswami

    Abstract: The skill of the statistical as well as physics-based coupled climate models in predicting the El Niño-Southern Oscillation (ENSO) is limited by their inability to represent the observed ENSO nonlinearity. A promising alternative, namely a deterministic nonlinear dynamical model derived from an observed ENSO timeseries, however, has remained elusive. Here we discover such a phenomenological nonlin… ▽ More

    Submitted 8 September, 2017; originally announced September 2017.

    Comments: Page 1-17 main text, Page 18 onwards Supplementary Information/Material

  17. arXiv:1705.04111  [pdf, other

    cs.DM

    Critical Graphs for Minimum Vertex Cover

    Authors: Andreas Jakoby, Naveen Kumar Goswami, Eik List, Stefan Lucks

    Abstract: In the context of the chromatic-number problem, a critical graph is an instance where the deletion of any element would decrease the graph's chromatic number. Such instances have shown to be interesting objects of study for deepen the understanding of the optimization problem. This work introduces critical graphs in context of Minimum Vertex Cover. We demonstrate their potential for the generati… ▽ More

    Submitted 12 July, 2017; v1 submitted 11 May, 2017; originally announced May 2017.

    ACM Class: F.2.2; G.2.2

  18. Radiation Environment In Earth-Moon Space: Results From RADOM Experiment Onboard Chandrayaan-1

    Authors: S. V. Vadawale, J. N. Goswami, T. P. Dachev, B. T. Tomov, V. Girish

    Abstract: The Radiation Monitor (RADOM) payload is a miniature dosimeter-spectrometer onboard Chandrayaan-1 mission for monitoring the local radiation environment in near-Earth space and in lunar space. RADOM measured the total absorbed dose and spectrum of the deposited energy from high energy particles in near-Earth space, en-route and in lunar orbit. RADOM was the first experiment to be switched on soon… ▽ More

    Submitted 9 December, 2010; originally announced December 2010.

    Comments: Accepted for publication in Advances in Geosciences