-
The Second DISPLACE Challenge : DIarization of SPeaker and LAnguage in Conversational Environments
Authors:
Shareef Babu Kalluri,
Prachi Singh,
Pratik Roy Chowdhuri,
Apoorva Kulkarni,
Shikha Baghel,
Pradyoth Hegde,
Swapnil Sontakke,
Deepak K T,
S. R. Mahadeva Prasanna,
Deepu Vijayasenan,
Sriram Ganapathy
Abstract:
The DIarization of SPeaker and LAnguage in Conversational Environments (DISPLACE) 2024 challenge is the second in the series of DISPLACE challenges, which involves tasks of speaker diarization (SD) and language diarization (LD) on a challenging multilingual conversational speech dataset. In the DISPLACE 2024 challenge, we also introduced the task of automatic speech recognition (ASR) on this datas…
▽ More
The DIarization of SPeaker and LAnguage in Conversational Environments (DISPLACE) 2024 challenge is the second in the series of DISPLACE challenges, which involves tasks of speaker diarization (SD) and language diarization (LD) on a challenging multilingual conversational speech dataset. In the DISPLACE 2024 challenge, we also introduced the task of automatic speech recognition (ASR) on this dataset. The dataset containing 158 hours of speech, consisting of both supervised and unsupervised mono-channel far-field recordings, was released for LD and SD tracks. Further, 12 hours of close-field mono-channel recordings were provided for the ASR track conducted on 5 Indian languages. The details of the dataset, baseline systems and the leader board results are highlighted in this paper. We have also compared our baseline models and the team's performances on evaluation data of DISPLACE-2023 to emphasize the advancements made in this second version of the challenge.
△ Less
Submitted 13 June, 2024;
originally announced June 2024.
-
Thread Detection and Response Generation using Transformers with Prompt Optimisation
Authors:
Kevin Joshua T,
Arnav Agarwal,
Shriya Sanjay,
Yash Sarda,
John Sahaya Rani Alex,
Saurav Gupta,
Sushant Kumar,
Vishwanath Kamath
Abstract:
Conversational systems are crucial for human-computer interaction, managing complex dialogues by identifying threads and prioritising responses. This is especially vital in multi-party conversations, where precise identification of threads and strategic response prioritisation ensure efficient dialogue management. To address these challenges an end-to-end model that identifies threads and prioriti…
▽ More
Conversational systems are crucial for human-computer interaction, managing complex dialogues by identifying threads and prioritising responses. This is especially vital in multi-party conversations, where precise identification of threads and strategic response prioritisation ensure efficient dialogue management. To address these challenges an end-to-end model that identifies threads and prioritises their response generation based on the importance was developed, involving a systematic decomposition of the problem into discrete components - thread detection, prioritisation, and performance optimisation which was meticulously analysed and optimised. These refined components seamlessly integrate into a unified framework, in conversational systems. Llama2 7b is used due to its high level of generalisation but the system can be updated with any open source Large Language Model(LLM). The computational capabilities of the Llama2 model was augmented by using fine tuning methods and strategic prompting techniques to optimise the model's performance, reducing computational time and increasing the accuracy of the model. The model achieves up to 10x speed improvement, while generating more coherent results compared to existing models.
△ Less
Submitted 9 March, 2024;
originally announced March 2024.
-
Fe and Mg Isotope compositions Indicate a Hybrid Mantle Source for Young Chang'E 5 Mare Basalts
Authors:
Jiang Y.,
Kang J. T.,
Liao S. Y.,
Elardo S. M.,
Zong K. Q.,
Wang S. J.,
Nie C.,
Li P. Y.,
Yin Z. J.,
Huang F.,
Hsu W. B
Abstract:
The Chang'E 5 (CE-5) samples represent the youngest mare basalt ever known and provide an access into the late lunar evolution. Recent studies have revealed that CE-5 basalts are the most evolved lunar basalt, yet controversy remains over the nature of their mantle sources. Here we combine Fe and Mg isotope analyses with a comprehensive study of petrology and mineralogy on two CE-5 basalt clasts.…
▽ More
The Chang'E 5 (CE-5) samples represent the youngest mare basalt ever known and provide an access into the late lunar evolution. Recent studies have revealed that CE-5 basalts are the most evolved lunar basalt, yet controversy remains over the nature of their mantle sources. Here we combine Fe and Mg isotope analyses with a comprehensive study of petrology and mineralogy on two CE-5 basalt clasts. These two clasts have a very low Mg# (~29) and show similar Mg isotope compositions with Apollo low-Ti mare basalts as well as intermediate TiO2 and Fe isotope compositions between low-Ti and high-Ti mare basalts. Fractional crystallization or evaporation during impact cannot produce such geochemical signatures which otherwise indicate a hybrid mantle source that incorporates both early- and late-stage lunar magma ocean (LMO) cumulates. Such a hybrid mantle source would be also compatible with the KREEP-like REE pattern of CE-5 basalts. Overall, our new Fe-Mg isotope data highlight the role of late LMO cumulate for the generation of young lunar volcanism.
△ Less
Submitted 21 February, 2023;
originally announced February 2023.
-
mm-Wave Radar Hand Shape Classification Using Deformable Transformers
Authors:
Athmanarayanan Lakshmi Narayanan,
Asma Beevi K. T,
Haoyang Wu,
**gyi Ma,
W. Margaret Huang
Abstract:
A novel, real-time, mm-Wave radar-based static hand shape classification algorithm and implementation are proposed. The method finds several applications in low cost and privacy sensitive touchless control technology using 60 Ghz radar as the sensor input. As opposed to prior Range-Doppler image based 2D classification solutions, our method converts raw radar data to 3D sparse cartesian point clou…
▽ More
A novel, real-time, mm-Wave radar-based static hand shape classification algorithm and implementation are proposed. The method finds several applications in low cost and privacy sensitive touchless control technology using 60 Ghz radar as the sensor input. As opposed to prior Range-Doppler image based 2D classification solutions, our method converts raw radar data to 3D sparse cartesian point clouds.The demonstrated 3D radar neural network model using deformable transformers significantly surpasses the performance results set by prior methods which either utilize custom signal processing or apply generic convolutional techniques on Range-Doppler FFT images. Experiments are performed on an internally collected dataset using an off-the-shelf radar sensor.
△ Less
Submitted 24 October, 2022;
originally announced October 2022.
-
Fast-Image2Point: Towards Real-Time Point Cloud Reconstruction of a Single Image using 3D Supervision
Authors:
AmirHossein Zamani,
Amir G. Aghdam,
Kamran Ghaffari T
Abstract:
A key question in the problem of 3D reconstruction is how to train a machine or a robot to model 3D objects. Many tasks like navigation in real-time systems such as autonomous vehicles directly depend on this problem. These systems usually have limited computational power. Despite considerable progress in 3D reconstruction systems in recent years, applying them to real-time systems such as navigat…
▽ More
A key question in the problem of 3D reconstruction is how to train a machine or a robot to model 3D objects. Many tasks like navigation in real-time systems such as autonomous vehicles directly depend on this problem. These systems usually have limited computational power. Despite considerable progress in 3D reconstruction systems in recent years, applying them to real-time systems such as navigation systems in autonomous vehicles is still challenging due to the high complexity and computational demand of the existing methods. This study addresses current problems in reconstructing objects displayed in a single-view image in a faster (real-time) fashion. To this end, a simple yet powerful deep neural framework is developed. The proposed framework consists of two components: the feature extractor module and the 3D generator module. We use point cloud representation for the output of our reconstruction module. The ShapeNet dataset is utilized to compare the method with the existing results in terms of computation time and accuracy. Simulations demonstrate the superior performance of the proposed method.
Index Terms-Real-time 3D reconstruction, single-view reconstruction, supervised learning, deep neural network
△ Less
Submitted 20 September, 2022;
originally announced September 2022.
-
A Bose Horn Antenna Radio Telescope (BHARAT) design for 21 cm hydrogen line experiments for radio astronomy teaching
Authors:
Ashish A. Mhaske,
Joydeep Bagchi,
Bhal Chandra Joshi,
Joe Jacob,
Paul K. T
Abstract:
We have designed a low-cost radio telescope system named the Bose Horn Antenna Radio Telescope (BHARAT) to detect the 21 cm hydrogen line emission from our Galaxy. The system is being used at the Radio Physics Laboratory (RPL), Inter-University Centre for Astronomy and Astrophysics (IUCAA), India, for laboratory sessions and training students and teachers. It is also a part of the laboratory curri…
▽ More
We have designed a low-cost radio telescope system named the Bose Horn Antenna Radio Telescope (BHARAT) to detect the 21 cm hydrogen line emission from our Galaxy. The system is being used at the Radio Physics Laboratory (RPL), Inter-University Centre for Astronomy and Astrophysics (IUCAA), India, for laboratory sessions and training students and teachers. It is also a part of the laboratory curriculum at several universities and colleges. Here, we present the design of a highly efficient, easy to build, and cost-effective dual-mode conical horn used as a radio telescope and describe the calibration procedure. We also present some model observation data acquired using the telescope for facilitating easy incorporation of this experiment in the laboratory curriculum of undergraduate or post-graduate programs. We have named the antenna after Acharya Jagadish Chandra Bose, honoring a pioneer in radio-wave science and an outstanding teacher, who inspired several world renowned scientists.
△ Less
Submitted 11 August, 2022;
originally announced August 2022.
-
Search for the decay $B_s^0 \to η^\prime K_S^0$
Authors:
Belle Collaboration,
T. Pang,
V. Savinov,
I. Adachi,
H. Aihara,
D. M. Asner,
H. Atmacan,
V. Aulchenko,
T. Aushev,
R. Ayad,
V. Babu,
P. Behera,
K. Belous,
M. Bessner,
V. Bhardwaj,
B. Bhuyan,
T. Bilka,
A. Bobrov,
D. Bodrov,
G. Bonvicini,
J. Borah,
A. Bozek,
M. Br ačko,
P. Branchini,
T. E. Browder
, et al. (184 additional authors not shown)
Abstract:
We report the results of the first search for the decay $B_s^0 \to η^\prime K_S^0$ using $121.4\,{\rm fb}^{-1}$ of data collected at the $Υ(5S)$ resonance with the Belle detector at the KEKB asymmetric-energy $e^+ e^-$ collider. We observe no signal and set a 90\% confidence-level upper limit of $8.16 \times 10^{-6}$ on the $B_s^0 \to η^\prime K_S^0$ branching fraction.
We report the results of the first search for the decay $B_s^0 \to η^\prime K_S^0$ using $121.4\,{\rm fb}^{-1}$ of data collected at the $Υ(5S)$ resonance with the Belle detector at the KEKB asymmetric-energy $e^+ e^-$ collider. We observe no signal and set a 90\% confidence-level upper limit of $8.16 \times 10^{-6}$ on the $B_s^0 \to η^\prime K_S^0$ branching fraction.
△ Less
Submitted 5 January, 2022;
originally announced January 2022.
-
Deep network for rolling shutter rectification
Authors:
Praveen K,
Lokesh Kumar T,
A. N. Rajagopalan
Abstract:
CMOS sensors employ row-wise acquisition mechanism while imaging a scene, which can result in undesired motion artifacts known as rolling shutter (RS) distortions in the captured image. Existing single image RS rectification methods attempt to account for these distortions by either using algorithms tailored for specific class of scenes which warrants information of intrinsic camera parameters or…
▽ More
CMOS sensors employ row-wise acquisition mechanism while imaging a scene, which can result in undesired motion artifacts known as rolling shutter (RS) distortions in the captured image. Existing single image RS rectification methods attempt to account for these distortions by either using algorithms tailored for specific class of scenes which warrants information of intrinsic camera parameters or a learning-based framework with known ground truth motion parameters. In this paper, we propose an end-to-end deep neural network for the challenging task of single image RS rectification. Our network consists of a motion block, a trajectory module, a row block, an RS rectification module and an RS regeneration module (which is used only during training). The motion block predicts camera pose for every row of the input RS distorted image while the trajectory module fits estimated motion parameters to a third-order polynomial. The row block predicts the camera motion that must be associated with every pixel in the target i.e, RS rectified image. Finally, the RS rectification module uses motion trajectory and the output of row block to warp the input RS image to arrive at a distortionfree image. For faster convergence during training, we additionally use an RS regeneration module which compares the input RS image with the ground truth image distorted by estimated motion parameters. The end-to-end formulation in our model does not constrain the estimated motion to ground-truth motion parameters, thereby successfully rectifying the RS images with complex real-life camera motion. Experiments on synthetic and real datasets reveal that our network outperforms prior art both qualitatively and quantitatively.
△ Less
Submitted 12 December, 2021;
originally announced December 2021.
-
End-to-End Optimized Arrhythmia Detection Pipeline using Machine Learning for Ultra-Edge Devices
Authors:
Sideshwar J B,
Sachin Krishan T,
Vishal Nagarajan,
Shanthakumar S,
Vineeth Vijayaraghavan
Abstract:
Atrial fibrillation (AF) is the most prevalent cardiac arrhythmia worldwide, with 2% of the population affected. It is associated with an increased risk of strokes, heart failure and other heart-related complications. Monitoring at-risk individuals and detecting asymptomatic AF could result in considerable public health benefits, as individuals with asymptomatic AF could take preventive measures w…
▽ More
Atrial fibrillation (AF) is the most prevalent cardiac arrhythmia worldwide, with 2% of the population affected. It is associated with an increased risk of strokes, heart failure and other heart-related complications. Monitoring at-risk individuals and detecting asymptomatic AF could result in considerable public health benefits, as individuals with asymptomatic AF could take preventive measures with lifestyle changes. With increasing affordability to wearables, personalized health care is becoming more accessible. These personalized healthcare solutions require accurate classification of bio-signals while being computationally inexpensive. By making inferences on-device, we avoid issues inherent to cloud-based systems such as latency and network connection dependency. We propose an efficient pipeline for real-time Atrial Fibrillation Detection with high accuracy that can be deployed in ultra-edge devices. The feature engineering employed in this research catered to optimizing the resource-efficient classifier used in the proposed pipeline, which was able to outperform the best performing standard ML model by $10^5\times$ in terms of memory footprint with a mere trade-off of 2% classification accuracy. We also obtain higher accuracy of approximately 6% while consuming 403$\times$ lesser memory and being 5.2$\times$ faster compared to the previous state-of-the-art (SoA) embedded implementation.
△ Less
Submitted 23 November, 2021;
originally announced November 2021.
-
Space Photometry with BRITE-Constellation
Authors:
Weiss W. W,
Zwintz K.,
Kuschnig R.,
Handler G.,
Moffat A. F. J.,
Baade D.,
Bowman D. M.,
Granzer T.,
Kallinger T.,
Koudelka O. F.,
Lovekin C. C.,
Neiner C.,
Pablo H.,
Pigulski A.,
Popowicz A.,
Ramiaramanantsoa T.,
Rucinski S. M.,
Strassmeier K. G.,
Wade G. A
Abstract:
BRITE-Constellation is devoted to high-precision optical photometric monitoring of bright stars, distributed all over the Milky Way, in red and/or blue passbands. Photometry from space avoids the turbulent and absorbing terrestrial atmosphere and allows for very long and continuous observing runs with high time resolution and thus provides the data necessary for understanding various processes ins…
▽ More
BRITE-Constellation is devoted to high-precision optical photometric monitoring of bright stars, distributed all over the Milky Way, in red and/or blue passbands. Photometry from space avoids the turbulent and absorbing terrestrial atmosphere and allows for very long and continuous observing runs with high time resolution and thus provides the data necessary for understanding various processes inside stars (e.g., asteroseismology) and in their immediate environment. While the first astronomical observations from space focused on the spectral regions not accessible from the ground it soon became obvious around 1970 that avoiding the turbulent terrestrial atmosphere significantly improved the accuracy of photometry and satellites explicitly dedicated to high-quality photometry were launched. A perfect example is BRITE-Constellation, which is the result of a very successful cooperation between Austria, Canada and Poland. Research highlights for targets distributed nearly over the entire HRD are presented, but focus primarily on massive and hot stars.
△ Less
Submitted 24 June, 2021;
originally announced June 2021.
-
DICODerma: A practical approach for metadata management of images in dermatology
Authors:
Bell Raj Eapen,
Feroze Kaliyadan,
Ashique Karalikkattil T
Abstract:
Clinical images are vital for diagnosing and monitoring skin diseases, and their importance has increased with the growing popularity of machine learning. Lack of standards has stifled innovation in dermatological imaging, unlike other image-intensive specialties such as radiology. We investigate the meta-requirements for utilizing the popular DICOM standard for metadata management of images in de…
▽ More
Clinical images are vital for diagnosing and monitoring skin diseases, and their importance has increased with the growing popularity of machine learning. Lack of standards has stifled innovation in dermatological imaging, unlike other image-intensive specialties such as radiology. We investigate the meta-requirements for utilizing the popular DICOM standard for metadata management of images in dermatology. We propose practical design solutions and provide open-source tools to integrate dermatologists' workflow with enterprise imaging systems. Using the tool, dermatologists can tag, search, organize and convert clinical images to the DICOM format. We believe that our less disruptive approach will improve the adoption of standards in the specialty.
△ Less
Submitted 17 February, 2021;
originally announced February 2021.
-
Compositional distributions and evolutionary processes for the near-Earth object population: Results from the MIT-Hawaii Near-Earth Object Spectroscopic Survey (MITHNEOS
Authors:
Binzel R. P.,
DeMeo F. E.,
Turtelboom E. V.,
Bus S. J.,
Tokunaga A.,
Burbine T. H.,
Lantz C.,
Polishook D.,
Carry B.,
Morbidelli A.,
Birlan M.,
Vernazza P.,
Moskovitz N.,
Slivan S. M.,
Thomas C. A.,
Rivkin A. S.,
Hicks M. D.,
Dunn T.,
Reddy V.,
Sanchez J. A.,
Granvik M.,
Kohout T
Abstract:
We report measured spectral properties for more than 1000 NEOs, representing>5% of the currently discovered population. Thermal flux detected below 2.5 μm allows us to make albedo estimates for nearly 50 objects, including two comets. Additional spectral data are reported for more than 350 Mars-crossing asteroids. Most of these measurements were achieved through a collaboration between researchers…
▽ More
We report measured spectral properties for more than 1000 NEOs, representing>5% of the currently discovered population. Thermal flux detected below 2.5 μm allows us to make albedo estimates for nearly 50 objects, including two comets. Additional spectral data are reported for more than 350 Mars-crossing asteroids. Most of these measurements were achieved through a collaboration between researchers at the Massachusetts Institute of Technology and the University of Hawaii, with full cooperation of the NASA Infrared Telescope Facility (IRTF) on Mauna Kea. We call this project the MIT-Hawaii Near-Earth Object Spectroscopic Survey (MITHNEOS; myth-neos).
△ Less
Submitted 10 April, 2020;
originally announced April 2020.
-
High-resolution Resonance Spin-flip Raman Spectroscopy of Pairs of Manganese Ions in CdTe
Authors:
Cherbunin R. V.,
Litviak V. M.,
Ryzhov I. I.,
Koudinov A. V.,
Elsässer S.,
Knapp A.,
Kiessling T.,
Geurts J.,
Chusnutdinow S.,
Wojtowicz T.,
Karczewski G
Abstract:
We report the observation of tens of minor lines of the combinational spin-flip Raman scattering in a CdTe:Mn quantum well by means of the high-resolution optical spectroscopy. Classification of this manifold leads to four characteristic values of energy, that correspond to four different types of pair clusters of Mn ions: the nearest, second, third etc. neighbors. All the four energies show up in…
▽ More
We report the observation of tens of minor lines of the combinational spin-flip Raman scattering in a CdTe:Mn quantum well by means of the high-resolution optical spectroscopy. Classification of this manifold leads to four characteristic values of energy, that correspond to four different types of pair clusters of Mn ions: the nearest, second, third etc. neighbors. All the four energies show up in a single experiment with a very high precision, providing experimental grounds for a deeper understanding of the d-d exchange interactions in a diluted magnetic semiconductor and demonstrating the capacity of the employed method. The major (nearest-neighbor) exchange constant J_1 = 6.15 K was found to consent with its previously reported value. Other detected characteristic energies are as follows: J_{(2)} = 1.80 K, J_{(3)} = 1.39 K, J_{(4)} = 0.81 K.
△ Less
Submitted 16 March, 2020; v1 submitted 4 March, 2019;
originally announced March 2019.
-
Excitation mechanism of OI lines in Herbig Ae/Be stars
Authors:
Blesson Mathew,
P. Manoj,
Mayank Narang,
D. P. K. Banerjee,
Pratheeksha Nayak,
S. Muneer,
S. Vig,
Pramod Kumar S.,
Paul K. T.,
G. Maheswar
Abstract:
We have investigated the role of a few prominent excitation mechanisms viz. collisional excitation, recombination, continuum fluorescence and Lyman beta fluorescence on the OI line spectra in Herbig Ae/Be stars. The aim is to understand which of them is the central mechanism that explains the observed OI line strengths. The study is based on an analysis of the observed optical spectra of 62 Herbig…
▽ More
We have investigated the role of a few prominent excitation mechanisms viz. collisional excitation, recombination, continuum fluorescence and Lyman beta fluorescence on the OI line spectra in Herbig Ae/Be stars. The aim is to understand which of them is the central mechanism that explains the observed OI line strengths. The study is based on an analysis of the observed optical spectra of 62 Herbig Ae/Be stars and near-infrared spectra of 17 Herbig Ae/Be stars. The strong correlation observed between the line fluxes of OI $λ$8446 and OI $λ$11287, as well as a high positive correlation between the line strengths of OI $λ$8446 and H$α$ suggest that Lyman beta fluorescence is the dominant excitation mechanism for the formation of OI emission lines in Herbig Ae/Be stars. Further, from an analysis of the emission line fluxes of OI $λλ$7774, 8446, and comparing the line ratios with those predicted by theoretical models, we assessed the contribution of collisional excitation in the formation of OI emission lines.
△ Less
Submitted 6 March, 2018;
originally announced March 2018.
-
Everyday Radio Telescope
Authors:
Pranshu Mandal,
Devansh Agarwal,
Pratik Kumar,
Anjali Yelikar,
Kanchan Soni,
Vineeth Krishna T
Abstract:
We have developed an affordable, portable college level radio telescope for amateur radio astronomy which can be used to provide hands-on experience with the fundamentals of a radio telescope and an insight into the realm of radio astronomy. With our set-up one can measure brightness temperature and flux of the Sun at 11.2 GHz and calculate the beam width of the antenna. The set-up uses commercial…
▽ More
We have developed an affordable, portable college level radio telescope for amateur radio astronomy which can be used to provide hands-on experience with the fundamentals of a radio telescope and an insight into the realm of radio astronomy. With our set-up one can measure brightness temperature and flux of the Sun at 11.2 GHz and calculate the beam width of the antenna. The set-up uses commercially available satellite television receiving system and parabolic dish antenna. We report the detection of point sources like Saturn and extended sources like the galactic arm of the Milky way. We have also developed python pipeline, which are available for free download, for data acquisition and visualization.
△ Less
Submitted 12 January, 2016;
originally announced January 2016.