Skip to main content

Showing 1–8 of 8 results for author: Munasinghe, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2312.02831  [pdf, other

    cs.CY physics.geo-ph

    Detection of Seismic Infrasonic Elephant Rumbles Using Spectrogram-Based Machine Learning

    Authors: A. M. J. V. Costa, C. S. Pallikkonda, H. H. R. Hiroshan, G. R. U. Y. Gamlath, S. R. Munasinghe, C. U. S. Edussooriya

    Abstract: This paper presents an effective method of identifying elephant rumbles in infrasonic seismic signals. The design and implementation of electronic circuitry to amplify, filter, and digitize the seismic signals captured through geophones are presented. A collection of seismic infrasonic elephant rumbles was collected at a free-ranging area of an elephant orphanage in Sri Lanka. The seismic rumbles… ▽ More

    Submitted 5 December, 2023; originally announced December 2023.

    Comments: 8 pages, 7 figures, journal

  2. arXiv:2311.16125  [pdf, other

    cs.CV cs.LG

    Vision-Based Incoming Traffic Estimator Using Deep Neural Network on General Purpose Embedded Hardware

    Authors: K. G. Zoysa, S. R. Munasinghe

    Abstract: Traffic management is a serious problem in many cities around the world. Even the suburban areas are now experiencing regular traffic congestion. Inappropriate traffic control wastes fuel, time, and the productivity of nations. Though traffic signals are used to improve traffic flow, they often cause problems due to inappropriate or obsolete timing that does not tally with the actual traffic inten… ▽ More

    Submitted 28 October, 2023; originally announced November 2023.

    Comments: 6 pages, 11 figures, journal

  3. arXiv:2311.13435  [pdf, other

    cs.CV cs.AI

    PG-Video-LLaVA: Pixel Grounding Large Video-Language Models

    Authors: Shehan Munasinghe, Rusiru Thushara, Muhammad Maaz, Hanoona Abdul Rasheed, Salman Khan, Mubarak Shah, Fahad Khan

    Abstract: Extending image-based Large Multimodal Models (LMMs) to videos is challenging due to the inherent complexity of video data. The recent approaches extending image-based LMMs to videos either lack the grounding capabilities (e.g., VideoChat, Video-ChatGPT, Video-LLaMA) or do not utilize the audio-signals for better video understanding (e.g., Video-ChatGPT). Addressing these gaps, we propose PG-Video… ▽ More

    Submitted 13 December, 2023; v1 submitted 22 November, 2023; originally announced November 2023.

    Comments: Technical Report

  4. arXiv:2310.16812  [pdf, other

    cs.RO

    Accurate Crop Spraying with RTK and Machine Learning on an Autonomous Field Robot

    Authors: W. M. T. D. Wijesundara, T. D. Wanigathunga, M. N. C. Waas, R. T. Hithanadura, S. R. Munasinghe

    Abstract: The agriculture sector requires a lot of labor and resources. Hence, farmers are constantly being pressed for technology and automation to be cost-effective. In this context, autonomous robots can play a very important role in carrying out agricultural tasks such as spraying, sowing, inspection, and even harvesting. This paper presents one such autonomous robot that is able to identify plants and… ▽ More

    Submitted 25 October, 2023; originally announced October 2023.

    Comments: 7 pages, 12 figures, Journal

  5. arXiv:2308.13833  [pdf, other

    cs.NI eess.SP

    A Cognitive Network Architecture for Vehicle-to-Network (V2N) Communications over Smart Meters for URLLC

    Authors: Shoaib Ahmed, Sayonto Khan, Kumudu S. Munasinghe, Md. Farhad Hossain

    Abstract: With the rapid advancement of smart city infrastructure, vehicle-to-network (V2N) communication has emerged as a crucial technology to enable intelligent transportation systems (ITS). The investigation of new methods to improve V2N communications is sparked by the growing need for high-speed and dependable communications in vehicular networks. To achieve ultra-reliable low latency communication (U… ▽ More

    Submitted 26 August, 2023; originally announced August 2023.

    Comments: 12 pages, 19 figures, IEEE format

  6. arXiv:2209.00062  [pdf, other

    cs.CV

    Class-Aware Attention for Multimodal Trajectory Prediction

    Authors: Bimsara Pathiraja, Shehan Munasinghe, Malshan Ranawella, Maleesha De Silva, Ranga Rodrigo, Peshala Jayasekara

    Abstract: Predicting the possible future trajectories of the surrounding dynamic agents is an essential requirement in autonomous driving. These trajectories mainly depend on the surrounding static environment, as well as the past movements of those dynamic agents. Furthermore, the multimodal nature of agent intentions makes the trajectory prediction problem more challenging. All of the existing models cons… ▽ More

    Submitted 31 August, 2022; originally announced September 2022.

  7. Text-to-Face Generation with StyleGAN2

    Authors: D. M. A. Ayanthi, Sarasi Munasinghe

    Abstract: Synthesizing images from text descriptions has become an active research area with the advent of Generative Adversarial Networks. The main goal here is to generate photo-realistic images that are aligned with the input descriptions. Text-to-Face generation (T2F) is a sub-domain of Text-to-Image generation (T2I) that is more challenging due to the complexity and variation of facial attributes. It h… ▽ More

    Submitted 25 May, 2022; originally announced May 2022.

    Comments: 16 pages, 5 figures, for conference, https://aircconline.com/csit/papers/vol12/csit120805.pdf

    Journal ref: David C. Wyld et al. (Eds): FCST, CMIT, SE, SIPM, SAIM, SNLP - 2022pp. 49-64, 2022. CS & IT - CSCP 2022 May 21~22, 2022, Zurich, Switzerland

  8. arXiv:2102.01728  [pdf, other

    eess.SP cs.LG

    A Novel Transfer Learning-Based Approach for Screening Pre-existing Heart Diseases Using Synchronized ECG Signals and Heart Sounds

    Authors: Ramith Hettiarachchi, Udith Haputhanthri, Kithmini Herath, Hasindu Kariyawasam, Shehan Munasinghe, Kithmin Wickramasinghe, Duminda Samarasinghe, Anjula De Silva, Chamira U. S. Edussooriya

    Abstract: Diagnosing pre-existing heart diseases early in life is important as it helps prevent complications such as pulmonary hypertension, heart rhythm problems, blood clots, heart failure and sudden cardiac arrest. To identify such diseases, phonocardiogram (PCG) and electrocardiogram (ECG) waveforms convey important information. Therefore, effectively using these two modalities of data has the potentia… ▽ More

    Submitted 14 February, 2021; v1 submitted 2 February, 2021; originally announced February 2021.

    Comments: Paper accepted to IEEE International Symposium on Circuits and Systems (ISCAS) 2021