Skip to main content

Showing 1–34 of 34 results for author: Mohan, R

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.17731  [pdf, other

    cs.DB

    Evaluating NoSQL Databases for OLAP Workloads: A Benchmarking Study of MongoDB, Redis, Kudu and ArangoDB

    Authors: Rishi Kesav Mohan, Risheek Rakshit Sukumar Kanmani, Krishna Anandan Ganesan, Nisha Ramasubramanian

    Abstract: In the era of big data, conventional RDBMS models have become impractical for handling colossal workloads. Consequently, NoSQL databases have emerged as the preferred storage solutions for executing processing-intensive Online Analytical Processing (OLAP) tasks. Within the realm of NoSQL databases, various classifications exist based on their data storage mechanisms, making it challenging to selec… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

  2. arXiv:2311.07761  [pdf, other

    cs.CV cs.AI cs.RO

    Amodal Optical Flow

    Authors: Maximilian Luz, Rohit Mohan, Ahmed Rida Sekkat, Oliver Sawade, Elmar Matthes, Thomas Brox, Abhinav Valada

    Abstract: Optical flow estimation is very challenging in situations with transparent or occluded objects. In this work, we address these challenges at the task level by introducing Amodal Optical Flow, which integrates optical flow with amodal perception. Instead of only representing the visible regions, we define amodal optical flow as a multi-layered pixel-level motion field that encompasses both visible… ▽ More

    Submitted 7 May, 2024; v1 submitted 13 November, 2023; originally announced November 2023.

  3. arXiv:2310.11797  [pdf, other

    cs.CV

    Panoptic Out-of-Distribution Segmentation

    Authors: Rohit Mohan, Kiran Kumaraswamy, Juana Valeria Hurtado, Kürsat Petek, Abhinav Valada

    Abstract: Deep learning has led to remarkable strides in scene understanding with panoptic segmentation emerging as a key holistic scene interpretation task. However, the performance of panoptic segmentation is severely impacted in the presence of out-of-distribution (OOD) objects i.e. categories of objects that deviate from the training distribution. To overcome this limitation, we propose Panoptic Out-of… ▽ More

    Submitted 18 October, 2023; originally announced October 2023.

  4. arXiv:2309.06547  [pdf, other

    cs.CV

    AmodalSynthDrive: A Synthetic Amodal Perception Dataset for Autonomous Driving

    Authors: Ahmed Rida Sekkat, Rohit Mohan, Oliver Sawade, Elmar Matthes, Abhinav Valada

    Abstract: Unlike humans, who can effortlessly estimate the entirety of objects even when partially occluded, modern computer vision algorithms still find this aspect extremely challenging. Leveraging this amodal perception for autonomous driving remains largely untapped due to the lack of suitable datasets. The curation of these datasets is primarily hindered by significant annotation costs and mitigating a… ▽ More

    Submitted 11 March, 2024; v1 submitted 12 September, 2023; originally announced September 2023.

  5. arXiv:2308.03193  [pdf, other

    cs.CV

    Syn-Mediverse: A Multimodal Synthetic Dataset for Intelligent Scene Understanding of Healthcare Facilities

    Authors: Rohit Mohan, José Arce, Sassan Mokhtar, Daniele Cattaneo, Abhinav Valada

    Abstract: Safety and efficiency are paramount in healthcare facilities where the lives of patients are at stake. Despite the adoption of robots to assist medical staff in challenging tasks such as complex surgeries, human expertise is still indispensable. The next generation of autonomous healthcare robots hinges on their capacity to perceive and understand their complex and frenetic environments. While dee… ▽ More

    Submitted 6 August, 2023; originally announced August 2023.

  6. arXiv:2303.09446  [pdf, other

    eess.AS cs.AI cs.CL cs.LG

    Controllable Prosody Generation With Partial Inputs

    Authors: Dan Andrei Iliescu, Devang Savita Ram Mohan, Tian Huey Teh, Zack Hodari

    Abstract: We address the problem of human-in-the-loop control for generating prosody in the context of text-to-speech synthesis. Controlling prosody is challenging because existing generative models lack an efficient interface through which users can modify the output quickly and precisely. To solve this, we introduce a novel framework whereby the user provides partial inputs and the generative model genera… ▽ More

    Submitted 15 April, 2024; v1 submitted 14 March, 2023; originally announced March 2023.

    Comments: 5 pages

  7. arXiv:2301.01849  [pdf, other

    cs.LG stat.ME stat.ML

    NODAGS-Flow: Nonlinear Cyclic Causal Structure Learning

    Authors: Muralikrishnna G. Sethuraman, Romain Lopez, Rahul Mohan, Faramarz Fekri, Tommaso Biancalani, Jan-Christian Hütter

    Abstract: Learning causal relationships between variables is a well-studied problem in statistics, with many important applications in science. However, modeling real-world systems remain challenging, as most existing algorithms assume that the underlying causal graph is acyclic. While this is a convenient framework for develo** theoretical developments about causal reasoning and inference, the underlying… ▽ More

    Submitted 4 January, 2023; originally announced January 2023.

  8. arXiv:2205.14637  [pdf, other

    cs.CV cs.AI cs.RO

    Perceiving the Invisible: Proposal-Free Amodal Panoptic Segmentation

    Authors: Rohit Mohan, Abhinav Valada

    Abstract: Amodal panoptic segmentation aims to connect the perception of the world to its cognitive understanding. It entails simultaneously predicting the semantic labels of visible scene regions and the entire shape of traffic participant instances, including regions that may be occluded. In this work, we formulate a proposal-free framework that tackles this task as a multi-label and multi-class problem b… ▽ More

    Submitted 29 May, 2022; originally announced May 2022.

  9. arXiv:2202.11542  [pdf, other

    cs.CV cs.AI cs.RO

    Amodal Panoptic Segmentation

    Authors: Rohit Mohan, Abhinav Valada

    Abstract: Humans have the remarkable ability to perceive objects as a whole, even when parts of them are occluded. This ability of amodal perception forms the basis of our perceptual and cognitive understanding of our world. To enable robots to reason with this capability, we formulate and propose a novel task that we name amodal panoptic segmentation. The goal of this task is to simultaneously predict the… ▽ More

    Submitted 23 February, 2022; originally announced February 2022.

  10. arXiv:2112.05210  [pdf, other

    cs.CV cs.LG cs.RO

    7th AI Driving Olympics: 1st Place Report for Panoptic Tracking

    Authors: Rohit Mohan, Abhinav Valada

    Abstract: In this technical report, we describe our EfficientLPT architecture that won the panoptic tracking challenge in the 7th AI Driving Olympics at NeurIPS 2021. Our architecture builds upon the top-down EfficientLPS panoptic segmentation approach. EfficientLPT consists of a shared backbone with a modified EfficientNet-B5 model comprising the proximity convolution module as the encoder followed by the… ▽ More

    Submitted 9 December, 2021; originally announced December 2021.

  11. arXiv:2109.03805  [pdf, other

    cs.CV cs.AI cs.LG cs.RO

    Panoptic nuScenes: A Large-Scale Benchmark for LiDAR Panoptic Segmentation and Tracking

    Authors: Whye Kit Fong, Rohit Mohan, Juana Valeria Hurtado, Lubing Zhou, Holger Caesar, Oscar Beijbom, Abhinav Valada

    Abstract: Panoptic scene understanding and tracking of dynamic agents are essential for robots and automated vehicles to navigate in urban environments. As LiDARs provide accurate illumination-independent geometric depictions of the scene, performing these tasks using LiDAR point clouds provides reliable predictions. However, existing datasets lack diversity in the type of urban scenes and have a limited nu… ▽ More

    Submitted 23 December, 2021; v1 submitted 8 September, 2021; originally announced September 2021.

    Comments: The benchmark is available at https://www.nuscenes.org

  12. arXiv:2106.08352  [pdf, other

    eess.AS cs.LG cs.SD

    Ctrl-P: Temporal Control of Prosodic Variation for Speech Synthesis

    Authors: Devang S Ram Mohan, Vivian Hu, Tian Huey Teh, Alexandra Torresquintero, Christopher G. R. Wallis, Marlene Staib, Lorenzo Foglianti, Jiameng Gao, Simon King

    Abstract: Text does not fully specify the spoken form, so text-to-speech models must be able to learn from speech data that vary in ways not explained by the corresponding text. One way to reduce the amount of unexplained variation in training data is to provide acoustic information as an additional learning signal. When generating speech, modifying this acoustic information enables multiple distinct rendit… ▽ More

    Submitted 15 June, 2021; originally announced June 2021.

    Comments: To be published in Interspeech 2021. 5 pages, 4 figures

  13. arXiv:2105.14219  [pdf, other

    cs.NI cs.AI cs.LG

    Machine Learning for Performance Prediction of Channel Bonding in Next-Generation IEEE 802.11 WLANs

    Authors: Francesc Wilhelmi, David Góez, Paola Soto, Ramon Vallés, Mohammad Alfaifi, Abdulrahman Algunayah, Jorge Martin-Pérez, Luigi Girletti, Rajasekar Mohan, K Venkat Ramnan, Boris Bellalta

    Abstract: With the advent of Artificial Intelligence (AI)-empowered communications, industry, academia, and standardization organizations are progressing on the definition of mechanisms and procedures to address the increasing complexity of future 5G and beyond communications. In this context, the International Telecommunication Union (ITU) organized the first AI for 5G Challenge to bring industry and acade… ▽ More

    Submitted 29 May, 2021; originally announced May 2021.

  14. arXiv:2102.08009  [pdf, other

    cs.CV cs.LG cs.RO

    EfficientLPS: Efficient LiDAR Panoptic Segmentation

    Authors: Kshitij Sirohi, Rohit Mohan, Daniel Büscher, Wolfram Burgard, Abhinav Valada

    Abstract: Panoptic segmentation of point clouds is a crucial task that enables autonomous vehicles to comprehend their vicinity using their highly accurate and reliable LiDAR sensors. Existing top-down approaches tackle this problem by either combining independent task-specific networks or translating methods from the image domain ignoring the intricacies of LiDAR data and thus often resulting in sub-optima… ▽ More

    Submitted 4 November, 2021; v1 submitted 16 February, 2021; originally announced February 2021.

    Comments: Ranked #1 on SemanticKITTI and nuScenes panoptic segmentation benchmarks

    Journal ref: IEEE Transactions on Robotics (T-RO), 2021

  15. arXiv:2008.10112  [pdf, other

    cs.CV cs.LG cs.RO

    Robust Vision Challenge 2020 -- 1st Place Report for Panoptic Segmentation

    Authors: Rohit Mohan, Abhinav Valada

    Abstract: In this technical report, we present key details of our winning panoptic segmentation architecture EffPS_b1bs4_RVC. Our network is a lightweight version of our state-of-the-art EfficientPS architecture that consists of our proposed shared backbone with a modified EfficientNet-B5 model as the encoder, followed by the 2-way FPN to learn semantically rich multi-scale features. It consists of two task… ▽ More

    Submitted 23 August, 2020; originally announced August 2020.

  16. Phonological Features for 0-shot Multilingual Speech Synthesis

    Authors: Marlene Staib, Tian Huey Teh, Alexandra Torresquintero, Devang S Ram Mohan, Lorenzo Foglianti, Raphael Lenain, Jiameng Gao

    Abstract: Code-switching---the intra-utterance use of multiple languages---is prevalent across the world. Within text-to-speech (TTS), multilingual models have been found to enable code-switching. By modifying the linguistic input to sequence-to-sequence TTS, we show that code-switching is possible for languages unseen during training, even within monolingual models. We use a small set of phonological featu… ▽ More

    Submitted 6 August, 2020; originally announced August 2020.

    Comments: 5 pages, to be presented at INTERSPEECH 2020

  17. arXiv:2008.03096  [pdf, other

    eess.AS cs.LG cs.SD stat.ML

    Incremental Text to Speech for Neural Sequence-to-Sequence Models using Reinforcement Learning

    Authors: Devang S Ram Mohan, Raphael Lenain, Lorenzo Foglianti, Tian Huey Teh, Marlene Staib, Alexandra Torresquintero, Jiameng Gao

    Abstract: Modern approaches to text to speech require the entire input character sequence to be processed before any audio is synthesised. This latency limits the suitability of such models for time-sensitive tasks like simultaneous interpretation. Interleaving the action of reading a character with that of synthesising audio reduces this latency. However, the order of this sequence of interleaved actions v… ▽ More

    Submitted 7 August, 2020; originally announced August 2020.

    Comments: To be published in Interspeech 2020. 5 pages, 4 figures

  18. arXiv:2004.08189  [pdf, other

    cs.CV cs.LG cs.RO

    MOPT: Multi-Object Panoptic Tracking

    Authors: Juana Valeria Hurtado, Rohit Mohan, Wolfram Burgard, Abhinav Valada

    Abstract: Comprehensive understanding of dynamic scenes is a critical prerequisite for intelligent robots to autonomously operate in their environment. Research in this domain, which encompasses diverse perception problems, has primarily been focused on addressing specific tasks individually rather than modeling the ability to understand dynamic scenes holistically. In this paper, we introduce a novel perce… ▽ More

    Submitted 27 May, 2020; v1 submitted 17 April, 2020; originally announced April 2020.

    Comments: Code & models are available at http://rl.uni-freiburg.de/research/panoptictracking

    Journal ref: The IEEE Conference on Computer Vision and Pattern Recognition (CVPR) Workshop on Scalability in Autonomous Driving, 2020

  19. arXiv:2004.02307  [pdf, other

    cs.CV cs.LG cs.RO

    EfficientPS: Efficient Panoptic Segmentation

    Authors: Rohit Mohan, Abhinav Valada

    Abstract: Understanding the scene in which an autonomous robot operates is critical for its competent functioning. Such scene comprehension necessitates recognizing instances of traffic participants along with general scene semantics which can be effectively addressed by the panoptic segmentation task. In this paper, we introduce the Efficient Panoptic Segmentation (EfficientPS) architecture that consists o… ▽ More

    Submitted 1 February, 2021; v1 submitted 5 April, 2020; originally announced April 2020.

    Comments: Ranked # 1 on Cityscapes panoptic segmentation benchmark, ranked # 2 among the published methods on Cityscapes semantic segmentation benchmark, and ranked # 2 among the published methods on Cityscapes instance segmentation benchmark. Demo, code and models are available at https://rl.uni-freiburg.de/research/panoptic

    Journal ref: International Journal of Computer Vision (IJCV), vol. 129, no. 5, pp. 1551-1579, 2021

  20. Vision-Based Autonomous UAV Navigation and Landing for Urban Search and Rescue

    Authors: Mayank Mittal, Rohit Mohan, Wolfram Burgard, Abhinav Valada

    Abstract: Unmanned Aerial Vehicles (UAVs) equipped with bioradars are a life-saving technology that can enable identification of survivors under collapsed buildings in the aftermath of natural disasters such as earthquakes or gas explosions. However, these UAVs have to be able to autonomously navigate in disaster struck environments and land on debris piles in order to accurately locate the survivors. This… ▽ More

    Submitted 3 September, 2019; v1 submitted 4 June, 2019; originally announced June 2019.

    Comments: Accepted for publication in the proceedings of the International Symposium on Robotics Research (ISRR) 2019

  21. arXiv:1903.01804  [pdf, other

    cs.RO cs.CV cs.LG

    Robot Localization in Floor Plans Using a Room Layout Edge Extraction Network

    Authors: Federico Boniardi, Abhinav Valada, Rohit Mohan, Tim Caselitz, Wolfram Burgard

    Abstract: Indoor localization is one of the crucial enablers for deployment of service robots. Although several successful techniques for indoor localization have been proposed, the majority of them relies on maps generated from data gathered with the same sensor modality used for localization. Typically, tedious labor by experts is needed to acquire this data, thus limiting the readiness of the system as w… ▽ More

    Submitted 12 July, 2019; v1 submitted 5 March, 2019; originally announced March 2019.

    Comments: Accepted for IROS 2019

  22. Self-Supervised Model Adaptation for Multimodal Semantic Segmentation

    Authors: Abhinav Valada, Rohit Mohan, Wolfram Burgard

    Abstract: Learning to reliably perceive and understand the scene is an integral enabler for robots to operate in the real-world. This problem is inherently challenging due to the multitude of object types as well as appearance changes caused by varying illumination and weather conditions. Leveraging complementary modalities can enable learning of semantically richer representations that are resilient to suc… ▽ More

    Submitted 8 July, 2019; v1 submitted 11 August, 2018; originally announced August 2018.

    Comments: A Live demo is available at http://deepscene.cs.uni-freiburg.de and the code as well as the models are available at https://github.com/DeepSceneSeg

    Journal ref: International Journal of Computer Vision (IJCV), Special Issue: Deep Learning for Robotic Vision, vol. 128, no. 5, pp. 1239-1285, 2019

  23. arXiv:1610.02828  [pdf, ps, other

    cs.AI cs.DL cs.LG

    Ranking academic institutions on potential paper acceptance in upcoming conferences

    Authors: Jobin Wilson, Ram Mohan, Muhammad Arif, Santanu Chaudhury, Brejesh Lall

    Abstract: The crux of the problem in KDD Cup 2016 involves develo** data mining techniques to rank research institutions based on publications. Rank importance of research institutions are derived from predictions on the number of full research papers that would potentially get accepted in upcoming top-tier conferences, utilizing public information on the web. This paper describes our solution to KDD Cup… ▽ More

    Submitted 10 October, 2016; originally announced October 2016.

    Comments: KDD 2016, KDD Cup 2016, Appeared in the KDD Cup Workshop 2016,https://kddcup2016.azurewebsites.net/Workshop

  24. arXiv:1502.02215  [pdf, other

    cs.LG cs.CY cs.SE

    Real World Applications of Machine Learning Techniques over Large Mobile Subscriber Datasets

    Authors: Jobin Wilson, Chitharanj Kachappilly, Rakesh Mohan, Prateek Kapadia, Arun Soman, Santanu Chaudhury

    Abstract: Communication Service Providers (CSPs) are in a unique position to utilize their vast transactional data assets generated from interactions of subscribers with network elements as well as with other subscribers. CSPs could leverage its data assets for a gamut of applications such as service personalization, predictive offer management, loyalty management, revenue forecasting, network capacity plan… ▽ More

    Submitted 8 February, 2015; originally announced February 2015.

    Comments: SE4ML: Software Engineering for Machine Learning (NIPS 2014 Workshop) https://sites.google.com/site/software4ml/accepted-papers

  25. arXiv:1411.4101  [pdf, other

    stat.ML cs.CV cs.LG

    Deep Deconvolutional Networks for Scene Parsing

    Authors: Rahul Mohan

    Abstract: Scene parsing is an important and challenging prob- lem in computer vision. It requires labeling each pixel in an image with the category it belongs to. Tradition- ally, it has been approached with hand-engineered features from color information in images. Recently convolutional neural networks (CNNs), which automatically learn hierar- chies of features, have achieved record performance on the tas… ▽ More

    Submitted 14 November, 2014; originally announced November 2014.

  26. Network Analysis and Application Control Software based on Client-Server Architecture

    Authors: Ramya Mohan

    Abstract: This paper outlines a comprehensive model to increase system efficiency, preserve network bandwidth, monitor incoming and outgoing packets, ensure the security of confidential files and reduce power wastage in an organization. This model illustrates the use and potential application of a Network Analysis Tool (NAT) in a multi-computer set-up of any scale. The model is designed to run in the backgr… ▽ More

    Submitted 18 April, 2013; originally announced April 2013.

    Journal ref: International Journal of Computer Applications 68(12):34-39, April 2013

  27. arXiv:cs/0605049  [pdf

    cs.DM

    On fractionally linear functions over a finite field

    Authors: V. M. Siddlenikov, R. N. Mohan, Moon Ho Lee

    Abstract: Abstrct: In this note, by considering fractionally linear functions over a finite field and consequently develo** an abstract sequence, we study some of its properties.

    Submitted 11 May, 2006; originally announced May 2006.

  28. arXiv:cs/0605045  [pdf

    cs.DM

    On Orthogonalities in Matrices

    Authors: R. N. Mohan

    Abstract: In this paper we have discussed different possible orthogonalities in matrices, namely orthogonal, quasi-orthogonal, semi-orthogonal and non-orthogonal matrices including completely positive matrices, while giving some of their constructions besides studying some of their properties.

    Submitted 9 May, 2006; originally announced May 2006.

  29. arXiv:cs/0604067  [pdf

    cs.DM

    Certain t-partite graphs

    Authors: R. N. Mohan, Moon Ho Lee, Subhash Pokrel

    Abstract: By making use of the generalized concept of orthogonality in Latin squares, certain t-partite graphs have been constructed and a suggestion for a net work system and some applications have been made.

    Submitted 10 May, 2006; v1 submitted 18 April, 2006; originally announced April 2006.

  30. arXiv:cs/0604057  [pdf

    cs.IT

    A New Fault-Tolerant M-network and its Analysis

    Authors: R. N. Mohan, P. T. Kulkarni

    Abstract: This paper introduces a new class of efficient inter connection networks called as M-graphs for large multi-processor systems.The concept of M-matrix and M-graph is an extension of Mn-matrices and Mn-graphs.We analyze these M-graphs regarding their suitability for large multi-processor systems. An(p,N) M-graph consists of N nodes, where p is the degree of each node.The topology is found to be ha… ▽ More

    Submitted 12 April, 2006; originally announced April 2006.

  31. arXiv:cs/0604050  [pdf

    cs.DM

    On Hadamard Conjecture

    Authors: R. N. Mohan

    Abstract: In this note, while giving an overview of the state of art of the well known Hadamard conjecture, which is more than a century old and now it has been established by using the methods given in the two papers by Mohan et al [6,7].

    Submitted 11 April, 2006; originally announced April 2006.

  32. arXiv:cs/0604044  [pdf

    cs.DM

    A new M-matrix of Type III, its properties and applications

    Authors: R. N. Mohan, Moon Ho Lee, Ram Paudal

    Abstract: Some binary matrices like (1,-1) and (1,0) were studied by many authors like Cohn, Wang, Ehlich and Ehlich and Zeller, and Mohan, Kageyama, Lee, and Gao. In this recent paper by Mohan et al considered the M-matrices of Type I and II by studying some of their properties and applications. In the present paper they discussed the M-matrices of Type III, and studied their properties and applications.… ▽ More

    Submitted 11 April, 2006; originally announced April 2006.

    Comments: 12 pages, one figure

  33. arXiv:cs/0604041  [pdf

    cs.DM

    On Orthogonality of Latin Squares

    Authors: R. N. Mohan, Moon Ho Lee, Subash Pokreal

    Abstract: An arrangement of s elements in s rows and s columns, such that no element repeats more than once in each row and each column is called a Latin square of order s. If two Latin squares of the same order superimposed one on the other and in the resultant array if each ordered pair occurs once and only once then they are called othogonal Latin Squares. A frequency square is an nxn matrix, such that… ▽ More

    Submitted 5 June, 2006; v1 submitted 10 April, 2006; originally announced April 2006.

    Comments: 29 pages

  34. arXiv:cs/0604035  [pdf

    cs.DM

    Certain new M-matrices and their properties and applications

    Authors: R. N. Mohan, Sanpei Kageyama, Moon Ho Lee, Gao Yang

    Abstract: The Mn-matrix was defined by Mohan [20] in which he has shown a method of constructing (1,-1)-matrices and studied some of their properties. The (1,-1)-matrices were constructed and studied by Cohn [5],Wang [33], Ehrlich [8] and Ehrlich and Zeller[9]. But in this paper, while giving some resemblances of this matrix with Hadamard matrix, and by naming it as M-matrix, we show how to construct part… ▽ More

    Submitted 9 April, 2006; originally announced April 2006.

    Comments: 21 pages,3 figures