Skip to main content

Showing 1–27 of 27 results for author: Ghosal, R

.
  1. arXiv:2406.19716  [pdf, other

    stat.ME stat.AP

    Functional Time Transformation Model with Applications to Digital Health

    Authors: Rahul Ghosal, Marcos Matabuena, Sujit K. Ghosh

    Abstract: The advent of wearable and sensor technologies now leads to functional predictors which are intrinsically infinite dimensional. While the existing approaches for functional data and survival outcomes lean on the well-established Cox model, the proportional hazard (PH) assumption might not always be suitable in real-world applications. Motivated by physiological signals encountered in digital medic… ▽ More

    Submitted 28 June, 2024; originally announced June 2024.

  2. arXiv:2406.03087  [pdf, other

    cs.IT cs.CV cs.LG

    Lossless Image Compression Using Multi-level Dictionaries: Binary Images

    Authors: Samar Agnihotri, Renu Rameshan, Ritwik Ghosal

    Abstract: Lossless image compression is required in various applications to reduce storage or transmission costs of images, while requiring the reconstructed images to have zero information loss compared to the original. Existing lossless image compression methods either have simple design but poor compression performance, or complex design, better performance, but with no performance guarantees. In our end… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

    Comments: 11 pages, 7 figures, and 5 tables

  3. arXiv:2405.13970  [pdf, other

    stat.ME

    Conformal uncertainty quantification using kernel depth measures in separable Hilbert spaces

    Authors: Marcos Matabuena, Rahul Ghosal, Pavlo Mozharovskyi, Oscar Hernan Madrid Padilla, Jukka-Pekka Onnela

    Abstract: Depth measures have gained popularity in the statistical literature for defining level sets in complex data structures like multivariate data, functional data, and graphs. Despite their versatility, integrating depth measures into regression modeling for establishing prediction regions remains underexplored. To address this gap, we propose a novel method utilizing a model-free uncertainty quantifi… ▽ More

    Submitted 22 May, 2024; originally announced May 2024.

  4. arXiv:2404.01786  [pdf

    cs.CL

    Generative AI-Based Text Generation Methods Using Pre-Trained GPT-2 Model

    Authors: Rohit Pandey, Hetvi Waghela, Sneha Rakshit, Aparna Rangari, Anjali Singh, Rahul Kumar, Ratnadeep Ghosal, Jaydip Sen

    Abstract: This work delved into the realm of automatic text generation, exploring a variety of techniques ranging from traditional deterministic approaches to more modern stochastic methods. Through analysis of greedy search, beam search, top-k sampling, top-p sampling, contrastive searching, and locally typical searching, this work has provided valuable insights into the strengths, weaknesses, and potentia… ▽ More

    Submitted 2 April, 2024; originally announced April 2024.

    Comments: This report pertains to the Capstone Project done by Group 5 of the Fall batch of 2023 students at Praxis Tech School, Kolkata, India. The reports consists of 57 pages and it includes 17 figures and 8 tables. This is the preprint which will be submitted to IEEE CONIT 2024 for review

  5. arXiv:2403.19752  [pdf, other

    stat.ME stat.ML

    Deep Learning Framework with Uncertainty Quantification for Survey Data: Assessing and Predicting Diabetes Mellitus Risk in the American Population

    Authors: Marcos Matabuena, Juan C. Vidal, Rahul Ghosal, Jukka-Pekka Onnela

    Abstract: Complex survey designs are commonly employed in many medical cohorts. In such scenarios, develo** case-specific predictive risk score models that reflect the unique characteristics of the study design is essential. This approach is key to minimizing potential selective biases in results. The objectives of this paper are: (i) To propose a general predictive framework for regression and classifica… ▽ More

    Submitted 28 March, 2024; originally announced March 2024.

  6. arXiv:2403.18069  [pdf, other

    stat.ME stat.AP

    Personalized Imputation in metric spaces via conformal prediction: Applications in Predicting Diabetes Development with Continuous Glucose Monitoring Information

    Authors: Marcos Matabuena, Carla Díaz-Louzao, Rahul Ghosal, Francisco Gude

    Abstract: The challenge of handling missing data is widespread in modern data analysis, particularly during the preprocessing phase and in various inferential modeling tasks. Although numerous algorithms exist for imputing missing data, the assessment of imputation quality at the patient level often lacks personalized statistical approaches. Moreover, there is a scarcity of imputation methods for metric spa… ▽ More

    Submitted 26 March, 2024; originally announced March 2024.

  7. arXiv:2403.06003  [pdf, other

    cs.RO cs.AI cs.LG

    A Generalized Acquisition Function for Preference-based Reward Learning

    Authors: Evan Ellis, Gaurav R. Ghosal, Stuart J. Russell, Anca Dragan, Erdem Bıyık

    Abstract: Preference-based reward learning is a popular technique for teaching robots and autonomous systems how a human user wants them to perform a task. Previous works have shown that actively synthesizing preference queries to maximize information gain about the reward function parameters improves data efficiency. The information gain criterion focuses on precisely identifying all parameters of the rewa… ▽ More

    Submitted 9 March, 2024; originally announced March 2024.

  8. arXiv:2310.10494  [pdf, other

    stat.ME stat.AP

    Multivariate Scalar on Multidimensional Distribution Regression

    Authors: Rahul Ghosal, Marcos Matabuena

    Abstract: We develop a new method for multivariate scalar on multidimensional distribution regression. Traditional approaches typically analyze isolated univariate scalar outcomes or consider unidimensional distributional representations as predictors. However, these approaches are sub-optimal because: i) they fail to utilize the dependence between the distributional predictors: ii) neglect the correlation… ▽ More

    Submitted 16 October, 2023; originally announced October 2023.

  9. arXiv:2306.15084  [pdf, other

    stat.ME stat.AP

    Functional Principal Component Analysis for Continuous non-Gaussian, Truncated, and Discrete Functional Data

    Authors: Debangan Dey, Rahul Ghosal, Kathleen Merikangas, Vadim Zipunnikov

    Abstract: Mobile health studies often collect multiple within-day self-reported assessments of participants' behavior and well-being on different scales such as physical activity (continuous), pain levels (truncated), mood states (ordinal), and life events (binary). These assessments, when indexed by time of day, can be treated as functional data of different types - continuous, truncated, ordinal, and bina… ▽ More

    Submitted 21 September, 2023; v1 submitted 26 June, 2023; originally announced June 2023.

    Comments: 4 figures, 1 table

    MSC Class: 62M10; 62P10

  10. arXiv:2302.07340  [pdf, other

    stat.ME stat.AP

    Functional proportional hazards mixture cure model and its application to modelling the association between cancer mortality and physical activity in NHANES 2003-2006

    Authors: Rahul Ghosal, Marcos Matabuena, Jiajia Zhang

    Abstract: We develop a functional proportional hazards mixture cure (FPHMC) model with scalar and functional covariates measured at the baseline. The mixture cure model, useful in studying populations with a cure fraction of a particular event of interest is extended to functional data. We employ the EM algorithm and develop a semiparametric penalized spline-based approach to estimate the dynamic functional… ▽ More

    Submitted 30 March, 2023; v1 submitted 14 February, 2023; originally announced February 2023.

  11. arXiv:2301.11399  [pdf, other

    stat.ME

    Distributional outcome regression via quantile functions and its application to modelling continuously monitored heart rate and physical activity

    Authors: Rahul Ghosal, Sujit K. Ghosh, Jennifer A. Schrack, Vadim Zipunnikov

    Abstract: Modern clinical and epidemiological studies widely employ wearables to record parallel streams of real-time data on human physiology and behavior. With recent advances in distributional data analysis, these high-frequency data are now often treated as distributional observations resulting in novel regression settings. Motivated by these modelling setups, we develop a distributional outcome regress… ▽ More

    Submitted 14 February, 2024; v1 submitted 26 January, 2023; originally announced January 2023.

  12. arXiv:2209.04476  [pdf, other

    stat.ME math.ST stat.AP

    Shape-constrained Estimation in Functional Regression with Bernstein Polynomials

    Authors: Rahul Ghosal, Sujit Ghosh, Jacek Urbanek, Jennifer A. Schrack, Vadim Zipunnikov

    Abstract: Shape restrictions on functional regression coefficients such as non-negativity, monotonicity, convexity or concavity are often available in the form of a prior knowledge or required to maintain a structural consistency in functional regression models. A new estimation method is developed in shape-constrained functional regression models using Bernstein polynomials. Specifically, estimation approa… ▽ More

    Submitted 9 September, 2022; originally announced September 2022.

  13. arXiv:2208.10687  [pdf, other

    cs.LG cs.AI

    The Effect of Modeling Human Rationality Level on Learning Rewards from Multiple Feedback Types

    Authors: Gaurav R. Ghosal, Matthew Zurek, Daniel S. Brown, Anca D. Dragan

    Abstract: When inferring reward functions from human behavior (be it demonstrations, comparisons, physical corrections, or e-stops), it has proven useful to model the human as making noisy-rational choices, with a "rationality coefficient" capturing how much noise or entropy we expect to see in the human behavior. Prior work typically sets the rationality level to a constant value, regardless of the type, o… ▽ More

    Submitted 9 March, 2023; v1 submitted 22 August, 2022; originally announced August 2022.

    Comments: Published at AAAI 2023; 10 pages, 5 figures plus appendices

  14. GRiD: GPU-Accelerated Rigid Body Dynamics with Analytical Gradients

    Authors: Brian Plancher, Sabrina M. Neuman, Radhika Ghosal, Scott Kuindersma, Vijay Janapa Reddi

    Abstract: We introduce GRiD: a GPU-accelerated library for computing rigid body dynamics with analytical gradients. GRiD was designed to accelerate the nonlinear trajectory optimization subproblem used in state-of-the-art robotic planning, control, and machine learning, which requires tens to hundreds of naturally parallel computations of rigid body dynamics and their gradients at each iteration. GRiD lever… ▽ More

    Submitted 25 February, 2022; v1 submitted 14 September, 2021; originally announced September 2021.

    Comments: Camera Ready Update: 8 pages, 5 figures, 1 data table, 2 algorithm blocks

  15. arXiv:2108.13354  [pdf, other

    cs.RO

    RoboRun: A Robot Runtime to Exploit Spatial Heterogeneity

    Authors: Behzad Boroujerdian, Radhika Ghosal, Jonathan Cruz, Brian Plancher, Vijay Janapa Reddi

    Abstract: The limited onboard energy of autonomous mobile robots poses a tremendous challenge for practical deployment. Hence, efficient computing solutions are imperative. A crucial shortcoming of state-of-the-art computing solutions is that they ignore the robot's operating environment heterogeneity and make static, worst-case assumptions. As this heterogeneity impacts the system's computing payload, an o… ▽ More

    Submitted 30 August, 2021; originally announced August 2021.

    Comments: will be published in Design Automation Conference (DAC) 2021

  16. Bayesian Inference for Generalized Linear Model with Linear Inequality Constraints

    Authors: Rahul Ghosal, Sujit K. Ghosh

    Abstract: Bayesian statistical inference for Generalized Linear Models (GLMs) with parameters lying on a constrained space is of general interest (e.g., in monotonic or convex regression), but often constructing valid prior distributions supported on a subspace spanned by a set of linear inequality constraints can be challenging, especially when some of the constraints might be binding leading to a lower di… ▽ More

    Submitted 23 August, 2021; originally announced August 2021.

  17. arXiv:2106.09636  [pdf, other

    cs.LG

    Multi-Modal Prototype Learning for Interpretable Multivariable Time Series Classification

    Authors: Gaurav R. Ghosal, Reza Abbasi-Asl

    Abstract: Multivariable time series classification problems are increasing in prevalence and complexity in a variety of domains, such as biology and finance. While deep learning methods are an effective tool for these problems, they often lack interpretability. In this work, we propose a novel modular prototype learning framework for multivariable time series classification. In the first stage of our framew… ▽ More

    Submitted 17 June, 2021; originally announced June 2021.

    Comments: 14 pages, 6 figures

  18. arXiv:2106.04008  [pdf, other

    cs.LG

    Widening Access to Applied Machine Learning with TinyML

    Authors: Vijay Janapa Reddi, Brian Plancher, Susan Kennedy, Laurence Moroney, Pete Warden, Anant Agarwal, Colby Banbury, Massimo Banzi, Matthew Bennett, Benjamin Brown, Sharad Chitlangia, Radhika Ghosal, Sarah Grafman, Rupert Jaeger, Srivatsan Krishnan, Maximilian Lam, Daniel Leiker, Cara Mann, Mark Mazumder, Dominic Pajak, Dhilan Ramaprasad, J. Evan Smith, Matthew Stewart, Dustin Tingley

    Abstract: Broadening access to both computational and educational resources is critical to diffusing machine-learning (ML) innovation. However, today, most ML resources and experts are siloed in a few countries and organizations. In this paper, we describe our pedagogical approach to increasing access to applied ML through a massive open online course (MOOC) on Tiny Machine Learning (TinyML). We suggest tha… ▽ More

    Submitted 9 June, 2021; v1 submitted 7 June, 2021; originally announced June 2021.

    Comments: Understanding the underpinnings of the TinyML edX course series: https://www.edx.org/professional-certificate/harvardx-tiny-machine-learning

  19. Scalar on time-by-distribution regression and its application for modelling associations between daily-living physical activity and cognitive functions in Alzheimer's Disease

    Authors: Rahul Ghosal, Vijay R. Varma, Dmitri Volfson, Jacek Urbanek, Jeffrey M. Hausdorff, Amber Watts, Vadim Zipunnikov

    Abstract: Wearable data is a rich source of information that can provide deeper understanding of links between human behaviours and human health. Existing modelling approaches use wearable data summarized at subject level via scalar summaries using regression techniques, temporal (time-of-day) curves using functional data analysis (FDA), and distributions using distributional data analysis (DDA). We propose… ▽ More

    Submitted 7 June, 2021; originally announced June 2021.

  20. arXiv:2105.12882  [pdf, other

    cs.RO

    MAVFI: An End-to-End Fault Analysis Framework with Anomaly Detection and Recovery for Micro Aerial Vehicles

    Authors: Yu-Shun Hsiao, Zishen Wan, Tianyu Jia, Radhika Ghosal, Abdulrahman Mahmoud, Arijit Raychowdhury, David Brooks, Gu-Yeon Wei, Vijay Janapa Reddi

    Abstract: Safety and resilience are critical for autonomous unmanned aerial vehicles (UAVs). We introduce MAVFI, the micro aerial vehicles (MAVs) resilience analysis methodology to assess the effect of silent data corruption (SDC) on UAVs' mission metrics, such as flight time and success rate, for accurately measuring system resilience. To enhance the safety and resilience of robot systems bound by size, we… ▽ More

    Submitted 30 January, 2023; v1 submitted 26 May, 2021; originally announced May 2021.

    Comments: 6 pages, 9 figures; The first two authors have equal contributions; Accepted as a conference paper in DATE 2023

  21. Distributional data analysis via quantile functions and its application to modelling digital biomarkers of gait in Alzheimer's Disease

    Authors: Rahul Ghosal, Vijay R. Varma, Dmitri Volfson, Inbar Hillel, Jacek Urbanek, Jeffrey M. Hausdorff, Amber Watts, Vadim Zipunnikov

    Abstract: With the advent of continuous health monitoring with wearable devices, users now generate their unique streams of continuous data such as minute-level step counts or heartbeats. Summarizing these streams via scalar summaries often ignores the distributional nature of wearable data and almost unavoidably leads to the loss of critical information. We propose to capture the distributional nature of w… ▽ More

    Submitted 25 October, 2021; v1 submitted 22 February, 2021; originally announced February 2021.

  22. arXiv:1904.08507  [pdf, ps, other

    stat.AP stat.ME

    Variable Selection in Functional Linear Concurrent Regression

    Authors: Rahul Ghosal, Arnab Maity, Timothy Clark, Stefano B Longo

    Abstract: We propose a novel method for variable selection in functional linear concurrent regression. Our research is motivated by a fisheries footprint study where the goal is to identify important time-varying socio-structural drivers influencing patterns of seafood consumption, and hence fisheries footprint, over time, as well as estimating their dynamic effects. We develop a variable selection method i… ▽ More

    Submitted 31 October, 2019; v1 submitted 17 April, 2019; originally announced April 2019.

  23. arXiv:1812.04680  [pdf, ps, other

    stat.ME stat.AP

    A Score Based Test for Functional Linear Concurrent Regression

    Authors: Rahul Ghosal, Arnab Maity

    Abstract: We propose a novel method for testing the null hypothesis of no effect of a covariate on the response in the context of functional linear concurrent regression. We establish an equivalent random effects formulation of our functional regression model under which our testing problem reduces to testing for zero variance component for random effects. For this purpose, we use a one-sided score test app… ▽ More

    Submitted 12 December, 2019; v1 submitted 11 December, 2018; originally announced December 2018.

  24. arXiv:1810.04785  [pdf, other

    stat.ME stat.AP

    Estimating menarcheal age distribution from partially recalled data

    Authors: Sedigheh Mirzaei Salehabadi, Debasis Sengupta, Rahul Ghosal

    Abstract: In a cross-sectional study, adolescent and young adult females were asked to recall the time of menarche, if experienced. Some respondents recalled the date exactly, some recalled only the month or the year of the event, and some were unable to recall anything. We consider estimation of the menarcheal age distribution from this interval censored data. A~complicated interplay between age-at-event a… ▽ More

    Submitted 3 March, 2019; v1 submitted 10 October, 2018; originally announced October 2018.

  25. arXiv:1810.00908  [pdf, ps, other

    stat.AP stat.ME

    A Statistical Exploration of Duckworth-Lewis Method Using Bayesian Inference

    Authors: Indrabati Bhattacharya, Rahul Ghosal, Sujit Ghosh

    Abstract: Duckworth-Lewis (D/L) method is the incumbent rain rule used to decide the result of a limited overs cricket match should it not be able to reach its natural conclusion. Duckworth and Lewis (1998) devised a two factor relationship between the numbers of overs a team had remaining and the number of wickets they had lost in order to quantify the percentage resources a team has at any stage of the ma… ▽ More

    Submitted 1 October, 2018; originally announced October 2018.

  26. arXiv:1808.03811  [pdf, ps, other

    cs.CR

    Privacy Preserving Multi-Server k-means Computation over Horizontally Partitioned Data

    Authors: Riddhi Ghosal, Sanjit Chatterjee

    Abstract: The k-means clustering is one of the most popular clustering algorithms in data mining. Recently a lot of research has been concentrated on the algorithm when the dataset is divided into multiple parties or when the dataset is too large to be handled by the data owner. In the latter case, usually some servers are hired to perform the task of clustering. The dataset is divided by the data owner amo… ▽ More

    Submitted 28 June, 2019; v1 submitted 11 August, 2018; originally announced August 2018.

    Comments: 19 pages, 4 tables. International Conference on Information Systems Security. Springer, Cham, 2018

  27. arXiv:1708.04495  [pdf, ps, other

    cs.CR

    Analysing Relations involving small number of Monomials in AES S- Box

    Authors: Riddhi Ghosal

    Abstract: In the present day, AES is one the most widely used and most secure Encryption Systems prevailing. So, naturally lots of research work is going on to mount a significant attack on AES. Many different forms of Linear and differential cryptanalysis have been performed on AES. Of late, an active area of research has been Algebraic Cryptanalysis of AES, where although fast progress is being made, ther… ▽ More

    Submitted 14 June, 2017; originally announced August 2017.

    Comments: 5 pages, 1 table