Skip to main content

Showing 1–11 of 11 results for author: Samadi, S

Searching in archive stat. Search in all archives.
.
  1. arXiv:2405.06582  [pdf, other

    cs.LG cs.CY stat.ML

    The Role of Learning Algorithms in Collective Action

    Authors: Omri Ben-Dov, Jake Fawkes, Samira Samadi, Amartya Sanyal

    Abstract: Collective action in machine learning is the study of the control that a coordinated group can have over machine learning algorithms. While previous research has concentrated on assessing the impact of collectives against Bayes (sub-)optimal classifiers, this perspective is limited in that it does not account for the choice of learning algorithm. Since classifiers seldom behave like Bayes classifi… ▽ More

    Submitted 4 June, 2024; v1 submitted 10 May, 2024; originally announced May 2024.

    Comments: Accepted at the International Conference in Machine Learning (ICML), 2024

  2. arXiv:2402.04579  [pdf, other

    cs.LG stat.ME

    Collective Counterfactual Explanations via Optimal Transport

    Authors: Ahmad-Reza Ehyaei, Ali Shirali, Samira Samadi

    Abstract: Counterfactual explanations provide individuals with cost-optimal actions that can alter their labels to desired classes. However, if substantial instances seek state modification, such individual-centric methods can lead to new competitions and unanticipated costs. Furthermore, these recommendations, disregarding the underlying data distribution, may suggest actions that users perceive as outlier… ▽ More

    Submitted 6 February, 2024; originally announced February 2024.

  3. arXiv:2312.02110  [pdf, ps, other

    stat.ME econ.TH

    Fourier Methods for Sufficient Dimension Reduction in Time Series

    Authors: S. Yaser Samadi, Tharindu P. De Alwis

    Abstract: Dimensionality reduction has always been one of the most significant and challenging problems in the analysis of high-dimensional data. In the context of time series analysis, our focus is on the estimation and inference of conditional mean and variance functions. By using central mean and variance dimension reduction subspaces that preserve sufficient information about the response, one can effec… ▽ More

    Submitted 4 December, 2023; originally announced December 2023.

  4. Reduced-rank Envelope Vector Autoregressive Models

    Authors: S. Yaser Samadi, Wiranthe B. Herath

    Abstract: The standard vector autoregressive (VAR) models suffer from overparameterization which is a serious issue for high-dimensional time series data as it restricts the number of variables and lags that can be incorporated into the model. Several statistical methods, such as the reduced-rank model for multivariate (multiple) time series (Velu et al., 1986; Reinsel and Velu, 1998; Reinsel et al., 2022)… ▽ More

    Submitted 22 September, 2023; originally announced September 2023.

    Journal ref: Journal of Business and Economic Statistics, 2023

  5. MLE for the parameters of bivariate interval-valued models

    Authors: S. Yaser Samadi, L. Billard, Jiin-Huarng Guo, Wei Xu

    Abstract: With contemporary data sets becoming too large to analyze the data directly, various forms of aggregated data are becoming common. The original individual data are points, but after aggregation, the observations are interval-valued (e.g.). While some researchers simply analyze the set of averages of the observations by aggregated class, it is easily established that approach ignores much of the in… ▽ More

    Submitted 29 May, 2023; originally announced May 2023.

    Comments: Will appear in ADAC

    Journal ref: Advances in Data Analysis and Classification, 2023

  6. arXiv:2204.08341  [pdf, other

    stat.ME math.ST

    itdr: An R package of Integral Transformation Methods to Estimate the SDR Subspaces in Regression

    Authors: Tharindu P. De Alwis, S. Yaser Samadi, Jiaying Weng

    Abstract: Sufficient dimension reduction (SDR) is an effective tool for regression models, offering a viable approach to address and analyze the nonlinear nature of regression problems. This paper introduces the itdr R package, a comprehensive and user-friendly tool that introduces several functions based on integral transformation methods for estimating SDR subspaces. In particular, the itdr package incorp… ▽ More

    Submitted 16 July, 2023; v1 submitted 13 April, 2022; originally announced April 2022.

    Comments: 17 pages, 1 figure

  7. arXiv:2105.03153  [pdf, other

    stat.ML cs.LG

    Pairwise Fairness for Ordinal Regression

    Authors: Matthäus Kleindessner, Samira Samadi, Muhammad Bilal Zafar, Krishnaram Kenthapadi, Chris Russell

    Abstract: We initiate the study of fairness for ordinal regression. We adapt two fairness notions previously considered in fair ranking and propose a strategy for training a predictor that is approximately fair according to either notion. Our predictor has the form of a threshold model, composed of a scoring function and a set of thresholds, and our strategy is based on a reduction to fair binary classifica… ▽ More

    Submitted 11 February, 2022; v1 submitted 7 May, 2021; originally announced May 2021.

  8. Modeling Count Data via Copulas

    Authors: Hadi Safari-Katesari, S. Yaser Samadi, Samira Zaroudi

    Abstract: Copula models have been widely used to model the dependence between continuous random variables, but modeling count data via copulas has recently become popular in the statistics literature. Spearman's rho is an appropriate and effective tool to measure the degree of dependence between two random variables. In this paper, we derived the population version of Spearman's rho correlation via copulas… ▽ More

    Submitted 3 December, 2020; originally announced December 2020.

    Comments: 33 pages

    Report number: 2020 MSC Class: 60E15; 62p10

    Journal ref: Statistics 2020

  9. arXiv:2006.10085  [pdf, other

    cs.LG cs.AI cs.CG stat.ML

    Socially Fair k-Means Clustering

    Authors: Mehrdad Ghadiri, Samira Samadi, Santosh Vempala

    Abstract: We show that the popular k-means clustering algorithm (Lloyd's heuristic), used for a variety of scientific data, can result in outcomes that are unfavorable to subgroups of data (e.g., demographic groups). Such biased clusterings can have deleterious implications for human-centric applications such as resource allocation. We present a fair k-means objective and algorithm to choose cluster centers… ▽ More

    Submitted 29 October, 2020; v1 submitted 17 June, 2020; originally announced June 2020.

    Comments: 12 pages, 11 figures

  10. arXiv:1901.08668  [pdf, other

    stat.ML cs.DS cs.LG

    Guarantees for Spectral Clustering with Fairness Constraints

    Authors: Matthäus Kleindessner, Samira Samadi, Pranjal Awasthi, Jamie Morgenstern

    Abstract: Given the widespread popularity of spectral clustering (SC) for partitioning graph data, we study a version of constrained SC in which we try to incorporate the fairness notion proposed by Chierichetti et al. (2017). According to this notion, a clustering is fair if every demographic group is approximately proportionally represented in each cluster. To this end, we develop variants of both normali… ▽ More

    Submitted 10 May, 2019; v1 submitted 24 January, 2019; originally announced January 2019.

  11. arXiv:1811.00103  [pdf, other

    cs.LG stat.ML

    The Price of Fair PCA: One Extra Dimension

    Authors: Samira Samadi, Uthaipon Tantipongpipat, Jamie Morgenstern, Mohit Singh, Santosh Vempala

    Abstract: We investigate whether the standard dimensionality reduction technique of PCA inadvertently produces data representations with different fidelity for two different populations. We show on several real-world data sets, PCA has higher reconstruction error on population A than on B (for example, women versus men or lower- versus higher-educated individuals). This can happen even when the data set has… ▽ More

    Submitted 31 October, 2018; originally announced November 2018.