Skip to main content

Showing 1–34 of 34 results for author: Sekhari, A

.
  1. arXiv:2406.17216  [pdf, other

    cs.LG cs.AI cs.CR cs.CY

    Machine Unlearning Fails to Remove Data Poisoning Attacks

    Authors: Martin Pawelczyk, Jimmy Z. Di, Yiwei Lu, Gautam Kamath, Ayush Sekhari, Seth Neel

    Abstract: We revisit the efficacy of several practical methods for approximate machine unlearning developed for large-scale deep learning. In addition to complying with data deletion requests, one often-cited potential application for unlearning methods is to remove the effects of training on poisoned data. We experimentally demonstrate that, while existing unlearning methods have been demonstrated to be ef… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

  2. arXiv:2406.11810  [pdf, ps, other

    cs.LG cs.RO eess.SY

    Computationally Efficient RL under Linear Bellman Completeness for Deterministic Dynamics

    Authors: Runzhe Wu, Ayush Sekhari, Akshay Krishnamurthy, Wen Sun

    Abstract: We study computationally and statistically efficient Reinforcement Learning algorithms for the linear Bellman Complete setting, a setting that uses linear function approximation to capture value functions and unifies existing models like linear Markov Decision Processes (MDP) and Linear Quadratic Regulators (LQR). While it is known from the prior works that this setting is statistically tractable,… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

  3. arXiv:2403.17091  [pdf, ps, other

    cs.LG cs.AI stat.ML

    Offline Reinforcement Learning: Role of State Aggregation and Trajectory Data

    Authors: Zeyu Jia, Alexander Rakhlin, Ayush Sekhari, Chen-Yu Wei

    Abstract: We revisit the problem of offline reinforcement learning with value function realizability but without Bellman completeness. Previous work by Xie and Jiang (2021) and Foster et al. (2022) left open the question whether a bounded concentrability coefficient along with trajectory-based offline data admits a polynomial sample complexity. In this work, we provide a negative answer to this question for… ▽ More

    Submitted 25 March, 2024; originally announced March 2024.

  4. arXiv:2401.09681  [pdf, other

    cs.LG stat.ML

    Harnessing Density Ratios for Online Reinforcement Learning

    Authors: Philip Amortila, Dylan J. Foster, Nan Jiang, Ayush Sekhari, Tengyang Xie

    Abstract: The theories of offline and online reinforcement learning, despite having evolved in parallel, have begun to show signs of the possibility for a unification, with algorithms and analysis techniques for one setting often having natural counterparts in the other. However, the notion of density ratio modeling, an emerging paradigm in offline RL, has been largely absent from online RL, perhaps for goo… ▽ More

    Submitted 4 June, 2024; v1 submitted 17 January, 2024; originally announced January 2024.

    Comments: ICLR 2024

  5. arXiv:2311.08384  [pdf, other

    cs.LG cs.AI stat.ML

    Offline Data Enhanced On-Policy Policy Gradient with Provable Guarantees

    Authors: Yifei Zhou, Ayush Sekhari, Yuda Song, Wen Sun

    Abstract: Hybrid RL is the setting where an RL agent has access to both offline data and online data by interacting with the real-world environment. In this work, we propose a new hybrid RL algorithm that combines an on-policy actor-critic method with offline data. On-policy methods such as policy gradient and natural policy gradient (NPG) have shown to be more robust to model misspecification, though somet… ▽ More

    Submitted 14 November, 2023; originally announced November 2023.

    Comments: The first two authors contributed equally

  6. arXiv:2310.06113  [pdf, other

    cs.LG cs.AI math.ST stat.ML

    When is Agnostic Reinforcement Learning Statistically Tractable?

    Authors: Zeyu Jia, Gene Li, Alexander Rakhlin, Ayush Sekhari, Nathan Srebro

    Abstract: We study the problem of agnostic PAC reinforcement learning (RL): given a policy class $Π$, how many rounds of interaction with an unknown MDP (with a potentially large state and action space) are required to learn an $ε$-suboptimal policy with respect to $Π$? Towards that end, we introduce a new complexity measure, called the \emph{spanning capacity}, that depends solely on the set $Π$ and is ind… ▽ More

    Submitted 9 October, 2023; originally announced October 2023.

    Comments: Accepted to NeurIPS 2023

  7. arXiv:2310.05926  [pdf

    physics.soc-ph cs.DL

    The using of bibliometric analysis to classify trends and future directions on ''Smart Farm''

    Authors: Paweena Suebsombut, Aicha Sekhari, Pradorn Sureepong, Pittawat Ueasangkomsate, Abdelaziz Bouras

    Abstract: Climate change has affected the cultivation in all countries with extreme drought, flooding, higher temperature, and changes in the season thus leaving behind the uncontrolled production. Consequently, the smart farm has become part of the crucial trend that is needed for application in certain farm areas. The aims of smart farm are to control and to enhance food production and productivity, and t… ▽ More

    Submitted 31 July, 2023; originally announced October 2023.

    Journal ref: 2017 International Conference on Digital Arts, Media and Technology (ICDAMT), Chiang Mai University, Mar 2017, Chiang Mai, Thailand. pp.136-141

  8. Training Evaluation in a Smart Farm using Kirkpatrick Model: A Case Study of Chiang Mai

    Authors: Suepphong Chernbumroong, Pradorn Sureephong, Paweena Suebsombut, Aicha Sekhari

    Abstract: Farmers can now use IoT to improve farm efficiency and productivity by using sensors for farm monitoring to enhance decision-making in areas such as fertilization, irrigation, climate forecast, and harvesting information. Local farmers in Chiang Mai, Thailand, on the other hand, continue to lack knowledge and experience with smart farm technology. As a result, the 'SUNSpACe' project, funded by the… ▽ More

    Submitted 31 July, 2023; originally announced August 2023.

    Journal ref: 2022 Joint International Conference on Digital Arts, Media and Technology with ECTI Northern Section Conference on Electrical, Electronics, Computer and Telecommunications Engineering (ECTI DAMT and NCON), Chiang Rai University, Jan 2022, Chiang Rai, Thailand. pp.463-466

  9. Chatbot Application to Support Smart Agriculture in Thailand

    Authors: Paweena Suebsombut, Pradorn Sureephong, Aicha Sekhari, Suepphong Chernbumroong, Abdelaziz Bouras

    Abstract: A chatbot is a software developed to help reply to text or voice conversations automatically and quickly in real time. In the agriculture sector, the existing smart agriculture systems just use data from sensing and internet of things (IoT) technologies that exclude crop cultivation knowledge to support decision-making by farmers. To enhance this, the chatbot application can be an assistant to far… ▽ More

    Submitted 31 July, 2023; originally announced August 2023.

    Journal ref: 2022 Joint International Conference on Digital Arts, Media and Technology with ECTI Northern Section Conference on Electrical, Electronics, Computer and Telecommunications Engineering (ECTI DAMT and NCON), Chiang Rai university, Jan 2022, Chiang Rai, Thailand. pp.364-367

  10. arXiv:2307.12926  [pdf, ps, other

    cs.LG cs.AI cs.HC

    Contextual Bandits and Imitation Learning via Preference-Based Active Queries

    Authors: Ayush Sekhari, Karthik Sridharan, Wen Sun, Runzhe Wu

    Abstract: We consider the problem of contextual bandits and imitation learning, where the learner lacks direct knowledge of the executed action's reward. Instead, the learner can actively query an expert at each round to compare two actions and receive noisy preference feedback. The learner's objective is two-fold: to minimize the regret associated with the executed actions, while simultaneously, minimizing… ▽ More

    Submitted 24 July, 2023; originally announced July 2023.

  11. arXiv:2307.04998  [pdf, other

    cs.LG cs.AI math.ST stat.ML

    Selective Sampling and Imitation Learning via Online Regression

    Authors: Ayush Sekhari, Karthik Sridharan, Wen Sun, Runzhe Wu

    Abstract: We consider the problem of Imitation Learning (IL) by actively querying noisy expert for feedback. While imitation learning has been empirically successful, much of prior work assumes access to noiseless expert feedback which is not practical in many applications. In fact, when one only has access to noisy expert feedback, algorithms that rely on purely offline data (non-interactive IL) can be sho… ▽ More

    Submitted 10 July, 2023; originally announced July 2023.

  12. arXiv:2306.15744  [pdf, ps, other

    cs.LG cs.DS stat.ML

    Ticketed Learning-Unlearning Schemes

    Authors: Badih Ghazi, Pritish Kamath, Ravi Kumar, Pasin Manurangsi, Ayush Sekhari, Chiyuan Zhang

    Abstract: We consider the learning--unlearning paradigm defined as follows. First given a dataset, the goal is to learn a good predictor, such as one minimizing a certain loss. Subsequently, given any subset of examples that wish to be unlearnt, the goal is to learn, without the knowledge of the original training dataset, a good predictor that is identical to the predictor that would have been produced when… ▽ More

    Submitted 27 June, 2023; originally announced June 2023.

    Comments: Conference on Learning Theory (COLT) 2023

  13. arXiv:2212.10717  [pdf, other

    cs.LG cs.AI cs.CR cs.CY

    Hidden Poison: Machine Unlearning Enables Camouflaged Poisoning Attacks

    Authors: Jimmy Z. Di, Jack Douglas, Jayadev Acharya, Gautam Kamath, Ayush Sekhari

    Abstract: We introduce camouflaged data poisoning attacks, a new attack vector that arises in the context of machine unlearning and other settings when model retraining may be induced. An adversary first adds a few carefully crafted points to the training dataset such that the impact on the model's predictions is minimal. The adversary subsequently triggers a request to remove a subset of the introduced poi… ▽ More

    Submitted 20 December, 2022; originally announced December 2022.

  14. arXiv:2211.14250  [pdf, other

    cs.LG math.OC math.ST stat.ML

    Model-Free Reinforcement Learning with the Decision-Estimation Coefficient

    Authors: Dylan J. Foster, Noah Golowich, Jian Qian, Alexander Rakhlin, Ayush Sekhari

    Abstract: We consider the problem of interactive decision making, encompassing structured bandits and reinforcement learning with general function approximation. Recently, Foster et al. (2021) introduced the Decision-Estimation Coefficient, a measure of statistical complexity that lower bounds the optimal regret for interactive decision making, as well as a meta-algorithm, Estimation-to-Decisions, which ach… ▽ More

    Submitted 12 August, 2023; v1 submitted 25 November, 2022; originally announced November 2022.

    Comments: V2 changes: Improved writing and added more examples

  15. arXiv:2210.06718  [pdf, other

    cs.LG

    Hybrid RL: Using Both Offline and Online Data Can Make RL Efficient

    Authors: Yuda Song, Yifei Zhou, Ayush Sekhari, J. Andrew Bagnell, Akshay Krishnamurthy, Wen Sun

    Abstract: We consider a hybrid reinforcement learning setting (Hybrid RL), in which an agent has access to an offline dataset and the ability to collect experience via real-world online interaction. The framework mitigates the challenges that arise in both pure offline and online RL settings, allowing for the design of simple and highly effective algorithms, in both theory and practice. We demonstrate these… ▽ More

    Submitted 11 March, 2023; v1 submitted 13 October, 2022; originally announced October 2022.

    Comments: 42 pages, 6 figures. Published at ICLR 2023. Code available at https://github.com/yudasong/HyQ

  16. arXiv:2210.06705  [pdf, ps, other

    cs.LG cs.AI math.OC

    From Gradient Flow on Population Loss to Learning with Stochastic Gradient Descent

    Authors: Satyen Kale, Jason D. Lee, Chris De Sa, Ayush Sekhari, Karthik Sridharan

    Abstract: Stochastic Gradient Descent (SGD) has been the method of choice for learning large-scale non-convex models. While a general analysis of when SGD works has been elusive, there has been a lot of recent progress in understanding the convergence of Gradient Flow (GF) on the population loss, partly due to the simplicity that a continuous-time analysis buys us. An overarching theme of our paper is provi… ▽ More

    Submitted 12 October, 2022; originally announced October 2022.

  17. arXiv:2210.01940  [pdf, other

    cs.LG cs.AI cs.CR

    On the Robustness of Deep Clustering Models: Adversarial Attacks and Defenses

    Authors: Anshuman Chhabra, Ashwin Sekhari, Prasant Mohapatra

    Abstract: Clustering models constitute a class of unsupervised machine learning methods which are used in a number of application pipelines, and play a vital role in modern data science. With recent advancements in deep learning -- deep clustering models have emerged as the current state-of-the-art over traditional clustering approaches, especially for high-dimensional image datasets. While traditional clus… ▽ More

    Submitted 4 October, 2022; originally announced October 2022.

    Comments: Accepted to the 36th Conference on Neural Information Processing Systems (NeurIPS 2022)

  18. arXiv:2206.13063  [pdf, other

    cs.LG math.OC math.ST stat.ML

    On the Complexity of Adversarial Decision Making

    Authors: Dylan J. Foster, Alexander Rakhlin, Ayush Sekhari, Karthik Sridharan

    Abstract: A central problem in online learning and decision making -- from bandits to reinforcement learning -- is to understand what modeling assumptions lead to sample-efficient learning guarantees. We consider a general adversarial decision making framework that encompasses (structured) bandit problems with adversarial rewards and reinforcement learning problems with adversarial dynamics. Our main result… ▽ More

    Submitted 27 June, 2022; originally announced June 2022.

  19. arXiv:2206.12081  [pdf, other

    cs.LG stat.ME stat.ML

    Computationally Efficient PAC RL in POMDPs with Latent Determinism and Conditional Embeddings

    Authors: Masatoshi Uehara, Ayush Sekhari, Jason D. Lee, Nathan Kallus, Wen Sun

    Abstract: We study reinforcement learning with function approximation for large-scale Partially Observable Markov Decision Processes (POMDPs) where the state space and observation space are large or even continuous. Particularly, we consider Hilbert space embeddings of POMDP where the feature of latent states and the feature of observations admit a conditional Hilbert space embedding of the observation emis… ▽ More

    Submitted 24 June, 2022; originally announced June 2022.

  20. arXiv:2206.12020  [pdf, ps, other

    cs.LG math.ST stat.ME stat.ML

    Provably Efficient Reinforcement Learning in Partially Observable Dynamical Systems

    Authors: Masatoshi Uehara, Ayush Sekhari, Jason D. Lee, Nathan Kallus, Wen Sun

    Abstract: We study Reinforcement Learning for partially observable dynamical systems using function approximation. We propose a new \textit{Partially Observable Bilinear Actor-Critic framework}, that is general enough to include models such as observable tabular Partially Observable Markov Decision Processes (POMDPs), observable Linear-Quadratic-Gaussian (LQG), Predictive State Representations (PSRs), as we… ▽ More

    Submitted 23 June, 2022; originally announced June 2022.

  21. arXiv:2206.09421  [pdf, other

    cs.LG

    Guarantees for Epsilon-Greedy Reinforcement Learning with Function Approximation

    Authors: Christoph Dann, Yishay Mansour, Mehryar Mohri, Ayush Sekhari, Karthik Sridharan

    Abstract: Myopic exploration policies such as epsilon-greedy, softmax, or Gaussian noise fail to explore efficiently in some reinforcement learning tasks and yet, they perform well in many others. In fact, in practice, they are often selected as the top choices, due to their simplicity. But, for what tasks do such policies succeed? Can we give theoretical guarantees for their favorable performance? These cr… ▽ More

    Submitted 19 June, 2022; originally announced June 2022.

    Comments: to appear at ICML 2022

  22. arXiv:2107.05074  [pdf, other

    cs.LG cs.AI

    SGD: The Role of Implicit Regularization, Batch-size and Multiple-epochs

    Authors: Satyen Kale, Ayush Sekhari, Karthik Sridharan

    Abstract: Multi-epoch, small-batch, Stochastic Gradient Descent (SGD) has been the method of choice for learning with large over-parameterized models. A popular theory for explaining why SGD works well in practice is that the algorithm has an implicit regularization that biases its output towards a good solution. Perhaps the theoretically most well understood learning setting for SGD is that of Stochastic C… ▽ More

    Submitted 11 July, 2021; originally announced July 2021.

  23. arXiv:2106.11519  [pdf, other

    cs.LG cs.AI eess.SY

    Agnostic Reinforcement Learning with Low-Rank MDPs and Rich Observations

    Authors: Christoph Dann, Yishay Mansour, Mehryar Mohri, Ayush Sekhari, Karthik Sridharan

    Abstract: There have been many recent advances on provably efficient Reinforcement Learning (RL) in problems with rich observation spaces. However, all these works share a strong realizability assumption about the optimal value function of the true MDP. Such realizability assumptions are often too strong to hold in practice. In this work, we consider the more realistic setting of agnostic RL with rich obser… ▽ More

    Submitted 21 June, 2021; originally announced June 2021.

  24. arXiv:2106.03243  [pdf, ps, other

    cs.LG

    Neural Active Learning with Performance Guarantees

    Authors: Pranjal Awasthi, Christoph Dann, Claudio Gentile, Ayush Sekhari, Zhilei Wang

    Abstract: We investigate the problem of active learning in the streaming setting in non-parametric regimes, where the labels are stochastically generated from a class of functions on which we make no assumptions whatsoever. We rely on recently proposed Neural Tangent Kernel (NTK) approximation tools to construct a suitable neural embedding that determines the feature space the algorithm operates on and the… ▽ More

    Submitted 6 June, 2021; originally announced June 2021.

    Comments: 30 pages

  25. arXiv:2103.03279  [pdf, ps, other

    cs.LG cs.AI

    Remember What You Want to Forget: Algorithms for Machine Unlearning

    Authors: Ayush Sekhari, Jayadev Acharya, Gautam Kamath, Ananda Theertha Suresh

    Abstract: We study the problem of unlearning datapoints from a learnt model. The learner first receives a dataset $S$ drawn i.i.d. from an unknown distribution, and outputs a model $\widehat{w}$ that performs well on unseen samples from the same distribution. However, at some point in the future, any training datapoint $z \in S$ can request to be unlearned, thus prompting the learner to modify its output mo… ▽ More

    Submitted 22 July, 2021; v1 submitted 4 March, 2021; originally announced March 2021.

  26. arXiv:2006.13476  [pdf, other

    cs.LG math.OC stat.ML

    Second-Order Information in Non-Convex Stochastic Optimization: Power and Limitations

    Authors: Yossi Arjevani, Yair Carmon, John C. Duchi, Dylan J. Foster, Ayush Sekhari, Karthik Sridharan

    Abstract: We design an algorithm which finds an $ε$-approximate stationary point (with $\|\nabla F(x)\|\le ε$) using $O(ε^{-3})$ stochastic gradient and Hessian-vector products, matching guarantees that were previously available only under a stronger assumption of access to multiple queries with the same random seed. We prove a lower bound which establishes that this rate is optimal and---surprisingly---tha… ▽ More

    Submitted 24 June, 2020; originally announced June 2020.

    Comments: Accepted to CONFERENCE ON LEARNING THEORY (COLT) 2020

  27. arXiv:2005.03789  [pdf, other

    cs.LG cs.AI stat.ML

    Reinforcement Learning with Feedback Graphs

    Authors: Christoph Dann, Yishay Mansour, Mehryar Mohri, Ayush Sekhari, Karthik Sridharan

    Abstract: We study episodic reinforcement learning in Markov decision processes when the agent receives additional feedback per step in the form of several transition observations. Such additional observations are available in a range of tasks through extended sensors or prior knowledge about the environment (e.g., when certain actions yield similar outcome). We formalize this setting using a feedback graph… ▽ More

    Submitted 7 May, 2020; originally announced May 2020.

  28. arXiv:1902.04686  [pdf, ps, other

    cs.LG math.OC stat.ML

    The Complexity of Making the Gradient Small in Stochastic Convex Optimization

    Authors: Dylan J. Foster, Ayush Sekhari, Ohad Shamir, Nathan Srebro, Karthik Sridharan, Blake Woodworth

    Abstract: We give nearly matching upper and lower bounds on the oracle complexity of finding $ε$-stationary points ($\| \nabla F(x) \| \leqε$) in stochastic convex optimization. We jointly analyze the oracle complexity in both the local stochastic oracle model and the global oracle (or, statistical learning) model. This allows us to decompose the complexity of finding near-stationary points into optimizatio… ▽ More

    Submitted 14 February, 2019; v1 submitted 12 February, 2019; originally announced February 2019.

  29. arXiv:1811.09693  [pdf

    cs.CY

    Towards Realizing the Smart Product Traceability System

    Authors: Dharmendra Kumar Mishra, Aicha Sekhari, Sébastien Henry, Dharmendra Mishra, Yacine Ouzrout, Ajay Shrestha, Abdelaziz Bouras

    Abstract: The rapid technological enhancement and innovations in current days have changed people's thought. The use of Information Technology tools in people's daily life has changed their life style completely. The advent of various innovative smart products in the market has tremendous impact on people's lifestyle. They want to know their heart beat while they run, they need a smart car which makes them… ▽ More

    Submitted 7 November, 2018; originally announced November 2018.

    Journal ref: IEEE International Conference on Software, Knowledge, Information Management \& Applications (SKIMA 2015), Dec 2015, Kathmandu, Nepal. 2015

  30. arXiv:1811.06358  [pdf

    cs.CY

    Traceability as an integral part of supply chain logistics management: an analytical review

    Authors: Dharmendra Kumar Mishra, Sébastien Henry, Aicha Sekhari, Yacine Ouzrout

    Abstract: Purpose: Supply chain has become very complex today. There are multiple stakeholders at various points. All these stakeholders need to collaborate with each other in multiple directions for its effective and efficient management. The manufacturers need proper information and data about the product location, its processing history, raw materials, etc at each point so as to control the production pr… ▽ More

    Submitted 6 November, 2018; originally announced November 2018.

    Journal ref: International Conference on Logistics and Transport (ICLT 2015), Nov 2015, Lyon, France. 2015

  31. Jointly identifying opinion mining elements and fuzzy measurement of opinion intensity to analyze product features

    Authors: Haiqing Zhang, Aicha Sekhari, Yacine Ouzrout, Abdelaziz Bouras

    Abstract: Opinion mining mainly involves three elements: feature and feature-of relations, opinion expressions and the related opinion attributes (e.g. Polarity), and feature-opinion relations. Although many works have emerged to achieve its aim of gaining information, the previous researches typically handled each of the three elements in isolation, which cannot give sufficient information extraction resul… ▽ More

    Submitted 13 November, 2018; originally announced November 2018.

    Journal ref: Engineering Applications of Artificial Intelligence, Elsevier, 2016, 47, pp.122--139

  32. arXiv:1810.11059  [pdf, ps, other

    cs.LG math.OC stat.ML

    Uniform Convergence of Gradients for Non-Convex Learning and Optimization

    Authors: Dylan J. Foster, Ayush Sekhari, Karthik Sridharan

    Abstract: We investigate 1) the rate at which refined properties of the empirical risk---in particular, gradients---converge to their population counterparts in standard non-convex learning tasks, and 2) the consequences of this convergence for optimization. Our analysis follows the tradition of norm-based capacity control. We propose vector-valued Rademacher complexities as a simple, composable, and user-f… ▽ More

    Submitted 11 November, 2018; v1 submitted 25 October, 2018; originally announced October 2018.

    Comments: To appear in Neural Information Processing Systems (NIPS) 2018

  33. arXiv:1707.03979  [pdf, other

    cs.AI cs.LG

    A Brief Study of In-Domain Transfer and Learning from Fewer Samples using A Few Simple Priors

    Authors: Marc Pickett, Ayush Sekhari, James Davidson

    Abstract: Domain knowledge can often be encoded in the structure of a network, such as convolutional layers for vision, which has been shown to increase generalization and decrease sample complexity, or the number of samples required for successful learning. In this study, we ask whether sample complexity can be reduced for systems where the structure of the domain is unknown beforehand, and the structure a… ▽ More

    Submitted 13 July, 2017; originally announced July 2017.

    Comments: Accepted for ICML 2017 Workshop on Picky Learners

  34. arXiv:1607.07712  [pdf, other

    cs.CY

    Review on Telemonitoring of Maternal Health care Targeting Medical Cyber-Physical Systems

    Authors: Mohammod Abul Kashem, Md. Hanif Seddiqui, Nejib Moalla, Aicha Sekhari, Yacine Ouzrout

    Abstract: We aim to review available literature related to the telemonitoring of maternal health care for a comprehensive understanding of the roles of Medical Cyber-Physical-Systems (MCPS) as cutting edge technology in maternal risk factor management, and for understanding the possible research gap in the domain. In this regard, we search literature through google scholar and PubMed databases for published… ▽ More

    Submitted 26 April, 2016; originally announced July 2016.

    Comments: Submitted for the 1st International Conference on Advanced Information and Communication Technology (ICAICT), 2016 proceedings, 6 pages, LaTeX, 1 .png figures