Skip to main content

Showing 1–22 of 22 results for author: Kachuee, M

.
  1. arXiv:2405.12933  [pdf, other

    cs.CL cs.AI cs.LG

    Skin-in-the-Game: Decision Making via Multi-Stakeholder Alignment in LLMs

    Authors: Bilgehan Sel, Priya Shanmugasundaram, Mohammad Kachuee, Kun Zhou, Ruoxi Jia, Ming **

    Abstract: Large Language Models (LLMs) have shown remarkable capabilities in tasks such as summarization, arithmetic reasoning, and question answering. However, they encounter significant challenges in the domain of moral reasoning and ethical decision-making, especially in complex scenarios with multiple stakeholders. This paper introduces the Skin-in-the-Game (SKIG) framework, aimed at enhancing moral rea… ▽ More

    Submitted 2 June, 2024; v1 submitted 21 May, 2024; originally announced May 2024.

    Comments: ACL 2024, long paper

  2. arXiv:2402.08968  [pdf, other

    cs.AI

    GrounDial: Human-norm Grounded Safe Dialog Response Generation

    Authors: Siwon Kim, Shuyang Dai, Mohammad Kachuee, Shayan Ray, Tara Taghavi, Sungroh Yoon

    Abstract: Current conversational AI systems based on large language models (LLMs) are known to generate unsafe responses, agreeing to offensive user input or including toxic content. Previous research aimed to alleviate the toxicity, by fine-tuning LLM with manually annotated safe dialogue histories. However, the dependency on additional tuning requires substantial costs. To remove the dependency, we propos… ▽ More

    Submitted 14 February, 2024; originally announced February 2024.

    Comments: Accepted to findings of EACL 2024

  3. arXiv:2306.04823  [pdf, other

    cs.CL

    Data Augmentation for Improving Tail-traffic Robustness in Skill-routing for Dialogue Systems

    Authors: Ting-Wei Wu, Fatemeh Sheikholeslami, Mohammad Kachuee, Jaeyoung Do, Sung** Lee

    Abstract: Large-scale conversational systems typically rely on a skill-routing component to route a user request to an appropriate skill and interpretation to serve the request. In such system, the agent is responsible for serving thousands of skills and interpretations which create a long-tail distribution due to the natural frequency of requests. For example, the samples related to play music might be a t… ▽ More

    Submitted 7 June, 2023; originally announced June 2023.

  4. arXiv:2305.10528  [pdf, other

    cs.AI cs.CL cs.LG

    Scalable and Safe Remediation of Defective Actions in Self-Learning Conversational Systems

    Authors: Sarthak Ahuja, Mohammad Kachuee, Fateme Sheikholeslami, Weiqing Liu, Jaeyoung Do

    Abstract: Off-Policy reinforcement learning has been a driving force for the state-of-the-art conversational AIs leading to more natural humanagent interactions and improving the user satisfaction for goal-oriented agents. However, in large-scale commercial settings, it is often challenging to balance between policy improvements and experience continuity on the broad spectrum of applications handled by such… ▽ More

    Submitted 17 May, 2023; originally announced May 2023.

    Comments: Accepted at ACL 2023 Industry Track

  5. arXiv:2209.08429  [pdf, other

    cs.LG cs.AI cs.CL

    Constrained Policy Optimization for Controlled Self-Learning in Conversational AI Systems

    Authors: Mohammad Kachuee, Sung** Lee

    Abstract: Recently, self-learning methods based on user satisfaction metrics and contextual bandits have shown promising results to enable consistent improvements in conversational AI systems. However, directly targeting such metrics by off-policy bandit learning objectives often increases the risk of making abrupt policy changes that break the current user experience. In this study, we introduce a scalable… ▽ More

    Submitted 17 September, 2022; originally announced September 2022.

    Report number: ACL 2023

  6. arXiv:2204.07135  [pdf, other

    cs.LG cs.AI cs.CL cs.HC

    Scalable and Robust Self-Learning for Skill Routing in Large-Scale Conversational AI Systems

    Authors: Mohammad Kachuee, **seok Nam, Sarthak Ahuja, **-Myung Won, Sung** Lee

    Abstract: Skill routing is an important component in large-scale conversational systems. In contrast to traditional rule-based skill routing, state-of-the-art systems use a model-based approach to enable natural conversations. To provide supervision signal required to train such models, ideas such as human annotation, replication of a rule-based system, relabeling based on user paraphrases, and bandit-based… ▽ More

    Submitted 14 April, 2022; originally announced April 2022.

    Comments: NAACL 2022

  7. arXiv:2204.01916  [pdf, other

    cs.LG cs.AI cs.NE

    Domain-Aware Contrastive Knowledge Transfer for Multi-domain Imbalanced Data

    Authors: Zixuan Ke, Mohammad Kachuee, Sung** Lee

    Abstract: In many real-world machine learning applications, samples belong to a set of domains e.g., for product reviews each review belongs to a product category. In this paper, we study multi-domain imbalanced learning (MIL), the scenario that there is imbalance not only in classes but also in domains. In the MIL setting, different domains exhibit different patterns and there is a varying degree of simila… ▽ More

    Submitted 4 April, 2022; originally announced April 2022.

    Comments: ACL WASSA 2022

  8. arXiv:2011.05961  [pdf, other

    cs.LG

    Real-Time Decentralized knowledge Transfer at the Edge

    Authors: Orpaz Goldstein, Mohammad Kachuee, Derek Shiell, Majid Sarrafzadeh

    Abstract: The proliferation of edge networks creates islands of learning agents working on local streams of data. Transferring knowledge between these agents in real-time without exposing private data allows for collaboration to decrease learning time and increase model confidence. Incorporating knowledge from data that a local model did not see creates an ability to debias a local model or add to classific… ▽ More

    Submitted 1 October, 2021; v1 submitted 11 November, 2020; originally announced November 2020.

  9. arXiv:2010.11230  [pdf, other

    cs.LG cs.CL

    Self-Supervised Contrastive Learning for Efficient User Satisfaction Prediction in Conversational Agents

    Authors: Mohammad Kachuee, Hao Yuan, Young-Bum Kim, Sung** Lee

    Abstract: Turn-level user satisfaction is one of the most important performance metrics for conversational agents. It can be used to monitor the agent's performance and provide insights about defective user experiences. Moreover, a powerful satisfaction model can be used as an objective function that a conversational agent continuously optimizes for. While end-to-end deep learning has shown promising result… ▽ More

    Submitted 11 April, 2021; v1 submitted 21 October, 2020; originally announced October 2020.

    Comments: NAACL-HLT 2021

  10. arXiv:1912.09600  [pdf, other

    cs.LG cs.NE stat.ML

    Group-Connected Multilayer Perceptron Networks

    Authors: Mohammad Kachuee, Sajad Darabi, Shayan Fazeli, Majid Sarrafzadeh

    Abstract: Despite the success of deep learning in domains such as image, voice, and graphs, there has been little progress in deep representation learning for domains without a known structure between features. For instance, a tabular dataset of different demographic and clinical factors where the feature interactions are not given as a prior. In this paper, we propose Group-Connected Multilayer Perceptron… ▽ More

    Submitted 25 November, 2020; v1 submitted 19 December, 2019; originally announced December 2019.

  11. arXiv:1912.08281  [pdf, other

    cs.LG stat.ML

    Cost-Sensitive Feature-Value Acquisition Using Feature Relevance

    Authors: Kimmo Kärkkäinen, Mohammad Kachuee, Orpaz Goldstein, Majid Sarrafzadeh

    Abstract: In many real-world machine learning problems, feature values are not readily available. To make predictions, some of the missing features have to be acquired, which can incur a cost in money, computational time, or human time, depending on the problem domain. This leads us to the problem of choosing which features to use at the prediction time. The chosen features should increase the prediction ac… ▽ More

    Submitted 18 December, 2019; v1 submitted 17 December, 2019; originally announced December 2019.

  12. arXiv:1910.01803  [pdf, other

    cs.LG cs.AI stat.ML

    Unsupervised Representation for EHR Signals and Codes as Patient Status Vector

    Authors: Sajad Darabi, Mohammad Kachuee, Majid Sarrafzadeh

    Abstract: Effective modeling of electronic health records presents many challenges as they contain large amounts of irregularity most of which are due to the varying procedures and diagnosis a patient may have. Despite the recent progress in machine learning, unsupervised learning remains largely at open, especially in the healthcare domain. In this work, we present a two-step unsupervised representation le… ▽ More

    Submitted 4 October, 2019; originally announced October 2019.

  13. arXiv:1909.06772  [pdf, other

    cs.LG stat.ML

    Target-Focused Feature Selection Using a Bayesian Approach

    Authors: Orpaz Goldstein, Mohammad Kachuee, Kimmo Karkkainen, Majid Sarrafzadeh

    Abstract: In many real-world scenarios where data is high dimensional, test time acquisition of features is a non-trivial task due to costs associated with feature acquisition and evaluating feature value. The need for highly confident models with an extremely frugal acquisition of features can be addressed by allowing a feature selection method to become target aware. We introduce an approach to feature se… ▽ More

    Submitted 15 September, 2019; originally announced September 2019.

  14. arXiv:1908.03971  [pdf, other

    cs.LG cs.CL stat.ML

    TAPER: Time-Aware Patient EHR Representation

    Authors: Sajad Darabi, Mohammad Kachuee, Shayan Fazeli, Majid Sarrafzadeh

    Abstract: Effective representation learning of electronic health records is a challenging task and is becoming more important as the availability of such data is becoming pervasive. The data contained in these records are irregular and contain multiple modalities such as notes, and medical codes. They are preempted by medical conditions the patient may have, and are typically jotted down by medical staff. A… ▽ More

    Submitted 3 May, 2020; v1 submitted 11 August, 2019; originally announced August 2019.

  15. arXiv:1905.09340  [pdf, other

    cs.LG cs.AI stat.ML

    Generative Imputation and Stochastic Prediction

    Authors: Mohammad Kachuee, Kimmo Karkkainen, Orpaz Goldstein, Sajad Darabi, Majid Sarrafzadeh

    Abstract: In many machine learning applications, we are faced with incomplete datasets. In the literature, missing data imputation techniques have been mostly concerned with filling missing values. However, the existence of missing values is synonymous with uncertainties not only over the distribution of missing values but also over target class assignments that require careful consideration. In this paper,… ▽ More

    Submitted 4 September, 2020; v1 submitted 22 May, 2019; originally announced May 2019.

  16. Non-invasive Blood Pressure Estimation Using Phonocardiogram

    Authors: Amirhossein Esmaili, Mohammad Kachuee, Mahdi Shabany

    Abstract: This paper presents a novel approach based on pulse transit time (PTT) for the estimation of blood pressure (BP). In order to achieve this goal, a data acquisition hardware is designed for high-resolution sampling of phonocardiogram (PCG) and photoplethysmogram (PPG). These two signals can derive PTT values. Meanwhile, a force-sensing resistor (FSR) is placed under the cuff of the BP reference dev… ▽ More

    Submitted 6 May, 2019; originally announced May 2019.

    Comments: The collected data set can be accessed using the following url link: http://www.kaggle.com/mkachuee/noninvasivebp

    Journal ref: 2017 IEEE International Symposium on Circuits and Systems (ISCAS), pp. 1-4. IEEE, 2017

  17. arXiv:1902.07102  [pdf, other

    cs.LG cs.AI cs.CY stat.ML

    Cost-Sensitive Diagnosis and Learning Leveraging Public Health Data

    Authors: Mohammad Kachuee, Kimmo Karkkainen, Orpaz Goldstein, Davina Zamanzadeh, Majid Sarrafzadeh

    Abstract: Traditionally, machine learning algorithms rely on the assumption that all features of a given dataset are available for free. However, there are many concerns such as monetary data collection costs, patient discomfort in medical procedures, and privacy impacts of data collection that require careful consideration in any real-world health analytics system. An efficient solution would only acquire… ▽ More

    Submitted 30 June, 2019; v1 submitted 19 February, 2019; originally announced February 2019.

  18. arXiv:1901.00243  [pdf, other

    cs.LG cs.AI cs.NE stat.ML

    Opportunistic Learning: Budgeted Cost-Sensitive Learning from Data Streams

    Authors: Mohammad Kachuee, Orpaz Goldstein, Kimmo Karkkainen, Sajad Darabi, Majid Sarrafzadeh

    Abstract: In many real-world learning scenarios, features are only acquirable at a cost constrained under a budget. In this paper, we propose a novel approach for cost-sensitive feature acquisition at the prediction-time. The suggested method acquires features incrementally based on a context-aware feature-value function. We formulate the problem in the reinforcement learning paradigm, and introduce a rewar… ▽ More

    Submitted 17 February, 2019; v1 submitted 1 January, 2019; originally announced January 2019.

    Comments: https://openreview.net/forum?id=S1eOHo09KX

    Journal ref: International Conference on Learning Representations (ICLR), 2019

  19. Nonlinear Cuff-less Blood Pressure Estimation of Healthy Subjects Using Pulse Transit Time and Arrival Time

    Authors: Amirhossein Esmaili, Mohammad Kachuee, Mahdi Shabany

    Abstract: This paper presents a novel blood pressure (BP) estimation method based on pulse transit time (PTT) and pulse arrival time (PAT) to estimate the systolic BP (SBP) and the diastolic BP (DBP). A data acquisition hardware is designed for high-resolution sampling of phonocardiogram (PCG), photoplethysmogram, and electrocardiogram (ECG). PCG and ECG perform as the proximal timing reference to obtain PT… ▽ More

    Submitted 6 May, 2019; v1 submitted 2 December, 2018; originally announced December 2018.

    Comments: The collected data set can be accessed using the following url link: http://www.kaggle.com/mkachuee/noninvasivebp

    Journal ref: IEEE Transactions on Instrumentation and Measurement, 66(12), pp.3299-3308, December 2017

  20. Dynamic Feature Acquisition Using Denoising Autoencoders

    Authors: Mohammad Kachuee, Sajad Darabi, Babak Moatamed, Majid Sarrafzadeh

    Abstract: In real-world scenarios, different features have different acquisition costs at test-time which necessitates cost-aware methods to optimize the cost and performance trade-off. This paper introduces a novel and scalable approach for cost-aware feature acquisition at test-time. The method incrementally asks for features based on the available context that are known feature values. The proposed metho… ▽ More

    Submitted 3 November, 2018; originally announced November 2018.

    Journal ref: IEEE Transactions on Neural Networks and Learning Systems, 2018

  21. arXiv:1805.00794  [pdf, other

    cs.CY cs.LG stat.ML

    ECG Heartbeat Classification: A Deep Transferable Representation

    Authors: Mohammad Kachuee, Shayan Fazeli, Majid Sarrafzadeh

    Abstract: Electrocardiogram (ECG) can be reliably used as a measure to monitor the functionality of the cardiovascular system. Recently, there has been a great attention towards accurate categorization of heartbeats. While there are many commonalities between different ECG conditions, the focus of most studies has been classifying a set of conditions on a dataset annotated for that task rather than learning… ▽ More

    Submitted 12 July, 2018; v1 submitted 19 April, 2018; originally announced May 2018.

  22. arXiv:1707.04364  [pdf, other

    cs.CY

    Complex Event Processing of Health Data in Real-time to Predict Heart Failure Risk and Stress

    Authors: Sandeep Singh Sandha, Mohammad Kachuee, Sajad Darabi

    Abstract: In this paper, we develop a scalable system which can do real-time analytics for different health applications. The occurrence of different health conditions can be regarded as the complex events and thus this concept can be extended to other use cases easily. Large number of users should be able to send the data in real-time, and should be able to receive the feedback and result. Kee** the requ… ▽ More

    Submitted 13 July, 2017; originally announced July 2017.