-
Human-AI collaboration is not very collaborative yet: A taxonomy of interaction patterns in AI-assisted decision making from a systematic review
Authors:
Catalina Gomez,
Sue Min Cho,
Shichang Ke,
Chien-Ming Huang,
Mathias Unberath
Abstract:
Leveraging Artificial Intelligence (AI) in decision support systems has disproportionately focused on technological advancements, often overlooking the alignment between algorithmic outputs and human expectations. A human-centered perspective attempts to alleviate this concern by designing AI solutions for seamless integration with existing processes. Determining what information AI should provide…
▽ More
Leveraging Artificial Intelligence (AI) in decision support systems has disproportionately focused on technological advancements, often overlooking the alignment between algorithmic outputs and human expectations. A human-centered perspective attempts to alleviate this concern by designing AI solutions for seamless integration with existing processes. Determining what information AI should provide to aid humans is vital, a concept underscored by explainable AI's efforts to justify AI predictions. However, how the information is presented, e.g., the sequence of recommendations and solicitation of interpretations, is equally crucial as complex interactions may emerge between humans and AI. While empirical studies have evaluated human-AI dynamics across domains, a common vocabulary for human-AI interaction protocols is lacking. To promote more deliberate consideration of interaction designs, we introduce a taxonomy of interaction patterns that delineate various modes of human-AI interactivity. We summarize the results of a systematic review of AI-assisted decision making literature and identify trends and opportunities in existing interactions across application domains from 105 articles. We find that current interactions are dominated by simplistic collaboration paradigms, leading to little support for truly interactive functionality. Our taxonomy offers a tool to understand interactivity with AI in decision-making and foster interaction designs for achieving clear communication, trustworthiness, and collaboration.
△ Less
Submitted 18 March, 2024; v1 submitted 30 October, 2023;
originally announced October 2023.
-
Non-Autoregressive Sign Language Production via Knowledge Distillation
Authors:
Eui Jun Hwang,
Jung Ho Kim,
Suk Min Cho,
Jong C. Park
Abstract:
Sign Language Production (SLP) aims to translate expressions in spoken language into corresponding ones in sign language, such as skeleton-based sign poses or videos. Existing SLP models are either AutoRegressive (AR) or Non-Autoregressive (NAR). However, AR-SLP models suffer from regression to the mean and error propagation during decoding. NSLP-G, a NAR-based model, resolves these issues to some…
▽ More
Sign Language Production (SLP) aims to translate expressions in spoken language into corresponding ones in sign language, such as skeleton-based sign poses or videos. Existing SLP models are either AutoRegressive (AR) or Non-Autoregressive (NAR). However, AR-SLP models suffer from regression to the mean and error propagation during decoding. NSLP-G, a NAR-based model, resolves these issues to some extent but engenders other problems. For example, it does not consider target sign lengths and suffers from false decoding initiation. We propose a novel NAR-SLP model via Knowledge Distillation (KD) to address these problems. First, we devise a length regulator to predict the end of the generated sign pose sequence. We then adopt KD, which distills spatial-linguistic features from a pre-trained pose encoder to alleviate false decoding initiation. Extensive experiments show that the proposed approach significantly outperforms existing SLP models in both Frechet Gesture Distance and Back-Translation evaluation.
△ Less
Submitted 12 August, 2022;
originally announced August 2022.
-
Few-Layered Graphyne Growth by an On-Surface Coupling Reaction via Alkynyl Vapour Deposition
Authors:
Sohyeon Seo,
Jungsue Choi,
Soo Min Cho,
Seungeun Lee,
Hyoyoung Lee
Abstract:
A graphyne (GY) family composed of the triple (sp) and double (sp2) bonds-hybridized carbon atoms is a promising allotrope of carbon for the development of nanoscale electronic devices. Unlike graphene, the GY family containing carbon sp bonds remains an unsolved problem in efficient two-dimensional (2D) growth due to monomer instability and side reactions. Herein, we synthesize GY into a single 2…
▽ More
A graphyne (GY) family composed of the triple (sp) and double (sp2) bonds-hybridized carbon atoms is a promising allotrope of carbon for the development of nanoscale electronic devices. Unlike graphene, the GY family containing carbon sp bonds remains an unsolved problem in efficient two-dimensional (2D) growth due to monomer instability and side reactions. Herein, we synthesize GY into a single 2D layer by chemical vapor deposition (CVD) at low temperatures preventing unexpected reactions, using 1,3,5-tribromo-2,4,6-triethynylbenzene (TBTEB). The CVD-grown GY via a surface-confined coupling between TBTEB monomers on a copper surface can be rid of either one-dimensional (1D) or three-dimensional (3D) growths via side reactions. The CVD-grown GY exhibits a hexagonal lattice structure containing a single sp-carbon bond in a link between two adjacent hexagons, consisting with the structure of gamma-GY. This work opens a new avenue for synthesizing large-scale and single-crystalline GY that can be capable of versatile applications with theoretically expected properties.
△ Less
Submitted 26 October, 2020;
originally announced October 2020.
-
MEANTIME: Mixture of Attention Mechanisms with Multi-temporal Embeddings for Sequential Recommendation
Authors:
Sung Min Cho,
Eunhyeok Park,
Sungjoo Yoo
Abstract:
Recently, self-attention based models have achieved state-of-the-art performance in sequential recommendation task. Following the custom from language processing, most of these models rely on a simple positional embedding to exploit the sequential nature of the user's history. However, there are some limitations regarding the current approaches. First, sequential recommendation is different from l…
▽ More
Recently, self-attention based models have achieved state-of-the-art performance in sequential recommendation task. Following the custom from language processing, most of these models rely on a simple positional embedding to exploit the sequential nature of the user's history. However, there are some limitations regarding the current approaches. First, sequential recommendation is different from language processing in that timestamp information is available. Previous models have not made good use of it to extract additional contextual information. Second, using a simple embedding scheme can lead to information bottleneck since the same embedding has to represent all possible contextual biases. Third, since previous models use the same positional embedding in each attention head, they can wastefully learn overlap** patterns. To address these limitations, we propose MEANTIME (MixturE of AtteNTIon mechanisms with Multi-temporal Embeddings) which employs multiple types of temporal embeddings designed to capture various patterns from the user's behavior sequence, and an attention structure that fully leverages such diversity. Experiments on real-world data show that our proposed method outperforms current state-of-the-art sequential recommendation methods, and we provide an extensive ablation study to analyze how the model gains from the diverse positional information.
△ Less
Submitted 21 August, 2020; v1 submitted 19 August, 2020;
originally announced August 2020.
-
Automatic Tip Detection of Surgical Instruments in Biportal Endoscopic Spine Surgery
Authors:
Sue Min Cho,
Young-Gon Kim,
**hoon Jeong,
Ho-** Lee,
Namkug Kim
Abstract:
Some endoscopic surgeries require a surgeon to hold the endoscope with one hand and the surgical instruments with the other hand to perform the actual surgery with correct vision. Recent technical advances in deep learning as well as in robotics can introduce robotics to these endoscopic surgeries. This can have numerous advantages by freeing one hand of the surgeon, which will allow the surgeon t…
▽ More
Some endoscopic surgeries require a surgeon to hold the endoscope with one hand and the surgical instruments with the other hand to perform the actual surgery with correct vision. Recent technical advances in deep learning as well as in robotics can introduce robotics to these endoscopic surgeries. This can have numerous advantages by freeing one hand of the surgeon, which will allow the surgeon to use both hands and to use more intricate and sophisticated techniques. Recently, deep learning with convolutional neural network achieves state-of-the-art results in computer vision. Therefore, the aim of this study is to automatically detect the tip of the instrument, localize a point, and evaluate detection accuracy in biportal endoscopic spine surgery. The localized point could be used for the controller's inputs of robotic endoscopy in these types of endoscopic surgeries.
△ Less
Submitted 12 March, 2020; v1 submitted 6 November, 2019;
originally announced November 2019.
-
Direct Band Gap Semiconducting Holey Graphyne: Structure, Synthesis and Potential Applications
Authors:
Xinghui Liu1,
Soo Min Cho,
Shiru Lin,
Eunbhin Yun,
Eun Hee Baek,
Zhongfang Chen,
Do Hyun Ryu,
Hyoyoung Lee
Abstract:
Here we report two-dimensional (2D) single-crystalline holey-graphyne (HGY) created an interfacial two-solvent system through a Castro-Stephens coupling reaction from 1,3,5-tribromo-2,4,6-triethynylbenzene. HGY is a new type of 2D carbon allotrope whose structure is comprised of a pattern of six-vertex and eight-vertex rings. The carbon-carbon 2D network of HGY is alternately linked between benzen…
▽ More
Here we report two-dimensional (2D) single-crystalline holey-graphyne (HGY) created an interfacial two-solvent system through a Castro-Stephens coupling reaction from 1,3,5-tribromo-2,4,6-triethynylbenzene. HGY is a new type of 2D carbon allotrope whose structure is comprised of a pattern of six-vertex and eight-vertex rings. The carbon-carbon 2D network of HGY is alternately linked between benzene rings and sp (carbon-carbon triple bond) bonding. The ratio of the sp over sp2 bonding is 50%. It is confirmed that HGY is stable by DFT calculation. The vibrational, optic, and electric properties of HGY are investigated theoretically and experimentally. It is a p-type semiconductor that embraces a natural direct band gap (~ 1.0 eV) with high hole mobility and electron mobility at room temperature. This report is expected to help develop a new types of carbon-based semiconductor devices with high mobility.
△ Less
Submitted 8 July, 2019;
originally announced July 2019.